Undefined use of weak symbols in gnulib
Fangrui Song
i@maskray.me
Wed May 5 20:31:04 GMT 2021
On 2021-04-27, H.J. Lu via Binutils wrote:
>On Tue, Apr 27, 2021 at 7:10 PM H.J. Lu <hjl.tools@gmail.com> wrote:
>>
>> On Tue, Apr 27, 2021 at 6:57 PM Bruno Haible <bruno@clisp.org> wrote:
>> >
>> > Hi Florian,
>> >
>> > > Here's a fairly representative test case, I think.
>> > >
>> > > #include <pthread.h>
>> > > #include <stdio.h>
>> > >
>> > > extern __typeof (pthread_key_create) __pthread_key_create __attribute__ ((weak));
>> > > extern __typeof (pthread_once) pthread_once __attribute__ ((weak));
>> > >
>> > > void
>> > > f1 (void)
>> > > {
>> > > puts ("f1 called");
>> > > }
>> > >
>> > > pthread_once_t once_var;
>> > >
>> > > void __attribute__ ((weak))
>> > > f2 (void)
>> > > {
>> > > if (__pthread_key_create != NULL)
>> > > pthread_once (&once_var, f1);
>> > > }
>> > >
>> > > int
>> > > main (void)
>> > > {
>> > > f2 ();
>> > > }
>> > >
>> > > Building it with “gcc -O2 -fpie -pie” and linking with binutils 2.30
>> > > does not result in a crash with LD_PRELOAD=libpthread.so.0.
>> >
>> > Thank you for the test case. It helps the understanding.
>> >
>> > But I don't understand
>> > - why anyone would redeclare 'pthread_once', when it's a standard POSIX
>> > function,
>> > - why f2 is declared weak,
>> > - why the program skips its initializations in single-threaded mode,
>> > - why libpthread would be loaded through LD_PRELOAD or dlopen, given
>> > that the long-term statement has been that declaring a symbol weak
>> > has no effect on the dynamic linker [1][2][3][4]?
>> >
>> > How about the following test case instead?
>> >
>> > =====================================================================
>> > #include <pthread.h>
>> > #include <stdio.h>
>> >
>> > #pragma weak pthread_key_create
>> > #pragma weak pthread_once
>> >
>> > void
>> > do_init (void)
>> > {
>> > puts ("initialization code");
>> > }
>> >
>> > pthread_once_t once_var;
>> >
>> > void
>> > init (void)
>> > {
>> > if (pthread_key_create != NULL)
>> > {
>> > puts ("multi-threaded initialization");
>> > pthread_once (&once_var, do_init);
>> > }
>> > else
>> > do_init ();
>> > }
>> >
>> > int
>> > main (void)
>> > {
>> > init ();
>> > }
>> > =====================================================================
>> >
>> > $ gcc -Wall -fpie -pie foo.c ; ./a.out
>> > initialization code
>> >
>> > $ gcc -Wall -fpie -pie foo.c -Wl,--no-as-needed -lpthread ; ./a.out
>> > multi-threaded initialization
>> > initialization code
>> >
>> > What will change for this program with glibc 2.34?
>> >
>> > Bruno
>> >
>> > [1] https://sourceware.org/legacy-ml/libc-hacker/2000-06/msg00029.html
>> > [2] https://www.akkadia.org/drepper/dsohowto.pdf page 6
>> > [3] https://stackoverflow.com/questions/21092601/is-pthread-in-glibc-so-implemented-by-weak-symbol-to-provide-pthread-stub-functi/21103255
>> > [4] https://stackoverflow.com/questions/20658809/dynamic-loading-and-weak-symbol-resolution
>> >
>>
>> Does x86 show the same issue? I fixed several undefined weak symbol
>> bugs on x86:
>>
>> https://sourceware.org/bugzilla/show_bug.cgi?id=19636
>> https://sourceware.org/bugzilla/show_bug.cgi?id=19704
>> https://sourceware.org/bugzilla/show_bug.cgi?id=19719
I don't consider the first two bugs.
Whether a dynamic relocation is emitted depends on
(1) whether .dynsym exists (2) architecture (3) relocation type (4)
-no-pie/-pie/-shared (5) -z {,no}dynamic-undefined-weak.
It is unlikely a user can summarize rules which can be relied upon.
We can step back and look at these from a different perspective:
figure out what should be defined, then everything else has no hard rule
and we can choose whatever to simplify rules.
* absolute relocation resolves to 0. There may or may not be dynamic relocations.
* PC-relative relocation doesn't make sense.
I changed LLD to use a simple rule:
* -no-pie and -pie: no dynamic relocation
* -shared: dynamic relocation
https://maskray.me/blog/2021-04-25-weak-symbol
>> with a linker option:
>>
>> 'dynamic-undefined-weak'
>> 'nodynamic-undefined-weak'
>> Make undefined weak symbols dynamic when building a dynamic
>> object, if they are referenced from a regular object file and
>> not forced local by symbol visibility or versioning. Do not
>> make them dynamic if 'nodynamic-undefined-weak'. If neither
>> option is given, a target may default to either option being
>> in force, or make some other selection of undefined weak
>> symbols dynamic. Not all targets support these options.
>>
>> Alan extended the fix to PPC:
>>
>> commit 954b63d4c8645f86e40c7ef6c6d60acd2bf019de
>> Author: Alan Modra <amodra@gmail.com>
>> Date: Wed Apr 19 01:26:57 2017 +0930
>>
>> Implement -z dynamic-undefined-weak
>>
>> -z nodynamic-undefined-weak is only implemented for x86. (The sparc
>> backend has some support code but doesn't enable the option by
>> including ld/emulparams/dynamic_undefined_weak.sh, and since the
>> support looks like it may be broken I haven't enabled it.) This patch
>> adds the complementary -z dynamic-undefined-weak, extends both options
>> to affect building of shared libraries as well as executables, and
>> adds support for the option on powerpc.
>>
>
>Another undefined weak symbol linker bug:
>
>https://sourceware.org/bugzilla/show_bug.cgi?id=22269
More information about the Libc-alpha
mailing list