[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: GNU dlopen(3) differs from POSIX/IEEE

To: Carlos O'Donell <carlos@redhat.com>, gnu-gabi@sourceware.org
Subject: Re: GNU dlopen(3) differs from POSIX/IEEE
From: Suprateeka R Hegde <hegdesmailbox@gmail.com>
Date: Sat, 18 Jun 2016 13:31:01 +0530
Authentication-results: sourceware.org; auth=none
Delivered-to: listarch-gnu-gabi@sourceware.org
Delivered-to: mailing list gnu-gabi@sourceware.org
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=reply-to:subject:references:to:from:organization:message-id:date :user-agent:mime-version:in-reply-to:content-transfer-encoding; bh=lkkWVN1jeCgG5U8eSaT/NP1fwKd6QZty/gXHw8MfnFI=; b=fPHwD5YjiTfLMBA5Sbw9l5qKsw+bovjY392cHohFBkALRqQq0Sv0AJxqRk0qnCefL4 wzlTnqms3zLZWdEpiPanPHoO5phShtcap2m/0uR6I7sltF8OImwTd5MZeaMa+ywubCZt wqvjc1f5nJWSuDiyayEKDy4fY69q8bIavX72qzyY5+c0sUcK2nlvIV6/BcEC8ZMO6CqB HA6kYP1edSeC2nDoaG9JjO3F2FeDtbym7bdUn/qau1dBOsJRxo/EIV+Ryi9novNj7d8J iqdKJh2erdr+vRbJpIkwYhC3LcBIdfqvMn6Cwsi6hnYiX4C2E4NXV4G5dQj349AX+ssS Iv0Q==
In-reply-to: <42a86c64-a042-0c0d-9601-49729816c825@redhat.com>
List-help: <mailto:gnu-gabi-help@sourceware.org>
List-id: <gnu-gabi.sourceware.org>
List-post: <mailto:gnu-gabi@sourceware.org>
List-subscribe: <mailto:gnu-gabi-subscribe@sourceware.org>
Mailing-list: contact gnu-gabi-help@sourceware.org; run by ezmlm
Organization: HEGDESASPECT
References: <25bc0c78-19ae-8974-b142-bb57f21cdb3d@gmail.com> <ca68d193-0a5d-1dc1-dc8c-bc59c8c27627@redhat.com> <763cd6f7-e33d-8d14-c0ba-f4e5797ddfa6@gmail.com> <42a86c64-a042-0c0d-9601-49729816c825@redhat.com>
Reply-to: hegdesmailbox@gmail.com
Sender: gnu-gabi-owner@sourceware.org
User-agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:45.0) Gecko/20100101 Thunderbird/45.1.1



On 18-Jun-2016 11:02 AM, Carlos O'Donell wrote:

On 06/18/2016 12:11 AM, Suprateeka R Hegde wrote:

All I am saying is, dlopen(3) with RTLD_GLOBAL also should bring in
foo at runtime to be compliant with POSIX.


I disagree. Nothing in POSIX says that needs to be done. The
key failure in your reasoning is that you have assumed lazy
symbol resolution must happen at the point of the first function
call.


ld(1) on a GNU/Linux machine says:
---
-z lazy

When generating an executable or shared library, mark it to tell thedynamic linker to defer function call resolution to the point when thefunction is called (lazy binding)

---

This made me think that GNU implementation also matches with otherimplementations -- that is lazy resolution happens at the time of thefirst call.

You have read "shall be made available for relocation" and
then used implementation knowledge to decide that _today_ those
relocations have a happens-after relationship with dlopen in your
program. But because lazy symbol resolution is not an observable
event for a well-defined program,

Yes. I agree very much. But making some massive enterprise legacyapplication to become "well-defined" now is beyond tool chain writers.

The very use of --unresolved-symbol=ignore all for an executable link isbad in a way.

and no guarantees are made,
you can't make a happens-after relationship, and can't expect
'foo' to resolve to the loaded 'foo' that came into the global
scope with dlopen.

Perhaps in the future you want a mode where all lazy symbol
resolution is done before the first dlopen runs. Say we want to
do this to relocate the whole PLT and mark it read-only for
safety hardening.

This is going to be a "mode". Almost similar to BIND_NOW. But notdefault. Even if decided default, a non-default (lazy writable PLTs)mode still exists.

If you were to _require_ lazy resolution to happen at the point
of the function call, which is what you're assuming here, then
it would prevent the above implementation from being conforming.

Both are mutually exclusive. In my opinion, programs either wantimmediate binding or lazy binding. Not an arbitrary mix of both.

However, because POSIX says nothing about when the lazy symbol
resolution happens, or anything at all about it,


It indeed says something:
---
RTLD_LAZY

Relocations shall be performed at an implementation-defined time,ranging from the time of the dlopen() call until the first reference toa given symbol occurs

---

And then based on the ld(1) manpage, I thought GNU/Linux implementationuses the time of first call.

What is the harm if we go by the existing documentation and under theoption -z lazy or RTLD_LAZY, make lazy resolution happen at the point offunction call?


(BTW, the above is already in place currently and is working as expected)

And eventually change the semantics of RTLD_GLOBAL to match thedescription mentioned in the POSIX spec -- ...relocation processing ofany other executable object file.


--
Supra

Follow-Ups:
- Re: GNU dlopen(3) differs from POSIX/IEEE
  - From: Carlos O'Donell <carlos@redhat.com>

References:
- GNU dlopen(3) differs from POSIX/IEEE
  - From: Suprateeka R Hegde <hegdesmailbox@gmail.com>
- Re: GNU dlopen(3) differs from POSIX/IEEE
  - From: Carlos O'Donell <carlos@redhat.com>
- Re: GNU dlopen(3) differs from POSIX/IEEE
  - From: Suprateeka R Hegde <hegdesmailbox@gmail.com>
- Re: GNU dlopen(3) differs from POSIX/IEEE
  - From: Carlos O'Donell <carlos@redhat.com>

Prev by Date: Re: GNU dlopen(3) differs from POSIX/IEEE
Next by Date: Re: OSABI on Linux Distros
Previous by thread: Re: GNU dlopen(3) differs from POSIX/IEEE
Next by thread: Re: GNU dlopen(3) differs from POSIX/IEEE
Index(es):
- Date
- Thread