This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [PATCH] X86-64: Add _dl_runtime_resolve_avx[512]_opt [BZ #20508]

From: Adhemerval Zanella <adhemerval dot zanella at linaro dot org>
To: libc-alpha at sourceware dot org
Date: Tue, 4 Oct 2016 16:54:42 -0300
Subject: Re: [PATCH] X86-64: Add _dl_runtime_resolve_avx[512]_opt [BZ #20508]
Authentication-results: sourceware.org; auth=none
References: <CAMe9rOojpuFz1jTbMpNcqZK1KVDqaWozNuEuS3E67dvD3Rh=hw@mail.gmail.com> <CAMe9rOo7MmGNPKct=AzzbtR564yH1P96tUcLD7pFv7GtxF3-Ng@mail.gmail.com> <48eeea78-99e8-f255-bd26-b6d28929b4f0@twiddle.net> <CAMe9rOqXhVLVbT90JBfkJnCLzZBrkUHtkO4Ddfk9HPH7jiyEzw@mail.gmail.com> <CAMe9rOoDp7XYwLdDetFNghAdDo_zj-4342LKV4i0zCH36aBWtw@mail.gmail.com> <CAMe9rOo+_TQABBp0eC4nQcJ5ENLrZAOSvtAjo17E6H0ezDpWvg@mail.gmail.com> <CAMe9rOqoATTPzXqjtqenV0h+hMtMYcr+W-ZL3Bi4jFJWYBCUpg@mail.gmail.com> <76f9e5c1-d1de-04d1-49f5-30673bf3060b@redhat.com> <CAMe9rOq8yu+b0-N6UR3vb843NApZ+ZjBwAw5GsERHpvuCrVjrw@mail.gmail.com> <14cc7a47-1a39-6285-d4b5-dab2769c092b@redhat.com> <CAMe9rOrqmLmHLhHL7pDSvR=8=dUUGP07zY5A3DiOubtpvR25WQ@mail.gmail.com> <ea7fe73f-b8cb-6049-d4c4-918d724bd883@redhat.com> <CAMe9rOqxqffhZAb1Sj-qxg_1oU0J=0v7rvhoXVbjbJrrthHucg@mail.gmail.com> <b8e16aaf-146b-7b20-4f35-a9fa0af2e042@linaro.org> <e671912a-526a-d54b-c839-cd3a5c6f90b5@redhat.com>


On 04/10/2016 16:18, Florian Weimer wrote:
> On 10/04/2016 08:13 PM, Adhemerval Zanella wrote:
>> I think 2.24 it is ok since it contains the BZ#20139 fix already.  For 2.23,
>> although it was not really explicit in NEWS, AVX512 is suppose to be supported
>> in a set of different implementation (memmove/memcpy/libmvec).  However my
>> understanding of this issue is limited to be a performance one, so I do not
>> see a pressing matter to change a release requirements for such change.
> 
> As far as I understand it, the issue is that the trampoline writes to the SSE/AVX/AVX2 registers, which clears the AVX-512F bits (which are not saved by the trampoline).
> 
> Intel introduced AVX512F in such a manner that you have to upgrade kernel and userspace in lock-step, which is of course unrealistic.
> 
> Florian

Reading the patch and its description leads to see that it tries to
fix a AVX-SEE transition described by this Intel documentation [1].

A more experienced arch developer could correct me, but my understanding
is hardware itself would save/restore the upper AVX 256 and 512 bits 
when SSE/AVX instruction are mixed together.  Am I missing something
here?

[1] https://software.intel.com/sites/default/files/m/d/4/1/d/8/11MC12_Avoiding_2BAVX-SSE_2BTransition_2BPenalties_2Brh_2Bfinal.pdf

References:
- Re: [PATCH] X86-64: Add _dl_runtime_resolve_avx[512]_opt [BZ #20508]
  - From: Florian Weimer
- Re: [PATCH] X86-64: Add _dl_runtime_resolve_avx[512]_opt [BZ #20508]
  - From: H.J. Lu
- Re: [PATCH] X86-64: Add _dl_runtime_resolve_avx[512]_opt [BZ #20508]
  - From: Florian Weimer
- Re: [PATCH] X86-64: Add _dl_runtime_resolve_avx[512]_opt [BZ #20508]
  - From: H.J. Lu
- Re: [PATCH] X86-64: Add _dl_runtime_resolve_avx[512]_opt [BZ #20508]
  - From: Florian Weimer
- Re: [PATCH] X86-64: Add _dl_runtime_resolve_avx[512]_opt [BZ #20508]
  - From: H.J. Lu
- Re: [PATCH] X86-64: Add _dl_runtime_resolve_avx[512]_opt [BZ #20508]
  - From: Adhemerval Zanella
- Re: [PATCH] X86-64: Add _dl_runtime_resolve_avx[512]_opt [BZ #20508]
  - From: Florian Weimer

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]