This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [PATCH 0/2] Multiarch hooks for memcpy variants

From: Wilco Dijkstra <Wilco dot Dijkstra at arm dot com>
To: Siddhesh Poyarekar <siddhesh at gotplt dot org>, Zack Weinberg <zackw at panix dot com>
Cc: Szabolcs Nagy <Szabolcs dot Nagy at arm dot com>, "libc-alpha at sourceware dot org" <libc-alpha at sourceware dot org>, nd <nd at arm dot com>
Date: Mon, 14 Aug 2017 13:22:50 +0000
Subject: Re: [PATCH 0/2] Multiarch hooks for memcpy variants
Authentication-results: sourceware.org; auth=none
Authentication-results: spf=none (sender IP is ) smtp.mailfrom=Wilco dot Dijkstra at arm dot com;
Nodisclaimer: True
References: <DB6PR0801MB20534ED1010DDF1B033821EE83890@DB6PR0801MB2053.eurprd08.prod.outlook.com> <18d2fdf8-ca55-1ded-fa66-3509b3bcf8fe@gotplt.org> <598DF02B.8010607@arm.com> <CAKCAbMg27DXDe=5vCCtBAW-g5BUkHKPb=_VTV7kr6cq_U91-Cg@mail.gmail.com> <4072a19f-eecb-8cdd-889f-46b4c8b968b4@gotplt.org> <CAKCAbMh8=u27ZcS9La4SdQ3UiHi76TZdv_KSCpX0pkY8WMohOQ@mail.gmail.com> <DB6PR0801MB20538D64F211A965ED3E806D838C0@DB6PR0801MB2053.eurprd08.prod.outlook.com>,<8ce803fd-37d2-d249-9953-1ad60be34518@gotplt.org>
Spamdiagnosticmetadata: NSPM
Spamdiagnosticoutput: 1:99

Siddhesh Poyarekar wrote:
> The first part is not true for falkor since its implementation is a good
> 10-15% faster on the falkor chip due to its design differences.  glibc
> makes pretty extensive use of memcpy throughout, but I don't have data
> on how much difference a core-specific memcpy will make there, so I
> don't have enough grounds for a generic change.

66% of memcpy calls are <=16 bytes. Assuming you can even get a 15% gain
for these small sizes (there is very little you can do different), that's at most 1
cycle faster, so the PLT indirection is going to be more expensive.

> Your last point about hurting everything else is very valid though; it's
> very likely that adding an extra indirection in cases where
> __memcpy_generic is going to be called anyway is going to be expensive
> given that a bulk of the memcpy calls will be for small sizes of less
> than 1k.

Note that the falkor version does quite well in memcpy-random across several
micro architectures so I think parts of it could be moved into the generic code.

> Allowing a PLT only for __memcpy_chk and mempcpy would need a test case
> waiver in check_localplt and that would become a blanket OK for PLT
> usage for memcpy, which we don't want.  Hence my patch is probably the
> best compromise, especially since there is precedent for the approach in
> x86.

I still can't see any reason to even support these entry points in GLIBC, let
alone optimize them using ifuncs. The _chk functions should obviously be
inlined to avoid all the target specific complexity for no benefit. I think this
could trivially be done via the GLIBC headers already. (That's assuming they
are in any way performance critical.)

Wilco

Follow-Ups:
- Re: [PATCH 0/2] Multiarch hooks for memcpy variants
  - From: Siddhesh Poyarekar
- Re: [PATCH 0/2] Multiarch hooks for memcpy variants
  - From: Patrick McGehearty
- Re: [PATCH 0/2] Multiarch hooks for memcpy variants
  - From: Zack Weinberg

References:
- Re: [PATCH 0/2] Multiarch hooks for memcpy variants
  - From: Wilco Dijkstra
- Re: [PATCH 0/2] Multiarch hooks for memcpy variants
  - From: Siddhesh Poyarekar
- Re: [PATCH 0/2] Multiarch hooks for memcpy variants
  - From: Szabolcs Nagy
- Re: [PATCH 0/2] Multiarch hooks for memcpy variants
  - From: Zack Weinberg
- Re: [PATCH 0/2] Multiarch hooks for memcpy variants
  - From: Siddhesh Poyarekar
- Re: [PATCH 0/2] Multiarch hooks for memcpy variants
  - From: Zack Weinberg
- Re: [PATCH 0/2] Multiarch hooks for memcpy variants
  - From: Wilco Dijkstra
- Re: [PATCH 0/2] Multiarch hooks for memcpy variants
  - From: Siddhesh Poyarekar

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]