This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [PING][PATCHv3 1/2] aarch64: Hoist ZVA check out of the memset function

From: Szabolcs Nagy <szabolcs dot nagy at arm dot com>
To: siddhesh at sourceware dot org, Wilco Dijkstra <Wilco dot Dijkstra at arm dot com>, "adhemerval dot zanella at linaro dot org" <adhemerval dot zanella at linaro dot org>
Cc: nd at arm dot com, "libc-alpha at sourceware dot org" <libc-alpha at sourceware dot org>
Date: Thu, 12 Oct 2017 15:01:30 +0100
Subject: Re: [PING][PATCHv3 1/2] aarch64: Hoist ZVA check out of the memset function
Authentication-results: sourceware.org; auth=none
Authentication-results: spf=none (sender IP is ) smtp.mailfrom=Szabolcs dot Nagy at arm dot com;
Nodisclaimer: True
References: <DB6PR0801MB20531D1099A99E5A4DF1E042834A0@DB6PR0801MB2053.eurprd08.prod.outlook.com> <69ed23fb-12f9-222e-5c6b-e53fa55b95c4@sourceware.org> <59DF4E71.8020307@arm.com> <bdfb8e17-40d2-f937-a05e-fbf310adc63f@sourceware.org>
Spamdiagnosticmetadata: NSPM
Spamdiagnosticoutput: 1:99

On 12/10/17 13:57, Siddhesh Poyarekar wrote:
> On Thursday 12 October 2017 04:43 PM, Szabolcs Nagy wrote:
>> since the patch modifies the memset code that runs in
>> most cases, the impact of the change should be well
>> understood before it goes in.
>>
>> in the static linked case it will introduce a plt
>> indirection, i don't expect a big gain except on falkor
>> and there is a risk of regression on other cores, it
>> also makes the code less maintainable.
> 
> Why do you think this gives a gain only on falkor, especially when I
> asserted that it gives a significant gain on falkor *and* mustang?  Why
> do you think having a bunch of branches in live code (that's the bit
> that gets hoisted out) would be beneficial to any core?
> 

there was no testing on real workloads only on
a ubenchmark that has issues outlined by wilco.

it is known that some cores are sensitive to the
exact code sequence in memset and you changed it
without testing the affected cores (cortex-a53),
probably only wilco knows why he wrote the code
exactly the way it is written, so i need evidence
that there is no regression or wilco's approval.

mustang is a devboard with x-gene cores, which is
not representative of the existing aarch64 cores
in use, but even on mustang it's not clear that the
ubenchmark speed up mean improvement in practice.
(string functions are hard to benchmark, there are
many cases and concerns, which is why i'm conservative
about accepting patches to them)

in principle the approach is fine, but it will take
more time to get confident about the patch.

Follow-Ups:
- Re: [PING][PATCHv3 1/2] aarch64: Hoist ZVA check out of the memset function
  - From: Siddhesh Poyarekar

References:
- Re: [PING][PATCHv3 1/2] aarch64: Hoist ZVA check out of the memset function
  - From: Wilco Dijkstra
- Re: [PING][PATCHv3 1/2] aarch64: Hoist ZVA check out of the memset function
  - From: Siddhesh Poyarekar
- Re: [PING][PATCHv3 1/2] aarch64: Hoist ZVA check out of the memset function
  - From: Szabolcs Nagy
- Re: [PING][PATCHv3 1/2] aarch64: Hoist ZVA check out of the memset function
  - From: Siddhesh Poyarekar

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]