This is the mail archive of the
libc-alpha@sourceware.org
mailing list for the glibc project.
Re: [PATCH 3/3] powerpc: Use default st{r,p}cpy optimization for POWER7
- From: "Tulio Magno Quites Machado Filho" <tuliom at linux dot vnet dot ibm dot com>
- To: Adhemerval Zanella <adhemerval dot zanella at linaro dot org>
- Cc: GNU C Library <libc-alpha at sourceware dot org>, Steven Munroe <munroesj at linux dot vnet dot ibm dot com>, OndÅej BÃlka <neleai at seznam dot cz>
- Cc:
- Date: Wed, 05 Aug 2015 16:01:24 -0300
- Subject: Re: [PATCH 3/3] powerpc: Use default st{r,p}cpy optimization for POWER7
- Authentication-results: sourceware.org; auth=none
- References: <55B823B2 dot 6060708 at linaro dot org>
Adhemerval Zanella <adhemerval.zanella@linaro.org> writes:
> Following the discussion with Ondrej and recent changes to default
> st{r,á}cpy algorithm, this patches uses it for both powerpc64 and
> powerpc64/power7 instead of optimized ones (which will be removed).
> This is faster in all but few inputs (mostly with very short sizes)
> for benchtests.
>
> It removes the default powerpc64 st{r,p}cpy and uses the same
> optimization, since powerpc64 optimized algorithm only uses a
> slight optimized path for both doubleword aligned source and
> destiny and resorting to byte-per-byte access to unaligned inputs.
>
> Checked on powerpc64le and compared bench output in attachments.
LGTM.
Thanks!
--
Tulio Magno