[PATCH, AARCH64] Optimized memset

Wilco Dijkstra wdijkstr@arm.com
Fri Jul 17 17:46:00 GMT 2015


> Marcus Shawcroft wrote:
> On 10 July 2015 at 13:44, Wilco Dijkstra <wdijkstr@arm.com> wrote:
> 
> > It turns out to be slightly faster on A53 too, so I've updated my patch.
> >
> > Wilco
> >
> > ChangeLog:
> > 2015-07-10  Wilco Dijkstra  <wdijkstr@arm.com>
> >
> >         * newlib/libc/machine/aarch64/memset.S (memset):
> >         Rewrite of optimized memset.
> >
> > OK for commit?
> 
> Wilco, This patch appears to cause various aarch64_be-none-elf gcc
> testsuite tests to drop into infinite loops. One example is:
> 
> RUNTESTFLAGS="builtins.exp=fprintf.c" make check-gcc
> 
> would you mind investigating please?
> 
> I'm going to revert the patch while we investigate this.

It turns out there was a typo in this instruction due to testing for
different zva values:

	and	tmp1w, tmp1w, 4 (should be 15)

As a result memset fails if you use a dz zva value other than the
common setting of 64 bytes. With the change it correctly works with 
any legal setting, and aarch64_be-none-elf passes.

Attached is the fixed version of the patch.

Wilco
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: 0001-Optimized-memset.txt
URL: <http://sourceware.org/pipermail/newlib/attachments/20150717/bc90d457/attachment.txt>


More information about the Newlib mailing list