This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: PowerPC LE memcpy


Hi Alan,

This patch is ok, thanks. I tried the power7 optimized memcpy implementation
from kernel and adjusted to use on glibc. Using the benchtest/bench-memcpy
implementation with some modification to accumulate total time spent in each
call (instead of just use the best timing), I noticed GLIBC implementation
with your cleanings is slight faster for the sizes considered in the 
benchmark (<496 bytes).


On 09-08-2013 02:25, Alan Modra wrote:
> LIttle-endian support for memcpy.  I spent some time cleaning up the
> 64-bit power7 memcpy, in order to avoid the extra alignment traps
> power7 takes for little-endian.  It probably would have been better
> to copy the linux kernel version of memcpy.
>
> 	* sysdeps/powerpc/powerpc32/power4/memcpy.S: Add little endian support.
> 	* sysdeps/powerpc/powerpc32/power6/memcpy.S: Likewise.
> 	* sysdeps/powerpc/powerpc32/power7/memcpy.S: Likewise.
> 	* sysdeps/powerpc/powerpc32/power7/mempcpy.S: Likewise.
> 	* sysdeps/powerpc/powerpc64/memcpy.S: Likewise.
> 	* sysdeps/powerpc/powerpc64/power4/memcpy.S: Likewise.
> 	* sysdeps/powerpc/powerpc64/power6/memcpy.S: Likewise.
> 	* sysdeps/powerpc/powerpc64/power7/memcpy.S: Likewise.
> 	* sysdeps/powerpc/powerpc64/power7/mempcpy.S: Likewise.  Make better
> 	use of regs.  Use power7 mtocrf.  Tidy function tails.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]