Improve memcpy for Atom
Michael Zolotukhin
michael.v.zolotukhin@gmail.com
Wed Oct 19 15:48:00 GMT 2011
Hi,
This patch contains one function:
__memcpy_ssse3
It improves memcpy on small sizes and on sizes between half of shared
cache size and shared cache size for Atom (up to 40% performance
gain).
The patch was tested on Atom.
Change Log:
2011-10-11 Michael Zolotukhin <michael.v.zolotukhin@gmail.com>
* sysdeps/i386/i686/multiarch/memcpy-ssse3.S: Update.
XMM-moves are used for copying on small sizes. Use
SHARED_CACHE_SIZE instead of
SHARED_CACHE_SIZE_HALF.
--
---
Best regards,
Michael V. Zolotukhin,
Software Engineer
Intel Corporation.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: memcpy_32.patch
Type: application/octet-stream
Size: 25733 bytes
Desc: not available
URL: <http://sourceware.org/pipermail/libc-alpha/attachments/20111019/3d4a388e/attachment.obj>
More information about the Libc-alpha
mailing list