Improve memcpy for Atom

Michael Zolotukhin michael.v.zolotukhin@gmail.com
Wed Oct 19 15:48:00 GMT 2011


Hi,

This patch contains one function:
__memcpy_ssse3

It improves memcpy on small sizes and on sizes between half of shared
cache size and shared cache size for Atom (up to 40% performance
gain).

The patch was tested on Atom.

Change Log:
2011-10-11  Michael Zolotukhin  <michael.v.zolotukhin@gmail.com>

        * sysdeps/i386/i686/multiarch/memcpy-ssse3.S: Update.
        XMM-moves are used for copying on small sizes. Use
SHARED_CACHE_SIZE instead of
        SHARED_CACHE_SIZE_HALF.

-- 
---
Best regards,
Michael V. Zolotukhin,
Software Engineer
Intel Corporation.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: memcpy_32.patch
Type: application/octet-stream
Size: 25733 bytes
Desc: not available
URL: <http://sourceware.org/pipermail/libc-alpha/attachments/20111019/3d4a388e/attachment.obj>


More information about the Libc-alpha mailing list