[PATCH, AARCH64] Optimized memcpy
Fri Jul 10 12:40:00 GMT 2015
On 9 July 2015 at 16:03, Marcus Shawcroft <email@example.com> wrote:
> On 8 July 2015 at 16:05, Wilco Dijkstra <firstname.lastname@example.org> wrote:
>> This is an optimized memcpy for AArch64. Copies are split into 3 main cases: small copies of up to
>> 16 bytes, medium copies of 17..96 bytes which are fully unrolled. Large copies of more than 96 bytes
>> align the destination and use an unrolled loop processing 64 bytes per iteration. In order to share
>> code with memmove, small and medium copies read all data before writing, allowing any kind of
>> overlap. On a random copy test memcpy is 40.8% faster on A57 and 28.4% on A53.
>> 2015-07-08 Wilco Dijkstra <email@example.com>
>> * newlib/libc/machine/aarch64/memcpy.S (memcpy):
>> Rewrite of optimized memcpy.
>> OK for commit?
> OK /Marcus
Drop the unrelated white space changes in the boiler plate first please.
More information about the Newlib