This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: [PATCH RFC] Imporve 64bit memcpy performance for Haswell CPU with AVX instruction


On Thu, Jul 17, 2014 at 8:33 AM, H.J. Lu <hjl.tools@gmail.com> wrote:
> On Tue, Jul 15, 2014 at 11:54 PM, Ling Ma <ling.ma.program@gmail.com> wrote:
>> Any comments please.
>>
>> Thanks
>> Ling
>>
>> 2014-07-14 12:28 GMT+08:00, Ling Ma <ling.ma.program@gmail.com>:
>>> In this patch we take advantage of HSW memory bandwidth, manage to
>>> reduce miss branch prediction by avoiding using branch instructions and
>>> force destination to be aligned with avx instruction.
>>>
>>> The CPU2006 403.gcc benchmark indicates this patch improves the whole
>>> performance from 2% to 10%, and 12~ 60% when copy size is over 256bytes.
>>>
>>> This version is based on latest ChangeLog,  and simplified memmove
>>> according to Ondra's comment.
>>>
>>> Thanks
>>> Ling
>>>
>
> Any more feedbacks on the latest version:
>
> https://sourceware.org/ml/libc-alpha/2014-07/msg00302.html
>

I will check it in after 24 hours.  Thanks for everyone.


-- 
H.J.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]