This is the mail archive of the
libc-alpha@sourceware.org
mailing list for the glibc project.
Re: [PATCH RFC 2/2 V3] Improve 64bit memset for Corei7 with avx2 instruction
- From: Andreas Jaeger <aj at suse dot com>
- To: Ling Ma <ling dot ma dot program at gmail dot com>
- Cc: libc-alpha at sourceware dot org, Ma Ling <ling dot ml at alibaba-inc dot com>
- Date: Mon, 29 Jul 2013 12:02:02 +0200
- Subject: Re: [PATCH RFC 2/2 V3] Improve 64bit memset for Corei7 with avx2 instruction
- References: <CAOGi=dP2EeoV4Lc+do+VTRpw4CToa03UYgeeUua9Cn=8YMxSjg at mail dot gmail dot com>
On 07/29/2013 11:49 AM, Ling Ma wrote:
> The Attachment includes how to setup cpu2006 gcc.403 to measure
> memset/memcpy respectively. the readme.txt specify the process.
> Any problem, please let me know.
Ling,
You're comparing against memcpy_sse2_unaligned but wouldn't the selector
use __memcpy_ssse3 on current Haswell cpus and thus you should compare
your new routine against that one?
Andreas
--
Andreas Jaeger aj@{suse.com,opensuse.org} Twitter/Identica: jaegerandi
SUSE LINUX Products GmbH, Maxfeldstr. 5, 90409 NÃrnberg, Germany
GF: Jeff Hawn,Jennifer Guild,Felix ImendÃrffer,HRB16746 (AG NÃrnberg)
GPG fingerprint = 93A3 365E CE47 B889 DF7F FED1 389A 563C C272 A126