This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: benchmark improvements (Was: Re: [PATCH] sysdeps/arm/armv7/multiarch/memcpy_impl.S: Improve performance.)

From: OndÅej BÃlka <neleai at seznam dot cz>
To: Will Newton <will dot newton at linaro dot org>
Cc: Siddhesh Poyarekar <siddhesh at redhat dot com>, Carlos O'Donell <carlos at redhat dot com>, "libc-ports at sourceware dot org" <libc-ports at sourceware dot org>, libc-alpha <libc-alpha at sourceware dot org>
Date: Tue, 3 Sep 2013 19:48:40 +0200
Subject: Re: benchmark improvements (Was: Re: [PATCH] sysdeps/arm/armv7/multiarch/memcpy_impl.S: Improve performance.)
Authentication-results: sourceware.org; auth=none
References: <520894D5 dot 7060207 at linaro dot org> <CANu=DmiBHoymFKTvaW_VsdhWZEYwkfViz1tTeRgj7H80f0FntA at mail dot gmail dot com> <5220D30B dot 9080306 at redhat dot com> <CANu=DmiXLL9v1Z1KS0sBOs-pL8csEUGc9YE829_-tidKd-GruQ at mail dot gmail dot com> <5220F1F0 dot 80501 at redhat dot com> <CANu=DmhA9QvSe6RS72Db2P=yyjC72fsE8d4QZKHEcNiwqxNMvw at mail dot gmail dot com> <20130902142037 dot GH3273 at spoyarek dot pnq dot redhat dot com> <CANu=DmgDYv0JFxqCYUkL2iz_GTSm0LKJvmtGD_i04wB=qDvu7A at mail dot gmail dot com>

On Tue, Sep 03, 2013 at 02:46:13PM +0100, Will Newton wrote:
> On 2 September 2013 15:20, Siddhesh Poyarekar <siddhesh@redhat.com> wrote:
> >> The glibc benchmarks also have some other weaknesses that should
> >> really be addressed, hopefully I'll have some time to write patches
> >> for some of this work.
> >
> > I know Ondrej had proposed a few improvements as well.  I'd like to
> > see those reposted so that we can look at it and if possible, have
> > them merged in.
> 
> I already have a patch to do multiple runs of benchmarks  - some
> things like physical page allocation that can impact a benchmark can
> only be controlled for this way. As I mentioned above I'd also like to
> get graphing capability in there too. Beyond that it would be nice to
> have a look at the various sizes and alignments used and make sure
> there is a reasonably complete set, and to make sure the tests are run
> for a useful number of iterations (not too large or too small).
> 
For alignments do you want existing implementation to take

a) 0.031s b) 0.080s c) 0.036s

If you want to get your implementation accepted pick a), if you do not
like ACME implementation pick b), otherwise pick c). 

I got those numbers by 'benchmarking' memchr with alignment 15 and size
15 on ivy bridge. (benchmark attached.) 

Current memchr implementation has separate branches for loads that cross
cache line and those that don't. For a) addresses are of form 64*x+15,
for b) 64*x+63, and for c) 16*x+15.

Attachment: memchr.tar.bz2
Description: Binary data

References:
- benchmark improvements (Was: Re: [PATCH] sysdeps/arm/armv7/multiarch/memcpy_impl.S: Improve performance.)
  - From: Siddhesh Poyarekar
- Re: benchmark improvements (Was: Re: [PATCH] sysdeps/arm/armv7/multiarch/memcpy_impl.S: Improve performance.)
  - From: Will Newton

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]