This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [PATCH v2] x86-64: Optimize strcmp/wcscmp with AVX2

From: Florian Weimer <fw at deneb dot enyo dot de>
To: Alexander Monakov <amonakov at ispras dot ru>
Cc: Leonardo Sandoval <leonardo dot sandoval dot gonzalez at linux dot intel dot com>, "H.J. Lu" <hjl dot tools at gmail dot com>, GNU C Library <libc-alpha at sourceware dot org>
Date: Sat, 02 Jun 2018 13:37:44 +0200
Subject: Re: [PATCH v2] x86-64: Optimize strcmp/wcscmp with AVX2
References: <20180529185339.11541-1-leonardo.sandoval.gonzalez@linux.intel.com> <CAMe9rOpKpR6pOLkxyMuTPBA1zSx4MmYYsTOwHz5pTxjdR57p1A@mail.gmail.com> <alpine.LNX.2.20.13.1806011824140.1892@monopod.intra.ispras.ru> <03bdf89c47880fd0734fc5b82213fc3c98eab372.camel@linux.intel.com> <alpine.LNX.2.20.13.1806021022140.1892@monopod.intra.ispras.ru>

* Alexander Monakov:

> this does not. The whole point was that frequency behavior means the
> slowdown on programs making *occasional* calls to strcmp will not be
> captured by microbenchmarks. What good is saving dozens of cycles on
> strcmp calls if the remaining program is slowed down by 5%?
>
> I was missing that AVX frequency limits kick in only if "heavy" operations
> are used -- on recent generations. I'm not sure that's true for older, e.g.
> Haswell, generations. Intel's white paper explaining Haswell AVX clocks
> makes no distinction of "light" vs. "heavy" operations:
>
> https://www.intel.com/content/dam/www/public/us/en/documents/white-papers/performance-xeon-e5-v3-advanced-vector-extensions-paper.pdf

This should be easy to measure.  Aren't there perf counters for that?
The CORE_POWER.LVL0_TURBO_LICENSE, CORE_POWER.LVL1_TURBO_LICENSE,
CORE_POWER.LVL2_TURBO_LICENSE counters?

Run the benchmark in parallel with itself, and then with other compute
loads, and see which of the counters increase?

References:
- Re: [PATCH v2] x86-64: Optimize strcmp/wcscmp with AVX2
  - From: H.J. Lu
- Re: [PATCH v2] x86-64: Optimize strcmp/wcscmp with AVX2
  - From: Alexander Monakov
- Re: [PATCH v2] x86-64: Optimize strcmp/wcscmp with AVX2
  - From: Leonardo Sandoval
- Re: [PATCH v2] x86-64: Optimize strcmp/wcscmp with AVX2
  - From: Alexander Monakov

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]