This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

RE: [PATCH 4/*] Generic string memchr and strnlen


> OndÅej BÃlka wrote: 
> On Tue, Jul 28, 2015 at 02:07:35PM +0100, Wilco Dijkstra wrote:
> > > OndÅej BÃlka wrote:
> > > On Mon, Jul 27, 2015 at 02:42:47PM -0400, Chris Metcalf wrote:
> > > > On 07/27/2015 12:56 PM, OndÅej BÃlka wrote:

> Then could you review a generic patch that I am about to ping?

Do you mean https://sourceware.org/ml/libc-alpha/2015-08/msg00443.html?
I don't see a patch attached...

> > > > So I'm not sure which point you are making but unless you know
> > > > something about the average distribution of the characters in the
> > > > strrchr() string to suggest they are likely to occur in the last third
> > > > of the string more than 50% of the time, I don't think I'm convinced.
> >
> > So what we really need is better statistics on strrchr and friends. Looking
> > at GLIBC sources, it seems that >90% search for '/' in a path.
> >
> > Do you have actual stats for strrchr OndÅej? That would really help solving
> > this issue. What I'd like to know is:
> >
> > 1. Average length of the strings
> > 2. What percentage fails to match
> > 3. Average number of matches per string if it matches at least once
> > 4. Average relative position within string of last match
> >
> > With that info we could create a simple patch for bench-strrchr.c to make
> > it use realistic inputs. Then based on that we can fix the generic code and
> > let maintainers further tune their optimized implementations if they do not
> > beat the generic code.
> >
> 
> Yes, I used dryrun for that. You don't need to model microbenchmark from
> that data which could go wrong in several ways but use dryrun to
> directly replay these.

Still bench-strrchr.c needs to be updated to use realistic inputs so people can
optimize for the right dataset. 

Wilco



Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]