This is the mail archive of the
libc-alpha@sourceware.org
mailing list for the glibc project.
Re: [RFC] Statistics of non-ASCII characters in strings
- From: ams at gnu dot org (Alfred M. Szmidt)
- To: "Wilco Dijkstra" <wdijkstr at arm dot com>
- Cc: libc-alpha at sourceware dot org
- Date: Mon, 22 Dec 2014 10:06:20 -0500
- Subject: Re: [RFC] Statistics of non-ASCII characters in strings
- Authentication-results: sourceware.org; auth=none
- References: <001401d01df6$0f7cc5a0$2e7650e0$ at com>
- Reply-to: ams at gnu dot org
Does anyone have statistics of how often strings contain non-ASCII
characters? I'm asking because it's feasible to make many string
functions faster if they are predominantly ASCII by using a
different check for the null byte. So if say 80-90% of strings in
strcpy/strlen are ASCII then it would be well worth optimizing for
it.
Not the whole world is ASCII...
Pójdźże, kiÅ? tÄ? chmurnoÅ?Ä? w gÅ?Ä?b flaszy.