11607 – strstr function very slow.

Bug 11607 - strstr function very slow.

Summary: strstr function very slow.

Status:	RESOLVED FIXED

Alias:	None

Product:	glibc
Classification:	Unclassified
Component:	libc (show other bugs)
Version:	2.10

Importance:	P2 normal
Target Milestone:	---
Assignee:	Maxim Kuvyrkov

URL:
Keywords:

Depends on:
Blocks:

Reported:	2010-05-17 09:27 UTC by Xavier Oswald
Modified:	2014-06-30 18:04 UTC (History)
CC List:	4 users (show)

See Also:
Host:
Target:
Build:
Last reconfirmed:

Flags:	fweimer: security-

Attachments
Source code (498 bytes, text/x-csrc) 2010-05-17 09:28 UTC, Xavier Oswald	Details
Data file (107.62 KB, text/plain) 2010-05-17 09:28 UTC, Xavier Oswald	Details
View All Add an attachment (proposed patch, testcase, etc.)

Note You need to log in before you can comment on or make changes to this bug.

Description Xavier Oswald 2010-05-17 09:27:05 UTC

I think tere are a huge regression for the strstr function between version 2.7
and 2.10.

In our code we use the 'strstr' function a lot and became to the fact that
strstr is having a strange behaviour.

I attach 2 file so you can test it.
$gcc test-strstr.c -o test
$time test 10000

Some results (User Time):
=================
Ubuntu 8.04 : 10s (glibc 2.7)
Ubuntu 9.04 : 1m20s (glibc 2.10)
Ubuntu 10.04 : 1m20s (glibc 2.10)
Debian lenny : 8s (glibc 2.7)
Debian Squeeze : 1m10s (glibc 2.10)

The results are the same on AMD64 and i386 architectures.

Comment 1 Xavier Oswald 2010-05-17 09:28:09 UTC

Created attachment 4798 [details]
Source code

Comment 2 Xavier Oswald 2010-05-17 09:28:49 UTC

Created attachment 4799 [details]
Data file

Comment 3 Xavier Oswald 2010-05-17 09:33:30 UTC

in test-strstr.c, just try to match a string a the end of the file test-strstr.txt
I tested with a needle > 16 and < 16, it doesn't change anything.

Comment 4 Eric Blake 2010-05-17 12:13:54 UTC

In glibc 2.7, the strstr() implementation has worst-case quadratic complexity of
O(m*n) (m the length of the haystack, n the length of the string); and gnulib
introduced a test case that demonstrated this quadratic complexity could be used
as a denial of service attack on any program that searches for a large needle
within a large string.  With glibc 2.10, the strstr() has a guaranteed linear
complexity of O(m+n), but that reduced algorithmic complexity comes with a
larger startup cost of factoring the needle, where the shorter the needle is,
the larger proportion of the overall strstr() is spent on that factoring. 
Furthermore, the glibc 2.7 implementation has had years of tweaking to be
cache-line friendly and tuned for various architectures, while my 2.10
implementation is relatively new, and probably still has some room for
improvements (at any rate, I know that when I wrote it, I was catering to large
needles and large haystacks, and was focused on the algorithmic improvement,
rather than the cache-line effects of small needles).

With appropriate benchmarking, it may be worth altering the implementation to be
a hybrid, where short-enough needles are handled with the older brute force
implementation - even if it is quadratic, less startup cost might be better for
short needles even if it results in more comparisons against the haystack for
some of those needles.

Furthermore, strstr() has a lousy interface when compared with regex searching.
 That is, if you repeatedly search for the same needle, the use of re_compile()
is spent only once, while you can reuse that compiled needle across multiple
re_search() calls.  But with strstr(), the cost of factoring the needle is spent
on every call.  Perhaps we should consider exposing some new interfaces that
separate the strstr() factorization startup code from the body that uses the
factored needle for the actual search.

Comment 5 Maxim Kuvyrkov 2012-06-28 22:22:51 UTC

We expect this to be fixed in 2.17.

Maxim's patches [1] fix performance regression for small needles -- which is what the testcase for this bug exercises.  Ondrej's patches [2] appear to further improve performance for x86.

[1] http://sourceware.org/ml/libc-alpha/2012-05/msg01228.html
[2] http://sourceware.org/ml/libc-alpha/2012-06/msg00027.html

Comment 6 Eric Blake 2012-06-28 22:40:23 UTC

(In reply to comment #5)
> We expect this to be fixed in 2.17.
> 
> Maxim's patches [1] fix performance regression for small needles -- which is
> what the testcase for this bug exercises.  Ondrej's patches [2] appear to
> further improve performance for x86.
> 
> [1] http://sourceware.org/ml/libc-alpha/2012-05/msg01228.html
> [2] http://sourceware.org/ml/libc-alpha/2012-06/msg00027.html

Note that [2] was just a summary of Ondrej's benchmarking; searching for an actual patch of his turns up this link:

http://sourceware.org/ml/libc-help/2011-11/msg00011.html

Comment 7 Carlos O'Donell 2012-09-04 15:14:02 UTC

Maxim,

I thought this had already been checked in and fixed?

Comment 8 Maxim Kuvyrkov 2012-09-04 20:02:10 UTC

Hm, I thought I marked this FIXED.  Apparently, I didn't.  Thanks Carlos.