This is the mail archive of the
glibc-bugs@sourceware.org
mailing list for the glibc project.
[Bug regex/19376] New: regcomp.c needs to be upgraded to GNU Grep's one
- From: "t.rus76 at ya dot ru" <sourceware-bugzilla at sourceware dot org>
- To: glibc-bugs at sourceware dot org
- Date: Fri, 18 Dec 2015 09:26:14 +0000
- Subject: [Bug regex/19376] New: regcomp.c needs to be upgraded to GNU Grep's one
- Auto-submitted: auto-generated
https://sourceware.org/bugzilla/show_bug.cgi?id=19376
Bug ID: 19376
Summary: regcomp.c needs to be upgraded to GNU Grep's one
Product: glibc
Version: 2.22
Status: NEW
Severity: normal
Priority: P2
Component: regex
Assignee: unassigned at sourceware dot org
Reporter: t.rus76 at ya dot ru
CC: drepper.fsp at gmail dot com
Target Milestone: ---
Symptom: GNU Grep does not handle Syriac characters (U+0700 â U+074F) correctly
$ echo 'ÜÜÜÜ' > peace
$ egrep '\<[Ü-Ü]' peace
grep: Invalid collation character
$ awk /'\<[Ü-Ü]'/ peace
ÜÜÜÜ
However when grep is build with ./configure --with-included-regex
it works just fine and there is no REG_ECOLLATE error
$ echo ÜÜÜÜ | src/egrep [Ü-Ü]
ÜÜÜÜ
$ echo ÜÜÜÜ | src/egrep [Ü-Ü]
$
This is because GNU Grep contains improved version of regcomp.
The bus was found here:
http://forum.rosalab.ru/viewtopic.php?f=53&t=6219&p=54747 (in Russian)
It is tested and confirmed also on Gentoo (both glibc and grep are 2.22).
I expect there are other bugs that could be fixed with this upgrade.
--
You are receiving this mail because:
You are on the CC list for the bug.