This is the mail archive of the
glibc-bugs@sourceware.org
mailing list for the glibc project.
[Bug localedata/13063] New: Can not 'sort -u' all Chinese characters in CJK UNIFIED IDEOGRAPH EXTENSION A/B/C/D
- From: "an.euroford at gmail dot com" <sourceware-bugzilla at sourceware dot org>
- To: glibc-bugs at sources dot redhat dot com
- Date: Sat, 6 Aug 2011 17:21:20 +0000
- Subject: [Bug localedata/13063] New: Can not 'sort -u' all Chinese characters in CJK UNIFIED IDEOGRAPH EXTENSION A/B/C/D
- Auto-submitted: auto-generated
http://sourceware.org/bugzilla/show_bug.cgi?id=13063
Summary: Can not 'sort -u' all Chinese characters in CJK
UNIFIED IDEOGRAPH EXTENSION A/B/C/D
Product: glibc
Version: unspecified
Status: NEW
Severity: critical
Priority: P2
Component: localedata
AssignedTo: libc-locales@sources.redhat.com
ReportedBy: an.euroford@gmail.com
Hi,
Refer to glibc/localedata/locales/zh_CN and iso14651_t1_pinyin or
iso14651_t1, glibc just support unicode3.0.
The new version of unicode is 6.0, it extend CJK UNIFIED IDEOGRAPH with
extension A/B/C/D, and extension A is included in GB18030:2005( China
locale charset standard).
So at least, glibc should sort all Chinese characters in CJK UNIFIED IDEOGRAPH
and EXTENSIONA(U+3400-U+4DBF).
The real effect is sort -u.
If you execute sort -u examples_CJK_extensionA.txt (see attachment), you
will got only one Chinese character "ã".
Regards,
An Yang
--
Configure bugmail: http://sourceware.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.