This is the mail archive of the
libc-alpha@sourceware.org
mailing list for the glibc project.
Is <BAS> required for 1st level difference?
- From: "Reshat Sabiq (Reşat)" <tatar dot iqtelif dot i18n at gmail dot com>
- To: libc-alpha at sourceware dot org
- Date: Mon, 09 Oct 2006 23:37:12 -0500
- Subject: Is <BAS> required for 1st level difference?
- Openpgp: id=262839AF; url=http://keyserver.veridis.com:11371
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
For instance, would the following be identified as separate letters
using collation level 1 and doing comparisons ignoring case (each of the
3 variations of i, and n tilde vs. n)?
reorder-after <h>
<i>
<i.>
<i'>
reorder-after <n>
<n?>
reorder-after <U0068>
<U0131> <i>;<BAS>;<MIN>;IGNORE
<U0069> <i.>;<PCT>;<MIN>;IGNORE
<U00ED> <i'>;<PCT>;<MIN>;IGNORE
reorder-after <U0048>
<U0049> <i>;<BAS>;<CAP>;IGNORE
<U0130> <i.>;<PCT>;<CAP>;IGNORE
<U01CD> <I'>;<PCT>;<CAP>;IGNORE
reorder-after <U006E>
<U00F1> <n?>;<TIL>;<MIN>;IGNORE
reorder-after <U004E>
<U00D1> <N?>;<TIL>;<CAP>;IGNORE
Or do i have to change <PCT> for i. and i', and <TIL> for n? to <BAS> to
achieve this?
Sorry if this is an obvious question, but i'm new to libc i18n, although
i have fixed an i18n bug in Java. I realize that if i spent some time on
writing a test i'd find it out for myself, but i'm hoping some feedback
would give me a shortcut.
P.S. If it helps, it Java the bug was:
- - + "& H < \u0131 , I , i , \u0130 "
The fix was:
+ "& H < \u0131 , I " // dotless i
+ + "& I < i , \u0130 " // dotted i
To me this appears to suggest that to avoid the bug like in Java, i have
to change <PCT> for i. and i', and <TIL> for n? to <BAS>. But i'm sure
some of you folks would be more qualified to tell.
P.P.S. Also, what is the difference between the first syntax and the
2nd, and are both required in a locale definition? Finally, what is the
meaning of ;IGNORE?
Thanks.
- --
My public GPG key (ID 0x262839AF) is at: http://keyserver.veridis.com:11371
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.2.1 (Cygwin)
iD8DBQFFKyN4O75ytyYoOa8RAkbSAKCCXKOfNK1VPtZV0hqiUFkYPb0tFQCfa8r3
6mSoRMk6+poSaN3ypw6fI20=
=0IRJ
-----END PGP SIGNATURE-----