This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Is <BAS> required for 1st level difference?


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

For instance, would the following be identified as separate letters
using collation level 1 and doing comparisons ignoring case (each of the
3 variations of i, and n tilde vs. n)?

reorder-after <h>
<i>
<i.>
<i'>
reorder-after <n>
<n?>

reorder-after <U0068>
<U0131> <i>;<BAS>;<MIN>;IGNORE
<U0069> <i.>;<PCT>;<MIN>;IGNORE
<U00ED> <i'>;<PCT>;<MIN>;IGNORE
reorder-after <U0048>
<U0049> <i>;<BAS>;<CAP>;IGNORE
<U0130> <i.>;<PCT>;<CAP>;IGNORE
<U01CD> <I'>;<PCT>;<CAP>;IGNORE

reorder-after <U006E>
<U00F1> <n?>;<TIL>;<MIN>;IGNORE
reorder-after <U004E>
<U00D1> <N?>;<TIL>;<CAP>;IGNORE

Or do i have to change <PCT> for i. and i', and <TIL> for n? to <BAS> to
achieve this?

Sorry if this is an obvious question, but i'm new to libc i18n, although
i have fixed an i18n bug in Java. I realize that if i spent some time on
writing a test i'd find it out for myself, but i'm hoping some feedback
would give me a shortcut.

P.S. If it helps, it Java the bug was:
- -                + "& H < \u0131 , I , i , \u0130 "
The fix was:
+ "& H < \u0131 , I "        // dotless i
+                + "& I < i , \u0130 "        // dotted i
To me this appears to suggest that to avoid the bug like in Java, i have
to change <PCT> for i. and i', and <TIL> for n? to <BAS>. But i'm sure
some of you folks would be more qualified to tell.
P.P.S. Also, what is the difference between the first syntax and the
2nd, and are both required in a locale definition? Finally, what is the
meaning of ;IGNORE?

Thanks.

- --
My public GPG key (ID 0x262839AF) is at: http://keyserver.veridis.com:11371
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.2.1 (Cygwin)

iD8DBQFFKyN4O75ytyYoOa8RAkbSAKCCXKOfNK1VPtZV0hqiUFkYPb0tFQCfa8r3
6mSoRMk6+poSaN3ypw6fI20=
=0IRJ
-----END PGP SIGNATURE-----


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]