This is the mail archive of the
libc-alpha@sourceware.org
mailing list for the glibc project.
[Patch 0/13] [BZ #14095] update collation data from Unicode / ISO 14651
- From: Mike FABIAN <mfabian at redhat dot com>
- To: libc-alpha at sourceware dot org
- Date: Fri, 26 Jan 2018 11:51:15 +0100
- Subject: [Patch 0/13] [BZ #14095] update collation data from Unicode / ISO 14651
- Authentication-results: sourceware.org; auth=none
This set of patches updates our
glibc/localedata/locales/iso14651_t1_common file to the latest
available version from ISO and adapts the collation rules of all
locales using “copy "iso14651_t1"” to the changes in the new file.
The ISO standard 14651:2016 is available here:
ISO/IEC 14651:2016: https://www.iso.org/standard/68309.html
And a POSIX style LC_COLLATE file is downloadable from:
http://standards.iso.org/ittf/PubliclyAvailableStandards/index.html
http://standards.iso.org/ittf/PubliclyAvailableStandards/c068309_ISO_IEC_14651_2016.zip
This .zip file contains a ISO14651_2017_TABLE1_en.txt which is in a
similar format as our current iso14651_t1_common and can be used as an
update.
That file is unfortunately up-to-date only with Unicode 8.0.0,
but that is already a huge improvement over what we have now.
Also, that file contained some errors which needed to be fixed.
Seems strange for a file release by ISO, but it really contained
some errors.
And as the names for most collation symbols have been changed, all the
collation rules of locales using “copy "iso14651_t1"” needed to be
updated.
While doing that, I made the collation rules of all locales I touched
agree with the CLDR collation rules. glibc has several locales which are
not in CLDR, for these I just adapted the existing rules.
--
Mike FABIAN <mfabian@redhat.com>