This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

[Patch 0/13] [BZ #14095] update collation data from Unicode / ISO 14651


This set of patches updates our
glibc/localedata/locales/iso14651_t1_common file to the latest
available version from ISO and adapts the collation rules of all
locales using “copy "iso14651_t1"” to the changes in the new file.

The ISO standard 14651:2016 is available here:

ISO/IEC 14651:2016: https://www.iso.org/standard/68309.html

And a POSIX style LC_COLLATE file is downloadable from:

http://standards.iso.org/ittf/PubliclyAvailableStandards/index.html
http://standards.iso.org/ittf/PubliclyAvailableStandards/c068309_ISO_IEC_14651_2016.zip

This .zip file contains a ISO14651_2017_TABLE1_en.txt which is in a
similar format as our current iso14651_t1_common and can be used as an
update.

That file is unfortunately up-to-date only with Unicode 8.0.0,
but that is already a huge improvement over what we have now.

Also, that file contained some errors which needed to be fixed.
Seems strange for a file release by ISO, but it really contained
some errors.

And as the names for most collation symbols have been changed, all the
collation rules of locales using “copy "iso14651_t1"” needed to be
updated.

While doing that, I made the collation rules of all locales I touched
agree with the CLDR collation rules. glibc has several locales which are
not in CLDR, for these I just adapted the existing rules.

-- 
Mike FABIAN <mfabian@redhat.com>


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]