This is the mail archive of the
libc-alpha@sourceware.org
mailing list for the glibc project.
Re: [Patch v3 13/14] [BZ #14095] update collation data from Unicode / ISO 14651
- From: Carlos O'Donell <carlos at redhat dot com>
- To: Mike FABIAN <mfabian at redhat dot com>, libc-alpha at sourceware dot org
- Cc: "Dmitry V. Levin" <ldv at altlinux dot org>
- Date: Fri, 23 Feb 2018 22:21:28 -0800
- Subject: Re: [Patch v3 13/14] [BZ #14095] update collation data from Unicode / ISO 14651
- Authentication-results: sourceware.org; auth=none
- References: <s9dzi40atop.fsf@taka.site>
On 02/23/2018 02:25 AM, Mike FABIAN wrote:
> From a7ae7090076899ba2c648ceaf2dbd5248dc56041 Mon Sep 17 00:00:00 2001
> From: Mike FABIAN <mfabian@redhat.com>
> Date: Mon, 11 Dec 2017 18:26:22 +0100
> Subject: [PATCH 13/14] Adapt collation in several locales to the new
> iso14651_t1_common file
> MIME-Version: 1.0
> Content-Type: text/plain; charset=UTF-8
> Content-Transfer-Encoding: 8bit
Mike,
This is truly a prolific amount of work. Really really awesome work
with these additional tests input data! You even fixed my pet peeve
that the list of input locales was not sorted 1-per-line-alpha :-)
I spot checked some of these, and they look good to me. Given the
errors in the upstream ISO documents though we'll see how good the
quality of the data is, but we'll rely on our users to tell us that
and feed this data back to the national experts.
Reviewed-by: Carlos O'Donell <carlos@redhat.com>
> [BZ #22550] - es_ES locale (and other es_* locales): collation should treat ñ as a primary different character, sync the collation for Spanish with CLDR
> [BZ #21547] - Tibetan script collation broken (Dzongkha and Tibetan)
>
> * localedata/Makefile: Add new test files.
> * localedata/lv_LV.UTF-8.in: Adapt test file to new collation order.
> * localedata/sv_SE.ISO-8859-1.in: Adapt test file to new collation order.
> * localedata/uk_UA.UTF-8.in: Adapt test file to new collation order.
> * localedata/am_ET.UTF-8.in: New test file.
> * localedata/az_AZ.UTF-8.in: Likewise.
> * localedata/be_BY.UTF-8.in: Likewise.
> * localedata/ber_DZ.UTF-8.in: Likewise.
> * localedata/ber_MA.UTF-8.in: Likewise.
> * localedata/bg_BG.UTF-8.in: Likewise.
> * localedata/br_FR.UTF-8.in: Likewise.
> * localedata/cmn_TW.UTF-8.in: Likewise.
> * localedata/crh_UA.UTF-8.in: Likewise.
> * localedata/csb_PL.UTF-8.in: Likewise.
> * localedata/cv_RU.UTF-8.in: Likewise.
> * localedata/cy_GB.UTF-8.in: Likewise.
> * localedata/dz_BT.UTF-8.in: Likewise.
> * localedata/eo.UTF-8.in: Likewise.
> * localedata/es_ES.UTF-8.in: Likewise.
> * localedata/fa_IR.UTF-8.in: Likewise.
> * localedata/fi_FI.UTF-8.in: Likewise.
> * localedata/fil_PH.UTF-8.in: Likewise.
> * localedata/fur_IT.UTF-8.in: Likewise.
> * localedata/gez_ER.UTF-8@abegede.in: Likewise.
> * localedata/ha_NG.UTF-8.in: Likewise.
> * localedata/ig_NG.UTF-8.in: Likewise.
> * localedata/ik_CA.UTF-8.in: Likewise.
> * localedata/kk_KZ.UTF-8.in: Likewise.
> * localedata/ku_TR.UTF-8.in: Likewise.
> * localedata/ky_KG.UTF-8.in: Likewise.
> * localedata/ln_CD.UTF-8.in: Likewise.
> * localedata/mi_NZ.UTF-8.in: Likewise.
> * localedata/ml_IN.UTF-8.in: Likewise.
> * localedata/mn_MN.UTF-8.in: Likewise.
> * localedata/mr_IN.UTF-8.in: Likewise.
> * localedata/mt_MT.UTF-8.in: Likewise.
> * localedata/nb_NO.UTF-8.in: Likewise.
> * localedata/om_KE.UTF-8.in: Likewise.
> * localedata/os_RU.UTF-8.in: Likewise.
> * localedata/ps_AF.UTF-8.in: Likewise.
> * localedata/ro_RO.UTF-8.in: Likewise.
> * localedata/ru_RU.UTF-8.in: Likewise.
> * localedata/sc_IT.UTF-8.in: Likewise.
> * localedata/se_NO.UTF-8.in: Likewise.
> * localedata/sq_AL.UTF-8.in: Likewise.
> * localedata/sv_SE.UTF-8.in: Likewise.
> * localedata/szl_PL.UTF-8.in: Likewise.
> * localedata/tg_TJ.UTF-8.in: Likewise.
> * localedata/tk_TM.UTF-8.in: Likewise.
> * localedata/tt_RU.UTF-8.in: Likewise.
> * localedata/tt_RU.UTF-8@iqtelif.in: Likewise.
> * localedata/ug_CN.UTF-8.in: Likewise.
> * localedata/uz_UZ.UTF-8.in: Likewise.
> * localedata/vi_VN.UTF-8.in: Likewise.
> * localedata/yi_US.UTF-8.in: Likewise.
> * localedata/yo_NG.UTF-8.in: Likewise.
> * localedata/zh_CN.UTF-8.in: Likewise.
> * localedata/locales/am_ET: Adapt collation rules to new iso14651_t1_common
> file and fix bugs in the collation.
> * localedata/locales/az_AZ: Likewise.
> * localedata/locales/be_BY: Likewise.
> * localedata/locales/ber_DZ: Likewise.
> * localedata/locales/ber_MA: Likewise.
> * localedata/locales/bg_BG: Likewise.
> * localedata/locales/br_FR: Likewise.
> * localedata/locales/br_FR@euro: Likewise.
> * localedata/locales/ca_ES: Likewise.
> * localedata/locales/cns11643_stroke: Likewise.
> * localedata/locales/crh_UA: Likewise.
> * localedata/locales/cs_CZ: Likewise.
> * localedata/locales/csb_PL: Likewise.
> * localedata/locales/cv_RU: Likewise.
> * localedata/locales/cy_GB: Likewise.
> * localedata/locales/da_DK: Likewise.
> * localedata/locales/dz_BT: Likewise.
> * localedata/locales/en_CA: Likewise.
> * localedata/locales/eo: Likewise.
> * localedata/locales/es_CU: Likewise.
> * localedata/locales/es_EC: Likewise.
> * localedata/locales/es_ES: Likewise.
> * localedata/locales/es_US: Likewise.
> * localedata/locales/et_EE: Likewise.
> * localedata/locales/fa_IR: Likewise.
> * localedata/locales/fi_FI: Likewise.
> * localedata/locales/fil_PH: Likewise.
> * localedata/locales/fur_IT: Likewise.
> * localedata/locales/gez_ER@abegede: Likewise.
> * localedata/locales/ha_NG: Likewise.
> * localedata/locales/hr_HR: Likewise.
> * localedata/locales/hsb_DE: Likewise.
> * localedata/locales/hu_HU: Likewise.
> * localedata/locales/ig_NG: Likewise.
> * localedata/locales/ik_CA: Likewise.
> * localedata/locales/is_IS: Likewise.
> * localedata/locales/iso14651_t1_common: Likewise.
> * localedata/locales/iso14651_t1_pinyin: Likewise.
> * localedata/locales/kk_KZ: Likewise.
> * localedata/locales/ku_TR: Likewise.
> * localedata/locales/ky_KG: Likewise.
> * localedata/locales/ln_CD: Likewise.
> * localedata/locales/lt_LT: Likewise.
> * localedata/locales/lv_LV: Likewise.
> * localedata/locales/mi_NZ: Likewise.
> * localedata/locales/ml_IN: Likewise.
> * localedata/locales/mn_MN: Likewise.
> * localedata/locales/mr_IN: Likewise.
> * localedata/locales/mt_MT: Likewise.
> * localedata/locales/nb_NO: Likewise.
> * localedata/locales/om_KE: Likewise.
> * localedata/locales/os_RU: Likewise.
> * localedata/locales/pl_PL: Likewise.
> * localedata/locales/ps_AF: Likewise.
> * localedata/locales/ro_RO: Likewise.
> * localedata/locales/ru_RU: Likewise.
> * localedata/locales/ru_UA: Likewise.
> * localedata/locales/sc_IT: Likewise.
> * localedata/locales/se_NO: Likewise.
> * localedata/locales/si_LK: Likewise.
> * localedata/locales/sq_AL: Likewise.
> * localedata/locales/sv_FI: Likewise.
> * localedata/locales/sv_FI@euro: Likewise.
> * localedata/locales/sv_SE: Likewise.
> * localedata/locales/szl_PL: Likewise.
> * localedata/locales/tg_TJ: Likewise.
> * localedata/locales/ti_ER: Likewise.
> * localedata/locales/tk_TM: Likewise.
> * localedata/locales/tl_PH: Likewise.
> * localedata/locales/tr_TR: Likewise.
> * localedata/locales/tt_RU: Likewise.
> * localedata/locales/tt_RU@iqtelif: Likewise.
> * localedata/locales/ug_CN: Likewise.
> * localedata/locales/uk_UA: Likewise.
> * localedata/locales/uz_UZ: Likewise.
> * localedata/locales/uz_UZ@cyrillic: Likewise.
> * localedata/locales/vi_VN: Likewise.
> * localedata/locales/yi_US: Likewise.
> * localedata/locales/yo_NG: Likewise.
> ---
> localedata/Makefile | 185 +-
> localedata/am_ET.UTF-8.in | 347 +
> localedata/az_AZ.UTF-8.in | 73 +
> localedata/be_BY.UTF-8.in | 16 +
> localedata/ber_DZ.UTF-8.in | 50 +
> localedata/ber_MA.UTF-8.in | 13 +
> localedata/bg_BG.UTF-8.in | 57 +
> localedata/br_FR.UTF-8.in | 15 +
> localedata/cmn_TW.UTF-8.in | 76319 ++++++++++++++++++++++++++++++++
> localedata/crh_UA.UTF-8.in | 50 +
> localedata/csb_PL.UTF-8.in | 70 +
> localedata/cv_RU.UTF-8.in | 45 +
> localedata/cy_GB.UTF-8.in | 72 +
> localedata/dz_BT.UTF-8.in | 789 +
> localedata/eo.UTF-8.in | 32 +
> localedata/es_ES.UTF-8.in | 46 +
> localedata/fa_IR.UTF-8.in | 71 +
> localedata/fi_FI.UTF-8.in | 140 +
> localedata/fil_PH.UTF-8.in | 16 +
> localedata/fur_IT.UTF-8.in | 12 +
> localedata/gez_ER.UTF-8@abegede.in | 365 +
> localedata/ha_NG.UTF-8.in | 47 +
> localedata/ig_NG.UTF-8.in | 93 +
> localedata/ik_CA.UTF-8.in | 60 +
> localedata/kk_KZ.UTF-8.in | 40 +
> localedata/ku_TR.UTF-8.in | 52 +
> localedata/ky_KG.UTF-8.in | 72 +
> localedata/ln_CD.UTF-8.in | 18 +
> localedata/locales/am_ET | 551 +-
> localedata/locales/az_AZ | 201 +-
> localedata/locales/be_BY | 41 +-
> localedata/locales/ber_DZ | 173 +-
> localedata/locales/ber_MA | 42 +-
> localedata/locales/bg_BG | 290 +-
> localedata/locales/br_FR | 55 +-
> localedata/locales/br_FR@euro | 3 +-
> localedata/locales/ca_ES | 16 +-
> localedata/locales/cns11643_stroke | 9 +-
> localedata/locales/crh_UA | 111 +-
> localedata/locales/cs_CZ | 69 +-
> localedata/locales/csb_PL | 83 +-
> localedata/locales/cv_RU | 75 +-
> localedata/locales/cy_GB | 242 +-
> localedata/locales/da_DK | 110 +-
> localedata/locales/dz_BT | 2484 +-
> localedata/locales/en_CA | 8 -
> localedata/locales/eo | 69 +-
> localedata/locales/es_CU | 3 +-
> localedata/locales/es_EC | 2 +-
> localedata/locales/es_ES | 49 +-
> localedata/locales/es_US | 56 +-
> localedata/locales/et_EE | 31 +-
> localedata/locales/fa_IR | 289 +-
> localedata/locales/fi_FI | 173 +-
> localedata/locales/fil_PH | 57 +-
> localedata/locales/fur_IT | 15 +-
> localedata/locales/gez_ER@abegede | 409 +-
> localedata/locales/ha_NG | 165 +-
> localedata/locales/hr_HR | 84 +-
> localedata/locales/hsb_DE | 64 +-
> localedata/locales/hu_HU | 298 +-
> localedata/locales/ig_NG | 453 +-
> localedata/locales/ik_CA | 153 +-
> localedata/locales/is_IS | 72 +-
> localedata/locales/iso14651_t1_pinyin | 9 +-
> localedata/locales/kk_KZ | 132 +-
> localedata/locales/ku_TR | 89 +-
> localedata/locales/ky_KG | 63 +-
> localedata/locales/ln_CD | 51 +-
> localedata/locales/lt_LT | 52 +-
> localedata/locales/lv_LV | 67 +-
> localedata/locales/mi_NZ | 43 +-
> localedata/locales/ml_IN | 158 +-
> localedata/locales/mn_MN | 34 +-
> localedata/locales/mr_IN | 76 +-
> localedata/locales/mt_MT | 144 +-
> localedata/locales/nan_TW@latin | 33 +-
> localedata/locales/nb_NO | 114 +-
> localedata/locales/om_KE | 124 +-
> localedata/locales/os_RU | 14 +-
> localedata/locales/pl_PL | 66 +-
> localedata/locales/ps_AF | 224 +-
> localedata/locales/ro_RO | 99 +-
> localedata/locales/ru_RU | 24 +-
> localedata/locales/ru_UA | 16 +-
> localedata/locales/sc_IT | 15 +-
> localedata/locales/se_NO | 298 +-
> localedata/locales/si_LK | 42 +
> localedata/locales/sq_AL | 291 +-
> localedata/locales/sv_FI | 2 +-
> localedata/locales/sv_FI@euro | 2 +-
> localedata/locales/sv_SE | 109 +-
> localedata/locales/szl_PL | 86 +-
> localedata/locales/tg_TJ | 106 +-
> localedata/locales/ti_ER | 2 +
> localedata/locales/tk_TM | 399 +-
> localedata/locales/tl_PH | 31 +-
> localedata/locales/tr_TR | 47 +-
> localedata/locales/tt_RU | 244 +-
> localedata/locales/tt_RU@iqtelif | 14 +-
> localedata/locales/ug_CN | 198 +-
> localedata/locales/uk_UA | 493 +-
> localedata/locales/uz_UZ | 127 +-
> localedata/locales/uz_UZ@cyrillic | 56 +-
> localedata/locales/vi_VN | 242 +-
> localedata/locales/yi_US | 125 +-
> localedata/locales/yo_NG | 365 +-
> localedata/lv_LV.UTF-8.in | 6 +-
> localedata/mi_NZ.UTF-8.in | 37 +
> localedata/ml_IN.UTF-8.in | 25 +
> localedata/mn_MN.UTF-8.in | 15 +
> localedata/mr_IN.UTF-8.in | 9 +
> localedata/mt_MT.UTF-8.in | 39 +
> localedata/nan_TW.UTF-8@latin.in | 11 +
> localedata/nb_NO.UTF-8.in | 66 +
> localedata/om_KE.UTF-8.in | 36 +
> localedata/os_RU.UTF-8.in | 9 +
> localedata/ps_AF.UTF-8.in | 61 +
> localedata/ro_RO.UTF-8.in | 32 +
> localedata/ru_RU.UTF-8.in | 15 +
> localedata/sc_IT.UTF-8.in | 12 +
> localedata/se_NO.UTF-8.in | 144 +
> localedata/sq_AL.UTF-8.in | 82 +
> localedata/sv_SE.ISO-8859-1.in | 10 +-
> localedata/sv_SE.UTF-8.in | 107 +
> localedata/szl_PL.UTF-8.in | 49 +
> localedata/tg_TJ.UTF-8.in | 105 +
> localedata/tk_TM.UTF-8.in | 213 +
> localedata/tt_RU.UTF-8.in | 194 +
> localedata/tt_RU.UTF-8@iqtelif.in | 53 +
> localedata/ug_CN.UTF-8.in | 16 +
> localedata/uk_UA.UTF-8.in | 14 +
> localedata/uz_UZ.UTF-8.in | 26 +
> localedata/vi_VN.UTF-8.in | 45 +
> localedata/yi_US.UTF-8.in | 39 +
> localedata/yo_NG.UTF-8.in | 30 +
> localedata/zh_CN.UTF-8.in | 25498 +++++++++++
> 137 files changed, 112596 insertions(+), 5498 deletions(-)
> create mode 100644 localedata/am_ET.UTF-8.in
> create mode 100644 localedata/az_AZ.UTF-8.in
> create mode 100644 localedata/be_BY.UTF-8.in
> create mode 100644 localedata/ber_DZ.UTF-8.in
> create mode 100644 localedata/ber_MA.UTF-8.in
> create mode 100644 localedata/bg_BG.UTF-8.in
> create mode 100644 localedata/br_FR.UTF-8.in
> create mode 100644 localedata/cmn_TW.UTF-8.in
> create mode 100644 localedata/crh_UA.UTF-8.in
> create mode 100644 localedata/csb_PL.UTF-8.in
> create mode 100644 localedata/cv_RU.UTF-8.in
> create mode 100644 localedata/cy_GB.UTF-8.in
> create mode 100644 localedata/dz_BT.UTF-8.in
> create mode 100644 localedata/eo.UTF-8.in
> create mode 100644 localedata/es_ES.UTF-8.in
> create mode 100644 localedata/fa_IR.UTF-8.in
> create mode 100644 localedata/fi_FI.UTF-8.in
> create mode 100644 localedata/fil_PH.UTF-8.in
> create mode 100644 localedata/fur_IT.UTF-8.in
> create mode 100644 localedata/gez_ER.UTF-8@abegede.in
> create mode 100644 localedata/ha_NG.UTF-8.in
> create mode 100644 localedata/ig_NG.UTF-8.in
> create mode 100644 localedata/ik_CA.UTF-8.in
> create mode 100644 localedata/kk_KZ.UTF-8.in
> create mode 100644 localedata/ku_TR.UTF-8.in
> create mode 100644 localedata/ky_KG.UTF-8.in
> create mode 100644 localedata/ln_CD.UTF-8.in
> create mode 100644 localedata/mi_NZ.UTF-8.in
> create mode 100644 localedata/ml_IN.UTF-8.in
> create mode 100644 localedata/mn_MN.UTF-8.in
> create mode 100644 localedata/mr_IN.UTF-8.in
> create mode 100644 localedata/mt_MT.UTF-8.in
> create mode 100644 localedata/nan_TW.UTF-8@latin.in
> create mode 100644 localedata/nb_NO.UTF-8.in
> create mode 100644 localedata/om_KE.UTF-8.in
> create mode 100644 localedata/os_RU.UTF-8.in
> create mode 100644 localedata/ps_AF.UTF-8.in
> create mode 100644 localedata/ro_RO.UTF-8.in
> create mode 100644 localedata/ru_RU.UTF-8.in
> create mode 100644 localedata/sc_IT.UTF-8.in
> create mode 100644 localedata/se_NO.UTF-8.in
> create mode 100644 localedata/sq_AL.UTF-8.in
> create mode 100644 localedata/sv_SE.UTF-8.in
> create mode 100644 localedata/szl_PL.UTF-8.in
> create mode 100644 localedata/tg_TJ.UTF-8.in
> create mode 100644 localedata/tk_TM.UTF-8.in
> create mode 100644 localedata/tt_RU.UTF-8.in
> create mode 100644 localedata/tt_RU.UTF-8@iqtelif.in
> create mode 100644 localedata/ug_CN.UTF-8.in
> create mode 100644 localedata/uz_UZ.UTF-8.in
> create mode 100644 localedata/vi_VN.UTF-8.in
> create mode 100644 localedata/yi_US.UTF-8.in
> create mode 100644 localedata/yo_NG.UTF-8.in
> create mode 100644 localedata/zh_CN.UTF-8.in
--
Cheers,
Carlos.