This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: [Patch v3 13/14] [BZ #14095] update collation data from Unicode / ISO 14651


On 02/23/2018 02:25 AM, Mike FABIAN wrote:
> From a7ae7090076899ba2c648ceaf2dbd5248dc56041 Mon Sep 17 00:00:00 2001
> From: Mike FABIAN <mfabian@redhat.com>
> Date: Mon, 11 Dec 2017 18:26:22 +0100
> Subject: [PATCH 13/14] Adapt collation in several locales to the new
>  iso14651_t1_common file
> MIME-Version: 1.0
> Content-Type: text/plain; charset=UTF-8
> Content-Transfer-Encoding: 8bit

Mike,

This is truly a prolific amount of work. Really really awesome work
with these additional tests input data! You even fixed my pet peeve
that the list of input locales was not sorted 1-per-line-alpha :-)

I spot checked some of these, and they look good to me. Given the
errors in the upstream ISO documents though we'll see how good the
quality of the data is, but we'll rely on our users to tell us that
and feed this data back to the national experts.

Reviewed-by: Carlos O'Donell <carlos@redhat.com>

> [BZ #22550] - es_ES locale (and other es_* locales): collation should treat ñ as a primary different character, sync the collation for Spanish with CLDR
> [BZ #21547] - Tibetan script collation broken (Dzongkha and Tibetan)
> 
> 	* localedata/Makefile: Add new test files.
> 	* localedata/lv_LV.UTF-8.in: Adapt test file to new collation order.
> 	* localedata/sv_SE.ISO-8859-1.in: Adapt test file to new collation order.
> 	* localedata/uk_UA.UTF-8.in: Adapt test file to new collation order.
> 	* localedata/am_ET.UTF-8.in: New test file.
> 	* localedata/az_AZ.UTF-8.in: Likewise.
> 	* localedata/be_BY.UTF-8.in: Likewise.
> 	* localedata/ber_DZ.UTF-8.in: Likewise.
> 	* localedata/ber_MA.UTF-8.in: Likewise.
> 	* localedata/bg_BG.UTF-8.in: Likewise.
> 	* localedata/br_FR.UTF-8.in: Likewise.
> 	* localedata/cmn_TW.UTF-8.in: Likewise.
> 	* localedata/crh_UA.UTF-8.in: Likewise.
> 	* localedata/csb_PL.UTF-8.in: Likewise.
> 	* localedata/cv_RU.UTF-8.in: Likewise.
> 	* localedata/cy_GB.UTF-8.in: Likewise.
> 	* localedata/dz_BT.UTF-8.in: Likewise.
> 	* localedata/eo.UTF-8.in: Likewise.
> 	* localedata/es_ES.UTF-8.in: Likewise.
> 	* localedata/fa_IR.UTF-8.in: Likewise.
> 	* localedata/fi_FI.UTF-8.in: Likewise.
> 	* localedata/fil_PH.UTF-8.in: Likewise.
> 	* localedata/fur_IT.UTF-8.in: Likewise.
> 	* localedata/gez_ER.UTF-8@abegede.in: Likewise.
> 	* localedata/ha_NG.UTF-8.in: Likewise.
> 	* localedata/ig_NG.UTF-8.in: Likewise.
> 	* localedata/ik_CA.UTF-8.in: Likewise.
> 	* localedata/kk_KZ.UTF-8.in: Likewise.
> 	* localedata/ku_TR.UTF-8.in: Likewise.
> 	* localedata/ky_KG.UTF-8.in: Likewise.
> 	* localedata/ln_CD.UTF-8.in: Likewise.
> 	* localedata/mi_NZ.UTF-8.in: Likewise.
> 	* localedata/ml_IN.UTF-8.in: Likewise.
> 	* localedata/mn_MN.UTF-8.in: Likewise.
> 	* localedata/mr_IN.UTF-8.in: Likewise.
> 	* localedata/mt_MT.UTF-8.in: Likewise.
> 	* localedata/nb_NO.UTF-8.in: Likewise.
> 	* localedata/om_KE.UTF-8.in: Likewise.
> 	* localedata/os_RU.UTF-8.in: Likewise.
> 	* localedata/ps_AF.UTF-8.in: Likewise.
> 	* localedata/ro_RO.UTF-8.in: Likewise.
> 	* localedata/ru_RU.UTF-8.in: Likewise.
> 	* localedata/sc_IT.UTF-8.in: Likewise.
> 	* localedata/se_NO.UTF-8.in: Likewise.
> 	* localedata/sq_AL.UTF-8.in: Likewise.
> 	* localedata/sv_SE.UTF-8.in: Likewise.
> 	* localedata/szl_PL.UTF-8.in: Likewise.
> 	* localedata/tg_TJ.UTF-8.in: Likewise.
> 	* localedata/tk_TM.UTF-8.in: Likewise.
> 	* localedata/tt_RU.UTF-8.in: Likewise.
> 	* localedata/tt_RU.UTF-8@iqtelif.in: Likewise.
> 	* localedata/ug_CN.UTF-8.in: Likewise.
> 	* localedata/uz_UZ.UTF-8.in: Likewise.
> 	* localedata/vi_VN.UTF-8.in: Likewise.
> 	* localedata/yi_US.UTF-8.in: Likewise.
> 	* localedata/yo_NG.UTF-8.in: Likewise.
> 	* localedata/zh_CN.UTF-8.in: Likewise.
> 	* localedata/locales/am_ET: Adapt collation rules to new iso14651_t1_common
>         file and fix bugs in the collation.
> 	* localedata/locales/az_AZ: Likewise.
> 	* localedata/locales/be_BY: Likewise.
> 	* localedata/locales/ber_DZ: Likewise.
> 	* localedata/locales/ber_MA: Likewise.
> 	* localedata/locales/bg_BG: Likewise.
> 	* localedata/locales/br_FR: Likewise.
> 	* localedata/locales/br_FR@euro: Likewise.
> 	* localedata/locales/ca_ES: Likewise.
> 	* localedata/locales/cns11643_stroke: Likewise.
> 	* localedata/locales/crh_UA: Likewise.
> 	* localedata/locales/cs_CZ: Likewise.
> 	* localedata/locales/csb_PL: Likewise.
> 	* localedata/locales/cv_RU: Likewise.
> 	* localedata/locales/cy_GB: Likewise.
> 	* localedata/locales/da_DK: Likewise.
> 	* localedata/locales/dz_BT: Likewise.
> 	* localedata/locales/en_CA: Likewise.
> 	* localedata/locales/eo: Likewise.
> 	* localedata/locales/es_CU: Likewise.
> 	* localedata/locales/es_EC: Likewise.
> 	* localedata/locales/es_ES: Likewise.
> 	* localedata/locales/es_US: Likewise.
> 	* localedata/locales/et_EE: Likewise.
> 	* localedata/locales/fa_IR: Likewise.
> 	* localedata/locales/fi_FI: Likewise.
> 	* localedata/locales/fil_PH: Likewise.
> 	* localedata/locales/fur_IT: Likewise.
> 	* localedata/locales/gez_ER@abegede: Likewise.
> 	* localedata/locales/ha_NG: Likewise.
> 	* localedata/locales/hr_HR: Likewise.
> 	* localedata/locales/hsb_DE: Likewise.
> 	* localedata/locales/hu_HU: Likewise.
> 	* localedata/locales/ig_NG: Likewise.
> 	* localedata/locales/ik_CA: Likewise.
> 	* localedata/locales/is_IS: Likewise.
> 	* localedata/locales/iso14651_t1_common: Likewise.
> 	* localedata/locales/iso14651_t1_pinyin: Likewise.
> 	* localedata/locales/kk_KZ: Likewise.
> 	* localedata/locales/ku_TR: Likewise.
> 	* localedata/locales/ky_KG: Likewise.
> 	* localedata/locales/ln_CD: Likewise.
> 	* localedata/locales/lt_LT: Likewise.
> 	* localedata/locales/lv_LV: Likewise.
> 	* localedata/locales/mi_NZ: Likewise.
> 	* localedata/locales/ml_IN: Likewise.
> 	* localedata/locales/mn_MN: Likewise.
> 	* localedata/locales/mr_IN: Likewise.
> 	* localedata/locales/mt_MT: Likewise.
> 	* localedata/locales/nb_NO: Likewise.
> 	* localedata/locales/om_KE: Likewise.
> 	* localedata/locales/os_RU: Likewise.
> 	* localedata/locales/pl_PL: Likewise.
> 	* localedata/locales/ps_AF: Likewise.
> 	* localedata/locales/ro_RO: Likewise.
> 	* localedata/locales/ru_RU: Likewise.
> 	* localedata/locales/ru_UA: Likewise.
> 	* localedata/locales/sc_IT: Likewise.
> 	* localedata/locales/se_NO: Likewise.
> 	* localedata/locales/si_LK: Likewise.
> 	* localedata/locales/sq_AL: Likewise.
> 	* localedata/locales/sv_FI: Likewise.
> 	* localedata/locales/sv_FI@euro: Likewise.
> 	* localedata/locales/sv_SE: Likewise.
> 	* localedata/locales/szl_PL: Likewise.
> 	* localedata/locales/tg_TJ: Likewise.
> 	* localedata/locales/ti_ER: Likewise.
> 	* localedata/locales/tk_TM: Likewise.
> 	* localedata/locales/tl_PH: Likewise.
> 	* localedata/locales/tr_TR: Likewise.
> 	* localedata/locales/tt_RU: Likewise.
> 	* localedata/locales/tt_RU@iqtelif: Likewise.
> 	* localedata/locales/ug_CN: Likewise.
> 	* localedata/locales/uk_UA: Likewise.
> 	* localedata/locales/uz_UZ: Likewise.
> 	* localedata/locales/uz_UZ@cyrillic: Likewise.
> 	* localedata/locales/vi_VN: Likewise.
> 	* localedata/locales/yi_US: Likewise.
> 	* localedata/locales/yo_NG: Likewise.
> ---
>  localedata/Makefile                   |   185 +-
>  localedata/am_ET.UTF-8.in             |   347 +
>  localedata/az_AZ.UTF-8.in             |    73 +
>  localedata/be_BY.UTF-8.in             |    16 +
>  localedata/ber_DZ.UTF-8.in            |    50 +
>  localedata/ber_MA.UTF-8.in            |    13 +
>  localedata/bg_BG.UTF-8.in             |    57 +
>  localedata/br_FR.UTF-8.in             |    15 +
>  localedata/cmn_TW.UTF-8.in            | 76319 ++++++++++++++++++++++++++++++++
>  localedata/crh_UA.UTF-8.in            |    50 +
>  localedata/csb_PL.UTF-8.in            |    70 +
>  localedata/cv_RU.UTF-8.in             |    45 +
>  localedata/cy_GB.UTF-8.in             |    72 +
>  localedata/dz_BT.UTF-8.in             |   789 +
>  localedata/eo.UTF-8.in                |    32 +
>  localedata/es_ES.UTF-8.in             |    46 +
>  localedata/fa_IR.UTF-8.in             |    71 +
>  localedata/fi_FI.UTF-8.in             |   140 +
>  localedata/fil_PH.UTF-8.in            |    16 +
>  localedata/fur_IT.UTF-8.in            |    12 +
>  localedata/gez_ER.UTF-8@abegede.in    |   365 +
>  localedata/ha_NG.UTF-8.in             |    47 +
>  localedata/ig_NG.UTF-8.in             |    93 +
>  localedata/ik_CA.UTF-8.in             |    60 +
>  localedata/kk_KZ.UTF-8.in             |    40 +
>  localedata/ku_TR.UTF-8.in             |    52 +
>  localedata/ky_KG.UTF-8.in             |    72 +
>  localedata/ln_CD.UTF-8.in             |    18 +
>  localedata/locales/am_ET              |   551 +-
>  localedata/locales/az_AZ              |   201 +-
>  localedata/locales/be_BY              |    41 +-
>  localedata/locales/ber_DZ             |   173 +-
>  localedata/locales/ber_MA             |    42 +-
>  localedata/locales/bg_BG              |   290 +-
>  localedata/locales/br_FR              |    55 +-
>  localedata/locales/br_FR@euro         |     3 +-
>  localedata/locales/ca_ES              |    16 +-
>  localedata/locales/cns11643_stroke    |     9 +-
>  localedata/locales/crh_UA             |   111 +-
>  localedata/locales/cs_CZ              |    69 +-
>  localedata/locales/csb_PL             |    83 +-
>  localedata/locales/cv_RU              |    75 +-
>  localedata/locales/cy_GB              |   242 +-
>  localedata/locales/da_DK              |   110 +-
>  localedata/locales/dz_BT              |  2484 +-
>  localedata/locales/en_CA              |     8 -
>  localedata/locales/eo                 |    69 +-
>  localedata/locales/es_CU              |     3 +-
>  localedata/locales/es_EC              |     2 +-
>  localedata/locales/es_ES              |    49 +-
>  localedata/locales/es_US              |    56 +-
>  localedata/locales/et_EE              |    31 +-
>  localedata/locales/fa_IR              |   289 +-
>  localedata/locales/fi_FI              |   173 +-
>  localedata/locales/fil_PH             |    57 +-
>  localedata/locales/fur_IT             |    15 +-
>  localedata/locales/gez_ER@abegede     |   409 +-
>  localedata/locales/ha_NG              |   165 +-
>  localedata/locales/hr_HR              |    84 +-
>  localedata/locales/hsb_DE             |    64 +-
>  localedata/locales/hu_HU              |   298 +-
>  localedata/locales/ig_NG              |   453 +-
>  localedata/locales/ik_CA              |   153 +-
>  localedata/locales/is_IS              |    72 +-
>  localedata/locales/iso14651_t1_pinyin |     9 +-
>  localedata/locales/kk_KZ              |   132 +-
>  localedata/locales/ku_TR              |    89 +-
>  localedata/locales/ky_KG              |    63 +-
>  localedata/locales/ln_CD              |    51 +-
>  localedata/locales/lt_LT              |    52 +-
>  localedata/locales/lv_LV              |    67 +-
>  localedata/locales/mi_NZ              |    43 +-
>  localedata/locales/ml_IN              |   158 +-
>  localedata/locales/mn_MN              |    34 +-
>  localedata/locales/mr_IN              |    76 +-
>  localedata/locales/mt_MT              |   144 +-
>  localedata/locales/nan_TW@latin       |    33 +-
>  localedata/locales/nb_NO              |   114 +-
>  localedata/locales/om_KE              |   124 +-
>  localedata/locales/os_RU              |    14 +-
>  localedata/locales/pl_PL              |    66 +-
>  localedata/locales/ps_AF              |   224 +-
>  localedata/locales/ro_RO              |    99 +-
>  localedata/locales/ru_RU              |    24 +-
>  localedata/locales/ru_UA              |    16 +-
>  localedata/locales/sc_IT              |    15 +-
>  localedata/locales/se_NO              |   298 +-
>  localedata/locales/si_LK              |    42 +
>  localedata/locales/sq_AL              |   291 +-
>  localedata/locales/sv_FI              |     2 +-
>  localedata/locales/sv_FI@euro         |     2 +-
>  localedata/locales/sv_SE              |   109 +-
>  localedata/locales/szl_PL             |    86 +-
>  localedata/locales/tg_TJ              |   106 +-
>  localedata/locales/ti_ER              |     2 +
>  localedata/locales/tk_TM              |   399 +-
>  localedata/locales/tl_PH              |    31 +-
>  localedata/locales/tr_TR              |    47 +-
>  localedata/locales/tt_RU              |   244 +-
>  localedata/locales/tt_RU@iqtelif      |    14 +-
>  localedata/locales/ug_CN              |   198 +-
>  localedata/locales/uk_UA              |   493 +-
>  localedata/locales/uz_UZ              |   127 +-
>  localedata/locales/uz_UZ@cyrillic     |    56 +-
>  localedata/locales/vi_VN              |   242 +-
>  localedata/locales/yi_US              |   125 +-
>  localedata/locales/yo_NG              |   365 +-
>  localedata/lv_LV.UTF-8.in             |     6 +-
>  localedata/mi_NZ.UTF-8.in             |    37 +
>  localedata/ml_IN.UTF-8.in             |    25 +
>  localedata/mn_MN.UTF-8.in             |    15 +
>  localedata/mr_IN.UTF-8.in             |     9 +
>  localedata/mt_MT.UTF-8.in             |    39 +
>  localedata/nan_TW.UTF-8@latin.in      |    11 +
>  localedata/nb_NO.UTF-8.in             |    66 +
>  localedata/om_KE.UTF-8.in             |    36 +
>  localedata/os_RU.UTF-8.in             |     9 +
>  localedata/ps_AF.UTF-8.in             |    61 +
>  localedata/ro_RO.UTF-8.in             |    32 +
>  localedata/ru_RU.UTF-8.in             |    15 +
>  localedata/sc_IT.UTF-8.in             |    12 +
>  localedata/se_NO.UTF-8.in             |   144 +
>  localedata/sq_AL.UTF-8.in             |    82 +
>  localedata/sv_SE.ISO-8859-1.in        |    10 +-
>  localedata/sv_SE.UTF-8.in             |   107 +
>  localedata/szl_PL.UTF-8.in            |    49 +
>  localedata/tg_TJ.UTF-8.in             |   105 +
>  localedata/tk_TM.UTF-8.in             |   213 +
>  localedata/tt_RU.UTF-8.in             |   194 +
>  localedata/tt_RU.UTF-8@iqtelif.in     |    53 +
>  localedata/ug_CN.UTF-8.in             |    16 +
>  localedata/uk_UA.UTF-8.in             |    14 +
>  localedata/uz_UZ.UTF-8.in             |    26 +
>  localedata/vi_VN.UTF-8.in             |    45 +
>  localedata/yi_US.UTF-8.in             |    39 +
>  localedata/yo_NG.UTF-8.in             |    30 +
>  localedata/zh_CN.UTF-8.in             | 25498 +++++++++++
>  137 files changed, 112596 insertions(+), 5498 deletions(-)
>  create mode 100644 localedata/am_ET.UTF-8.in
>  create mode 100644 localedata/az_AZ.UTF-8.in
>  create mode 100644 localedata/be_BY.UTF-8.in
>  create mode 100644 localedata/ber_DZ.UTF-8.in
>  create mode 100644 localedata/ber_MA.UTF-8.in
>  create mode 100644 localedata/bg_BG.UTF-8.in
>  create mode 100644 localedata/br_FR.UTF-8.in
>  create mode 100644 localedata/cmn_TW.UTF-8.in
>  create mode 100644 localedata/crh_UA.UTF-8.in
>  create mode 100644 localedata/csb_PL.UTF-8.in
>  create mode 100644 localedata/cv_RU.UTF-8.in
>  create mode 100644 localedata/cy_GB.UTF-8.in
>  create mode 100644 localedata/dz_BT.UTF-8.in
>  create mode 100644 localedata/eo.UTF-8.in
>  create mode 100644 localedata/es_ES.UTF-8.in
>  create mode 100644 localedata/fa_IR.UTF-8.in
>  create mode 100644 localedata/fi_FI.UTF-8.in
>  create mode 100644 localedata/fil_PH.UTF-8.in
>  create mode 100644 localedata/fur_IT.UTF-8.in
>  create mode 100644 localedata/gez_ER.UTF-8@abegede.in
>  create mode 100644 localedata/ha_NG.UTF-8.in
>  create mode 100644 localedata/ig_NG.UTF-8.in
>  create mode 100644 localedata/ik_CA.UTF-8.in
>  create mode 100644 localedata/kk_KZ.UTF-8.in
>  create mode 100644 localedata/ku_TR.UTF-8.in
>  create mode 100644 localedata/ky_KG.UTF-8.in
>  create mode 100644 localedata/ln_CD.UTF-8.in
>  create mode 100644 localedata/mi_NZ.UTF-8.in
>  create mode 100644 localedata/ml_IN.UTF-8.in
>  create mode 100644 localedata/mn_MN.UTF-8.in
>  create mode 100644 localedata/mr_IN.UTF-8.in
>  create mode 100644 localedata/mt_MT.UTF-8.in
>  create mode 100644 localedata/nan_TW.UTF-8@latin.in
>  create mode 100644 localedata/nb_NO.UTF-8.in
>  create mode 100644 localedata/om_KE.UTF-8.in
>  create mode 100644 localedata/os_RU.UTF-8.in
>  create mode 100644 localedata/ps_AF.UTF-8.in
>  create mode 100644 localedata/ro_RO.UTF-8.in
>  create mode 100644 localedata/ru_RU.UTF-8.in
>  create mode 100644 localedata/sc_IT.UTF-8.in
>  create mode 100644 localedata/se_NO.UTF-8.in
>  create mode 100644 localedata/sq_AL.UTF-8.in
>  create mode 100644 localedata/sv_SE.UTF-8.in
>  create mode 100644 localedata/szl_PL.UTF-8.in
>  create mode 100644 localedata/tg_TJ.UTF-8.in
>  create mode 100644 localedata/tk_TM.UTF-8.in
>  create mode 100644 localedata/tt_RU.UTF-8.in
>  create mode 100644 localedata/tt_RU.UTF-8@iqtelif.in
>  create mode 100644 localedata/ug_CN.UTF-8.in
>  create mode 100644 localedata/uz_UZ.UTF-8.in
>  create mode 100644 localedata/vi_VN.UTF-8.in
>  create mode 100644 localedata/yi_US.UTF-8.in
>  create mode 100644 localedata/yo_NG.UTF-8.in
>  create mode 100644 localedata/zh_CN.UTF-8.in


-- 
Cheers,
Carlos.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]