This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.
Index Nav: | [Date Index] [Subject Index] [Author Index] [Thread Index] | |
---|---|---|
Message Nav: | [Date Prev] [Date Next] | [Thread Prev] [Thread Next] |
Other format: | [Raw text] |
On 07/25/2018 04:31 PM, Florian Weimer wrote: > On 07/25/2018 10:25 PM, Carlos O'Donell wrote: >> On 07/25/2018 04:18 PM, Florian Weimer wrote: >>> On 07/25/2018 05:54 PM, Carlos O'Donell wrote: >>>> Attaching it as swbz23393v3.tar.gz to avoid spam rejection. >>> >>> Quick comment. The middle line here adds trailing whitespace: >>> >>> - { "[a-z]|[^a-z]", "\xcb\xa2", REG_EXTENDED, 2, >>> + >>> + The U+02DA RING ABOVE is chosen because it's not in [s-㏜]. */ >> >> Thanks. I'll fix this with v4. > > I have verified that localedata/locales/iso14651_t1_common is just a reordering (except for the new comments). > > localedata/locales/tr_TR is more complicated, but looks like an order-only change for me too. > >> I had to fix the following locales: >> >> modified: localedata/locales/ar_SA >> modified: localedata/locales/km_KH >> modified: localedata/locales/lo_LA >> modified: localedata/locales/or_IN >> modified: localedata/locales/sl_SI >> modified: localedata/locales/th_TH > > Do you have the actual locale names handy? localedata/SUPPORTED contains charsets, but I'm not sure if the translation to locale names is completely regular. It is completely regular. In that ar_SA => ar_SA.UTF-8. And so forth. >> They all re-arranged ASCII character collation element ordering like tr_TR, >> and so they needed manual fixing. >> >> Could you please add these locales to your tester? > > I will try. I already have an xtests part, and these probably need to go there as well. v4 - Fixed ar_SA, km_KH, lo_LA, or_IN, sl_SI, th_TH. - Added range checking for a-z, A-Z for all supported UTF-8 locales. All of my testers are clean. So the question is now: Do we commit to rational ranges for a-z, A-Z, 0-9 ... for 2.28. or Do we just do the deinterlacing of iso14651_t1_common to fix en_US.UTF-8? Cheers, Carlos.
Attachment:
swbz23393v4.tar.gz
Description: application/gzip
Index Nav: | [Date Index] [Subject Index] [Author Index] [Thread Index] | |
---|---|---|
Message Nav: | [Date Prev] [Date Next] | [Thread Prev] [Thread Next] |