[PATCH v9] Locales: Cyrillic -> ASCII transliteration table [BZ #2872]

Marko Myllynen myllynen@redhat.com
Mon Nov 19 19:36:00 GMT 2018


Hi,

On 19/11/2018 11.21, Egor Kobylkin wrote:
> On 19.11.18 08:13, Marko Myllynen wrote:
>> On 17/11/2018 20.34, Egor Kobylkin wrote:
> 
> Your example only covers _tansliteration_ to Latin Diacritics
> iconv -f UTF-8 -t ISO-8859-15//TRANSLIT \
> | iconv -f ISO-8859-15 -t UTF-8
> 
> while BZ #2872 is about _transcription_ to ASCII
> iconv -f UTF-8 -t ASCII//TRANSLIT

AFAICS v9 (unlike v10) supported both of the above cases.

> The glibc wiki explicitly lists this use case (ASCII) as the test
> example https://sourceware.org/glibc/wiki/Locales#Testing_Locales

I wrote that section and I certainly wasn't considering Cyrillic aspects
at that time (IIRC it was written even before Mike did the major update
for transliteration rules at the end of 2015). The context back then was
mostly about handling Latin letters like Ã
, Ä, Ö, Ø, etc.

> So again, you are asking to have ISO 9.1995. System A but the bug is
> about ISO 9.1995. System B (GOST 7.79-2000)

We certainly can decide here what's the best course of action, we do not
have to slavishly follow some old bug report when deciding the direction
for the implementation. But I think I've made my position clear by now
so I'm not going to repeat it anymore.

In any case once your patch lands I'm going to submit a follow-up patch
for fi_FI to make it compliant with the applicable national standard
(SFS 4900) which defines how to do Cyrillic transliteration /
transcription in the context Finnish.

Thanks,

-- 
Marko Myllynen



More information about the Libc-locales mailing list