This is the mail archive of the
libc-alpha@sourceware.org
mailing list for the glibc project.
Re: [PATCH][BZ 15593] Add transliteration data for "LATIN SMALL LETTER O WITH STROKE" (Ã)
- From: Marko Myllynen <myllynen at redhat dot com>
- To: Toke HÃiland-JÃrgensen <toke at toke dot dk>
- Cc: libc-alpha at sourceware dot org
- Date: Thu, 17 Oct 2013 16:00:08 +0300
- Subject: Re: [PATCH][BZ 15593] Add transliteration data for "LATIN SMALL LETTER O WITH STROKE" (Ã)
- Authentication-results: sourceware.org; auth=none
- References: <87r4bkdsyn dot fsf at toke dot dk>
- Reply-to: myllynen at redhat dot com
Hi,
On 2013-10-17 14:42, Toke HÃiland-JÃrgensen wrote:
>
> After applying the patch and rebuilding the locale, proper
> transliteration is achieved:
>
> $ echo test à test | iconv -f utf8 -t ascii//translit
> test oe test
>
> The choice of transliteration to the phonetic 'oe' is primarily based on
> being a native of Denmark (it's how my name is spelled on plane tickets
> and in my passport...), and on comments from comments from
> keld@keldix.com (see bugzilla comment thread), but these sources seem to
> agree:
>
> The Nordic FAQ:
> http://www.faqs.org/faqs/nordic-faq/part1_INTRODUCTION/section-7.html
>
> +% LATIN CAPITAL LETTER O WITH STROKE -> "OE"
> +<U00D8> "<U004F><U0338>";"<U004F><U0045>"
Having "Ã" -> "OE" transliteration rule seems like a good thing.
Looking at the current tables I see these for à (U00C6) and à (U00D6):
translit_neutral:<U00C6> "<U0041><U0045>"
translit_combining:<U00D6> <U004F>
For à Unicode specifies decomposition (U004F U0308) but not so for Ã/Ã.
So it's a bit unclear to me are all the three rules above as expected?
Also, I wonder how do we cope with these kind of local additions when
resolving bug 14094?
Thanks,
--
Marko Myllynen