This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: [PATCH][BZ 15593] Add transliteration data for "LATIN SMALL LETTER O WITH STROKE" (Ã)


Hi,

On 2013-10-17 14:42, Toke HÃiland-JÃrgensen wrote:
> 
> After applying the patch and rebuilding the locale, proper
> transliteration is achieved:
> 
> $ echo test à test | iconv -f utf8 -t ascii//translit
> test oe test
> 
> The choice of transliteration to the phonetic 'oe' is primarily based on
> being a native of Denmark (it's how my name is spelled on plane tickets
> and in my passport...), and on comments from comments from
> keld@keldix.com (see bugzilla comment thread), but these sources seem to
> agree:
> 
> The Nordic FAQ:
> http://www.faqs.org/faqs/nordic-faq/part1_INTRODUCTION/section-7.html
> 
> +% LATIN CAPITAL LETTER O WITH STROKE -> "OE"
> +<U00D8> "<U004F><U0338>";"<U004F><U0045>"

Having "Ã" -> "OE" transliteration rule seems like a good thing.

Looking at the current tables I see these for à (U00C6) and à (U00D6):

translit_neutral:<U00C6> "<U0041><U0045>"
translit_combining:<U00D6> <U004F>

For à Unicode specifies decomposition (U004F U0308) but not so for Ã/Ã.
So it's a bit unclear to me are all the three rules above as expected?

Also, I wonder how do we cope with these kind of local additions when
resolving bug 14094?

Thanks,

-- 
Marko Myllynen


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]