Bug 11947

Summary: New locale for Meadow Mari language
Product: glibc Reporter: Slava Kileev <slavakileev>
Component: localedataAssignee: GNU C Library Locale Maintainers <libc-locales>
Status: RESOLVED FIXED    
Severity: normal CC: ajlyon, drepper.fsp, glibc-bugs, mistresssilvara
Priority: P2 Flags: fweimer: security-
Version: unspecified   
Target Milestone: ---   
Host: Target:
Build: Last reconfirmed:

Description Slava Kileev 2010-08-26 07:17:44 UTC
I have created locale file for Meadow Mari language. I suppose, it should be 
added to upstream. The file is here:
http://marlamuter.org/public/linux/mhr_RU
Comment 1 Ulrich Drepper 2010-10-04 02:31:57 UTC
You have to use the ru_RU file for the language-independent categories.  Use 
"copy".
Comment 2 Slava Kileev 2010-10-04 07:50:16 UTC
Subject: Re:  New locale for Meadow Mari language



04.10.10, 06:31, "drepper dot fsp at gmail dot com" <sourceware-bugzilla@sourceware.org>:

> 
>  ------- Additional Comments From drepper dot fsp at gmail dot com  2010-10-04 02:31 -------
>  You have to use the ru_RU file for the language-independent categories.  Use 
>  "copy".
>  

Hello, Ulrich Drepper

Could you please specify what categories are language-independent?
This locale file was created according national standards and cultural well-established traditions. Some differences in mhr_RU from ru_RU could be, these differences in files reflect cultural differences. Please specify exact categories that we could look into the problem.

Regards,
Slava
Comment 3 -EMail Hidden- 2011-04-07 11:18:34 UTC
First, as follows from here

http://ru.wikipedia.org/wiki/%D0%9C%D0%B0%D1%80%D0%B8%D0%B9%D1%81%D0%BA%D0%B0%D1%8F_%D0%BF%D0%B8%D1%81%D1%8C%D0%BC%D0%B5%D0%BD%D0%BD%D0%BE%D1%81%D1%82%D1%8C

Marian language has some extra letters over Russian 33, and therefore they must be described in LC_COLLATE section.

Second, as M. probably doesn't have R. words "да" и "нет", yesexpr and noexpr should be fixed to use letters for M. words.

Finally, LC_MONETARY, LC_NUMERIC, LC_PAPER, LC_TELEPHONE, LC_MEASUREMENT, LC_NAME and LC_ADDRESS, being a federal-level standard, should be copied from ru locale.
Comment 4 Slava Kileev 2011-04-18 09:05:14 UTC
Thanks for comments.

Yes, Marian language has some extra letters, that are absent in Russian. These additional letters are fine sorted in iso-14651_t1 that is used in LC_COLLATE section of the file. Is there a necessity to duplicate appropriate rules from iso-14651 files?

Now, I have changed yesexpr and noexpr to appropriate Mari letters. Thanks for remark.

About LC_MONETARY. mhr_RU - locale is for Russian Federation as a country. So, money should be rubles. Rubles in Mari language would be "Теҥге", all prices in Mari language have Mari spelling rather than Russian. So only the "currency symbol" has Mari spelling, all other fields are the same as in Russian locale.

LC_NUMERIC, LC_PAPER, LC_TELEPHONE, LC_MEASUREMENT, LC_NAME and LC_ADDRESS are the same as in ru_RU.

I have updated the file with locale, it is still available for download from here:
http://marlamuter.org/public/linux/mhr_RU

Is there any other things that should I do to make Mari locale file approved and added to glibc?
Comment 5 Avram Lyon 2011-04-23 19:41:34 UTC
(In reply to comment #4)
> About LC_MONETARY. mhr_RU - locale is for Russian Federation as a country. So,
> money should be rubles. Rubles in Mari language would be "Теҥге", all prices in
> Mari language have Mari spelling rather than Russian. So only the "currency
> symbol" has Mari spelling, all other fields are the same as in Russian locale.

And this holds across many of the non-Russian languages of the Russian Federation, such as the nearby Tatar and Bashkir. The non-Russian names of the ruble had well-documented official status in the Soviet Union (see http://en.wikipedia.org/wiki/Soviet_ruble#Ruble_in_the_Soviet_Union, or any banknote from that period), and Mari is one of many local languages that maintains an native term.
Comment 6 -EMail Hidden- 2011-05-09 14:34:20 UTC
> ...These
> additional letters are fine sorted in iso-14651_t1 that is used in LC_COLLATE
> section of the file. Is there a necessity to duplicate appropriate rules from
> iso-14651 files?

In that file they're not defined as letters but as diacritic variants of base letter, and ҥ sorts as н and г combination. If they are actual letters, and have their own section in dictionary, or not intermingled with ы or о, then unique sort weights should be created and assigned.

> Is there any other things that should I do to make Mari locale file approved
> and added to glibc?

I doubt libc developers care (or know) about Mari at all, so most likely you'll have to wait for a couple of years.
Comment 7 Ulrich Drepper 2011-05-09 23:06:13 UTC
You have to change the state of the bug from WAITING.  WAITING is hiding bugs in queries.

I fixed up the locale and added it.
Comment 8 Slava Kileev 2011-05-10 14:32:18 UTC
Thanks a lot! This locale is very important for Mari people. We will be waiting for a new version of glibc.