Bug 13953 - km_KH: locale update
Summary: km_KH: locale update
Status: RESOLVED FIXED
Alias: None
Product: glibc
Classification: Unclassified
Component: localedata (show other bugs)
Version: 2.15
: P2 normal
Target Milestone: 2.27
Assignee: David Justin
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2012-04-06 07:31 UTC by Chris Leonard
Modified: 2017-10-30 11:13 UTC (History)
5 users (show)

See Also:
Host:
Target:
Build:
Last reconfirmed:
fweimer: security-


Attachments
UTF-8_KhmerSymbol (728 bytes, patch)
2014-12-05 18:12 UTC, David Justin
Details | Diff
UFT-8_KHM (126.95 KB, patch)
2014-12-05 18:14 UTC, David Justin
Details | Diff

Note You need to log in before you can comment on or make changes to this bug.
Description Chris Leonard 2012-04-06 07:31:10 UTC
In addition to the errors I can correct with the patch in #13952, there are other sections that appear to have carry-over of incorrect characters from the Lao locale.

Unfortunately, I do not have a fix for these, this will have to come from the Khmer language team, I can only point out the potential errors.

++++++++++++++++++++++

This section of LC_TIME contains characters from the Lao character range (e.g. U0Enn) of Unicode and is unlikely to be correct for Khmer.

% Era : Buddhist Era
%era     "+:1:-543<U002F>01<U002F>01:+*:<U0E9E><U002E><U0EAA><U002E>:%EC<U0020>%Ey"
%era_d_fmt       "%e<U0020>%b<U0020>%Ey"
%era_t_fmt       "%H<U002E>%M<U002E>%S<U0020><U0E99><U002E>"
%era_d_t_fmt     "<U0EA7><U0EB1><U0E99>%A<U0E97><U0EB5><U0EC8><U0020>%e<U0020>%B<U0020>%EC<U0020>%Ey<U002C><U0020>%H<U002E>%M<U002E>%S<U0020><U0E99><U002E>"

++++++++++++++++++++++

This section of LC_NAME contains characters from the Lao character range (e.g. U0Enn) of Unicode and is unlikely to be correct for Khmer.

name_mr     "<U0E97>."
name_mrs    "<U0E99>."

++++++++++++++++++++++
Comment 1 David Justin 2014-12-05 18:12:35 UTC
Created attachment 7999 [details]
UTF-8_KhmerSymbol
Comment 2 David Justin 2014-12-05 18:14:38 UTC
Created attachment 8000 [details]
UFT-8_KHM
Comment 3 Sourceware Commits 2017-10-30 11:10:15 UTC
This is an automated email from the git hooks/post-receive script. It was
generated because a ref change was pushed to the repository containing
the project "GNU C Library master sources".

The branch, master has been updated
       via  b110af8278588c51f29ff00ec5577566b77b9039 (commit)
      from  f301e5334065e93aace667fd4a87bce6fc1dbd13 (commit)

Those revisions listed above that are new to this repository have
not appeared on any other notification email; so we list those
revisions in full, below.

- Log -----------------------------------------------------------------
https://sourceware.org/git/gitweb.cgi?p=glibc.git;h=b110af8278588c51f29ff00ec5577566b77b9039

commit b110af8278588c51f29ff00ec5577566b77b9039
Author: Mike FABIAN <mfabian@redhat.com>
Date:   Mon Oct 30 11:46:05 2017 +0100

    km_KH locale: remove tags which were using Lao characters [BZ #13953]
    
    	[BZ #13953]
    	* localedata/locales/km_KH: Use ASCII as much
    	as possible for better readability of the source and
    	remove useless comments.
    	* localedata/locales/km_KH (LC_TIME): Remove era stuff, it
    	was commented out and apparently wrong anyway because it was
    	using Lao characters. If Buddhist era should be used
    	for km_KH, a native speaker should write the correct formaat
    	for Khmer.
    	* localedata/locales/km_KH (LC_TIME): Add first_weekday 1
    	(According to CLDR, the first weekday for Cambodia is Sunday).
    	* localedata/locales/km_KH (LC_NAME): Remove name_mr and name_mrs
    	(These were using Lao characters which must be wrong. If we get
    	the correct data from a native speaker, we could add it back, until
    	then it is better not to have name_mr and name_mrs at all than
    	having it wrong).

-----------------------------------------------------------------------

Summary of changes:
 ChangeLog                |   19 +++++++++++
 localedata/locales/km_KH |   78 ++++++++++++---------------------------------
 2 files changed, 40 insertions(+), 57 deletions(-)
Comment 4 Mike FABIAN 2017-10-30 11:13:29 UTC
Fixed in glibc master.