This is the mail archive of the mailing list for the glibc project.

Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: [PATCH 4/4] ja_JP locale: Add entry for the new Japanese era [BZ #22964]

On 4/1/19 6:34 AM, Florian Weimer wrote:
* TAMUKI Shoichi:

Hello Florian-san,

From: Florian Weimer <>
Subject: Re: [PATCH 4/4] ja_JP locale: Add entry for the new Japanese era [BZ #22964]
Date: Mon, 01 Apr 2019 09:04:05 +0200

Based on <>and


the encoding appears to be correct.

In addition to the code of <U4EE4>, note that the code of <UF9A8> is
also present in <REI>.  The latter is a CJK compatibility ideograph
unicode character and is not usually used.

Sorry, I don't understand.  Do you mean that the era name could be
written with <UF9A8> instead of <U4EE4> as the first codepoint?  How
certain are we that <U4EE4> is indeed the official codepoint?

I believe it could be written as <UF9A8>, but that is not a Kanji
character which we can display in shift-jis / euc-jp, while <U4EE4>
is a code point we have mapped to a specific encoding value.

All uses of REI are <U4EE4> on the government site, which is as strong
indicator that this is what we should be using.

Lastly, this CLDR ticket for 35.1:

And these revisions of trunk:
Index: common/uca/FractionalUCA.txt
--- common/uca/FractionalUCA.txt        (revision 14975)
+++ common/uca/FractionalUCA.txt        (revision 14978)
-[UCA version = 12.0.0]
+[UCA version = 12.1.0]
+32FF; [U+4EE4, 31][U+548C, 31] # Zyyy So       [FB40.0020.001C][CEE4.0000.0000][FB40.0020.001C][D48C.0000.0000]        * SQUARE ERA NAME REIWA

Indicate <U4EE4> is what will be used for CLDR and we should match.

No official update from the Unicode standard yet:


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]