[PATCH] Update to Unicode 13.0.0 [BZ #25819]

Mike FABIAN mfabian@redhat.com
Thu Apr 16 11:06:27 GMT 2020


Bug 25819: Update to Unicode 13.0.0

Unicode 13.0.0 Support: Character encoding, character type info, and
transliteration tables are all updated to Unicode 13.0.0, using
the generator scripts contributed by Mike FABIAN (Red Hat).

Total added characters in newly generated CHARMAP: 5930
Total added characters in newly generated WIDTH: 5536


I reviewed the reports generated by the scripts, looks fine to me,
I could not see anything suspicious.

A bit surprising is that these two characters are single width now:

Total removed characters in newly generated WIDTH: 2
(Characters not in WIDTH get width 1 by default, i.e. these have width 1 now.)
removed: <U0001F93B> 2 : eaw=N category=So bidi=ON  name=MODERN PENTATHLON
removed: <U0001F946> 2 : eaw=N category=So bidi=ON  name=RIFLE

But that is OK because they are not emoji.

I noticed a few new Latin characters with upper and lower case versions in ctype:

tolower: Added 3 characters in new ctype which were not in old ctype
tolower: Added: Ꟈ → ꟈ 0xa7c7 → 0xa7c8 name unknown → name unknown
tolower: Added: Ꟊ → ꟊ 0xa7c9 → 0xa7ca name unknown → name unknown
tolower: Added: Ꟶ → ꟶ 0xa7f5 → 0xa7f6 name unknown → name unknown

Looks fine as well. Lots of other characters were added but nothing
suspicious there either. So I think the updating scripts worked well.

-------------- next part --------------
A non-text attachment was scrubbed...
Name: 0001-Bug-25819-Update-to-Unicode-13.0.0.patch
Type: text/x-patch
Size: 523101 bytes
Desc: not available
URL: <https://sourceware.org/pipermail/libc-alpha/attachments/20200416/f166bd49/attachment-0001.bin>


More information about the Libc-alpha mailing list