[PATCH] Update to Unicode 13.0.0 [BZ #25819]

Tom Honermann tom@honermann.net
Thu Apr 16 19:32:59 GMT 2020


Perhaps this update should also modify the following portion of 
include/stdc-predef.h?  Unicode 13 correlates with ISO/IEC 10646:2020, 
sixth edition.  Though, that edition is still in DIS status and hasn't 
been published yet.

/* wchar_t uses Unicode 10.0.0. Version 10.0 of the Unicode Standard is
    synchronized with ISO/IEC 10646:2017, fifth edition, plus
    the following additions from Amendment 1 to the fifth edition:
    - 56 emoji characters
    - 285 hentaigana
    - 3 additional Zanabazar Square characters */
#define __STDC_ISO_10646__              201706L

Tom.

On 4/16/20 7:06 AM, Mike FABIAN via Libc-alpha wrote:
> Bug 25819: Update to Unicode 13.0.0
>
> Unicode 13.0.0 Support: Character encoding, character type info, and
> transliteration tables are all updated to Unicode 13.0.0, using
> the generator scripts contributed by Mike FABIAN (Red Hat).
>
> Total added characters in newly generated CHARMAP: 5930
> Total added characters in newly generated WIDTH: 5536
>
>
> I reviewed the reports generated by the scripts, looks fine to me,
> I could not see anything suspicious.
>
> A bit surprising is that these two characters are single width now:
>
> Total removed characters in newly generated WIDTH: 2
> (Characters not in WIDTH get width 1 by default, i.e. these have width 1 now.)
> removed: <U0001F93B> 2 : eaw=N category=So bidi=ON  name=MODERN PENTATHLON
> removed: <U0001F946> 2 : eaw=N category=So bidi=ON  name=RIFLE
>
> But that is OK because they are not emoji.
>
> I noticed a few new Latin characters with upper and lower case versions in ctype:
>
> tolower: Added 3 characters in new ctype which were not in old ctype
> tolower: Added: Ꟈ → ꟈ 0xa7c7 → 0xa7c8 name unknown → name unknown
> tolower: Added: Ꟊ → ꟊ 0xa7c9 → 0xa7ca name unknown → name unknown
> tolower: Added: Ꟶ → ꟶ 0xa7f5 → 0xa7f6 name unknown → name unknown
>
> Looks fine as well. Lots of other characters were added but nothing
> suspicious there either. So I think the updating scripts worked well.
>



More information about the Libc-alpha mailing list