localedata/unicode-gen/utf8_gen.py lists the 6th element of JAMO_FINAL_SHORT_NAME as NI, but according to Unicode (all I've checked, Unicode claims names are immutable) it should be NJ. See https://www.unicode.org/Public/4.1.0/ucd/Jamo.txt 11AC; NJ # HANGUL JONGSEONG NIEUN-CIEUC or https://www.unicode.org/Public/14.0.0/ucd/Jamo.txt 11AC; NJ # HANGUL JONGSEONG NIEUN-CIEUC This means that UTF-8 contains entries like: <UAD8D> /xea/xb6/x8d HANGUL SYLLABLE GWEONI that my Unicode name to codepoint function can't recognize, while it can map "HANGUL SYLLABLE GWEONJ" to U+AD8D.
The master branch has been updated by Mike Fabian <mfabian@sourceware.org>: https://sourceware.org/git/gitweb.cgi?p=glibc.git;h=064c708c78cc2a6b5802dce73108fc0c1c6bfc80 commit 064c708c78cc2a6b5802dce73108fc0c1c6bfc80 Author: Mike FABIAN <mfabian@redhat.com> Date: Sun Jan 14 11:42:28 2024 +0100 localedata/unicode-gen/utf8_gen.py: fix Hangul syllable name Resolves: BZ # 29506
Fixed in glibc master.