Bug 3954 - iconv : a new character (U+327E) needs to be added to Korean encodings
Summary: iconv : a new character (U+327E) needs to be added to Korean encodings
Status: RESOLVED FIXED
Alias: None
Product: glibc
Classification: Unclassified
Component: libc (show other bugs)
Version: unspecified
: P2 normal
Target Milestone: ---
Assignee: Ulrich Drepper
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2007-02-02 02:07 UTC by Jungshik Shin
Modified: 2018-04-20 14:01 UTC (History)
1 user (show)

See Also:
Host:
Target:
Build:
Last reconfirmed:
fweimer: security-


Attachments
Proposed fix (untested) (1.90 KB, patch)
2007-02-03 03:54 UTC, Bruno Haible
Details | Diff

Note You need to log in before you can comment on or make changes to this bug.
Description Jungshik Shin 2007-02-02 02:07:08 UTC
In 2002, a new character (Korean postal code mark) was added to KS X 1001 at row
2 and column 72. In GL, that's 0x22 0x68 while it's 0xA2 0xE8 in GR. That is,
for ISO-2022-KR, it's 0x22 0x68 and for EUC-KR it's 0xA2 0xE8.  

For JOHAB, it's 0xD9 0xE8
Comment 1 Jungshik Shin 2007-02-02 02:15:11 UTC
It's also 0xA2 0xE8 in CP949 (UHC, Windows-949)
Comment 2 Jungshik Shin 2007-02-02 02:16:22 UTC
(In reply to comment #1)
> It's also 0xA2 0xE8 in CP949 (UHC, Windows-949)

Sorry for bug spam. Up to Windows XP, MS doesn't seem to have added it to their
fonts. I haven't checked their converter, yet. Windows Vista is shipped with a
new set of Korean fonts so that things may be different there. 


Comment 3 Bruno Haible 2007-02-03 03:53:49 UTC
The document http://std.dkuug.dk/jtc1/sc2/wg2/docs/N2815.doc 
agrees with what you say. The character has been added in Unicode 4.1.0: 
327E;CIRCLED HANGUL IEUNG U;So;0;ON;<circle> 110B 116E;;;;N;;;;; 
 
EUC-KR and JOHAB are based on KS X 1001, therefore I am in favour of adding 
this character to these converters. CP949 = UHC, however, is nowadays defined 
by Microsoft, and the Windows XP mapping does not contain this character 
(see http://www.haible.de/bruno/charsets/conversion-tables/EUC-KR.html), 
therefore I'm against adding it to the CP949 converter at this time. 
 
Find attached a patch that implements this. Assumes PR3955.diff is applied 
first. Untested. 
 
 
Comment 4 Bruno Haible 2007-02-03 03:54:46 UTC
Created attachment 1534 [details]
Proposed fix (untested)
Comment 5 Ulrich Drepper 2007-02-16 01:25:46 UTC
After fixing one typo I added the patch to the trunk.
Comment 6 Sourceware Commits 2007-07-12 14:48:33 UTC
Subject: Bug 3954

CVSROOT:	/cvs/glibc
Module name:	libc
Branch: 	glibc-2_5-branch
Changes by:	jakub@sourceware.org	2007-07-12 14:48:20

Modified files:
	.              : ChangeLog 
	iconvdata      : johab.c ksc5601.c ksc5601.h uhc.c 
	localedata     : ChangeLog 
	localedata/charmaps: EUC-KR JOHAB 

Log message:
	2007-02-02  Bruno Haible  <bruno@clisp.org>
	
	[BZ #3954]
	* iconvdata/ksc5601.c (__ksc5601_sym_to_ucs, __ksc5601_sym_from_ucs):
	Add mapping for U+327E.
	* iconvdata/ksc5601.h (KSC5601_SYMBOL): Increment.
	* iconvdata/johab.c (BODY for FROM_LOOP, BODY for TO_LOOP): Enable
	mapping of 0xD9 0xE8.
	* iconvdata/uhc.c (BODY for FROM_LOOP, BODY for TO_LOOP): Disable
	mapping of U+327E.
	Reported by Jungshik Shin <jungshik@google.com>.
	
	[BZ #3955]
	* iconvdata/johab.c (BODY for FROM_LOOP, BODY for TO_LOOP): Enable
	mapping of 0xD9 0xE6 and of 0xD9 0xE7.
	Reported by Jungshik Shin <jungshik@google.com>.
	
	2007-02-02  Bruno Haible  <bruno@clisp.org>
	
	* charmaps/EUC-KR: Add mapping for 0xA2 0xE8.
	* charmaps/JOHAB: Add mapping for 0xD9 0xE8.
	Reported by Jungshik Shin <jungshik@google.com>.
	
	* charmaps/JOHAB: Add mappings for 0xD9 0xE6 and 0xD9 0xE7.
	Reported by Jungshik Shin <jungshik@google.com>.

Patches:
http://sourceware.org/cgi-bin/cvsweb.cgi/libc/ChangeLog.diff?cvsroot=glibc&only_with_tag=glibc-2_5-branch&r1=1.10362.2.38&r2=1.10362.2.39
http://sourceware.org/cgi-bin/cvsweb.cgi/libc/iconvdata/johab.c.diff?cvsroot=glibc&only_with_tag=glibc-2_5-branch&r1=1.28&r2=1.28.8.1
http://sourceware.org/cgi-bin/cvsweb.cgi/libc/iconvdata/ksc5601.c.diff?cvsroot=glibc&only_with_tag=glibc-2_5-branch&r1=1.6&r2=1.6.10.1
http://sourceware.org/cgi-bin/cvsweb.cgi/libc/iconvdata/ksc5601.h.diff?cvsroot=glibc&only_with_tag=glibc-2_5-branch&r1=1.14&r2=1.14.8.1
http://sourceware.org/cgi-bin/cvsweb.cgi/libc/iconvdata/uhc.c.diff?cvsroot=glibc&only_with_tag=glibc-2_5-branch&r1=1.26&r2=1.26.8.1
http://sourceware.org/cgi-bin/cvsweb.cgi/libc/localedata/ChangeLog.diff?cvsroot=glibc&only_with_tag=glibc-2_5-branch&r1=1.603.2.3&r2=1.603.2.4
http://sourceware.org/cgi-bin/cvsweb.cgi/libc/localedata/charmaps/EUC-KR.diff?cvsroot=glibc&only_with_tag=glibc-2_5-branch&r1=1.10&r2=1.10.8.1
http://sourceware.org/cgi-bin/cvsweb.cgi/libc/localedata/charmaps/JOHAB.diff?cvsroot=glibc&only_with_tag=glibc-2_5-branch&r1=1.4&r2=1.4.8.1