This is the mail archive of the
glibc-bugs@sourceware.org
mailing list for the glibc project.
[Bug libc/1386] iconv incorrectly convert bytes 1A, 1C and 7F for IBM943 and IBM942
- From: "grhoten at gmail dot com" <sourceware-bugzilla at sourceware dot org>
- To: glibc-bugs at sources dot redhat dot com
- Date: 15 Oct 2005 00:05:00 -0000
- Subject: [Bug libc/1386] iconv incorrectly convert bytes 1A, 1C and 7F for IBM943 and IBM942
- References: <20050928034819.1386.grhoten@gmail.com>
- Reply-to: sourceware-bugzilla at sourceware dot org
------- Additional Comments From grhoten at gmail dot com 2005-10-15 00:05 -------
(In reply to comment #1)
You're right. iconv doesn't have ibm-942. I meant ibm-932. Sorry about that.
The ibm-* tables from ICU's charset repository are generated directly from IBM's
CDRA. I'm sure that the ibm943 iconv module was also generated from IBM, but
this seems to be a typo in the iconv module.
The main issue is not whether \u007F goes to \x7F or \x1C. Both mapping
behaviors are considered valid in IBM's CDRA. The problem is that those bytes
don't map back to the original Unicode character. You have to round trip convert
your data three times to get your original data back.
--
http://sourceware.org/bugzilla/show_bug.cgi?id=1386
------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.