This is the mail archive of the glibc-bugs@sourceware.org mailing list for the glibc project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

[Bug libc/1386] iconv incorrectly convert bytes 1A, 1C and 7F for IBM943 and IBM942

From: "grhoten at gmail dot com" <sourceware-bugzilla at sourceware dot org>
To: glibc-bugs at sources dot redhat dot com
Date: 15 Oct 2005 00:05:00 -0000
Subject: [Bug libc/1386] iconv incorrectly convert bytes 1A, 1C and 7F for IBM943 and IBM942
References: <20050928034819.1386.grhoten@gmail.com>
Reply-to: sourceware-bugzilla at sourceware dot org

------- Additional Comments From grhoten at gmail dot com  2005-10-15 00:05 -------
(In reply to comment #1)

You're right. iconv doesn't have ibm-942. I meant ibm-932. Sorry about that.

The ibm-* tables from ICU's charset repository are generated directly from IBM's
CDRA. I'm sure that the ibm943 iconv module was also generated from IBM, but
this seems to be a typo in the iconv module.

The main issue is not whether \u007F goes to \x7F or \x1C. Both mapping
behaviors are considered valid in IBM's CDRA. The problem is that those bytes
don't map back to the original Unicode character. You have to round trip convert
your data three times to get your original data back.

-- 


http://sourceware.org/bugzilla/show_bug.cgi?id=1386

------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]