This is the mail archive of the
libc-alpha@sources.redhat.com
mailing list for the glibc project.
code_set_name of GB2312
- To: libc-alpha at sources dot redhat dot com
- Subject: code_set_name of GB2312
- From: Bruno Haible <haible at ilog dot fr>
- Date: Thu, 14 Sep 2000 22:30:38 +0200 (CEST)
In a locale with encoding GB2312, a call to nl_langinfo(CODESET)
currently returns "GB2312-1980". This is bad because
1. This is not a standardized encoding name,
2. this is different from the preferred MIME name "GB2312",
3. even glibc's own iconv does not know about this encoding name.
Remember that these are different:
- the doublewidth coded character set (does not include ASCII)
IANA names: GB_2312-80 = iso-ir-58 = chinese = csISO58GB231280
X11R6 name: GB2312.1980-0
- the character encoding scheme a.k.a. EUC-CN (includes ASCII)
IANA names: GB2312 = csGB2312
MIME name: GB2312
and the charmap stands for the latter. Here is a patch.
2000-09-14 Bruno Haible <haible@clisp.cons.org>
* charmaps/GB2312: Change code_set_name to GB2312.
*** localedata/charmaps/GB2312.bak Tue Sep 5 15:25:23 2000
--- localedata/charmaps/GB2312 Thu Sep 14 22:19:26 2000
***************
*** 1,9 ****
! <code_set_name> GB2312-1980
<mb_cur_max> 2
<mb_cur_min> 1
<comment_char> %
<escape_char> /
! % Chinese charmap for GB2312-1980
% version: 1.0
% Contact: ha_shao
% Email: hashao@china.com
--- 1,9 ----
! <code_set_name> GB2312
<mb_cur_max> 2
<mb_cur_min> 1
<comment_char> %
<escape_char> /
! % Chinese charmap for EUC-CN = GB2312 = union of ASCII and GB_2312-80
% version: 1.0
% Contact: ha_shao
% Email: hashao@china.com