This is the mail archive of the libc-alpha@sources.redhat.com mailing list for the glibc project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]

code_set_name of GB2312



In a locale with encoding GB2312, a call to nl_langinfo(CODESET)
currently returns "GB2312-1980". This is bad because
  1. This is not a standardized encoding name,
  2. this is different from the preferred MIME name "GB2312",
  3. even glibc's own iconv does not know about this encoding name.

Remember that these are different:

  - the doublewidth coded character set (does not include ASCII)
    IANA names: GB_2312-80 = iso-ir-58 = chinese = csISO58GB231280
    X11R6 name: GB2312.1980-0

  - the character encoding scheme a.k.a. EUC-CN (includes ASCII)
    IANA names: GB2312 = csGB2312
    MIME name: GB2312

and the charmap stands for the latter. Here is a patch.


2000-09-14  Bruno Haible  <haible@clisp.cons.org>

	* charmaps/GB2312: Change code_set_name to GB2312.

*** localedata/charmaps/GB2312.bak	Tue Sep  5 15:25:23 2000
--- localedata/charmaps/GB2312	Thu Sep 14 22:19:26 2000
***************
*** 1,9 ****
! <code_set_name> GB2312-1980
  <mb_cur_max> 2
  <mb_cur_min> 1
  <comment_char> %
  <escape_char> /
! % Chinese charmap for GB2312-1980
  % version: 1.0
  % Contact: ha_shao
  % Email: hashao@china.com
--- 1,9 ----
! <code_set_name> GB2312
  <mb_cur_max> 2
  <mb_cur_min> 1
  <comment_char> %
  <escape_char> /
! % Chinese charmap for EUC-CN = GB2312 = union of ASCII and GB_2312-80
  % version: 1.0
  % Contact: ha_shao
  % Email: hashao@china.com

Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]