This is the mail archive of the cygwin mailing list for the Cygwin project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: LANG=ja_JP.Shift_JIS


2010/5/21 Andy Koppe <andy.koppe@gmail.com>:
>
> So why do you need to set it to ja_JP.Shift_JIS if ja_JP.CP932 and
> ja_JP.SJIS do the same thing?

There is no serious reason.
I think IANA name is more famous.

> I'm not convinced that adding Shift_JIS to Cygwin as another alias for
> CP932 is a good idea, because they aren't actually the same, e.g.
> iconv treats them as different charsets. The SJIS alias is dodgy
> enough already.

Yes, I know it is difficult problem.

@centos5.5
$ echo -ne '\x5c ~ \x81\x60'|iconv -f Shift_JIS -t UTF-16LE|hexdump
0000000 00a5 0020 203e 0020 301c
000000a
$ echo -ne '\x5c ~ \x81\x60'|iconv -f SJIS -t UTF-16LE|hexdump
0000000 00a5 0020 203e 0020 301c
000000a
$ echo -ne '\x5c ~ \x81\x60'|iconv -f CP932 -t UTF-16LE|hexdump
0000000 005c 0020 007e 0020 ff5e
000000a
$ echo -ne '\x5c ~ \x81\x60'|iconv -f Windows-31J -t UTF-16LE|hexdump
0000000 005c 0020 007e 0020 ff5e
000000a

@cygwin-1.7
$ echo -ne '\x5c ~ \x81\x60'|iconv -f Shift_JIS -t UTF-16LE|hexdump
0000000 00a5 0020 203e 0020 301c
000000a
$ echo -ne '\x5c ~ \x81\x60'|iconv -f SJIS -t UTF-16LE|hexdump
0000000 00a5 0020 203e 0020 301c
000000a
$ echo -ne '\x5c ~ \x81\x60'|iconv -f CP932 -t UTF-16LE|hexdump
0000000 005c 0020 007e 0020 301c
000000a
$ echo -ne '\x5c ~ \x81\x60'|iconv -f Windows-31J -t UTF-16LE|hexdump
iconv: conversion from Windows-31J unsupported
iconv: try 'iconv -l' to get the list of supported encodings

--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]