I'm running sys-libs/glibc-2.9_p20081201-r2 on Gento Linux. I'm attempting to use iconv to convert sjis/cp932 input into utf8 output. However, some apparently valid input is causing iconv to return EILSEQ. It is worthwhile to note that this content can be displayed by Firefox both on Linux and a Windows Vista test box as well as in IE8. It appears that there is some problem within iconv, but there is no other charset that matches the input (as seen here): http://www.gnu.org/software/libiconv/documentation/libiconv/iconv_open.3.html
Created attachment 4054 [details] test case for failed input Open this file in Firefox and manually set encoding to ShiftJIS. Then run it through iconv and notice that it fails.
It appears that the first 64 indices of row 81 are invalid: http://web.mit.edu/shutkin/MacData_1124b/afs/sipb/project/dia/src/libunicode-0.4/msft/cp932.h but then why does this character still render -- apparently correctly -- elsewhere?
What code positions do you mean? When I read It appears that the first 64 indices of row 81 are invalid: I hope you don't mean 0x81,0x00 to 0x81,0x3f. These are of course invalid. Just look at http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP932.TXT The tables is used to generate the tables.
No reply in more than 3 months. Closing.