default codepage
Thomas Wolff
towo@towo.net
Tue Jun 23 15:01:00 GMT 2009
Corinna Vinschen wrote:
> On Jun 22 16:48, Thomas Wolff wrote:
> > Since the latest locale-related changes, the default codepage after
> > starting cygwin _without_ explicit setting (of a locale variable)
> > seems to have changed from CP1252 ("Windows ANSI") to ISO 8859-1 ("Latin 1").
> > Was this change on purpose?
>
> There was no such change at all. The default codepage is still the
> default ANSI codepage on your system. The internal conversion from
> Windows functions to the POSIX multibyte environment and vice versa
> uses UTF-8, though, so that all existing filenames have a valid
> representation even when using characters not available in your
> current codepage.
If I do the following:
* Open cmd console window.
* Go into cygwin 1.7 directory.
* Call cygwin.bat.
* In cygwin, "cat" a file with all 8 bit characters from U+20 to U+FF.
Then there are no printable characters in the range U+80...U+9F
(the difference between ISO 8859-1 and Windows "Western" CP1252).
If I set LC_CTYPE=en_US.CP1252 before invoking cygwin.bat, I get
full CP1252.
The script calls bash --login. If I start only bash (without --login
and without LC_CTYPE), I get CP1252 as well, which appears somehow
inconsistent to me.
[I'll attach screen shots and the test file to a copy of this mail only
sent to Corinna, as I seem to remember attachments are not desired on
this mailing list.]
Thomas
--
Problem reports: http://cygwin.com/problems.html
FAQ: http://cygwin.com/faq/
Documentation: http://cygwin.com/docs.html
Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple
More information about the Cygwin
mailing list