This is the mail archive of the cygwin-developers mailing list for the Cygwin project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: Console codepage setting via chcp?

From: IWAMURO Motonori <deenheart at gmail dot com>
To: cygwin-developers at cygwin dot com
Date: Tue, 29 Sep 2009 00:50:15 +0900
Subject: Re: Console codepage setting via chcp?
References: <20090924201753.GK30851@calimero.vinschen.de> <20090925093910.GM30851@calimero.vinschen.de> <416096c60909250413o743e2f74g6ef66bdff8006ad@mail.gmail.com> <20090925141042.GN30851@calimero.vinschen.de> <416096c60909250941n135d0bd4gfd7f4d5d90ac308@mail.gmail.com> <20090925172744.GQ30851@calimero.vinschen.de> <416096c60909251142w56af59c4n1896cade0e21600c@mail.gmail.com> <20090926122018.GT30851@calimero.vinschen.de> <3f0ad08d0909270725p670c66c2qae9071d2c6d8e231@mail.gmail.com> <20090927162356.GH30851@calimero.vinschen.de>

2009/9/28 Corinna Vinschen <corinna-cygwin@cygwin.com>:
>> > - System objects will always be *initially* translated using UTF-8. This
>> > Âincludes file names, user names, and initial environment variables.
>> > - By setting the locale environ variables you can switch the charset
>> > Âused to translate filenames on a per-process base.
>> > ÂThis would be only a stop-gap measure, to allow to re-use old archives
>> > Âor scripts. ÂThose should be converted to UTF-8 ASAP. ÂExpect complaints.
>
> Basically, either the above, or just always UTF-8 for filenames
> everywhere, every time. ÂI have a local implementation now which
> behaves according to the above proposal.

My opinion:
I think that mb*/wc*/ctypes functions should accept any 8bit byte data
when use C locales.
In other words, the charset of C.<charset> should affect only
filenames and console I/O.
I uncommonly use LANG=C to treat the content in file/stream as 8bit byte data.

>> > - The "C" locale's charset will be UTF-8.
>> > - There'll be language-neutral "C.<charset>" locales.
>> > - The user's ANSI codepage will remain the default charset for
>> > "language_TERRITORY" locales.
>> > - The console charset will be set according to LC_ALL/LC_CTYPE/LANG
>> > Âat the time the application starts.
>>
>> * Is other issue of existing only the thread "Lone surrogates in UTF-8?"?
>> Â(Does the thread exist in the ML archive page?)
>
> Sorry, I don't understand the question. ÂBut, yes, the thread exists
> in the cygwin-developers mailing list archives.

Excluding above issues, any problem exists?
(Please forget the question in parentheses)
-- 
IWAMURO Motnori <http://vmi.jp/>

Follow-Ups:
- Re: Console codepage setting via chcp?
  - From: Corinna Vinschen

References:
- Re: Console codepage setting via chcp?
  - From: Corinna Vinschen
- Re: Console codepage setting via chcp?
  - From: Corinna Vinschen
- Re: Console codepage setting via chcp?
  - From: Andy Koppe
- Re: Console codepage setting via chcp?
  - From: Corinna Vinschen
- Re: Console codepage setting via chcp?
  - From: Andy Koppe
- Re: Console codepage setting via chcp?
  - From: Corinna Vinschen
- Re: Console codepage setting via chcp?
  - From: Andy Koppe
- Re: Console codepage setting via chcp?
  - From: Corinna Vinschen
- Re: Console codepage setting via chcp?
  - From: IWAMURO Motonori
- Re: Console codepage setting via chcp?
  - From: Corinna Vinschen

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]