This is the mail archive of the cygwin mailing list for the Cygwin project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: non-BMP character width

From: Lapo Luchini <lapo at lapo dot it>
To: cygwin at cygwin dot com
Date: Mon, 21 Sep 2009 18:52:55 +0200
Subject: Re: non-BMP character width
Openpgp: id=C8F252FB
References: <200909161148.n8GBm4ha001469@mail.bln1.bf.nsn-intra.net> <20090921163348.GL20981@calimero.vinschen.de>

Corinna Vinschen wrote:
> On Sep 16 13:48, Thomas Wolff wrote:
>> Hi,
>> I see one small remaining glitch with Unicode display; non-BMP characters 
>> (those with Unicode value > 0xFFFF) are displayed as two boxes.
> 
> Can you please create a simple self-contained testcase?  I'm not exactly
> sure how this is supposed to work and if a solution exists.  Is that a
> problem for the non-UTF-8 case, too, or for UTF-8 only?

I guess he meant anything like U+10001, which seems to be assigned to
linear-B charset on the DecodeUnicode database:

ð = http://www.decodeunicode.org/U+10001
UTF-8 as F0 90 80 81

Or this (Iguess that's traditional Chinese?) example taken from en.wiki:
ð = http://www.decodeunicode.org/U+24B62
UTF-8 as F0 A4 AD A2

-- 
Lapo Luchini - http://lapo.it/


--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

Follow-Ups:
- Re: non-BMP character width
  - From: Corinna Vinschen

References:
- non-BMP character width
  - From: Thomas Wolff
- Re: non-BMP character width
  - From: Corinna Vinschen

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]