This is the mail archive of the
mailing list for the Cygwin project.
Re: non-BMP character width
Corinna Vinschen wrote:
> On Sep 16 13:48, Thomas Wolff wrote:
>> I see one small remaining glitch with Unicode display; non-BMP characters
>> (those with Unicode value > 0xFFFF) are displayed as two boxes.
> Can you please create a simple self-contained testcase? I'm not exactly
> sure how this is supposed to work and if a solution exists. Is that a
> problem for the non-UTF-8 case, too, or for UTF-8 only?
I guess he meant anything like U+10001, which seems to be assigned to
linear-B charset on the DecodeUnicode database:
ð = http://www.decodeunicode.org/U+10001
UTF-8 as F0 90 80 81
Or this (Iguess that's traditional Chinese?) example taken from en.wiki:
ð = http://www.decodeunicode.org/U+24B62
UTF-8 as F0 A4 AD A2
Lapo Luchini - http://lapo.it/
Problem reports: http://cygwin.com/problems.html
Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple