This is the mail archive of the
xsl-list@mulberrytech.com
mailing list .
Re: Upper ASCII chars
- From: David Carlisle <davidc at nag dot co dot uk>
- To: xsl-list at lists dot mulberrytech dot com
- Date: Tue, 5 Feb 2002 18:19:16 GMT
- Subject: Re: [xsl] Upper ASCII chars
- References: <5.1.0.14.0.20020205115241.00a48410@digexch1>
- Reply-to: xsl-list at lists dot mulberrytech dot com
> It's not relevant which codepage was in use when the data was entered,
> as we simply pass the binary value of the character through
You can not do that in XSLT (or in XML, generally) If the byte in the
range 128-255 appears in the input document then it will be mapped to
some unicode number depending on the encoding specified for the input
document (The xslt stylesheet does not know what this encoding is).
So if the input document happened to be latin1 the unicode numbers are
the same in that range, but if it was some other 8bit encoding such as
latin2, then thise input bytes will be reported as higher unicode
character numbers. You can not tell what bytes were in the input
stream, and you can not tell what the original encoding was.
If you select an output encoding that is different to the input
encoding, characters may pass straight through but the bytes will be
changed.
David
_____________________________________________________________________
This message has been checked for all known viruses by Star Internet
delivered through the MessageLabs Virus Scanning Service. For further
information visit http://www.star.net.uk/stats.asp or alternatively call
Star Internet for details on the Virus Scanning Service.
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list