This is the mail archive of the xsl-list@mulberrytech.com mailing list .


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]

RE: SAXON and UTF-8


Seems to me that

http://www.w3.org/TR/2000/REC-xml-20001006#sec-guessing-no-ext-info

is quite clear about it...

(I think this was changed in the second edition)

> -----Original Message-----
> From: owner-xsl-list@lists.mulberrytech.com
> [mailto:owner-xsl-list@lists.mulberrytech.com]On Behalf Of Michael Kay
> Sent: Thursday, September 27, 2001 6:28 PM
> To: xsl-list@lists.mulberrytech.com
> Subject: RE: [xsl] SAXON and UTF-8
>
>
> > Windows Notepad saves UTF8 files with Byte Order Mark, and
> > AFAIK, the XML
> > parser in Saxon (AElfred) doesn't support this (at least it
> > didn't last time I checked).
> >
>
> The question is, can an XML document (or entity) in UTF-8 encoding start
> with a BOM? The fact that Unicode allows it, and the fact that Notepad can
> create it, doesn't make it legal XML.
>
> My reading of the XML spec is that it expects to find BOM only in UTF-16
> files. I can't see any total prohibition of a BOM in a UTF-8 file, but the
> spec certainly seems to assume that they won't occur. If anyone thinks
> otherwise, I'd like to see evidence from the XML specification,
> which is the
> only definitive source.
>
> This is of course totally off-topic for XSLT.
>
> Mike Kay
>
>
>  XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
>


 XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]