This is the mail archive of the
xsl-list@mulberrytech.com
mailing list .
SAXON and UTF-8
- To: "XSL List" <xsl-list at lists dot mulberrytech dot com>
- Subject: [xsl] SAXON and UTF-8
- From: "Gustaf Liljegren" <gustaf dot liljegren at xml dot se>
- Date: Thu, 27 Sep 2001 15:09:17 +0200
- Reply-To: xsl-list at lists dot mulberrytech dot com
Newbie observations: I get the following error when feeding SAXON with a XML
document with UTF-8 encoding.
--
E:\test\sampledocs>saxon dataseq.xml sampledoc.xsl > dataseq.fo
Fatal error reported by XML parser: required character (found "?") (expected
"<"
)
URL: file:/E:/test/sampledocs/dataseq.xml
Line: 1
Column: 5
Error
required character (found "?") (expected "<")
Transformation failed
--
The document contains one 'ä' character and is saved in UTF-8 according to
my text editor, EditPlus 2.10. The document has no encoding declaration.
Saving in "plain text" triggers the appropriate error message from SAXON:
E:\test\sampledocs>saxon dataseq.xml sampledoc.xsl > dataseq.fo
Fatal error reported by XML parser: bad continuation of multi-byte UTF-8
sequenc
e (character code: 0x72)
URL: file:/E:/test/sampledocs/dataseq.xml
Line: -1
Column: 1477
Error
bad continuation of multi-byte UTF-8 sequence (character code: 0x72)
Transformation failed
That error message could have been better. It's acctually on line 14, and
it's far from column 1477. :-) It's the XML parser's fault, isn't it?
The third alternative in EditPlus, "Unicode", works in SAXON.
The question is of course why the XML parser in SAXON doesn't seem to like
XML documents in UTF-8.
Regards,
Gustaf Liljegren
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list