This is the mail archive of the xsl-list@mulberrytech.com mailing list .


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]

SAXON and UTF-8


Newbie observations: I get the following error when feeding SAXON with a XML
document with UTF-8 encoding.

--
E:\test\sampledocs>saxon dataseq.xml sampledoc.xsl > dataseq.fo
Fatal error reported by XML parser: required character (found "?") (expected
"<"
)
  URL:    file:/E:/test/sampledocs/dataseq.xml
  Line:   1
  Column: 5
Error
  required character (found "?") (expected "<")
Transformation failed
--

The document contains one 'ä' character and is saved in UTF-8 according to
my text editor, EditPlus 2.10. The document has no encoding declaration.
Saving in "plain text" triggers the appropriate error message from SAXON:

E:\test\sampledocs>saxon dataseq.xml sampledoc.xsl > dataseq.fo
Fatal error reported by XML parser: bad continuation of multi-byte UTF-8
sequenc
e (character code: 0x72)
  URL:    file:/E:/test/sampledocs/dataseq.xml
  Line:   -1
  Column: 1477
Error
  bad continuation of multi-byte UTF-8 sequence (character code: 0x72)
Transformation failed

That error message could have been better. It's acctually on line 14, and
it's far from column 1477. :-) It's the XML parser's fault, isn't it?

The third alternative in EditPlus, "Unicode", works in SAXON.

The question is of course why the XML parser in SAXON doesn't seem to like
XML documents in UTF-8.

Regards,

Gustaf Liljegren



 XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]