This is the mail archive of the xsl-list@mulberrytech.com mailing list .


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]

RE: SAXON and UTF-8


Michael Kay wrote at 28 Sep 2001 09:25:29 +0100:
 > When I need to check what the XML spec says, I usually turn to Bob
 > DuCharme's book. Unfortunately this means I sometimes miss things that
 > changed in the second edition.

It's always been possible to use  with UTF-8 in XML.  It just
wasn't mentioned in the XML Recommendation (and still isn't all that
explicit).

ISO/IEC 10646-1:1993 has "always" supported use of ZERO WIDTH NO-BREAK
SPACE () as an encoding signature for UTF-8 (where "always"
probably means "since UTF-8 was added to ISO/IEC 10646-1:1993 as
Amendment 2 some time before there was a Unicode 2.0").

The Unicode side of the Unicode==ISO/IEC 10646 equation was ambivalent
(at best) about  as an encoding signature for UTF-8 for quite
a long time after ISO/IEC 10646 blessed the idea, but the signature is
now listed as such in Section 13.6, Specials, of the Unicode Standard,
Version 3.0.

Regards,


Tony Graham
------------------------------------------------------------------------
XML Technology Center - Dublin        mailto:tony.graham@ireland.sun.com
Sun Microsystems Ireland Ltd                       Phone: +353 1 8199708
Hamilton House, East Point Business Park, Dublin 3            x(70)19708

 XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]