This is the mail archive of the docbook@lists.oasis-open.org mailing list for the DocBook project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Codepoint attribute for character elements


In the XML Character Entities  draft, I wonder if there shouldn't be 
an option (or perhaps even a requirement) for a codepoint attribute 
on the character element that gives the decimal or hexadecimal value 
of the Unicode character; e.g.:

<doc xmlns:e="http://www.oasis-open.org/docbook/xmlcharent/names";>
<p>This document uses the character names element to access
character entities, such as "<e:char codepoint="0xE9"/>".</p>
</doc>

In many cases this would be easier to author, especially when looking 
up characters in the Unicode book or similar books.

It would also be much easier for software to transform to the proper 
character than a name based lookup mechanism.

Most importantly, this would allow this mechanism to be used for the 
unnamed Hangul syllables and CJK ideographs.
-- 

+-----------------------+------------------------+-------------------+
| Elliotte Rusty Harold | elharo@metalab.unc.edu | Writer/Programmer |
+-----------------------+------------------------+-------------------+
|          The XML Bible, 2nd Edition (Hungry Minds, 2001)           |
|             http://www.cafeconleche.org/books/bible2/              |
|   http://www.amazon.com/exec/obidos/ISBN=0764547607/cafeaulaitA/   |
+----------------------------------+---------------------------------+
|  Read Cafe au Lait for Java News:  http://www.cafeaulait.org/      |
|  Read Cafe con Leche for XML News: http://www.cafeconleche.org/    |
+----------------------------------+---------------------------------+


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]