Bug 5464 - hp-greek8 not supported
Summary: hp-greek8 not supported
Status: RESOLVED FIXED
Alias: None
Product: glibc
Classification: Unclassified
Component: localedata (show other bugs)
Version: unspecified
: P2 enhancement
Target Milestone: ---
Assignee: GNU C Library Locale Maintainers
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2007-12-09 20:34 UTC by Sean Young
Modified: 2014-07-03 11:52 UTC (History)
1 user (show)

See Also:
Host:
Target:
Build:
Last reconfirmed:
fweimer: security-


Attachments
charmap of hpux greek8 encoding (2.56 KB, text/plain)
2007-12-09 20:35 UTC, Sean Young
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Sean Young 2007-12-09 20:34:46 UTC
HP-UX has a character set greek8 which is not compatible with ISO-8859-7. I
don't know why iconv thinks that greek8 and iso-8869-7 are compatible; possibly
this is referring to another encoding. ICU has the same "issue".

In light of this I would propose the name HP-GREEK8.
Comment 1 Sean Young 2007-12-09 20:35:28 UTC
Created attachment 2130 [details]
charmap of hpux greek8 encoding
Comment 2 Ulrich Drepper 2007-12-10 15:27:12 UTC
Where did you get this information?  Without reference I have no reason to trust
the information.
Comment 3 Sean Young 2007-12-10 18:09:20 UTC
This is generated on an HP-UX machine.

1) 
/* l.c */
#include <stdio.h>

int main(int argc, char *argv[])
{
        int i;

        for (i=32; i<256; i++)
                printf("%02x: %c\n", i, (char)i);

        return 0;
}

2) HP-UX iconv cannot convert greek8 to utf8 directly.

./l | iconv -f gree8 -t iso897 | iconv -f iso897 -t utf8 > greek8-utf8.txt

3) Then converted the output file to the format described. Names are taken from
unicode NamesList.txt. A little python script came in helpful.

4) Also manually compared to greek8.cm (charmap) from HP-UX,
/usr/lib/nls/loc/charmaps/greek8.cm

Note greek8 is an obsolete encoding:

http://docs.hp.com/en/5971-2270/ch03s01.html
http://docs.hp.com/en/5971-2270/ch08s11.html

I've just found a picture of the character set as well:

http://docs.hp.com/en/32650-90207/apa.html
Comment 4 Ulrich Drepper 2007-12-12 08:00:39 UTC
I verified the data myself.  The trunk has the new module.