This is the mail archive of the
mailing list for the glibc project.
Re: Improved check-localedef script
4.08.2017 11:50 Mike FABIAN <email@example.com> wrote:
> Rafal Luzynski <firstname.lastname@example.org> wrote:
> > They may describe something more complex like <U00E0>. But even this
> > is not UTF-8 because UTF-8 would be <C3> <A0> (UTF-8 is 8-bit). The
> > closest charset would be UCS-2 or simply a generic Unicode.
> My understanding at the moment is that the “% Charset: ...” comment
> indicates the encoding used to write the source file.
> So something like
> “<U20AC>” is definitely ASCII. Non-ASCII stuff in locale source files
> seems to exist only in comments at the moment.
So probably UTF-8 is a good choice. And converting to UTF-8 if
there is something non-UTF-8.