Bug 16052 - Provide European ordering rules (EOR / EN 13710) for locales
Summary: Provide European ordering rules (EOR / EN 13710) for locales
Status: NEW
Alias: None
Product: glibc
Classification: Unclassified
Component: localedata (show other bugs)
Version: 2.18
: P2 normal
Target Milestone: ---
Assignee: Mike FABIAN
URL:
Keywords:
Depends on: 14095
Blocks:
  Show dependency treegraph
 
Reported: 2013-10-15 13:36 UTC by Marko Myllynen
Modified: 2018-01-25 10:41 UTC (History)
4 users (show)

See Also:
Host:
Target:
Build:
Last reconfirmed:
fweimer: security-


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Marko Myllynen 2013-10-15 13:36:45 UTC
From http://en.wikipedia.org/wiki/European_ordering_rules:

The European ordering rules (EOR / EN 13710) define an ordering for strings written in languages that use the Latin, Greek and Cyrillic alphabets. The standard covers languages used in Europe (i.e., the European Union, the European Free Trade Association, and parts of the former Soviet Union). It is a tailoring of the Common Tailorable Template of ISO/IEC 14651. EOR can [and in most cases should] in turn be tailored for different (European) languages as the primary language. But in inter-European multilingual contexts, EOR can be used without further tailoring.

In some countries/languages, like Finland/Finnish, the new national recommendation is based on EN 13710. If EN 13710 would be available, the new Finnish sorting standard could be easily implemented by providing the national delta defined in SFS-EN 13710.

Please consider providing EN 13710 ordering rules to allow implementation of new national standards based on it.

Thanks.
Comment 1 joseph@codesourcery.com 2013-10-15 15:20:33 UTC
Note that the ISO 14651 data in glibc is extremely out of date; see bug 
14095.  I don't know whether that would affect this issue - whether it 
would be necessary to update the ISO 14651 data first, understanding 
exactly how it relates to (an old version of) ISO 14651 in the process.  
We certainly need someone willing to spend a substantial amount of time 
understanding collation issues and how we got to the current state.  (To a 
lesser extent, there are such issues for character map / LC_CTYPE 
information - see bug 14094 - both need substantial work to produce 
appropriate automation for easier future updates of this data while 
demonstrating that previous unautomated local changes aren't being 
inappropriately lost in the process.)
Comment 2 Carlos O'Donell 2013-10-15 15:23:48 UTC
(In reply to joseph@codesourcery.com from comment #1)
> Note that the ISO 14651 data in glibc is extremely out of date; see bug 
> 14095.  I don't know whether that would affect this issue - whether it 
> would be necessary to update the ISO 14651 data first, understanding 
> exactly how it relates to (an old version of) ISO 14651 in the process.  
> We certainly need someone willing to spend a substantial amount of time 
> understanding collation issues and how we got to the current state.  (To a 
> lesser extent, there are such issues for character map / LC_CTYPE 
> information - see bug 14094 - both need substantial work to produce 
> appropriate automation for easier future updates of this data while 
> demonstrating that previous unautomated local changes aren't being 
> inappropriately lost in the process.)

Agreed. I think this work should fall on the shoulders of the distribution maintainers. At some point I will be doing some work here, but my first goal is to fix the state of getaddrinfo against the various RFCs.
Comment 3 Marko Myllynen 2013-10-15 15:43:04 UTC
(In reply to joseph@codesourcery.com from comment #1)
> Note that the ISO 14651 data in glibc is extremely out of date; see bug 
> 14095.  I don't know whether that would affect this issue - whether it 
> would be necessary to update the ISO 14651 data first, understanding 
> exactly how it relates to (an old version of) ISO 14651 in the process.  

Thanks for pointing this out. I have a copy of the SFS-EN 13710 standard (which I am, unfortunately, not allowed to distribute publicly) which contains also the EN 13710 text and it states the following:

"
4 Conformance

In order to be conformant to this European Standard an application shall meet the requirements prescribed in ISO/IEC 14651:2007, Clause 6 and its Common Template Table ISO14651_2006_TABLE1 after the application of the EOR delta table specified in Clause 6 of this European Standard. An equivalent description of the resulting tailored table shall equally conform to this European Standard.
"

and then goes on to define the EOR delta:

"
6 EOR Delta Table

NOTE For the syntax of the table please consult ISO/IEC 14651.

%% EOR's EORDeltaTable
%
%% European Ordering Rules.
%
% EOR delta for MES-3 from ISO/IEC 14651:2007's CTT (ISO14651_2006_TABLE1_en.txt).
...
"