Bug 9844 - Wrong collation of ;: should sort after 'q', not 'k'
Summary: Wrong collation of ;: should sort after 'q', not 'k'
Status: RESOLVED FIXED
Alias: None
Product: glibc
Classification: Unclassified
Component: localedata (show other bugs)
Version: 2.8
: P2 normal
Target Milestone: ---
Assignee: GNU C Library Locale Maintainers
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2009-02-14 13:25 UTC by Martin Pitt
Modified: 2014-07-01 20:54 UTC (History)
1 user (show)

See Also:
Host:
Target:
Build:
Last reconfirmed:
fweimer: security-


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Martin Pitt 2009-02-14 13:25:56 UTC
From https://launchpad.net/bugs/325159:

Quoting http://en.wikipedia.org/wiki/ĸ:

> It is used to denote the sound written as [q] in the International Phonetic
Alphabet (the voiceless uvular plosive). For collation purposes, it is therefore
considered to be a type of 'q', rather than a type of 'k', and should sort near 'q'.

At least in fi_FI.UTF-8 and en_US.UTF-8, ĸ appears next to k when sorted:

% for l in j k l p q r ĸ; do echo "$l"; done | sort | xargs
j k ĸ l p q r

Both Unicode[1] and ISO-14651[2] place ĸ after q and before r.

[1]: http://unicode.org/charts/collation/chart_Latin.html
[2]: http://www.iso.org/ittf/ISO14651_2006_TABLE1_En.txt

The locales seem to base the collation on
/usr/share/i18n/locales/iso14651_t1_common, which places ĸ after k. Perhaps the
file is based on an outdated version of the ISO-14651 table and needs to be updated.
Comment 1 Martin Pitt 2009-02-14 13:27:08 UTC
Sorry, bz seems to have broken the wikipedia link. This is a safer one:

  http://en.wikipedia.org/wiki/%C4%B8
Comment 2 Martin Pitt 2009-02-14 13:29:47 UTC
Meh, it broke the character in the entire text as well. Please look at the
wikipedia or launchpad page. It is a small-caps "K".
Comment 3 Ulrich Drepper 2009-03-15 21:12:43 UTC
I moved the entry.