[Bug localedata/23421] Strange collation rules for A and space with UTF-8 locale when other characters appended

b.cama at kerlink dot fr sourceware-bugzilla@sourceware.org
Tue Jul 17 16:42:00 GMT 2018


https://sourceware.org/bugzilla/show_bug.cgi?id=23421

Benjamin Cama <b.cama at kerlink dot fr> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|RESOLVED                    |UNCONFIRMED
         Resolution|INVALID                     |---

--- Comment #3 from Benjamin Cama <b.cama at kerlink dot fr> ---
Thanks a lot for these explanations! I did not expect that.

My use case is just basic ASCII sorting of space separated values lines using
“sort”, which does not work with “default” locales and is quite a PITA. E.g
with my default UTF-8 locale (fr_FR.UTF-8):

% printf "a b\naa b\n" | sort
aa b
a b
% printf "a b\naa b\n" | LANG=C sort
a b
aa b

This is very weird.

Should I assume that “basic” sorting in Unix should always explicitly state to
use the C locale?

Thanks for your time.

-- 
You are receiving this mail because:
You are on the CC list for the bug.


More information about the Libc-locales mailing list