[Bug localedata/23421] Strange collation rules for A and space with UTF-8 locale when other characters appended
b.cama at kerlink dot fr
sourceware-bugzilla@sourceware.org
Tue Jul 17 16:42:00 GMT 2018
https://sourceware.org/bugzilla/show_bug.cgi?id=23421
Benjamin Cama <b.cama at kerlink dot fr> changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|RESOLVED |UNCONFIRMED
Resolution|INVALID |---
--- Comment #3 from Benjamin Cama <b.cama at kerlink dot fr> ---
Thanks a lot for these explanations! I did not expect that.
My use case is just basic ASCII sorting of space separated values lines using
“sort”, which does not work with “default” locales and is quite a PITA. E.g
with my default UTF-8 locale (fr_FR.UTF-8):
% printf "a b\naa b\n" | sort
aa b
a b
% printf "a b\naa b\n" | LANG=C sort
a b
aa b
This is very weird.
Should I assume that “basic” sorting in Unix should always explicitly state to
use the C locale?
Thanks for your time.
--
You are receiving this mail because:
You are on the CC list for the bug.
More information about the Libc-locales
mailing list