[PATCH v4 0/3] C.UTF-8
Mike Frysinger
vapier@gentoo.org
Wed Aug 18 08:12:43 GMT 2021
On 29 Jul 2021 09:53, Florian Weimer via Libc-alpha wrote:
> * Carlos O'Donell via Libc-alpha:
> > The following changes implement a minimally sized C.UTF-8.
> > First we implement the 'strcmp_collation' directive.
> > Then we implement C.UTF-8 with an LC_COLLATE that uses the
> > 'strcmp_collation' directive to support using strcmp for
> > collation i.e. code point sorting. The final C.UTF-8 is
> > only ~396KiB with the largest ~346KiB in LC_CTYPE for all
> > of Unicode.
> >
> > This v4 fixes the regressions detected in Fedora Rawhide
> > here: https://bugzilla.redhat.com/show_bug.cgi?id=1986421
> > Additional testing coverage is provided for fnmatch, regcomp,
> > and regexec (which would have caught the regression).
>
> From a high-level point of view I wonder if the more conservative choice
> would be to fix the localdef generation for LC_COLLATE, at least for
> this release. It would also mean that we do not break statically linked
> executables.
glibc already (somewhat regularly) breaks statically linked programs due to
nss incompatibilities. unless/until we take that seriously, i'm not sure we
should bother expending effort on these trade-offs. just go with whatever
makes sense long term.
-mike
More information about the Libc-alpha
mailing list