This is the mail archive of the
libc-alpha@sourceware.org
mailing list for the glibc project.
Re: The collation update to the new ISO file made all locales almost twice as big
- From: Zack Weinberg <zackw at panix dot com>
- To: Mike FABIAN <mfabian at redhat dot com>
- Cc: GNU C Library <libc-alpha at sourceware dot org>
- Date: Mon, 5 Mar 2018 09:40:19 -0500
- Subject: Re: The collation update to the new ISO file made all locales almost twice as big
- Authentication-results: sourceware.org; auth=none
- References: <s9d1sgy8ww9.fsf@taka.site>
On Mon, Mar 5, 2018 at 8:39 AM, Mike FABIAN <mfabian@redhat.com> wrote:
> I added this to the wiki for hte 2.28 release notes:
> https://sourceware.org/glibc/wiki/Release/2.28#The_locale-archive_file_is_much_bigger
Thanks for writing this up.
> As LC_COLLATE makes up the bulk of the locale data, the size of
> the locales increased a lot. The locale-archive file which contains the
> data for all localess grew from 126 MiB to 206 Mib.
I wonder if we should spend some time thinking about ways to compact
this data or factor it out. I realize it's not as simple as putting
the compiled form of iso14651_t1_common in its own file that all the
locales refer to, because of the "locale specific rules", but maybe it
could be _almost_ that simple? Alternatively, perhaps some simple
compression could be applied?