This is the mail archive of the glibc-bugs@sourceware.org mailing list for the glibc project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

[Bug localedata/23774] New: lv_LV collates Y/y incorrectly


https://sourceware.org/bugzilla/show_bug.cgi?id=23774

            Bug ID: 23774
           Summary: lv_LV collates Y/y incorrectly
           Product: glibc
           Version: unspecified
            Status: UNCONFIRMED
          Severity: minor
          Priority: P2
         Component: localedata
          Assignee: unassigned at sourceware dot org
          Reporter: danko at very dot lv
                CC: libc-locales at sourceware dot org, maiku.fabian at gmail dot com
  Target Milestone: ---

Commit 159738548130d5ac4fe6178977e940ed5f8cfdc4 introduced this change in the
lv_LV locale:

-<U0079> <i>;<PCL>;<MIN>;IGNORE % y
-<U0059> <i>;<PCL>;<CAP>;IGNORE % Y
+<U0079> <S0069>;<LOWLINE>;<MIN>;IGNORE % y
+<U0059> <S0069>;<LOWLINE>;<CAP>;IGNORE % Y

I don't know what "PCL" meant and whether "Y" was supposed to be "BASE" in the
first place, but "LOWLINE" certainly looks like a bug.

Letter Y is not present in the Latvian alphabet, however it is present in
Latgalian and is located after I, which is what the CLDR rule seems to suggest:

&I<<y<<<Y

I found this by accident while investigating the result of this command on my
system (with LANG being lv_LV.UTF-8)

$ echo abcxyz | grep -Eo '[a-z]+'
abcx
z

I'm sorry if I misunderstood something as I've never worked with either glibc
or CLDR locales directly before.

-- 
You are receiving this mail because:
You are on the CC list for the bug.

Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]