Bug 18384 - iswalnum(0x345) incorrectly return 1
Summary: iswalnum(0x345) incorrectly return 1
Status: NEW
Alias: None
Product: glibc
Classification: Unclassified
Component: localedata (show other bugs)
Version: unspecified
: P2 normal
Target Milestone: ---
Assignee: Not yet assigned to anyone
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2015-05-06 20:27 UTC by Michael Vlasenko
Modified: 2015-06-06 06:34 UTC (History)
3 users (show)

See Also:
Host:
Target:
Build:
Last reconfirmed:
fweimer: security-


Attachments
Screenshot (36.38 KB, image/png)
2015-05-06 20:27 UTC, Michael Vlasenko
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Michael Vlasenko 2015-05-06 20:27:08 UTC
Created attachment 8303 [details]
Screenshot

In Greek traditional text analysis, the iota subscript (U+0345) is often talked about as a "diacritic", see:

"http://en.wikipedia.org/wiki/Iota_subscript"

In the recourse: http://www.unicode.org/ucd/ 
I found two documents:
1. http://www.unicode.org/Public/UCD/latest/charts/CodeCharts.pdf
2. http://www.unicode.org/Public/UCD/latest/ucd/PropList.txt
In the first document in the section: "Combining Diacritical Marks" I found this character in the table. See at the attached screenshot.
In the second document in line 752 we have:
0300..034E    ; Diacritic # Mn  [79] COMBINING GRAVE ACCENT..COMBINING UPWARDS ARROW BELOW

As a result, I think, it is pretty clear that iswalnum(0x345) should not return 1, but rather 0.