bug-libunistring
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[bug-libunistring] Collation on unassigned code points


From: Aleksander Morgado
Subject: [bug-libunistring] Collation on unassigned code points
Date: Mon, 29 Nov 2010 13:55:34 +0100

Hi Bruno,

According to section 7.1.2 in UTS#10, unassigned code points, as the
ones in private planes, have a implicit weight to be used during
collation:
http://unicode.org/reports/tr10/#Unassigned_And_Other

But my tests with u8_strcoll() show that when having for example U
+10FFFD, the collator function in libunistring will actually ignore that
code point. Attached a test program which orders several strings, which
gives the following output:
 Africa
 america
 Ball
 land
 􏿽 last
 lazy
 zzz

When using the same approach with libicu, it orders in the expected way:
 Africa
 america
 Ball
 land
 lazy
 zzz
 􏿽 last


Any reason for this behavior in libunistring?

Cheers!

-- 
Aleksander

Attachment: libunistring.c
Description: Text Data


reply via email to

[Prev in Thread] Current Thread [Next in Thread]