bug-bash
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: locale specific ordering in EN_US vs. characterset collation rules f


From: Linda Walsh
Subject: Re: locale specific ordering in EN_US vs. characterset collation rules for UTF-8
Date: Thu, 27 Jun 2013 22:04:21 -0700
User-agent: Thunderbird



Chet Ramey wrote:
The world is larger than glibc and the glibc locale definitions.  We need
a solution that encompasses all of it.  That solution should, and maybe
will, include glibc, but that is not sufficient by itself.
----
        I don't suppose it is possible to use the Unicode
collation order when using unicode?

algorithm reference: http://www.unicode.org/reports/tr10/tr10-24.html

Collation order chart:
http://www.unicode.org/Public/UCA/latest/allkeys.txt

How does one get UTF-8 collation order?

I would think think that a character specific ordering specified
in LC_COLLATE would take precedence over a less specific regional ordering.

I.e LC_COLLATE="XXX.UTF-8" -- Seems like it should use the UTF-8 rules
over the XXX rules for COLLATION.  If they wanted regional rules,
then "XXX" alone without specifying an international standard like unicode,
would allow regional rules to take precedence.

But if they specify a specific character encoding for the characters,
under collation, why wouldn't the character set's collation order be used?

So how does one get UTF-8's Unicode collation ordering?





reply via email to

[Prev in Thread] Current Thread [Next in Thread]