[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Case insensitivity seems to ignore lower bound of interval
From: |
Eric Bischoff |
Subject: |
Re: Case insensitivity seems to ignore lower bound of interval |
Date: |
Thu, 28 Apr 2011 07:32:18 +0200 |
User-agent: |
KMail/1.13.6 (Linux/2.6.38-8-generic; KDE/4.6.2; x86_64; ; ) |
Le jeudi 28 avril 2011 06:17:07, Paul Jarc a écrit :
> That example behaves as described in the documentation for some
> locales, but not in others (such as yours, apparently). That's the
> whole point of that section of the documentation--different locales
> have different behavior for character ranges.
Oh, ok, now I understand, then the problem is that
[R-Z]
evaluates either as
RrSsTtUuVvWwXxYyZz
or as
rRsStTuUvVwWxXyYzZ
> Note that case-insensitivity is not an intended feature at all. It's
> just an accidental result of the character collation of some locales.
> Some locales arrange characters in the order aAbBcC...zZ, so a range
> like [A-Z] includes all upper- and lowercase letters except lowercase
> a. Other locales may arrange them as AaBbCc...Zz, so [A-Z] excludes
> lowercase z instead. But the usual expectation, and the actual
> behavior in the C locale, is that [A-Z] includes only uppercase
> letters, and [a-z] includes only lowercase letters.
OK. Thanks for explanation, now I get it.
But there is still something deeply wrong here.
My understanding of the word "collation" is :
"A", "a", and "à" are "equivalent"with respect to alphabetical order
It is not :
"A", "a" and "à" are "next to each other" in alphabetical order.
Current situation is :
a < à < A < b < B < c < C ...
while it should be :
a = à = A < b = B < c = C ...
My two cents.
--
Éric Bischoff - Bureau Cornavin
Technical writing and translations
http://www.bureau-cornavin.com
(+33) 3 68 46 00 85
sip:address@hidden