bug-gnu-utils
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Case insensitivity seems to ignore lower bound of interval


From: Eric Bischoff
Subject: Re: Case insensitivity seems to ignore lower bound of interval
Date: Thu, 28 Apr 2011 07:32:18 +0200
User-agent: KMail/1.13.6 (Linux/2.6.38-8-generic; KDE/4.6.2; x86_64; ; )

Le jeudi 28 avril 2011 06:17:07, Paul Jarc a écrit :
> That example behaves as described in the documentation for some
> locales, but not in others (such as yours, apparently).  That's the
> whole point of that section of the documentation--different locales
> have different behavior for character ranges.


Oh, ok, now I understand, then the problem is that
         [R-Z]
evaluates either as
      RrSsTtUuVvWwXxYyZz
or as
      rRsStTuUvVwWxXyYzZ
 
> Note that case-insensitivity is not an intended feature at all.  It's
> just an accidental result of the character collation of some locales.
> Some locales arrange characters in the order aAbBcC...zZ, so a range
> like [A-Z] includes all upper- and lowercase letters except lowercase
> a.  Other locales may arrange them as AaBbCc...Zz, so [A-Z] excludes
> lowercase z instead.  But the usual expectation, and the actual
> behavior in the C locale, is that [A-Z] includes only uppercase
> letters, and [a-z] includes only lowercase letters.

OK. Thanks for explanation, now I get it.

But there is still something deeply wrong here.

My understanding of the word "collation" is :
   "A", "a",  and "à" are "equivalent"with respect to alphabetical order
It is not :
   "A", "a" and "à" are "next to each other" in alphabetical order.

Current situation is :
         a < à < A < b < B < c < C ...
while it should be :
       a = à = A   <   b = B   <   c = C ...


My two cents.

-- 
Éric Bischoff - Bureau Cornavin
Technical writing and translations
http://www.bureau-cornavin.com
(+33) 3 68 46 00 85
sip:address@hidden



reply via email to

[Prev in Thread] Current Thread [Next in Thread]