[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: texi2dvi: locale-dependent error in egrep [A-z]

From: Martin von Gagern
Subject: Re: texi2dvi: locale-dependent error in egrep [A-z]
Date: Wed, 31 Mar 2010 09:54:46 +0200
User-agent: Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv: Gecko/20100322 Thunderbird/3.0.3

On 31.03.2010 09:47, Eli Zaretskii wrote:
> Does it work to use [a-Z] instead?

[a-Z] works in de_DE.utf8 but not in C.

> I think this has to do with a
> different collating order under UTF-8, which is aAbB...zZ.

aAbB...zZ would include a valid range a-Z. It seems to be more like
ab...zAB...Z, for reasons I cannot begin to fathom.

>> Of course I know that [A-z] includes the ASCII characters between Z
>> and a, namely  [\]^_`  which technically aren't allowed as DOS drive
>> letters

In locales other than "C", the new sequence [A-Za-z] might include even
stranger letters:

$ echo Ä | LC_ALL=de_DE.utf8 egrep '[A-Z]'

So if you want to exclude these, you'd have to go for LC_ALL=C. Dunno if
it's worth the trouble, though.


Attachment: signature.asc
Description: OpenPGP digital signature

reply via email to

[Prev in Thread] Current Thread [Next in Thread]