[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Case insensitivity seems to ignore lower bound of interval
From: |
Aharon Robbins |
Subject: |
Re: Case insensitivity seems to ignore lower bound of interval |
Date: |
Fri, 29 Apr 2011 10:57:12 +0300 |
User-agent: |
Heirloom mailx 12.4 7/29/08 |
Hi.
> Date: Wed, 27 Apr 2011 16:40:21 -0400
> From: John Cowan <address@hidden>
> To: Aharon Robbins <address@hidden>
> Cc: address@hidden, address@hidden, address@hidden
> Subject: Re: Case insensitivity seems to ignore lower bound of interval
>
> Aharon Robbins scripsit:
>
> > I do agree that the behavior is suprising, disconcerting, undesirable,
> > and so on. For this reason, the upcoming version of gawk translates
> > ranges of the form [d-h] into '[defgh]' before compiling the regular
> > expression.
>
> Alas, that means that in a locale where e-acute sorts after e, the regex
> [d-h] will not match it. You can't have everything at once, but it
> would be good to have a switch to turn this behavior on and off.
I have been fighting this issue for at least 10 years (I think). To turn
it off, just use --posix. Gawk already has too many command-line options;
I don't want to add another one.
Thanks,
Arnold