bug-gnulib
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Dealing with character ranges in grep


From: Bruno Haible
Subject: Re: Dealing with character ranges in grep
Date: Thu, 9 Jun 2011 11:58:29 +0200
User-agent: KMail/1.9.9

Paolo,

> > [=e=] to match "e" as well as accented versions like é, è and ê).
> > That is the one feature that you get with glibc, and that you would
> > sacrifice when building --with-included-regex.
> 
> I agree.  It's up to distros to choose, of course.

If you are on the point of sacrificing a glibc feature in many programs,
then IMO you should first talk with the glibc people to see what alternative
they can offer.

> It's strictly about gawk/grep/gnulib; no need to involve 
> glibc from the beginning.

I disagree. It must involve glibc.

It is probably futile to ask Ulrich Drepper to change how [a-z] is interpreted
by default. But what would gnulib need so as to implement our "desired"
behaviour? As far as I understand, you want to keep the interpretation of
[=e=] in the POSIX + glibc way, but change the interpretation of [a-z]?
Then, what do we need from glibc?
  - Do we need a RE_RANGES_IGNORE_LOCALES flag, like Arnold proposed?
  - Do we need an API that allows us to access the collation elements?
    (Or is strcoll and wcscoll sufficient?)

Bruno
-- 
In memoriam Johanna Kirchner <http://en.wikipedia.org/wiki/Johanna_Kirchner>



reply via email to

[Prev in Thread] Current Thread [Next in Thread]