bug-gnulib
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: bug in join: case comparisons don't work in multibyte locales


From: Pádraig Brady
Subject: Re: bug in join: case comparisons don't work in multibyte locales
Date: Wed, 11 Mar 2009 02:55:09 +0000
User-agent: Thunderbird 2.0.0.6 (X11/20071008)

Pádraig Brady wrote:
> Bruno Haible wrote:
>> Hi Jim,
> 
> Thanks for looking at this Bruno.
> 
>> In coreutils/src/join.c, there is a FIXME mentioning that the -i option for
>> case insensitive comparison of the input lines does not work in multibyte
>> locales.
> 
> Utils that have this issue are:
> join -i, uniq -i, sort -f, ptx -f

Note as well as folding case I think it might
be useful to fold other forms like:
  Diacritics:  À -> A
  Enclosed:  \u24b6 -> A
  Stylistic: \uff21-> A

I.E. have more general function like:
ulc_coll(fold={Case|Diactritics|Stylistic}, ...);

I'm not suggesting it's required or anything,
just something to keep in mind while designing
these interfaces.

cheers,
Pádraig.




reply via email to

[Prev in Thread] Current Thread [Next in Thread]