bug-grep
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[bug #28275] Ranges like [a-z] incorrectly match in UTF systems


From: Makar
Subject: [bug #28275] Ranges like [a-z] incorrectly match in UTF systems
Date: Mon, 14 Dec 2009 23:10:11 +0000
User-agent: Mozilla/5.0 (X11; U; Linux x86_64; ru-RU; rv:1.9.1.5) Gecko/20091129 Sabayon Firefox/3.5.5

Follow-up Comment #4, bug #28275 (project grep):

Does [a-z] mean any Latin-based symbol? Does [а-я] mean any extended
Cyrillic? Is this behavior in glibc by design? Thanks for the explanation.

Still, in Perl mode bracket expressions work incorrectly with non-ASCII
symbols.

grep -Poha '[бв]' random-symbols.txt 

treats the expression in brackets as 4 1-byte synbols: 'xd0xb1xd0xb2'. It
gives a column of 1-byte symbols xb1 xb2 and xd0.

    _______________________________________________________

Reply to this item at:

  <http://savannah.gnu.org/bugs/?28275>

_______________________________________________
  Message sent via/by Savannah
  http://savannah.gnu.org/





reply via email to

[Prev in Thread] Current Thread [Next in Thread]