bug-grep
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[bug #28275] Ranges like [a-z] incorrectly match in UTF systems


From: Makar
Subject: [bug #28275] Ranges like [a-z] incorrectly match in UTF systems
Date: Mon, 14 Dec 2009 20:57:33 +0000
User-agent: Mozilla/5.0 (X11; U; Linux x86_64; ru-RU; rv:1.9.1.5) Gecko/20091129 Sabayon Firefox/3.5.5

Follow-up Comment #2, bug #28275 (project grep):

No. It matches various non-ASCII symbols. Like 
ǹûṣőṏŭṋṽẚęčẉįļẹèĕểöâǩǝŏä

For example type (on UTF-8 system):

dd if=/dev/urandom bs=1024000 count=1 |iconv -c -f ucs-2 -t utf-8 >
random-symbols.txt

grep -oha '[a-z]' random-symbols.txt > 'random [a-z].txt'

and you'll see what I mean.

(file #19265, file #19266)
    _______________________________________________________

Additional Item Attachment:

File name: random-symbols.txt             Size:169 KB
File name: random [a-z].txt               Size:0 KB


    _______________________________________________________

Reply to this item at:

  <http://savannah.gnu.org/bugs/?28275>

_______________________________________________
  Message sent via/by Savannah
  http://savannah.gnu.org/





reply via email to

[Prev in Thread] Current Thread [Next in Thread]