emacs-bug-tracker
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

bug#43225: closed (Grep treats extended Latin characters like whitespace


From: GNU bug Tracking System
Subject: bug#43225: closed (Grep treats extended Latin characters like whitespace)
Date: Wed, 09 Sep 2020 19:46:01 +0000

Your message dated Wed, 9 Sep 2020 12:45:11 -0700
with message-id <87d378cf-2c5b-c0aa-a9c4-1557ecb7c40e@cs.ucla.edu>
and subject line Re: bug#43225: Grep treats extended Latin characters like 
whitespace
has caused the debbugs.gnu.org bug report #43225,
regarding Grep treats extended Latin characters like whitespace
to be marked as done.

(If you believe you have received this mail in error, please contact
help-debbugs@gnu.org.)


-- 
43225: http://debbugs.gnu.org/cgi/bugreport.cgi?bug=43225
GNU Bug Tracking System
Contact help-debbugs@gnu.org with problems
--- Begin Message --- Subject: Grep treats extended Latin characters like whitespace Date: Sat, 5 Sep 2020 14:27:56 +0000
What I did:
```
grep -Riw cone *
'''

Expected result: lines with the word "cone" surrounded by whitespace, ignoring case.

What I got instead:
```
data/po/pt_BR.po:msgstr "Pressione o ícone de pódio para iniciar o tutorial"
'''

Why this is a bug: the word ícone is not the same as cone and should not have been returned in the result set. It appears that grep treats the í character in ícone as whitespace, which affects other extended-Latin characters as well.



--- End Message ---
--- Begin Message --- Subject: Re: bug#43225: Grep treats extended Latin characters like whitespace Date: Wed, 9 Sep 2020 12:45:11 -0700 User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101 Thunderbird/68.10.0
On 9/5/20 7:27 AM, Mayo Fark wrote:

grep -Riw cone *
...
data/po/pt_BR.po:msgstr "Pressione o ícone de pódio para iniciar o tutorial"

Thanks for the bug report. This bug is due to an overenthusiastic optimization that I installed in late 2016. I installed the attached patch to fix the bug.

Attachment: 0001-grep-fix-w-bug-in-UTF-8-locales.patch
Description: Text Data


--- End Message ---

reply via email to

[Prev in Thread] Current Thread [Next in Thread]