|
From: | GNU bug Tracking System |
Subject: | bug#43225: closed (Grep treats extended Latin characters like whitespace) |
Date: | Wed, 09 Sep 2020 19:46:01 +0000 |
Your message dated Wed, 9 Sep 2020 12:45:11 -0700 with message-id <87d378cf-2c5b-c0aa-a9c4-1557ecb7c40e@cs.ucla.edu> and subject line Re: bug#43225: Grep treats extended Latin characters like whitespace has caused the debbugs.gnu.org bug report #43225, regarding Grep treats extended Latin characters like whitespace to be marked as done. (If you believe you have received this mail in error, please contact help-debbugs@gnu.org.) -- 43225: http://debbugs.gnu.org/cgi/bugreport.cgi?bug=43225 GNU Bug Tracking System Contact help-debbugs@gnu.org with problems
--- Begin Message ---Subject: Grep treats extended Latin characters like whitespace Date: Sat, 5 Sep 2020 14:27:56 +0000
What I did:```grep -Riw cone *'''
Expected result: lines with the word "cone" surrounded by whitespace, ignoring case.
What I got instead:```data/po/pt_BR.po:msgstr "Pressione o ícone de pódio para iniciar o tutorial"'''
Why this is a bug: the word ícone is not the same as cone and should not have been returned in the result set. It appears that grep treats the í character in ícone as whitespace, which affects other extended-Latin characters as well.
--- End Message ---
--- Begin Message ---Subject: Re: bug#43225: Grep treats extended Latin characters like whitespace Date: Wed, 9 Sep 2020 12:45:11 -0700 User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101 Thunderbird/68.10.0 On 9/5/20 7:27 AM, Mayo Fark wrote:grep -Riw cone * ... data/po/pt_BR.po:msgstr "Pressione o ícone de pódio para iniciar o tutorial"Thanks for the bug report. This bug is due to an overenthusiastic optimization that I installed in late 2016. I installed the attached patch to fix the bug.0001-grep-fix-w-bug-in-UTF-8-locales.patch
Description: Text Data
--- End Message ---
[Prev in Thread] | Current Thread | [Next in Thread] |