|
From: | Paul Eggert |
Subject: | bug#19388: grep 2.21-1 identifies iso encoded text files as binary |
Date: | Mon, 15 Dec 2014 23:12:10 -0800 |
User-agent: | Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.3.0 |
Martin Hoch wrote:
I noticed that grep 2.21-1 regards ISO-8859-15 encoded files as binary, if LC_ALL is set to en_US.UTF. I am not sure if this is a bug or an expected behaviour change in 2.21-1
It's an expected change. Although this was documented in NEWS: If a file contains data improperly encoded for the current locale, and this is discovered before any of the file's contents are output, grep now treats the file as binary.the grep manual is not so clear about it. I installed the attached patch to try to fix that.
0001-doc-document-binary-data-heuristic-better.patch
Description: Text Data
[Prev in Thread] | Current Thread | [Next in Thread] |