bug-grep
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

bug#19388: grep 2.21-1 identifies iso encoded text files as binary


From: Paul Eggert
Subject: bug#19388: grep 2.21-1 identifies iso encoded text files as binary
Date: Mon, 15 Dec 2014 23:12:10 -0800
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.3.0

Martin Hoch wrote:
I noticed that grep 2.21-1 regards ISO-8859-15 encoded files as binary, if
LC_ALL is set to en_US.UTF.

I am not sure if this is a bug or an expected behaviour change in 2.21-1

It's an expected change.  Although this was documented in NEWS:

  If a file contains data improperly encoded for the current locale,
  and this is discovered before any of the file's contents are output,
  grep now treats the file as binary.

the grep manual is not so clear about it. I installed the attached patch to try to fix that.

Attachment: 0001-doc-document-binary-data-heuristic-better.patch
Description: Text Data


reply via email to

[Prev in Thread] Current Thread [Next in Thread]