Re: dired-do-find-regexp failure with latin-1 encoding

emacs-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: dired-do-find-regexp failure with latin-1 encoding

From:	Eli Zaretskii
Subject:	Re: dired-do-find-regexp failure with latin-1 encoding
Date:	Sun, 29 Nov 2020 19:12:51 +0200

> Cc: stephen.berman@gmx.net, emacs-devel@gnu.org
> From: Dmitry Gutov <dgutov@yandex.ru>
> Date: Sun, 29 Nov 2020 18:07:38 +0200
> 
> Adding -a or prepending 'LC_ALL=C' changes that:
> $ LC_ALL=C grep "prem" latin1.txt
> premi�re is first
> premie?re is slightly different

Is that � what Grep actually produced?

> > What is not clear to me is whether the _output_ is always in some
> > fixed encoding, like UTF-8.  That doesn't seem to be stated in the
> > docs there.
> 
> Judging by a small experiment, rg's output is in the same encoding as 
> input, for each file.

So in this aspect it is not better than Grep: it is still impractical
to search through files that have different encodings.

> In any case, if one takes the pre-processing route, the end encoding 
> will be UTF-8.

But then the pre-processor will have to guess the encoding (if it is
not the same for all the files), which we know is not simple.

[Prev in Thread]

Current Thread

[Next in Thread]

Re: dired-do-find-regexp failure with latin-1 encoding, (continued)

Prev by Date: Re: dired-do-find-regexp failure with latin-1 encoding
Next by Date: Re: dired-do-find-regexp failure with latin-1 encoding
Previous by thread: Re: dired-do-find-regexp failure with latin-1 encoding
Next by thread: Re: dired-do-find-regexp failure with latin-1 encoding
Index(es):
- Date
- Thread