[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Automatic (e)tags generation and incremental updates

From: Eli Zaretskii
Subject: Re: Automatic (e)tags generation and incremental updates
Date: Wed, 13 Jan 2021 17:58:16 +0200

> Cc: tom@tromey.com, john@yates-sheets.org, philipk@posteo.net,
>  emacs-devel@gnu.org
> From: Dmitry Gutov <dgutov@yandex.ru>
> Date: Wed, 13 Jan 2021 17:52:16 +0200
> > Almost all the identifiers are ASCII, right?  So maybe optimize 99.9%
> > of use cases by storing such tags tables in a unibyte buffer, read
> > with insert-file-contents-literally?
> All right, and that option is probably handled well enough already by 
> the user choosing (l) in the prompt when the tags file is very big.

Yes, but my idea was to do that automatically.  After all, the size
threshold beyond which we prompt the user is customizable, so it could
be very large.

> > As for why utf-8-emacs didn't help: I'm not really sure why Stefan
> > thought it will.  I mean, look at the code: it still encodes, just
> > differently.
> My (apparently faulty) intuition was that if utf-8-emacs is the memory 
> representation of buffer text, converting it into that encoding can be 
> faster because it could be done by copying from memory rather that 
> having to do the work of recoding every character.

We don't recode characters when they are valid UTF-8 sequences, but
you forget the raw bytes: they are converted from internal multibyte
representation to single bytes, and that requires walking the buffer
one character at a time.

IOW, utf-8-emacs is the same as utf-8 for this purpose.

reply via email to

[Prev in Thread] Current Thread [Next in Thread]