Re: Bug 130397

emacs-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Bug 130397

From:	David Kastrup
Subject:	Re: Bug 130397
Date:	Sat, 08 Jan 2005 13:47:09 +0100
User-agent:	Gnus/5.11 (Gnus v5.11) Emacs/21.3.50 (gnu/linux)

Geoff Kuenning <address@hidden> writes:

> Ken writes:
>
>> Geoff has a much better understanding of the underlying spell search
>> engine.  Perhaps he can shed additional light on this topic.
>
> I just looked at the code to be sure my memory is correct.  Here's
> the short rundown: in the '-a' interface, ispell interfaces with the
> outside world purely in a byte-indexed mode.  It is perfectly
> capable of handling UTF-8 and similar multi-byte encodings, but when
> it reports the offsets of incorrect words, it does so as a byte
> offset, not a character offset.
>
> Does emacs provide an underlying byte-indexed interface to the
> buffer?  If so, life should be easy: just have ispell.el use that
> interface.

You are wrongly assuming that the buffer is maintained in UTF-8.  It
isn't.  Byte indexing is not going to be fun with regard to
efficiency, unless we get some interface that will, while writing out
a file in UTF-8, store an array of byte/character correspondences for
the UTF-8 (or whatever other) character conversion somewhere.

-- 
David Kastrup, Kriemhildstr. 15, 44793 Bochum

[Prev in Thread]

Current Thread

[Next in Thread]

Re: Bug 130397, (continued)

Prev by Date: Re: Bug 130397
Next by Date: Re: Bug in Info - can use mouse to click links (w32)
Previous by thread: Re: Bug 130397
Next by thread: Re: Bug 130397
Index(es):
- Date
- Thread