[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Unibyte characters, strings, and buffers

From: David Kastrup
Subject: Re: Unibyte characters, strings, and buffers
Date: Sun, 30 Mar 2014 17:05:19 +0200
User-agent: Gnus/5.13 (Gnus v5.13) Emacs/24.4.50 (gnu/linux)

Andreas Schwab <address@hidden> writes:

> David Kastrup <address@hidden> writes:
>> I don't think it gets much more transparent than "unibyte flag only
>> marks the valid Unicode-in-Emacs character range".  I'm for the range
>> 0..255, Andreas for something like 0..127 U 4194176..4194303 which
>> IĀ find cumbersome for little return.
> Before decoding there is no charset information yet, so using anything
> other than the eight-bit charset would be wrong.

When "right" does not buy you anything but trouble, why bother?

> After decoding, the eight-bit charset is used only for undecodable
> bytes.  That preserves the distinction between encoded and decoded
> strings/buffers (except for the uninteresting trivial ASCII decoding)
> in a world without unibyte flag.

The "uninteresting trivial ASCII" listens to case-fold-search just as
much as the latin-1 code page does.  So being "right" for half of the
coding range does not really buy anything.

David Kastrup

reply via email to

[Prev in Thread] Current Thread [Next in Thread]