[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [AUCTeX] Update: Text corruption

From: David Kastrup
Subject: Re: [AUCTeX] Update: Text corruption
Date: Tue, 22 Jan 2013 11:20:46 +0100
User-agent: Gnus/5.13 (Gnus v5.13) Emacs/24.3.50 (gnu/linux)

Tassilo Horn <address@hidden> writes:

> "Axel E. Retif" <address@hidden> writes:
> Hi Axel,
> [sorry for the duplicate; previously I've replied to you privately.]
>>>> Curiously enough, sometimes Meld have shown some text corruption,
>>>> but reopening the same diff documents with Meld makes the
>>>> ``corruption'' disappear (and checking with Emacs, TeXworks and Kate
>>>> show that indeed there wasn't any corruption).
>>> Did that happen with the new memory modules, i.e., is the issue still
>>> there, or has it been solved with the new modules?
>> With the new memory modules, but it has been different. I has happened
>> twice exactly:
>> The first, instead of the word ``según'' Meld showed in the new
>> version of the file something like ``segÃn'',
> Hm, assuming UTF-8 and again the bit-2-flip, it would be:
> ú = 11111010

Huh?  That's the Latin-1 interpretation (or the Unicode code point).  In
utf-8, this is rather 0xc3 0xba.

> þ = 11111110
> Maybe Meld prints à in case the font doesn't have a glyph for the
> character to display?

More banal than that, I guess.  Misinterpreting UTF-8 as Latin-1
frequently cranks out à (character code 0xc3 in Latin-1 which is the
lead-in character for Unicode characters 0xc0 to 0xff).

So this was more likely an encoding problem than a flipping bit (unless
a bit flipping _elsewhere_ confused the encoding detection).

David Kastrup

reply via email to

[Prev in Thread] Current Thread [Next in Thread]