[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: converting between charsets

From: Stefan Monnier
Subject: Re: converting between charsets
Date: Mon, 15 May 2006 23:50:50 -0400
User-agent: Gnus/5.11 (Gnus v5.11) Emacs/22.0.50 (gnu/linux)

SM> If you put the cursor on the russian chars before calling
SM> encode-coding-region and hit C-u C-x = what does it say?

>   character: ΠΆ (01212102, 332866, 0x51442)
>     charset: mule-unicode-0100-24ff
>              (Unicode characters of the range U+0100..U+24FF.)
>  code point: 40 66
>      syntax: word
>    category: y:Cyrillic  
> buffer code: 0x9C 0xF4 0xA8 0xC2
>   file code: 0xD0 0xA2 (encoded by coding system utf-8)
>        font: -monotype-courier 
> new-medium-r-normal--13-94-99-99-m-80-iso10646-1

SM> If you put the cursor on the `?' that replaced that char and hit C-u C-x =
SM> what does it say?

>   character: ? (077, 63, 0x3f)
>     charset: ascii (ASCII (ISO646 IRV))
>  code point: 63
>      syntax: punctuation
>    category: a:ASCII   l:Latin  
> buffer code: 0x3F
>   file code: 0x3F (encoded by coding system utf-8)
>        font: -monotype-courier 
> new-medium-r-normal--13-94-99-99-m-80-adobe-standard

Hmm... with my Emacs (a recent CVS checkout), if I do

   M-: (encode-coding-string (string 332866) 'koi8-r) RET

I get "\364" rather than "?".  So either you're running an older Emacs and
the problem has been fixed, or there's something else going that
I don't understand.


reply via email to

[Prev in Thread] Current Thread [Next in Thread]