[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: garbage chars when pasting French chars into emacs
From: |
ken |
Subject: |
Re: garbage chars when pasting French chars into emacs |
Date: |
Thu, 02 Feb 2012 15:00:37 -0500 |
User-agent: |
Thunderbird 2.0.0.24 (X11/20111109) |
On 02/01/2012 10:55 PM Eli Zaretskii wrote:
Date: Wed, 01 Feb 2012 21:39:22 -0500
From: ken <gebser@mousecar.com>
What is your value of selection-coding-system? Try setting it to
something like ctext-with-extensions.
Thanks, Eli,
Immediately prior to doing the copy-and-paste I ran all of these:
(set-language-environment 'UTF-8)
(set-default-coding-systems 'utf-8)
(setq file-name-coding-system 'utf-8)
(setq default-buffer-file-coding-system 'utf-8)
(setq coding-system-for-write 'utf-8)
(set-keyboard-coding-system 'utf-8)
(set-terminal-coding-system 'utf-8)
(set-clipboard-coding-system 'utf-8)
(set-selection-coding-system 'utf-8)
(prefer-coding-system 'utf-8)
(modify-coding-system-alist 'process "\\*shell\\*\\'" 'utf-8-unix)
Not a good idea, I'm afraid: the UTF-8 support in Emacs 21 left a lot
to be desired.
Following your advice, I ran
(set-selection-coding-system 'ctext-with-extensions)
and then did the same copy-and-paste again. This got more of the
characters correct, but not all of them.
Looking again, I see my eyes must have been malfunctioning for the text
pasted into emacs is rendered correctly. However, when I try to save
the text, I'm presented with the minibuffer message "Select coding
system (default iso-2022-jp-2): ". At the same time a second buffer
opens under the first giving the options for coding system:
===================================================================
These default coding systems were tried:
mule-utf-8-unix iso-latin-1
However, none of them safely encodes the target text.
Select one of the following safe coding systems:
iso-2022-jp-2 x-ctext iso-2022-7bit raw-text emacs-mule no-conversion
ctext-no-compositions iso-2022-8bit-ss2 iso-2022-7bit-lock
iso-2022-7bit-ss2 tibetan-iso-8bit-with-esc thai-tis620-with-esc
lao-with-esc korean-iso-8bit-with-esc hebrew-iso-8bit-with-esc
greek-iso-8bit-with-esc iso-latin-9-with-esc iso-latin-8-with-esc
iso-latin-5-with-esc iso-latin-4-with-esc iso-latin-3-with-esc
iso-latin-2-with-esc iso-latin-1-with-esc
in-is13194-devanagari-with-esc cyrillic-iso-8bit-with-esc
chinese-iso-8bit-with-esc japanese-iso-8bit-with-esc
===================================================================
Entering "utf-8" into the minibuffer, of course, doesn't work. Frankly,
I'd like to save into utf-8, this to avoid having problems with this
same file at a later time when I add more text to it.
Can you show the original text, and then what you have after pasting?
I need to see which characters aren't pasted correctly.
The original text must have been edited out somewhere in the thread.
It's at <http://www.wikilivres.info/wiki/Maurice_Merleau-Ponty>, from
"Francais" through the end of the unordered list, ie, the line ending
with "et l’invisible, 1964".
Here it is:
==========================================================
Français
* La Structure du comportement, 1942
* La Phénoménologie de la perception, 1945
* Humanisme et terreur, 1947
* Sens et non-sens, 1948
* Les Sciences de l’homme et la phénoménologie
* Les Relations avec autrui chez l’enfant
* Éloge de la philosophie, leçon inaugurale faite au collège de
France, le jeudi 15 janvier 1953
* Signes, 1960
* L’œil et l’esprit, Gallimard, 1960
* Le visible et l’invisible, 1964
==========================================================
Thanks again for your help.
Re: garbage chars when pasting French chars into emacs, Philipp Haselwarter, 2012/02/01