[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: umlauts (8bit characters) input
From: |
Peter Dyballa |
Subject: |
Re: umlauts (8bit characters) input |
Date: |
Wed, 2 Feb 2005 20:37:54 +0100 |
Am 02.02.2005 um 16:19 schrieb Stefan Monnier:
real umlauts! But in shell they're gone ... a bunch of control codes?
(I
am using tcsh and see
a\314?o\314?u\314?\303?U\314?O\314?A\314?\342?\254
instead of äöü?ÜÖÄÛ, shell has only 'uu-')
Could you try posting in a more widely available charset than
mac-roman?
The last char of "uu-" is meaningless in a shell buffer (it's the
encoding
of the file associated with the buffer, and since shell buffers have no
associated file, ...).
Sorry, this is no mac-roman, it's exactly what I see in xterm or in
Terminal in the shell buffer -- although I should see something
different: äöüßÜÖÄ€. That's the file's name. (Mail.app claims that it
is using a 7bit encoding.)
The X11 client is even worse: copy and paste in this same Emacs does
not
work (a simple ä is converted to a whole book volume of glyphs that
are
hard to describe),
Huh? You mean you can't copy&paste from Emacs to itself correctly?
Or do you mean you can't correctly copy from Emacs to something else?
Or you can't correctly paste from something else to Emacs?
My .emacs file is only 8bit, so I chose ISO Latin-15. In its header it
is explicitly written
;;; -*- mode: Emacs-Lisp; coding: iso-8859-15; -*-
When I copy a string with characters from this set via double-clicking
the selection and paste it either in the same or in the (Unicode)
scratch buffer the simple accented chars become each a series of this
character and mostly \216 -- me and others have seen similiar things in
GNOME under Linux when copying from outside GNU Emacs. Pressing umlauts
etc. on my keyboard enter exactly these umlauts into the buffers. C-x =
explains in hex that they're taken from the usual code positions (for
example Ä is 0x8c4, ä is 0x8e4).
I can send you a snapshot privately ...
and displays 'a<box>o<box>u...Û', and shell uses no mode '--:**...'
as in
Terminal and shows the name as in Terminal.
The <box> just means that it couldn't find a font to display the char.
Go to that char and hit C-u C-x = to see which char it is. Maybe you
just
need to help Emacs find the right font.
There is no char. So there should not be a box. The correct string is
äöüß... and as it's a file name, it's UTF-8 encoded. (Since the
displayed glyphs are the chars stripped off their diaeresis the box
could be ... but it's : "(01211310, 332488, 0x512c8, file ...). A bit
astronomical high. æ and Æ and € are represented by themselves.
But of course, first we need to know which Emacs version you're
running.
If it's Emacs-CVS, please move this discussion to emacs-devel@gnu.org
or
emacs-pretest-bug@gnu.org.
Since the Fink team decided to distribute in the unstable section an
Emacs from CVS ... (but there was not much difference to 21.3.50, it
might be exactly 0 in this case)
--
Greetings
Pete
- Re: umlauts (8bit characters) input, (continued)
- Re: umlauts (8bit characters) input, Peter Dyballa, 2005/02/02
- Message not available
- Re: umlauts (8bit characters) input, David Kastrup, 2005/02/02
- Re: umlauts (8bit characters) input, Peter Dyballa, 2005/02/02
- Message not available
- Re: umlauts (8bit characters) input, David Kastrup, 2005/02/02
- Re: umlauts (8bit characters) input, Ismael Valladolid Torres, 2005/02/02
- Re: umlauts (8bit characters) input, Peter Dyballa, 2005/02/02
- Re: umlauts (8bit characters) input, Peter Dyballa, 2005/02/02
- Message not available
- Re: umlauts (8bit characters) input, Stefan Monnier, 2005/02/02
- Re: umlauts (8bit characters) input,
Peter Dyballa <=
- Message not available
- Re: umlauts (8bit characters) input, Stefan Monnier, 2005/02/02
Re: umlauts (8bit characters) input, Reiner Steib, 2005/02/01
Re: umlauts (8bit characters) input, Peter Dyballa, 2005/02/01
Re: umlauts (8bit characters) input, Hendrik Sattler, 2005/02/01