On 06/09/2009 09:03 AM B. T. Raven wrote:
ken wrote:
On 06/08/2009 04:43 PM B. T. Raven wrote:
ken wrote:
....
C-x ret C-\ latin-4-postfix
then a,e,i,o,u followed by hyphen generate macroned vowels
....
Fantastic! But... when I save and close the buffer and then open it up
again, in place of the beautiful and correct characters, there are
little boxes.
After you see then correctly in the buffer do:
C-x ret c utf-8
then
C-x C-s
Now next time you load that file it should appear correctly.
ā and ī are not in iso-8859-1 and so you must use a more comprehensive
coding system.
Hmmm... it doesn't. Doing everything just as you say above, I still get
the little boxes in place of the non-English characters.
When after reloading the buffer, I run "describe-coding-system" on this
buffer, I get:
=============================================
Coding system for saving this buffer:
u -- mule-utf-8-unix
Default coding system (for new files):
u -- mule-utf-8 (alias: utf-8)
Coding system for keyboard input:
nil
Coding system for terminal output:
0 -- iso-latin-9 (alias: iso-8859-15 latin-9 latin-0)
Defaults for subprocess I/O:
decoding: u -- mule-utf-8 (alias: utf-8)
encoding: u -- mule-utf-8 (alias: utf-8)
Priority order for recognizing coding systems when reading files:
1. mule-utf-8 (alias: utf-8)
2. iso-latin-1 (alias: iso-8859-1 latin-1)
3. iso-2022-jp (alias: junet)
4. iso-2022-7bit
5. iso-2022-7bit-lock (alias: iso-2022-int-1)
6. iso-2022-8bit-ss2
7. emacs-mule
8. raw-text
9. japanese-shift-jis (alias: shift_jis sjis)
10. chinese-big5 (alias: big5 cn-big5)
11. no-conversion (alias: binary)
Other coding systems cannot be distinguished automatically
from these, and therefore cannot be recognized automatically
with the present coding system priorities.
The followings are decoded correctly but recognized as iso-2022-7bit-lock:
iso-2022-7bit-ss2 iso-2022-7bit-lock-ss2 iso-2022-cn iso-2022-cn-ext
iso-2022-jp-2 iso-2022-kr
....
==================================================================
I don't know... does utf-8 or mule-utf-8 contain latin-4, greek, and/or
German characters? (This file has some of each.)
I tried using ‘C-x C-m c utf-8 RET’ prior to 'C-x C-f filename'... but
no joy. Same no-go with 'C-x C-m c mule-utf-8 RET'.
The fact that these non-English characters display properly in the
buffer initially tells me that I have the requisite fonts installed. So
what little connection is emacs not making (and how do I tell it to make
that connection)?
If you use utf-8 a lot you can put ;; -*- coding: utf-8[;] -*- into the
first line of the file. I don't know whether that sem in brackets is
needed or not.
Sorry, I should have mentioned that I have this (with the semi-colon) at
the top of the file.
Let me also say that, though the little boxes appear in the emacs
buffer, the proper non-English characters appear when the file is loaded
into firefox. (Yeah, this emacs file is an HTML page.)
Thanks, all.