Re: wikipedia and UTF8 coding (emacs21)

From: Stefan Monnier
Subject: Re: wikipedia and UTF8 coding (emacs21)
Date: Tue, 22 Nov 2005 14:46:52 -0500
> I presume you don't a difference in 
> Brüder and Brüder 
This shows that the utf-8 format was not treated as such and instead Emacs
read the file as if it were encoded in latin-1 or binary.

> I did your C-u C-x = which tells me:

> character: Ã (0303, 195, 0xc3)
>     charset: eight-bit-graphic (8-bit graphic char (0xA0..0xFF))

This means "unknown char, thus treated as a byte".

If you tell your Emacs to use utf-8 for this file (try C-x RET u utf-8 RET
before C-x C-f, or set file-coding-system-alist so it's done automatically
everytime), your char's charset should be either latin-8859-1 or

All the above assumes you're not using Emacs in unibyte mode.  Unibyte mode
(which you sadly get if you have a lingering `special-display-european' in
your .emacs) can't handle UTF-8.


