[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Problem with national characters in XHTML

From: Mathias Dahl
Subject: Re: Problem with national characters in XHTML
Date: Thu, 29 Sep 2005 13:11:49 +0200
User-agent: Gnus/5.11 (Gnus v5.11) Emacs/22.0.50 (windows-nt)

Juanma Barranquero <address@hidden> writes:

> On 9/28/05, LENNART BORGMAN <address@hidden> wrote:
>> I have run into a problem with swedish national characters in an
>> XHTML document. The header of the document is like this:
>>   <?xml version="1.0" encoding="utf-8"?>
>>   <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN"
>>             "http://www.w3.org/TR/REC-html40/loose.dtd";>
>>   <html xmlns="http://www.w3.org/1999/xhtml"; xml:lang="en">
>> The swedish character รค looks like \344 in CVS Emacs (2005-09-23).
> Hmm. An XHTML document with encoding="utf-8" should not have
> "swedish national characters" in it, should it? Upon reading the
> file, Emacs will set its coding system to mule-utf-8, so it's no
> surprise than high-bit, non-valid utf8 byte sequences appear as
> \xxx...

I might be wrong here, but doesn't UTF-8 encode all characters in
Latin-1 (ISO 8859-1) exactly as they are *in* Latin-1 encoding?

reply via email to

[Prev in Thread] Current Thread [Next in Thread]