Re: undecided vs utf-8

emacs-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: undecided vs utf-8

From:	Eli Zaretskii
Subject:	Re: undecided vs utf-8
Date:	Fri, 05 Nov 2010 09:56:58 +0200

> From: Lars Magne Ingebrigtsen <address@hidden>
> Date: Thu, 04 Nov 2010 23:27:57 +0100
> 
> When using erc, it decodes iso-8859-1 fine with the default `undecided'
> into encoding.  However, any utf-8 strings are, sort of, just translated
> into the same coding system:
> 
> (decode-coding-string "u-te-\303\246ff \303\245tte" 'undecided)
> => "u-te-Ã¦ff Ã¥tte"
> 
> (decode-coding-string "u-te-\303\246ff \303\245tte" 'utf-8)
> => "u-te-æff åtte"

Please show the output of "M-x mule-diag RET" on the machine where
this happens.

> Shouldn't `decode-coding-string' prefer utf-8 if it's actually
> valid?

Depending on the user's locale and preferences, this could easily
backfire, especially if the text is insufficiently long to distinguish
between the two.

Using incorrect decoder in a small fraction of cases is a fact of
life; every program out there hits this from time to time.  What we
need is good defaults, and ways to customize those in specific
situations.  In this case, perhaps erc should use its own defaults, if
UTF-8 is widely (or solely) used there.

[Prev in Thread]

Current Thread

[Next in Thread]

Re: undecided vs utf-8, (continued)
- Re: undecided vs utf-8, Kenichi Handa, 2010/11/04
  - Re: undecided vs utf-8, Lars Magne Ingebrigtsen, 2010/11/04
    - Re: undecided vs utf-8, Kenichi Handa, 2010/11/05
    - Re: undecided vs utf-8, Lars Magne Ingebrigtsen, 2010/11/05
    - Re: undecided vs utf-8, Eli Zaretskii, 2010/11/05
    - Re: undecided vs utf-8, Lars Magne Ingebrigtsen, 2010/11/05
    - Re: undecided vs utf-8, Eli Zaretskii, 2010/11/05
    - Re: undecided vs utf-8, Deniz Dogan, 2010/11/05
    - Re: undecided vs utf-8, Lars Magne Ingebrigtsen, 2010/11/05
- Re: undecided vs utf-8, Eli Zaretskii <=

Prev by Date: Re: collect-string (was: simple useful functions)
Next by Date: Re: undecided vs utf-8
Previous by thread: Re: undecided vs utf-8
Next by thread: Minor update for Savannah Emacs page
Index(es):
- Date
- Thread