[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: eww doesn't decode %AA%BB%CC URL names

From: Eli Zaretskii
Subject: Re: eww doesn't decode %AA%BB%CC URL names
Date: Thu, 24 Dec 2015 21:03:10 +0200

> From: Yuri Khan <address@hidden>
> Date: Fri, 25 Dec 2015 00:07:40 +0600
> Cc: Eli Zaretskii <address@hidden>, Emacs developers <address@hidden>
> On Thu, Dec 24, 2015 at 11:40 PM, Lars Ingebrigtsen <address@hidden> wrote:
> > (decode-coding-string (url-unhex-string
> > "%D0%A1%D0%B5%D1%80%D0%B4%D1%86%D0%B5")
> > 'utf-8)
> > => "Сердце"
> >
> > Right.  What charset do we choose?  I guess using the charset of the
> > document we're in doesn't make much sense (because it's linking to
> > something off-site which may be in a different charset)...
> By RFC 3986, percent-encoded URLs SHOULD use UTF-8 encoding. If the
> URL does not decode into a valid UTF-8 string, it is ok to fall back
> to a heuristic, though.

Yes, I think this is a good policy, thanks.  Bonus points for
implementing the command in a way that it will be able to accept user
choice of the encoding via "C-x RET c", like file operations do.

reply via email to

[Prev in Thread] Current Thread [Next in Thread]