[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
bug#6252: Emacs does not implement URL (aka "percent") decoding correctl
From: |
Lars Magne Ingebrigtsen |
Subject: |
bug#6252: Emacs does not implement URL (aka "percent") decoding correctly. |
Date: |
Wed, 21 Sep 2011 22:17:52 +0200 |
User-agent: |
Gnus/5.110018 (No Gnus v0.18) Emacs/24.0.50 (gnu/linux) |
José A. Romero L. <escherdragon@gmail.com> writes:
> On May 18, 20:14, Xah Lee <xah...@gmail.com> wrote:
>
>> is there emacs lisp function that decode the url percent encoding?
>> e.g.http://en.wikipedia.org/wiki/Sylvester%E2%80%93Gallai_theorem
>> should become
>> http://en.wikipedia.org/wiki/Sylvester–Gallai_theorem
>> that's a EN DASH (unicode 8211, #o20023, #x2013).
>> I know there's a
>> (require 'gnus-util)
>> gnus-url-unhex-string
>> but that just unhex, and generate gibberish if the url contain unicode
>> chars.
> (...)
>
> Seems that RFC 3986 has not been implemented correctly in Emacs. IMHO
> that is an important hole you have found there. The standard requires
> that all unreserved characters be encoded/decoded as UTF8 bytes. Even
> though the encoding part looks OK (in url-util.el), the decoding does
> not go that last mile to interpret the decoded bytes as UTF-8.
I'm not quite sure I understand what the problem is. Do you have a test
case that illustrates what url.el does wrong?
--
(domestic pets only, the antidote for overdose, milk.)
bloggy blog http://lars.ingebrigtsen.no/
- bug#6252: Emacs does not implement URL (aka "percent") decoding correctly.,
Lars Magne Ingebrigtsen <=