[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Bug-wget] Save 3 byte utf8 url

From: bes
Subject: [Bug-wget] Save 3 byte utf8 url
Date: Thu, 7 Feb 2013 17:06:47 +0300


i found some bug in wget with interpreting and save percent-encoding 3 byte
utf8 url

1. Create url with "—". This is U+2014 (EM DASH). Percent-encoding UTF-8 is
2. Try wget it: wget "http://example.com/abc—d"; or wget "
http://example.com/abc%E2%80%94d"; directly
3. Wget save this URL to file "abc\342%80%94d". Expected is
"abc%E2%80%94d". This is a bug.

$ wget -V
GNU Wget 1.14 built on linux-gnu.

+digest +https -ipv6 -iri +large-file -nls -ntlm +opie +ssl/openssl

/etc/wgetrc (system)
Compile: x86_64-pc-linux-gnu-gcc -DHAVE_CONFIG_H
-DLOCALEDIR="/usr/share/locale" -I. -I../lib -I../lib -DHAVE_LIBSSL
-O2 -march=native -pipe -fomit-frame-pointer
Link: x86_64-pc-linux-gnu-gcc -DHAVE_LIBSSL -O2 -march=native -pipe
-fomit-frame-pointer -Wl,-O1 -Wl,--as-needed -lssl -lcrypto -lpcre
-lrt ftp-opie.o openssl.o ../lib/libgnu.a

reply via email to

[Prev in Thread] Current Thread [Next in Thread]