[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Bug-wget] [bug #47701] wget 1.17.1 fails to convert from percent encodi
From: |
Anonymous Coward |
Subject: |
[Bug-wget] [bug #47701] wget 1.17.1 fails to convert from percent encoding to unicode correctly (mingw32) |
Date: |
Thu, 21 Apr 2016 22:17:16 +0000 |
User-agent: |
Mozilla/5.0 (Windows NT 10.0) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/49.0.2623.112 Safari/537.36 |
Follow-up Comment #3, bug #47701 (project wget):
I tried adding the --local-encoding=UTF-8 command-line option as suggested by
Eli, (Thanks, Eli!) but that also didn't quite work 100% for me.
Instead of simply failing to download anything, as before, now it makes a good
attempt at downloading the requested directory, but (presumably because f'ing
losedows doesn't use UTF-8 internally) it not only still encodes it wrong, it
encodes it wrong _differently._ In this case it encodes as a capital A with
tilde followed by the pilcrow, which would be C3 B6 in Latin-1 or Windows
Codepage 1252.
Don't get me wrong, it does actually download the files, they're just encoded
wrong on my hard drive now.
When I run wget with debug on, it throws a lot of stuff at me I don't
understand, but it might be useful to you so I'll include it here.
It displays it in 'cmd' in _a-whole-nother_ encoding, as two box-drawing
characters in DOS Codepage 437! (AAARGH! DIE, MICROSOFT, DIE!)
Due to one or more of these encoding issues, wget *seems* to decide not to get
some of the stuff I asked it to:
Deciding whether to enqueue "https://leoandpeto.com/icons/blank.gif".
Going to "icons" would escape "Music/Peto/Non/o-z/R├╢yksopp - The
Understanding" with no_parent on.
Decided NOT to load it.
Deciding whether to enqueue
"https://leoandpeto.com/Music/Peto/Non/o-z/R%C3%B6yksopp%20-%20The%20Understanding/?C=N;O=A".
Already on the black list.
Decided NOT to load it.
Deciding whether to enqueue
"https://leoandpeto.com/Music/Peto/Non/o-z/R%C3%B6yksopp%20-%20The%20Understanding/?C=M;O=A".
Already on the black list.
Decided NOT to load it.
and something seems to make windows-sockets/winsock output lots of stuff, too.
This might be normal for debug mode though.
200 OK
Length: 5409000 (5.2M) [audio/mpeg]
Saving to: 'R303266yksopp - The Understanding/01 Triumphant.mp3'
R├╢yksopp - The Understanding/01 Triumphant. 0%[
] 0
--.-KB/s seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
R├╢yksopp - The Understanding/01 Triumphant. 0%[
] 24.00K
113KB/s seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
R├╢yksopp - The Understanding/01 Triumphant. 1%[
] 56.00K
119KB/s seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
R├╢yksopp - The Understanding/01 Triumphant. 1%[>
] 104.00K
152KB/s seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
[.......]
> thanks anonymous for the detailed bug report.
Oh you're quite welcome, I know a detail-less bug report is useless.
-- Anonymous Coward.
_______________________________________________________
Reply to this item at:
<http://savannah.gnu.org/bugs/?47701>
_______________________________________________
Message sent via/by Savannah
http://savannah.gnu.org/