bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Bug-wget] [bug #47701] wget 1.17.1 fails to convert from percent encodi


From: Anonymous Coward
Subject: [Bug-wget] [bug #47701] wget 1.17.1 fails to convert from percent encoding to unicode correctly (mingw32)
Date: Thu, 21 Apr 2016 22:17:16 +0000
User-agent: Mozilla/5.0 (Windows NT 10.0) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/49.0.2623.112 Safari/537.36

Follow-up Comment #3, bug #47701 (project wget):

I tried adding the --local-encoding=UTF-8 command-line option as suggested by
Eli, (Thanks, Eli!) but that also didn't quite work 100% for me.

Instead of simply failing to download anything, as before, now it makes a good
attempt at downloading the requested directory, but (presumably because f'ing
losedows doesn't use UTF-8 internally) it not only still encodes it wrong, it
encodes it wrong _differently._ In this case it encodes as a capital A with
tilde followed by the pilcrow, which would be C3 B6 in Latin-1 or Windows
Codepage 1252.

Don't get me wrong, it does actually download the files, they're just encoded
wrong on my hard drive now.

When I run wget with debug on, it throws a lot of stuff at me I don't
understand, but it might be useful to you so I'll include it here.

It displays it in 'cmd' in _a-whole-nother_ encoding, as two box-drawing
characters in DOS Codepage 437! (AAARGH! DIE, MICROSOFT, DIE!)

Due to one or more of these encoding issues, wget *seems* to decide not to get
some of the stuff I asked it to:


Deciding whether to enqueue "https://leoandpeto.com/icons/blank.gif";.
Going to "icons" would escape "Music/Peto/Non/o-z/R├╢yksopp - The
Understanding" with no_parent on.
Decided NOT to load it.
Deciding whether to enqueue
"https://leoandpeto.com/Music/Peto/Non/o-z/R%C3%B6yksopp%20-%20The%20Understanding/?C=N;O=A";.
Already on the black list.
Decided NOT to load it.
Deciding whether to enqueue
"https://leoandpeto.com/Music/Peto/Non/o-z/R%C3%B6yksopp%20-%20The%20Understanding/?C=M;O=A";.
Already on the black list.
Decided NOT to load it.


and something seems to make windows-sockets/winsock output lots of stuff, too.
This might be normal for debug mode though.


200 OK
Length: 5409000 (5.2M) [audio/mpeg]
Saving to: 'R303266yksopp - The Understanding/01 Triumphant.mp3'

R├╢yksopp - The Understanding/01 Triumphant.   0%[                        
                                                                     ]       0
 --.-KB/s               seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
R├╢yksopp - The Understanding/01 Triumphant.   0%[                        
                                                                     ]  24.00K
  113KB/s               seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
R├╢yksopp - The Understanding/01 Triumphant.   1%[                        
                                                                     ]  56.00K
  119KB/s               seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
R├╢yksopp - The Understanding/01 Triumphant.   1%[>                       
                                                                     ] 104.00K
  152KB/s               seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
seconds 900.00, Winsock error: 0
[.......]


> thanks anonymous for the detailed bug report.

Oh you're quite welcome, I know a detail-less bug report is useless.

-- Anonymous Coward.

    _______________________________________________________

Reply to this item at:

  <http://savannah.gnu.org/bugs/?47701>

_______________________________________________
  Message sent via/by Savannah
  http://savannah.gnu.org/




reply via email to

[Prev in Thread] Current Thread [Next in Thread]