On Fri, Dec 11, 2015 at 08:22:23PM +0100, Giuseppe Scrivano wrote:
Hello,
I am pleased to announce the new version of GNU wget. We consider it a
bug fixes release as it addresses issues found in 1.17, which contained
quite a few new features.
Please report any problem you may experience to the address@hidden
mailing list.
Four months ago I mentioned two bugs:
1. Non-ASCII filenames are mistreated
2. The progress bar is broken when the filename is non-ASCII.
And I provided patches that fix 2, and fix 1 on Unix.
Tim Ruehsen polished the second patch a bit more, and then
nothing more was heard about it.
The first patch was for Unix, and there was some amount of discussion
of the Windows situation with Eli Zaretskii. But nobody offered
a Windows patch, and my patch is OK, but Unix-only.
Now that wget-1.17.1 is out, let me try it on the Russian Wikipedia page
for the page "heart", Сердце, the same example we used last August.
With my patch I get:
Saving to: ‘Сердце’
With wget-1.17.1 I get:
% wget/wget-1.17.1/src/wget
https://ru.wikipedia.org/wiki/%D0%A1%D0%B5%D1%80%D0%B4%D1%86%D0%B5
Saving to: ‘Се\321%80д\321%86е’.
Here wget saves to a name that is not a legal name on this filesystem.
The progress bar is still broken and contains illegal characters.
So, I find that wget-1.17.1 is still broken, and no good for downloading
files with UTF-8 filename from an UTF-8 site to a local UTF-8 site.
There is some default built-in ISO 8859-* ugliness.
Andries