|
From: | HB |
Subject: | [Bug-wget] [bug #47689] Support parsing of UTF-16 HTML encoding |
Date: | Fri, 8 Jul 2016 06:46:06 +0000 (UTC) |
User-agent: | Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Ubuntu Chromium/51.0.2704.79 Chrome/51.0.2704.79 Safari/537.36 |
Follow-up Comment #4, bug #47689 (project wget): >The following site has UTF-16 encoding: >http://www.free-energy-info.co.uk/ >W3C claim it's UTF-16LE, but it's not relevant. I have just spent the last two hours trying to wget this same site. When I finally figured out that it wasn't working because of UTF-16 I googled how to get wget to support UTF-16 and found this bug. I want to mirror this site but wget finds no urls to follow in the index.html I was able to convert the index.html to UTF-8 but no way that I know of to easily feed that back to wget for mirroring. Pls advise. Or fix. Thanks _______________________________________________________ Reply to this item at: <http://savannah.gnu.org/bugs/?47689> _______________________________________________ Message sent via/by Savannah http://savannah.gnu.org/
[Prev in Thread] | Current Thread | [Next in Thread] |