[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Bug-wget] [bug #47689] Support for UTF-16 encoding.
From: |
kenorb |
Subject: |
[Bug-wget] [bug #47689] Support for UTF-16 encoding. |
Date: |
Wed, 13 Apr 2016 18:42:53 +0000 |
User-agent: |
Mozilla/5.0 (Macintosh; Intel Mac OS X 10_11_2) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/49.0.2623.87 Safari/537.36 |
URL:
<http://savannah.gnu.org/bugs/?47689>
Summary: Support for UTF-16 encoding.
Project: GNU Wget
Submitted by: kenorb
Submitted on: Wed 13 Apr 2016 06:42:52 PM GMT
Category: Localization
Severity: 3 - Normal
Priority: 5 - Normal
Status: None
Privacy: Public
Assigned to: None
Originator Name:
Originator Email:
Open/Closed: Open
Discussion Lock: Any
Release: 1.16.3
Operating System: Mac OS
Reproducibility: Every Time
Fixed Release: None
Planned Release: None
Regression: None
Work Required: None
Patch Included: None
_______________________________________________________
Details:
The following site has UTF-16 encoding:
http://www.free-energy-info.co.uk/
W3C claim it's UTF-16LE, but it's not relevant.
By default wget doesn't recognise the source of it, because it's not following
any links when using with -m or -r.
When specifying remote-encoding, it doesn't work either:
$ wget --remote-encoding=UTF-16 http://www.free-energy-info.co.uk/
This version does not have support for IRIs
The same for any format, including when specifying `--no-iri`.
What should be the fix in order that encoding of that site can be parsed by
wget?
Related: http://stackoverflow.com/q/36605946/55075
_______________________________________________________
Reply to this item at:
<http://savannah.gnu.org/bugs/?47689>
_______________________________________________
Message sent via/by Savannah
http://savannah.gnu.org/
- [Bug-wget] [bug #47689] Support for UTF-16 encoding.,
kenorb <=