[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Bug-wget] How remove all after ? or @ (REAL PROBLEM)
From: |
Dale R. Worley |
Subject: |
Re: [Bug-wget] How remove all after ? or @ (REAL PROBLEM) |
Date: |
Tue, 23 Aug 2016 11:07:19 -0400 |
address@hidden (Dale R. Worley) writes:
> Павел Серегов <address@hidden> writes:
>> file on server http://site.com/style.css?v1000
>> downloaded file address@hidden
>>
>> How remove @v1000
>> I want result: style.css (without @v1000)
>
> The easiest way is to specify the output file name you want with
> "--output-file=style.css".
Correction, that should be "--output-document=style.css" (for a single file).
Павел Серегов <address@hidden> writes:
> The file is not one. I'm downloading the whole site.
> wget -m -E -o wget_log.txt http://www.store-discount.ru/
The problem is that the URL contains a query part ("?v1000"), and wget
needs to record that in the file name. In general, it is possible that
wget will download both http://site.com/style.css?v1000 and
http://site.com/style.css?v2000, and it needs separate file names for
both.
wget does not provide a facility to adjust the names of the downloaded
files, except in a few particular ways. You will probably have to first
download all the files and then run a program to rename the downloaded
files to the names you want.
> =============================
> Next problem "duplicate". if use "-E"
>
> without:
> address@hidden
>
> with "-E"
> address@hidden
> =============================
That is the behavior that is prescribed for -E. Similarly, you will
probably have to run a separate program to rename the files.
Dale