bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] possible bug? pathconf: Not a directory


From: Ed Summers
Subject: Re: [Bug-wget] possible bug? pathconf: Not a directory
Date: Mon, 15 Jun 2015 16:26:16 -0400

Just in case this particular problem comes again…

It turns out that adding adding --adjust-extension and --convert-links forced 
the desired behavior of downloading

    http://politwoops.sunlightfoundation.com/tweet/599258910776754176

as

    politwoops.sunlightfoundation.com/tweet/599258910776754176.html

which then allowed the subsequent fetch for
    
http://politwoops.sunlightfoundation.com/tweet/599258910776754176/thumb/599258910776754176-0.jpg

to succeed since it was able to create a directory at:

    politwoops.sunlightfoundation.com/tweet/599258910776754176

for the jpg. The nice thing is that the WARC reflects the actual URLs 
requested, not the rewritten ones, which is exactly the behavior I wanted.

Thanks for the help,

//Ed

> On Jun 9, 2015, at 12:12 AM, Ed Summers <address@hidden> wrote:
> 
> Hi Ander,
> 
>> On Jun 8, 2015, at 2:48 AM, Ander Juaristi <address@hidden> wrote:
>> 
>> You can work around the issue with -nd, which will download all the files in 
>> the same directory (will not recreate the directory structure). I've tested 
>> it and it correctly downloads the missing image.
> 
> Thanks so much for this. -nd did in fact result in the representation being 
> fetched and stored in the WARC. I really appreciate your help!
> 
> //Ed

Attachment: signature.asc
Description: Message signed with OpenPGP using GPGMail


reply via email to

[Prev in Thread] Current Thread [Next in Thread]