[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Bug-wget] possible bug? pathconf: Not a directory
From: |
Ed Summers |
Subject: |
Re: [Bug-wget] possible bug? pathconf: Not a directory |
Date: |
Mon, 15 Jun 2015 16:26:16 -0400 |
Just in case this particular problem comes again…
It turns out that adding adding --adjust-extension and --convert-links forced
the desired behavior of downloading
http://politwoops.sunlightfoundation.com/tweet/599258910776754176
as
politwoops.sunlightfoundation.com/tweet/599258910776754176.html
which then allowed the subsequent fetch for
http://politwoops.sunlightfoundation.com/tweet/599258910776754176/thumb/599258910776754176-0.jpg
to succeed since it was able to create a directory at:
politwoops.sunlightfoundation.com/tweet/599258910776754176
for the jpg. The nice thing is that the WARC reflects the actual URLs
requested, not the rewritten ones, which is exactly the behavior I wanted.
Thanks for the help,
//Ed
> On Jun 9, 2015, at 12:12 AM, Ed Summers <address@hidden> wrote:
>
> Hi Ander,
>
>> On Jun 8, 2015, at 2:48 AM, Ander Juaristi <address@hidden> wrote:
>>
>> You can work around the issue with -nd, which will download all the files in
>> the same directory (will not recreate the directory structure). I've tested
>> it and it correctly downloads the missing image.
>
> Thanks so much for this. -nd did in fact result in the representation being
> fetched and stored in the WARC. I really appreciate your help!
>
> //Ed
signature.asc
Description: Message signed with OpenPGP using GPGMail