bug-wget archive search

Search String: Display: Description: Sort:

Results:

References: [ warc: 295 ]

Total 295 documents matching your query.

21. [Bug-wget] [patch] uuid generation in warc.c (score: 40)
Author: HIDDEN
Date: Tue, 16 Dec 2014 00:26:35 +0100
Hi, in src/warc.c three methods are provided to generate uuids: libuuid, uuid functions from libc, and a fallback method. At least OpenBSD, FreeBSD and NetBSD provide those uuid functions in their li
/archive/html/bug-wget/2014-12/msg00079.html (8,239 bytes)

22. [Bug-wget] [PATCH] warc: Fix UUID generation on FreeBSD (score: 40)
Author: HIDDEN
Date: Sun, 22 Jun 2014 15:31:48 +0200
-- Reported off-list by Alex Zimnitsky <address@hidden>. ChangeLog | 4 ++++ configure.ac | 6 ++++++ src/ChangeLog | 7 +++++++ src/warc.c | 10 +++++++++- 4 files changed, 26 insertions(+), 1 deletion(
/archive/html/bug-wget/2014-06/msg00103.html (6,586 bytes)

23. Re: [Bug-wget] wget 1.14 possibly writing off-spec warc.gz files (score: 40)
Author: HIDDEN
Date: Sun, 31 Mar 2013 00:46:00 +0100
Hi, I think that's correct: Wget doesn't write the subfield length in the "extra field" section of the header. After the subfield ID "sl" it should write the length LEN (see RFC 1952 [1]), but it doe
/archive/html/bug-wget/2013-03/msg00082.html (6,747 bytes)

24. Re: [Bug-wget] [PATCH] Invalid Content-Length header in WARC files, on some platforms (score: 40)
Author: HIDDEN
Date: Wed, 14 Nov 2012 11:32:10 +0100
Hi, as David Ryskalczyk stated, just two printf format specifiers might cause the havoc. I think, there is not need to use wgint instead of off_t. @Guiseppe: please apply the appended patches (maybe
/archive/html/bug-wget/2012-11/msg00031.html (9,118 bytes)

25. Re: [Bug-wget] Invalid Content-Length header in WARC files, on some platforms (score: 40)
Author: HIDDEN
Date: Tue, 13 Nov 2012 09:48:37 +0100
Hello Gis, just out of curiosity. What about setting the compiler option -D _FILE_OFFSET_BITS=64 on these systems ? Since off_t is used in many places for file length, there should be many more probl
/archive/html/bug-wget/2012-11/msg00025.html (8,121 bytes)

26. Re: [Bug-wget] Validity of angle brackets around WARC-Target-URI value (score: 39)
Author: HIDDEN
Date: Fri, 17 Nov 2017 03:31:16 -0500
For what it's worth, I confirmed that Heritrix (Internet Archive's crawling tool) produces WARC files without the angle brackets for WARC-Target-URI. Best regards, William Prescott
/archive/html/bug-wget/2017-11/msg00057.html (6,986 bytes)

27. Re: [Bug-wget] WARC File Creation - Scope Issues (score: 39)
Author: HIDDEN
Date: Fri, 12 Apr 2013 10:32:13 +0200
Hello Mark, to capture a single document just execute e.g. wget --warc-file single_page 'https://webarchive.jira.com/wiki/display/wayback/Wayback+Installation+and+Configuration+Guide#WaybackInstallat
/archive/html/bug-wget/2013-04/msg00026.html (6,734 bytes)

28. Re: [Bug-wget] wget 1.14 possibly writing off-spec warc.gz files (score: 39)
Author: HIDDEN
Date: Sat, 30 Mar 2013 21:54:23 +0100
Am Freitag, 29. März 2013 schrieb Andy Jackson: Just a very quick test (before I go to bed) shows an unexpected behaviour to me: $ wget -O tempname --warc-file="output" "http://example.com" results
/archive/html/bug-wget/2013-03/msg00080.html (6,324 bytes)

29. [Bug-wget] Fix for gzip bug in WARC + zlib 1.2.4 (score: 39)
Author: HIDDEN
Date: Wed, 09 Nov 2011 14:14:28 +0100
Hi, There is a small bug in the WARC methods. The function gzdopen () is called with 'wb+9'. The '+' is ignored by zlib 1.2.3.*, but it causes an error with zlib 1.2.4. The attached patch removes the
/archive/html/bug-wget/2011-11/msg00005.html (4,817 bytes)

30. Re: [Bug-wget] WARC, new version (score: 39)
Author: HIDDEN
Date: Fri, 04 Nov 2011 23:14:57 +0100
That's good to hear. There's one other small adjustment that you may want to make, see the attached patch. One of the WARC functions uses the basename function, which causes problems on OS X. Includ
/archive/html/bug-wget/2011-11/msg00002.html (5,887 bytes)

31. Re: [Bug-wget] WARC, new version (score: 39)
Author: HIDDEN
Date: Mon, 24 Oct 2011 00:15:37 +0200
Hi Giuseppe, Thanks for your reply. I've attached a new version of the patch that includes a fallback function that generates UUIDs from rand (version 4 from RFC 4122, the UUID description). The only
/archive/html/bug-wget/2011-10/msg00041.html (6,372 bytes)

32. [bug #59086] --page-requisites not always working when creating a warc file (score: 38)
Author: HIDDEN
Date: Wed, 9 Sep 2020 04:52:04 -0400 (EDT)
URL: <https://savannah.gnu.org/bugs/?59086> Summary: --page-requisites not always working when creating a warc file Project: GNU Wget Submitted by: thomasegense Submitted on: Wed 09 Sep 2020 08:52:02
/archive/html/bug-wget/2020-09/msg00001.html (6,503 bytes)

33. [Bug-wget] [bug #47281] WARC URI Headers Improperly Quoted (score: 38)
Author: HIDDEN
Date: Sat, 27 Feb 2016 11:13:30 +0000
URL: <http://savannah.gnu.org/bugs/?47281> Summary: WARC URI Headers Improperly Quoted Project: GNU Wget Submitted by: None Submitted on: Sat 27 Feb 2016 11:13:29 UTC Category: Program Logic Severity
/archive/html/bug-wget/2016-02/msg00046.html (6,107 bytes)

34. Re: [Bug-wget] [patch] uuid generation in warc.c (score: 38)
Author: HIDDEN
Date: Sun, 15 Feb 2015 20:27:31 +0200
[Please CC me directly, as I'm not subscribed to the list.] Yes, thanks. Updated patch is attached. 2015-02-14 Eli Zaretskii <address@hidden> Gisle Vanem <address@hidden> * warc.c (windows_uuid_str)
/archive/html/bug-wget/2015-02/msg00046.html (9,024 bytes)

35. Re: [Bug-wget] [patch] uuid generation in warc.c (score: 38)
Author: HIDDEN
Date: Sat, 14 Feb 2015 13:10:07 +0200
The patch I suggest is below. It uses the fallback method if Rpcrt4.dll cannot be loaded, or if the functions from that DLL fail for some reason. 2015-02-14 Eli Zaretskii <address@hidden> Gisle Vane
/archive/html/bug-wget/2015-02/msg00043.html (8,262 bytes)

36. [Bug-wget] WARC File Creation - Scope Issues (score: 38)
Author: HIDDEN
Date: Thu, 11 Apr 2013 15:13:57 +0000
This is not a 'bug' by any means, but I could find no better place to post this so please forgive me... I've used 'wget' for years but am just now discovering the real power it has. Lately I have upg
/archive/html/bug-wget/2013-04/msg00022.html (5,666 bytes)

37. Re: [Bug-wget] WARC, new version (score: 38)
Author: HIDDEN
Date: Sun, 30 Oct 2011 18:18:48 +0100
Hi Giuseppe, * I've changed the configure.ac and src/Makefile.am. * I've added a ChangeLog entry. See the new version of the patch. I've also attached a patch with just the changes in these three fil
/archive/html/bug-wget/2011-10/msg00047.html (6,447 bytes)

38. only the final 206 partial content response included in WARC file (score: 37)
Author: HIDDEN
Date: Sun, 30 Apr 2023 08:32:09 -0700
Hi, I believe I found a bug. While downloading a large file with wget, the connection failed multiple times. Wget retried with a range request until it had the entire file downloaded. In the resultin
/archive/html/bug-wget/2023-04/msg00013.html (4,482 bytes)

39. WARC outputs (score: 37)
Author: HIDDEN
Date: Fri, 11 Nov 2022 11:42:17 +0000
Good morning, New to wget and web archiving in general here. I've been trying to use wget to mirror a couple of my websites and output WARC files however I am unable to view the WARCs in webarchivepl
/archive/html/bug-wget/2022-11/msg00001.html (3,916 bytes)

40. [Bug-wget] Validity of angle brackets around WARC-Target-URI value (score: 37)
Author: HIDDEN
Date: Tue, 14 Nov 2017 23:45:12 -0500
Hello, It seems that there may be some ambiguity in the WARC standard regarding the usage of angle brackets surrounding the URI given for a WARC-Target-URI field. In short, while the BNF grammar incl
/archive/html/bug-wget/2017-11/msg00050.html (6,170 bytes)


This search system is powered by Namazu