Namazu: a Full-Text Search Engine

This index contains 0 documents and 0 keywords.

Last modified: date


Query: [How to search]

Display: Description: Sort:

Results:

References: [ warc: 270 ]

Total 270 documents matching your query.

1. Re: [Bug-wget] GSOC Project Availability (score: 3)
Author: Tim Rühsen <tim.ruehsen@gmx.de>
Date: Tue, 27 Mar 2018 16:00:58 +0200
Hi Eric, IMO the deadline for GSOC student applications is today 18:00 CEST, so you have to hurry. During 27th March and 23rd April the organizations review and decide for/against proposals. Both, ht
/archive/html/bug-wget/2018-03/msg00031.html (5,405 bytes)

2. [Bug-wget] GSOC Project Availability (score: 2)
Author: Eric Ngo <eric.ngo1xyz@gmail.com>
Date: Mon, 26 Mar 2018 14:34:02 -0700
To whom this may concern, My name is Eric Ngo and I am a computer science major at San Francisco State University. I was looking for open-source projects to contribute to in GSOC 2018 and came across
/archive/html/bug-wget/2018-03/msg00030.html (4,163 bytes)

3. Re: [Bug-wget] GSoC'18: DNS over HTTPS. (score: 2)
Author: Darshit Shah <darnir@gmail.com>
Date: Thu, 22 Mar 2018 14:38:14 +0100
Hi, I'll get to a discussion about the proposal shortly, but in the meantime, may I please request everyone to avoid continuing this email thread on address@hidden That is a generic mailing list for
/archive/html/bug-wget/2018-03/msg00028.html (7,052 bytes)

4. Re: [Bug-wget] GSoC'18: DNS over HTTPS. (score: 2)
Author: Tim Rühsen <tim.ruehsen@gmx.de>
Date: Thu, 22 Mar 2018 14:16:25 +0100
Since your code will likely use functions from libwget and the other way round, we should place it in libwget/. But if it makes your development easier during GSOC, feel free to put it into a separat
/archive/html/bug-wget/2018-03/msg00027.html (6,137 bytes)

5. Re: [Bug-wget] How to intercept wget to extract the raw requests and the raw responses? (score: 10)
Author: Bykov Alexey <gnfalex@rambler.ru>
Date: Thu, 15 Feb 2018 21:34:22 +0200
wget --warc-file=httpbin -qO- https://httpbin.org/get How to convert the warc format to the actual header of requests and responses? Greetings WARC is gzipped plain text. wget --warc-file=httpbin --n
/archive/html/bug-wget/2018-02/msg00025.html (6,867 bytes)

6. Re: [Bug-wget] How to intercept wget to extract the raw requests and the raw responses? (score: 4)
Author: Peng Yu <pengyu.ut@gmail.com>
Date: Thu, 15 Feb 2018 14:27:14 +0000
How to convert the warc format to the actual header of requests and responses? <https://httpbin.org/get> -- Regards, Peng
/archive/html/bug-wget/2018-02/msg00024.html (5,886 bytes)

7. Re: [Bug-wget] How to intercept wget to extract the raw requests and the raw responses? (score: 3)
Author: Bykov Alexey <gnfalex@rambler.ru>
Date: Wed, 14 Feb 2018 20:46:46 +0200
Greetings Did You tried "--warc-file" option? wget --warc-file=httpbin -qO- https://httpbin.org/get Best regards.
/archive/html/bug-wget/2018-02/msg00023.html (5,371 bytes)

8. [Bug-wget] [bug #52705] HTML assets embedding with --page-requisites (score: 4)
Author: Darshit Shah <INVALID.NOREPLY@gnu.org>
Date: Thu, 21 Dec 2017 07:59:46 -0500 (EST)
Follow-up Comment #2, bug #52705 (project wget): While MHTML was a convenient way to create snapshots of pages, sadly it was never properly standardized and most popular browsers no longer support it
/archive/html/bug-wget/2017-12/msg00021.html (5,659 bytes)

9. Re: [Bug-wget] Validity of angle brackets around WARC-Target-URI value (score: 39)
Author: William Prescott <appledesktopbus@gmail.com>
Date: Fri, 17 Nov 2017 03:31:16 -0500
For what it's worth, I confirmed that Heritrix (Internet Archive's crawling tool) produces WARC files without the angle brackets for WARC-Target-URI. Best regards, William Prescott
/archive/html/bug-wget/2017-11/msg00057.html (6,986 bytes)

10. [Bug-wget] Validity of angle brackets around WARC-Target-URI value (score: 37)
Author: William Prescott <appledesktopbus@gmail.com>
Date: Tue, 14 Nov 2017 23:45:12 -0500
Hello, It seems that there may be some ambiguity in the WARC standard regarding the usage of angle brackets surrounding the URI given for a WARC-Target-URI field. In short, while the BNF grammar incl
/archive/html/bug-wget/2017-11/msg00050.html (6,170 bytes)

11. Re: [Bug-wget] Wget1 Gzip Compression (score: 2)
Author: Giuseppe Scrivano <gscrivano@gnu.org>
Date: Wed, 26 Jul 2017 10:37:50 +0200
Hi Tim, I think that would be a nice feature to have. We are already linking to libz for the WARC support so gzip compression won't require a new dependency for wget. Regards, Giuseppe
/archive/html/bug-wget/2017-07/msg00034.html (5,122 bytes)

12. [Bug-wget] [bug #51029] Reproducible Segmentation Fault in 1.16, 1.18, 1.19 (score: 2)
Author: anonymous <INVALID.NOREPLY@gnu.org>
Date: Tue, 16 May 2017 03:53:41 -0400 (EDT)
Follow-up Comment #4, bug #51029 (project wget): Hi again, our system hit another website with the same behavior. It's the same call as in the original post but with https://www.sparkasse.at as targe
/archive/html/bug-wget/2017-05/msg00070.html (6,740 bytes)

13. [Bug-wget] [bug #51029] Reproducible Segmentation Fault in 1.16, 1.18, 1.19 (score: 5)
Author: anonymous <INVALID.NOREPLY@gnu.org>
Date: Mon, 15 May 2017 11:07:10 -0400 (EDT)
URL: <http://savannah.gnu.org/bugs/?51029> Summary: Reproducible Segmentation Fault in 1.16, 1.18, 1.19 Project: GNU Wget Submitted by: None Submitted on: Mon 15 May 2017 03:07:09 PM UTC Category: Pr
/archive/html/bug-wget/2017-05/msg00060.html (13,226 bytes)

14. Re: [Bug-wget] Wget2 plans (Was Re: PATCH: tests for SSL) (score: 4)
Author: Tim Rühsen <tim.ruehsen@gmx.de>
Date: Sun, 30 Apr 2017 11:52:24 +0200
Hi Vijo, We try to be backward compatible with options (name and functionality). But it's not a must. We are free to fix bugs or change/extend behavior. That's why we call the executable 'wget2'. It
/archive/html/bug-wget/2017-04/msg00061.html (6,277 bytes)

15. [Bug-wget] [bug #50788] Build failure against openssl-1.1 that lacks deprecated features (score: 2)
Author: Lars Wendler <INVALID.NOREPLY@gnu.org>
Date: Wed, 12 Apr 2017 05:05:52 -0400 (EDT)
URL: <http://savannah.gnu.org/bugs/?50788> Summary: Build failure against openssl-1.1 that lacks deprecated features Project: GNU Wget Submitted by: polyc Submitted on: Wed 12 Apr 2017 11:05:51 AM CE
/archive/html/bug-wget/2017-04/msg00028.html (8,540 bytes)

16. [Bug-wget] patch for writing cdx records (score: 9)
Author: Christof Horschitz <christof@nimbusec.com>
Date: Wed, 22 Mar 2017 14:01:50 +0100
Hi, attached you can find a patch that proposes a change to the file warc.c. The change will use url_escape to escape reserved characters in the redirect_location. Up to the current version (1.19) wg
/archive/html/bug-wget/2017-03/msg00115.html (5,026 bytes)

17. Re: [Bug-wget] [GSOC 2017] Design and implementation of a statistics module (score: 2)
Author: Avinash Sonawane <rootkea@gmail.com>
Date: Tue, 21 Mar 2017 10:59:52 +0530
Sure. I'll add the docs to wiki. But first let me fix few hangs. :) Thanks Tim! Just what I was looking for. -- Avinash Sonawane (rootKea) PICT, Pune https://rootkea.wordpress.com
/archive/html/bug-wget/2017-03/msg00099.html (6,183 bytes)

18. Re: [Bug-wget] [GSOC 2017] Design and implementation of a statistics module (score: 2)
Author: Tim Ruehsen <tim.ruehsen@gmx.de>
Date: Mon, 20 Mar 2017 17:09:45 +0100
Welcome Avinash ! Oh yes, I wish I had some more time to do that... :-) wget.addictivecode.org is for Wget1.x only. There is not yet a similar doc, but it is time to write it. This is low hanging fru
/archive/html/bug-wget/2017-03/msg00093.html (7,780 bytes)

19. Re: [Bug-wget] Patch: Always surround the "WARC-Target-URI" value with angle brackets (score: 48)
Author: Tim Rühsen <tim.ruehsen@gmx.de>
Date: Sat, 04 Mar 2017 12:55:39 +0100
Thanks, Bejamin, your patch is applied (trivial, no FSF copyright assignment required). Regards, Tim Attachment: signature.asc Description: This is a digitally signed message part.
/archive/html/bug-wget/2017-03/msg00013.html (7,355 bytes)

20. [Bug-wget] Patch: Always surround the "WARC-Target-URI" value with angle brackets (score: 48)
Author: Benjamin Esham <benjamin@esham.io>
Date: Fri, 3 Mar 2017 09:00:57 -0500
Hello, When producing WARC files, Wget records the requested URI in the "WARC-Target-URI" field. I noticed that Wget encloses the value of this URI within <angle brackets> in blocks with "WARC-Type:
/archive/html/bug-wget/2017-03/msg00006.html (6,557 bytes)


This search system is powered by Namazu v

foobar@namazu.org