Big XML files... (was Re: [Pan-users] Re: Better processing of very larg

pan-users

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Big XML files... (was Re: [Pan-users] Re: Better processing of very larg

From:	Ron Johnson
Subject:	Big XML files... (was Re: [Pan-users] Re: Better processing of very large groups?)
Date:	Fri, 03 Jul 2009 21:56:36 -0500
User-agent:	Mozilla/5.0 (X11; U; Linux i686 (x86_64); en-US; rv:1.8.1.19) Gecko/20090103 Thunderbird/2.0.0.19 Mnenhy/0.7.6.666

On 2009-07-02 18:53, Duncan wrote:

Ron Johnson <address@hidden> posted
address@hidden, excerpted below, on  Thu, 02 Jul 2009 13:14:20
-0500:

Because giganews has such a long retention period, some groups can have
a very *large number* of messages.  If you subscribe to two or more of
them, you could run out of memory.

As it is, pan seems to sequentially scan thru all messages when marking
a group of them as Read.

There needs to be a better and less memory intensive method of handling
huge groups.  B-trees, hash tables, SQL-Lite, I don't know, but
*something* better than the status quo.

This is true, tho pan is far better than it used to be (it deals withmulti-million messages now, where old-pan had problems with 100k).

One of the problems seems to be his use of big flat files. It'sgreat for being able to peek into the inner working of tasks.nzb,but every time an article gets successfully downloaded, pan mustmake a copy of the file in order to get ride of that one article.If tasks.nzb is large, that takes a while.


Similar problems in groups/.

This reminds me of mbox files in the email world, and it's whyMaildir (where each email is a separate file) is so much moreefficient at doing things other than adding new emails to the end ofthe file.

Also (and maybe because I'm a DBA), this problem just *screams* forSQLite and a database in the "First Normal Form".


--
Scooty Puff, Sr
The Doom-Bringer

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [Pan-users] Re: Better processing of very large groups?, (continued)

Prev by Date: [Pan-users] Re: another update
Next by Date: Re: Big XML files... (was Re: [Pan-users] Re: Better processing of very large groups?)
Previous by thread: Re: [Pan-users] Re: Better processing of very large groups?
Next by thread: Re: Big XML files... (was Re: [Pan-users] Re: Better processing of very large groups?)
Index(es):
- Date
- Thread