[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Gnu-arch-users] Storage efficiency of revlibs
From: |
Ludovic Courtès |
Subject: |
Re: [Gnu-arch-users] Storage efficiency of revlibs |
Date: |
Thu, 08 Dec 2005 09:08:47 +0100 |
User-agent: |
Gnus/5.110004 (No Gnus v0.4) Emacs/21.4 (gnu/linux) |
Mikhael Goikhman <address@hidden> writes:
> % revision=archzoom--devel--0--patch-300
> % cd `tla library-find $revision`/..
> % tar cf - --exclude $revision/,,patch-set --exclude $revision/,,index \
> --exclude $revision/,,index-by-name $revision | gzip -9 >$revision.tar.gz
> % du -s --block-size=1 $revision
> % ls -s --block-size=1 $revision.tar.gz
> 3403776 archzoom--devel--0--patch-300
> 163840 archzoom--devel--0--patch-300.tar.gz
>
> The ratio is 21. There is a small, but increasing gain when compared with
> earlier revisions (18), in particular because {arch} contains a lot of
> small files that are compressed nicely. Probably better than hardlinking.
You're comparing the size of a *single* revision directory against
tar+gz. This doesn't make much sense since, by definition, the hard
link trick compresses data *across* several revisions.
> Please don't forget that a hardlink costs more than 0,
Can you elaborate on that?
> and also that for
> every merged external revision there are at least 2 more files, in {arch}
> and ,,patch-log/, and possibly new subdirs too (not hardlink-able).
Right.
> For me (and for du/rm) it is not the size, but number of inodes that is
> more important, so this very CPU expensive solution would not solve much.
There are several good papers on the topic [0,1,2]. I'm pretty
confident that hard link + gzip of individual files would yield a better
compression ratio than keeping several whole revision tarballs, *when*
several subsequent revisions are kept.
Thanks,
Ludovic.
[0] http://ssrc.cse.ucsc.edu/Papers/you-mss04.pdf
[1] http://ssrc.cse.ucsc.edu/Papers/you-icde05.pdf
[2]
http://www.usenix.org/events/usenix04/tech/general/full_papers/kulkarni/kulkarni_html/paper.html
- Re: [Gnu-arch-users] Re: patch: automatic cacherev and smarter get, (continued)
- Re: [Gnu-arch-users] Re: patch: automatic cacherev and smarter get, Mikhael Goikhman, 2005/12/05
- Re: [Gnu-arch-users] Re: patch: automatic cacherev and smarter get, Michael Poole, 2005/12/05
- [Gnu-arch-users] Re: patch: automatic cacherev and smarter get, Matthieu Moy, 2005/12/06
- [Gnu-arch-users] Storage efficiency of revlibs, Ludovic Courtès, 2005/12/06
- Re: [Gnu-arch-users] Storage efficiency of revlibs, Mikhael Goikhman, 2005/12/06
- [Gnu-arch-users] Re: Storage efficiency of revlibs, Stefan Monnier, 2005/12/06
- Re: [Gnu-arch-users] Storage efficiency of revlibs, Ludovic Courtès, 2005/12/07
- Re: [Gnu-arch-users] Storage efficiency of revlibs, Mikhael Goikhman, 2005/12/07
- Re: [Gnu-arch-users] Storage efficiency of revlibs,
Ludovic Courtès <=
- Re: [Gnu-arch-users] Storage efficiency of revlibs, Mikhael Goikhman, 2005/12/09
- Re: [Gnu-arch-users] Storage efficiency of revlibs, Ludovic Courtès, 2005/12/12
- Re: [Gnu-arch-users] Storage efficiency of revlibs, Mikhael Goikhman, 2005/12/12
- Re: [Gnu-arch-users] Storage efficiency of revlibs, Ludovic Courtès, 2005/12/13
[Gnu-arch-users] Re: patch: automatic cacherev and smarter get, Stefan Monnier, 2005/12/06
- Re: [Gnu-arch-users] Re: patch: automatic cacherev and smarter get, Mikhael Goikhman, 2005/12/07
- Re: [Gnu-arch-users] Re: patch: automatic cacherev and smarter get, Stefan Monnier, 2005/12/07
- [Gnu-arch-users] Re: patch: automatic cacherev and smarter get, Matthieu Moy, 2005/12/07
- Re: [Gnu-arch-users] Re: patch: automatic cacherev and smarter get, Mikhael Goikhman, 2005/12/07
- Re: [Gnu-arch-users] Re: patch: automatic cacherev and smarter get, Stephen J. Turnbull, 2005/12/07