Re: [Gnu-arch-users] Re: darcs vs tla

gnu-arch-users

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Gnu-arch-users] Re: darcs vs tla

From:	Dustin Sallings
Subject:	Re: [Gnu-arch-users] Re: darcs vs tla
Date:	Mon, 8 Nov 2004 15:20:51 -0800


On Nov 8, 2004, at 2:08, Catalin Marinas wrote:

Timothy Webster <address@hidden> writes:

I would like to hear from users who have tried both tla and
darcs. And specifically why I should not go with darcs.

I use both quite a bit and really have a difficult time trying tofigure out which one I like more.

(I'm trying to use generic language below since there are differentterms to describe concepts in arch and darcs...hopefully, like me,you'll find that fact more annoying than the terms I'm using)

Darcs has some nice concepts such as breaking a checkin into multiplecheckins at commit time (i.e. I changed two parts of a file, but thischangeset should only reflect the changes I made to the bottom of thefile, not the stuff I did at the top or in the middle.).

It's also very nice that a working directory is effectively a branchfrom a checked out tree. This is a very natural concept to branching.I was arguing with a friend about how bad things like CVS breakpeople's mentalities and prevent them from doing better and he gave mesome sort of ``9x% of the time all I do is ci and up,'' regardingbranching. I pointed out that in darcs, an ``update'' is a branchintegration and there's no way to distinguish the two. That's one ofthe nicest things about using it.

The branching in general is actually very nice in its simplicity.There are no limits on integrations (that I can tell) short of patchdependencies, and it doesn't require you to think about branching oroffline development before you find yourself in a foxhole somewherewithout connectivity.

I end up sending patches via email to a central repository as well,which I find to be very nice.

However, the lack of separation between a repository and a workingtree can be a little odd. Having multiple projects in a single unit ofrepository in arch seems nice in that there's one thing I have to worryabout setting up and incrementally adding distinct projects to it thatare related only in how I think about them. With darcs, I do havecommon project directories, but I have to manage each piece separately(although I have scripts that help with a lot of this).

Also, in arch, a branch has a separate patch space than the tree fromwhich you branched. This, along with cacherevs can make things a lotsmaller and easier to look at (although darcs has some similarconcepts, they're not quite the same). This also gives you theopportunity to have a long-developed feature in a branch be merged as asingle changeset instead of having each little checkin pulled in.

main problem with it - it is incredibly slow. I tried it with the
Linux kernel (~300MB sources) and the commit operation (after applying
an 18MB patch) took around 3 hours, in which time my machine was
completely unusable.

The extreme case isn't handled all that efficiently yet, no. Ibelieve all this tells you is that it's possible to handle very largeprojects, although if you actually have a project with this much sort,it might not be recommended just yet.

existing structure or not). Even if this would be implemented, more
engineering needs to go into it before it could cope with the level of
patches in the Linux kernel (around 50 patches a day).

Again, that's a phenomenally big project. I'm not arguing that Linuxis particularly well designed or managed, but it's extremely rare forany project with that kind of commit rate to exist in the world withany sort of quality. Plenty of software houses might have that kind ofrate, but not necessarily in a single project.

A second problem I think is Haskell. Not so many people can help with
coding and it is also much slower than C or C++. The today's compilers
are not smart enough to optimally deal with pure functional
languages.

This is clearly wrong. Haskell was the #1 reason that pointed me inthe direction of darcs (and no, I didn't know very much of it at thetime). I greatly support projects creating software in higher levellanguages instead of holding so fast to the belief that it'll be slowif they do it in anything other than C.

I write a lot of code in OCaml (not purely functional, though most ofmy code is), and I can assure you *that* compiler optimizes very wellcompared to gcc. It does not seem intuitive to me that a low-levelcompiler such as C could optimize better than a high level compilersuch as that of ghc, ocaml, eiffel, etc... Expressing what you want ata high level gives the compiler much more flexibility in how it candeal with it (i.e. ``move data over there'' vs. ``allocate a 32-bitinteger pointer to the top of this buffer and seek to the first nullposition [...]'').

Anyway, my experience has shown me that I can get far faster apps withless code (i.e. sooner) by avoiding C, and have them be more stable toboot (we all make mistakes).

With darcs you also need to understand its theory of patches since it
doesn't report a conflict for cases where arch does (this is where I
think darcs should at least let you know).

Perhaps it should let you know, but I think this is more of a workflowissue. I.e. perforce lets me know when there are conflicts, but it'snot that easy to read (conflicts and updates look very similar), so Iend up wrapping my updates in a script that does my update, automaticconflict resolution, tagging, and occasionally branch integration atone time.

Arch's patches are more
readable since they are based on the diff format.

That's not exactly true. A darcs patch is a single text file, whilean arch patch is a tarred up directory with standard diffs along withother supporting files.

While darcs is a nice research project, my recommendation would be to
stay with arch, at least until you hear somebody happily using darcs
with a huge source tree like the Linux kernel.

I don't know that a source tree like the Linux kernel is all thatnecessary, but darcs itself has had nearly 2,200 patches since 2002.This is compared to about 4,000 in a project at my company with what Iconsider to be a fairly rapidly developed project since December 2001.(Actually, this project, too, is broken into two trees of about 4,000patches and 3,500 patches in the same timeline).

While I do believe it's a good metric, how a system handles the mostextreme case you can find isn't necessarily a practical way todetermine what's a good fit for you.


--
Dustin Sallings

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [Gnu-arch-users] Re: darcs vs tla, (continued)

Prev by Date: Re: [OT] Re: [Gnu-arch-users] Re: Re: community spirit
Next by Date: Re: [Gnu-arch-users] Re: darcs vs tla
Previous by thread: Re: [Gnu-arch-users] Re: darcs vs tla
Next by thread: Re: [Gnu-arch-users] Re: darcs vs tla
Index(es):
- Date
- Thread