Re: [Monotone-devel] partial pull #2 - gaps instead of a single horizon

monotone-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Monotone-devel] partial pull #2 - gaps instead of a single horizon

From:	Markus Schiltknecht
Subject:	Re: [Monotone-devel] partial pull #2 - gaps instead of a single horizon
Date:	Fri, 01 Jun 2007 18:36:28 +0200
User-agent:	Icedove 1.5.0.10 (X11/20070329)

Hi,

Christian Ohler wrote:

This looks like two separate issues to me:

(1) The total history size of a project in monotone grows without bound.
(2) The time it takes for a new developer to get a local workspace of aproject is too high with monotone.
As far as I can tell, problem (1) on its own isn't affecting anyoneright now -- even though there are a handful of projects in existencethat would run into it should they ever convert their history tomonotone. Problem (1) does imply problem (2) in theory, but the realreason typical projects have problem (2) right now is unrelated toproblem (1). The reason is that mtn pull is too CPU-intensive and/ornot doing proper pipelining.


Agreed, as long as you are talking about relatively young repositories.

But there simply are repositories, which have quite a huge history-sizevs. checkout-size ratio. Way beyond that factor 3, which Nathanielclaims to be the average. For example, my PostgreSQL repository (themonotone database) is about 250 MB, while a tar.gz of a fresh checkoutis only 14 MB.

And things will get worse, as soon as people really start using toolslike monotone. For example, think about merge_into_dir. With that youcan easily drag in a complete foreign repository, and possibly hundredsof megabytes - only to be able to propagate, but that's most probablyjust exactly the feature you want - and why you chose to let monotonetrack of that import from a foreign repository.

Basically, what I'm stating is, that the avg. history vs. checkout sizeratio probably is that low, because the tools to track history werelacking. I bet that this ration will grow, as soon as people learn aboutthe benefits of properly tracking history.

In fact, what the Pidgin project is doing (download compressed mtndatabase snapshots over HTTP) is a solution to (2) that doesn't solve(1). Too bad mtn isn't smart enough to offer similar efficiency forthis particular case. It's a special case, but it's the case that matters.

Why do you want a solution, which solves only one problem? Partial pullwould solve (1) and (2), no?

A complete pull of Pidgin's current database transfers 120 MB. Is thisthe size of history that we want to give up on and recommend partialpull for? That doesn't seem very satisfactory.

Huh? Why not? Having to download 10MB or 120MB still makes a differenceof a few minutes on the avg. internet connection.

It's nowhere near theseveral gigabytes of history that Nathaniel is calling an unreasonablesize. It should be within the range that mtn pull can deal with.Partial pull would just be a workaround for mtn's inefficient pullmechanism.


No, it would solve issue (1), too.

Maybe it's just a matter of optimizing the roster manipulation code. Ormaybe there's a way to avoid or defer some of the work that the code iscurrently doing during pull. Maybe there's a way to short-circuit theexpensive roster manipulation and just copy node ids from the server(with some simple adjustments) if the local database does not containany revisions connected to the subgraph being pulled?

I'm all for these optimizations. Please go ahead and optimize netsync,that would be very nice.

[ Please note, that all of this has nothing to do with the debate aboutsingle horizon vs. gaps implementation of partial pull. ]


Regards

Markus

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [Monotone-devel] partial pull #2 - gaps instead of a single horizon, Christian Ohler, 2007/06/01
- Re: [Monotone-devel] partial pull #2 - gaps instead of a single horizon, Markus Schiltknecht <=
  - Re: [Monotone-devel] partial pull #2 - gaps instead of a single horizon, Rob Schoening, 2007/06/01
    - Re: [Monotone-devel] partial pull #2 - gaps instead of a single horizon, Markus Schiltknecht, 2007/06/02

Prev by Date: Re: [Monotone-devel] Re: No read-only automate cert (aka no mtn2cl)
Next by Date: [Monotone-devel] missing docs about --xargs, --quiet, --norc
Previous by thread: Re: [Monotone-devel] partial pull #2 - gaps instead of a single horizon
Next by thread: Re: [Monotone-devel] partial pull #2 - gaps instead of a single horizon
Index(es):
- Date
- Thread