Re: [Monotone-devel] resolving name conflicts; file suturing vs drop

monotone-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Monotone-devel] resolving name conflicts; file suturing vs drop

From:	William Uther
Subject:	Re: [Monotone-devel] resolving name conflicts; file suturing vs drop
Date:	Wed, 7 May 2008 10:37:16 +1000


On 06/05/2008, at 5:58 PM, Markus Schiltknecht wrote:

Hi,

William Uther wrote:
I can't see an easy way to implement this without a graveyard. Ifyou'regoing to implement a graveyard, then I'd get rid of DieDieDie mergefirst.
Hm.. I don't see what file resurrection has to do with suturing. Ofcourse, resurrection would help the user to revert from erroneouslysutured files. But that's the point of it.

Superficially, suturing and resurrection have nothing to do with eachother. However, if you implement suturing using drop, and you stillhave DieDieDie merge, then suturing is a VERY dangerous operation.Suturing (as I currently understand it - based on drop) would be sodangerous with DieDieDie merge that I would oppose implementing it.

You could then implement the 'drop one side' approximation to asuture, and
know that DieDieDie merge wont kill you.
To be symmetric, suturing will have to drop both source files andcreate a new destination node. Only that way you can resurrect anyof the two files later on, for example.
I'm thinking of suturing as an atomic "delete two, add one" operation.


I can see two options for suturing here:

i) Keep it symmetric as a "delete two, add one" operation. In thiscase you need to implement some form of "merge through suture"ability. e.g. Imagine the following:


       o
      / \
     a   b
    / \ / \
   c   d   e
    \  |  /
     \ | /
       f

o is our original revision.

In a and b, two people independently introduced new files with thesame name and purpose.

In revision d, the files introduced in a and b were sutured together.

In c and e, the files introduced in a and b were each independentlyedited.

In f we're trying to merge everything together again.

Note that merging c and d, or d and e would require merging "throughthe drop".

ii) Pick one side and drop that side. This still requires a "mergethrough suture" function, but that function can be more like 'pluck'in that you just move the patch from the dead node-id and re-apply tothe live node-id. Eventually all instances of the dead node-id woulddisappear.

Option ii) is much messier, but much easier to implement. Hrm. Maybenot. Maybe you could do both quick and dirty...

Ahhh. There may be a third option here. Use a disjoint sets (akaunion-find) algorithm on node-ids.

http://en.wikipedia.org/wiki/Disjoint-set_data_structure

In this option, each node-id would have a pointer to a 'parent' node-id. If the parent is null then you use the node-id itself. Otherwiseyou keep following parent ids until you get to the final id of anode. You then merge as normal... (have to think about how to find a'common ancestor' for three-way merge).

This third option would avoid the drops entirely. It has the problemthat I don't know how to reverse it. i.e. if you merge two node-idsthen you could never tease them apart again. Hrm. You could justintroduce a new node-id with the current contents, but you'd have lostsome of the details in the history.


Well, it is more food for thought anyway.

Once you have a graveyard, appending information to dead nodes,such as"this node was merged into this other node" would make futuremerges easier.
Hm.. maybe you need to outline your graveyard concept a littlebetter. Last I've heard about file resurrection, we should simplyadd a boolean flag for alive or dead. That hardly carries any extrainformation, but could be merged the same as other attributes.

At the moment dropped node-ids are gone. Introducing a graveyardmeans keeping all node-ids around. The standard thought forresurrection is to keep them around with an 'isLive' boolean attachedto them that can be mark-merged. But once you're keeping around allthe node-ids, it wouldn't be hard to keep around more information.That extra information could be the "replacement" node-id for node idsthat were dropped as part of a suture. The extra information could bethe 'parent' node-id from a disjoint sets data structure.

I don't know how to merge "replacement" node-ids. Merging of the'parent' node-id for a disjoint sets data structure is easy - it isthe union operation in "union-find".


Cheers,

Will       :-}

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [Monotone-devel] resolving name conflicts; code style, (continued)

Prev by Date: Re: [Monotone-devel] resolving name conflicts; file suturing vs drop
Next by Date: Re: [Monotone-devel] resolving name conflicts; file suturing vs drop
Previous by thread: Re: [Monotone-devel] resolving name conflicts; file suturing vs drop
Next by thread: Re: [Monotone-devel] resolving name conflicts; file suturing vs drop
Index(es):
- Date
- Thread