Re: [rdiff-backup-users] Problem with Detection of Multiple rdiff-backup

rdiff-backup-users

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [rdiff-backup-users] Problem with Detection of Multiple rdiff-backup

From:	Steven Willoughby
Subject:	Re: [rdiff-backup-users] Problem with Detection of Multiple rdiff-backup instances
Date:	Thu, 24 Sep 2009 19:16:48 -0600
User-agent:	Thunderbird 2.0.0.23 (X11/20090817)

Dean Cording wrote:

I've come across an issue with the way that rdiff-backup ensures that only oneserver is accessing a backup dataset.

...

Recently I had a backup fail, probably because of a network outage. Allsubsequent backups refuse to run because rdiff-backup believes the failed rdiff-backup instance is still running - even though this is clearly impossiblebecause it is a totally different instance of the virtual server.
This had me stumped for a while but I finally figured out what is happening.
Because I start a new virtual server instance each time and I run the backupfrom a script, everything happens in a consistent order. As a result theinstance of rdiff-backup running on the server for each backup session almostalways has the same PID. So when a backup fails, the subsequent backup looksat the metadata, finds the PID of the failed backup and sees that that PID isstill running - not realising that the other instance is actually itself.

A cursory look at regress.py seems to confirm this behavior:Specifically in check_pids() it says:


    if pid is not None and pid_running(pid):

This could say:

    if pid is not None and pid is not os.getpid() and pid_running(pid):

I'm not sure of a way of working around this problem as the virtual machine isalways started from a known state and hasn't been running long enough to buildup any entropy to generate unique random numbers between different sessions.

The current time adds a little randomness. A silly workaround would beto call the following perl script before running rdiff-backup:


#!/usr/bin/perl
`/bin/true` for 0..int(rand(100));

This will increase the pid and should stop your job from failingcontinuously.


Steven

[Prev in Thread]

Current Thread

[Next in Thread]

[rdiff-backup-users] Problem with Detection of Multiple rdiff-backup instances, Dean Cording, 2009/09/24
- Re: [rdiff-backup-users] Problem with Detection of Multiple rdiff-backup instances, Steven Willoughby <=
  - Re: [rdiff-backup-users] Problem with Detection of Multiple rdiff-backup instances, Jakob Unterwurzacher, 2009/09/25
    - Re: [rdiff-backup-users] Problem with Detection of Multiple rdiff-backup instances, Dominic Raferd, 2009/09/25

Prev by Date: [rdiff-backup-users] Problem with Detection of Multiple rdiff-backup instances
Next by Date: Re: [rdiff-backup-users] Problem with Detection of Multiple rdiff-backup instances
Previous by thread: [rdiff-backup-users] Problem with Detection of Multiple rdiff-backup instances
Next by thread: Re: [rdiff-backup-users] Problem with Detection of Multiple rdiff-backup instances
Index(es):
- Date
- Thread