rdiff-backup-bugs
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Rdiff-backup-bugs] [bug #27296] RFE: hardlink sharing between multiple


From: Joshua Jensen
Subject: [Rdiff-backup-bugs] [bug #27296] RFE: hardlink sharing between multiple archive backups
Date: Fri, 21 Aug 2009 16:05:33 +0000
User-agent: Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.1.2) Gecko/20090803 Fedora/3.5.2-2.fc11 Firefox/3.5.2

URL:
  <http://savannah.nongnu.org/bugs/?27296>

                 Summary: RFE:  hardlink sharing between multiple archive
backups
                 Project: rdiff-backup
            Submitted by: joshmule
            Submitted on: Fri 21 Aug 2009 04:05:32 PM GMT
                Category: None
                Severity: 3 - Normal
              Item Group: None
                  Status: None
                 Privacy: Public
             Assigned to: None
             Open/Closed: Open
         Discussion Lock: Any

    _______________________________________________________

Details:

I ask because I have 2000 (yes, two thousand) machines to back up.  At least
90% of data on each machine is the same... basic Linux filesystems, mostly /
and /var and /usr partitions.

I wonder if it would be worthwhile to add a feature to rdiff-backup that
would look to hardlink files that different rdiff-backup archives have in
common?

Right now, the hard link savings is only per machine's, say, /usr
partition.  Sure, the incremental backup from the same machine
benefits from this; only any differences are recorded.

But right next to that archive directory on the backup server I have a
thousand other archvies with a /usr archive that is mostly identical.  Why not
have the option to make rdiff-backup
be able to use some sort of higher level "all these archives can share
hardlinks" capability?  The space saving would be staggering for multihost
backups.

I was thinking that rdiff-backup could just keep an additional
hardlink-only list of files/checksums that any individual run could
reference, thus giving it a perspective outside of that individual backup. 
When it found a file that was already on that global/group list, it need not
transfer it, only hardlink to it.  This could make storing similar files MUCH
more space and network efficient.

Thoughts?




    _______________________________________________________

Reply to this item at:

  <http://savannah.nongnu.org/bugs/?27296>

_______________________________________________
  Message sent via/by Savannah
  http://savannah.nongnu.org/





reply via email to

[Prev in Thread] Current Thread [Next in Thread]