[Qemu-devel] Re: [PATCH v5 4/5] Inter-VM shared memory PCI device

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Qemu-devel] Re: [PATCH v5 4/5] Inter-VM shared memory PCI device

From:	Anthony Liguori
Subject:	[Qemu-devel] Re: [PATCH v5 4/5] Inter-VM shared memory PCI device
Date:	Tue, 11 May 2010 10:51:02 -0500
User-agent:	Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.1.5) Gecko/20091209 Fedora/3.0-4.fc12 Lightning/1.0pre Thunderbird/3.0

On 05/11/2010 09:53 AM, Avi Kivity wrote:

On 05/11/2010 05:17 PM, Cam Macdonell wrote:

The master is the shared memory area. It's a completely separateentitythat is represented by the backing file (or shared memory serverhanding out
the fd to mmap).  It can exists independently of any guest.
I think the master/peer idea would be necessary if we were sharing
guest memory (sharing guest A's memory with guest B).  Then if the
master (guest A) dies, perhaps something needs to happen to preserve
the memory contents.


Definitely.  But we aren't...

Then transparent live migration is impossible. IMHO, that's afundamental mistake that we will regret down the road.

   But since we're sharing host memory, the
applications in the guests can race to determine the master by
grabbing a lock at offset 0 or by using lowest VM ID.

Looking at it another way, it is the applications using shared memory
that may or may not need a master, the Qemu processes don't need the
concept of a master since the memory belongs to the host.
Exactly. Furthermore, even in a master/slave relationship, there willbe different masters for different sub-areas, it would be a pity toexpose all this in the hardware abstraction. This way we have anexternal device, and PCI HBAs which connect to it - just like amulti-tailed SCSI disk.


To support transparent live migration, it's necessary to do two things:

1) Preserve the memory contents of the PCI BAR after disconnected from ashared memory segment2) Synchronize any changes made to the PCI BAR with the shared memorysegment upon reconnect/initial connection.

N.B. savevm/loadvm both constitute disconnect and reconnect eventsrespectively.

Supporting (1) is easy since we just need to memcpy() the contents ofthe shared memory segment to a temporary RAM area upon disconnect.

Supporting (2) is easy when the shared memory segment is viewed as ownedby the guest since it has the definitive copy of the data. IMHO, thisis what role=master means. However, if we want to support a model wherethe guest does not have a definitive copy of the data, upon reconnect,we need to throw away the guest's changes and make the shared memorysegment appear to simultaneously update to the guest. This is whatrole=peer means.

For role=peer, it's necessary to signal to the guest when it's notconnected. This means prior to savevm it's necessary to indicate to theguest that it's been disconnected.

I think it's important that we build this mechanism in from the startbecause as I've stated in the past, I don't think role=peer is going tobe the dominant use-case. I actually don't think that shared memorybetween guests is all that interesting compared to shared memory to anexternal process on the host.


Regards,

Anthony Liguori

[Prev in Thread]

Current Thread

[Next in Thread]

[Qemu-devel] Re: [PATCH v5 4/5] Inter-VM shared memory PCI device, (continued)

Prev by Date: Re: [Qemu-devel] [RFC] default mac address issue
Next by Date: Re: [Qemu-devel] Re: [PATCH 2/2] Add flush=off parameter to -drive
Previous by thread: [Qemu-devel] Re: [PATCH v5 4/5] Inter-VM shared memory PCI device
Next by thread: [Qemu-devel] Re: [PATCH v5 4/5] Inter-VM shared memory PCI device
Index(es):
- Date
- Thread