Re: Issue with QEMU Live Migration

qemu-block

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Issue with QEMU Live Migration

From:	Arisetty, Chakri
Subject:	Re: Issue with QEMU Live Migration
Date:	Mon, 26 Aug 2024 19:05:23 +0000

> * How is 'issue block-job-cancel' command issued exactly at the
> PRE_SWITCHOVER stage? virsh blockjob --abort?

We are currently not using libvirt/virsh to issue QMP command. Our software 
makes up QMP connection and sends the QMP command to cancel the job when RAM 
migration state is in pre-switchover.

qemu_blockdev_cancel($user, "node-ssci-disk-0");

sub qemu_blockdev_cancel {
    my $user   = shift;
    my $device = shift;

    return qemu_control(
        $user,
        {
            "execute"   => "block-job-cancel",
            "arguments" => {
                "device" => $device,
            }
        }
    );
}

>* Recently a postcopy issue, wherein the migrated guest on the
> destination machine hangs sometimes with migrate-postcopy but not with
> virsh ---postcopy-after-precopy. It seems virsh(1) handles the switch
> better. Wondering if it's similar with 'block-job-cancel'.

Thank you for the pointer, we are currently using pre-switchover. II will look 
more into libvirt as how it is implemented.

Thanks
Chakri

On 8/26/24, 5:05 AM, "Prasad Pandit" <ppandit@redhat.com 
<mailto:ppandit@redhat.com>> wrote:

!-------------------------------------------------------------------|
This Message Is From an External Sender
This message came from outside your organization.
|-------------------------------------------------------------------!

On Sun, 25 Aug 2024 at 22:40, Arisetty, Chakri <carisett@akamai.com 
<mailto:carisett@akamai.com>> wrote:
> > - start the mirror job
> > - qmp_migrate
> > - once PRE_SWITCHOVER is reached, issue block-job-cancel
> > - qmp_migrate_continue
>
> We use exact same steps to do live migration. I repeated the test now
>
> Sure, as you suggested to rule out any incorrect usage, I repeated the test 
> with above steps.
> once RAM migration state moved to pre-switchover, issued block-job-cancel. 
> There are no more dirty blocks.
> But all the disk writes from 'pre-switchover' state to 'complete' state are 
> lost.
> Thus, it is creating loss of customer data.
>

* How is 'issue block-job-cancel' command issued exactly at the
PRE_SWITCHOVER stage? virsh blockjob --abort?

* Recently a postcopy issue, wherein the migrated guest on the
destination machine hangs sometimes with migrate-postcopy but not with
virsh ---postcopy-after-precopy. It seems virsh(1) handles the switch
better. Wondering if it's similar with 'block-job-cancel'.

Thank you.
---
- Prasad

[Prev in Thread]

Current Thread

[Next in Thread]

Issue with QEMU Live Migration, Arisetty, Chakri, 2024/08/21
- Re: Issue with QEMU Live Migration, Fabiano Rosas, 2024/08/21
  - Re: Issue with QEMU Live Migration, Arisetty, Chakri, 2024/08/21
    - Re: Issue with QEMU Live Migration, Fabiano Rosas, 2024/08/22
    - Re: Issue with QEMU Live Migration, Arisetty, Chakri, 2024/08/23
    - Re: Issue with QEMU Live Migration, Arisetty, Chakri, 2024/08/23
    - Re: Issue with QEMU Live Migration, Fabiano Rosas, 2024/08/23
    - Re: Issue with QEMU Live Migration, Arisetty, Chakri, 2024/08/25
    - Re: Issue with QEMU Live Migration, Prasad Pandit, 2024/08/26
    - Re: Issue with QEMU Live Migration, Arisetty, Chakri <=

Prev by Date: Re: [PATCH v4 6/7] memory: Do not create circular reference with subregion
Next by Date: Re: [PATCH v4 6/7] memory: Do not create circular reference with subregion
Previous by thread: Re: Issue with QEMU Live Migration
Next by thread: Re: [PATCH v4] scripts/qcow2-to-stdout.py: Add script to write qcow2 images to stdout
Index(es):
- Date
- Thread