[Qemu-devel] Migration auto-converge problem

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Qemu-devel] Migration auto-converge problem

From:	Jason J. Herne
Subject:	[Qemu-devel] Migration auto-converge problem
Date:	Mon, 02 Mar 2015 16:04:54 -0500
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.4.0

We have a test case that dirties memory very very quickly. When we runthis test case in a guest and attempt a migration, that migration neverconverges even when done with auto-converge on.

The auto converge behavior of Qemu functions differently purpose than Ihad expected. In my mind, I expected auto converge to continuously applyadaptive throttling of the cpu utilization of a busy guest if Qemudetects that progress is not being made quickly enough in the guestmemory transfer. The idea is that a guest dirtying pages too quicklywill be adaptively slowed down by the throttling until migration is ableto transfer pages fast enough to complete the migration within the maxdowntime. Qemu's current auto converge does not appear to do this inpractice.


A quick look at the source code shows the following:

- Autoconverge keeps a counter. This counter is only incremented if, fora completed memory pass, the guest is dirtying pages at a rate of 50%(or more) of our transfer rate.

- The counter only increments at most once per pass through memory.

- The counter must reach 4 before any throttling is done. (a minimum of4 memory passes have to occur)- Once the counter reaches 4, it is immediately reset to 0, and thenthrottling action is taken.- Throttling occurs by doing an async sleep on each guest cpu for 30ms,exactly one time.

Now consider the scenario auto-converge is meant to solve (I think): Aguest touching lots of memory very quickly. Each pass through memory isgoing to be sending a lot of pages, and thus, taking a decent amount oftime to complete. If, for every four passes, we are *only* sleeping theguest for 30ms, our guest is still going to be able dirty pages fasterthan we can transfer them. We will never catch up because the sleep timerelative to guest execution time is very very small.

Auto converge, as it is implemented today, does not address the problemI expect it solve. However, after rapid prototyping a new version ofauto converge that performs adaptive modeling I've learned something.The workload I'm attempting to migrate is actually a pathological case.It is an excellent example of why throttling cpu is not always a goodmethod of limiting memory access. In this test case we are able to touchover 600 MB of pages in 50 ms of continuous execution. In this case,even if I throttle the guest to 5% (50ms runtime, 950ms sleep) we stillcannot even come close to catching up even with a fairly speedy networklink (which not every user will have).

Given the above, I believe that some workloads touch memory too fast andwe'll never be able to live migrate them with auto-converge. On thelower end there are workloads that have a very small/stagnant workingset size which will be live migratable without the need forauto-converge. Lastly, we have "the nebulous middle". These areworkloads that would benefit from auto-converge because they touch pagestoo fast for migration to be able to deal with them, AND (importantconditional here), throttling will(may?) actually reduce their rate ofpage modifications. I would like to try and define this "middle" set ofworkloads.

A question with no obvious answer: How much throttling is acceptable? IfI have to throttle a guest 90% and he ends up failing 75% of whatevertransactions he is attempting to process then we have quite likelydefeated the entire purpose of "live" migration. Perhaps it would bebetter in this case to just stop the guest and do a non-live migration.Maybe by reverting to non-live we actually save time and thus moretransactions would have completed. This one may take some experimentingto be able to get a good idea for what makes the most sense. Maybe evenhave max throttling be be user configurable.

With all this said, I still wonder exactly how big this "nebulousmiddle" really is. If, in practice, that "middle" only accounts for 1%of the workloads out there then is it really worth spending time fixingit? Keep in mind this is a two pronged test:

1. Guest cannot migrate because it changes memory too fast

2. Cpu throttling slows guest's memory writes down enough such that hecan now migrate


I'm interested in any thoughts anyone has. Thanks!

--
-- Jason J. Herne (address@hidden)

[Prev in Thread]

Current Thread

[Next in Thread]

[Qemu-devel] Migration auto-converge problem, Jason J. Herne <=
- Re: [Qemu-devel] Migration auto-converge problem, Jason J. Herne, 2015/03/11
- Re: [Qemu-devel] Migration auto-converge problem, John Snow, 2015/03/11
- Re: [Qemu-devel] Migration auto-converge problem, Dr. David Alan Gilbert, 2015/03/12

Prev by Date: Re: [Qemu-devel] [PATCH 2/4] target-mips: add Unified Hosting Interface (UHI) support
Next by Date: Re: [Qemu-devel] [PATCH V13 3/4] pc: add a Virtual Machine Generation ID device
Previous by thread: Re: [Qemu-devel] [PATCH target-arm v1 03/15] arm: Introduce Xilinx Zynq MPSoC
Next by thread: Re: [Qemu-devel] Migration auto-converge problem
Index(es):
- Date
- Thread