Re: [RFC PATCH 0/1] QEMU: Dirty quota-based throttling of vcpus

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [RFC PATCH 0/1] QEMU: Dirty quota-based throttling of vcpus

From:	Shivam Kumar
Subject:	Re: [RFC PATCH 0/1] QEMU: Dirty quota-based throttling of vcpus
Date:	Mon, 19 Dec 2022 00:42:27 +0530
User-agent:	Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:102.0) Gecko/20100101 Thunderbird/102.6.0



On 06/12/22 10:59 pm, Hyman Huang wrote:



在 2022/12/7 0:00, Peter Xu 写道:

Hi, Shivam,

On Tue, Dec 06, 2022 at 11:18:52AM +0530, Shivam Kumar wrote:

[...]

Note
----------
----------

We understand that there is a good scope of improvement in the current
implementation. Here is a list of things we are working on:

1) Adding dirty quota as a migration capability so that it can betoggled

through QMP command.
2) Adding support for throttling guest DMAs.
3) Not enabling dirty quota for the first migration iteration.


Agreed.

4) Falling back to current auto-converge based throttling in caseswhere dirty
quota throttling can overthrottle.


If overthrottle happens, would auto-converge always be better?


Please stay tuned for the next patchset.

Shivam Kumar (1):
    Dirty quota-based throttling of vcpus

accel/kvm/kvm-all.c | 91+++++++++++++++++++++++++++++++++++++++

   include/exec/memory.h     |  3 ++
   include/hw/core/cpu.h     |  5 +++
   include/sysemu/kvm_int.h  |  1 +
   linux-headers/linux/kvm.h |  9 ++++
   migration/migration.c     | 22 ++++++++++
   migration/migration.h     | 31 +++++++++++++
   softmmu/memory.c          | 64 +++++++++++++++++++++++++++
   8 files changed, 226 insertions(+)

It'd be great if I could get some more feedback before I send v2.Thanks.


Sorry to respond late.

What's the status of the kernel patchset?

From high level the approach looks good at least to me. It's justthat (as

I used to mention) we have two similar approaches now on throttling the
guest for precopy.  I'm not sure what's the best way to move forward if
without doing a comparison of the two.

https://urldefense.proofpoint.com/v2/url?u=https-3A__lore.kernel.org_all_cover.1669047366.git.huangy81-40chinatelecom.cn_&d=DwIDaQ&c=s883GpUCOChKOHiocYtGcg&r=4hVFP4-J13xyn-OcN0apTCh8iKZRosf5OJTQePXBMB8&m=CuJmsk4azThm0mAIiykxHz3F9q4xRCxjXeS5Q00YL6-FSnPF_BKSyW1y8LIiXqRA&s=QjAcViWNO5THFQvljhrWbDX30yTipTb7KEKWKkc2kDU&e=
Sorry to say so, and no intention to create a contention, but merging the
two without any thought will definitely confuse everybody.  We need to
figure out a way.

 From what I can tell..

One way is we choose one of them which will be superior to the other and
all of us stick with it (for either higher possibility of migrate, less
interference to the workloads, and so on).

The other way is we take both, when each of them may be suitable for

different scenarios. However in this latter case, we'd better atleast be

aware of the differences (which suites what), then that'll be part of
documentation we need for each of the features when the user wants to use
them.

Add Yong into the loop.

Any thoughts?

This is quite different from "dirtylimit capability of migration". IMHO,quota-based implementation seems a little complicated, because itdepends on correctness of dirty quota and the measured data, whichinvolves the patchset both in qemu and kernel. It seems that dirtylimitand quota-based are not mutually exclusive, at least we can figure out

which suites what firstly depending on the test results as Peter said.

Thank you for sharing the link to this alternate approach towardsthrottling - "dirtylimit capability of migration". I am sharing keypoints from my understanding and some questions below:

1) The alternate approach is exclusively for the dirty ring interface.The dirty quota approach is orthogonal to the dirty logging interface.It works both with the dirty ring and the dirty bitmap interface.

2) Can we achieve micro-stunning with the alternate approach? Can we saywith good confidence that for most of the time, we stun the vcpu onlywhen it is dirtying the memory? Last time when I checked, dirty ringsize could be a multiple of 512 which makes it difficult to stun thevcpu in microscopic intervals.

3) Also, are we relying on the system administrator to select a limit onthe dirty rate for "dirtylimit capability of migration"?

4) Also, does "dirtylimit capability of migration" play with the dirtyring size in a way that it uses a larger ring size for higher dirty ratelimits and smaller ring size for smaller dirty rate limits? I think thedirty rate limit is good information to choose a good-enough dirty ringsize.



Thanks,
Shivam

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [RFC PATCH 0/1] QEMU: Dirty quota-based throttling of vcpus, Shivam Kumar, 2022/12/06
- Re: [RFC PATCH 0/1] QEMU: Dirty quota-based throttling of vcpus, Peter Xu, 2022/12/06
  - Re: [RFC PATCH 0/1] QEMU: Dirty quota-based throttling of vcpus, Hyman Huang, 2022/12/06
    - Re: [RFC PATCH 0/1] QEMU: Dirty quota-based throttling of vcpus, Shivam Kumar <=
    - Re: [RFC PATCH 0/1] QEMU: Dirty quota-based throttling of vcpus, Hyman Huang, 2022/12/19

Prev by Date: [PULL 2/2] target/tricore: Fix gdbstub write to address registers
Next by Date: Re: [PATCH] accel/tcg/plugin: Fix op_rw
Previous by thread: Re: [RFC PATCH 0/1] QEMU: Dirty quota-based throttling of vcpus
Next by thread: Re: [RFC PATCH 0/1] QEMU: Dirty quota-based throttling of vcpus
Index(es):
- Date
- Thread