[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Qemu-block] [Qemu-devel] [PULL 7/7] vl: introduce vm_shutdown()
From: |
John Snow |
Subject: |
Re: [Qemu-block] [Qemu-devel] [PULL 7/7] vl: introduce vm_shutdown() |
Date: |
Mon, 12 Mar 2018 15:05:44 -0400 |
User-agent: |
Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.6.0 |
On 03/09/2018 08:19 AM, Stefan Hajnoczi wrote:
> Commit 00d09fdbbae5f7864ce754913efc84c12fdf9f1a ("vl: pause vcpus before
> stopping iothreads") and commit dce8921b2baaf95974af8176406881872067adfa
> ("iothread: Stop threads before main() quits") tried to work around the
> fact that emulation was still active during termination by stopping
> iothreads. They suffer from race conditions:
> 1. virtio_scsi_handle_cmd_vq() racing with iothread_stop_all() hits the
> virtio_scsi_ctx_check() assertion failure because the BDS AioContext
> has been modified by iothread_stop_all().
> 2. Guest vq kick racing with main loop termination leaves a readable
> ioeventfd that is handled by the next aio_poll() when external
> clients are enabled again, resulting in unwanted emulation activity.
>
> This patch obsoletes those commits by fully disabling emulation activity
> when vcpus are stopped.
>
> Use the new vm_shutdown() function instead of pause_all_vcpus() so that
> vm change state handlers are invoked too. Virtio devices will now stop
> their ioeventfds, preventing further emulation activity after vm_stop().
>
> Note that vm_stop(RUN_STATE_SHUTDOWN) cannot be used because it emits a
> QMP STOP event that may affect existing clients.
>
> It is no longer necessary to call replay_disable_events() directly since
> vm_shutdown() does so already.
>
> Drop iothread_stop_all() since it is no longer used.
>
> Cc: Fam Zheng <address@hidden>
> Cc: Kevin Wolf <address@hidden>
> Signed-off-by: Stefan Hajnoczi <address@hidden>
> Reviewed-by: Fam Zheng <address@hidden>
> Acked-by: Paolo Bonzini <address@hidden>
> Message-id: address@hidden
> Signed-off-by: Stefan Hajnoczi <address@hidden>
> ---
> include/sysemu/iothread.h | 1 -
> include/sysemu/sysemu.h | 1 +
> cpus.c | 16 +++++++++++++---
> iothread.c | 31 -------------------------------
> vl.c | 13 +++----------
> 5 files changed, 17 insertions(+), 45 deletions(-)
>
> diff --git a/include/sysemu/iothread.h b/include/sysemu/iothread.h
> index 799614ffd2..8a7ac2c528 100644
> --- a/include/sysemu/iothread.h
> +++ b/include/sysemu/iothread.h
> @@ -45,7 +45,6 @@ typedef struct {
> char *iothread_get_id(IOThread *iothread);
> IOThread *iothread_by_id(const char *id);
> AioContext *iothread_get_aio_context(IOThread *iothread);
> -void iothread_stop_all(void);
> GMainContext *iothread_get_g_main_context(IOThread *iothread);
>
> /*
> diff --git a/include/sysemu/sysemu.h b/include/sysemu/sysemu.h
> index d24ad09f37..356bfdc1c1 100644
> --- a/include/sysemu/sysemu.h
> +++ b/include/sysemu/sysemu.h
> @@ -56,6 +56,7 @@ void vm_start(void);
> int vm_prepare_start(void);
> int vm_stop(RunState state);
> int vm_stop_force_state(RunState state);
> +int vm_shutdown(void);
>
> typedef enum WakeupReason {
> /* Always keep QEMU_WAKEUP_REASON_NONE = 0 */
> diff --git a/cpus.c b/cpus.c
> index 9bcff7d63c..d8fe90eafe 100644
> --- a/cpus.c
> +++ b/cpus.c
> @@ -993,7 +993,7 @@ void cpu_synchronize_all_pre_loadvm(void)
> }
> }
>
> -static int do_vm_stop(RunState state)
> +static int do_vm_stop(RunState state, bool send_stop)
> {
> int ret = 0;
>
> @@ -1002,7 +1002,9 @@ static int do_vm_stop(RunState state)
> pause_all_vcpus();
> runstate_set(state);
> vm_state_notify(0, state);
> - qapi_event_send_stop(&error_abort);
> + if (send_stop) {
> + qapi_event_send_stop(&error_abort);
> + }
> }
>
> bdrv_drain_all();
> @@ -1012,6 +1014,14 @@ static int do_vm_stop(RunState state)
> return ret;
> }
>
> +/* Special vm_stop() variant for terminating the process. Historically
> clients
> + * did not expect a QMP STOP event and so we need to retain compatibility.
> + */
> +int vm_shutdown(void)
> +{
> + return do_vm_stop(RUN_STATE_SHUTDOWN, false);
> +}
> +
> static bool cpu_can_run(CPUState *cpu)
> {
> if (cpu->stop) {
> @@ -1994,7 +2004,7 @@ int vm_stop(RunState state)
> return 0;
> }
>
> - return do_vm_stop(state);
> + return do_vm_stop(state, true);
> }
>
> /**
> diff --git a/iothread.c b/iothread.c
> index 2ec5a3bffe..1b3463cb00 100644
> --- a/iothread.c
> +++ b/iothread.c
> @@ -101,18 +101,6 @@ void iothread_stop(IOThread *iothread)
> qemu_thread_join(&iothread->thread);
> }
>
> -static int iothread_stop_iter(Object *object, void *opaque)
> -{
> - IOThread *iothread;
> -
> - iothread = (IOThread *)object_dynamic_cast(object, TYPE_IOTHREAD);
> - if (!iothread) {
> - return 0;
> - }
> - iothread_stop(iothread);
> - return 0;
> -}
> -
> static void iothread_instance_init(Object *obj)
> {
> IOThread *iothread = IOTHREAD(obj);
> @@ -333,25 +321,6 @@ IOThreadInfoList *qmp_query_iothreads(Error **errp)
> return head;
> }
>
> -void iothread_stop_all(void)
> -{
> - Object *container = object_get_objects_root();
> - BlockDriverState *bs;
> - BdrvNextIterator it;
> -
> - for (bs = bdrv_first(&it); bs; bs = bdrv_next(&it)) {
> - AioContext *ctx = bdrv_get_aio_context(bs);
> - if (ctx == qemu_get_aio_context()) {
> - continue;
> - }
> - aio_context_acquire(ctx);
> - bdrv_set_aio_context(bs, qemu_get_aio_context());
> - aio_context_release(ctx);
> - }
> -
> - object_child_foreach(container, iothread_stop_iter, NULL);
> -}
> -
> static gpointer iothread_g_main_context_init(gpointer opaque)
> {
> AioContext *ctx;
> diff --git a/vl.c b/vl.c
> index dae986b352..3ef04ce991 100644
> --- a/vl.c
> +++ b/vl.c
> @@ -4722,17 +4722,10 @@ int main(int argc, char **argv, char **envp)
> os_setup_post();
>
> main_loop();
> - replay_disable_events();
>
> - /* The ordering of the following is delicate. Stop vcpus to prevent new
> - * I/O requests being queued by the guest. Then stop IOThreads (this
> - * includes a drain operation and completes all request processing). At
> - * this point emulated devices are still associated with their IOThreads
> - * (if any) but no longer have any work to do. Only then can we close
> - * block devices safely because we know there is no more I/O coming.
> - */
> - pause_all_vcpus();
> - iothread_stop_all();
> + /* No more vcpu or device emulation activity beyond this point */
> + vm_shutdown();
> +
> bdrv_close_all();
>
> res_free();
>
This appears to cause a regression in qemu-iotest 185:
--- 185.out.bad 2018-03-12 14:54:25.692884537 -0400
+++ ../../../../tests/qemu-iotests/185.out 2017-12-21 16:15:50.879455552
-0500
@@ -20,7 +20,7 @@
{"return": {}}
{"return": {}}
{"timestamp": {"seconds": TIMESTAMP, "microseconds": TIMESTAMP},
"event": "SHUTDOWN", "data": {"guest": false}}
-{"timestamp": {"seconds": TIMESTAMP, "microseconds": TIMESTAMP},
"event": "BLOCK_JOB_CANCELLED", "data": {"device": "disk", "len":
67108864, "offset": 1048576, "speed": 65536, "type": "commit"}}
+{"timestamp": {"seconds": TIMESTAMP, "microseconds": TIMESTAMP},
"event": "BLOCK_JOB_CANCELLED", "data": {"device": "disk", "len":
67108864, "offset": 524288, "speed": 65536, "type": "commit"}}
=== Start active commit job and exit qemu ===
@@ -28,8 +28,7 @@
{"return": {}}
{"return": {}}
{"timestamp": {"seconds": TIMESTAMP, "microseconds": TIMESTAMP},
"event": "SHUTDOWN", "data": {"guest": false}}
-{"timestamp": {"seconds": TIMESTAMP, "microseconds": TIMESTAMP},
"event": "BLOCK_JOB_READY", "data": {"device": "disk", "len": 4194304,
"offset": 4194304, "speed": 65536, "type": "commit"}}
-{"timestamp": {"seconds": TIMESTAMP, "microseconds": TIMESTAMP},
"event": "BLOCK_JOB_COMPLETED", "data": {"device": "disk", "len":
4194304, "offset": 4194304, "speed": 65536, "type": "commit"}}
+{"timestamp": {"seconds": TIMESTAMP, "microseconds": TIMESTAMP},
"event": "BLOCK_JOB_CANCELLED", "data": {"device": "disk", "len":
4194304, "offset": 4194304, "speed": 65536, "type": "commit"}}
=== Start mirror job and exit qemu ===
@@ -38,8 +37,7 @@
{"return": {}}
{"return": {}}
{"timestamp": {"seconds": TIMESTAMP, "microseconds": TIMESTAMP},
"event": "SHUTDOWN", "data": {"guest": false}}
-{"timestamp": {"seconds": TIMESTAMP, "microseconds": TIMESTAMP},
"event": "BLOCK_JOB_READY", "data": {"device": "disk", "len": 4194304,
"offset": 4194304, "speed": 65536, "type": "mirror"}}
-{"timestamp": {"seconds": TIMESTAMP, "microseconds": TIMESTAMP},
"event": "BLOCK_JOB_COMPLETED", "data": {"device": "disk", "len":
4194304, "offset": 4194304, "speed": 65536, "type": "mirror"}}
+{"timestamp": {"seconds": TIMESTAMP, "microseconds": TIMESTAMP},
"event": "BLOCK_JOB_CANCELLED", "data": {"device": "disk", "len":
4194304, "offset": 4194304, "speed": 65536, "type": "mirror"}}
=== Start backup job and exit qemu ===
@@ -48,7 +46,7 @@
{"return": {}}
{"return": {}}
{"timestamp": {"seconds": TIMESTAMP, "microseconds": TIMESTAMP},
"event": "SHUTDOWN", "data": {"guest": false}}
-{"timestamp": {"seconds": TIMESTAMP, "microseconds": TIMESTAMP},
"event": "BLOCK_JOB_CANCELLED", "data": {"device": "disk", "len":
67108864, "offset": 131072, "speed": 65536, "type": "backup"}}
+{"timestamp": {"seconds": TIMESTAMP, "microseconds": TIMESTAMP},
"event": "BLOCK_JOB_CANCELLED", "data": {"device": "disk", "len":
67108864, "offset": 65536, "speed": 65536, "type": "backup"}}
=== Start streaming job and exit qemu ===
@@ -56,6 +54,6 @@
{"return": {}}
{"return": {}}
{"timestamp": {"seconds": TIMESTAMP, "microseconds": TIMESTAMP},
"event": "SHUTDOWN", "data": {"guest": false}}
-{"timestamp": {"seconds": TIMESTAMP, "microseconds": TIMESTAMP},
"event": "BLOCK_JOB_CANCELLED", "data": {"device": "disk", "len":
67108864, "offset": 1048576, "speed": 65536, "type": "stream"}}
+{"timestamp": {"seconds": TIMESTAMP, "microseconds": TIMESTAMP},
"event": "BLOCK_JOB_CANCELLED", "data": {"device": "disk", "len":
67108864, "offset": 524288, "speed": 65536, "type": "stream"}}
No errors were found on the image.
*** done
- [Qemu-block] [PULL 0/7] Block patches, Stefan Hajnoczi, 2018/03/09
- [Qemu-block] [PULL 1/7] block: Fix qemu crash when using scsi-block, Stefan Hajnoczi, 2018/03/09
- [Qemu-block] [PULL 2/7] README: Fix typo 'git-publish', Stefan Hajnoczi, 2018/03/09
- [Qemu-block] [PULL 4/7] block: add aio_wait_bh_oneshot(), Stefan Hajnoczi, 2018/03/09
- [Qemu-block] [PULL 3/7] virtio-blk: dataplane: Don't batch notifications if EVENT_IDX is present, Stefan Hajnoczi, 2018/03/09
- [Qemu-block] [PULL 6/7] virtio-scsi: fix race between .ioeventfd_stop() and vq handler, Stefan Hajnoczi, 2018/03/09
- [Qemu-block] [PULL 7/7] vl: introduce vm_shutdown(), Stefan Hajnoczi, 2018/03/09
- Re: [Qemu-block] [Qemu-devel] [PULL 7/7] vl: introduce vm_shutdown(),
John Snow <=
- [Qemu-block] [PULL 5/7] virtio-blk: fix race between .ioeventfd_stop() and vq handler, Stefan Hajnoczi, 2018/03/09
- Re: [Qemu-block] [PULL 0/7] Block patches, Peter Maydell, 2018/03/09