[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Qemu-devel] crash in vhost-user-bridge on migration
From: |
Marc-André Lureau |
Subject: |
Re: [Qemu-devel] crash in vhost-user-bridge on migration |
Date: |
Wed, 03 May 2017 10:31:17 +0000 |
Hi
On Tue, May 2, 2017 at 11:17 PM Dr. David Alan Gilbert <address@hidden>
wrote:
> Hi,
> I've started playing with vhost-user-bridge and have it
> basically up and going, but I've just tried migration and
> I've got a reliable crash for it; I'm not that sure I've
> got it set up right, so suggestions please:
>
> This is with qemu head, on an f26 host running an f25-ish
> guest.
>
> Program received signal SIGSEGV, Segmentation fault.
> 0x000055c414112ce4 in vring_avail_idx (vq=0x55c41582fd68,
> vq=0x55c41582fd68)
> at /home/dgilbert/git/qemu/contrib/libvhost-user/libvhost-user.c:940
> 940 vq->shadow_avail_idx = vq->vring.avail->idx;
> (gdb) p vq
> $1 = (VuVirtq *) 0x55c41582fd68
> (gdb) p vq->vring
> $2 = {num = 0, desc = 0x0, avail = 0x0, used = 0x0, log_guest_addr = 0,
> flags = 0}
> (gdb) p vq->shadow_avail_idx
> $3 = 0
>
> #0 0x000055c414112ce4 in vring_avail_idx (vq=0x55c41582fd68,
> vq=0x55c41582fd68)
> at /home/dgilbert/git/qemu/contrib/libvhost-user/libvhost-user.c:940
> No locals.
> #1 virtqueue_num_heads (idx=0, vq=0x55c41582fd68, dev=0x55c41582fc20)
> at /home/dgilbert/git/qemu/contrib/libvhost-user/libvhost-user.c:960
> num_heads = <optimized out>
> #2 vu_queue_get_avail_bytes (dev=0x55c41582fc20, vq=0x55c41582fd68,
> address@hidden,
> address@hidden,
> address@hidden,
> address@hidden) at
> /home/dgilbert/git/qemu/contrib/libvhost-user/libvhost-user.c:1034
> idx = 0
> total_bufs = 0
> in_total = 0
> out_total = 0
> rc = <optimized out>
> #3 0x000055c414112fbd in vu_queue_avail_bytes (dev=<optimized out>,
> vq=<optimized out>, in_bytes=0, out_bytes=0)
> at /home/dgilbert/git/qemu/contrib/libvhost-user/libvhost-user.c:1116
> in_total = 0
> out_total = 0
> #4 0x000055c4141114da in vubr_backend_recv_cb (sock=<optimized out>,
> ctx=0x55c41582fc20)
> at /home/dgilbert/git/qemu/tests/vhost-user-bridge.c:276
> vubr = 0x55c41582fc20
> dev = 0x55c41582fc20
> vq = 0x55c41582fd68
> elem = 0x0
> mhdr_sg = {{iov_base = 0x0, iov_len = 0} <repeats 740 times>,
> {iov_base = 0x0, iov_len = 140512740079088}, {
> .....}
>
> mhdr = {hdr = {flags = 0 '\000', gso_type = 0 '\000', hdr_len = 0,
> gso_size = 0, csum_start = 0,
> csum_offset = 0}, num_buffers = 0}
> mhdr_cnt = 0
> hdrlen = 0
> i = 0
> hdr = {flags = 0 '\000', gso_type = 0 '\000', hdr_len = 0,
> gso_size = 0, csum_start = 0, csum_offset = 0}
> __PRETTY_FUNCTION__ = "vubr_backend_recv_cb"
> #5 0x000055c414110ad3 in dispatcher_wait (timeout=200000,
> dispr=0x55c4158300b8)
> at /home/dgilbert/git/qemu/tests/vhost-user-bridge.c:154
> e = 0x55c415830180
>
>
This happens because vubr assumes that if a ring is enabled, it should be
able to use/read it.
This is not the case, as qemu may enable a ring before the slave has the
ring addresses.
The protocol has evolved regarding this. There are multiple ways to fix
this, I think the simplest is for vu_queue_avail_bytes() to return false if
the ring is not available. I'll send a patch.
That's from the destination bridge; I'm running both on a single
> host and that's happening when I just do a :
> migrate_set_speed 1G
> migrate tcp:localhost:8888
>
> The destination qemu spits out:
> qemu-system-x86_64: Failed to set msg fds.
> qemu-system-x86_64: vhost VQ 0 ring restore failed: -1: Resource
> temporarily unavailable (11)
> qemu-system-x86_64: Failed to set msg fds.
> qemu-system-x86_64: vhost VQ 1 ring restore failed: -1: Resource
> temporarily unavailable (11)
>
> but I'm not sure if that's before or after the seg of the bridge.
>
That is after, on disconnect.
>
> I've got:
> a) One qemu that just has the -net socket / -net user setup as per the
> docs - but I've got
> two lots of sockets for either side
> b) Two qemus for the guests, teh second with just the -incoming
> c) The two vhost-user-bridge instances - the destination being pointed
> at the second set of sockets.
>
> My test is run by doing:
> #!/bin/bash -x
> SESS=vhost
> tmux -L $SESS new-session -d
> tmux -L $SESS set-option -g set-remain-on-exit on
> # Start a router using the system qemu
> tmux -L $SESS new-window -n router qemu-system-x86_64 -M none -nographic
> -net socket,vlan=0,udp=localhost:4444,localaddr=localhost:5555 -net
> socket,vlan=0,udp=localhost:4445,localaddr=localhost:5556 -net user,vlan=0
> # Start source vhost bridge
> tmux -L $SESS new-window -n srcvhostbr ./tests/vhost-user-bridge -u
> /tmp/vubrsrc.sock
> tmux -L $SESS new-window -n source "./x86_64-softmmu/qemu-system-x86_64
> -enable-kvm -m 1G -smp 2 -object
> memory-backend-file,id=mem,size=1G,mem-path=/dev/shm,share=on -numa
> node,memdev=mem -mem-prealloc -chardev
> socket,id=char0,path=/tmp/vubrsrc.sock -netdev
> type=vhost-user,id=mynet1,chardev=char0,vhostforce -device
> virtio-net-pci,netdev=mynet1 /home/vmimages/f25.qcow2 -net none"
> # Start dest vhost bridge
> tmux -L $SESS new-window -n destvhostbr ./tests/vhost-user-bridge -u
> /tmp/vubrdst.sock -l 127.0.0.1:4445 -r 127.0.0.1:5556
> tmux -L $SESS new-window -n dest "./x86_64-softmmu/qemu-system-x86_64
> -enable-kvm -m 1G -smp 2 -object
> memory-backend-file,id=mem,size=1G,mem-path=/dev/shm,share=on -numa
> node,memdev=mem -mem-prealloc -chardev
> socket,id=char0,path=/tmp/vubrdst.sock -netdev
> type=vhost-user,id=mynet1,chardev=char0,vhostforce -device
> virtio-net-pci,netdev=mynet1 /home/vmimages/f25.qcow2 -net none -incoming
> tcp::8888"
>
> (I've got a few added printf's so the lines might be off by a few).
>
>
Thanks for the report
--
Marc-André Lureau