Re: [Qemu-devel] [regression] dataplane: throughout -40% by commit 580b6

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-devel] [regression] dataplane: throughout -40% by commit 580b6

From:	Paolo Bonzini
Subject:	Re: [Qemu-devel] [regression] dataplane: throughout -40% by commit 580b6b2aa2
Date:	Wed, 02 Jul 2014 11:13:32 +0200
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:24.0) Gecko/20100101 Thunderbird/24.6.0

Il 02/07/2014 10:54, Stefan Hajnoczi ha scritto:

Both can be eliminated by introducing a fast path in bdrv_aio_{read,write}v,
that bypasses coroutines in the common case of no I/O throttling, no
copy-on-write, etc.


I tried that in 2012 and couldn't measure an improvement above the noise
threshold, although it was without dataplane.

BTW, we cannot eliminate the BH because the block layer guarantees that
callbacks are not invoked with reentrancy.  They are always invoked
directly from the event loop through a BH.  This simplifies callers
since they don't need to worry about callbacks happening while they are
still in bdrv_aio_readv(), for example.

Removing this guarantee (by making callers safe first) is orthogonal to
coroutines.  But it's hard to do since it requires auditing a lot of
code.

You could also audit the few implementations of bdrv_aio_readv/writev(including bdrv_aio_readv/writev_em) to guarantee that they do notdirectly invoke the callbacks. The rule was there before conversion tocoroutine, so the implementations should be fine. In fact, most of themare just forwarding to another bdrv_aio_readv/writev, and the others gothrough an EventNotifier or bottom half.

Drivers that implement bdrv_co_readv/writev would not enjoy the fastpath, and would keep using the BH.

Another idea is to skip aio_notify() when we're sure the event loop
isn't blocked in g_poll().  Doing this is a thread-safe and lockless way
might be tricky though.


Yes, good idea for 2.2 but not now.

So to recap, three issues are being discussed here:

1. rt_sigprocmask due to getcontext() in qemu_coroutine_new().  We
shouldn't be invoking qemu_coroutine_new() often.  The freelist is
probably too small.

Yes, right now it's 64 for the whole of QEMU. Originally it was 64 perthread (using TLS) but then TLS was dropped because of problems whencoroutines were created in the VCPU thread and destroyed in theiothread. Nowadays, the size should probably be dynamic---like 64 periothread to keep it simple.

2. Coroutines might be slower than the non-coroutine aio codepath.  I
don't think this is the case, they are very cheap and I was never able
to measure a real difference.

3. The block layer requires a BH with aio_notify() for
bdrv_aio_readv()/bdrv_aio_writev()/bdrv_aio_flush() callbacks regardless
of coroutines or not.  Eliminating the BH or skipping aio_notify() will
take some work but could speed up QEMU as a whole.

I think (3) is not really true, and the BH is the actual reason whycoroutines are slower.


Paolo

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [Qemu-devel] [regression] dataplane: throughout -40% by commit 580b6b2aa2, Ming Lei, 2014/07/01
- Re: [Qemu-devel] [regression] dataplane: throughout -40% by commit 580b6b2aa2, Stefan Hajnoczi, 2014/07/01
  - Re: [Qemu-devel] [regression] dataplane: throughout -40% by commit 580b6b2aa2, Ming Lei, 2014/07/01
    - Re: [Qemu-devel] [regression] dataplane: throughout -40% by commit 580b6b2aa2, Paolo Bonzini, 2014/07/01
    - Re: [Qemu-devel] [regression] dataplane: throughout -40% by commit 580b6b2aa2, Ming Lei, 2014/07/01
    - Re: [Qemu-devel] [regression] dataplane: throughout -40% by commit 580b6b2aa2, Stefan Hajnoczi, 2014/07/02
    - Re: [Qemu-devel] [regression] dataplane: throughout -40% by commit 580b6b2aa2, Paolo Bonzini <=
    - Re: [Qemu-devel] [regression] dataplane: throughout -40% by commit 580b6b2aa2, Kevin Wolf, 2014/07/02
    - Re: [Qemu-devel] [regression] dataplane: throughout -40% by commit 580b6b2aa2, Paolo Bonzini, 2014/07/02
    - Re: [Qemu-devel] [regression] dataplane: throughout -40% by commit 580b6b2aa2, Kevin Wolf, 2014/07/02
    - Re: [Qemu-devel] [regression] dataplane: throughout -40% by commit 580b6b2aa2, Paolo Bonzini, 2014/07/02
    - Re: [Qemu-devel] [regression] dataplane: throughout -40% by commit 580b6b2aa2, Ming Lei, 2014/07/02
    - Re: [Qemu-devel] [regression] dataplane: throughout -40% by commit 580b6b2aa2, Ming Lei, 2014/07/02
    - Re: [Qemu-devel] [regression] dataplane: throughout -40% by commit 580b6b2aa2, Paolo Bonzini, 2014/07/02
    - Re: [Qemu-devel] [regression] dataplane: throughout -40% by commit 580b6b2aa2, Ming Lei, 2014/07/02
    - Re: [Qemu-devel] [regression] dataplane: throughout -40% by commit 580b6b2aa2, Paolo Bonzini, 2014/07/02
    - Re: [Qemu-devel] [regression] dataplane: throughout -40% by commit 580b6b2aa2, Ming Lei, 2014/07/02

Prev by Date: Re: [Qemu-devel] [PATCH v2 1/3] block: block: introduce bdrv_io_plug() and bdrv_io_unplug()
Next by Date: [Qemu-devel] [PATCH for-2.1] acpi: fix typo in memory hotplug MMIO region name
Previous by thread: Re: [Qemu-devel] [regression] dataplane: throughout -40% by commit 580b6b2aa2
Next by thread: Re: [Qemu-devel] [regression] dataplane: throughout -40% by commit 580b6b2aa2
Index(es):
- Date
- Thread