Re: [Qemu-devel] [PATCH RFC] virtio: put last seen used index into ring

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-devel] [PATCH RFC] virtio: put last seen used index into ring

From:	Avi Kivity
Subject:	Re: [Qemu-devel] [PATCH RFC] virtio: put last seen used index into ring itself
Date:	Thu, 20 May 2010 10:00:56 +0300
User-agent:	Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.1.9) Gecko/20100330 Fedora/3.0.4-1.fc12 Thunderbird/3.0.4

On 05/20/2010 08:01 AM, Rusty Russell wrote:

A device with out of order
completion (like virtio-blk) will quickly randomize the unused
descriptor indexes, so every descriptor fetch will require a bounce.

In contrast, if the rings hold the descriptors themselves instead of
pointers, we bounce (sizeof(descriptor)/cache_line_size) cache lines for
every descriptor, amortized.

We already have indirect, this would be a logical next step.  So let's
think about it. The avail ring would contain 64 bit values, the used ring
would contain indexes into the avail ring.

Have just one ring, no indexes. The producer places descriptors intothe ring and updates the head, The consumer copies out descriptors tobe processed and copies back in completed descriptors. Chaining isalways linear. The descriptors contain a tag that allow the producer toidentify the completion.

Indirect only pays when there are enough descriptors in it to fill acouple of cache lines. Otherwise it's an extra bounce.

We will always bounce here, that what happens when transferring data.The question is whether how many cache lines per descriptor. A pointeradds 1 bounce, linear descriptors cost 1/4 bounce, chained descriptorscost a bounce. So best is one ring of linearly chained descriptors.Indirect works when you have large requests (like block).

So client writes descriptor page and adds to avail ring, then writes to
index.
Server reads index, avail ring, descriptor page (3).  Writes used
entry (1).  Updates last_used (1).  Client reads used (1), derefs avail (1),
updates last_used (1), cleans descriptor page (1).

That's 9 cacheline transfers, worst case.  Best case of a half-full ring
in steady state, assuming 128-byte cache lines, the avail ring costs are
1/16, the used entry is 1/64.  This drops it to 6 and 9/64 transfers.


Cache lines are 64 bytes these days.

With a single ring, client writes descriptors (ceil(N/4)), updates head(1). Server reads head (1) copies out descriptors (ceil(N/4)), issuesrequests, copies back completions ((ceil(N/4)), updates tail (1).Client reads back tail and descriptors (1 + ceil(N/4))


Worst case: 4 + 4 * ceil(N/4).  Best case I think this drops by half.

(Note, the current scheme adds 2 more cacheline transfers, for the descriptor
table, worst case.


2 bounces per descriptor due to random access.

   Assuming indirect, we get 2/8 xfer best case.  Either way,
it's not the main source of cacheline xfers).

Indirect adds a double bounce to get to the descriptor table, but anydescriptors there are accessed linearly. It's only good when you havelarge chains.

Can we do better?  The obvious idea is to try to get rid of last_used and
used, and use the ring itself.  We would use an invalid entry to mark the
head of the ring.

Interesting! So a peer will read until it hits a wall. But how toupdate the wall atomically?

Maybe we can have a flag in the descriptor indicate headness ortailness. Update looks ugly though: write descriptor with head flag,write next descriptor with head flag, remove flag from previous descriptor.


--
Do not meddle in the internals of kernels, for they are subtle and quick to 
panic.

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [Qemu-devel] [PATCH RFC] virtio: put last seen used index into ring itself, (continued)
- [Qemu-devel] Re: [PATCH RFC] virtio: put last seen used index into ring itself, Ryan Harper, 2010/05/11
  - [Qemu-devel] Re: [PATCH RFC] virtio: put last seen used index into ring itself, Michael S. Tsirkin, 2010/05/11

Prev by Date: [Qemu-devel] [PATCH] pc: fix segfault introduced by 3d53f5c36ff6
Next by Date: Re: [Qemu-devel] [PATCH] Add QEMU DirectFB display driver
Previous by thread: Re: [Qemu-devel] [PATCH RFC] virtio: put last seen used index into ring itself
Next by thread: Re: [Qemu-devel] [PATCH RFC] virtio: put last seen used index into ring itself
Index(es):
- Date
- Thread