Re: [kvm-devel] [Qemu-devel] Re: [PATCH 1/3] Refactor AIO interface to a

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [kvm-devel] [Qemu-devel] Re: [PATCH 1/3] Refactor AIO interface to a

From:	Avi Kivity
Subject:	Re: [kvm-devel] [Qemu-devel] Re: [PATCH 1/3] Refactor AIO interface to allow other AIO implementations
Date:	Mon, 21 Apr 2008 09:39:31 +0300
User-agent:	Thunderbird 2.0.0.12 (X11/20080226)

Jamie Lokier wrote:

Avi Kivity wrote:
Does that mean "for the majority of deployments, the slow version is
sufficient.  The few that care about performance can use Linux AIO?"
In essence, yes. s/slow/slower/ and s/performance/ultimate block deviceperformance/.
Many deployments don't care at all about block device performance; theycare mostly about networking performance.
That's interesting.  I'd have expected block device performance to be
important for most things, for the same reason that disk performance
is (well, reasonably) important for non-virtual machines.

Seek time is important. Bandwidth is somewhat important. But for one-and two- spindle workloads (the majority), the cpu utilization inducedby getting requests to the disk is not important, and that's what we'reoptimizing here.

Disks work at around 300 Hz. Processors at around 3 GHz. That's sevenorders of magnitude difference. Even if you spent 100 usec calculatingwhat's the next best seek, even if it saves you only 10% of seeks it's awin. And of course modern processors spend a few microseconds at mostgetting a request out.

You really need 50+ disks or a large write-back cache to makemicrooptimizations around the submission path felt.

But as you say next:
I'm under the impression that the entire and only point of Linux AIO
is that it's faster than POSIX AIO on Linux.
It is. I estimate posix aio adds a few microseconds above linux aio perI/O request, when using O_DIRECT. Assuming 10 microseconds, you willneed 10,000 I/O requests per second per vcpu to have a 10% performancedifference. That's definitely rare.
Oh, I didn't realise the difference was so small.

At such a tiny difference, I'm wondering why Linux-AIO exists at all,
as it complicates the kernel rather a lot.  I can see the theoretical
appeal, but if performance is so marginal, I'm surprised it's in
there.

Linux aio exists, but that's all that can be said for it. It worksmostly for raw disks, doesn't integrate with networking, and doesn'tadvance at the same pace as the rest of the kernel. I believe onlydatabases use it (and a userspace filesystem I wrote some time ago).

I'm also surprised the Glibc implementation of AIO using ordinary

threads is so close to it.


Why are you surprised?

Actually the glibc implementation could be improved from what I'veheard. My estimates are for a thread pool implementation, but there isnot reason why glibc couldn't achieve exactly the same performance.

And then, I'm wondering why use AIO it
all: it suggests QEMU would run about as fast doing synchronous I/O in
a few dedicated I/O threads.


Posix aio is the unix API for this, why not use it?

Also, I'd presume that those that need 10K IOPS and above will not placetheir high throughput images on a filesystem; rather on a separate SAN LUN.
Does the separate LUN make any difference?  I thought O_DIRECT on a
filesystem was meant to be pretty close to block device performance.

On a good extent-based filesystem like XFS you will get good performance(though more cpu overhead due to needing to go through additionalmapping layers. Old clunkers like ext3 will require additional seeks ora ton of cache (1 GB per 1 TB).

I base this on messages here and there which say swapping to a file is
about as fast as swapping to a block device, nowadays.

Swapping to a file preloads the block mapping into memory, so thefilesystem is not involved at all in the I/O path.


--
Do not meddle in the internals of kernels, for they are subtle and quick to 
panic.

[Prev in Thread]

Current Thread

[Next in Thread]

[Qemu-devel] Re: [kvm-devel] [PATCH 1/3] Refactor AIO interface to allow other AIO implementations, (continued)

Prev by Date: Re: [Qemu-devel] Where the fuck is Qmeu's cocoa port for mac? And also where did the website for it go?
Next by Date: Re: [Qemu-devel] Re: [PATCH] x86: Reboot CPU on triple fault - Version 4
Previous by thread: Re: [kvm-devel] [Qemu-devel] Re: [PATCH 1/3] Refactor AIO interface to allow other AIO implementations
Next by thread: Re: [kvm-devel] [Qemu-devel] Re: [PATCH 1/3] Refactor AIO interface to allow other AIO implementations
Index(es):
- Date
- Thread