[Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to users

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to users

From:	Avi Kivity
Subject:	[Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to userspace
Date:	Mon, 07 Jun 2010 21:42:08 +0300
User-agent:	Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.1.9) Gecko/20100430 Fedora/3.0.4-3.fc13 Thunderbird/3.0.4

On 06/07/2010 08:04 PM, Anthony Liguori wrote:


I think we could also move the local APIC.

I'm not even sure we can safely move the ioapic/pic (mostly due tochurn). But the local APIC is so heavily accessed by the guest thatit's impossible to move it. Run an ftrace one day, especially on an smpguest. Every IPI requires several APIC accesses. Before a halt atickless kernel sets the wakeup timer. EOIs.

To optimize device models, we've tended to put the full device modelin the kernel whereas the hardware vendors have tended to put only thefast paths of the devices models in hardware.
For instance, we could introduce a userspace interface similar tovapic support whereas a shared page that mapped the APIC's layout wasused with a mask to select which registers trapped on read/write.

That leads to very problematic interfaces. When you separate along adevice boundary, you have a spec that defines the software interfaces.When you separate along a boundary that you define, it's up to you toget everything right.

In fact with the ioapic/pic/lapic one of the problems is that theinterconnection between the devices that is not well defined, and that'swhere we have bugs.

That said, I can understand an argument that the local APIC is part ofthe CPU state since it's a very special type of device.
A better example would be a generic counter kernel mechanism. I canenvision such a device as doing nothing more than providing aread-only view of a counter with a userspace configurable divider andwidth. Any write to the counter or read of any other byte outside thecounter register would result in a trap to userspace.

What about latches? byte access to word registers? There will be asmany special cases as there are timers.

If the kernel supported a bytecode/jit facility I'd happily use that todownload portions of the device model into the kernel.

That should allow both the PIT and the HPET to be accelerated withminimal effort in the kernel.

IMO it's probably more effort than porting HPET to the kernel. Tryoutlining an interface that supports PIT, HPET, RTC, and ACPI PMTIMER.

I'd be in favor of a straight port to userspace. We already have theinterfaces to communicate with an external device model for thesedevices so let's just take the kernel code and stick it into dedicatedthreads in userspace.

Currently we support an all-or-nothing approach. I don't think localAPIC in userspace is worthwhile. Esp. as it will slow down vhost andassigned devices significantly - interrupts will have to be mediated byuserspace.

I think it's easier to then work to merge the two bits of code in thesame tree than it is to try and take out-of-tree code and merge itincrementally.

Are you talking about qemu.git/qemu-kvm.git? That's the least of myconcerns, I'm worried about kvm.git.

5. Risk
We may find out after all this is implemented that performance is notacceptable and all the work will have to be dropped.
That's another advantage to a straight port to userspace. We cancollect performance data with only a modest amount of engineering effort.

Port what exactly? We have a userspace irqchip implementation. What wedon't have is just the ioapic/pic/pit in userspace, and the only way totry it out is to implement the whole thing.


--
I have a truly marvellous patch that fixes the bug which this
signature is too narrow to contain.

[Prev in Thread]

Current Thread

[Next in Thread]

[Qemu-devel] [RFC] Moving the kvm ioapic, pic, and pit back to userspace, Avi Kivity, 2010/06/07
- [Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to userspace, David S. Ahern, 2010/06/07
  - [Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to userspace, Avi Kivity, 2010/06/07
    - [Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to userspace, David S. Ahern, 2010/06/07
    - [Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to userspace, Avi Kivity, 2010/06/07
- [Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to userspace, Anthony Liguori, 2010/06/07
  - [Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to userspace, Avi Kivity <=
    - [Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to userspace, Anthony Liguori, 2010/06/07
    - [Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to userspace, Avi Kivity, 2010/06/08
- [Qemu-devel] RE: [RFC] Moving the kvm ioapic, pic, and pit back to userspace, Dong, Eddie, 2010/06/09
  - [Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to userspace, Avi Kivity, 2010/06/09
    - [Qemu-devel] RE: [RFC] Moving the kvm ioapic, pic, and pit back to userspace, Dong, Eddie, 2010/06/09
    - [Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to userspace, Avi Kivity, 2010/06/09
    - [Qemu-devel] RE: [RFC] Moving the kvm ioapic, pic, and pit back to userspace, Dong, Eddie, 2010/06/10

Prev by Date: Re: [Qemu-devel] [PATCH] configure: add an option to disable vlans
Next by Date: [Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to userspace
Previous by thread: [Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to userspace
Next by thread: [Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to userspace
Index(es):
- Date
- Thread