[Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to users

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to users

From:	Avi Kivity
Subject:	[Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to userspace
Date:	Tue, 08 Jun 2010 08:48:13 +0300
User-agent:	Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.1.9) Gecko/20100430 Fedora/3.0.4-3.fc13 Thunderbird/3.0.4

On 06/08/2010 01:23 AM, Anthony Liguori wrote:

A better example would be a generic counter kernel mechanism. I canenvision such a device as doing nothing more than providing aread-only view of a counter with a userspace configurable dividerand width. Any write to the counter or read of any other byteoutside the counter register would result in a trap to userspace.
What about latches? byte access to word registers? There will be asmany special cases as there are timers.
If the kernel supported a bytecode/jit facility I'd happily use thatto download portions of the device model into the kernel.
That should allow both the PIT and the HPET to be accelerated withminimal effort in the kernel.
IMO it's probably more effort than porting HPET to the kernel. Tryoutlining an interface that supports PIT, HPET, RTC, and ACPI PMTIMER.
I was referring specifically to time sources, not time events.
An accelerated counter for HPET is pretty trivial. It's a 32-bitregister that's actually a nanosecond value in qemu. We need to beable to set an offset from the host wall clock time, a means to stopit, and a means to start it.
The PIT is latched so the kernel needs to know enough about how todecode the PIT state to understand the latching. There's very littlestate associated with latching though so I don't think this is a hugeproblem. It's a fixed value write to a fixed register followed by aread to a fixed register. The act of latching doesn't effect thestate beyond the fact that you need to save the latched value in theevent that you have a live migration before reading the latched value.
The PMTIMER is also pretty straight forward. It's a variable portaddress (that's fixed during execution).
Even if we require three separate interfaces, the interfaces are sosimply that it seems like an obvious win.


So a non-generic interface - 4x the interfaces (including RTC).

Those counters raise interrupts when they expire, and set various statusbits in their hardware. So we need 4x of:


  set counter value, frequency, and reload interval
  raise alarm to userspace on expiration
  set counter memory/ioport location and availability
  read counter value

and we haven't solved interrupt coalescing.

5. Risk
We may find out after all this is implemented that performance isnot acceptable and all the work will have to be dropped.
That's another advantage to a straight port to userspace. We cancollect performance data with only a modest amount of engineeringeffort.
Port what exactly? We have a userspace irqchip implementation. Whatwe don't have is just the ioapic/pic/pit in userspace, and the onlyway to try it out is to implement the whole thing.
If you take the kernel code and do a pretty straight port: switchingkernel functions to libc functions and maintaining all the existinglocking via pthreads, you could then implement a very simple MMIO/PIOdispatch mechanism in the kvm code that shortcutted those devicesbefore we ever hit the qemu_mutex and the traditional qemu codepaths. It should be a relatively easy conversion and it gives aproper vehicle for doing experimentations.

Those devices don't exist independently of the rest of the devices. Ifthey need to post interrupts, they will need the traditional qemu codepaths.

(I'm trying to view the move from the POV of the kernel first, assuminguserspace is as efficient as possible; so I'm not arguing qemuinefficiencies should prevent us from doing it. But they do add upconsiderably to the amount of work involved)

In fact, you could pretty quickly determine viability by porting thePIT to userspace and implementing a vpit interface in the kernel thatallowed the channel 0 counters to be latched and read withinlightweight exits.

Just looking at it shows the interface is incredibly messy. You have tomaintain the control word in the kernel (since it tells you whichcounter to read or write), so now you need a userspace interface to readand write the control word. With the current interface, you have theentire thing in a black box that you don't need to worry about (exceptfor the speaker port...).



--
I have a truly marvellous patch that fixes the bug which this
signature is too narrow to contain.

[Prev in Thread]

Current Thread

[Next in Thread]

[Qemu-devel] [RFC] Moving the kvm ioapic, pic, and pit back to userspace, Avi Kivity, 2010/06/07
- [Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to userspace, David S. Ahern, 2010/06/07
  - [Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to userspace, Avi Kivity, 2010/06/07
    - [Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to userspace, David S. Ahern, 2010/06/07
    - [Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to userspace, Avi Kivity, 2010/06/07
- [Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to userspace, Anthony Liguori, 2010/06/07
  - [Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to userspace, Avi Kivity, 2010/06/07
    - [Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to userspace, Anthony Liguori, 2010/06/07
    - [Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to userspace, Avi Kivity <=
- [Qemu-devel] RE: [RFC] Moving the kvm ioapic, pic, and pit back to userspace, Dong, Eddie, 2010/06/09
  - [Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to userspace, Avi Kivity, 2010/06/09
    - [Qemu-devel] RE: [RFC] Moving the kvm ioapic, pic, and pit back to userspace, Dong, Eddie, 2010/06/09
    - [Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to userspace, Avi Kivity, 2010/06/09
    - [Qemu-devel] RE: [RFC] Moving the kvm ioapic, pic, and pit back to userspace, Dong, Eddie, 2010/06/10

Prev by Date: Re: [Qemu-devel] [PATCH 0/5] Add '-device help' output for device params and help text
Next by Date: [Qemu-devel] Re: [PATCH v3 00/17] clean up vl.c code
Previous by thread: [Qemu-devel] Re: [RFC] Moving the kvm ioapic, pic, and pit back to userspace
Next by thread: [Qemu-devel] RE: [RFC] Moving the kvm ioapic, pic, and pit back to userspace
Index(es):
- Date
- Thread