提交 · 6fc138d2278078990f597cb1f62fde9e5b458f96 · openanolis / cloud-kernel

30 1月, 2008 2 次提交

KVM: Support assigning userspace memory to the guest · 6fc138d2

由 Izik Eidus 提交于 10月 09, 2007

Instead of having the kernel allocate memory to the guest, let userspace
allocate it and pass the address to the kernel.

This is required for s390 support, but also enables features like memory
sharing and using hugetlbfs backed memory.
Signed-off-by: NIzik Eidus <izike@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

6fc138d2

KVM: Allow dynamic allocation of the mmu shadow cache size · 82ce2c96

由 Izik Eidus 提交于 10月 02, 2007

The user is now able to set how many mmu pages will be allocated to the guest.
Signed-off-by: NIzik Eidus <izike@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

82ce2c96

13 10月, 2007 12 次提交

KVM: Replace enum by #define · 8a45450d

由 Avi Kivity 提交于 10月 10, 2007

Easier for existence test (#ifdef) in userspace.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

8a45450d

KVM: in-kernel LAPIC save and restore support · 96ad2cc6

由 Eddie Dong 提交于 9月 06, 2007

This patch adds a new vcpu-based IOCTL to save and restore the local
apic registers for a single vcpu. The kernel only copies the apic page as
a whole, extraction of registers is left to userspace side. On restore, the
APIC timer is restarted from the initial count, this introduces a little
delay, but works fine.
Signed-off-by: NYaozu (Eddie) Dong <eddie.dong@intel.com>
Signed-off-by: NQing He <qing.he@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

96ad2cc6

KVM: in-kernel IOAPIC save and restore support · 6bf9e962

由 He, Qing 提交于 8月 05, 2007

This patch adds support for in-kernel ioapic save and restore (to
and from userspace). It uses the same get/set_irqchip ioctl as
in-kernel PIC.
Signed-off-by: NQing He <qing.he@intel.com>
Signed-off-by: NYaozu (Eddie) Dong <eddie.dong@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

6bf9e962

KVM: Add get/set irqchip ioctls for in-kernel PIC live migration support · 6ceb9d79

由 He, Qing 提交于 7月 26, 2007

This patch adds two new ioctls to dump and write kernel irqchips for
save/restore and live migration. PIC s/r and l/m is implemented in this
patch.
Signed-off-by: NYaozu (Eddie) Dong <eddie.dong@intel.com>
Signed-off-by: NQing He <qing.he@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

6ceb9d79

KVM: Emulate hlt in the kernel · b6958ce4

由 Eddie Dong 提交于 7月 18, 2007

By sleeping in the kernel when hlt is executed, we simplify the in-kernel
guest interrupt path considerably.
Signed-off-by: NGregory Haskins <ghaskins@novell.com>
Signed-off-by: NYaozu (Eddie) Dong <eddie.dong@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

b6958ce4

KVM: Emulate local APIC in kernel · 97222cc8

由 Eddie Dong 提交于 9月 12, 2007

Because lightweight exits (exits which don't involve userspace) are many
times faster than heavyweight exits, it makes sense to emulate high usage
devices in the kernel.  The local APIC is one such device, especially for
Windows and for SMP, so we add an APIC model to kvm.

It also allows in-kernel host-side drivers to inject interrupts without
going through userspace.

[compile fix on i386 from Jindrich Makovicka]
Signed-off-by: NYaozu (Eddie) Dong <Eddie.Dong@intel.com>
Signed-off-by: NQing He <qing.he@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

97222cc8

KVM: Add support for in-kernel PIC emulation · 85f455f7

由 Eddie Dong 提交于 7月 06, 2007

Signed-off-by: NYaozu (Eddie) Dong <eddie.dong@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

85f455f7

KVM: Communicate cr8 changes to userspace · 253abdee

由 Yang, Sheng 提交于 8月 16, 2007

This allows running 64-bit Windows.
Signed-off-by: NSheng Yang <sheng.yang@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

253abdee

KVM: add hypercall nr to kvm_run · 519ef353

由 Jeff Dike 提交于 7月 16, 2007

Add the hypercall number to kvm_run and initialize it. This changes the ABI,
but as this particular ABI was unusable before this no users are affected.
Signed-off-by: NJeff Dike <jdike@linux.intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

519ef353

KVM: Trivial: Use standard BITMAP macros, open-code userspace-exposed header · 9eb829ce

由 Rusty Russell 提交于 7月 18, 2007

Creating one's own BITMAP macro seems suboptimal: if we use manual
arithmetic in the one place exposed to userspace, we can use standard
macros elsewhere.
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

9eb829ce

KVM: Trivial: /dev/kvm interface is no longer experimental. · dea8caee

由 Rusty Russell 提交于 7月 17, 2007

KVM interface is no longer experimental.
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

dea8caee

KVM: Future-proof the exit information union ABI · 24cbc7e9

由 Avi Kivity 提交于 7月 17, 2007

Note that as the size of struct kvm_run is not part of the ABI, we can add
things at the end.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

24cbc7e9

03 5月, 2007 17 次提交

KVM: Remove unused 'instruction_length' · 2ff81f70

由 Avi Kivity 提交于 4月 29, 2007

As we no longer emulate in userspace, this is meaningless.  We don't
compute it on SVM anyway.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

2ff81f70

KVM: Don't require explicit indication of completion of mmio or pio · 02c83209

由 Avi Kivity 提交于 4月 29, 2007

It is illegal not to return from a pio or mmio request without completing
it, as mmio or pio is an atomic operation. Therefore, we can simplify
the userspace interface by avoiding the completion indication.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

02c83209

KVM: Add fpu get/set operations · b8836737

由 Avi Kivity 提交于 4月 01, 2007

These are really helpful when migrating an floating point app to another
machine.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

b8836737

KVM: Add physical memory aliasing feature · e8207547

由 Avi Kivity 提交于 3月 30, 2007

With this, we can specify that accesses to one physical memory range will
be remapped to another. This is useful for the vga window at 0xa0000 which
is used as a movable window into the (much larger) framebuffer.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

e8207547

KVM: Avoid guest virtual addresses in string pio userspace interface · 039576c0

由 Avi Kivity 提交于 3月 20, 2007

The current string pio interface communicates using guest virtual addresses,
relying on userspace to translate addresses and to check permissions. This
interface cannot fully support guest smp, as the check needs to take into
account two pages at one in case an unaligned string transfer straddles a
page boundary.

Change the interface not to communicate guest addresses at all; instead use
a buffer page (mmaped by userspace) and do transfers there. The kernel
manages the virtual to physical translation and can perform the checks
atomically by taking the appropriate locks.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

039576c0

KVM: Allow kernel to select size of mmap() buffer · 07c45a36

由 Avi Kivity 提交于 3月 07, 2007

This allows us to store offsets in the kernel/user kvm_run area, and be
sure that userspace has them mapped. As offsets can be outside the
kvm_run struct, userspace has no way of knowing how much to mmap.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

07c45a36

KVM: Add guest mode signal mask · 1961d276

由 Avi Kivity 提交于 3月 05, 2007

Allow a special signal mask to be used while executing in guest mode. This
allows signals to be used to interrupt a vcpu without requiring signal
delivery to a userspace handler, which is quite expensive. Userspace still
receives -EINTR and can get the signal via sigwait().
Signed-off-by: NAvi Kivity <avi@qumranet.com>

1961d276

KVM: Add a special exit reason when exiting due to an interrupt · 1b19f3e6

由 Avi Kivity 提交于 3月 04, 2007

This is redundant, as we also return -EINTR from the ioctl, but it
allows us to examine the exit_reason field on resume without seeing
old data.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

1b19f3e6

KVM: Fold kvm_run::exit_type into kvm_run::exit_reason · 8eb7d334

由 Avi Kivity 提交于 3月 04, 2007

Currently, userspace is told about the nature of the last exit from the
guest using two fields, exit_type and exit_reason, where exit_type has
just two enumerations (and no need for more). So fold exit_type into
exit_reason, reducing the complexity of determining what really happened.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

8eb7d334

A
KVM: Allow userspace to process hypercalls which have no kernel handler · b4e63f56
由 Avi Kivity 提交于 3月 04, 2007
```
This is useful for paravirtualized graphics devices, for example.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
b4e63f56
A
KVM: Add method to check for backwards-compatible API extensions · 5d308f45
由 Avi Kivity 提交于 3月 01, 2007
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
5d308f45

KVM: Renumber ioctls · 739872c5

由 Avi Kivity 提交于 3月 01, 2007

The recent changes have left the ioctl numbers in complete disarray.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

739872c5

KVM: Remove minor wart from KVM_CREATE_VCPU ioctl · 2a4dac39

由 Avi Kivity 提交于 3月 01, 2007

That ioctl does not transfer any data, so it should be an _IO rather than an
_IOW.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

2a4dac39

KVM: Remove the 'emulated' field from the userspace interface · 106b552b

由 Avi Kivity 提交于 3月 01, 2007

We no longer emulate single instructions in userspace.  Instead, we service
mmio or pio requests.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

106b552b

KVM: Handle cpuid in the kernel instead of punting to userspace · 06465c5a

由 Avi Kivity 提交于 2月 28, 2007

KVM used to handle cpuid by letting userspace decide what values to
return to the guest.  We now handle cpuid completely in the kernel.  We
still let userspace decide which values the guest will see by having
userspace set up the value table beforehand (this is necessary to allow
management software to set the cpu features to the least common denominator,
so that live migration can work).

The motivation for the change is that kvm kernel code can be impacted by
cpuid features, for example the x86 emulator.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

06465c5a

KVM: Do not communicate to userspace through cpu registers during PIO · 46fc1477

由 Avi Kivity 提交于 2月 22, 2007

Currently when passing the a PIO emulation request to userspace, we
rely on userspace updating %rax (on 'in' instructions) and %rsi/%rdi/%rcx
(on string instructions).  This (a) requires two extra ioctls for getting
and setting the registers and (b) is unfriendly to non-x86 archs, when
they get kvm ports.

So fix by doing the register fixups in the kernel and passing to userspace
only an abstract description of the PIO to be done.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

46fc1477

KVM: Use a shared page for kernel/user communication when runing a vcpu · 9a2bb7f4

由 Avi Kivity 提交于 2月 22, 2007

Instead of passing a 'struct kvm_run' back and forth between the kernel and
userspace, allocate a page and allow the user to mmap() it.  This reduces
needless copying and makes the interface expandable by providing lots of
free space.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

9a2bb7f4

04 3月, 2007 3 次提交

A
KVM: Bump API version · f7e6a45a
由 Avi Kivity 提交于 2月 21, 2007
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
f7e6a45a

KVM: Per-vcpu inodes · bccf2150

由 Avi Kivity 提交于 2月 21, 2007

Allocate a distinct inode for every vcpu in a VM.  This has the following
benefits:

 - the filp cachelines are no longer bounced when f_count is incremented on
   every ioctl()
 - the API and internal code are distinctly clearer; for example, on the
   KVM_GET_REGS ioctl, there is no need to copy the vcpu number from
   userspace and then copy the registers back; the vcpu identity is derived
   from the fd used to make the call

Right now the performance benefits are completely theoretical since (a) we
don't support more than one vcpu per VM and (b) virtualization hardware
inefficiencies completely everwhelm any cacheline bouncing effects.  But
both of these will change, and we need to prepare the API today.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

bccf2150

KVM: Create an inode per virtual machine · f17abe9a

由 Avi Kivity 提交于 2月 21, 2007

This avoids having filp->f_op and the corresponding inode->i_fop different,
which is a little unorthodox.

The ioctl list is split into two: global kvm ioctls and per-vm ioctls.  A new
ioctl, KVM_CREATE_VM, is used to create VMs and return the VM fd.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

f17abe9a

13 2月, 2007 2 次提交

[PATCH] kvm: Fix mismatch between 32-bit and 64-bit abi · 8cd13307

由 Avi Kivity 提交于 2月 12, 2007

Unfortunately requiring a version bump.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
Cc: Ingo Molnar <mingo@elte.hu>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

8cd13307

[PATCH] kvm: Two-way apic tpr synchronization · 54810342

由 Dor Laor 提交于 2月 12, 2007

We report the value of cr8 to userspace on an exit.  Also let userspace change
cr8 when we re-enter the guest.  The lets 64-bit guest code maintain the tpr
correctly.

Thanks for Yaniv Kamay for the idea.
Signed-off-by: NDor Laor <dor.laor@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>
Cc: Ingo Molnar <mingo@elte.hu>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

54810342

27 1月, 2007 1 次提交

[PATCH] KVM: SVM: Propagate cpu shutdown events to userspace · 46fe4ddd

由 Joerg Roedel 提交于 1月 26, 2007

This patch implements forwarding of SHUTDOWN intercepts from the guest on to
userspace on AMD SVM.  A SHUTDOWN event occurs when the guest produces a
triple fault (e.g.  on reboot).  This also fixes the bug that a guest reboot
actually causes a host reboot under some circumstances.
Signed-off-by: NJoerg Roedel <joerg.roedel@amd.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

46fe4ddd

06 1月, 2007 1 次提交

[PATCH] KVM: Improve interrupt response · c1150d8c

由 Dor Laor 提交于 1月 05, 2007

The current interrupt injection mechanism might delay an interrupt under
the following circumstances:

 - if injection fails because the guest is not interruptible (rflags.IF clear,
   or after a 'mov ss' or 'sti' instruction).  Userspace can check rflags,
   but the other cases or not testable under the current API.
 - if injection fails because of a fault during delivery.  This probably
   never happens under normal guests.
 - if injection fails due to a physical interrupt causing a vmexit so that
   it can be handled by the host.

In all cases the guest proceeds without processing the interrupt, reducing
the interactive feel and interrupt throughput of the guest.

This patch fixes the situation by allowing userspace to request an exit
when the 'interrupt window' opens, so that it can re-inject the interrupt
at the right time.  Guest interactivity is very visibly improved.
Signed-off-by: NDor Laor <dor.laor@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>
Acked-by: NIngo Molnar <mingo@elte.hu>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

c1150d8c

23 12月, 2006 1 次提交

[PATCH] KVM: API versioning · 0b76e20b

由 Avi Kivity 提交于 12月 22, 2006

Add compile-time and run-time API versioning.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

0b76e20b

11 12月, 2006 1 次提交

[PATCH] kvm: userspace interface · 6aa8b732

由 Avi Kivity 提交于 12月 10, 2006

web site: http://kvm.sourceforge.net

mailing list: kvm-devel@lists.sourceforge.net
  (http://lists.sourceforge.net/lists/listinfo/kvm-devel)

The following patchset adds a driver for Intel's hardware virtualization
extensions to the x86 architecture.  The driver adds a character device
(/dev/kvm) that exposes the virtualization capabilities to userspace.  Using
this driver, a process can run a virtual machine (a "guest") in a fully
virtualized PC containing its own virtual hard disks, network adapters, and
display.

Using this driver, one can start multiple virtual machines on a host.

Each virtual machine is a process on the host; a virtual cpu is a thread in
that process.  kill(1), nice(1), top(1) work as expected.  In effect, the
driver adds a third execution mode to the existing two: we now have kernel
mode, user mode, and guest mode.  Guest mode has its own address space mapping
guest physical memory (which is accessible to user mode by mmap()ing
/dev/kvm).  Guest mode has no access to any I/O devices; any such access is
intercepted and directed to user mode for emulation.

The driver supports i386 and x86_64 hosts and guests.  All combinations are
allowed except x86_64 guest on i386 host.  For i386 guests and hosts, both pae
and non-pae paging modes are supported.

SMP hosts and UP guests are supported.  At the moment only Intel
hardware is supported, but AMD virtualization support is being worked on.

Performance currently is non-stellar due to the naive implementation of the
mmu virtualization, which throws away most of the shadow page table entries
every context switch.  We plan to address this in two ways:

- cache shadow page tables across tlb flushes
- wait until AMD and Intel release processors with nested page tables

Currently a virtual desktop is responsive but consumes a lot of CPU.  Under
Windows I tried playing pinball and watching a few flash movies; with a recent
CPU one can hardly feel the virtualization.  Linux/X is slower, probably due
to X being in a separate process.

In addition to the driver, you need a slightly modified qemu to provide I/O
device emulation and the BIOS.

Caveats (akpm: might no longer be true):

- The Windows install currently bluescreens due to a problem with the
  virtual APIC.  We are working on a fix.  A temporary workaround is to
  use an existing image or install through qemu
- Windows 64-bit does not work.  That's also true for qemu, so it's
  probably a problem with the device model.

[bero@arklinux.org: build fix]
[simon.kagstrom@bth.se: build fix, other fixes]
[uril@qumranet.com: KVM: Expose interrupt bitmap]
[akpm@osdl.org: i386 build fix]
[mingo@elte.hu: i386 fixes]
[rdreier@cisco.com: add log levels to all printks]
[randy.dunlap@oracle.com: Fix sparse NULL and C99 struct init warnings]
[anthony@codemonkey.ws: KVM: AMD SVM: 32-bit host support]
Signed-off-by: NYaniv Kamay <yaniv@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>
Cc: Simon Kagstrom <simon.kagstrom@bth.se>
Cc: Bernhard Rosenkraenzer <bero@arklinux.org>
Signed-off-by: NUri Lublin <uril@qumranet.com>
Cc: Ingo Molnar <mingo@elte.hu>
Cc: Roland Dreier <rolandd@cisco.com>
Signed-off-by: NRandy Dunlap <randy.dunlap@oracle.com>
Signed-off-by: NAnthony Liguori <anthony@codemonkey.ws>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

6aa8b732

openanolis / cloud-kernel 大约 1 年 前同步成功

openanolis / cloud-kernel
大约 1 年前同步成功