提交 · 54738c097163c3f01e67ccc85462b78d4d4f495f · OpenHarmony / kernel_linux

12 7月, 2011 7 次提交

KVM: PPC: Accelerate H_PUT_TCE by implementing it in real mode · 54738c09

由 David Gibson 提交于 6月 29, 2011

This improves I/O performance for guests using the PAPR
paravirtualization interface by making the H_PUT_TCE hcall faster, by
implementing it in real mode.  H_PUT_TCE is used for updating virtual
IOMMU tables, and is used both for virtual I/O and for real I/O in the
PAPR interface.

Since this moves the IOMMU tables into the kernel, we define a new
KVM_CREATE_SPAPR_TCE ioctl to allow qemu to create the tables.  The
ioctl returns a file descriptor which can be used to mmap the newly
created table.  The qemu driver models use them in the same way as
userspace managed tables, but they can be updated directly by the
guest with a real-mode H_PUT_TCE implementation, reducing the number
of host/guest context switches during guest IO.

There are certain circumstances where it is useful for userland qemu
to write to the TCE table even if the kernel H_PUT_TCE path is used
most of the time.  Specifically, allowing this will avoid awkwardness
when we need to reset the table.  More importantly, we will in the
future need to write the table in order to restore its state after a
checkpoint resume or migration.
Signed-off-by: NDavid Gibson <david@gibson.dropbear.id.au>
Signed-off-by: NPaul Mackerras <paulus@samba.org>
Signed-off-by: NAlexander Graf <agraf@suse.de>

54738c09

KVM: PPC: Add support for Book3S processors in hypervisor mode · de56a948

由 Paul Mackerras 提交于 6月 29, 2011

This adds support for KVM running on 64-bit Book 3S processors,
specifically POWER7, in hypervisor mode. Using hypervisor mode means
that the guest can use the processor's supervisor mode. That means
that the guest can execute privileged instructions and access privileged
registers itself without trapping to the host. This gives excellent
performance, but does mean that KVM cannot emulate a processor
architecture other than the one that the hardware implements.

This code assumes that the guest is running paravirtualized using the
PAPR (Power Architecture Platform Requirements) interface, which is the
interface that IBM's PowerVM hypervisor uses. That means that existing
Linux distributions that run on IBM pSeries machines will also run
under KVM without modification. In order to communicate the PAPR
hypercalls to qemu, this adds a new KVM_EXIT_PAPR_HCALL exit code
to include/linux/kvm.h.

Currently the choice between book3s_hv support and book3s_pr support
(i.e. the existing code, which runs the guest in user mode) has to be
made at kernel configuration time, so a given kernel binary can only
do one or the other.

This new book3s_hv code doesn't support MMIO emulation at present.
Since we are running paravirtualized guests, this isn't a serious
restriction.

With the guest running in supervisor mode, most exceptions go straight
to the guest. We will never get data or instruction storage or segment
interrupts, alignment interrupts, decrementer interrupts, program
interrupts, single-step interrupts, etc., coming to the hypervisor from
the guest. Therefore this introduces a new KVMTEST_NONHV macro for the
exception entry path so that we don't have to do the KVM test on entry
to those exception handlers.

We do however get hypervisor decrementer, hypervisor data storage,
hypervisor instruction storage, and hypervisor emulation assist
interrupts, so we have to handle those.

In hypervisor mode, real-mode accesses can access all of RAM, not just
a limited amount. Therefore we put all the guest state in the vcpu.arch
and use the shadow_vcpu in the PACA only for temporary scratch space.
We allocate the vcpu with kzalloc rather than vzalloc, and we don't use
anything in the kvmppc_vcpu_book3s struct, so we don't allocate it.
We don't have a shared page with the guest, but we still need a
kvm_vcpu_arch_shared struct to store the values of various registers,
so we include one in the vcpu_arch struct.

The POWER7 processor has a restriction that all threads in a core have
to be in the same partition. MMU-on kernel code counts as a partition
(partition 0), so we have to do a partition switch on every entry to and
exit from the guest. At present we require the host and guest to run
in single-thread mode because of this hardware restriction.

This code allocates a hashed page table for the guest and initializes
it with HPTEs for the guest's Virtual Real Memory Area (VRMA). We
require that the guest memory is allocated using 16MB huge pages, in
order to simplify the low-level memory management. This also means that
we can get away without tracking paging activity in the host for now,
since huge pages can't be paged or swapped.

This also adds a few new exports needed by the book3s_hv code.
Signed-off-by: NPaul Mackerras <paulus@samba.org>
Signed-off-by: NAlexander Graf <agraf@suse.de>

de56a948

KVM: Fix KVM_ASSIGN_SET_MSIX_ENTRY documentation · 58f0964e

由 Jan Kiszka 提交于 6月 11, 2011

The documented behavior did not match the implemented one (which also
never changed).
Signed-off-by: NJan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

58f0964e

KVM: Clarify KVM_ASSIGN_PCI_DEVICE documentation · 91e3d71d

由 Jan Kiszka 提交于 6月 03, 2011

Neither host_irq nor the guest_msi struct are used anymore today.
Tag the former, drop the latter to avoid confusion.
Signed-off-by: NJan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

91e3d71d

KVM: Fixup documentation section numbering · 7f4382e8

由 Jan Kiszka 提交于 6月 02, 2011

Signed-off-by: NJan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

7f4382e8

KVM: Document KVM_IOEVENTFD · 55399a02

由 Sasha Levin 提交于 5月 28, 2011

Document KVM_IOEVENTFD that can be used to receive
notifications of PIO/MMIO events without triggering
an exit.
Signed-off-by: NSasha Levin <levinsasha928@gmail.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

55399a02

A
KVM: Document KVM_GET_LAPIC, KVM_SET_LAPIC ioctl · e7677933
由 Avi Kivity 提交于 5月 11, 2011
```
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
```
e7677933

07 5月, 2011 1 次提交

Move kvm, uml, and lguest subdirectories under a common "virtual" directory, I.E: · ed16648e

由 Rob Landley 提交于 5月 06, 2011

  cd Documentation
  mkdir virtual
  git mv kvm uml lguest virtual
Signed-off-by: NRob Landley <rlandley@parallels.com>
Signed-off-by: NRandy Dunlap <randy.dunlap@oracle.com>

ed16648e

15 2月, 2011 1 次提交

doc: Fix numbering of KVM API description sections · 68ba6974

由 Paul Bolle 提交于 2月 15, 2011

Signed-off-by: NPaul Bolle <pebolle@tiscali.nl>
Reviewed-by: NJesper Juhl <jj@chaosbits.net>
Signed-off-by: NJiri Kosina <jkosina@suse.cz>

68ba6974

12 1月, 2011 1 次提交

KVM: Document device assigment API · 49f48172

由 Jan Kiszka 提交于 11月 16, 2010

Adds API documentation for KVM_[DE]ASSIGN_PCI_DEVICE,
KVM_[DE]ASSIGN_DEV_IRQ, KVM_SET_GSI_ROUTING, KVM_ASSIGN_SET_MSIX_NR, and
KVM_ASSIGN_SET_MSIX_ENTRY.
Acked-by: NAlex Williamson <alex.williamson@redhat.com>
Acked-by: NMichael S. Tsirkin <mst@redhat.com>
Signed-off-by: NJan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

49f48172

02 11月, 2010 1 次提交

tree-wide: fix comment/printk typos · b595076a

由 Uwe Kleine-König 提交于 11月 01, 2010

"gadget", "through", "command", "maintain", "maintain", "controller", "address",
"between", "initiali[zs]e", "instead", "function", "select", "already",
"equal", "access", "management", "hierarchy", "registration", "interest",
"relative", "memory", "offset", "already",
Signed-off-by: NUwe Kleine-König <u.kleine-koenig@pengutronix.de>
Signed-off-by: NJiri Kosina <jkosina@suse.cz>

b595076a

24 10月, 2010 3 次提交

A
KVM: Document that KVM_GET_SUPPORTED_CPUID may return emulated values · c39cbd2a
由 Avi Kivity 提交于 9月 12, 2010
```
Signed-off-by: NAvi Kivity <avi@redhat.com>
```
c39cbd2a

KVM: PPC: Document KVM_INTERRUPT ioctl · 6f7a2bd4

由 Alexander Graf 提交于 8月 31, 2010

This adds some documentation for the KVM_INTERRUPT special cases that
PowerPC now implements.
Signed-off-by: NAlexander Graf <agraf@suse.de>

6f7a2bd4

KVM: PPC: Add get_pvinfo interface to query hypercall instructions · 15711e9c

由 Alexander Graf 提交于 7月 29, 2010

We need to tell the guest the opcodes that make up a hypercall through
interfaces that are controlled by userspace. So we need to add a call
for userspace to allow it to query those opcodes so it can pass them
on.

This is required because the hypercall opcodes can change based on
the hypervisor conditions. If we're running in hardware accelerated
hypervisor mode, a hypercall looks different from when we're running
without hardware acceleration.
Signed-off-by: NAlexander Graf <agraf@suse.de>
Signed-off-by: NAvi Kivity <avi@redhat.com>

15711e9c

02 8月, 2010 2 次提交
- A
  KVM: Document KVM_GET_SUPPORTED_CPUID2 ioctl · d153513d
  由 Avi Kivity 提交于 7月 14, 2010
```
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
```
  d153513d
- A
  KVM: Document MCE banks non-exposure via KVM_GET_MSR_INDEX_LIST · 2e2602ca
  由 Avi Kivity 提交于 7月 07, 2010
```
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
```
  2e2602ca
01 8月, 2010 5 次提交

KVM: Remove kernel-allocated memory regions · b74a07be

由 Avi Kivity 提交于 6月 21, 2010

Equivalent (and better) functionality is provided by user-allocated memory
regions.
Signed-off-by: NAvi Kivity <avi@redhat.com>

b74a07be

KVM: Remove memory alias support · a1f4d395

由 Avi Kivity 提交于 6月 21, 2010

As advertised in feature-removal-schedule.txt.  Equivalent support is provided
by overlapping memory regions.
Signed-off-by: NAvi Kivity <avi@redhat.com>

a1f4d395

KVM: x86: XSAVE/XRSTOR live migration support · 2d5b5a66

由 Sheng Yang 提交于 6月 13, 2010

This patch enable save/restore of xsave state.
Signed-off-by: NSheng Yang <sheng@linux.intel.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

2d5b5a66

A
KVM: Document KVM_SET_BOOT_CPU_ID · 57bc24cf
由 Avi Kivity 提交于 4月 29, 2010
```
Signed-off-by: NAvi Kivity <avi@redhat.com>
```
57bc24cf
A
KVM: Document KVM_SET_IDENTITY_MAP ioctl · 47dbb84f
由 Avi Kivity 提交于 4月 29, 2010
```
Signed-off-by: NAvi Kivity <avi@redhat.com>
```
47dbb84f

17 5月, 2010 6 次提交

KVM: Document KVM_GET_MP_STATE and KVM_SET_MP_STATE · b843f065

由 Avi Kivity 提交于 4月 25, 2010

Acked-by: NPekka Enberg <penberg@cs.helsinki.fi>
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

b843f065

A
KVM: Document replacements for KVM_EXIT_HYPERCALL · 647dc49e
由 Avi Kivity 提交于 4月 01, 2010
```
Signed-off-by: NAvi Kivity <avi@redhat.com>
```
647dc49e

KVM: PPC: Add OSI hypercall interface · ad0a048b

由 Alexander Graf 提交于 3月 24, 2010

MOL uses its own hypercall interface to call back into userspace when
the guest wants to do something.

So let's implement that as an exit reason, specify it with a CAP and
only really use it when userspace wants us to.

The only user of it so far is MOL.
Signed-off-by: NAlexander Graf <agraf@suse.de>
Signed-off-by: NAvi Kivity <avi@redhat.com>

ad0a048b

KVM: Add support for enabling capabilities per-vcpu · 71fbfd5f

由 Alexander Graf 提交于 3月 24, 2010

Some times we don't want all capabilities to be available to all
our vcpus. One example for that is the OSI interface, implemented
in the next patch.

In order to have a generic mechanism in how to enable capabilities
individually, this patch introduces a new ioctl that can be used
for this purpose. That way features we don't want in all guests or
userspace configurations can just not be enabled and we're good.
Signed-off-by: NAlexander Graf <agraf@suse.de>
Signed-off-by: NAvi Kivity <avi@redhat.com>

71fbfd5f

A
KVM: Document KVM_SET_TSS_ADDR · 8a5416db
由 Avi Kivity 提交于 3月 25, 2010
```
Signed-off-by: NAvi Kivity <avi@redhat.com>
```
8a5416db

KVM: Document KVM_SET_USER_MEMORY_REGION · 0f2d8f4d

由 Avi Kivity 提交于 3月 25, 2010

Acked-by: NPekka Enberg <penberg@cs.helsinki.fi>
Signed-off-by: NAvi Kivity <avi@redhat.com>

0f2d8f4d

25 4月, 2010 3 次提交

KVM: x86: Add support for saving&restoring debug registers · a1efbe77

由 Jan Kiszka 提交于 2月 15, 2010

So far user space was not able to save and restore debug registers for
migration or after reset. Plug this hole.
Signed-off-by: NJan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

a1efbe77

KVM: x86: Save&restore interrupt shadow mask · 48005f64

由 Jan Kiszka 提交于 2月 19, 2010

The interrupt shadow created by STI or MOV-SS-like operations is part of
the VCPU state and must be preserved across migration. Transfer it in
the spare padding field of kvm_vcpu_events.interrupt.

As a side effect we now have to make vmx_set_interrupt_shadow robust
against both shadow types being set. Give MOV SS a higher priority and
skip STI in that case to avoid that VMX throws a fault on next entry.
Signed-off-by: NJan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

48005f64

KVM: add doc note about PIO/MMIO completion API · 67961344

由 Marcelo Tosatti 提交于 2月 13, 2010

Document that partially emulated instructions leave the guest state
inconsistent, and that the kernel will complete operations before
checking for pending signals.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

67961344

01 3月, 2010 1 次提交

KVM: trivial document fixes · 2044892d

由 Wu Fengguang 提交于 12月 24, 2009

Signed-off-by: NWu Fengguang <fengguang.wu@intel.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

2044892d

27 12月, 2009 1 次提交

KVM: x86: Extend KVM_SET_VCPU_EVENTS with selective updates · dab4b911

由 Jan Kiszka 提交于 12月 06, 2009

User space may not want to overwrite asynchronously changing VCPU event
states on write-back. So allow to skip nmi.pending and sipi_vector by
setting corresponding bits in the flags field of kvm_vcpu_events.

[avi: advertise the bits in KVM_GET_VCPU_EVENTS]
Signed-off-by: NJan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

dab4b911

03 12月, 2009 3 次提交

KVM: x86: Add KVM_GET/SET_VCPU_EVENTS · 3cfc3092

由 Jan Kiszka 提交于 11月 12, 2009

This new IOCTL exports all yet user-invisible states related to
exceptions, interrupts, and NMIs. Together with appropriate user space
changes, this fixes sporadic problems of vmsave/restore, live migration
and system reset.

[avi: future-proof abi by adding a flags field]
Signed-off-by: NJan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

3cfc3092

KVM: allow userspace to adjust kvmclock offset · afbcf7ab

由 Glauber Costa 提交于 10月 16, 2009

When we migrate a kvm guest that uses pvclock between two hosts, we may
suffer a large skew. This is because there can be significant differences
between the monotonic clock of the hosts involved. When a new host with
a much larger monotonic time starts running the guest, the view of time
will be significantly impacted.

Situation is much worse when we do the opposite, and migrate to a host with
a smaller monotonic clock.

This proposed ioctl will allow userspace to inform us what is the monotonic
clock value in the source host, so we can keep the time skew short, and
more importantly, never goes backwards. Userspace may also need to trigger
the current data, since from the first migration onwards, it won't be
reflected by a simple call to clock_gettime() anymore.

[marcelo: future-proof abi with a flags field]
[jan: fix KVM_GET_CLOCK by clearing flags field instead of checking it]
Signed-off-by: NGlauber Costa <glommer@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

afbcf7ab

KVM: Xen PV-on-HVM guest support · ffde22ac

由 Ed Swierk 提交于 10月 15, 2009

Support for Xen PV-on-HVM guests can be implemented almost entirely in
userspace, except for handling one annoying MSR that maps a Xen
hypercall blob into guest address space.

A generic mechanism to delegate MSR writes to userspace seems overkill
and risks encouraging similar MSR abuse in the future.  Thus this patch
adds special support for the Xen HVM MSR.

I implemented a new ioctl, KVM_XEN_HVM_CONFIG, that lets userspace tell
KVM which MSR the guest will write to, as well as the starting address
and size of the hypercall blobs (one each for 32-bit and 64-bit) that
userspace has loaded from files.  When the guest writes to the MSR, KVM
copies one page of the blob from userspace to the guest.

I've tested this patch with a hacked-up version of Gerd's userspace
code, booting a number of guests (CentOS 5.3 i386 and x86_64, and
FreeBSD 8.0-RC1 amd64) and exercising PV network and block devices.

[jan: fix i386 build warning]
[avi: future proof abi with a flags field]
Signed-off-by: NEd Swierk <eswierk@aristanetworks.com>
Signed-off-by: NJan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

ffde22ac

10 9月, 2009 2 次提交
- A
  KVM: Document KVM_CAP_IRQCHIP · 5dadbfd6
  由 Avi Kivity 提交于 8月 23, 2009
```
Signed-off-by: NAvi Kivity <avi@redhat.com>
```
  5dadbfd6
- A
  KVM: Document basic API · 9c1b96e3
  由 Avi Kivity 提交于 6月 09, 2009
```
Document the basic API corresponding to the 2.6.22 release.
Signed-off-by: NAvi Kivity <avi@redhat.com>
```
  9c1b96e3

OpenHarmony / kernel_linux 上一次同步 3 年多

OpenHarmony / kernel_linux
上一次同步 3 年多