提交 · 9647c14c98687d0abf5197e74b9d1448ab6ebb95 · openeuler / raspberrypi-kernel

30 1月, 2008 40 次提交

KVM: MMU: Keep a reverse mapping of non-writable translations · 9647c14c

由 Izik Eidus 提交于 10月 16, 2007

The current kvm mmu only reverse maps writable translation.  This is used
to write-protect a page in case it becomes a pagetable.

But with swapping support, we need a reverse mapping of read-only pages as
well:  when we evict a page, we need to remove any mapping to it, whether
writable or not.
Signed-off-by: NIzik Eidus <izike@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

9647c14c

I
KVM: MMU: Add rmap_next(), a helper for walking kvm rmaps · 98348e95
由 Izik Eidus 提交于 10月 16, 2007
```
Signed-off-by: NIzik Eidus <izike@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
98348e95

KVM: x86 emulator: cmc, clc, cli, sti · b284be57

由 Nitin A Kamble 提交于 10月 16, 2007

Instruction: cmc, clc, cli, sti
opcodes: 0xf5, 0xf8, 0xfa, 0xfb respectively.

[avi: fix reference to EFLG_IF which is not defined anywhere]
Signed-off-by: NNitin A Kamble <nitin.a.kamble@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

b284be57

KVM: MMU: Simplify page table walker · 42bf3f0a

由 Avi Kivity 提交于 10月 17, 2007

Simplify the walker level loop not to carry so much information from one
loop to the next. In addition to being complex, this made kmap_atomic()
critical sections difficult to manage.

As a result of this change, kmap_atomic() sections are limited to actually
touching the guest pte, which allows the other functions called from the
walker to do sleepy operations. This will happen when we enable swapping.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

42bf3f0a

KVM: x86 emulator: Implement emulation of instruction: inc & dec · d77a2507

由 Nitin A Kamble 提交于 10月 12, 2007

Instructions:
	inc r16/r32 (opcode 0x40-0x47)
	dec r16/r32 (opcode 0x48-0x4f)
Signed-off-by: NNitin A Kamble <nitin.a.kamble@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

d77a2507

KVM: Rename KVM_TLB_FLUSH to KVM_REQ_TLB_FLUSH · 3176bc3e

由 Avi Kivity 提交于 10月 16, 2007

We now have a new namespace, KVM_REQ_*, for bits in vcpu->requests.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

3176bc3e

KVM: Move apic timer interrupt backlog processing to common code · ab6ef34b

由 Avi Kivity 提交于 10月 16, 2007

Beside the obvious goodness of making code more common, this prevents
a livelock with the next patch which moves interrupt injection out of the
critical section.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

ab6ef34b

KVM: Add some \n in ioapic_debug() · e25e3ed5

由 Laurent Vivier 提交于 10月 12, 2007

Add new-line at end of debug strings.
Signed-off-by: NLaurent Vivier <Laurent.Vivier@bull.net>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

e25e3ed5

KVM: apic round robin cleanup · e4d47f40

由 Qing He 提交于 9月 24, 2007

If no apic is enabled in the bitmap of an interrupt delivery with delivery
mode of lowest priority, a warning should be reported rather than select
a fallback vcpu
Signed-off-by: NQing He <qing.he@intel.com>
Signed-off-by: NEddie (Yaozu) Dong <eddie.dong@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

e4d47f40

KVM: Portability: split kvm_vcpu_ioctl · 313a3dc7

由 Carsten Otte 提交于 10月 11, 2007

This patch splits kvm_vcpu_ioctl into archtecture independent parts, and
x86 specific parts which go to kvm_arch_vcpu_ioctl in x86.c.

Common ioctls for all architectures are:
KVM_RUN, KVM_GET/SET_(S-)REGS, KVM_TRANSLATE, KVM_INTERRUPT,
KVM_DEBUG_GUEST, KVM_SET_SIGNAL_MASK, KVM_GET/SET_FPU
Note that some PPC chips don't have an FPU, so we might need an #ifdef
around KVM_GET/SET_FPU one day.

x86 specific ioctls are:
KVM_GET/SET_LAPIC, KVM_SET_CPUID, KVM_GET/SET_MSRS

An interresting aspect is vcpu_load/vcpu_put. We now have a common
vcpu_load/put which does the preemption stuff, and an architecture
specific kvm_arch_vcpu_load/put. In the x86 case, this one calls the
vmx/svm function defined in kvm_x86_ops.
Signed-off-by: NCarsten Otte <cotte@de.ibm.com>
Reviewed-by: NChristian Borntraeger <borntraeger@de.ibm.com>
Reviewed-by: NChristian Ehrhardt <ehrhardt@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

313a3dc7

KVM: MMU: When updating the dirty bit, inform the mmu about it · c4fcc272

由 Avi Kivity 提交于 10月 11, 2007

Since the mmu uses different shadow pages for dirty large pages and clean
large pages, this allows the mmu to drop ptes that are now invalid.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

c4fcc272

A
KVM: MMU: Move dirty bit updates to a separate function · 5df34a86
由 Avi Kivity 提交于 10月 11, 2007
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
5df34a86
A
KVM: MMU: Instantiate real-mode shadows as user writable shadows · 6bfccdc9
由 Avi Kivity 提交于 10月 11, 2007
```
This is consistent with real-mode permissions.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
6bfccdc9

KVM: MMU: Disable write access on clean large pages · cc70e737

由 Avi Kivity 提交于 10月 11, 2007

By forcing clean huge pages to be read-only, we have separate roles
for the shadow of a clean large page and the shadow of a dirty large
page.  This is necessary because different ptes will be instantiated
for the two cases, even for read faults.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

cc70e737

KVM: MMU: Fix nx access bit for huge pages · c22e3514

由 Avi Kivity 提交于 10月 11, 2007

We must set the bit before the shift, otherwise the wrong bit gets set.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

c22e3514

KVM: Move guest pte dirty bit management to the guest pagetable walker · e3c5e7ec

由 Avi Kivity 提交于 10月 11, 2007

This is more consistent with the accessed bit management, and makes the dirty
bit available earlier for other purposes.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

e3c5e7ec

KVM: MMU: More struct kvm_vcpu -> struct kvm cleanups · 4a4c9924

由 Anthony Liguori 提交于 10月 10, 2007

This time, the biggest change is gpa_to_hpa. The translation of GPA to HPA does
not depend on the VCPU state unlike GVA to GPA so there's no need to pass in
the kvm_vcpu.
Signed-off-by: NAnthony Liguori <aliguori@us.ibm.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

4a4c9924

KVM: MMU: Clean up MMU functions to take struct kvm when appropriate · f67a46f4

由 Anthony Liguori 提交于 10月 10, 2007

Some of the MMU functions take a struct kvm_vcpu even though they affect all
VCPUs.  This patch cleans up some of them to instead take a struct kvm.  This
makes things a bit more clear.

The main thing that was confusing me was whether certain functions need to be
called on all VCPUs.
Signed-off-by: NAnthony Liguori <aliguori@us.ibm.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

f67a46f4

C
KVM: Move x86 msr handling to new files x86.[ch] · 043405e1
由 Carsten Otte 提交于 10月 10, 2007
```
Signed-off-by: NCarsten Otte <cotte@de.ibm.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
043405e1

KVM: Support assigning userspace memory to the guest · 6fc138d2

由 Izik Eidus 提交于 10月 09, 2007

Instead of having the kernel allocate memory to the guest, let userspace
allocate it and pass the address to the kernel.

This is required for s390 support, but also enables features like memory
sharing and using hugetlbfs backed memory.
Signed-off-by: NIzik Eidus <izike@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

6fc138d2

KVM: CodingStyle cleanup · d77c26fc

由 Mike Day 提交于 10月 08, 2007

Signed-off-by: NMike D. Day <ncmike@ncultra.org>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

d77c26fc

KVM: Remove gratuitous casts from lapic.c · 7e620d16

由 Rusty Russell 提交于 10月 08, 2007

Since vcpu->apic is of the correct type, there's not need to cast.
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

7e620d16

KVM: Hoist kvm_create_lapic() into kvm_vcpu_init() · 76fafa5e

由 Rusty Russell 提交于 10月 08, 2007

Move kvm_create_lapic() into kvm_vcpu_init(), rather than having svm
and vmx do it.  And make it return the error rather than a fairly
random -ENOMEM.

This also solves the problem that neither svm.c nor vmx.c actually
handles the error path properly.
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

76fafa5e

KVM: Add kvm_free_lapic() to pair with kvm_create_lapic() · d589444e

由 Rusty Russell 提交于 10月 08, 2007

Instead of the asymetry of kvm_free_apic, implement kvm_free_lapic().
And guess what?  I found a minor bug: we don't need to hrtimer_cancel()
from kvm_main.c, because we do that in kvm_free_apic().

Also:
1) kvm_vcpu_uninit should be the reverse order from kvm_vcpu_init.
2) Don't set apic->regs_page to zero before freeing apic.
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

d589444e

KVM: Allow dynamic allocation of the mmu shadow cache size · 82ce2c96

由 Izik Eidus 提交于 10月 02, 2007

The user is now able to set how many mmu pages will be allocated to the guest.
Signed-off-by: NIzik Eidus <izike@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

82ce2c96

I
KVM: Add general accessors to read and write guest memory · 195aefde
由 Izik Eidus 提交于 10月 01, 2007
```
Signed-off-by: NIzik Eidus <izike@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
195aefde

KVM: Remove the usage of page->private field by rmap · 290fc38d

由 Izik Eidus 提交于 9月 27, 2007

When kvm uses user-allocated pages in the future for the guest, we won't
be able to use page->private for rmap, since page->rmap is reserved for
the filesystem.  So we move the rmap base pointers to the memory slot.

A side effect of this is that we need to store the gfn of each gpte in
the shadow pages, since the memory slot is addressed by gfn, instead of
hfn like struct page.
Signed-off-by: NIzik Eidus <izik@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

290fc38d

KVM: VMX: Simplify vcpu_clear() · f566e09f

由 Avi Kivity 提交于 9月 30, 2007

Now that smp_call_function_single() knows how to call a function on the
current cpu, there's no need to check explicitly.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

f566e09f

A
KVM: VMX: Don't clear the vmcs if the vcpu is not loaded on any processor · eae5ecb5
由 Avi Kivity 提交于 9月 30, 2007
```
Noted by Eddie Dong.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
eae5ecb5

KVM: x86 emulator: Any legacy prefix after a REX prefix nullifies its effect · b4c6abfe

由 Laurent Vivier 提交于 9月 25, 2007

This patch modifies the management of REX prefix according behavior
I saw in Xen 3.1. In Xen, this modification has been introduced by
Jan Beulich.

http://lists.xensource.com/archives/html/xen-changelog/2007-01/msg00081.htmlSigned-off-by: NLaurent Vivier <Laurent.Vivier@bull.net>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

b4c6abfe

KVM: Purify x86_decode_insn() error case management · a22436b7

由 Laurent Vivier 提交于 9月 24, 2007

The only valid case is on protected page access, other cases are errors.
Signed-off-by: NLaurent Vivier <Laurent.Vivier@bull.net>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

a22436b7

KVM: x86_emulator: no writeback for bt · e4f8e039

由 Qing He 提交于 9月 24, 2007

Signed-off-by: NQing He <qing.he@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

e4f8e039

KVM: x86 emulator: Remove no_wb, use dst.type = OP_NONE instead · a01af5ec

由 Laurent Vivier 提交于 9月 24, 2007

Remove no_wb, use dst.type = OP_NONE instead, idea stollen from xen-3.1
Signed-off-by: NLaurent Vivier <Laurent.Vivier@bull.net>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

a01af5ec

KVM: x86 emulator: remove _eflags and use directly ctxt->eflags. · 05f086f8

由 Laurent Vivier 提交于 9月 24, 2007

Remove _eflags and use directly ctxt->eflags. Caching eflags is not needed as
it is restored to vcpu by kvm_main.c:emulate_instruction() from ctxt->eflags
only if emulation doesn't fail.
Signed-off-by: NLaurent Vivier <Laurent.Vivier@bull.net>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

05f086f8

KVM: x86 emulator: split some decoding into functions for readability · 8cdbd2c9

由 Laurent Vivier 提交于 9月 24, 2007

To improve readability, move push, writeback, and grp 1a/2/3/4/5/9 emulation
parts into functions.
Signed-off-by: NLaurent Vivier <Laurent.Vivier@bull.net>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

8cdbd2c9

KVM: MMU: Ignore reserved bits in cr3 in non-pae mode · 21764863

由 Ryan Harper 提交于 9月 18, 2007

This patch removes the fault injected when the guest attempts to set reserved
bits in cr3.  X86 hardware doesn't generate a fault when setting reserved bits.
The result of this patch is that vmware-server, running within a kvm guest,
boots and runs memtest from an iso.
Signed-off-by: NRyan Harper <ryanh@us.ibm.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

21764863

KVM: MMU: Make flooding detection work when guest page faults are bypassed · 12b7d28f

由 Avi Kivity 提交于 9月 23, 2007

When we allow guest page faults to reach the guests directly, we lose
the fault tracking which allows us to detect demand paging. So we provide
an alternate mechnism by clearing the accessed bit when we set a pte, and
checking it later to see if the guest actually used it.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

12b7d28f

KVM: Allow not-present guest page faults to bypass kvm · c7addb90

由 Avi Kivity 提交于 9月 16, 2007

There are two classes of page faults trapped by kvm:
 - host page faults, where the fault is needed to allow kvm to install
   the shadow pte or update the guest accessed and dirty bits
 - guest page faults, where the guest has faulted and kvm simply injects
   the fault back into the guest to handle

The second class, guest page faults, is pure overhead.  We can eliminate
some of it on vmx using the following evil trick:
 - when we set up a shadow page table entry, if the corresponding guest pte
   is not present, set up the shadow pte as not present
 - if the guest pte _is_ present, mark the shadow pte as present but also
   set one of the reserved bits in the shadow pte
 - tell the vmx hardware not to trap faults which have the present bit clear

With this, normal page-not-present faults go directly to the guest,
bypassing kvm entirely.

Unfortunately, this trick only works on Intel hardware, as AMD lacks a
way to discriminate among page faults based on error code.  It is also
a little risky since it uses reserved bits which might become unreserved
in the future, so a module parameter is provided to disable it.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

c7addb90

KVM: VMX: Further reduce efer reloads · 51c6cf66

由 Avi Kivity 提交于 8月 29, 2007

KVM avoids reloading the efer msr when the difference between the guest
and host values consist of the long mode bits (which are switched by
hardware) and the NX bit (which is emulated by the KVM MMU).

This patch also allows KVM to ignore SCE (syscall enable) when the guest
is running in 32-bit mode.  This is because the syscall instruction is
not available in 32-bit mode on Intel processors, so the SCE bit is
effectively meaningless.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

51c6cf66

KVM: Call x86_decode_insn() only when needed · 3427318f

由 Laurent Vivier 提交于 9月 18, 2007

Move emulate_ctxt to kvm_vcpu to keep emulate context when we exit from kvm
module. Call x86_decode_insn() only when needed. Modify x86_emulate_insn() to
not modify the context if it must be re-entered.
Signed-off-by: NLaurent Vivier <Laurent.Vivier@bull.net>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

3427318f