提交 · 4a4c99248713e878e1e2880015d01049aec805f3 · openeuler / raspberrypi-kernel

30 1月, 2008 6 次提交

KVM: MMU: More struct kvm_vcpu -> struct kvm cleanups · 4a4c9924

由 Anthony Liguori 提交于 10月 10, 2007

This time, the biggest change is gpa_to_hpa. The translation of GPA to HPA does
not depend on the VCPU state unlike GVA to GPA so there's no need to pass in
the kvm_vcpu.
Signed-off-by: NAnthony Liguori <aliguori@us.ibm.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

4a4c9924

KVM: MMU: Clean up MMU functions to take struct kvm when appropriate · f67a46f4

由 Anthony Liguori 提交于 10月 10, 2007

Some of the MMU functions take a struct kvm_vcpu even though they affect all
VCPUs.  This patch cleans up some of them to instead take a struct kvm.  This
makes things a bit more clear.

The main thing that was confusing me was whether certain functions need to be
called on all VCPUs.
Signed-off-by: NAnthony Liguori <aliguori@us.ibm.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

f67a46f4

KVM: CodingStyle cleanup · d77c26fc

由 Mike Day 提交于 10月 08, 2007

Signed-off-by: NMike D. Day <ncmike@ncultra.org>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

d77c26fc

KVM: Remove the usage of page->private field by rmap · 290fc38d

由 Izik Eidus 提交于 9月 27, 2007

When kvm uses user-allocated pages in the future for the guest, we won't
be able to use page->private for rmap, since page->rmap is reserved for
the filesystem.  So we move the rmap base pointers to the memory slot.

A side effect of this is that we need to store the gfn of each gpte in
the shadow pages, since the memory slot is addressed by gfn, instead of
hfn like struct page.
Signed-off-by: NIzik Eidus <izik@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

290fc38d

KVM: MMU: Make flooding detection work when guest page faults are bypassed · 12b7d28f

由 Avi Kivity 提交于 9月 23, 2007

When we allow guest page faults to reach the guests directly, we lose
the fault tracking which allows us to detect demand paging. So we provide
an alternate mechnism by clearing the accessed bit when we set a pte, and
checking it later to see if the guest actually used it.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

12b7d28f

KVM: Allow not-present guest page faults to bypass kvm · c7addb90

由 Avi Kivity 提交于 9月 16, 2007

There are two classes of page faults trapped by kvm:
 - host page faults, where the fault is needed to allow kvm to install
   the shadow pte or update the guest accessed and dirty bits
 - guest page faults, where the guest has faulted and kvm simply injects
   the fault back into the guest to handle

The second class, guest page faults, is pure overhead.  We can eliminate
some of it on vmx using the following evil trick:
 - when we set up a shadow page table entry, if the corresponding guest pte
   is not present, set up the shadow pte as not present
 - if the guest pte _is_ present, mark the shadow pte as present but also
   set one of the reserved bits in the shadow pte
 - tell the vmx hardware not to trap faults which have the present bit clear

With this, normal page-not-present faults go directly to the guest,
bypassing kvm entirely.

Unfortunately, this trick only works on Intel hardware, as AMD lacks a
way to discriminate among page faults based on error code.  It is also
a little risky since it uses reserved bits which might become unreserved
in the future, so a module parameter is provided to disable it.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

c7addb90

13 10月, 2007 3 次提交

KVM: Rename kvm_arch_ops to kvm_x86_ops · cbdd1bea

由 Christian Ehrhardt 提交于 9月 09, 2007

This patch just renames the current (misnamed) _arch namings to _x86 to
ensure better readability when a real arch layer takes place.
Signed-off-by: NChristian Ehrhardt <ehrhardt@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

cbdd1bea

KVM: Move gfn_to_page out of kmap/unmap pairs · fe551881

由 Shaohua Li 提交于 7月 23, 2007

gfn_to_page might sleep with swap support. Move it out of the kmap calls.
Signed-off-by: NShaohua Li <shaohua.li@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

fe551881

KVM: Use standard CR3 flags, tighten checking · f802a307

由 Rusty Russell 提交于 7月 17, 2007

The kernel now has asm/cpu-features.h: use those macros instead of inventing
our own.

Also spell out definition of CR3_RESEVED_BITS, fix spelling and
tighten it for the non-PAE case.
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

f802a307

21 7月, 2007 1 次提交

KVM: MMU: Store nx bit for large page shadows · d55e2cb2

由 Avi Kivity 提交于 7月 10, 2007

We need to distinguish between large page shadows which have the nx bit set
and those which don't. The problem shows up when booting a newer smp Linux
kernel, where the trampoline page (which is in real mode, which uses the
same shadow pages as large pages) is using the same mapping as a kernel data
page, which is mapped using nx, causing kvm to spin on that page.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

d55e2cb2

16 7月, 2007 16 次提交

KVM: MMU: Remove unused large page marker · bd2b2baa

由 Avi Kivity 提交于 5月 31, 2007

This has not been used for some time, as the same information is available
in the page header.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

bd2b2baa

KVM: MMU: Don't cache guest access bits in the shadow page table · b64b3763

由 Avi Kivity 提交于 5月 31, 2007

This was once used to avoid accessing the guest pte when upgrading
the shadow pte from read-only to read-write.  But usually we need
to set the guest pte dirty or accessed bits anyway, so this wasn't
really exploited.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

b64b3763

KVM: MMU: Simpify accessed/dirty/present/nx bit handling · fd97dc51

由 Avi Kivity 提交于 5月 31, 2007

Always set the accessed and dirty bit (since having them cleared causes
a read-modify-write cycle), always set the present bit, and copy the
nx bit from the guest.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

fd97dc51

KVM: MMU: Remove cr0.wp tricks · 4436d466

由 Avi Kivity 提交于 5月 31, 2007

No longer needed as we do everything in one place.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

4436d466

A
KVM: MMU: Make setting shadow ptes atomic on i386 · e663ee64
由 Avi Kivity 提交于 5月 31, 2007
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
e663ee64

KVM: Make shadow pte updates atomic · 0d551bb6

由 Avi Kivity 提交于 5月 31, 2007

With guest smp, a second vcpu might see partial updates when the first
vcpu services a page fault.  So delay all updates until we have figured
out what the pte should look like.

Note that on i386, this is still not completely atomic as a 64-bit write
will be split into two on a 32-bit machine.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

0d551bb6

A
KVM: Move shadow pte modifications from set_pte/set_pde to set_pde_common() · a18de5a4
由 Avi Kivity 提交于 5月 31, 2007
```
We want all shadow pte modifications in one place.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
a18de5a4

KVM: MMU: Fold fix_write_pf() into set_pte_common() · 97a0a01e

由 Avi Kivity 提交于 5月 31, 2007

This prevents some work from being performed twice, and, more importantly,
reduces the number of places where we modify shadow ptes.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

97a0a01e

A
KVM: MMU: Fold fix_read_pf() into set_pte_common() · 63b1ad24
由 Avi Kivity 提交于 5月 31, 2007
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
63b1ad24

KVM: MMU: Pass the guest pde to set_pte_common · 6598c8b2

由 Avi Kivity 提交于 5月 31, 2007

We will need the accessed bit (in addition to the dirty bit) and
also write access (for setting the dirty bit) in a future patch.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

6598c8b2

A
KVM: MMU: Move set_pte_common() to pte width dependent code · e60d75ea
由 Avi Kivity 提交于 5月 30, 2007
```
In preparation of some modifications.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
e60d75ea
A
KVM: MMU: Simplify fetch() a little bit · ef0197e8
由 Avi Kivity 提交于 5月 30, 2007
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
ef0197e8
E
KVM: Use symbolic constants instead of magic numbers · 8d728203
由 Eddie Dong 提交于 5月 29, 2007
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
8d728203
A
KVM: MMU: Store shadow page tables as kernel virtual addresses, not physical · 47ad8e68
由 Avi Kivity 提交于 5月 06, 2007
```
Simpifies things a bit.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
47ad8e68

KVM: Update shadow pte on write to guest pte · 0028425f

由 Avi Kivity 提交于 5月 01, 2007

A typical demand page/copy on write pattern is:

- page fault on vaddr
- kvm propagates fault to guest
- guest handles fault, updates pte
- kvm traps write, clears shadow pte, resumes guest
- guest returns to userspace, re-faults on same vaddr
- kvm installs shadow pte, resumes guest
- guest continues

So, three vmexits for a single guest page fault.  But if instead of clearing
the page table entry, we update to correspond to the value that the guest
has just written, we eliminate the third vmexit.

This patch does exactly that, reducing kbuild time by about 10%.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

0028425f

KVM: Reduce misfirings of the fork detector · a25f7e1f

由 Avi Kivity 提交于 4月 30, 2007

The kvm mmu tries to detects forks by looking for repeated writes to a
page table. If it sees a fork, it unshadows the page table so the page
table copying can proceed at native speed instead of being emulated.

However, the detector also triggered on simple demand paging access patterns:
a linear walk of memory would of course cause repeated writes to the same
pagetable page, causing it to unshadow prematurely.

Fix by resetting the fork detector if we detect a demand fault.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

a25f7e1f

03 5月, 2007 3 次提交

KVM: Per-vcpu statistics · 1165f5fe

由 Avi Kivity 提交于 4月 19, 2007

Make the exit statistics per-vcpu instead of global.  This gives a 3.5%
boost when running one virtual machine per core on my two socket dual core
(4 cores total) machine.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

1165f5fe

KVM: MMU: Fix hugepage pdes mapping same physical address with different access · d28c6cfb

由 Avi Kivity 提交于 3月 23, 2007

The kvm mmu keeps a shadow page for hugepage pdes; if several such pdes map
the same physical address, they share the same shadow page. This is a fairly
common case (kernel mappings on i386 nonpae Linux, for example).

However, if the two pdes map the same memory but with different permissions, kvm
will happily use the cached shadow page. If the access through the more
permissive pde will occur after the access to the strict pde, an endless pagefault
loop will be generated and the guest will make no progress.

Fix by making the access permissions part of the cache lookup key.

The fix allows Xen pae to boot on kvm and run guest domains.

Thanks to Jeremy Fitzhardinge for reporting the bug and testing the fix.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

d28c6cfb

KVM: MMU: Remove unnecessary check for pdptr access · ca5aac1f

由 Avi Kivity 提交于 3月 20, 2007

We already special case the pdptr access, so no need to check it again.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

ca5aac1f

04 3月, 2007 2 次提交

A
KVM: Cosmetics · d27d4aca
由 Avi Kivity 提交于 2月 19, 2007
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
d27d4aca

KVM: mmu: add missing dirty page tracking cases · bf3f8e86

由 Avi Kivity 提交于 2月 19, 2007

We fail to mark a page dirty in three cases:

- setting the accessed bit in a pte
- setting the dirty bit in a pte
- emulating a write into a pagetable

This fix adds the missing cases.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

bf3f8e86

13 2月, 2007 1 次提交

[PATCH] kvm: Fix gva_to_gpa() · e119d117

由 Avi Kivity 提交于 2月 12, 2007

gva_to_gpa() needs to be updated to the new walk_addr() calling convention,
otherwise it may oops under some circumstances.

Use the opportunity to remove all the code duplication in gva_to_gpa(), which
essentially repeats the calculations in walk_addr().
Signed-off-by: NAvi Kivity <avi@qumranet.com>
Cc: Ingo Molnar <mingo@elte.hu>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

e119d117

27 1月, 2007 2 次提交

[PATCH] KVM: MMU: Report nx faults to the guest · 73b1087e

由 Avi Kivity 提交于 1月 26, 2007

With the recent guest page fault change, we perform access checks on our
own instead of relying on the cpu.  This means we have to perform the nx
checks as well.

Software like the google toolbar on windows appears to rely on this
somehow.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

73b1087e

[PATCH] KVM: MMU: Perform access checks in walk_addr() · 7993ba43

由 Avi Kivity 提交于 1月 26, 2007

Check pte permission bits in walk_addr(), instead of scattering the checks all
over the code.  This has the following benefits:

1. We no longer set the accessed bit for accessed which fail permission checks.
2. Setting the accessed bit is simplified.
3. Under some circumstances, we used to pretend a page fault was fixed when
   it would actually fail the access checks.  This caused an unnecessary
   vmexit.
4. The error code for guest page faults is now correct.

The fix helps netbsd further along booting, and allows kvm to pass the new mmu
testsuite.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

7993ba43

23 1月, 2007 1 次提交

[PATCH] KVM: fix bogus pagefault on writable pages · fc3dffe1

由 Avi Kivity 提交于 1月 22, 2007

If a page is marked as dirty in the guest pte, set_pte_common() can set the
writable bit on newly-instantiated shadow pte.  This optimization avoids
a write fault after the initial read fault.

However, if a write fault instantiates the pte, fix_write_pf() incorrectly
reports the fault as a guest page fault, and the guest oopses on what appears
to be a correctly-mapped page.

Fix is to detect the condition and only report a guest page fault on a user
access to a kernel page.

With the fix, a kvm guest can survive a whole night of running the kernel
hacker's screensaver (make -j9 in a loop).
Signed-off-by: NAvi Kivity <avi@qumranet.com>
Cc: Ingo Molnar <mingo@elte.hu>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

fc3dffe1

06 1月, 2007 5 次提交

[PATCH] KVM: MMU: Add missing dirty bit · 760db773

由 Avi Kivity 提交于 1月 05, 2007

If we emulate a write, we fail to set the dirty bit on the guest pte, leading
the guest to believe the page is clean, and thus lose data.  Bad.

Fix by setting the guest pte dirty bit under such conditions.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

760db773

[PATCH] KVM: MMU: add audit code to check mappings, etc are correct · 37a7d8b0

由 Avi Kivity 提交于 1月 05, 2007

Signed-off-by: NAvi Kivity <avi@qumranet.com>
Acked-by: NIngo Molnar <mingo@elte.hu>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

37a7d8b0

[PATCH] KVM: MMU: Detect oom conditions and propagate error to userspace · e2dec939

由 Avi Kivity 提交于 1月 05, 2007

Signed-off-by: NAvi Kivity <avi@qumranet.com>
Acked-by: NIngo Molnar <mingo@elte.hu>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

e2dec939

[PATCH] KVM: MMU: Replace atomic allocations by preallocated objects · 714b93da

由 Avi Kivity 提交于 1月 05, 2007

The mmu sometimes needs memory for reverse mapping and parent pte chains.
however, we can't allocate from within the mmu because of the atomic context.

So, move the allocations to a central place that can be executed before the
main mmu machinery, where we can bail out on failure before any damage is
done.

(error handling is deffered for now, but the basic structure is there)
Signed-off-by: NAvi Kivity <avi@qumranet.com>
Acked-by: NIngo Molnar <mingo@elte.hu>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

714b93da

[PATCH] KVM: MMU: Treat user-mode faults as a hint that a page is no longer a page table · 14364656

由 Avi Kivity 提交于 1月 05, 2007

Signed-off-by: NAvi Kivity <avi@qumranet.com>
Acked-by: NIngo Molnar <mingo@elte.hu>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

14364656