提交 · f67a46f4aa1212b38696ac6b6a82b4323cea61aa · openeuler / Kernel

30 1月, 2008 6 次提交

KVM: MMU: Clean up MMU functions to take struct kvm when appropriate · f67a46f4

由 Anthony Liguori 提交于 10月 10, 2007

Some of the MMU functions take a struct kvm_vcpu even though they affect all
VCPUs.  This patch cleans up some of them to instead take a struct kvm.  This
makes things a bit more clear.

The main thing that was confusing me was whether certain functions need to be
called on all VCPUs.
Signed-off-by: NAnthony Liguori <aliguori@us.ibm.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

f67a46f4

KVM: CodingStyle cleanup · d77c26fc

由 Mike Day 提交于 10月 08, 2007

Signed-off-by: NMike D. Day <ncmike@ncultra.org>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

d77c26fc

KVM: Allow dynamic allocation of the mmu shadow cache size · 82ce2c96

由 Izik Eidus 提交于 10月 02, 2007

The user is now able to set how many mmu pages will be allocated to the guest.
Signed-off-by: NIzik Eidus <izike@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

82ce2c96

KVM: Remove the usage of page->private field by rmap · 290fc38d

由 Izik Eidus 提交于 9月 27, 2007

When kvm uses user-allocated pages in the future for the guest, we won't
be able to use page->private for rmap, since page->rmap is reserved for
the filesystem.  So we move the rmap base pointers to the memory slot.

A side effect of this is that we need to store the gfn of each gpte in
the shadow pages, since the memory slot is addressed by gfn, instead of
hfn like struct page.
Signed-off-by: NIzik Eidus <izik@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

290fc38d

KVM: MMU: Make flooding detection work when guest page faults are bypassed · 12b7d28f

由 Avi Kivity 提交于 9月 23, 2007

When we allow guest page faults to reach the guests directly, we lose
the fault tracking which allows us to detect demand paging. So we provide
an alternate mechnism by clearing the accessed bit when we set a pte, and
checking it later to see if the guest actually used it.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

12b7d28f

KVM: Allow not-present guest page faults to bypass kvm · c7addb90

由 Avi Kivity 提交于 9月 16, 2007

There are two classes of page faults trapped by kvm:
 - host page faults, where the fault is needed to allow kvm to install
   the shadow pte or update the guest accessed and dirty bits
 - guest page faults, where the guest has faulted and kvm simply injects
   the fault back into the guest to handle

The second class, guest page faults, is pure overhead.  We can eliminate
some of it on vmx using the following evil trick:
 - when we set up a shadow page table entry, if the corresponding guest pte
   is not present, set up the shadow pte as not present
 - if the guest pte _is_ present, mark the shadow pte as present but also
   set one of the reserved bits in the shadow pte
 - tell the vmx hardware not to trap faults which have the present bit clear

With this, normal page-not-present faults go directly to the guest,
bypassing kvm entirely.

Unfortunately, this trick only works on Intel hardware, as AMD lacks a
way to discriminate among page faults based on error code.  It is also
a little risky since it uses reserved bits which might become unreserved
in the future, so a module parameter is provided to disable it.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

c7addb90

22 10月, 2007 2 次提交

KVM: VMX: Reset mmu context when entering real mode · 8668a3c4

由 Eddie Dong 提交于 10月 10, 2007

Resetting an SMP guest will force AP enter real mode (RESET) with
paging enabled in protected mode. While current enter_rmode() can
only handle mode switch from nonpaging mode to real mode which leads
to SMP reboot failure.

Fix by reloading the mmu context on entering real mode.
Signed-off-by: NYaozu (Eddie) Dong <eddie.dong@intel.com>
Signed-off-by: NQing He <qing.he@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

8668a3c4

KVM: MMU: Set shadow pte atomically in mmu_pte_write_zap_pte() · 7f2145ad

由 Izik Eidus 提交于 9月 23, 2007

Setting shadow page table entry should be set atomicly using set_shadow_pte().
Signed-off-by: NIzik Eidus <izike@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

7f2145ad

13 10月, 2007 6 次提交

KVM: MMU: Don't do GFP_NOWAIT allocations · 2e3e5882

由 Avi Kivity 提交于 9月 10, 2007

Before preempt notifiers, kvm needed to allocate memory with GFP_NOWAIT so
as not to have to enable preemption and take a heavyweight exit. On oom, we'd
fall back to a GFP_KERNEL allocation.

With preemption notifiers, we can do a GFP_KERNEL allocation, and perform
the heavyweight exit only if the kernel decides to put us to sleep.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

2e3e5882

KVM: Rename kvm_arch_ops to kvm_x86_ops · cbdd1bea

由 Christian Ehrhardt 提交于 9月 09, 2007

This patch just renames the current (misnamed) _arch namings to _x86 to
ensure better readability when a real arch layer takes place.
Signed-off-by: NChristian Ehrhardt <ehrhardt@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

cbdd1bea

KVM: Convert vm lock to a mutex · 11ec2804

由 Shaohua Li 提交于 7月 23, 2007

This allows the kvm mmu to perform sleepy operations, such as memory
allocation.
Signed-off-by: NShaohua Li <shaohua.li@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

11ec2804

KVM: Use the scheduler preemption notifiers to make kvm preemptible · 15ad7146

由 Avi Kivity 提交于 7月 11, 2007

Current kvm disables preemption while the new virtualization registers are
in use. This of course is not very good for latency sensitive workloads (one
use of virtualization is to offload user interface and other latency
insensitive stuff to a container, so that it is easier to analyze the
remaining workload). This patch re-enables preemption for kvm; preemption
is now only disabled when switching the registers in and out, and during
the switch to guest mode and back.

Contains fixes from Shaohua Li <shaohua.li@intel.com>.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

15ad7146

KVM: Move gfn_to_page out of kmap/unmap pairs · fe551881

由 Shaohua Li 提交于 7月 23, 2007

gfn_to_page might sleep with swap support. Move it out of the kmap calls.
Signed-off-by: NShaohua Li <shaohua.li@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

fe551881

KVM: Trivial: Use standard CR0 flags macros from asm/cpu-features.h · 707d92fa

由 Rusty Russell 提交于 7月 17, 2007

The kernel now has asm/cpu-features.h: use those macros instead of
inventing our own.

Also spell out definition of CR0_RESEVED_BITS (no code change) and fix typo.
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

707d92fa

15 9月, 2007 1 次提交

KVM: MMU: Fix rare oops on guest context switch · 22d95b12

由 Avi Kivity 提交于 9月 14, 2007

A guest context switch to an uncached cr3 can require allocation of
shadow pages, but we only recycle shadow pages in kvm_mmu_page_fault().

Move shadow page recycling to mmu_topup_memory_caches(), which is called
from both the page fault handler and from guest cr3 reload.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

22d95b12

21 7月, 2007 3 次提交

KVM: MMU: Fix cleaning up the shadow page allocation cache · c4d198d5

由 Avi Kivity 提交于 7月 21, 2007

__free_page() wants a struct page, not a virtual address.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

c4d198d5

KVM: MMU: Fix oopses with SLUB · c1158e63

由 Avi Kivity 提交于 7月 20, 2007

The kvm mmu uses page->private on shadow page tables; so does slub, and
an oops result.  Fix by allocating regular pages for shadows instead of
using slub.
Tested-by: NS.Çağlar Onur <caglar@pardus.org.tr>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

c1158e63

KVM: Fix memory slot management functions for guest smp · 90cb0529

由 Avi Kivity 提交于 7月 17, 2007

The memory slot management functions were oriented against vcpu 0, where
they should be kvm-wide. This causes hangs starting X on guest smp.

Fix by making the functions (and resultant tail in the mmu) non-vcpu-specific.
Unfortunately this reduces the efficiency of the mmu object cache a bit. We
may have to revisit this later.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

90cb0529

20 7月, 2007 1 次提交

mm: Remove slab destructors from kmem_cache_create(). · 20c2df83

由 Paul Mundt 提交于 7月 20, 2007

Slab destructors were no longer supported after Christoph's
c59def9f change. They've been
BUGs for both slab and slub, and slob never supported them
either.

This rips out support for the dtor pointer from kmem_cache_create()
completely and fixes up every single callsite in the kernel (there were
about 224, not including the slab allocator definitions themselves,
or the documentation references).
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

20c2df83

16 7月, 2007 19 次提交

KVM: Clean up #includes · e495606d

由 Avi Kivity 提交于 6月 28, 2007

Remove unnecessary ones, and rearange the remaining in the standard order.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

e495606d

KVM: MMU: Fix Wrong tlb flush order · 88a97f0b

由 Shaohua Li 提交于 6月 20, 2007

Need to flush the tlb after updating a pte, not before.
Signed-off-by: NShaohua Li <shaohua.li@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

88a97f0b

KVM: Flush remote tlbs when reducing shadow pte permissions · d9e368d6

由 Avi Kivity 提交于 6月 07, 2007

When a vcpu causes a shadow tlb entry to have reduced permissions, it
must also clear the tlb on remote vcpus.  We do that by:

- setting a bit on the vcpu that requests a tlb flush before the next entry
- if the vcpu is currently executing, we send an ipi to make sure it
  exits before we continue
Signed-off-by: NAvi Kivity <avi@qumranet.com>

d9e368d6

KVM: Fix vcpu freeing for guest smp · 7b53aa56

由 Avi Kivity 提交于 6月 05, 2007

A vcpu can pin up to four mmu shadow pages, which means the freeing
loop will never terminate.  Fix by first unpinning shadow pages on
all vcpus, then freeing shadow pages.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

7b53aa56

KVM: Lazy guest cr3 switching · 17c3ba9d

由 Avi Kivity 提交于 6月 04, 2007

Switch guest paging context may require us to allocate memory, which
might fail.  Instead of wiring up error paths everywhere, make context
switching lazy and actually do the switch before the next guest entry,
where we can return an error if allocation fails.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

17c3ba9d

KVM: MMU: Remove unused large page marker · bd2b2baa

由 Avi Kivity 提交于 5月 31, 2007

This has not been used for some time, as the same information is available
in the page header.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

bd2b2baa

KVM: MMU: Don't cache guest access bits in the shadow page table · b64b3763

由 Avi Kivity 提交于 5月 31, 2007

This was once used to avoid accessing the guest pte when upgrading
the shadow pte from read-only to read-write.  But usually we need
to set the guest pte dirty or accessed bits anyway, so this wasn't
really exploited.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

b64b3763

KVM: MMU: Simpify accessed/dirty/present/nx bit handling · fd97dc51

由 Avi Kivity 提交于 5月 31, 2007

Always set the accessed and dirty bit (since having them cleared causes
a read-modify-write cycle), always set the present bit, and copy the
nx bit from the guest.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

fd97dc51

A
KVM: MMU: Make setting shadow ptes atomic on i386 · e663ee64
由 Avi Kivity 提交于 5月 31, 2007
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
e663ee64

KVM: MMU: Fold fix_write_pf() into set_pte_common() · 97a0a01e

由 Avi Kivity 提交于 5月 31, 2007

This prevents some work from being performed twice, and, more importantly,
reduces the number of places where we modify shadow ptes.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

97a0a01e

A
KVM: MMU: Fold fix_read_pf() into set_pte_common() · 63b1ad24
由 Avi Kivity 提交于 5月 31, 2007
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
63b1ad24
A
KVM: MMU: Move set_pte_common() to pte width dependent code · e60d75ea
由 Avi Kivity 提交于 5月 30, 2007
```
In preparation of some modifications.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
e60d75ea
A
KVM: MMU: Use slab caches for shadow pages and their headers · d3d25b04
由 Avi Kivity 提交于 5月 30, 2007
```
Use slab caches instead of a simple custom list.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
d3d25b04
A
KVM: MMU: Store shadow page tables as kernel virtual addresses, not physical · 47ad8e68
由 Avi Kivity 提交于 5月 06, 2007
```
Simpifies things a bit.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
47ad8e68
A
KVM: MMU: Simplify kvm_mmu_free_page() a tiny bit · 4b02d6da
由 Avi Kivity 提交于 5月 06, 2007
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
4b02d6da

KVM: Update shadow pte on write to guest pte · 0028425f

由 Avi Kivity 提交于 5月 01, 2007

A typical demand page/copy on write pattern is:

- page fault on vaddr
- kvm propagates fault to guest
- guest handles fault, updates pte
- kvm traps write, clears shadow pte, resumes guest
- guest returns to userspace, re-faults on same vaddr
- kvm installs shadow pte, resumes guest
- guest continues

So, three vmexits for a single guest page fault.  But if instead of clearing
the page table entry, we update to correspond to the value that the guest
has just written, we eliminate the third vmexit.

This patch does exactly that, reducing kbuild time by about 10%.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

0028425f

KVM: MMU: Respect nonpae pagetable quadrant when zapping ptes · fce0657f

由 Avi Kivity 提交于 5月 01, 2007

When a guest writes to a page that has an mmu shadow, we have to clear
the shadow pte corresponding to the memory location touched by the guest.

Now, in nonpae mode, a single guest page may have two or four shadow
pages (because a nonpae page maps 4MB or 4GB, whereas the pae shadow maps
2MB or 1GB), so we when we look up the page we find up to three additional
aliases for the page. Since we _clear_ the shadow pte, it doesn't matter
except for a slight performance penalty, but if we want to _update_ the
shadow pte instead of clearing it, it is vital that we don't modify the
aliases.

Fortunately, exactly which page is needed (the "quadrant") is easily
computed, and is accessible in the shadow page header. All we need is
to ignore shadow pages from the wrong quadrants.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

fce0657f

KVM: Unify kvm_mmu_pre_write() and kvm_mmu_post_write() · 09072daf

由 Avi Kivity 提交于 5月 01, 2007

Instead of calling two functions and repeating expensive checks, call one
function and provide it with before/after information.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

09072daf

KVM: Assume that writes smaller than 4 bytes are to non-pagetable pages · e925c5ba

由 Avi Kivity 提交于 4月 30, 2007

This allows us to remove write protection earlier than otherwise.  Should
some mad OS choose to use byte writes to update pagetables, it will suffer
a performance hit, but still work correctly.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

e925c5ba

03 5月, 2007 2 次提交

KVM: fix an if() condition · 2807696c

由 Adrian Bunk 提交于 4月 28, 2007

It might have worked in this case since PT_PRESENT_MASK is 1, but let's
express this correctly.
Signed-off-by: NAdrian Bunk <bunk@stusta.de>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

2807696c

KVM: Per-vcpu statistics · 1165f5fe

由 Avi Kivity 提交于 4月 19, 2007

Make the exit statistics per-vcpu instead of global.  This gives a 3.5%
boost when running one virtual machine per core on my two socket dual core
(4 cores total) machine.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

1165f5fe

openeuler / Kernel 大约 1 年 前同步成功

openeuler / Kernel
大约 1 年前同步成功