提交 · c4d198d5183ec7bbf8b53216cfc5ded7ebb0ec0c · openeuler / raspberrypi-kernel

21 7月, 2007 3 次提交

KVM: MMU: Fix cleaning up the shadow page allocation cache · c4d198d5

由 Avi Kivity 提交于 7月 21, 2007

__free_page() wants a struct page, not a virtual address.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

c4d198d5

KVM: MMU: Fix oopses with SLUB · c1158e63

由 Avi Kivity 提交于 7月 20, 2007

The kvm mmu uses page->private on shadow page tables; so does slub, and
an oops result.  Fix by allocating regular pages for shadows instead of
using slub.
Tested-by: NS.Çağlar Onur <caglar@pardus.org.tr>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

c1158e63

KVM: Fix memory slot management functions for guest smp · 90cb0529

由 Avi Kivity 提交于 7月 17, 2007

The memory slot management functions were oriented against vcpu 0, where
they should be kvm-wide. This causes hangs starting X on guest smp.

Fix by making the functions (and resultant tail in the mmu) non-vcpu-specific.
Unfortunately this reduces the efficiency of the mmu object cache a bit. We
may have to revisit this later.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

90cb0529

20 7月, 2007 1 次提交

mm: Remove slab destructors from kmem_cache_create(). · 20c2df83

由 Paul Mundt 提交于 7月 20, 2007

Slab destructors were no longer supported after Christoph's
c59def9f change. They've been
BUGs for both slab and slub, and slob never supported them
either.

This rips out support for the dtor pointer from kmem_cache_create()
completely and fixes up every single callsite in the kernel (there were
about 224, not including the slab allocator definitions themselves,
or the documentation references).
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

20c2df83

16 7月, 2007 19 次提交

KVM: Clean up #includes · e495606d

由 Avi Kivity 提交于 6月 28, 2007

Remove unnecessary ones, and rearange the remaining in the standard order.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

e495606d

KVM: MMU: Fix Wrong tlb flush order · 88a97f0b

由 Shaohua Li 提交于 6月 20, 2007

Need to flush the tlb after updating a pte, not before.
Signed-off-by: NShaohua Li <shaohua.li@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

88a97f0b

KVM: Flush remote tlbs when reducing shadow pte permissions · d9e368d6

由 Avi Kivity 提交于 6月 07, 2007

When a vcpu causes a shadow tlb entry to have reduced permissions, it
must also clear the tlb on remote vcpus.  We do that by:

- setting a bit on the vcpu that requests a tlb flush before the next entry
- if the vcpu is currently executing, we send an ipi to make sure it
  exits before we continue
Signed-off-by: NAvi Kivity <avi@qumranet.com>

d9e368d6

KVM: Fix vcpu freeing for guest smp · 7b53aa56

由 Avi Kivity 提交于 6月 05, 2007

A vcpu can pin up to four mmu shadow pages, which means the freeing
loop will never terminate.  Fix by first unpinning shadow pages on
all vcpus, then freeing shadow pages.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

7b53aa56

KVM: Lazy guest cr3 switching · 17c3ba9d

由 Avi Kivity 提交于 6月 04, 2007

Switch guest paging context may require us to allocate memory, which
might fail.  Instead of wiring up error paths everywhere, make context
switching lazy and actually do the switch before the next guest entry,
where we can return an error if allocation fails.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

17c3ba9d

KVM: MMU: Remove unused large page marker · bd2b2baa

由 Avi Kivity 提交于 5月 31, 2007

This has not been used for some time, as the same information is available
in the page header.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

bd2b2baa

KVM: MMU: Don't cache guest access bits in the shadow page table · b64b3763

由 Avi Kivity 提交于 5月 31, 2007

This was once used to avoid accessing the guest pte when upgrading
the shadow pte from read-only to read-write.  But usually we need
to set the guest pte dirty or accessed bits anyway, so this wasn't
really exploited.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

b64b3763

KVM: MMU: Simpify accessed/dirty/present/nx bit handling · fd97dc51

由 Avi Kivity 提交于 5月 31, 2007

Always set the accessed and dirty bit (since having them cleared causes
a read-modify-write cycle), always set the present bit, and copy the
nx bit from the guest.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

fd97dc51

A
KVM: MMU: Make setting shadow ptes atomic on i386 · e663ee64
由 Avi Kivity 提交于 5月 31, 2007
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
e663ee64

KVM: MMU: Fold fix_write_pf() into set_pte_common() · 97a0a01e

由 Avi Kivity 提交于 5月 31, 2007

This prevents some work from being performed twice, and, more importantly,
reduces the number of places where we modify shadow ptes.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

97a0a01e

A
KVM: MMU: Fold fix_read_pf() into set_pte_common() · 63b1ad24
由 Avi Kivity 提交于 5月 31, 2007
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
63b1ad24
A
KVM: MMU: Move set_pte_common() to pte width dependent code · e60d75ea
由 Avi Kivity 提交于 5月 30, 2007
```
In preparation of some modifications.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
e60d75ea
A
KVM: MMU: Use slab caches for shadow pages and their headers · d3d25b04
由 Avi Kivity 提交于 5月 30, 2007
```
Use slab caches instead of a simple custom list.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
d3d25b04
A
KVM: MMU: Store shadow page tables as kernel virtual addresses, not physical · 47ad8e68
由 Avi Kivity 提交于 5月 06, 2007
```
Simpifies things a bit.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
47ad8e68
A
KVM: MMU: Simplify kvm_mmu_free_page() a tiny bit · 4b02d6da
由 Avi Kivity 提交于 5月 06, 2007
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
4b02d6da

KVM: Update shadow pte on write to guest pte · 0028425f

由 Avi Kivity 提交于 5月 01, 2007

A typical demand page/copy on write pattern is:

- page fault on vaddr
- kvm propagates fault to guest
- guest handles fault, updates pte
- kvm traps write, clears shadow pte, resumes guest
- guest returns to userspace, re-faults on same vaddr
- kvm installs shadow pte, resumes guest
- guest continues

So, three vmexits for a single guest page fault.  But if instead of clearing
the page table entry, we update to correspond to the value that the guest
has just written, we eliminate the third vmexit.

This patch does exactly that, reducing kbuild time by about 10%.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

0028425f

KVM: MMU: Respect nonpae pagetable quadrant when zapping ptes · fce0657f

由 Avi Kivity 提交于 5月 01, 2007

When a guest writes to a page that has an mmu shadow, we have to clear
the shadow pte corresponding to the memory location touched by the guest.

Now, in nonpae mode, a single guest page may have two or four shadow
pages (because a nonpae page maps 4MB or 4GB, whereas the pae shadow maps
2MB or 1GB), so we when we look up the page we find up to three additional
aliases for the page. Since we _clear_ the shadow pte, it doesn't matter
except for a slight performance penalty, but if we want to _update_ the
shadow pte instead of clearing it, it is vital that we don't modify the
aliases.

Fortunately, exactly which page is needed (the "quadrant") is easily
computed, and is accessible in the shadow page header. All we need is
to ignore shadow pages from the wrong quadrants.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

fce0657f

KVM: Unify kvm_mmu_pre_write() and kvm_mmu_post_write() · 09072daf

由 Avi Kivity 提交于 5月 01, 2007

Instead of calling two functions and repeating expensive checks, call one
function and provide it with before/after information.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

09072daf

KVM: Assume that writes smaller than 4 bytes are to non-pagetable pages · e925c5ba

由 Avi Kivity 提交于 4月 30, 2007

This allows us to remove write protection earlier than otherwise.  Should
some mad OS choose to use byte writes to update pagetables, it will suffer
a performance hit, but still work correctly.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

e925c5ba

03 5月, 2007 13 次提交

KVM: fix an if() condition · 2807696c

由 Adrian Bunk 提交于 4月 28, 2007

It might have worked in this case since PT_PRESENT_MASK is 1, but let's
express this correctly.
Signed-off-by: NAdrian Bunk <bunk@stusta.de>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

2807696c

KVM: Per-vcpu statistics · 1165f5fe

由 Avi Kivity 提交于 4月 19, 2007

Make the exit statistics per-vcpu instead of global.  This gives a 3.5%
boost when running one virtual machine per core on my two socket dual core
(4 cores total) machine.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

1165f5fe

Y
KVM: MMU: Avoid heavy ASSERT at non debug mode. · d6c69ee9
由 Yaozu Dong 提交于 4月 25, 2007
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
d6c69ee9
A
KVM: Retry sleeping allocation if atomic allocation fails · 8c438502
由 Avi Kivity 提交于 4月 16, 2007
```
This avoids -ENOMEM under memory pressure.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
8c438502

KVM: Use slab caches to allocate mmu data structures · b5a33a75

由 Avi Kivity 提交于 4月 15, 2007

Better leak detection, statistics, memory use, speed -- goodness all
around.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

b5a33a75

KVM: Handle partial pae pdptr · 417726a3

由 Avi Kivity 提交于 4月 12, 2007

Some guests (Solaris) do not set up all four pdptrs, but leave some invalid.
kvm incorrectly treated these as valid page directories, pinning the
wrong pages and causing general confusion.

Fix by checking the valid bit of a pae pdpte.  This closes sourceforge bug
1698922.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

417726a3

KVM: Simply gfn_to_page() · 954bbbc2

由 Avi Kivity 提交于 3月 30, 2007

Mapping a guest page to a host page is a common operation.  Currently,
one has first to find the memory slot where the page belongs (gfn_to_memslot),
then locate the page itself (gfn_to_page()).

This is clumsy, and also won't work well with memory aliases.  So simplify
gfn_to_page() not to require memory slot translation first, and instead do it
internally.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

954bbbc2

KVM: Add mmu cache clear function · e0fa826f

由 Dor Laor 提交于 3月 30, 2007

Functions that play around with the physical memory map
need a way to clear mappings to possibly nonexistent or
invalid memory.  Both the mmu cache and the processor tlb
are cleared.
Signed-off-by: NDor Laor <dor.laor@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

e0fa826f

KVM: Use list_move() · 36868f7b

由 Avi Kivity 提交于 3月 26, 2007

Use list_move() where possible.  Noticed by Dor Laor.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

36868f7b

KVM: MMU: Fix hugepage pdes mapping same physical address with different access · d28c6cfb

由 Avi Kivity 提交于 3月 23, 2007

The kvm mmu keeps a shadow page for hugepage pdes; if several such pdes map
the same physical address, they share the same shadow page. This is a fairly
common case (kernel mappings on i386 nonpae Linux, for example).

However, if the two pdes map the same memory but with different permissions, kvm
will happily use the cached shadow page. If the access through the more
permissive pde will occur after the access to the strict pde, an endless pagefault
loop will be generated and the guest will make no progress.

Fix by making the access permissions part of the cache lookup key.

The fix allows Xen pae to boot on kvm and run guest domains.

Thanks to Jeremy Fitzhardinge for reporting the bug and testing the fix.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

d28c6cfb

KVM: MMU: Remove global pte tracking · aac01224

由 Avi Kivity 提交于 3月 20, 2007

The initial, noncaching, version of the kvm mmu flushed the all nonglobal
shadow page table translations (much like a native tlb flush).  The new
implementation flushes translations only when they change, rendering global
pte tracking superfluous.

This removes the unused tracking mechanism and storage space.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

aac01224

KVM: Avoid guest virtual addresses in string pio userspace interface · 039576c0

由 Avi Kivity 提交于 3月 20, 2007

The current string pio interface communicates using guest virtual addresses,
relying on userspace to translate addresses and to check permissions. This
interface cannot fully support guest smp, as the check needs to take into
account two pages at one in case an unaligned string transfer straddles a
page boundary.

Change the interface not to communicate guest addresses at all; instead use
a buffer page (mmaped by userspace) and do transfers there. The kernel
manages the virtual to physical translation and can perform the checks
atomically by taking the appropriate locks.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

039576c0

KVM: Fix bogus sign extension in mmu mapping audit · 1ea252af

由 Avi Kivity 提交于 3月 08, 2007

When auditing a 32-bit guest on a 64-bit host, sign extension of the page
table directory pointer table index caused bogus addresses to be shown on
audit errors.

Fix by declaring the index unsigned.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

1ea252af

19 4月, 2007 1 次提交

KVM: Fix off-by-one when writing to a nonpae guest pde · 6b8d0f9b

由 Avi Kivity 提交于 4月 18, 2007

Nonpae guest pdes are shadowed by two pae ptes, so we double the offset
twice: once to account for the pte size difference, and once because we
need to shadow pdes for a single guest pde.

But when writing to the upper guest pde we also need to truncate the
lower bits, otherwise the multiply shifts these bits into the pde index
and causes an access to the wrong shadow pde.  If we're at the end of the
page (accessing the very last guest pde) we can even overflow into the
next host page and oops.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

6b8d0f9b

18 3月, 2007 2 次提交

KVM: MMU: Fix host memory corruption on i386 with >= 4GB ram · 27aba766

由 Avi Kivity 提交于 3月 09, 2007

PAGE_MASK is an unsigned long, so using it to mask physical addresses on
i386 (which are 64-bit wide) leads to truncation.  This can result in
page->private of unrelated memory pages being modified, with disasterous
results.

Fix by not using PAGE_MASK for physical addresses; instead calculate
the correct value directly from PAGE_SIZE.  Also fix a similar BUG_ON().
Acked-by: NIngo Molnar <mingo@elte.hu>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

27aba766

KVM: MMU: Fix guest writes to nonpae pde · ac1b714e

由 Avi Kivity 提交于 3月 08, 2007

KVM shadow page tables are always in pae mode, regardless of the guest
setting.  This means that a guest pde (mapping 4MB of memory) is mapped
to two shadow pdes (mapping 2MB each).

When the guest writes to a pte or pde, we intercept the write and emulate it.
We also remove any shadowed mappings corresponding to the write.  Since the
mmu did not account for the doubling in the number of pdes, it removed the
wrong entry, resulting in a mismatch between shadow page tables and guest
page tables, followed shortly by guest memory corruption.

This patch fixes the problem by detecting the special case of writing to
a non-pae pde and adjusting the address and number of shadow pdes zapped
accordingly.
Acked-by: NIngo Molnar <mingo@elte.hu>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

ac1b714e

04 3月, 2007 1 次提交

KVM: Use page_private()/set_page_private() apis · 5972e953

由 Markus Rechberger 提交于 2月 19, 2007

Besides using an established api, this allows using kvm in older kernels.
Signed-off-by: NMarkus Rechberger <markus.rechberger@amd.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

5972e953