提交 · 10589a4699bb978c781ce73bbae8ca942c5250c9 · OpenHarmony / kernel_linux

31 1月, 2008 2 次提交

KVM: MMU: Concurrent guest walkers · 10589a46

由 Marcelo Tosatti 提交于 12月 20, 2007

Do not hold kvm->lock mutex across the entire pagefault code,
only acquire it in places where it is necessary, such as mmu
hash list, active list, rmap and parent pte handling.

Allow concurrent guest walkers by switching walk_addr() to use
mmap_sem in read-mode.

And get rid of the lockless __gfn_to_page.

[avi: move kvm_mmu_pte_write() locking inside the function]
[avi: add locking for real mode]
[avi: fix cmpxchg locking]
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

10589a46

KVM: Move arch dependent files to new directory arch/x86/kvm/ · edf88417

由 Avi Kivity 提交于 12月 16, 2007

This paves the way for multiple architecture support.  Note that while
ioapic.c could potentially be shared with ia64, it is also moved.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

edf88417

30 1月, 2008 38 次提交

KVM: Portability: Introduce kvm_vcpu_arch · ad312c7c

由 Zhang Xiantao 提交于 12月 13, 2007

Move all the architecture-specific fields in kvm_vcpu into a new struct
kvm_vcpu_arch.
Signed-off-by: NZhang Xiantao <xiantao.zhang@intel.com>
Acked-by: NCarsten Otte <cotte@de.ibm.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

ad312c7c

KVM: MMU: Fix SMP shadow instantiation race · 7819026e

由 Marcelo Tosatti 提交于 12月 11, 2007

There is a race where VCPU0 is shadowing a pagetable entry while VCPU1
is updating it, which results in a stale shadow copy.

Fix that by comparing the contents of the cached guest pte with the
current guest pte after write-protecting the guest pagetable.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

7819026e

KVM: MMU: Move set_pte() into guest paging mode independent code · 1c4f1fd6

由 Avi Kivity 提交于 12月 09, 2007

As set_pte() no longer references either a gpte or the guest walker, we can
move it out of paging mode dependent code (which compiles twice and is
generally nasty).
Signed-off-by: NAvi Kivity <avi@qumranet.com>

1c4f1fd6

A
KVM: MMU: Remove walker argument to set_pte() · 2fbf4cf1
由 Avi Kivity 提交于 12月 09, 2007
```
Unused.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
2fbf4cf1
A
KVM: MMU: Pass pte dirty flag to set_pte() instead of calculating it on-site · e3f95504
由 Avi Kivity 提交于 12月 09, 2007
```
This allows us to remove its dependency on pt_element_t.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
e3f95504

KVM: MMU: No need to pick up nx bit from guest pte · b4ab019c

由 Avi Kivity 提交于 12月 09, 2007

We already set it according to cumulative access permissions.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

b4ab019c

KVM: MMU: Fix inherited permissions for emulated guest pte updates · 41074d07

由 Avi Kivity 提交于 12月 09, 2007

When we emulate a guest pte write, we fail to apply the correct inherited
permissions from the parent ptes. Now that we store inherited permissions
in the shadow page, we can use that to update the pte permissions correctly.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

41074d07

A
KVM: MMU: Move pte access calculation into a helper function · bedbe4ee
由 Avi Kivity 提交于 12月 09, 2007
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
bedbe4ee

KVM: MMU: Set nx bit correctly on shadow ptes · 8d87a03a

由 Avi Kivity 提交于 12月 09, 2007

While the page table walker correctly generates a guest page fault
if a guest tries to execute a non-executable page, the shadow code does
not mark it non-executable.  This means that if a guest accesses an nx
page first with a read access, then subsequent code fetch accesses will
succeed.

Fix by setting the nx bit on shadow ptes.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

8d87a03a

KVM: MMU: Simplify calculation of pte access · fe135d2c

由 Avi Kivity 提交于 12月 09, 2007

The nx bit is awkwardly placed in the 63rd bit position; furthermore it
has a reversed meaning compared to the other bits, which means we can't use
a bitwise and to calculate compounded access masks.

So, we simplify things by creating a new 3-bit exec/write/user access word,
and doing all calculations in that.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

fe135d2c

KVM: MMU: Use cmpxchg for pte updates on walk_addr() · b3e4e63f

由 Marcelo Tosatti 提交于 12月 07, 2007

In preparation for multi-threaded guest pte walking, use cmpxchg()
when updating guest pte's. This guarantees that the assignment of the
dirty bit can't be lost if two CPU's are faulting the same address
simultaneously.

[avi: fix kunmap_atomic() parameters]
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

b3e4e63f

M
KVM: MMU: Remove unused prev_shadow_ent variable from fetch() · 4bf8ed8d
由 Marcelo Tosatti 提交于 12月 04, 2007
```
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
4bf8ed8d

KVM: MMU: Introduce gfn_to_gpa() · 1755fbcc

由 Avi Kivity 提交于 11月 21, 2007

Converting a frame number to an address is tricky since the data type changes
size.  Introduce a function to do it.  This fixes an actual bug when
accessing guest ptes.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

1755fbcc

A
KVM: MMU: Adjust page_header_update_slot() to accept a gfn instead of a gpa · 38c335f1
由 Avi Kivity 提交于 11月 21, 2007
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
38c335f1

KVM: MMU: Merge set_pte() and set_pte_common() · 230c9a8f

由 Avi Kivity 提交于 11月 21, 2007

Since set_pte() is now the only caller of set_pte_common(), merge the two
functions.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

230c9a8f

A
KVM: MMU: Remove set_pde() · 050e6499
由 Avi Kivity 提交于 11月 21, 2007
```
It is now identical to set_pte().
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
050e6499
A
KVM: MMU: Remove extra gaddr parameter from set_pte_common() · 4e542370
由 Avi Kivity 提交于 11月 21, 2007
```
Similar information is available in the gfn parameter, so use that.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
4e542370
A
KVM: MMU: Move pse36 handling to the guest walker · da928521
由 Avi Kivity 提交于 11月 21, 2007
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
da928521
A
KVM: MMU: Introduce and use gpte_to_gfn() · 5fb07ddb
由 Avi Kivity 提交于 11月 21, 2007
```
Instead of repretitively open-coding this.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
5fb07ddb

KVM: MMU: Code cleanup · b238f7bc

由 Izik Eidus 提交于 11月 20, 2007

Signed-off-by: NIzik Eidus <izike@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

b238f7bc

KVM: MMU: Implement guest page fault bypass for nonpae · e5a4c8ca

由 Avi Kivity 提交于 11月 20, 2007

I spent an hour worrying why I see so many guest page faults on FC6 i386.
Turns out bypass wasn't implemented for nonpae.  Implement it so it doesn't
happen again.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

e5a4c8ca

KVM: MMU: Selectively set PageDirty when releasing guest memory · b4231d61

由 Izik Eidus 提交于 11月 20, 2007

Improve dirty bit setting for pages that kvm release, until now every page
that we released we marked dirty, from now only pages that have potential
to get dirty we mark dirty.
Signed-off-by: NIzik Eidus <izike@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

b4231d61

A
KVM: MMU: Remove unused variable · 971535ff
由 Avi Kivity 提交于 11月 19, 2007
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
971535ff

KVM: MMU: Change guest pte access to kvm_{read,write}_guest() · ec8d4eae

由 Izik Eidus 提交于 11月 19, 2007

Things are simpler and more regular this way.
Signed-off-by: NIzik Eidus <izike@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

ec8d4eae

KVM: MMU: Partial swapping of guest memory · 8a7ae055

由 Izik Eidus 提交于 10月 18, 2007

This allows guest memory to be swapped.  Pages which are currently mapped
via shadow page tables are pinned into memory, but all other pages can
be freely swapped.

The patch makes gfn_to_page() elevate the page's reference count, and
introduces kvm_release_page() that pairs with it.
Signed-off-by: NIzik Eidus <izike@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

8a7ae055

KVM: MMU: Make gfn_to_page() always safe · cea7bb21

由 Izik Eidus 提交于 10月 17, 2007

In case the page is not present in the guest memory map, return a dummy
page the guest can scribble on.

This simplifies error checking in its users.
Signed-off-by: NIzik Eidus <izike@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

cea7bb21

KVM: MMU: Simplify page table walker · 42bf3f0a

由 Avi Kivity 提交于 10月 17, 2007

Simplify the walker level loop not to carry so much information from one
loop to the next. In addition to being complex, this made kmap_atomic()
critical sections difficult to manage.

As a result of this change, kmap_atomic() sections are limited to actually
touching the guest pte, which allows the other functions called from the
walker to do sleepy operations. This will happen when we enable swapping.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

42bf3f0a

KVM: MMU: When updating the dirty bit, inform the mmu about it · c4fcc272

由 Avi Kivity 提交于 10月 11, 2007

Since the mmu uses different shadow pages for dirty large pages and clean
large pages, this allows the mmu to drop ptes that are now invalid.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

c4fcc272

A
KVM: MMU: Move dirty bit updates to a separate function · 5df34a86
由 Avi Kivity 提交于 10月 11, 2007
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
5df34a86

KVM: MMU: Disable write access on clean large pages · cc70e737

由 Avi Kivity 提交于 10月 11, 2007

By forcing clean huge pages to be read-only, we have separate roles
for the shadow of a clean large page and the shadow of a dirty large
page.  This is necessary because different ptes will be instantiated
for the two cases, even for read faults.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

cc70e737

KVM: MMU: Fix nx access bit for huge pages · c22e3514

由 Avi Kivity 提交于 10月 11, 2007

We must set the bit before the shift, otherwise the wrong bit gets set.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

c22e3514

KVM: Move guest pte dirty bit management to the guest pagetable walker · e3c5e7ec

由 Avi Kivity 提交于 10月 11, 2007

This is more consistent with the accessed bit management, and makes the dirty
bit available earlier for other purposes.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

e3c5e7ec

KVM: MMU: More struct kvm_vcpu -> struct kvm cleanups · 4a4c9924

由 Anthony Liguori 提交于 10月 10, 2007

This time, the biggest change is gpa_to_hpa. The translation of GPA to HPA does
not depend on the VCPU state unlike GVA to GPA so there's no need to pass in
the kvm_vcpu.
Signed-off-by: NAnthony Liguori <aliguori@us.ibm.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

4a4c9924

KVM: MMU: Clean up MMU functions to take struct kvm when appropriate · f67a46f4

由 Anthony Liguori 提交于 10月 10, 2007

Some of the MMU functions take a struct kvm_vcpu even though they affect all
VCPUs.  This patch cleans up some of them to instead take a struct kvm.  This
makes things a bit more clear.

The main thing that was confusing me was whether certain functions need to be
called on all VCPUs.
Signed-off-by: NAnthony Liguori <aliguori@us.ibm.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

f67a46f4

KVM: CodingStyle cleanup · d77c26fc

由 Mike Day 提交于 10月 08, 2007

Signed-off-by: NMike D. Day <ncmike@ncultra.org>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

d77c26fc

KVM: Remove the usage of page->private field by rmap · 290fc38d

由 Izik Eidus 提交于 9月 27, 2007

When kvm uses user-allocated pages in the future for the guest, we won't
be able to use page->private for rmap, since page->rmap is reserved for
the filesystem.  So we move the rmap base pointers to the memory slot.

A side effect of this is that we need to store the gfn of each gpte in
the shadow pages, since the memory slot is addressed by gfn, instead of
hfn like struct page.
Signed-off-by: NIzik Eidus <izik@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

290fc38d

KVM: MMU: Make flooding detection work when guest page faults are bypassed · 12b7d28f

由 Avi Kivity 提交于 9月 23, 2007

When we allow guest page faults to reach the guests directly, we lose
the fault tracking which allows us to detect demand paging. So we provide
an alternate mechnism by clearing the accessed bit when we set a pte, and
checking it later to see if the guest actually used it.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

12b7d28f

KVM: Allow not-present guest page faults to bypass kvm · c7addb90

由 Avi Kivity 提交于 9月 16, 2007

There are two classes of page faults trapped by kvm:
 - host page faults, where the fault is needed to allow kvm to install
   the shadow pte or update the guest accessed and dirty bits
 - guest page faults, where the guest has faulted and kvm simply injects
   the fault back into the guest to handle

The second class, guest page faults, is pure overhead.  We can eliminate
some of it on vmx using the following evil trick:
 - when we set up a shadow page table entry, if the corresponding guest pte
   is not present, set up the shadow pte as not present
 - if the guest pte _is_ present, mark the shadow pte as present but also
   set one of the reserved bits in the shadow pte
 - tell the vmx hardware not to trap faults which have the present bit clear

With this, normal page-not-present faults go directly to the guest,
bypassing kvm entirely.

Unfortunately, this trick only works on Intel hardware, as AMD lacks a
way to discriminate among page faults based on error code.  It is also
a little risky since it uses reserved bits which might become unreserved
in the future, so a module parameter is provided to disable it.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

c7addb90

OpenHarmony / kernel_linux 上一次同步 3 年多

OpenHarmony / kernel_linux
上一次同步 3 年多