提交 · fdae862f91728aec6dd8fd62cd2398868c906b6b · OpenHarmony / kernel_linux

27 4月, 2008 6 次提交

KVM: MMU: unify slots_lock usage · 3200f405

由 Marcelo Tosatti 提交于 3月 29, 2008

Unify slots_lock acquision around vcpu_run(). This is simpler and less
error-prone.

Also fix some callsites that were not grabbing the lock properly.

[avi: drop slots_lock while in guest mode to avoid holding the lock
      for indefinite periods]
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

3200f405

KVM: MMU: Set the accessed bit on non-speculative shadow ptes · 947da538

由 Avi Kivity 提交于 3月 18, 2008

If we populate a shadow pte due to a fault (and not speculatively due to a
pte write) then we can set the accessed bit on it, as we know it will be
set immediately on the next guest instruction.  This saves a read-modify-write
operation.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

947da538

KVM: replace remaining __FUNCTION__ occurances · b8688d51

由 Harvey Harrison 提交于 3月 03, 2008

__FUNCTION__ is gcc-specific, use __func__
Signed-off-by: NHarvey Harrison <harvey.harrison@gmail.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

b8688d51

KVM: MMU: large page support · 05da4558

由 Marcelo Tosatti 提交于 2月 23, 2008

Create large pages mappings if the guest PTE's are marked as such and
the underlying memory is hugetlbfs backed.  If the largepage contains
write-protected pages, a large pte is not used.

Gives a consistent 2% improvement for data copies on ram mounted
filesystem, without NPT/EPT.

Anthony measures a 4% improvement on 4-way kernbench, with NPT.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

05da4558

KVM: MMU: Decouple mmio from shadow page tables · d196e343

由 Avi Kivity 提交于 1月 24, 2008

Currently an mmio guest pte is encoded in the shadow pagetable as a
not-present trapping pte, with the SHADOW_IO_MARK bit set. However
nothing is ever done with this information, so maintaining it is a
useless complication.

This patch moves the check for mmio to before shadow ptes are instantiated,
so the shadow code is never invoked for ptes that reference mmio. The code
is simpler, and with future work, can be made to handle mmio concurrently.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

d196e343

KVM: MMU: Update shadow ptes on partial guest pte writes · 489f1d65

由 Dong, Eddie 提交于 1月 07, 2008

A guest partial guest pte write will leave shadow_trap_nonpresent_pte
in spte, which generates a vmexit at the next guest access through that pte.

This patch improves this by reading the full guest pte in advance and thus
being able to update the spte and eliminate the vmexit.

This helps pae guests which use two 32-bit writes to set a single 64-bit pte.

[truncation fix by Eric]
Signed-off-by: NYaozu (Eddie) Dong <eddie.dong@intel.com>
Signed-off-by: NFeng (Eric) Liu <eric.e.liu@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

489f1d65

04 3月, 2008 3 次提交

KVM: MMU: Fix race when instantiating a shadow pte · f7d9c7b7

由 Avi Kivity 提交于 2月 26, 2008

For improved concurrency, the guest walk is performed concurrently with other
vcpus.  This means that we need to revalidate the guest ptes once we have
write-protected the guest page tables, at which point they can no longer be
modified.

The current code attempts to avoid this check if the shadow page table is not
new, on the assumption that if it has existed before, the guest could not have
modified the pte without the shadow lock.  However the assumption is incorrect,
as the racing vcpu could have modified the pte, then instantiated the shadow
page, before our vcpu regains control:

  vcpu0        vcpu1

  fault
  walk pte

               modify pte
               fault in same pagetable
               instantiate shadow page

  lookup shadow page
  conclude it is old
  instantiate spte based on stale guest pte

We could do something clever with generation counters, but a test run by
Marcelo suggests this is unnecessary and we can just do the revalidation
unconditionally.  The pte will be in the processor cache and the check can
be quite fast.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

f7d9c7b7

KVM: make MMU_DEBUG compile again · 24993d53

由 Marcelo Tosatti 提交于 2月 14, 2008

the cr3 variable is now inside the vcpu->arch structure.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

24993d53

KVM: remove the usage of the mmap_sem for the protection of the memory slots. · 72dc67a6

由 Izik Eidus 提交于 2月 10, 2008

This patch replaces the mmap_sem lock for the memory slots with a new
kvm private lock, it is needed beacuse untill now there were cases where
kvm accesses user memory while holding the mmap semaphore.
Signed-off-by: NIzik Eidus <izike@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

72dc67a6

31 1月, 2008 7 次提交

KVM: MMU: Merge shadow level check in FNAME(fetch) · 5882842f

由 Dong, Eddie 提交于 1月 02, 2008

Remove the redundant level check when fetching
shadow pte for present & non-present spte.
Signed-off-by: NYaozu (Eddie) Dong <eddie.dong@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

5882842f

KVM: MMU: Move kvm_free_some_pages() into critical section · eb787d10

由 Avi Kivity 提交于 12月 31, 2007

If some other cpu steals mmu pages between our check and an attempt to
allocate, we can run out of mmu pages.  Fix by moving the check into the
same critical section as the allocation.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

eb787d10

KVM: MMU: Switch to mmu spinlock · aaee2c94

由 Marcelo Tosatti 提交于 12月 20, 2007

Convert the synchronization of the shadow handling to a separate mmu_lock
spinlock.

Also guard fetch() by mmap_sem in read-mode to protect against alias
and memslot changes.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

aaee2c94

KVM: MMU: Avoid calling gfn_to_page() in mmu_set_spte() · d7824fff

由 Avi Kivity 提交于 12月 30, 2007

Since gfn_to_page() is a sleeping function, and we want to make the core mmu
spinlocked, we need to pass the page from the walker context (which can sleep)
to the shadow context (which cannot).

[marcelo: avoid recursive locking of mmap_sem]
Signed-off-by: NAvi Kivity <avi@qumranet.com>

d7824fff

KVM: Add kvm_read_guest_atomic() · 7ec54588

由 Marcelo Tosatti 提交于 12月 20, 2007

In preparation for a mmu spinlock, add kvm_read_guest_atomic()
and use it in fetch() and prefetch_page().
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

7ec54588

KVM: MMU: Concurrent guest walkers · 10589a46

由 Marcelo Tosatti 提交于 12月 20, 2007

Do not hold kvm->lock mutex across the entire pagefault code,
only acquire it in places where it is necessary, such as mmu
hash list, active list, rmap and parent pte handling.

Allow concurrent guest walkers by switching walk_addr() to use
mmap_sem in read-mode.

And get rid of the lockless __gfn_to_page.

[avi: move kvm_mmu_pte_write() locking inside the function]
[avi: add locking for real mode]
[avi: fix cmpxchg locking]
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

10589a46

KVM: Move arch dependent files to new directory arch/x86/kvm/ · edf88417

由 Avi Kivity 提交于 12月 16, 2007

This paves the way for multiple architecture support.  Note that while
ioapic.c could potentially be shared with ia64, it is also moved.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

edf88417

30 1月, 2008 24 次提交

KVM: Portability: Introduce kvm_vcpu_arch · ad312c7c

由 Zhang Xiantao 提交于 12月 13, 2007

Move all the architecture-specific fields in kvm_vcpu into a new struct
kvm_vcpu_arch.
Signed-off-by: NZhang Xiantao <xiantao.zhang@intel.com>
Acked-by: NCarsten Otte <cotte@de.ibm.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

ad312c7c

KVM: MMU: Fix SMP shadow instantiation race · 7819026e

由 Marcelo Tosatti 提交于 12月 11, 2007

There is a race where VCPU0 is shadowing a pagetable entry while VCPU1
is updating it, which results in a stale shadow copy.

Fix that by comparing the contents of the cached guest pte with the
current guest pte after write-protecting the guest pagetable.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

7819026e

KVM: MMU: Move set_pte() into guest paging mode independent code · 1c4f1fd6

由 Avi Kivity 提交于 12月 09, 2007

As set_pte() no longer references either a gpte or the guest walker, we can
move it out of paging mode dependent code (which compiles twice and is
generally nasty).
Signed-off-by: NAvi Kivity <avi@qumranet.com>

1c4f1fd6

A
KVM: MMU: Remove walker argument to set_pte() · 2fbf4cf1
由 Avi Kivity 提交于 12月 09, 2007
```
Unused.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
2fbf4cf1
A
KVM: MMU: Pass pte dirty flag to set_pte() instead of calculating it on-site · e3f95504
由 Avi Kivity 提交于 12月 09, 2007
```
This allows us to remove its dependency on pt_element_t.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
e3f95504

KVM: MMU: No need to pick up nx bit from guest pte · b4ab019c

由 Avi Kivity 提交于 12月 09, 2007

We already set it according to cumulative access permissions.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

b4ab019c

KVM: MMU: Fix inherited permissions for emulated guest pte updates · 41074d07

由 Avi Kivity 提交于 12月 09, 2007

When we emulate a guest pte write, we fail to apply the correct inherited
permissions from the parent ptes. Now that we store inherited permissions
in the shadow page, we can use that to update the pte permissions correctly.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

41074d07

A
KVM: MMU: Move pte access calculation into a helper function · bedbe4ee
由 Avi Kivity 提交于 12月 09, 2007
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
bedbe4ee

KVM: MMU: Set nx bit correctly on shadow ptes · 8d87a03a

由 Avi Kivity 提交于 12月 09, 2007

While the page table walker correctly generates a guest page fault
if a guest tries to execute a non-executable page, the shadow code does
not mark it non-executable.  This means that if a guest accesses an nx
page first with a read access, then subsequent code fetch accesses will
succeed.

Fix by setting the nx bit on shadow ptes.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

8d87a03a

KVM: MMU: Simplify calculation of pte access · fe135d2c

由 Avi Kivity 提交于 12月 09, 2007

The nx bit is awkwardly placed in the 63rd bit position; furthermore it
has a reversed meaning compared to the other bits, which means we can't use
a bitwise and to calculate compounded access masks.

So, we simplify things by creating a new 3-bit exec/write/user access word,
and doing all calculations in that.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

fe135d2c

KVM: MMU: Use cmpxchg for pte updates on walk_addr() · b3e4e63f

由 Marcelo Tosatti 提交于 12月 07, 2007

In preparation for multi-threaded guest pte walking, use cmpxchg()
when updating guest pte's. This guarantees that the assignment of the
dirty bit can't be lost if two CPU's are faulting the same address
simultaneously.

[avi: fix kunmap_atomic() parameters]
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

b3e4e63f

M
KVM: MMU: Remove unused prev_shadow_ent variable from fetch() · 4bf8ed8d
由 Marcelo Tosatti 提交于 12月 04, 2007
```
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
4bf8ed8d

KVM: MMU: Introduce gfn_to_gpa() · 1755fbcc

由 Avi Kivity 提交于 11月 21, 2007

Converting a frame number to an address is tricky since the data type changes
size.  Introduce a function to do it.  This fixes an actual bug when
accessing guest ptes.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

1755fbcc

A
KVM: MMU: Adjust page_header_update_slot() to accept a gfn instead of a gpa · 38c335f1
由 Avi Kivity 提交于 11月 21, 2007
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
38c335f1

KVM: MMU: Merge set_pte() and set_pte_common() · 230c9a8f

由 Avi Kivity 提交于 11月 21, 2007

Since set_pte() is now the only caller of set_pte_common(), merge the two
functions.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

230c9a8f

A
KVM: MMU: Remove set_pde() · 050e6499
由 Avi Kivity 提交于 11月 21, 2007
```
It is now identical to set_pte().
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
050e6499
A
KVM: MMU: Remove extra gaddr parameter from set_pte_common() · 4e542370
由 Avi Kivity 提交于 11月 21, 2007
```
Similar information is available in the gfn parameter, so use that.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
4e542370
A
KVM: MMU: Move pse36 handling to the guest walker · da928521
由 Avi Kivity 提交于 11月 21, 2007
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
da928521
A
KVM: MMU: Introduce and use gpte_to_gfn() · 5fb07ddb
由 Avi Kivity 提交于 11月 21, 2007
```
Instead of repretitively open-coding this.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
5fb07ddb

KVM: MMU: Code cleanup · b238f7bc

由 Izik Eidus 提交于 11月 20, 2007

Signed-off-by: NIzik Eidus <izike@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

b238f7bc

KVM: MMU: Implement guest page fault bypass for nonpae · e5a4c8ca

由 Avi Kivity 提交于 11月 20, 2007

I spent an hour worrying why I see so many guest page faults on FC6 i386.
Turns out bypass wasn't implemented for nonpae.  Implement it so it doesn't
happen again.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

e5a4c8ca

KVM: MMU: Selectively set PageDirty when releasing guest memory · b4231d61

由 Izik Eidus 提交于 11月 20, 2007

Improve dirty bit setting for pages that kvm release, until now every page
that we released we marked dirty, from now only pages that have potential
to get dirty we mark dirty.
Signed-off-by: NIzik Eidus <izike@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

b4231d61

A
KVM: MMU: Remove unused variable · 971535ff
由 Avi Kivity 提交于 11月 19, 2007
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
971535ff

KVM: MMU: Change guest pte access to kvm_{read,write}_guest() · ec8d4eae

由 Izik Eidus 提交于 11月 19, 2007

Things are simpler and more regular this way.
Signed-off-by: NIzik Eidus <izike@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

ec8d4eae

OpenHarmony / kernel_linux 上一次同步 3 年多

OpenHarmony / kernel_linux
上一次同步 3 年多