提交 · 7e4e4056f72da51c5dede48515df0ecd20eaf8ca · openeuler / raspberrypi-kernel

10 9月, 2009 9 次提交

KVM: MMU: shadow support for 1gb pages · 7e4e4056

由 Joerg Roedel 提交于 7月 27, 2009

This patch adds support for shadow paging to the 1gb page table code in KVM.
With this code the guest can use 1gb pages even if the host does not support
them.

[ Marcelo: fix shadow page collision on pmd level if a guest 1gb page is mapped
           with 4kb ptes on host level ]
Signed-off-by: NJoerg Roedel <joerg.roedel@amd.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

7e4e4056

KVM: MMU: make page walker aware of mapping levels · e04da980

由 Joerg Roedel 提交于 7月 27, 2009

The page walker may be used with nested paging too when accessing mmio
areas.  Make it support the additional page-level too.

[ Marcelo: fix reserved bit check for 1gb pte ]
Signed-off-by: NJoerg Roedel <joerg.roedel@amd.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

e04da980

J
KVM: MMU: make direct mapping paths aware of mapping levels · 852e3c19
由 Joerg Roedel 提交于 7月 27, 2009
```
Signed-off-by: NJoerg Roedel <joerg.roedel@amd.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>
```
852e3c19

KVM: MMU: rename is_largepage_backed to mapping_level · d25797b2

由 Joerg Roedel 提交于 7月 27, 2009

With the new name and the corresponding backend changes this function
can now support multiple hugepage sizes.
Signed-off-by: NJoerg Roedel <joerg.roedel@amd.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

d25797b2

A
KVM: MMU: Trace guest pagetable walker · 07420171
由 Avi Kivity 提交于 7月 06, 2009
```
Signed-off-by: NAvi Kivity <avi@redhat.com>
```
07420171

KVM: Prepare memslot data structures for multiple hugepage sizes · ec04b260

由 Joerg Roedel 提交于 6月 19, 2009

[avi: fix build on non-x86]
Signed-off-by: NJoerg Roedel <joerg.roedel@amd.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

ec04b260

KVM: MMU: s/shadow_pte/spte/ · d555c333

由 Avi Kivity 提交于 6月 10, 2009

We use shadow_pte and spte inconsistently, switch to the shorter spelling.

Rename set_shadow_pte() to __set_spte() to avoid a conflict with the
existing set_spte(), and to indicate its lowlevelness.
Signed-off-by: NAvi Kivity <avi@redhat.com>

d555c333

KVM: MMU: Adjust pte accessors to explicitly indicate guest or shadow pte · 43a3795a

由 Avi Kivity 提交于 6月 10, 2009

Since the guest and host ptes can have wildly different format, adjust
the pte accessor names to indicate on which type of pte they operate on.

No functional changes.
Signed-off-by: NAvi Kivity <avi@redhat.com>

43a3795a

KVM: Cache pdptrs · 6de4f3ad

由 Avi Kivity 提交于 5月 31, 2009

Instead of reloading the pdptrs on every entry and exit (vmcs writes on vmx,
guest memory access on svm) extract them on demand.
Signed-off-by: NAvi Kivity <avi@redhat.com>

6de4f3ad

28 6月, 2009 1 次提交

KVM: shut up uninit compiler warning in paging_tmpl.h · bde89223

由 Jaswinder Singh Rajput 提交于 5月 20, 2009

Dixes compilation warning:
  CC      arch/x86/kernel/io_delay.o
 arch/x86/kvm/paging_tmpl.h: In function ‘paging64_fetch’:
 arch/x86/kvm/paging_tmpl.h:279: warning: ‘sptep’ may be used uninitialized in this function
 arch/x86/kvm/paging_tmpl.h: In function ‘paging32_fetch’:
 arch/x86/kvm/paging_tmpl.h:279: warning: ‘sptep’ may be used uninitialized in this function

warning is bogus (always have a least one level), but need to shut the compiler
up.
Signed-off-by: NJaswinder Singh Rajput <jaswinderrajput@gmail.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

bde89223

10 6月, 2009 4 次提交

KVM: MMU: remove global page optimization logic · c2d0ee46

由 Marcelo Tosatti 提交于 4月 05, 2009

Complexity to fix it not worthwhile the gains, as discussed
in http://article.gmane.org/gmane.comp.emulators.kvm.devel/28649.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

c2d0ee46

KVM: MMU: Emulate #PF error code of reserved bits violation · 82725b20

由 Dong, Eddie 提交于 3月 30, 2009

Detect, indicate, and propagate page faults where reserved bits are set.
Take care to handle the different paging modes, each of which has different
sets of reserved bits.

[avi: fix pte reserved bits for efer.nxe=0]
Signed-off-by: NEddie Dong <eddie.dong@intel.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

82725b20

KVM: MMU: Fix comment in page_fault() · a8b876b1

由 Eddie Dong 提交于 3月 26, 2009

The original one is for the code before refactoring.
Signed-off-by: NYaozu (Eddie) Dong <eddie.dong@intel.com>
Signed-off-by: NSheng Yang <sheng@linux.intel.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

a8b876b1

KVM: MMU: remove call to kvm_mmu_pte_write from walk_addr · f5a1e9f8

由 Joerg Roedel 提交于 3月 05, 2009

There is no reason to update the shadow pte here because the guest pte
is only changed to dirty state.
Signed-off-by: NJoerg Roedel <joerg.roedel@amd.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

f5a1e9f8

24 3月, 2009 5 次提交

KVM: Fix missing smp tlb flush in invlpg · 4539b358

由 Andrea Arcangeli 提交于 3月 12, 2009

When kvm emulates an invlpg instruction, it can drop a shadow pte, but
leaves the guest tlbs intact.  This can cause memory corruption when
swapping out.

Without this the other cpu can still write to a freed host physical page.
tlb smp flush must happen if rmap_remove is called always before mmu_lock
is released because the VM will take the mmu_lock before it can finally add
the page to the freelist after swapout. mmu notifier makes it safe to flush
the tlb after freeing the page (otherwise it would never be safe) so we can do
a single flush for multiple sptes invalidated.

Cc: stable@kernel.org
Signed-off-by: NAndrea Arcangeli <aarcange@redhat.com>
Acked-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

4539b358

KVM: MMU: Fix another largepage memory leak · c5bc2242

由 Joerg Roedel 提交于 2月 19, 2009

In the paging_fetch function rmap_remove is called after setting a large
pte to non-present. This causes rmap_remove to not drop the reference to
the large page. The result is a memory leak of that page.

Cc: stable@kernel.org
Signed-off-by: NJoerg Roedel <joerg.roedel@amd.com>
Acked-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

c5bc2242

KVM: MMU: Rename "metaphysical" attribute to "direct" · f6e2c02b

由 Avi Kivity 提交于 1月 11, 2009

This actually describes what is going on, rather than alerting the reader
that something strange is going on.
Signed-off-by: NAvi Kivity <avi@redhat.com>

f6e2c02b

A
KVM: MMU: Replace walk_shadow() by for_each_shadow_entry() in invlpg() · a461930b
由 Avi Kivity 提交于 12月 25, 2008
```
Signed-off-by: NAvi Kivity <avi@redhat.com>
```
a461930b

KVM: MMU: Replace walk_shadow() by for_each_shadow_entry() in fetch() · e7a04c99

由 Avi Kivity 提交于 12月 25, 2008

Effectively reverting to the pre walk_shadow() version -- but now
with the reusable for_each().
Signed-off-by: NAvi Kivity <avi@redhat.com>

e7a04c99

31 12月, 2008 3 次提交

KVM: MMU: handle large host sptes on invlpg/resync · 87917239

由 Marcelo Tosatti 提交于 12月 22, 2008

The invlpg and sync walkers lack knowledge of large host sptes,
descending to non-existant pagetable level.

Stop at directory level in such case.

Fixes SMP Windows XP with hugepages.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

87917239

KVM: MMU: prepopulate the shadow on invlpg · ad218f85

由 Marcelo Tosatti 提交于 12月 01, 2008

If the guest executes invlpg, peek into the pagetable and attempt to
prepopulate the shadow entry.

Also stop dirty fault updates from interfering with the fork detector.

2% improvement on RHEL3/AIM7.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

ad218f85

KVM: MMU: skip global pgtables on sync due to cr3 switch · 6cffe8ca

由 Marcelo Tosatti 提交于 12月 01, 2008

Skip syncing global pages on cr3 switch (but not on cr4/cr0). This is
important for Linux 32-bit guests with PAE, where the kmap page is
marked as global.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

6cffe8ca

26 11月, 2008 1 次提交

KVM: MMU: avoid creation of unreachable pages in the shadow · 6c475352

由 Marcelo Tosatti 提交于 11月 25, 2008

It is possible for a shadow page to have a parent link
pointing to a freed page. When zapping a high level table,
kvm_mmu_page_unlink_children fails to remove the parent_pte link.
For that to happen, the child must be unreachable via the shadow
tree, which can happen in shadow_walk_entry if the guest pte was
modified in between walk() and fetch(). Remove the parent pte
reference in such case.

Possible cause for oops in bug #2217430.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

6c475352

15 10月, 2008 8 次提交

KVM: MMU: out of sync shadow core · 4731d4c7

由 Marcelo Tosatti 提交于 9月 23, 2008

Allow guest pagetables to go out of sync.  Instead of emulating write
accesses to guest pagetables, or unshadowing them, we un-write-protect
the page table and allow the guest to modify it at will.  We rely on
invlpg executions to synchronize individual ptes, and will synchronize
the entire pagetable on tlb flushes.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

4731d4c7

KVM: x86: trap invlpg · a7052897

由 Marcelo Tosatti 提交于 9月 23, 2008

With pages out of sync invlpg needs to be trapped. For now simply nuke
the entry.

Untested on AMD.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

a7052897

KVM: MMU: mode specific sync_page · e8bc217a

由 Marcelo Tosatti 提交于 9月 23, 2008

Examine guest pagetable and bring the shadow back in sync. Caller is responsible
for local TLB flush before re-entering guest mode.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

e8bc217a

KVM: MMU: flush remote TLBs on large->normal entry overwrite · 93a423e7

由 Marcelo Tosatti 提交于 9月 23, 2008

It is necessary to flush all TLB's when a large spte entry is
overwritten with a normal page directory pointer.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

93a423e7

KVM: switch to get_user_pages_fast · 4c2155ce

由 Marcelo Tosatti 提交于 9月 16, 2008

Convert gfn_to_pfn to use get_user_pages_fast, which can do lockless
pagetable lookups on x86. Kernel compilation on 4-way guest is 3.7%
faster on VMX.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

4c2155ce

KVM: MMU: Modify kvm_shadow_walk.entry to accept u64 addr · d40a1ee4

由 Sheng Yang 提交于 9月 01, 2008

EPT is 4 level by default in 32pae(48 bits), but the addr parameter
of kvm_shadow_walk->entry() only accept unsigned long as virtual
address, which is 32bit in 32pae. This result in SHADOW_PT_INDEX()
overflow when try to fetch level 4 index.

Fix it by extend kvm_shadow_walk->entry() to accept 64bit addr in
parameter.
Signed-off-by: NSheng Yang <sheng.yang@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

d40a1ee4

A
KVM: MMU: Convert the paging mode shadow walk to use the generic walker · abb9e0b8
由 Avi Kivity 提交于 8月 22, 2008
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
abb9e0b8

KVM: MMU: Move SHADOW_PT_INDEX to mmu.c · 135f8c2b

由 Avi Kivity 提交于 8月 21, 2008

It is not specific to the paging mode, so can be made global (and reusable).
Signed-off-by: NAvi Kivity <avi@qumranet.com>

135f8c2b

25 8月, 2008 1 次提交

KVM: MMU: Fix torn shadow pte · cd5998eb

由 Avi Kivity 提交于 8月 22, 2008

The shadow code assigns a pte directly in one place, which is nonatomic on
i386 can can cause random memory references. Fix by using an atomic setter.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

cd5998eb

29 7月, 2008 1 次提交

KVM: Synchronize guest physical memory map to host virtual memory map · e930bffe

由 Andrea Arcangeli 提交于 7月 25, 2008

Synchronize changes to host virtual addresses which are part of
a KVM memory slot to the KVM shadow mmu.  This allows pte operations
like swapping, page migration, and madvise() to transparently work
with KVM.
Signed-off-by: NAndrea Arcangeli <andrea@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

e930bffe

20 7月, 2008 1 次提交

KVM: MMU: Optimize prefetch_page() · eab9f71f

由 Avi Kivity 提交于 5月 29, 2008

Instead of reading each pte individually, read 256 bytes worth of ptes and
batch process them.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

eab9f71f

07 6月, 2008 1 次提交
- A
  KVM: MMU: Fix printk() format string · ebb0e626
  由 Avi Kivity 提交于 5月 20, 2008
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
  ebb0e626
27 4月, 2008 5 次提交

KVM: MMU: Don't assume struct page for x86 · 35149e21

由 Anthony Liguori 提交于 4月 02, 2008

This patch introduces a gfn_to_pfn() function and corresponding functions like
kvm_release_pfn_dirty().  Using these new functions, we can modify the x86
MMU to no longer assume that it can always get a struct page for any given gfn.

We don't want to eliminate gfn_to_page() entirely because a number of places
assume they can do gfn_to_page() and then kmap() the results.  When we support
IO memory, gfn_to_page() will fail for IO pages although gfn_to_pfn() will
succeed.

This does not implement support for avoiding reference counting for reserved
RAM or for IO memory.  However, it should make those things pretty straight
forward.

Since we're only introducing new common symbols, I don't think it will break
the non-x86 architectures but I haven't tested those.  I've tested Intel,
AMD, NPT, and hugetlbfs with Windows and Linux guests.

[avi: fix overflow when shifting left pfns by adding casts]
Signed-off-by: NAnthony Liguori <aliguori@us.ibm.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

35149e21

KVM: MMU: unify slots_lock usage · 3200f405

由 Marcelo Tosatti 提交于 3月 29, 2008

Unify slots_lock acquision around vcpu_run(). This is simpler and less
error-prone.

Also fix some callsites that were not grabbing the lock properly.

[avi: drop slots_lock while in guest mode to avoid holding the lock
      for indefinite periods]
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

3200f405

KVM: MMU: Set the accessed bit on non-speculative shadow ptes · 947da538

由 Avi Kivity 提交于 3月 18, 2008

If we populate a shadow pte due to a fault (and not speculatively due to a
pte write) then we can set the accessed bit on it, as we know it will be
set immediately on the next guest instruction.  This saves a read-modify-write
operation.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

947da538

KVM: replace remaining __FUNCTION__ occurances · b8688d51

由 Harvey Harrison 提交于 3月 03, 2008

__FUNCTION__ is gcc-specific, use __func__
Signed-off-by: NHarvey Harrison <harvey.harrison@gmail.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

b8688d51

KVM: MMU: large page support · 05da4558

由 Marcelo Tosatti 提交于 2月 23, 2008

Create large pages mappings if the guest PTE's are marked as such and
the underlying memory is hugetlbfs backed.  If the largepage contains
write-protected pages, a large pte is not used.

Gives a consistent 2% improvement for data copies on ram mounted
filesystem, without NPT/EPT.

Anthony measures a 4% improvement on 4-way kernbench, with NPT.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

05da4558