提交 · c5fbdffbda79254047ec83b09c1a61a3655d052a · openeuler / Kernel

31 12月, 2008 40 次提交

KVM: ppc: save and restore guest mappings on context switch · c5fbdffb

由 Hollis Blanchard 提交于 12月 02, 2008

Store shadow TLB entries in memory, but only use it on host context switch
(instead of every guest entry). This improves performance for most workloads on
440 by reducing the guest TLB miss rate.
Signed-off-by: NHollis Blanchard <hollisb@us.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

c5fbdffb

KVM: ppc: directly insert shadow mappings into the hardware TLB · 7924bd41

由 Hollis Blanchard 提交于 12月 02, 2008

Formerly, we used to maintain a per-vcpu shadow TLB and on every entry to the
guest would load this array into the hardware TLB. This consumed 1280 bytes of
memory (64 entries of 16 bytes plus a struct page pointer each), and also
required some assembly to loop over the array on every entry.

Instead of saving a copy in memory, we can just store shadow mappings directly
into the hardware TLB, accepting that the host kernel will clobber these as
part of the normal 440 TLB round robin. When we do that we need less than half
the memory, and we have decreased the exit handling time for all guest exits,
at the cost of increased number of TLB misses because the host overwrites some
guest entries.

These savings will be increased on processors with larger TLBs or which
implement intelligent flush instructions like tlbivax (which will avoid the
need to walk arrays in software).

In addition to that and to the code simplification, we have a greater chance of
leaving other host userspace mappings in the TLB, instead of forcing all
subsequent tasks to re-fault all their mappings.
Signed-off-by: NHollis Blanchard <hollisb@us.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

7924bd41

powerpc/44x: declare tlb_44x_index for use in C code · c0ca609c

由 Hollis Blanchard 提交于 12月 02, 2008

KVM currently ignores the host's round robin TLB eviction selection, instead
maintaining its own TLB state and its own round robin index. However, by
participating in the normal 44x TLB selection, we can drop the alternate TLB
processing in KVM. This results in a significant performance improvement,
since that processing currently must be done on *every* guest exit.

Accordingly, KVM needs to be able to access and increment tlb_44x_index.
(KVM on 440 cannot be a module, so there is no need to export this symbol.)
Signed-off-by: NHollis Blanchard <hollisb@us.ibm.com>
Acked-by: NJosh Boyer <jwboyer@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

c0ca609c

KVM: ppc: support large host pages · 89168618

由 Hollis Blanchard 提交于 12月 02, 2008

KVM on 440 has always been able to handle large guest mappings with 4K host
pages -- we must, since the guest kernel uses 256MB mappings.

This patch makes KVM work when the host has large pages too (tested with 64K).
Signed-off-by: NHollis Blanchard <hollisb@us.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

89168618

KVM: VMX: fix sparse warning · efff9e53

由 Hannes Eder 提交于 11月 28, 2008

Impact: make global function static

  arch/x86/kvm/vmx.c:134:3: warning: symbol 'vmx_capability' was not declared. Should it be static?
Signed-off-by: NHannes Eder <hannes@hanneseder.net>
Signed-off-by: NAvi Kivity <avi@redhat.com>

efff9e53

A
KVM: Remove extraneous semicolon after do/while · f3fd92fb
由 Avi Kivity 提交于 11月 29, 2008
```
Notices by Guillaume Thouvenin.
Signed-off-by: NAvi Kivity <avi@redhat.com>
```
f3fd92fb

KVM: x86 emulator: fix popf emulation · 2b48cc75

由 Avi Kivity 提交于 11月 29, 2008

Set operand type and size to get correct writeback behavior.
Signed-off-by: NAvi Kivity <avi@redhat.com>

2b48cc75

KVM: x86 emulator: fix ret emulation · cf5de4f8

由 Avi Kivity 提交于 11月 28, 2008

'ret' did not set the operand type or size for the destination, so
writeback ignored it.
Signed-off-by: NAvi Kivity <avi@redhat.com>

cf5de4f8

A
KVM: x86 emulator: switch 'pop reg' instruction to emulate_pop() · 8a09b687
由 Avi Kivity 提交于 11月 27, 2008
```
Signed-off-by: NAvi Kivity <avi@redhat.com>
```
8a09b687
A
KVM: x86 emulator: allow pop from mmio · 781d0edc
由 Avi Kivity 提交于 11月 27, 2008
```
Signed-off-by: NAvi Kivity <avi@redhat.com>
```
781d0edc
A
KVM: x86 emulator: Extract 'pop' sequence into a function · faa5a3ae
由 Avi Kivity 提交于 11月 27, 2008
```
Switch 'pop r/m' instruction to use the new function.
Signed-off-by: NAvi Kivity <avi@redhat.com>
```
faa5a3ae

KVM: s390: Fix memory leak of vcpu->run · 6692cef3

由 Christian Borntraeger 提交于 11月 26, 2008

The s390 backend of kvm never calls kvm_vcpu_uninit. This causes
a memory leak of vcpu->run pages.
Lets call kvm_vcpu_uninit in kvm_arch_vcpu_destroy to free
the vcpu->run.
Signed-off-by: NChristian Borntraeger <borntraeger@de.ibm.com>
Acked-by: NCarsten Otte <cotte@de.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

6692cef3

KVM: s390: Fix refcounting and allow module unload · d329c035

由 Christian Borntraeger 提交于 11月 26, 2008

Currently it is impossible to unload the kvm module on s390.
This patch fixes kvm_arch_destroy_vm to release all cpus.
This make it possible to unload the module.

In addition we stop messing with the module refcount in arch code.
Signed-off-by: NChristian Borntraeger <borntraeger@de.ibm.com>
Acked-by: NCarsten Otte <cotte@de.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

d329c035

A
KVM: x86 emulator: consolidate emulation of two operand instructions · 6b7ad61f
由 Avi Kivity 提交于 11月 26, 2008
```
No need to repeat the same assembly block over and over.
Signed-off-by: NAvi Kivity <avi@redhat.com>
```
6b7ad61f
A
KVM: x86 emulator: reduce duplication in one operand emulation thunks · dda96d8f
由 Avi Kivity 提交于 11月 26, 2008
```
Signed-off-by: NAvi Kivity <avi@redhat.com>
```
dda96d8f

KVM: MMU: optimize set_spte for page sync · ecc5589f

由 Marcelo Tosatti 提交于 11月 25, 2008

The write protect verification in set_spte is unnecessary for page sync.

Its guaranteed that, if the unsync spte was writable, the target page
does not have a write protected shadow (if it had, the spte would have
been write protected under mmu_lock by rmap_write_protect before).

Same reasoning applies to mark_page_dirty: the gfn has been marked as
dirty via the pagefault path.

The cost of hash table and memslot lookups are quite significant if the
workload is pagetable write intensive resulting in increased mmu_lock
contention.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

ecc5589f

x86: KVM guest: sign kvmclock as paravirt · 423cd25a

由 Glauber Costa 提交于 11月 24, 2008

Currently, we only set the KVM paravirt signature in case
of CONFIG_KVM_GUEST. However, it is possible to have it turned
off, while CONFIG_KVM_CLOCK is turned on. This is also a paravirt
case, and should be shown accordingly.
Signed-off-by: NGlauber Costa <glommer@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

423cd25a

KVM: VMX: Conditionally request interrupt window after injecting irq · df203ec9

由 Avi Kivity 提交于 11月 23, 2008

If we're injecting an interrupt, and another one is pending, request
an interrupt window notification so we don't have excess latency on the
second interrupt.

This shouldn't happen in practice since an EOI will be issued, giving a second
chance to request an interrupt window, but...
Signed-off-by: NAvi Kivity <avi@redhat.com>

df203ec9

KVM: ia64: Clean up vmm_ivt.S using tab to indent every line · 8fe07367

由 Xiantao Zhang 提交于 11月 21, 2008

Using tab for indentation for vmm_ivt.S.
Signed-off-by: NXiantao Zhang <xiantao.zhang@intel.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

8fe07367

KVM: ia64: Add handler for crashed vmm · 9f7d5bb5

由 Xiantao Zhang 提交于 11月 21, 2008

Since vmm runs in an isolated address space and it is just a copy
of host's kvm-intel module, so once vmm crashes, we just crash all guests
running on it instead of crashing whole kernel.
Signed-off-by: NXiantao Zhang <xiantao.zhang@intel.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

9f7d5bb5

KVM: ia64: Add some debug points to provide crash infomation · 5e2be198

由 Xiantao Zhang 提交于 11月 21, 2008

Use printk infrastructure to print out some debug info once VM crashes.
Signed-off-by: NXiantao Zhang <xiantao.zhang@intel.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

5e2be198

KVM: ia64: Define printk function for kvm-intel module · 7d637978

由 Xiantao Zhang 提交于 11月 21, 2008

kvm-intel module is relocated to an isolated address space
with kernel, so it can't call host kernel's printk for debug
purpose. In the module, we implement the printk to output debug
info of vmm.
Signed-off-by: NXiantao Zhang <xiantao.zhang@intel.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

7d637978

x86: disable VMX on all CPUs on reboot · d176720d

由 Eduardo Habkost 提交于 11月 17, 2008

On emergency_restart, we may need to use an NMI to disable virtualization
on all CPUs. We do that using nmi_shootdown_cpus() if VMX is enabled.

Note: With this patch, we will run the NMI stuff only when the CPU where
emergency_restart() was called has VMX enabled. This should work on most
cases because KVM enables VMX on all CPUs, but we may miss the small
window where KVM is doing that. Also, I don't know if all code using
VMX out there always enable VMX on all CPUs like KVM does. We have two
other alternatives for that:

a) Have an API that all code that enables VMX on any CPU should use
to tell the kernel core that it is going to enable VMX on the CPUs.
b) Always call nmi_shootdown_cpus() if the CPU supports VMX. This is
a bit intrusive and more risky, as it would run nmi_shootdown_cpus()
on emergency_reboot() even on systems where virtualization is never
enabled.

Finding a proper point to hook the nmi_shootdown_cpus() call isn't
trivial, as the non-emergency machine_restart() (that doesn't need the
NMI tricks) uses machine_emergency_restart() directly.

The solution to make this work without adding a new function or argument
to machine_ops was setting a 'reboot_emergency' flag that tells if
native_machine_emergency_restart() needs to do the virt cleanup or not.
Signed-off-by: NEduardo Habkost <ehabkost@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

d176720d

kdump: forcibly disable VMX and SVM on machine_crash_shutdown() · 2340b62f

由 Eduardo Habkost 提交于 11月 17, 2008

We need to disable virtualization extensions on all CPUs before booting
the kdump kernel, otherwise the kdump kernel booting will fail, and
rebooting after the kdump kernel did its task may also fail.

We do it using cpu_emergency_vmxoff() and cpu_emergency_svm_disable(),
that should always work, because those functions check if the CPUs
support SVM or VMX before doing their tasks.
Signed-off-by: NEduardo Habkost <ehabkost@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

2340b62f

x86: cpu_emergency_svm_disable() function · 0f3e9eeb

由 Eduardo Habkost 提交于 11月 17, 2008

This function can be used by the reboot or kdump code to forcibly
disable SVM on the CPU.
Signed-off-by: NEduardo Habkost <ehabkost@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

0f3e9eeb

KVM: SVM: move svm_hardware_disable() code to asm/virtext.h · 2c8dceeb

由 Eduardo Habkost 提交于 11月 17, 2008

Create cpu_svm_disable() function.
Signed-off-by: NEduardo Habkost <ehabkost@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

2c8dceeb

KVM: SVM: move has_svm() code to asm/virtext.h · 63d1142f

由 Eduardo Habkost 提交于 11月 17, 2008

Use a trick to keep the printk()s on has_svm() working as before. gcc
will take care of not generating code for the 'msg' stuff when the
function is called with a NULL msg argument.
Signed-off-by: NEduardo Habkost <ehabkost@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

63d1142f

x86: cpu_emergency_vmxoff() function · 6aa07a0d

由 Eduardo Habkost 提交于 11月 17, 2008

Add cpu_emergency_vmxoff() and its friends: cpu_vmx_enabled() and
__cpu_emergency_vmxoff().
Signed-off-by: NEduardo Habkost <ehabkost@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

6aa07a0d

KVM: VMX: extract kvm_cpu_vmxoff() from hardware_disable() · 710ff4a8

由 Eduardo Habkost 提交于 11月 17, 2008

Along with some comments on why it is different from the core cpu_vmxoff()
function.
Signed-off-by: NEduardo Habkost <ehabkost@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

710ff4a8

x86: asm/virtext.h: add cpu_vmxoff() inline function · 1e993114

由 Eduardo Habkost 提交于 11月 17, 2008

Unfortunately we can't use exactly the same code from vmx
hardware_disable(), because the KVM function uses the
__kvm_handle_fault_on_reboot() tricks.
Signed-off-by: NEduardo Habkost <ehabkost@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

1e993114

KVM: VMX: move cpu_has_kvm_support() to an inline on asm/virtext.h · 6210e37b

由 Eduardo Habkost 提交于 11月 17, 2008

It will be used by core code on kdump and reboot, to disable
vmx if needed.
Signed-off-by: NEduardo Habkost <ehabkost@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

6210e37b

KVM: VMX: move ASM_VMX_* definitions from asm/kvm_host.h to asm/vmx.h · eca70fc5

由 Eduardo Habkost 提交于 11月 17, 2008

Those definitions will be used by code outside KVM, so move it outside
of a KVM-specific source file.

Those definitions are used only on kvm/vmx.c, that already includes
asm/vmx.h, so they can be moved safely.
Signed-off-by: NEduardo Habkost <ehabkost@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

eca70fc5

KVM: SVM: move svm.h to include/asm · c2cedf7b

由 Eduardo Habkost 提交于 11月 17, 2008

svm.h will be used by core code that is independent of KVM, so I am
moving it outside the arch/x86/kvm directory.
Signed-off-by: NEduardo Habkost <ehabkost@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

c2cedf7b

KVM: VMX: move vmx.h to include/asm · 13673a90

由 Eduardo Habkost 提交于 11月 17, 2008

vmx.h will be used by core code that is independent of KVM, so I am
moving it outside the arch/x86/kvm directory.
Signed-off-by: NEduardo Habkost <ehabkost@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

13673a90

KVM: ppc: fix userspace mapping invalidation on context switch · fe4e771d

由 Hollis Blanchard 提交于 11月 10, 2008

We used to defer invalidating userspace TLB entries until jumping out of the
kernel. This was causing MMU weirdness most easily triggered by using a pipe in
the guest, e.g. "dmesg | tail". I believe the problem was that after the guest
kernel changed the PID (part of context switch), the old process's mappings
were still present, and so copy_to_user() on the "return to new process" path
ended up using stale mappings.

Testing with large pages (64K) exposed the problem, probably because with 4K
pages, pressure on the TLB faulted all process A's mappings out before the
guest kernel could insert any for process B.
Signed-off-by: NHollis Blanchard <hollisb@us.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

fe4e771d

KVM: ppc: use prefetchable mappings for guest memory · df9b856c

由 Hollis Blanchard 提交于 11月 10, 2008

Bare metal Linux on 440 can "overmap" RAM in the kernel linear map, so that it
can use large (256MB) mappings even if memory isn't a multiple of 256MB. To
prevent the hardware prefetcher from loading from an invalid physical address
through that mapping, it's marked Guarded.

However, KVM must ensure that all guest mappings are backed by real physical
RAM (since a deliberate access through a guarded mapping could still cause a
machine check). Accordingly, we don't need to make our mappings guarded, so
let's allow prefetching as the designers intended.

Curiously this patch didn't affect performance at all on the quick test I
tried, but it's clearly the right thing to do anyways and may improve other
workloads.
Signed-off-by: NHollis Blanchard <hollisb@us.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

df9b856c

KVM: ppc: use MMUCR accessor to obtain TID · bf5d4025

由 Hollis Blanchard 提交于 11月 10, 2008

We have an accessor; might as well use it.
Signed-off-by: NHollis Blanchard <hollisb@us.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

bf5d4025

KVM: ia64: Remove some macro definitions in asm-offsets.c. · 30ed5bb6

由 Xiantao Zhang 提交于 10月 24, 2008

Use kernel's corresponding macro instead.
Signed-off-by: NXiantao Zhang <xiantao.zhang@intel.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

30ed5bb6

KVM: ppc: fix Kconfig constraints · 74ef740d

由 Hollis Blanchard 提交于 11月 07, 2008

Make sure that CONFIG_KVM cannot be selected without processor support
(currently, 440 is the only processor implementation available).
Signed-off-by: NHollis Blanchard <hollisb@us.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

74ef740d

KVM: Fix cpuid iteration on multiple leaves per eac · 0fdf8e59

由 Nitin A Kamble 提交于 11月 05, 2008

The code to traverse the cpuid data array list for counting type of leaves is
currently broken.

This patches fixes the 2 things in it.

 1. Set the 1st counting entry's flag KVM_CPUID_FLAG_STATE_READ_NEXT. Without
    it the code will never find a valid entry.

 2. Also the stop condition in the for loop while looking for the next unflaged
    entry is broken. It needs to stop when it find one matching entry;
    and in the case of count of 1, it will be the same entry found in this
    iteration.
Signed-Off-By: NNitin A Kamble <nitin.a.kamble@intel.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

0fdf8e59

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功