提交 · 44e3ff32ac229a10a30b7b840f092f5b32a5f72a · openeuler / raspberrypi-kernel

21 7月, 2007 3 次提交

KVM: x86 emulator: implement rdmsr and wrmsr · 35f3f286

由 Avi Kivity 提交于 7月 17, 2007

Allow real-mode emulation of rdmsr and wrmsr.  This allows smp Windows to
boot, presumably for its sipi trampoline.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

35f3f286

KVM: Fix memory slot management functions for guest smp · 90cb0529

由 Avi Kivity 提交于 7月 17, 2007

The memory slot management functions were oriented against vcpu 0, where
they should be kvm-wide. This causes hangs starting X on guest smp.

Fix by making the functions (and resultant tail in the mmu) non-vcpu-specific.
Unfortunately this reduces the efficiency of the mmu object cache a bit. We
may have to revisit this later.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

90cb0529

KVM: MMU: Store nx bit for large page shadows · d55e2cb2

由 Avi Kivity 提交于 7月 10, 2007

We need to distinguish between large page shadows which have the nx bit set
and those which don't. The problem shows up when booting a newer smp Linux
kernel, where the trampoline page (which is in real mode, which uses the
same shadow pages as large pages) is using the same mapping as a kernel data
page, which is mapped using nx, causing kvm to spin on that page.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

d55e2cb2

16 7月, 2007 19 次提交

KVM: Add support for in-kernel pio handlers · 74906345

由 Eddie Dong 提交于 6月 19, 2007

Useful for the PIC and PIT.
Signed-off-by: NYaozu (Eddie) Dong <eddie.dong@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

74906345

G
KVM: Adds support for in-kernel mmio handlers · 2eeb2e94
由 Gregory Haskins 提交于 5月 31, 2007
```
Signed-off-by: NGregory Haskins <ghaskins@novell.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
2eeb2e94

KVM: Flush remote tlbs when reducing shadow pte permissions · d9e368d6

由 Avi Kivity 提交于 6月 07, 2007

When a vcpu causes a shadow tlb entry to have reduced permissions, it
must also clear the tlb on remote vcpus.  We do that by:

- setting a bit on the vcpu that requests a tlb flush before the next entry
- if the vcpu is currently executing, we send an ipi to make sure it
  exits before we continue
Signed-off-by: NAvi Kivity <avi@qumranet.com>

d9e368d6

KVM: Keep an upper bound of initialized vcpus · 39c3b86e

由 Avi Kivity 提交于 6月 07, 2007

That way, we don't need to loop for KVM_MAX_VCPUS for a single vcpu
vm.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

39c3b86e

KVM: Emulate hlt on real mode for Intel · 72d6e5a0

由 Avi Kivity 提交于 6月 05, 2007

This has two use cases: the bios can't boot from disk, and guest smp
bootstrap.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

72d6e5a0

A
KVM: Move duplicate halt handling code into kvm_main.c · d3bef15f
由 Avi Kivity 提交于 6月 05, 2007
```
Will soon have a thid user.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
d3bef15f

KVM: Enable guest smp · ef9254df

由 Avi Kivity 提交于 6月 05, 2007

As we don't support guest tlb shootdown yet, this is only reliable
for real-mode guests.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

ef9254df

KVM: Lazy guest cr3 switching · 17c3ba9d

由 Avi Kivity 提交于 6月 04, 2007

Switch guest paging context may require us to allocate memory, which
might fail.  Instead of wiring up error paths everywhere, make context
switching lazy and actually do the switch before the next guest entry,
where we can return an error if allocation fails.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

17c3ba9d

A
KVM: MMU: Use slab caches for shadow pages and their headers · d3d25b04
由 Avi Kivity 提交于 5月 30, 2007
```
Use slab caches instead of a simple custom list.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
d3d25b04

KVM: Fix includes · 06ff0d37

由 Markus Rechberger 提交于 5月 27, 2007

KVM compilation fails for some .configs.  This fixes it.
Signed-off-by: NMarkus Rechberger <markus.rechberger@amd.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

06ff0d37

KVM: VMX: Avoid saving and restoring msr_efer on lightweight vmexit · 2cc51560

由 Eddie Dong 提交于 5月 21, 2007

MSR_EFER.LME/LMA bits are automatically save/restored by VMX
hardware, KVM only needs to save NX/SCE bits at time of heavy
weight VM Exit. But clearing NX bits in host envirnment may
cause system hang if the host page table is using EXB bits,
thus we leave NX bits as it is. If Host NX=1 and guest NX=0, we
can do guest page table EXB bits check before inserting a shadow
pte (though no guest is expecting to see this kind of gp fault).
If host NX=0, we present guest no Execute-Disable feature to guest,
thus no host NX=0, guest NX=1 combination.

This patch reduces raw vmexit time by ~27%.

Me: fix compile warnings on i386.
Signed-off-by: NYaozu (Eddie) Dong <eddie.dong@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

2cc51560

KVM: VMX: Avoid saving and restoring msrs on lightweight vmexit · a75beee6

由 Eddie Dong 提交于 5月 17, 2007

In a lightweight exit (where we exit and reenter the guest without
scheduling or exiting to userspace in between), we don't need various
msrs on the host, and avoiding shuffling them around reduces raw exit
time by 8%.

i386 compile fix by Daniel Hecken <dh@bahntechnik.de>.
Signed-off-by: NYaozu (Eddie) Dong <eddie.dong@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

a75beee6

A
KVM: MMU: Store shadow page tables as kernel virtual addresses, not physical · 47ad8e68
由 Avi Kivity 提交于 5月 06, 2007
```
Simpifies things a bit.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
47ad8e68

KVM: Set cr0.mp for guests · a3a06367

由 Avi Kivity 提交于 5月 02, 2007

This allows fwait instructions to be trapped when the guest fpu is not
loaded.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

a3a06367

A
KVM: Consolidate guest fpu activation and deactivation · 5fd86fcf
由 Avi Kivity 提交于 5月 02, 2007
```
Easier to keep track of where the fpu is this way.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
5fd86fcf

KVM: Fix potential guest state leak into host · 33ed6329

由 Avi Kivity 提交于 5月 02, 2007

The lightweight vmexit path avoids saving and reloading certain host
state.  However in certain cases lightweight vmexit handling can schedule()
which requires reloading the host state.

So we store the host state in the vcpu structure, and reloaded it if we
relinquish the vcpu.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

33ed6329

KVM: Increase mmu shadow cache to 1024 pages · 7494c0cc

由 Avi Kivity 提交于 5月 01, 2007

This improves kbuild times by about 10%, bringing it within a respectable
25% of native.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

7494c0cc

KVM: Unify kvm_mmu_pre_write() and kvm_mmu_post_write() · 09072daf

由 Avi Kivity 提交于 5月 01, 2007

Instead of calling two functions and repeating expensive checks, call one
function and provide it with before/after information.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

09072daf

KVM: Avoid saving and restoring some host CPU state on lightweight vmexit · e6adf283

由 Avi Kivity 提交于 4月 30, 2007

Many msrs and the like will only be used by the host if we schedule() or
return to userspace.  Therefore, we avoid saving them if we handle the
exit within the kernel, and if a reschedule is not requested.

Based on a patch from Eddie Dong <eddie.dong@intel.com> with a couple of
fixes by me.
Signed-off-by: NYaozu(Eddie) Dong <eddie.dong@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

e6adf283

15 6月, 2007 1 次提交

KVM: Prevent guest fpu state from leaking into the host · 7702fd1f

由 Avi Kivity 提交于 6月 14, 2007

The lazy fpu changes did not take into account that some vmexit handlers
can sleep. Move loading the guest state into the inner loop so that it
can be reloaded if necessary, and move loading the host state into
vmx_vcpu_put() so it can be performed whenever we relinquish the vcpu.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

7702fd1f

22 5月, 2007 1 次提交

Detach sched.h from mm.h · e8edc6e0

由 Alexey Dobriyan 提交于 5月 21, 2007

First thing mm.h does is including sched.h solely for can_do_mlock() inline
function which has "current" dereference inside. By dealing with can_do_mlock()
mm.h can be detached from sched.h which is good. See below, why.

This patch
a) removes unconditional inclusion of sched.h from mm.h
b) makes can_do_mlock() normal function in mm/mlock.c
c) exports can_do_mlock() to not break compilation
d) adds sched.h inclusions back to files that were getting it indirectly.
e) adds less bloated headers to some files (asm/signal.h, jiffies.h) that were
   getting them indirectly

Net result is:
a) mm.h users would get less code to open, read, preprocess, parse, ... if
   they don't need sched.h
b) sched.h stops being dependency for significant number of files:
   on x86_64 allmodconfig touching sched.h results in recompile of 4083 files,
   after patch it's only 3744 (-8.3%).

Cross-compile tested on

	all arm defconfigs, all mips defconfigs, all powerpc defconfigs,
	alpha alpha-up
	arm
	i386 i386-up i386-defconfig i386-allnoconfig
	ia64 ia64-up
	m68k
	mips
	parisc parisc-up
	powerpc powerpc-up
	s390 s390-up
	sparc sparc-up
	sparc64 sparc64-up
	um-x86_64
	x86_64 x86_64-up x86_64-defconfig x86_64-allnoconfig

as well as my two usual configs.
Signed-off-by: NAlexey Dobriyan <adobriyan@gmail.com>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

e8edc6e0

03 5月, 2007 16 次提交

KVM: Remove extraneous guest entry on mmio read · e7df56e4

由 Avi Kivity 提交于 3月 14, 2007

When emulating an mmio read, we actually emulate twice: once to determine
the physical address of the mmio, and, after we've exited to userspace to
get the mmio value, we emulate again to place the value in the result
register and update any flags.

But we don't really need to enter the guest again for that, only to take
an immediate vmexit. So, if we detect that we're doing an mmio read,
emulate a single instruction before entering the guest again.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

e7df56e4

KVM: VMX: Properly shadow the CR0 register in the vcpu struct · 25c4c276

由 Anthony Liguori 提交于 4月 27, 2007

Set all of the host mask bits for CR0 so that we can maintain a proper
shadow of CR0.  This exposes CR0.TS, paving the way for lazy fpu handling.
Signed-off-by: NAnthony Liguori <aliguori@us.ibm.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

25c4c276

KVM: Lazy FPU support for SVM · 7807fa6c

由 Anthony Liguori 提交于 4月 23, 2007

Avoid saving and restoring the guest fpu state on every exit.  This
shaves ~100 cycles off the guest/host switch.
Signed-off-by: NAnthony Liguori <aliguori@us.ibm.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

7807fa6c

KVM: Per-vcpu statistics · 1165f5fe

由 Avi Kivity 提交于 4月 19, 2007

Make the exit statistics per-vcpu instead of global.  This gives a 3.5%
boost when running one virtual machine per core on my two socket dual core
(4 cores total) machine.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

1165f5fe

KVM: Use slab caches to allocate mmu data structures · b5a33a75

由 Avi Kivity 提交于 4月 15, 2007

Better leak detection, statistics, memory use, speed -- goodness all
around.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

b5a33a75

KVM: Add physical memory aliasing feature · e8207547

由 Avi Kivity 提交于 3月 30, 2007

With this, we can specify that accesses to one physical memory range will
be remapped to another. This is useful for the vga window at 0xa0000 which
is used as a movable window into the (much larger) framebuffer.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

e8207547

KVM: Simply gfn_to_page() · 954bbbc2

由 Avi Kivity 提交于 3月 30, 2007

Mapping a guest page to a host page is a common operation.  Currently,
one has first to find the memory slot where the page belongs (gfn_to_memslot),
then locate the page itself (gfn_to_page()).

This is clumsy, and also won't work well with memory aliases.  So simplify
gfn_to_page() not to require memory slot translation first, and instead do it
internally.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

954bbbc2

KVM: Add mmu cache clear function · e0fa826f

由 Dor Laor 提交于 3月 30, 2007

Functions that play around with the physical memory map
need a way to clear mappings to possibly nonexistent or
invalid memory.  Both the mmu cache and the processor tlb
are cleared.
Signed-off-by: NDor Laor <dor.laor@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

e0fa826f

KVM: SVM: Ensure timestamp counter monotonicity · 0cc5064d

由 Avi Kivity 提交于 3月 25, 2007

When a vcpu is migrated from one cpu to another, its timestamp counter
may lose its monotonic property if the host has unsynced timestamp counters.
This can confuse the guest, sometimes to the point of refusing to boot.

As the rdtsc instruction is rather fast on AMD processors (7-10 cycles),
we can simply record the last host tsc when we drop the cpu, and adjust
the vcpu tsc offset when we detect that we've migrated to a different cpu.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

0cc5064d

KVM: MMU: Fix hugepage pdes mapping same physical address with different access · d28c6cfb

由 Avi Kivity 提交于 3月 23, 2007

The kvm mmu keeps a shadow page for hugepage pdes; if several such pdes map
the same physical address, they share the same shadow page. This is a fairly
common case (kernel mappings on i386 nonpae Linux, for example).

However, if the two pdes map the same memory but with different permissions, kvm
will happily use the cached shadow page. If the access through the more
permissive pde will occur after the access to the strict pde, an endless pagefault
loop will be generated and the guest will make no progress.

Fix by making the access permissions part of the cache lookup key.

The fix allows Xen pae to boot on kvm and run guest domains.

Thanks to Jeremy Fitzhardinge for reporting the bug and testing the fix.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

d28c6cfb

KVM: Remove set_cr0_no_modeswitch() arch op · f6528b03

由 Avi Kivity 提交于 3月 20, 2007

set_cr0_no_modeswitch() was a hack to avoid corrupting segment registers.
As we now cache the protected mode values on entry to real mode, this
isn't an issue anymore, and it interferes with reboot (which usually _is_
a modeswitch).
Signed-off-by: NAvi Kivity <avi@qumranet.com>

f6528b03

KVM: MMU: Remove global pte tracking · aac01224

由 Avi Kivity 提交于 3月 20, 2007

The initial, noncaching, version of the kvm mmu flushed the all nonglobal
shadow page table translations (much like a native tlb flush).  The new
implementation flushes translations only when they change, rendering global
pte tracking superfluous.

This removes the unused tracking mechanism and storage space.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

aac01224

KVM: Avoid guest virtual addresses in string pio userspace interface · 039576c0

由 Avi Kivity 提交于 3月 20, 2007

The current string pio interface communicates using guest virtual addresses,
relying on userspace to translate addresses and to check permissions. This
interface cannot fully support guest smp, as the check needs to take into
account two pages at one in case an unaligned string transfer straddles a
page boundary.

Change the interface not to communicate guest addresses at all; instead use
a buffer page (mmaped by userspace) and do transfers there. The kernel
manages the virtual to physical translation and can perform the checks
atomically by taking the appropriate locks.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

039576c0

KVM: Add guest mode signal mask · 1961d276

由 Avi Kivity 提交于 3月 05, 2007

Allow a special signal mask to be used while executing in guest mode. This
allows signals to be used to interrupt a vcpu without requiring signal
delivery to a userspace handler, which is quite expensive. Userspace still
receives -EINTR and can get the signal via sigwait().
Signed-off-by: NAvi Kivity <avi@qumranet.com>

1961d276

KVM: Handle cpuid in the kernel instead of punting to userspace · 06465c5a

由 Avi Kivity 提交于 2月 28, 2007

KVM used to handle cpuid by letting userspace decide what values to
return to the guest.  We now handle cpuid completely in the kernel.  We
still let userspace decide which values the guest will see by having
userspace set up the value table beforehand (this is necessary to allow
management software to set the cpu features to the least common denominator,
so that live migration can work).

The motivation for the change is that kvm kernel code can be impacted by
cpuid features, for example the x86 emulator.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

06465c5a

KVM: Do not communicate to userspace through cpu registers during PIO · 46fc1477

由 Avi Kivity 提交于 2月 22, 2007

Currently when passing the a PIO emulation request to userspace, we
rely on userspace updating %rax (on 'in' instructions) and %rsi/%rdi/%rcx
(on string instructions).  This (a) requires two extra ioctls for getting
and setting the registers and (b) is unfriendly to non-x86 archs, when
they get kvm ports.

So fix by doing the register fixups in the kernel and passing to userspace
only an abstract description of the PIO to be done.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

46fc1477