提交 · abd3f2d622a810b7f6687f7ddb405e90e4cfb7ab · openeuler / raspberrypi-kernel

16 7月, 2007 7 次提交

KVM: Rationalize exception bitmap usage · abd3f2d6

由 Avi Kivity 提交于 5月 02, 2007

Everyone owns a piece of the exception bitmap, but they happily write to
the entire thing like there's no tomorrow.  Centralize handling in
update_exception_bitmap() and have everyone call that.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

abd3f2d6

A
KVM: Move some more msr mangling into vmx_save_host_state() · 707c0874
由 Avi Kivity 提交于 5月 02, 2007
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
707c0874

KVM: Fix potential guest state leak into host · 33ed6329

由 Avi Kivity 提交于 5月 02, 2007

The lightweight vmexit path avoids saving and reloading certain host
state.  However in certain cases lightweight vmexit handling can schedule()
which requires reloading the host state.

So we store the host state in the vcpu structure, and reloaded it if we
relinquish the vcpu.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

33ed6329

KVM: Be more careful restoring fs on lightweight vmexit · 62135845

由 Avi Kivity 提交于 5月 01, 2007

i386 wants fs for accessing the pda even on a lightweight exit, so ensure
we can always restore it.  This fixes a regression on i386 introduced by
the lightweight vmexit patch.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

62135845

A
KVM: Unindent some code · 05e0c8c3
由 Avi Kivity 提交于 4月 30, 2007
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
05e0c8c3

KVM: Avoid saving and restoring some host CPU state on lightweight vmexit · e6adf283

由 Avi Kivity 提交于 4月 30, 2007

Many msrs and the like will only be used by the host if we schedule() or
return to userspace.  Therefore, we avoid saving them if we handle the
exit within the kernel, and if a reschedule is not requested.

Based on a patch from Eddie Dong <eddie.dong@intel.com> with a couple of
fixes by me.
Signed-off-by: NYaozu(Eddie) Dong <eddie.dong@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

e6adf283

KVM: VMX: Enable io bitmaps to avoid IO port 0x80 VMEXITs · fdef3ad1

由 He, Qing 提交于 4月 30, 2007

This patch enables IO bitmaps control on vmx and unmask the 0x80 port to
avoid VMEXITs caused by accessing port 0x80. 0x80 is used as delays (see
include/asm/io.h), and handling VMEXITs on its access is unnecessary but
slows things down. This patch improves kernel build test at around
3%~5%.
	Because every VM uses the same io bitmap, it is shared between
all VMs rather than a per-VM data structure.
Signed-off-by: NQing He <qing.he@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

fdef3ad1

15 6月, 2007 1 次提交

KVM: Prevent guest fpu state from leaking into the host · 7702fd1f

由 Avi Kivity 提交于 6月 14, 2007

The lazy fpu changes did not take into account that some vmexit handlers
can sleep. Move loading the guest state into the inner loop so that it
can be reloaded if necessary, and move loading the host state into
vmx_vcpu_put() so it can be performed whenever we relinquish the vcpu.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

7702fd1f

01 6月, 2007 1 次提交

kvm: fix section mismatch warning in kvm-intel.o · 39959588

由 Sam Ravnborg 提交于 6月 01, 2007

Fix following section mismatch warning in kvm-intel.o:
WARNING: o-i386/drivers/kvm/kvm-intel.o(.init.text+0xbd): Section mismatch: reference to .exit.text: (between 'hardware_setup' and 'vmx_disabled_by_bios')

The function free_kvm_area is used in the function alloc_kvm_area which
is marked __init.
The __exit area is discarded by some archs during link-time if a
module is built-in resulting in an oops.

Note: This warning is only seen by my local copy of modpost
      but the change will soon hit upstream.
Signed-off-by: NSam Ravnborg <sam@ravnborg.org>
Cc: Avi Kivity <avi@qumranet.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

39959588

22 5月, 2007 1 次提交

Detach sched.h from mm.h · e8edc6e0

由 Alexey Dobriyan 提交于 5月 21, 2007

First thing mm.h does is including sched.h solely for can_do_mlock() inline
function which has "current" dereference inside. By dealing with can_do_mlock()
mm.h can be detached from sched.h which is good. See below, why.

This patch
a) removes unconditional inclusion of sched.h from mm.h
b) makes can_do_mlock() normal function in mm/mlock.c
c) exports can_do_mlock() to not break compilation
d) adds sched.h inclusions back to files that were getting it indirectly.
e) adds less bloated headers to some files (asm/signal.h, jiffies.h) that were
   getting them indirectly

Net result is:
a) mm.h users would get less code to open, read, preprocess, parse, ... if
   they don't need sched.h
b) sched.h stops being dependency for significant number of files:
   on x86_64 allmodconfig touching sched.h results in recompile of 4083 files,
   after patch it's only 3744 (-8.3%).

Cross-compile tested on

	all arm defconfigs, all mips defconfigs, all powerpc defconfigs,
	alpha alpha-up
	arm
	i386 i386-up i386-defconfig i386-allnoconfig
	ia64 ia64-up
	m68k
	mips
	parisc parisc-up
	powerpc powerpc-up
	s390 s390-up
	sparc sparc-up
	sparc64 sparc64-up
	um-x86_64
	x86_64 x86_64-up x86_64-defconfig x86_64-allnoconfig

as well as my two usual configs.
Signed-off-by: NAlexey Dobriyan <adobriyan@gmail.com>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

e8edc6e0

03 5月, 2007 21 次提交

KVM: Remove unused 'instruction_length' · 2ff81f70

由 Avi Kivity 提交于 4月 29, 2007

As we no longer emulate in userspace, this is meaningless.  We don't
compute it on SVM anyway.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

2ff81f70

KVM: VMX: Add lazy FPU support for VT · 2ab455cc

由 Anthony Liguori 提交于 4月 27, 2007

Only save/restore the FPU host state when the guest is actually using the
FPU.
Signed-off-by: NAnthony Liguori <aliguori@us.ibm.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

2ab455cc

KVM: VMX: Properly shadow the CR0 register in the vcpu struct · 25c4c276

由 Anthony Liguori 提交于 4月 27, 2007

Set all of the host mask bits for CR0 so that we can maintain a proper
shadow of CR0.  This exposes CR0.TS, paving the way for lazy fpu handling.
Signed-off-by: NAnthony Liguori <aliguori@us.ibm.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

25c4c276

A
KVM: Don't complain about cpu erratum AA15 · e0e5127d
由 Avi Kivity 提交于 4月 25, 2007
```
It slows down Windows x64 horribly.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
e0e5127d

KVM: Per-vcpu statistics · 1165f5fe

由 Avi Kivity 提交于 4月 19, 2007

Make the exit statistics per-vcpu instead of global.  This gives a 3.5%
boost when running one virtual machine per core on my two socket dual core
(4 cores total) machine.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

1165f5fe

KVM: VMX: Only save/restore MSR_K6_STAR if necessary · 4d56c8a7

由 Avi Kivity 提交于 4月 19, 2007

Intel hosts only support syscall/sysret in long more (and only if efer.sce
is enabled), so only reload the related MSR_K6_STAR if the guest will
actually be able to use it.

This reduces vmexit cost by about 500 cycles (6400 -> 5870) on my setup.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

4d56c8a7

A
KVM: Fold drivers/kvm/kvm_vmx.h into drivers/kvm/vmx.c · 35cc7f97
由 Avi Kivity 提交于 4月 19, 2007
```
No meat in that file.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
35cc7f97

KVM: VMX: Don't switch 64-bit msrs for 32-bit guests · e38aea3e

由 Avi Kivity 提交于 4月 19, 2007

Some msrs are only used by x86_64 instructions, and are therefore
not needed when the guest is legacy mode.  By not bothering to switch
them, we reduce vmexit latency by 2400 cycles (from about 8800) when
running a 32-bt guest on a 64-bit host.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

e38aea3e

KVM: VMX: Reduce unnecessary saving of host msrs · 2345df8c

由 Avi Kivity 提交于 4月 17, 2007

THe automatically switched msrs are never changed on the host (with
the exception of MSR_KERNEL_GS_BASE) and thus there is no need to save
them on every vm entry.

This reduces vmexit latency by ~400 cycles on i386 and by ~900 cycles (10%)
on x86_64.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

2345df8c

KVM: Fix overflow bug in overflow detection code · 3964994b

由 Eric Sesterhenn / Snakebyte 提交于 4月 09, 2007

The expression

   sp - 6 < sp

where sp is a u16 is undefined in C since 'sp - 6' is promoted to int,
and signed overflow is undefined in C.  gcc 4.2 actually warns about it.
Replace with a simpler test.
Signed-off-by: NEric Sesterhenn <snakebyte@gmx.de>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

3964994b

KVM: Simply gfn_to_page() · 954bbbc2

由 Avi Kivity 提交于 3月 30, 2007

Mapping a guest page to a host page is a common operation.  Currently,
one has first to find the memory slot where the page belongs (gfn_to_memslot),
then locate the page itself (gfn_to_page()).

This is clumsy, and also won't work well with memory aliases.  So simplify
gfn_to_page() not to require memory slot translation first, and instead do it
internally.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

954bbbc2

A
KVM: Remove debug message · afeb1f14
由 Avi Kivity 提交于 3月 27, 2007
```
No longer interesting.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
afeb1f14

KVM: Hack real-mode segments on vmx from KVM_SET_SREGS · 038881c8

由 Avi Kivity 提交于 3月 21, 2007

As usual, we need to mangle segment registers when emulating real mode
as vm86 has specific constraints.  We special case the reset segment base,
and set the "access rights" (or descriptor flags) to vm86 comaptible values.

This fixes reboot on vmx.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

038881c8

KVM: Remove set_cr0_no_modeswitch() arch op · f6528b03

由 Avi Kivity 提交于 3月 20, 2007

set_cr0_no_modeswitch() was a hack to avoid corrupting segment registers.
As we now cache the protected mode values on entry to real mode, this
isn't an issue anymore, and it interferes with reboot (which usually _is_
a modeswitch).
Signed-off-by: NAvi Kivity <avi@qumranet.com>

f6528b03

KVM: Workaround vmx inability to virtualize the reset state · 8cb5b033

由 Avi Kivity 提交于 3月 20, 2007

The reset state has cs.selector == 0xf000 and cs.base == 0xffff0000,
which aren't compatible with vm86 mode, which is used for real mode
virtualization.

When we create a vcpu, we set cs.base to 0xf0000, but if we get there by
way of a reset, the values are inconsistent and vmx refuses to enter
guest mode.

Workaround by detecting the state and munging it appropriately.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

8cb5b033

KVM: Avoid guest virtual addresses in string pio userspace interface · 039576c0

由 Avi Kivity 提交于 3月 20, 2007

The current string pio interface communicates using guest virtual addresses,
relying on userspace to translate addresses and to check permissions. This
interface cannot fully support guest smp, as the check needs to take into
account two pages at one in case an unaligned string transfer straddles a
page boundary.

Change the interface not to communicate guest addresses at all; instead use
a buffer page (mmaped by userspace) and do transfers there. The kernel
manages the virtual to physical translation and can perform the checks
atomically by taking the appropriate locks.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

039576c0

KVM: Add a special exit reason when exiting due to an interrupt · 1b19f3e6

由 Avi Kivity 提交于 3月 04, 2007

This is redundant, as we also return -EINTR from the ioctl, but it
allows us to examine the exit_reason field on resume without seeing
old data.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

1b19f3e6

KVM: Fold kvm_run::exit_type into kvm_run::exit_reason · 8eb7d334

由 Avi Kivity 提交于 3月 04, 2007

Currently, userspace is told about the nature of the last exit from the
guest using two fields, exit_type and exit_reason, where exit_type has
just two enumerations (and no need for more). So fold exit_type into
exit_reason, reducing the complexity of determining what really happened.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

8eb7d334

KVM: Handle cpuid in the kernel instead of punting to userspace · 06465c5a

由 Avi Kivity 提交于 2月 28, 2007

KVM used to handle cpuid by letting userspace decide what values to
return to the guest.  We now handle cpuid completely in the kernel.  We
still let userspace decide which values the guest will see by having
userspace set up the value table beforehand (this is necessary to allow
management software to set the cpu features to the least common denominator,
so that live migration can work).

The motivation for the change is that kvm kernel code can be impacted by
cpuid features, for example the x86 emulator.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

06465c5a

KVM: Do not communicate to userspace through cpu registers during PIO · 46fc1477

由 Avi Kivity 提交于 2月 22, 2007

Currently when passing the a PIO emulation request to userspace, we
rely on userspace updating %rax (on 'in' instructions) and %rsi/%rdi/%rcx
(on string instructions).  This (a) requires two extra ioctls for getting
and setting the registers and (b) is unfriendly to non-x86 archs, when
they get kvm ports.

So fix by doing the register fixups in the kernel and passing to userspace
only an abstract description of the PIO to be done.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

46fc1477

KVM: Use the generic skip_emulated_instruction() in hypercall code · 510043da

由 Dor Laor 提交于 2月 19, 2007

Instead of twiddling the rip registers directly, use the
skip_emulated_instruction() function to do that for us.
Signed-off-by: NDor Laor <dor.laor@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

510043da

27 3月, 2007 2 次提交

KVM: always reload segment selectors · 6d9658df

由 Ingo Molnar 提交于 3月 11, 2007

failed VM entry on VMX might still change %fs or %gs, thus make sure
that KVM always reloads the segment selectors. This is crutial on both
x86 and x86_64: x86 has __KERNEL_PDA in %fs on which things like
'current' depends and x86_64 has 0 there and needs MSR_GS_BASE to work.
Signed-off-by: NIngo Molnar <mingo@elte.hu>

6d9658df

KVM: Prevent system selectors leaking into guest on real->protected mode transition on vmx · 6af11b9e

由 Avi Kivity 提交于 3月 19, 2007

Intel virtualization extensions do not support virtualizing real mode. So
kvm uses virtualized vm86 mode to run real mode code. Unfortunately, this
virtualized vm86 mode does not support the so called "big real" mode, where
the segment selector and base do not agree with each other according to the
real mode rules (base == selector << 4).

To work around this, kvm checks whether a selector/base pair violates the
virtualized vm86 rules, and if so, forces it into conformance. On a
transition back to protected mode, if we see that the guest did not touch
a forced segment, we restore it back to the original protected mode value.

This pile of hacks breaks down if the gdt has changed in real mode, as it
can cause a segment selector to point to a system descriptor instead of a
normal data segment. In fact, this happens with the Windows bootloader
and the qemu acpi bios, where a protected mode memcpy routine issues an
innocent 'pop %es' and traps on an attempt to load a system descriptor.

"Fix" by checking if the to-be-restored selector points at a system segment,
and if so, coercing it into a normal data segment. The long term solution,
of course, is to abandon vm86 mode and use emulation for big real mode.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

6af11b9e

18 3月, 2007 1 次提交

KVM: Fix guest sysenter on vmx · f5b42c33

由 Avi Kivity 提交于 3月 06, 2007

The vmx code currently treats the guest's sysenter support msrs as 32-bit
values, which breaks 32-bit compat mode userspace on 64-bit guests.  Fix by
using the native word width of the machine.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

f5b42c33

04 3月, 2007 6 次提交

KVM: Per-vcpu inodes · bccf2150

由 Avi Kivity 提交于 2月 21, 2007

Allocate a distinct inode for every vcpu in a VM.  This has the following
benefits:

 - the filp cachelines are no longer bounced when f_count is incremented on
   every ioctl()
 - the API and internal code are distinctly clearer; for example, on the
   KVM_GET_REGS ioctl, there is no need to copy the vcpu number from
   userspace and then copy the registers back; the vcpu identity is derived
   from the fd used to make the call

Right now the performance benefits are completely theoretical since (a) we
don't support more than one vcpu per VM and (b) virtualization hardware
inefficiencies completely everwhelm any cacheline bouncing effects.  But
both of these will change, and we need to prepare the API today.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

bccf2150

A
KVM: Wire up hypercall handlers to a central arch-independent location · 270fd9b9
由 Avi Kivity 提交于 2月 19, 2007
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
270fd9b9
I
KVM: Add host hypercall support for vmx · c21415e8
由 Ingo Molnar 提交于 2月 19, 2007
```
Signed-off-by: NAvi Kivity <avi@qumranet.com>
```
c21415e8

KVM: add MSR based hypercall API · 102d8325

由 Ingo Molnar 提交于 2月 19, 2007

This adds a special MSR based hypercall API to KVM. This is to be
used by paravirtual kernels and virtual drivers.
Signed-off-by: NIngo Molnar <mingo@elte.hu>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

102d8325

KVM: Use ARRAY_SIZE macro instead of manual calculation. · 9d8f549d

由 Ahmed S. Darwish 提交于 2月 19, 2007

Signed-off-by: NAhmed S. Darwish <darwish.07@gmail.com>
Signed-off-by: NDor Laor <dor.laor@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

9d8f549d

KVM: vmx: hack set_cr0_no_modeswitch() to actually do modeswitch · de979caa

由 Joerg Roedel 提交于 2月 19, 2007

The whole thing is rotten, but this allows vmx to boot with the guest reboot
fix.
Signed-off-by: NMarkus Rechberger <markus.rechberger@amd.com>
Signed-off-by: NJoerg Roedel <joerg.roedel@amd.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

de979caa