提交 · 1390a28b274e2e45f89bac67c435cbcbc5cc0790 · openeuler / raspberrypi-kernel

28 8月, 2012 12 次提交

KVM: VMX: Preserve segment limit and access rights in real mode · 1390a28b

由 Avi Kivity 提交于 8月 21, 2012

While this is undocumented, real processors do not reload the segment
limit and access rights when loading a segment register in real mode.
Real programs rely on it so we need to comply with this behaviour.
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

1390a28b

KVM: VMX: Return real real-mode segment data even if emulate_invalid_guest_state=1 · 72636420

由 Avi Kivity 提交于 8月 21, 2012

emulate_invalid_guest_state=1 doesn't mean we don't munge the segments in the
vmcs; we do. So we need to return the real ones (maintained by vmx_set_segment).
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

72636420

KVM: x86 emulator: Fix #GP error code during linearization · 0afbe2f8

由 Avi Kivity 提交于 8月 21, 2012

We want the segment selector, nor segment number.
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

0afbe2f8

KVM: x86 emulator: Check segment limits in real mode too · a5625189

由 Avi Kivity 提交于 8月 21, 2012

Segment limits are verified in real mode, not just protected mode.
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

a5625189

KVM: x86 emulator: Leave segment limit and attributs alone in real mode · 03ebebeb

由 Avi Kivity 提交于 8月 21, 2012

When loading a segment in real mode, only the base and selector must
be modified.  The limit needs to be left alone, otherwise big real mode
users will hit a #GP due to limit checking (currently this is suppressed
because we don't check limits in real mode).
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

03ebebeb

KVM: VMX: Allow vm86 virtualization of big real mode · e2a610d7

由 Avi Kivity 提交于 8月 21, 2012

Usually, big real mode uses large (4GB) segments. Currently we don't
virtualize this; if any segment has a limit other than 0xffff, we emulate.
But if we set the vmx-visible limit to 0xffff, we can use vm86 to virtualize
real mode; if an access overruns the segment limit, the guest will #GP, which
we will trap and forward to the emulator. This results in significantly
faster execution, and less risk of hitting an unemulated instruction.

If the limit is less than 0xffff, we retain the existing behaviour.
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

e2a610d7

KVM: VMX: Allow real mode emulation using vm86 with dpl=0 · 495e1166

由 Avi Kivity 提交于 8月 21, 2012

Real mode is always entered from protected mode with dpl=0.  Since
the dpl doesn't affect execution, and we already override it to 3
in the vmcs (as vmx requires), we can allow execution in that state.
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

495e1166

KVM: VMX: Retain limit and attributes when entering protected mode · c865c43d

由 Avi Kivity 提交于 8月 21, 2012

Real processors don't change segment limits and attributes while in
real mode.  Mimic that behaviour.
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

c865c43d

KVM: VMX: Use kvm_segment to save protected-mode segments when entering realmode · f5f7b2fe

由 Avi Kivity 提交于 8月 21, 2012

Instead of using struct kvm_save_segment, use struct kvm_segment, which is what
the other APIs use. This leads to some simplification.

We replace save_rmode_seg() with a call to vmx_save_segment(). Since this depends
on rmode.vm86_active, we move the call to before setting the flag.
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

f5f7b2fe

KVM: VMX: Fix incorrect lookup of segment S flag in fix_pmode_dataseg() · 72fbefec

由 Avi Kivity 提交于 8月 21, 2012

fix_pmode_dataseg() looks up S in ->base instead of ->ar_bytes.
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

72fbefec

KVM: VMX: Separate saving pre-realmode state from setting segments · baa7e81e

由 Avi Kivity 提交于 8月 21, 2012

Commit b246dd5d ("KVM: VMX: Fix KVM_SET_SREGS with big real mode
segments") moved fix_rmode_seg() to vmx_set_segment(), so that it is
applied not just on transitions to real mode, but also on KVM_SET_SREGS
(migration).  However fix_rmode_seg() not only munges the vmcs segments,
it also sets up the save area for us to restore when returning to
protected mode or to return in vmx_get_segment().

Move saving the segment into a new function, save_rmode_seg(), and
call it just during the transition.
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

baa7e81e

KVM: x86 emulator: access GPRs on demand · dd856efa

由 Avi Kivity 提交于 8月 27, 2012

Instead of populating the entire register file, read in registers
as they are accessed, and write back only the modified ones.  This
saves a VMREAD and VMWRITE on Intel (for rsp, since it is not usually
used during emulation), and a two 128-byte copies for the registers.
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

dd856efa

23 8月, 2012 1 次提交

KVM: x86 emulator: use stack size attribute to mask rsp in stack ops · 5ad105e5

由 Avi Kivity 提交于 8月 19, 2012

The sub-register used to access the stack (sp, esp, or rsp) is not
determined by the address size attribute like other memory references,
but by the stack segment's B bit (if not in x86_64 mode).

Fix by using the existing stack_mask() to figure out the correct mask.

This long-existing bug was exposed by a combination of a27685c3
(emulate invalid guest state by default), which causes many more
instructions to be emulated, and a seabios change (possibly a bug) which
causes the high 16 bits of esp to become polluted across calls to real
mode software interrupts.
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

5ad105e5

22 8月, 2012 4 次提交

KVM: MMU: Fix mmu_shrink() so that it can free mmu pages as intended · 35f2d16b

由 Takuya Yoshikawa 提交于 8月 20, 2012

Although the possible race described in

  commit 85b70591
  KVM: MMU: fix shrinking page from the empty mmu

was correct, the real cause of that issue was a more trivial bug of
mmu_shrink() introduced by

  commit 19526396
  KVM: MMU: do not iterate over all VMs in mmu_shrink()

Here is the bug:

	if (kvm->arch.n_used_mmu_pages > 0) {
		if (!nr_to_scan--)
			break;
		continue;
	}

We skip VMs whose n_used_mmu_pages is not zero and try to shrink others:
in other words we try to shrink empty ones by mistake.

This patch reverses the logic so that mmu_shrink() can free pages from
the first VM whose n_used_mmu_pages is not zero.  Note that we also add
comments explaining the role of nr_to_scan which is not practically
important now, hoping this will be improved in the future.
Signed-off-by: NTakuya Yoshikawa <yoshikawa.takuya@oss.ntt.co.jp>
Cc: Gleb Natapov <gleb@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

35f2d16b

KVM: introduce readonly memslot · 4d8b81ab

由 Xiao Guangrong 提交于 8月 21, 2012

In current code, if we map a readonly memory space from host to guest
and the page is not currently mapped in the host, we will get a fault
pfn and async is not allowed, then the vm will crash

We introduce readonly memory region to map ROM/ROMD to the guest, read access
is happy for readonly memslot, write access on readonly memslot will cause
KVM_EXIT_MMIO exit
Signed-off-by: NXiao Guangrong <xiaoguangrong@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

4d8b81ab

KVM: introduce gfn_to_pfn_memslot_atomic · 037d92dc

由 Xiao Guangrong 提交于 8月 21, 2012

It can instead of hva_to_pfn_atomic
Signed-off-by: NXiao Guangrong <xiaoguangrong@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

037d92dc

KVM: x86: fix possible infinite loop caused by reexecute_instruction · 8e3d9d06

由 Xiao Guangrong 提交于 8月 21, 2012

Currently, we reexecute all unhandleable instructions if they do not
access on the mmio, however, it can not work if host map the readonly
memory to guest. If the instruction try to write this kind of memory,
it will fault again when guest retry it, then we will goto a infinite
loop: retry instruction -> write #PF -> emulation fail ->
retry instruction -> ...

Fix it by retrying the instruction only when it faults on the writable
memory
Signed-off-by: NXiao Guangrong <xiaoguangrong@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

8e3d9d06

15 8月, 2012 1 次提交

KVM: x86: drop parameter validation in ioapic/pic · 28a6fdab

由 Michael S. Tsirkin 提交于 8月 14, 2012

We validate irq pin number when routing is setup, so
code handling illegal irq # in pic and ioapic on each injection
is never called.
Drop it, replace with BUG_ON to catch out of bounds access bugs.
Signed-off-by: NMichael S. Tsirkin <mst@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

28a6fdab

14 8月, 2012 3 次提交

KVM: VMX: Advertize RDTSC exiting to nested guests · dbcb4e79

由 Avi Kivity 提交于 8月 13, 2012

All processors that support VMX have that feature, and guests (Xen) depend on
it.  As we already implement it, advertize it to the guest.
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

dbcb4e79

KVM: VMX: restore MSR_IA32_DEBUGCTLMSR after VMEXIT · 2a7921b7

由 Gleb Natapov 提交于 8月 12, 2012

MSR_IA32_DEBUGCTLMSR is zeroed on VMEXIT. Restore it to the correct
value.
Signed-off-by: NGleb Natapov <gleb@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

2a7921b7

KVM: x86: fix pvclock guest stopped flag reporting · 51d59c6b

由 Marcelo Tosatti 提交于 8月 03, 2012

kvm_guest_time_update unconditionally clears hv_clock.flags field,
so the notification never reaches the guest.

Fix it by allowing PVCLOCK_GUEST_STOPPED to passthrough.
Reviewed-by: NEric B Munson <emunson@mgebm.net>
Reviewed-by: NAmit Shah <amit.shah@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

51d59c6b

09 8月, 2012 1 次提交

KVM: correctly detect APIC SW state in kvm_apic_post_state_restore() · 64eb0620

由 Gleb Natapov 提交于 8月 08, 2012

For apic_set_spiv() to track APIC SW state correctly it needs to see
previous and next values of the spurious vector register, but currently
memset() overwrite the old value before apic_set_spiv() get a chance to
do tracking. Fix it by calling apic_set_spiv() before overwriting old
value.
Signed-off-by: NGleb Natapov <gleb@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

64eb0620

07 8月, 2012 4 次提交

KVM: inline kvm_apic_present() and kvm_lapic_enabled() · c48f1496