提交 · fc73373b33f5f965f2f82bfbc40ef8e6072e986d · openeuler / Kernel

12 7月, 2012 1 次提交

KVM: Add x86_hyper_kvm to complete detect_hypervisor_platform check · fc73373b

由 Prarit Bhargava 提交于 7月 06, 2012

While debugging I noticed that unlike all the other hypervisor code in the
kernel, kvm does not have an entry for x86_hyper which is used in
detect_hypervisor_platform() which results in a nice printk in the
syslog.  This is only really a stub function but it
does make kvm more consistent with the other hypervisors.
Signed-off-by: NPrarit Bhargava <prarit@redhat.com>
Cc: Avi Kivity <avi@redhat.com>
Cc: Gleb Natapov <gleb@redhat.com>
Cc: Alex Williamson <alex.williamson@redhat.com>
Cc: Konrad Rzeszutek Wilk <konrad.wilk@oracle.com>
Cc: Marcelo Tostatti <mtosatti@redhat.com>
Cc: kvm@vger.kernel.org
Signed-off-by: NAvi Kivity <avi@redhat.com>

fc73373b

11 7月, 2012 10 次提交

KVM: MMU: document mmu-lock and fast page fault · 58d8b172

由 Xiao Guangrong 提交于 6月 20, 2012

Document fast page fault and mmu-lock in locking.txt
Signed-off-by: NXiao Guangrong <xiaoguangrong@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

58d8b172

KVM: MMU: fix kvm_mmu_pagetable_walk tracepoint · 6fbc2770

由 Xiao Guangrong 提交于 6月 20, 2012

The P bit of page fault error code is missed in this tracepoint, fix it by
passing the full error code
Signed-off-by: NXiao Guangrong <xiaoguangrong@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

6fbc2770

KVM: MMU: trace fast page fault · a72faf25

由 Xiao Guangrong 提交于 6月 20, 2012

To see what happen on this path and help us to optimize it
Signed-off-by: NXiao Guangrong <xiaoguangrong@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

a72faf25

KVM: MMU: fast path of handling guest page fault · c7ba5b48

由 Xiao Guangrong 提交于 6月 20, 2012

If the the present bit of page fault error code is set, it indicates
the shadow page is populated on all levels, it means what we do is
only modify the access bit which can be done out of mmu-lock

Currently, in order to simplify the code, we only fix the page fault
caused by write-protect on the fast path
Signed-off-by: NXiao Guangrong <xiaoguangrong@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

c7ba5b48

KVM: MMU: introduce SPTE_MMU_WRITEABLE bit · 49fde340

由 Xiao Guangrong 提交于 6月 20, 2012

This bit indicates whether the spte can be writable on MMU, that means
the corresponding gpte is writable and the corresponding gfn is not
protected by shadow page protection

In the later path, SPTE_MMU_WRITEABLE will indicates whether the spte
can be locklessly updated
Signed-off-by: NXiao Guangrong <xiaoguangrong@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

49fde340

KVM: MMU: fold tlb flush judgement into mmu_spte_update · 6e7d0354

由 Xiao Guangrong 提交于 6月 20, 2012

mmu_spte_update() is the common function, we can easily audit the path
Signed-off-by: NXiao Guangrong <xiaoguangrong@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

6e7d0354

KVM: VMX: export PFEC.P bit on ept · 4f5982a5

由 Xiao Guangrong 提交于 6月 20, 2012

Export the present bit of page fault error code, the later patch
will use it
Signed-off-by: NXiao Guangrong <xiaoguangrong@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

4f5982a5

KVM: MMU: cleanup spte_write_protect · 8e22f955

由 Xiao Guangrong 提交于 6月 20, 2012

Use __drop_large_spte to cleanup this function and comment spte_write_protect
Signed-off-by: NXiao Guangrong <xiaoguangrong@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

8e22f955

KVM: MMU: abstract spte write-protect · d13bc5b5

由 Xiao Guangrong 提交于 6月 20, 2012

Introduce a common function to abstract spte write-protect to
cleanup the code
Signed-off-by: NXiao Guangrong <xiaoguangrong@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

d13bc5b5

KVM: MMU: return bool in __rmap_write_protect · 2f84569f

由 Xiao Guangrong 提交于 6月 20, 2012

The reture value of __rmap_write_protect is either 1 or 0, use
true/false instead of these
Signed-off-by: NXiao Guangrong <xiaoguangrong@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

2f84569f

09 7月, 2012 27 次提交

KVM: VMX: Emulate invalid guest state by default · a27685c3

由 Avi Kivity 提交于 6月 12, 2012

Our emulation should be complete enough that we can emulate guests
while they are in big real mode, or in a mode transition that is not
virtualizable without unrestricted guest support.
Signed-off-by: NAvi Kivity <avi@redhat.com>

a27685c3

KVM: x86 emulator: implement LTR · 80890006

由 Avi Kivity 提交于 6月 13, 2012

Opcode 0F 00 /3.  Encountered during Windows XP secondary processor bringup.
Signed-off-by: NAvi Kivity <avi@redhat.com>

80890006

KVM: x86 emulator: make loading TR set the busy bit · 869be99c

由 Avi Kivity 提交于 6月 13, 2012

Guest software doesn't actually depend on it, but vmx will refuse us
entry if we don't.  Set the bit in both the cached segment and memory,
just to be nice.
Signed-off-by: NAvi Kivity <avi@redhat.com>

869be99c

KVM: x86 emulator: make read_segment_descriptor() return the address · e919464b

由 Avi Kivity 提交于 6月 13, 2012

Some operations want to modify the descriptor later on, so save the
address for future use.
Signed-off-by: NAvi Kivity <avi@redhat.com>

e919464b

KVM: x86 emulator: emulate LLDT · a14e579f

由 Avi Kivity 提交于 6月 13, 2012

Opcode 0F 00 /2. Used by isolinux durign the protected mode transition.
Signed-off-by: NAvi Kivity <avi@redhat.com>

a14e579f

KVM: x86 emulator: emulate BSWAP · 9299836e

由 Avi Kivity 提交于 6月 13, 2012

Opcodes 0F C8 - 0F CF.

Used by the SeaBIOS cdrom code (though not in big real mode).
Signed-off-by: NAvi Kivity <avi@redhat.com>

9299836e

A
KVM: VMX: Improve error reporting during invalid guest state emulation · de5f70e0
由 Avi Kivity 提交于 6月 12, 2012
```
If instruction emulation fails, report it properly to userspace.
Signed-off-by: NAvi Kivity <avi@redhat.com>
```
de5f70e0

KVM: VMX: Stop invalid guest state emulation on pending event · de87dcdd

由 Avi Kivity 提交于 6月 12, 2012

Process the event, possibly injecting an interrupt, before continuing.
Signed-off-by: NAvi Kivity <avi@redhat.com>

de87dcdd

KVM: x86 emulator: implement ENTER · 612e89f0

由 Avi Kivity 提交于 6月 12, 2012

Opcode C8.

Only ENTER with lexical nesting depth 0 is implemented, since others are
very rare.  We'll fail emulation if nonzero lexical depth is used so data
is not corrupted.
Signed-off-by: NAvi Kivity <avi@redhat.com>

612e89f0

KVM: x86 emulator: split push logic from push opcode emulation · 51ddff50

由 Avi Kivity 提交于 6月 12, 2012

This allows us to reuse the code without populating ctxt->src and
overriding ctxt->op_bytes.
Signed-off-by: NAvi Kivity <avi@redhat.com>

51ddff50

KVM: x86 emulator: fix byte-sized MOVZX/MOVSX · 361cad2b

由 Avi Kivity 提交于 6月 11, 2012

Commit 2adb5ad9 removed ByteOp from MOVZX/MOVSX, replacing them by
SrcMem8, but neglected to fix the dependency in the emulation code
on ByteOp.  This caused the instruction not to have any effect in
some circumstances.

Fix by replacing the check for ByteOp with the equivalent src.op_bytes == 1.
Signed-off-by: NAvi Kivity <avi@redhat.com>

361cad2b

A
KVM: x86 emulator: emulate LAHF · 2dd7caa0
由 Avi Kivity 提交于 6月 11, 2012
```
Opcode 9F.
Signed-off-by: NAvi Kivity <avi@redhat.com>
```
2dd7caa0

KVM: VMX: Continue emulating after batch exhausted · 7c068e45

由 Avi Kivity 提交于 6月 10, 2012

If we return early from an invalid guest state emulation loop, make
sure we return to it later if the guest state is still invalid.
Signed-off-by: NAvi Kivity <avi@redhat.com>

7c068e45

KVM: VMX: Fix interrupt exit condition during emulation · bdea48e3

由 Avi Kivity 提交于 6月 10, 2012

Checking EFLAGS.IF is incorrect as we might be in interrupt shadow.  If
that is the case, the main loop will notice that and not inject the interrupt,
causing an endless loop.

Fix by using vmx_interrupt_allowed() to check if we can inject an interrupt
instead.
Signed-off-by: NAvi Kivity <avi@redhat.com>

bdea48e3

A
KVM: x86 emulator: emulate SGDT/SIDT · 96051572
由 Avi Kivity 提交于 6月 10, 2012
```
Opcodes 0F 01 /0 and 0F 01 /1
Signed-off-by: NAvi Kivity <avi@redhat.com>
```
96051572

KVM: Fix SS default ESP/EBP based addressing · a6e3407b

由 Avi Kivity 提交于 6月 10, 2012

We correctly default to SS when BP is used as a base in 16-bit address mode,
but we don't do that for 32-bit mode.

Fix by adjusting the default to SS when either ESP or EBP is used as the base
register.
Signed-off-by: NAvi Kivity <avi@redhat.com>

a6e3407b

KVM: x86 emulator: initialize memop · cbd27ee7

由 Avi Kivity 提交于 6月 10, 2012

memop is not initialized; this can lead to a two-byte operation
following a 4-byte operation to see garbage values.  Usually
truncation fixes things fot us later on, but at least in one case
(call abs) it doesn't.

Fix by moving memop to the auto-initialized field area.
Signed-off-by: NAvi Kivity <avi@redhat.com>

cbd27ee7

KVM: x86 emulator: emulate LEAVE · f47cfa31

由 Avi Kivity 提交于 6月 07, 2012

Opcode c9; used by some variants of Windows during boot, in big real mode.
Signed-off-by: NAvi Kivity <avi@redhat.com>

f47cfa31

KVM: VMX: Limit iterations with emulator_invalid_guest_state · b8405c18

由 Avi Kivity 提交于 6月 07, 2012

Otherwise, if the guest ends up looping, we never exit the srcu critical
section, which causes synchronize_srcu() to hang.
Signed-off-by: NAvi Kivity <avi@redhat.com>

b8405c18

KVM: VMX: Relax check on unusable segment · f0495f9b

由 Avi Kivity 提交于 6月 07, 2012

Some userspace (e.g. QEMU 1.1) munge the d and g bits of segment
descriptors, causing us not to recognize them as unusable segments
with emulate_invalid_guest_state=1.  Relax the check by testing for
segment not present (a non-present segment cannot be usable).
Signed-off-by: NAvi Kivity <avi@redhat.com>

f0495f9b

KVM: x86 emulator: fix LIDT/LGDT in long mode · 510425ff

由 Avi Kivity 提交于 6月 07, 2012

The operand size for these instructions is 8 bytes in long mode, even without
a REX prefix.  Set it explicitly.

Triggered while booting Linux with emulate_invalid_guest_state=1.
Signed-off-by: NAvi Kivity <avi@redhat.com>

510425ff

A
KVM: x86 emulator: allow loading null SS in long mode · 79d5b4c3
由 Avi Kivity 提交于 6月 07, 2012
```
Null SS is valid in long mode; allow loading it.
Signed-off-by: NAvi Kivity <avi@redhat.com>
```
79d5b4c3

KVM: x86 emulator: emulate cpuid · 6d6eede4

由 Avi Kivity 提交于 6月 07, 2012

Opcode 0F A2.

Used by Linux during the mode change trampoline while in a state that is
not virtualizable on vmx without unrestricted_guest, so we need to emulate
it is emulate_invalid_guest_state=1.
Signed-off-by: NAvi Kivity <avi@redhat.com>

6d6eede4

KVM: x86 emulator: change ->get_cpuid() accessor to use the x86 semantics · 0017f93a

由 Avi Kivity 提交于 6月 07, 2012

Instead of getting an exact leaf, follow the spec and fall back to the last
main leaf instead.  This lets us easily emulate the cpuid instruction in the
emulator.
Signed-off-by: NAvi Kivity <avi@redhat.com>

0017f93a

KVM: Split cpuid register access from computation · 62046e5a

由 Avi Kivity 提交于 6月 07, 2012

Introduce kvm_cpuid() to perform the leaf limit check and calculate
register values, and let kvm_emulate_cpuid() just handle reading and
writing the registers from/to the vcpu.  This allows us to reuse
kvm_cpuid() in a context where directly reading and writing registers
is not desired.
Signed-off-by: NAvi Kivity <avi@redhat.com>

62046e5a

KVM: VMX: Return correct CPL during transition to protected mode · d881e6f6

由 Avi Kivity 提交于 6月 06, 2012

In protected mode, the CPL is defined as the lower two bits of CS, as set by
the last far jump. But during the transition to protected mode, there is no
last far jump, so we need to return zero (the inherited real mode CPL).

Fix by reading CPL from the cache during the transition. This isn't 100%
correct since we don't set the CPL cache on a far jump, but since protected
mode transition will always jump to a segment with RPL=0, it will always
work.
Signed-off-by: NAvi Kivity <avi@redhat.com>

d881e6f6

KVM: MMU: Force cr3 reload with two dimensional paging on mov cr3 emulation · e676505a

由 Avi Kivity 提交于 7月 08, 2012

Currently the MMU's ->new_cr3() callback does nothing when guest paging
is disabled or when two-dimentional paging (e.g. EPT on Intel) is active.
This means that an emulated write to cr3 can be lost; kvm_set_cr3() will
write vcpu-arch.cr3, but the GUEST_CR3 field in the VMCS will retain its
old value and this is what the guest sees.

This bug did not have any effect until now because:
- with unrestricted guest, or with svm, we never emulate a mov cr3 instruction
- without unrestricted guest, and with paging enabled, we also never emulate a
  mov cr3 instruction
- without unrestricted guest, but with paging disabled, the guest's cr3 is
  ignored until the guest enables paging; at this point the value from arch.cr3
  is loaded correctly my the mov cr0 instruction which turns on paging

However, the patchset that enables big real mode causes us to emulate mov cr3
instructions in protected mode sometimes (when guest state is not virtualizable
by vmx); this mov cr3 is effectively ignored and will crash the guest.

The fix is to make nonpaging_new_cr3() call mmu_free_roots() to force a cr3
reload.  This is awkward because now all the new_cr3 callbacks to the same
thing, and because mmu_free_roots() is somewhat of an overkill; but fixing
that is more complicated and will be done after this minimal fix.

Observed in the Window XP 32-bit installer while bringing up secondary vcpus.
Signed-off-by: NAvi Kivity <avi@redhat.com>

e676505a

07 7月, 2012 1 次提交

KVM: handle last_boosted_vcpu = 0 case · 5cfc2aab

由 Rik van Riel 提交于 6月 19, 2012

If last_boosted_vcpu == 0, then we fall through all test cases and
may end up with all VCPUs pouncing on vcpu 0.  With a large enough
guest, this can result in enormous runqueue lock contention, which
can prevent vcpu0 from running, leading to a livelock.

Changing < to <= makes sure we properly handle that case.
Signed-off-by: NRik van Riel <riel@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

5cfc2aab

04 7月, 2012 1 次提交

KVM: s390: Fix sigp sense handling. · 21b26c08

由 Cornelia Huck 提交于 6月 26, 2012

If sigp sense doesn't have any status bits to report, it should set
cc 0 and leave the register as-is.

Since we know about the external call pending bit, we should report
it if it is set as well.
Acked-by: NHeiko Carstens <heiko.carstens@de.ibm.com>
Reviewed-by: NChristian Borntraeger <borntraeger@de.ibm.com>
Signed-off-by: NCornelia Huck <cornelia.huck@de.ibm.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

21b26c08

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功