提交 · 8a395363e2f9f52ec44a1cd892881e8ee1a53269 · OpenHarmony / kernel_linux

30 1月, 2015 5 次提交

KVM: x86: fix x2apic logical address matching · 8a395363

由 Radim Krčmář 提交于 1月 29, 2015

We cannot hit the bug now, but future patches will expose this path.
Signed-off-by: NRadim Krčmář <rkrcmar@redhat.com>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

8a395363

KVM: x86: replace 0 with APIC_DEST_PHYSICAL · 3697f302

由 Radim Krčmář 提交于 1月 29, 2015

To make the code self-documenting.
Signed-off-by: NRadim Krčmář <rkrcmar@redhat.com>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

3697f302

KVM: x86: cleanup kvm_apic_match_*() · 9368b567

由 Radim Krčmář 提交于 1月 29, 2015

The majority of this patch turns
  result = 0; if (CODE) result = 1; return result;
into
  return CODE;
because we return bool now.
Signed-off-by: NRadim Krčmář <rkrcmar@redhat.com>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

9368b567

KVM: x86: return bool from kvm_apic_match*() · 52c233a4

由 Radim Krčmář 提交于 1月 29, 2015

And don't export the internal ones while at it.
Signed-off-by: NRadim Krčmář <rkrcmar@redhat.com>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

52c233a4

KVM: VMX: Add PML support in VMX · 843e4330

由 Kai Huang 提交于 1月 28, 2015

This patch adds PML support in VMX. A new module parameter 'enable_pml' is added
to allow user to enable/disable it manually.
Signed-off-by: NKai Huang <kai.huang@linux.intel.com>
Reviewed-by: NXiao Guangrong <guangrong.xiao@linux.intel.com>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

843e4330

29 1月, 2015 5 次提交

KVM: x86: Add new dirty logging kvm_x86_ops for PML · 88178fd4

由 Kai Huang 提交于 1月 28, 2015

This patch adds new kvm_x86_ops dirty logging hooks to enable/disable dirty
logging for particular memory slot, and to flush potentially logged dirty GPAs
before reporting slot->dirty_bitmap to userspace.

kvm x86 common code calls these hooks when they are available so PML logic can
be hidden to VMX specific. SVM won't be impacted as these hooks remain NULL
there.
Signed-off-by: NKai Huang <kai.huang@linux.intel.com>
Reviewed-by: NXiao Guangrong <guangrong.xiao@linux.intel.com>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

88178fd4

KVM: x86: Change parameter of kvm_mmu_slot_remove_write_access · 1c91cad4

由 Kai Huang 提交于 1月 28, 2015

This patch changes the second parameter of kvm_mmu_slot_remove_write_access from
'slot id' to 'struct kvm_memory_slot *' to align with kvm_x86_ops dirty logging
hooks, which will be introduced in further patch.

Better way is to change second parameter of kvm_arch_commit_memory_region from
'struct kvm_userspace_memory_region *' to 'struct kvm_memory_slot * new', but it
requires changes on other non-x86 ARCH too, so avoid it now.
Signed-off-by: NKai Huang <kai.huang@linux.intel.com>
Reviewed-by: NXiao Guangrong <guangrong.xiao@linux.intel.com>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

1c91cad4

KVM: MMU: Explicitly set D-bit for writable spte. · 9b51a630

由 Kai Huang 提交于 1月 28, 2015

This patch avoids unnecessary dirty GPA logging to PML buffer in EPT violation
path by setting D-bit manually prior to the occurrence of the write from guest.

We only set D-bit manually in set_spte, and leave fast_page_fault path
unchanged, as fast_page_fault is very unlikely to happen in case of PML.

For the hva <-> pa change case, the spte is updated to either read-only (host
pte is read-only) or be dropped (host pte is writeable), and both cases will be
handled by above changes, therefore no change is necessary.
Signed-off-by: NKai Huang <kai.huang@linux.intel.com>
Reviewed-by: NXiao Guangrong <guangrong.xiao@linux.intel.com>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

9b51a630

KVM: MMU: Add mmu help functions to support PML · f4b4b180

由 Kai Huang 提交于 1月 28, 2015

This patch adds new mmu layer functions to clear/set D-bit for memory slot, and
to write protect superpages for memory slot.

In case of PML, CPU logs the dirty GPA automatically to PML buffer when CPU
updates D-bit from 0 to 1, therefore we don't have to write protect 4K pages,
instead, we only need to clear D-bit in order to log that GPA.

For superpages, we still write protect it and let page fault code to handle
dirty page logging, as we still need to split superpage to 4K pages in PML.

As PML is always enabled during guest's lifetime, to eliminate unnecessary PML
GPA logging, we set D-bit manually for the slot with dirty logging disabled.
Signed-off-by: NKai Huang <kai.huang@linux.intel.com>
Reviewed-by: NXiao Guangrong <guangrong.xiao@linux.intel.com>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

f4b4b180

KVM: Rename kvm_arch_mmu_write_protect_pt_masked to be more generic for log dirty · 3b0f1d01

由 Kai Huang 提交于 1月 28, 2015

We don't have to write protect guest memory for dirty logging if architecture
supports hardware dirty logging, such as PML on VMX, so rename it to be more
generic.
Signed-off-by: NKai Huang <kai.huang@linux.intel.com>
Reviewed-by: NXiao Guangrong <guangrong.xiao@linux.intel.com>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

3b0f1d01

28 1月, 2015 1 次提交

kvm: iommu: Add cond_resched to legacy device assignment code · 128ca093

由 Joerg Roedel 提交于 1月 27, 2015

When assigning devices to large memory guests (>=128GB guest
memory in the failure case) the functions to create the
IOMMU page-tables for the whole guest might run for a very
long time. On non-preemptible kernels this might cause
Soft-Lockup warnings. Fix these by adding a cond_resched()
to the mapping and unmapping loops.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

128ca093

26 1月, 2015 7 次提交

KVM: x86: Emulation of call may use incorrect stack size · 82268083

由 Nadav Amit 提交于 1月 26, 2015

On long-mode, when far call that changes cs.l takes place, the stack size is
determined by the new mode.  For instance, if we go from 32-bit mode to 64-bit
mode, the stack-size if 64.  KVM uses the old stack size.

Fix it.
Signed-off-by: NNadav Amit <namit@cs.technion.ac.il>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

82268083

KVM: x86: 32-bit wraparound read/write not emulated correctly · bac15531

由 Nadav Amit 提交于 1月 26, 2015

If we got a wraparound of 32-bit operand, and the limit is 0xffffffff, read and
writes should be successful. It just needs to be done in two segments.
Signed-off-by: NNadav Amit <namit@cs.technion.ac.il>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

bac15531

KVM: x86: Fix defines in emulator.c · 2b42fce6

由 Nadav Amit 提交于 1月 26, 2015

Unnecassary define was left after commit 7d882ffa ("KVM: x86: Revert
NoBigReal patch in the emulator").

Commit 39f062ff ("KVM: x86: Generate #UD when memory operand is required")
was missing undef.

Fix it.
Signed-off-by: NNadav Amit <namit@cs.technion.ac.il>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

2b42fce6

KVM: x86: ARPL emulation can cause spurious exceptions · 2276b511

由 Nadav Amit 提交于 1月 26, 2015

ARPL and MOVSXD are encoded the same and their execution depends on the
execution mode. The operand sizes of each instruction are different.
Currently, ARPL is detected too late, after the decoding was already done, and
therefore may result in spurious exception (instead of failed emulation).

Introduce a group to the emulator to handle instructions according to execution
mode (32/64 bits). Note: in order not to make changes that may affect
performance, the new ModeDual can only be applied to instructions with ModRM.
Signed-off-by: NNadav Amit <namit@cs.technion.ac.il>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

2276b511

KVM: x86: IRET emulation does not clear NMI masking · 801806d9

由 Nadav Amit 提交于 1月 26, 2015

The IRET instruction should clear NMI masking, but the current implementation
does not do so.
Signed-off-by: NNadav Amit <namit@cs.technion.ac.il>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

801806d9

KVM: x86: Wrong operand size for far ret · 16794aaa

由 Nadav Amit 提交于 1月 26, 2015

Indeed, Intel SDM specifically states that for the RET instruction "In 64-bit
mode, the default operation size of this instruction is the stack-address size,
i.e. 64 bits."

However, experiments show this is not the case. Here is for example objdump of
small 64-bit asm:

  4004f1:	ca 14 00             	lret   $0x14
  4004f4:	48 cb                	lretq
  4004f6:	48 ca 14 00          	lretq  $0x14

Therefore, remove the Stack flag from far-ret instructions.
Signed-off-by: NNadav Amit <namit@cs.technion.ac.il>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

16794aaa

KVM: x86: Dirty the dest op page on cmpxchg emulation · 2fcf5c8a

由 Nadav Amit 提交于 1月 26, 2015

Intel SDM says for CMPXCHG: "To simplify the interface to the processorâ€™s bus,
the destination operand receives a write cycle without regard to the result of
the comparison.". This means the destination page should be dirtied.

Fix it to by writing back the original value if cmpxchg failed.
Signed-off-by: NNadav Amit <namit@cs.technion.ac.il>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

2fcf5c8a

23 1月, 2015 1 次提交

KVM: remove unneeded return value of vcpu_postcreate · 31928aa5

由 Dominik Dingel 提交于 12月 04, 2014

The return value of kvm_arch_vcpu_postcreate is not checked in its
caller.  This is okay, because only x86 provides vcpu_postcreate right
now and it could only fail if vcpu_load failed.  But that is not
possible during KVM_CREATE_VCPU (kvm_arch_vcpu_load is void, too), so
just get rid of the unchecked return value.
Signed-off-by: NDominik Dingel <dingel@linux.vnet.ibm.com>
Acked-by: NCornelia Huck <cornelia.huck@de.ibm.com>
Signed-off-by: NChristian Borntraeger <borntraeger@de.ibm.com>

31928aa5

21 1月, 2015 2 次提交

KVM: x86: workaround SuSE's 2.6.16 pvclock vs masterclock issue · 54750f2c

由 Marcelo Tosatti 提交于 1月 20, 2015

SuSE's 2.6.16 kernel fails to boot if the delta between tsc_timestamp
and rdtsc is larger than a given threshold:

 * If we get more than the below threshold into the future, we rerequest
 * the real time from the host again which has only little offset then
 * that we need to adjust using the TSC.
 *
 * For now that threshold is 1/5th of a jiffie. That should be good
 * enough accuracy for completely broken systems, but also give us swing
 * to not call out to the host all the time.
 */
#define PVCLOCK_DELTA_MAX ((1000000000ULL / HZ) / 5)

Disable masterclock support (which increases said delta) in case the
boot vcpu does not use MSR_KVM_SYSTEM_TIME_NEW.

Upstreams kernels which support pvclock vsyscalls (and therefore make
use of PVCLOCK_STABLE_BIT) use MSR_KVM_SYSTEM_TIME_NEW.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

54750f2c

KVM: fix "Should it be static?" warnings from sparse · 69b0049a

由 Fengguang Wu 提交于 1月 19, 2015

arch/x86/kvm/x86.c:495:5: sparse: symbol 'kvm_read_nested_guest_page' was not declared. Should it be static?
arch/x86/kvm/x86.c:646:5: sparse: symbol '__kvm_set_xcr' was not declared. Should it be static?
arch/x86/kvm/x86.c:1183:15: sparse: symbol 'max_tsc_khz' was not declared. Should it be static?
arch/x86/kvm/x86.c:1237:6: sparse: symbol 'kvm_track_tsc_matching' was not declared. Should it be static?
Signed-off-by: NFengguang Wu <fengguang.wu@intel.com>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

69b0049a

19 1月, 2015 2 次提交

Optimize TLB flush in kvm_mmu_slot_remove_write_access. · d91ffee9

由 Kai Huang 提交于 1月 12, 2015

No TLB flush is needed when there's no valid rmap in memory slot.
Signed-off-by: NKai Huang <kai.huang@linux.intel.com>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

d91ffee9

x86: kvm: vmx: Remove some unused functions · 0c55d6d9

由 Rickard Strandqvist 提交于 1月 11, 2015

Removes some functions that are not used anywhere:
cpu_has_vmx_eptp_writeback() cpu_has_vmx_eptp_uncacheable()

This was partially found by using a static code analysis program called cppcheck.
Signed-off-by: NRickard Strandqvist <rickard_strandqvist@spectrumdigital.se>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

0c55d6d9

16 1月, 2015 1 次提交

KVM: x86: switch to kvm_get_dirty_log_protect · e108ff2f

由 Paolo Bonzini 提交于 1月 15, 2015

We now have a generic function that does most of the work of
kvm_vm_ioctl_get_dirty_log, now use it.
Acked-by: NChristoffer Dall <christoffer.dall@linaro.org>
Signed-off-by: NMario Smarduch <m.smarduch@samsung.com>

e108ff2f

09 1月, 2015 16 次提交

KVM: x86: #PF error-code on R/W operations is wrong · c205fb7d

由 Nadav Amit 提交于 12月 25, 2014

When emulating an instruction that reads the destination memory operand (i.e.,
instructions without the Mov flag in the emulator), the operand is first read.
If a page-fault is detected in this phase, the error-code which would be
delivered to the VM does not indicate that the access that caused the exception
is a write one. This does not conform with real hardware, and may cause the VM
to enter the page-fault handler twice for no reason (once for read, once for
write).
Signed-off-by: NNadav Amit <namit@cs.technion.ac.il>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

c205fb7d

KVM: x86: flush TLB when D bit is manually changed. · 7e71a59b

由 Kai Huang 提交于 1月 09, 2015

When software changes D bit (either from 1 to 0, or 0 to 1), the
corresponding TLB entity in the hardware won't be updated immediately. We
should flush it to guarantee the consistence of D bit between TLB and
MMU page table in memory.  This is especially important when clearing
the D bit, since it may cause false negatives in reporting dirtiness.

Sanity test was done on my machine with Intel processor.
Signed-off-by: NKai Huang <kai.huang@linux.intel.com>
[Check A bit too. - Paolo]
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

7e71a59b

KVM: x86: allow TSC deadline timer on all hosts · defcf51f

由 Radim Krčmář 提交于 1月 08, 2015

Emulation does not utilize the feature.
Signed-off-by: NRadim Krčmář <rkrcmar@redhat.com>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

defcf51f

kvm: x86: Remove kvm_make_request from lapic.c · bab5bb39

由 Nicholas Krause 提交于 1月 01, 2015

Adds a function kvm_vcpu_set_pending_timer instead of calling
kvm_make_request in lapic.c.
Signed-off-by: NNicholas Krause <xerofoify@gmail.com>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

bab5bb39

KVM: x86: Access to LDT/GDT that wraparound is incorrect · edccda7c

由 Nadav Amit 提交于 12月 25, 2014

When access to descriptor in LDT/GDT wraparound outside long-mode, the address
of the descriptor should be truncated to 32-bit.  Citing Intel SDM 2.1.1.1
"Global and Local Descriptor Tables in IA-32e Mode": "GDTR and LDTR registers
are expanded to 64-bits wide in both IA-32e sub-modes (64-bit mode and
compatibility mode)."

So in other cases, we need to truncate. Creating new function to return a
pointer to descriptor table to avoid too much code duplication.
Signed-off-by: NNadav Amit <namit@cs.technion.ac.il>
[Wrap 64-bit check with #ifdef CONFIG_X86_64, to avoid a "right shift count
 >= width of type" warning and consequent undefined behavior. - Paolo]
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

edccda7c

KVM: x86: Do not set access bit on accessed segments · e2cefa74

由 Nadav Amit 提交于 12月 25, 2014

When segment is loaded, the segment access bit is set unconditionally. In
fact, it should be set conditionally, based on whether the segment had the
accessed bit set before. In addition, it can improve performance.
Signed-off-by: NNadav Amit <namit@cs.technion.ac.il>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

e2cefa74

KVM: x86: POP [ESP] is not emulated correctly · ab708099

由 Nadav Amit 提交于 12月 25, 2014

According to Intel SDM: "If the ESP register is used as a base register for
addressing a destination operand in memory, the POP instruction computes the
effective address of the operand after it increments the ESP register."

The current emulation does not behave so. The fix required to waste another
of the precious instruction flags and to check the flag in decode_modrm.
Signed-off-by: NNadav Amit <namit@cs.technion.ac.il>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

ab708099

KVM: x86: em_call_far should return failure result · 80976dbb

由 Nadav Amit 提交于 12月 25, 2014

Currently, if em_call_far fails it returns success instead of the resulting
error-code. Fix it.
Signed-off-by: NNadav Amit <namit@cs.technion.ac.il>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

80976dbb

KVM: x86: JMP/CALL using call- or task-gate causes exception · 3dc4bc4f

由 Nadav Amit 提交于 12月 25, 2014

The KVM emulator does not emulate JMP and CALL that target a call gate or a
task gate.  This patch does not try to implement these scenario as they are
presumably rare; yet it returns X86EMUL_UNHANDLEABLE error in such cases
instead of generating an exception.
Signed-off-by: NNadav Amit <namit@cs.technion.ac.il>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

3dc4bc4f

KVM: x86: fnstcw and fnstsw may cause spurious exception · 16bebefe

由 Nadav Amit 提交于 12月 25, 2014

Since the operand size of fnstcw and fnstsw is updated during the execution,
the emulation may cause spurious exceptions as it reads the memory beforehand.

Marking these instructions as Mov (since the previous value is ignored) and
DstMem16 to simplify the setting of operand size.
Signed-off-by: NNadav Amit <namit@cs.technion.ac.il>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

16bebefe

KVM: x86: pop sreg accesses only 2 bytes · 3313bc4e

由 Nadav Amit 提交于 12月 25, 2014

Although pop sreg updates RSP according to the operand size, only 2 bytes are
read. The current behavior may result in incorrect #GP or #PF exceptions.
Signed-off-by: NNadav Amit <namit@cs.technion.ac.il>
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

3313bc4e

KVM: x86: mmu: replace assertions with MMU_WARN_ON, a conditional WARN_ON · fa4a2c08

由 Paolo Bonzini 提交于 10月 02, 2013

This makes the direction of the conditions consistent with code that
is already using WARN_ON.
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

fa4a2c08

KVM: x86: mmu: remove ASSERT(vcpu) · 4c1a50de

由 Paolo Bonzini 提交于 10月 02, 2013

Because ASSERT is just a printk, these would oops right away.
The assertion thus hardly adds anything.
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

4c1a50de

KVM: x86: mmu: remove argument to kvm_init_shadow_mmu and kvm_init_shadow_ept_mmu · ad896af0

由 Paolo Bonzini 提交于 10月 02, 2013

The initialization function in mmu.c can always use walk_mmu, which
is known to be vcpu->arch.mmu.  Only init_kvm_nested_mmu is used to
initialize vcpu->arch.nested_mmu.
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

ad896af0

P
KVM: x86: mmu: do not use return to tail-call functions that return void · e0c6db3e
由 Paolo Bonzini 提交于 12月 23, 2014
```
This is, pedantically, not valid C.  It also looks weird.
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>
```
e0c6db3e

KVM: x86: add tracepoint to wait_lapic_expire · 6c19b753

由 Marcelo Tosatti 提交于 12月 16, 2014

Add tracepoint to wait_lapic_expire.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
[Remind reader if early or late. - Paolo]
Signed-off-by: NPaolo Bonzini <pbonzini@redhat.com>

6c19b753

OpenHarmony / kernel_linux 上一次同步 3 年多

OpenHarmony / kernel_linux
上一次同步 3 年多