提交 · 0853d2c1d849ef69884d2447d90d04007590b72b · openanolis / cloud-kernel

31 12月, 2008 11 次提交

KVM: Fix cpuid leaf 0xb loop termination · 0853d2c1

由 Nitin A Kamble 提交于 11月 05, 2008

For cpuid leaf 0xb the bits 8-15 in ECX register define the end of counting
leaf.      The previous code was using bits 0-7 for this purpose, which is
a bug.
Signed-off-by: NNitin A Kamble <nitin.a.kamble@intel.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

0853d2c1

KVM: Enable Function Level Reset for assigned device · 6eb55818

由 Sheng Yang 提交于 10月 31, 2008

Ideally, every assigned device should in a clear condition before and after
assignment, so that the former state of device won't affect later work.
Some devices provide a mechanism named Function Level Reset, which is
defined in PCI/PCI-e document. We should execute it before and after device
assignment.

(But sadly, the feature is new, and most device on the market now don't
support it. We are considering using D0/D3hot transmit to emulate it later,
but not that elegant and reliable as FLR itself.)

[Update: Reminded by Xiantao, execute FLR after we ensure that the device can
be assigned to the guest.]
Signed-off-by: NSheng Yang <sheng@linux.intel.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

6eb55818

KVM: allow emulator to adjust rip for emulated pio instructions · e93f36bc

由 Guillaume Thouvenin 提交于 10月 28, 2008

If we call the emulator we shouldn't call skip_emulated_instruction()
in the first place, since the emulator already computes the next rip
for us. Thus we move ->skip_emulated_instruction() out of
kvm_emulate_pio() and into handle_io() (and the svm equivalent). We
also replaced "return 0" by "break" in the "do_io:" case because now
the shadow register state needs to be committed. Otherwise eip will never
be updated.
Signed-off-by: NGuillaume Thouvenin <guillaume.thouvenin@ext.bull.net>
Signed-off-by: NAvi Kivity <avi@redhat.com>

e93f36bc

KVM: x86: Fix typo in function name · b8222ad2

由 Amit Shah 提交于 10月 22, 2008

get_segment_descritptor_dtable() contains an obvious type.
Signed-off-by: NAmit Shah <amit.shah@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

b8222ad2

KVM: Enable MTRR for EPT · 64d4d521

由 Sheng Yang 提交于 10月 09, 2008

The effective memory type of EPT is the mixture of MSR_IA32_CR_PAT and memory
type field of EPT entry.
Signed-off-by: NSheng Yang <sheng@linux.intel.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

64d4d521

KVM: VMX: Add PAT support for EPT · 468d472f

由 Sheng Yang 提交于 10月 09, 2008

GUEST_PAT support is a new feature introduced by Intel Core i7 architecture.
With this, cpu would save/load guest and host PAT automatically, for EPT memory
type in guest depends on MSR_IA32_CR_PAT.

Also add save/restore for MSR_IA32_CR_PAT.
Signed-off-by: NSheng Yang <sheng@linux.intel.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

468d472f

KVM: Improve MTRR structure · 0bed3b56

由 Sheng Yang 提交于 10月 09, 2008

As well as reset mmu context when set MTRR.
Signed-off-by: NSheng Yang <sheng@linux.intel.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

0bed3b56

KVM: call kvm_arch_vcpu_reset() instead of the kvm_x86_ops callback · 5f179287

由 Gleb Natapov 提交于 10月 07, 2008

Call kvm_arch_vcpu_reset() instead of directly using arch callback.
The function does additional things.
Signed-off-by: NGleb Natapov <gleb@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

5f179287

KVM: x86: Support for user space injected NMIs · c4abb7c9

由 Jan Kiszka 提交于 9月 26, 2008

Introduces the KVM_NMI IOCTL to the generic x86 part of KVM for
injecting NMIs from user space and also extends the statistic report
accordingly.

Based on the original patch by Sheng Yang.
Signed-off-by: NJan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: NSheng Yang <sheng.yang@intel.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

c4abb7c9

KVM: x86: VCPU with pending NMI is runnabled · 0496fbb9

由 Jan Kiszka 提交于 9月 26, 2008

Ensure that a VCPU with pending NMIs is considered runnable.
Signed-off-by: NJan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

0496fbb9

KVM: x86: Reset pending/inject NMI state on CPU reset · 448fa4a9

由 Jan Kiszka 提交于 9月 26, 2008

CPU reset invalidates pending or already injected NMIs, therefore reset
the related state variables.

Based on original patch by Gleb Natapov.
Signed-off-by: NGleb Natapov <gleb@redhat.com>
Signed-off-by: NJan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

448fa4a9

28 10月, 2008 1 次提交

KVM: Fix guest shared interrupt with in-kernel irqchip · 5550af4d

由 Sheng Yang 提交于 10月 15, 2008

Every call of kvm_set_irq() should offer an irq_source_id, which is
allocated by kvm_request_irq_source_id(). Based on irq_source_id, we
identify the irq source and implement logical OR for shared level
interrupts.

The allocated irq_source_id can be freed by kvm_free_irq_source_id().

Currently, we support at most sizeof(unsigned long) different irq sources.

[Amit: - rebase to kvm.git HEAD
       - move definition of KVM_USERSPACE_IRQ_SOURCE_ID to common file
       - move kvm_request_irq_source_id to the update_irq ioctl]

[Xiantao: - Add kvm/ia64 stuff and make it work for kvm/ia64 guests]
Signed-off-by: NSheng Yang <sheng@linux.intel.com>
Signed-off-by: NAmit Shah <amit.shah@redhat.com>
Signed-off-by: NXiantao Zhang <xiantao.zhang@intel.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

5550af4d

17 10月, 2008 1 次提交

misc: replace __FUNCTION__ with __func__ · 80a914dc

由 Harvey Harrison 提交于 10月 15, 2008

__FUNCTION__ is gcc-specific, use __func__
Signed-off-by: NHarvey Harrison <harvey.harrison@gmail.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

80a914dc

15 10月, 2008 27 次提交

KVM: Move device assignment logic to common code · 8a98f664

由 Xiantao Zhang 提交于 10月 06, 2008

To share with other archs, this patch moves device assignment
logic to common parts.
Signed-off-by: NXiantao Zhang <xiantao.zhang@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

8a98f664

KVM: x86: Silence various LAPIC-related host kernel messages · 1b10bf31

由 Jan Kiszka 提交于 9月 30, 2008

KVM-x86 dumps a lot of debug messages that have no meaning for normal
operation:
 - INIT de-assertion is ignored
 - SIPIs are sent and received
 - APIC writes are unaligned or < 4 byte long
   (Windows Server 2003 triggers this on SMP)

Degrade them to true debug messages, keeping the host kernel log clean
for real problems.
Signed-off-by: NJan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

1b10bf31

KVM: PIC: enhance IPI avoidance · e4825800

由 Marcelo Tosatti 提交于 9月 24, 2008

The PIC code makes little effort to avoid kvm_vcpu_kick(), resulting in
unnecessary guest exits in some conditions.

For example, if the timer interrupt is routed through the IOAPIC, IRR
for IRQ 0 will get set but not cleared, since the APIC is handling the
acks.

This means that everytime an interrupt < 16 is triggered, the priority
logic will find IRQ0 pending and send an IPI to vcpu0 (in case IRQ0 is
not masked, which is Linux's case).

Introduce a new variable isr_ack to represent the IRQ's for which the
guest has been signalled / cleared the ISR. Use it to avoid more than
one IPI per trigger-ack cycle, in addition to the avoidance when ISR is
set in get_priority().
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

e4825800

KVM: MMU: out of sync shadow core · 4731d4c7

由 Marcelo Tosatti 提交于 9月 23, 2008

Allow guest pagetables to go out of sync.  Instead of emulating write
accesses to guest pagetables, or unshadowing them, we un-write-protect
the page table and allow the guest to modify it at will.  We rely on
invlpg executions to synchronize individual ptes, and will synchronize
the entire pagetable on tlb flushes.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

4731d4c7

KVM: x86: trap invlpg · a7052897

由 Marcelo Tosatti 提交于 9月 23, 2008

With pages out of sync invlpg needs to be trapped. For now simply nuke
the entry.

Untested on AMD.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

a7052897

KVM: MMU: sync roots on mmu reload · 0ba73cda

由 Marcelo Tosatti 提交于 9月 23, 2008

Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

0ba73cda

KVM: don't enter guest after SIPI was received by a CPU · af2152f5

由 Gleb Natapov 提交于 9月 22, 2008

The vcpu should process pending SIPI message before entering guest mode again.
kvm_arch_vcpu_runnable() returns true if the vcpu is in SIPI state, so
we can't call it here.
Signed-off-by: NGleb Natapov <gleb@qumranet.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

af2152f5

KVM: x86.c make kvm_load_realmode_segment static · 2259e3a7

由 Harvey Harrison 提交于 8月 22, 2008

Noticed by sparse:
arch/x86/kvm/x86.c:3591:5: warning: symbol 'kvm_load_realmode_segment' was not declared. Should it be static?
Signed-off-by: NHarvey Harrison <harvey.harrison@gmail.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

2259e3a7

KVM: switch to get_user_pages_fast · 4c2155ce

由 Marcelo Tosatti 提交于 9月 16, 2008

Convert gfn_to_pfn to use get_user_pages_fast, which can do lockless
pagetable lookups on x86. Kernel compilation on 4-way guest is 3.7%
faster on VMX.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

4c2155ce

KVM: Device Assignment: Free device structures if IRQ allocation fails · bfadaded

由 Amit Shah 提交于 9月 16, 2008

When an IRQ allocation fails, we free up the device structures and
disable the device so that we can unregister the device in the
userspace and not expose it to the guest at all.
Signed-off-by: NAmit Shah <amit.shah@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

bfadaded

KVM: Device Assignment with VT-d · 62c476c7

由 Ben-Ami Yassour 提交于 9月 14, 2008

Based on a patch by: Kay, Allen M <allen.m.kay@intel.com>

This patch enables PCI device assignment based on VT-d support.
When a device is assigned to the guest, the guest memory is pinned and
the mapping is updated in the VT-d IOMMU.

[Amit: Expose KVM_CAP_IOMMU so we can check if an IOMMU is present
and also control enable/disable from userspace]
Signed-off-by: NKay, Allen M <allen.m.kay@intel.com>
Signed-off-by: NWeidong Han <weidong.han@intel.com>
Signed-off-by: NBen-Ami Yassour <benami@il.ibm.com>
Signed-off-by: NAmit Shah <amit.shah@qumranet.com>
Acked-by: NMark Gross <mgross@linux.intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

62c476c7

KVM: x86: unhalt vcpu0 on reset · 9c3e4aab

由 Marcelo Tosatti 提交于 9月 10, 2008

Since "KVM: x86: do not execute halted vcpus", HLT by vcpu0 before system
reset by the IO thread will hang the guest.

Mark vcpu as runnable in such case.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

9c3e4aab

KVM: x86: do not execute halted vcpus · d7690175

由 Marcelo Tosatti 提交于 9月 08, 2008

Offline or uninitialized vcpu's can be executed if requested to perform
userspace work.

Follow Avi's suggestion to handle halted vcpu's in the main loop,
simplifying kvm_emulate_halt(). Introduce a new vcpu->requests bit to
indicate events that promote state from halted to running.

Also standardize vcpu wake sites.

Signed-off-by: Marcelo Tosatti <mtosatti <at> redhat.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

d7690175

KVM: Add statistics for guest irq injections · fa89a817

由 Avi Kivity 提交于 9月 01, 2008

These can help show whether a guest is making progress or not.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

fa89a817

KVM: add MC5_MISC msr read support · a89c1ad2

由 Joerg Roedel 提交于 8月 29, 2008

Currently KVM implements MC0-MC4_MISC read support. When booting Linux this
results in KVM warnings in the kernel log when the guest tries to read
MC5_MISC. Fix this warnings with this patch.
Signed-off-by: NJoerg Roedel <joerg.roedel@amd.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

a89c1ad2

KVM: Allocate guest memory as MAP_PRIVATE, not MAP_SHARED · acee3c04

由 Avi Kivity 提交于 8月 26, 2008

There is no reason to share internal memory slots with fork()ed instances.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

acee3c04

KVM: Load real mode segments correctly · f4bbd9aa

由 Avi Kivity 提交于 8月 20, 2008

Real mode segments to not reference the GDT or LDT; they simply compute
base = selector * 16.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

f4bbd9aa

KVM: make irq ack notifier functions static · ee032c99

由 Harvey Harrison 提交于 8月 11, 2008

sparse says:

arch/x86/kvm/x86.c:107:32: warning: symbol 'kvm_find_assigned_dev' was not declared. Should it be static?
arch/x86/kvm/i8254.c:225:6: warning: symbol 'kvm_pit_ack_irq' was not declared. Should it be static?
Signed-off-by: NHarvey Harrison <harvey.harrison@gmail.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

ee032c99

KVM: Use kvm_set_irq to inject interrupts · 29c8fa32

由 Amit Shah 提交于 8月 18, 2008

... instead of using the pic and ioapic variants
Signed-off-by: NAmit Shah <amit.shah@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

29c8fa32

KVM: Device assignment: Check for privileges before assigning irq · 6762b729

由 Amit Shah 提交于 8月 13, 2008

Even though we don't share irqs at the moment, we should ensure
regular user processes don't try to allocate system resources.

We check for capability to access IO devices (CAP_SYS_RAWIO) before
we request_irq on behalf of the guest.

Noticed by Avi.
Signed-off-by: NAmit Shah <amit.shah@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

6762b729

KVM: set debug registers after "schedulable" section · 29415c37

由 Marcelo Tosatti 提交于 8月 01, 2008

The vcpu thread can be preempted after the guest_debug_pre() callback,
resulting in invalid debug registers on the new vcpu.

Move it inside the non-preemptable section.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

29415c37

KVM: Reduce stack usage in kvm_arch_vcpu_ioctl() · b772ff36

由 Dave Hansen 提交于 8月 11, 2008

[sheng: fix KVM_GET_LAPIC using wrong size]
Signed-off-by: NDave Hansen <dave@linux.vnet.ibm.com>
Signed-off-by: NSheng Yang <sheng.yang@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

b772ff36

KVM: Reduce kvm stack usage in kvm_arch_vm_ioctl() · f0d66275

由 Dave Hansen 提交于 8月 11, 2008

On my machine with gcc 3.4, kvm uses ~2k of stack in a few
select functions.  This is mostly because gcc fails to
notice that the different case: statements could have their
stack usage combined.  It overflows very nicely if interrupts
happen during one of these large uses.

This patch uses two methods for reducing stack usage.
1. dynamically allocate large objects instead of putting
   on the stack.
2. Use a union{} member for all of the case variables. This
   tricks gcc into combining them all into a single stack
   allocation. (There's also a comment on this)
Signed-off-by: NDave Hansen <dave@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

f0d66275

KVM: pci device assignment · 4d5c5d0f

由 Ben-Ami Yassour 提交于 7月 28, 2008

Based on a patch from: Amit Shah <amit.shah@qumranet.com>

This patch adds support for handling PCI devices that are assigned to
the guest.

The device to be assigned to the guest is registered in the host kernel
and interrupt delivery is handled.  If a device is already assigned, or
the device driver for it is still loaded on the host, the device
assignment is failed by conveying a -EBUSY reply to the userspace.

Devices that share their interrupt line are not supported at the moment.

By itself, this patch will not make devices work within the guest.
The VT-d extension is required to enable the device to perform DMA.
Another alternative is PVDMA.
Signed-off-by: NAmit Shah <amit.shah@qumranet.com>
Signed-off-by: NBen-Ami Yassour <benami@il.ibm.com>
Signed-off-by: NWeidong Han <weidong.han@intel.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

4d5c5d0f

KVM: Ignore DEBUGCTL MSRs with no effect · b5e2fec0

由 Alexander Graf 提交于 7月 22, 2008

Netware writes to DEBUGCTL and reads from the DEBUGCTL and LAST*IP MSRs
without further checks and is really confused to receive a #GP during that.
To make it happy we should just make them stubs, which is exactly what SVM
already does.

Writes to DEBUGCTL that are vendor-specific are resembled to behave as if the
virtual CPU does not know them.
Signed-off-by: NAlexander Graf <agraf@suse.de>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

b5e2fec0

KVM: Clear exception queue before emulating an instruction · 26eef70c

由 Avi Kivity 提交于 7月 03, 2008

If we're emulating an instruction, either it will succeed, in which case
any previously queued exception will be spurious, or we will requeue the
same exception.
Signed-off-by: NAvi Kivity <avi@qumranet.com>

26eef70c

KVM: x86: accessors for guest registers · 5fdbf976

由 Marcelo Tosatti 提交于 6月 27, 2008

As suggested by Avi, introduce accessors to read/write guest registers.
This simplifies the ->cache_regs/->decache_regs interface, and improves
register caching which is important for VMX, where the cost of
vmcs_read/vmcs_write is significant.

[avi: fix warnings]
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

5fdbf976

openanolis / cloud-kernel 大约 1 年 前同步成功

openanolis / cloud-kernel
大约 1 年前同步成功