提交 · a8a93f3f03b7a8008d720e8d91798efe599d416c · openanolis / cloud-kernel

28 3月, 2009 1 次提交

由 Christoph Hellwig 提交于 11月 28, 2008

Due to a different size of ino_t ustat needs a compat handler, but
currently only x86 and mips provide one.  Add a generic compat_sys_ustat
and switch all architectures over to it.  Instead of doing various
user copy hacks compat_sys_ustat just reimplements sys_ustat as
it's trivial.  This was suggested by Arnd Bergmann.

Found by Eric Sandeen when running xfstests/017 on ppc64, which causes
stack smashing warnings on RHEL/Fedora due to the too large amount of
data writen by the syscall.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

2b1c6bd7

27 3月, 2009 1 次提交

x86: headers cleanup - setup.h · 17d14040

由 Cyrill Gorcunov 提交于 1月 14, 2009

Impact: cleanup

'make headers_check' warn us about leaking of kernel private
(mostly compile time vars) data to userspace in headers. Fix it.

Guard this one by __KERNEL__.
Signed-off-by: NCyrill Gorcunov <gorcunov@openvz.org>
Signed-off-by: NH. Peter Anvin <hpa@linux.intel.com>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

17d14040

26 3月, 2009 1 次提交

x86: disable __do_IRQ support · fc2869f6

由 Thomas Gleixner 提交于 3月 13, 2009

Impact: disable unused code

x86 is fully converted to flow handlers. No need to keep the
deprecated __do_IRQ() support active.
Signed-off-by: NThomas Gleixner <tglx@linutronix.de>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

fc2869f6

24 3月, 2009 37 次提交

KVM: VMX: Don't allow uninhibited access to EFER on i386 · 16175a79

由 Avi Kivity 提交于 3月 23, 2009

vmx_set_msr() does not allow i386 guests to touch EFER, but they can still
do so through the default: label in the switch.  If they set EFER_LME, they
can oops the host.

Fix by having EFER access through the normal channel (which will check for
EFER_LME) even on i386.
Reported-and-tested-by: NBenjamin Gilbert <bgilbert@cs.cmu.edu>
Cc: stable@kernel.org
Signed-off-by: NAvi Kivity <avi@redhat.com>

16175a79

KVM: Fix missing smp tlb flush in invlpg · 4539b358

由 Andrea Arcangeli 提交于 3月 12, 2009

When kvm emulates an invlpg instruction, it can drop a shadow pte, but
leaves the guest tlbs intact.  This can cause memory corruption when
swapping out.

Without this the other cpu can still write to a freed host physical page.
tlb smp flush must happen if rmap_remove is called always before mmu_lock
is released because the VM will take the mmu_lock before it can finally add
the page to the freelist after swapout. mmu notifier makes it safe to flush
the tlb after freeing the page (otherwise it would never be safe) so we can do
a single flush for multiple sptes invalidated.

Cc: stable@kernel.org
Signed-off-by: NAndrea Arcangeli <aarcange@redhat.com>
Acked-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

4539b358

KVM: fix sparse warnings: Should it be static? · cded19f3

由 Hannes Eder 提交于 2月 21, 2009

Impact: Make symbols static.

Fix this sparse warnings:
arch/x86/kvm/mmu.c:992:5: warning: symbol 'mmu_pages_add' was not declared. Should it be static?
arch/x86/kvm/mmu.c:1124:5: warning: symbol 'mmu_pages_next' was not declared. Should it be static?
arch/x86/kvm/mmu.c:1144:6: warning: symbol 'mmu_pages_clear_parents' was not declared. Should it be static?
arch/x86/kvm/x86.c:2037:5: warning: symbol 'kvm_read_guest_virt' was not declared. Should it be static?
arch/x86/kvm/x86.c:2067:5: warning: symbol 'kvm_write_guest_virt' was not declared. Should it be static?
virt/kvm/irq_comm.c:220:5: warning: symbol 'setup_routing_entry' was not declared. Should it be static?
Signed-off-by: NHannes Eder <hannes@hanneseder.net>
Signed-off-by: NAvi Kivity <avi@redhat.com>

cded19f3

KVM: fix sparse warnings: context imbalance · d7364a29

由 Hannes Eder 提交于 2月 21, 2009

Impact: Attribute function with __acquires(...) resp. __releases(...).

Fix this sparse warnings:
arch/x86/kvm/i8259.c:34:13: warning: context imbalance in 'pic_lock' - wrong count at exit
arch/x86/kvm/i8259.c:39:13: warning: context imbalance in 'pic_unlock' - unexpected unlock
Signed-off-by: NHannes Eder <hannes@hanneseder.net>
Signed-off-by: NAvi Kivity <avi@redhat.com>

d7364a29

KVM: is_long_mode() should check for EFER.LMA · 41d6af11

由 Amit Shah 提交于 2月 28, 2008

is_long_mode currently checks the LongModeEnable bit in
EFER instead of the LongModeActive bit. This is wrong, but
we survived this till now since it wasn't triggered. This
breaks guests that go from long mode to compatibility mode.

This is noticed on a solaris guest and fixes bug #1842160
Signed-off-by: NAmit Shah <amit.shah@qumranet.com>
Signed-off-by: NAvi Kivity <avi@qumranet.com>

41d6af11

KVM: VMX: Update necessary state when guest enters long mode · 401d10de

由 Amit Shah 提交于 2月 20, 2009

setup_msrs() should be called when entering long mode to save the
shadow state for the 64-bit guest state.

Using vmx_set_efer() in enter_lmode() removes some duplicated code
and also ensures we call setup_msrs(). We can safely pass the value
of shadow_efer to vmx_set_efer() as no other bits in the efer change
while enabling long mode (guest first sets EFER.LME, then sets CR0.PG
which causes a vmexit where we activate long mode).

With this fix, is_long_mode() can check for EFER.LMA set instead of
EFER.LME and 5e23049e86dd298b72e206b420513dbc3a240cd9 can be reverted.
Signed-off-by: NAmit Shah <amit.shah@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

401d10de

KVM: MMU: Fix another largepage memory leak · c5bc2242

由 Joerg Roedel 提交于 2月 19, 2009

In the paging_fetch function rmap_remove is called after setting a large
pte to non-present. This causes rmap_remove to not drop the reference to
the large page. The result is a memory leak of that page.

Cc: stable@kernel.org
Signed-off-by: NJoerg Roedel <joerg.roedel@amd.com>
Acked-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

c5bc2242

KVM: SVM: set accessed bit for VMCB segment selectors · 1fbdc7a5

由 Andre Przywara 提交于 1月 11, 2009

In the segment descriptor _cache_ the accessed bit is always set
(although it can be cleared in the descriptor itself). Since Intel
checks for this condition on a VMENTRY, set this bit in the AMD path
to enable cross vendor migration.

Cc: stable@kernel.org
Signed-off-by: NAndre Przywara <andre.przywara@amd.com>
Acked-By: NAmit Shah <amit.shah@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

1fbdc7a5

KVM: Report IRQ injection status to userspace. · 4925663a

由 Gleb Natapov 提交于 2月 04, 2009

IRQ injection status is either -1 (if there was no CPU found
that should except the interrupt because IRQ was masked or
ioapic was misconfigured or ...) or >= 0 in that case the
number indicates to how many CPUs interrupt was injected.
If the value is 0 it means that the interrupt was coalesced
and probably should be reinjected.
Signed-off-by: NGleb Natapov <gleb@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

4925663a

KVM: MMU: remove assertion in kvm_mmu_alloc_page · 452425db

由 Joerg Roedel 提交于 2月 18, 2009

The assertion no longer makes sense since we don't clear page tables on
allocation; instead we clear them during prefetch.
Signed-off-by: NJoerg Roedel <joerg.roedel@amd.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

452425db

KVM: MMU: remove redundant check in mmu_set_spte · 6bed6b9e

由 Joerg Roedel 提交于 2月 18, 2009

The following code flow is unnecessary:

	if (largepage)
		was_rmapped = is_large_pte(*shadow_pte);
	 else
	 	was_rmapped = 1;

The is_large_pte() function will always evaluate to one here because the
(largepage && !is_large_pte) case is already handled in the first
if-clause. So we can remove this check and set was_rmapped to one always
here.
Signed-off-by: NJoerg Roedel <joerg.roedel@amd.com>
Acked-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

6bed6b9e

KVM: Fix kvmclock on !constant_tsc boxes · c8076604

由 Gerd Hoffmann 提交于 2月 04, 2009

kvmclock currently falls apart on machines without constant tsc.
This patch fixes it.  Changes:

  * keep tsc frequency in a per-cpu variable.
  * handle kvmclock update using a new request flag, thus checking
    whenever we need an update each time we enter guest context.
  * use a cpufreq notifier to track frequency changes and force
    kvmclock updates.
  * send ipis to kick cpu out of guest context if needed to make
    sure the guest doesn't see stale values.
Signed-off-by: NGerd Hoffmann <kraxel@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

c8076604

KVM: VMX: Use kvm_mmu_page_fault() handle EPT violation mmio · 49cd7d22

由 Sheng Yang 提交于 2月 11, 2009

Removed duplicated code.
Signed-off-by: NSheng Yang <sheng@linux.intel.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

49cd7d22

KVM: Drop unused evaluations from string pio handlers · 34c33d16

由 Jan Kiszka 提交于 2月 08, 2009

Looks like neither the direction nor the rep prefix are used anymore.
Drop related evaluations from SVM's and VMX's I/O exit handlers.
Signed-off-by: NJan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

34c33d16

KVM: Add FFXSR support · 1b2fd70c

由 Alexander Graf 提交于 2月 02, 2009

AMD K10 CPUs implement the FFXSR feature that gets enabled using
EFER. Let's check if the virtual CPU description includes that
CPUID feature bit and allow enabling it then.

This is required for Windows Server 2008 in Hyper-V mode.

v2 adds CPUID capability exposure
Signed-off-by: NAlexander Graf <agraf@suse.de>
Signed-off-by: NAvi Kivity <avi@redhat.com>

1b2fd70c

x86: Add EFER descriptions for FFXSR · d2062693

由 Alexander Graf 提交于 2月 02, 2009

AMD k10 includes support for the FFXSR feature, which leaves out
XMM registers on FXSAVE/FXSAVE when the EFER_FFXSR bit is set in
EFER.

The CPUID feature bit exists already, but the EFER bit is missing
currently, so this patch adds it to the list of known EFER bits.
Signed-off-by: NAlexander Graf <agraf@suse.de>
CC: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

d2062693

KVM: make irq ack notifications aware of routing table · 44882eed

由 Marcelo Tosatti 提交于 1月 27, 2009

IRQ ack notifications assume an identity mapping between pin->gsi,
which might not be the case with, for example, HPET.

Translate before acking.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Acked-by: NGleb Natapov <gleb@redhat.com>

44882eed

KVM: Avoid using CONFIG_ in userspace visible headers · 91b2ae77

由 Avi Kivity 提交于 1月 19, 2009

Kconfig symbols are not available in userspace, and are not stripped by
headers-install.  Avoid their use by adding #defines in <asm/kvm.h> to
suit each architecture.
Signed-off-by: NAvi Kivity <avi@redhat.com>

91b2ae77

KVM: Userspace controlled irq routing · 399ec807

由 Avi Kivity 提交于 11月 19, 2008

Currently KVM has a static routing from GSI numbers to interrupts (namely,
0-15 are mapped 1:1 to both PIC and IOAPIC, and 16:23 are mapped 1:1 to
the IOAPIC).  This is insufficient for several reasons:

- HPET requires non 1:1 mapping for the timer interrupt
- MSIs need a new method to assign interrupt numbers and dispatch them
- ACPI APIC mode needs to be able to reassign the PCI LINK interrupts to the
  ioapics

This patch implements an interrupt routing table (as a linked list, but this
can be easily changed) and a userspace interface to replace the table.  The
routing table is initialized according to the current hardwired mapping.
Signed-off-by: NAvi Kivity <avi@redhat.com>

399ec807

KVM: x86: Fix typos and whitespace errors · 19355475

由 Amit Shah 提交于 1月 14, 2009

Some typos, comments, whitespace errors corrected in the cpuid code
Signed-off-by: NAmit Shah <amit.shah@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

19355475

A
KVM: MMU: Only enable cr4_pge role in shadow mode · 5a41accd
由 Avi Kivity 提交于 1月 11, 2009
```
Two dimensional paging is only confused by it.
Signed-off-by: NAvi Kivity <avi@redhat.com>
```
5a41accd

KVM: MMU: Rename "metaphysical" attribute to "direct" · f6e2c02b

由 Avi Kivity 提交于 1月 11, 2009

This actually describes what is going on, rather than alerting the reader
that something strange is going on.
Signed-off-by: NAvi Kivity <avi@redhat.com>

f6e2c02b

KVM: MMU: drop zeroing on mmu_memory_cache_alloc · 9903a927

由 Marcelo Tosatti 提交于 1月 08, 2009

Zeroing on mmu_memory_cache_alloc is unnecessary since:

- Smaller areas are pre-allocated with kmem_cache_zalloc.
- Page pointed by ->spt is overwritten with prefetch_page
  and entries in page pointed by ->gfns are initialized
  before reading.

[avi: zeroing pages is unnecessary]
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

9903a927

KVM: SVM: Fix typo in has_svm() · ff81ff10

由 Joe Perches 提交于 1月 08, 2009

Signed-off-by: NJoe Perches <joe@perches.com>
Acked-by: NJoerg Roedel <joerg.roedel@amd.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

ff81ff10

KVM: Reset PIT irq injection logic when the PIT IRQ is unmasked · 4780c659

由 Avi Kivity 提交于 1月 04, 2009

While the PIT is masked the guest cannot ack the irq, so the reinject logic
will never allow the interrupt to be injected.

Fix by resetting the reinjection counters on unmask.

Unbreaks Xen.
Signed-off-by: NAvi Kivity <avi@redhat.com>

4780c659

KVM: Add CONFIG_HAVE_KVM_IRQCHIP · 5d9b8e30

由 Avi Kivity 提交于 1月 04, 2009

Two KVM archs support irqchips and two don't.  Add a Kconfig item to
make selecting between the two models easier.
Signed-off-by: NAvi Kivity <avi@redhat.com>

5d9b8e30

KVM: MMU: Optimize page unshadowing · 4677a3b6

由 Avi Kivity 提交于 1月 06, 2009

Using kvm_mmu_lookup_page() will result in multiple scans of the hash chains;
use hlist_for_each_entry_safe() to achieve a single scan instead.
Signed-off-by: NAvi Kivity <avi@redhat.com>

4677a3b6

KVM: SVM: Add microcode patch level dummy · c8a73f18

由 Alexander Graf 提交于 1月 05, 2009

VMware ESX checks if the microcode level is correct when using a barcelona
CPU, in order to see if it actually can use SVM. Let's tell it we're on the
safe side...
Signed-off-by: NAlexander Graf <agraf@suse.de>
Signed-off-by: NAvi Kivity <avi@redhat.com>

c8a73f18

KVM: Properly lock PIT creation · 269e05e4

由 Avi Kivity 提交于 1月 05, 2009

Otherwise, two threads can create a PIT in parallel and cause a memory leak.
Signed-off-by: NAvi Kivity <avi@redhat.com>

269e05e4

A
KVM: x86 emulator: implement 'ret far' instruction (opcode 0xcb) · a77ab5ea
由 Avi Kivity 提交于 1月 05, 2009
```
Signed-off-by: NAvi Kivity <avi@redhat.com>
```
a77ab5ea
A
KVM: VMX: When emulating on invalid vmx state, don't return to userspace unnecessarily · 8b3079a5
由 Avi Kivity 提交于 1月 05, 2009
```
If we aren't doing mmio there's no need to exit to userspace (which will
just be confused).
Signed-off-by: NAvi Kivity <avi@redhat.com>
```
8b3079a5

KVM: x86 emulator: Make emulate_pop() a little more generic · 350f69dc

由 Avi Kivity 提交于 1月 05, 2009

Allow emulate_pop() to read into arbitrary memory rather than just the
source operand. Needed for complicated instructions like far returns.
Signed-off-by: NAvi Kivity <avi@redhat.com>

350f69dc

KVM: VMX: Prevent exit handler from running if emulating due to invalid state · 10f32d84

由 Avi Kivity 提交于 1月 05, 2009

If we've just emulated an instruction, we won't have any valid exit
reason and associated information.

Fix by moving the clearing of the emulation_required flag to the exit handler.
This way the exit handler can notice that we've been emulating and abort
early.
Signed-off-by: NAvi Kivity <avi@redhat.com>

10f32d84

KVM: VMX: don't clobber segment AR if emulating invalid state · 9fd4a3b7

由 Avi Kivity 提交于 1月 04, 2009

The ususable bit is important for determining state validity; don't
clobber it.
Signed-off-by: NAvi Kivity <avi@redhat.com>

9fd4a3b7

KVM: VMX: Fix guest state validity checks · 1872a3f4

由 Avi Kivity 提交于 1月 04, 2009

The vmx guest state validity checks are full of bugs.  Make them
conform to the manual.
Signed-off-by: NAvi Kivity <avi@redhat.com>

1872a3f4

KVM: Move struct kvm_pio_request into x86 kvm_host.h · 1c08364c

由 Avi Kivity 提交于 1月 04, 2009

This is an x86 specific stucture and has no business living in common code.
Signed-off-by: NAvi Kivity <avi@redhat.com>

1c08364c

KVM: PIT: provide an option to disable interrupt reinjection · 52d939a0

由 Marcelo Tosatti 提交于 12月 30, 2008

Certain clocks (such as TSC) in older 2.6 guests overaccount for lost
ticks, causing severe time drift. Interrupt reinjection magnifies the
problem.

Provide an option to disable it.

[avi: allow room for expansion in case we want to disable reinjection
      of other timers]
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

52d939a0

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功