提交 · fb3ff69d1397ce4bd2441c87b1daea67cb945ac6 · openanolis / cloud-kernel

18 11月, 2010 2 次提交

KVM: VMX: Fix host userspace gsbase corruption · c8770e7b

由 Avi Kivity 提交于 11月 11, 2010

We now use load_gs_index() to load gs safely; unfortunately this also
changes MSR_KERNEL_GS_BASE, which we managed separately.  This resulted
in confusion and breakage running 32-bit host userspace on a 64-bit kernel.

Fix by
- saving guest MSR_KERNEL_GS_BASE before we we reload the host's gs
- doing the host save/load unconditionally, instead of only when in guest
  long mode

Things can be cleaned up further, but this is the minmal fix for now.
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

c8770e7b

KVM: Correct ordering of ldt reload wrt fs/gs reload · 0a77fe4c

由 Avi Kivity 提交于 10月 19, 2010

If fs or gs refer to the ldt, they must be reloaded after the ldt.  Reorder
the code to that effect.

Userspace code that uses the ldt with kvm is nonexistent, so this doesn't fix
a user-visible bug.
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

0a77fe4c

06 11月, 2010 4 次提交

KVM: x86: Issue smp_call_function_many with preemption disabled · 453d9c57

由 Jan Kiszka 提交于 11月 01, 2010

smp_call_function_many is specified to be called only with preemption
disabled. Fulfill this requirement.
Signed-off-by: NJan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

453d9c57

KVM: x86: fix information leak to userland · 97e69aa6

由 Vasiliy Kulikov 提交于 10月 30, 2010

Structures kvm_vcpu_events, kvm_debugregs, kvm_pit_state2 and
kvm_clock_data are copied to userland with some padding and reserved
fields unitialized.  It leads to leaking of contents of kernel stack
memory.  We have to initialize them to zero.

In patch v1 Jan Kiszka suggested to fill reserved fields with zeros
instead of memset'ting the whole struct.  It makes sense as these
fields are explicitly marked as padding.  No more fields need zeroing.

KVM-Stable-Tag.
Signed-off-by: NVasiliy Kulikov <segooon@gmail.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

97e69aa6

KVM: MMU: fix rmap_remove on non present sptes · eb45fda4

由 Marcelo Tosatti 提交于 10月 25, 2010

drop_spte should not attempt to rmap_remove a non present shadow pte.

This fixes a BUG_ON seen on kvm-autotest.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Reported-by: NLucas Meneghel Rodrigues <lmr@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

eb45fda4

KVM: Write protect memory after slot swap · edde99ce

由 Michael S. Tsirkin 提交于 10月 25, 2010

I have observed the following bug trigger:

1. userspace calls GET_DIRTY_LOG
2. kvm_mmu_slot_remove_write_access is called and makes a page ro
3. page fault happens and makes the page writeable
   fault is logged in the bitmap appropriately
4. kvm_vm_ioctl_get_dirty_log swaps slot pointers

a lot of time passes

5. guest writes into the page
6. userspace calls GET_DIRTY_LOG

At point (5), bitmap is clean and page is writeable,
thus, guest modification of memory is not logged
and GET_DIRTY_LOG returns an empty bitmap.

The rule is that all pages are either dirty in the current bitmap,
or write-protected, which is violated here.

It seems that just moving kvm_mmu_slot_remove_write_access down
to after the slot pointer swap should fix this bug.

KVM-Stable-Tag.
Signed-off-by: NMichael S. Tsirkin <mst@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

edde99ce

24 10月, 2010 34 次提交

KVM: MCE: Send SRAR SIGBUS directly · 77db5cbd

由 Huang Ying 提交于 10月 08, 2010

Originally, SRAR SIGBUS is sent to QEMU-KVM via touching the poisoned
page. But commit 96054569 prevents the
signal from being sent. So now the signal is sent via
force_sig_info_fault directly.

[marcelo: use send_sig_info instead]
Reported-by: NDean Nelson <dnelson@redhat.com>
Signed-off-by: NHuang Ying <ying.huang@intel.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

77db5cbd

KVM: MCE: Add MCG_SER_P into KVM_MCE_CAP_SUPPORTED · 5854dbca

由 Huang Ying 提交于 10月 08, 2010

Now we have MCG_SER_P (and corresponding SRAO/SRAR MCE) support in
kernel and QEMU-KVM, the MCG_SER_P should be added into
KVM_MCE_CAP_SUPPORTED to make all these code really works.
Reported-by: NDean Nelson <dnelson@redhat.com>
Signed-off-by: NHuang Ying <ying.huang@intel.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

5854dbca

KVM: fix typo in copyright notice · 9611c187

由 Nicolas Kaiser 提交于 10月 06, 2010

Fix typo in copyright notice.
Signed-off-by: NNicolas Kaiser <nikai@nikai.net>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

9611c187

KVM: Disable interrupts around get_kernel_ns() · 395c6b0a

由 Avi Kivity 提交于 10月 04, 2010

get_kernel_ns() wants preemption disabled.  It doesn't make a lot of sense
during the get/set ioctls (no way to make them non-racy) but the callee wants
it.
Signed-off-by: NAvi Kivity <avi@redhat.com>

395c6b0a

A
KVM: MMU: Avoid sign extension in mmu_alloc_direct_roots() pae root address · 7ebaf15e
由 Avi Kivity 提交于 10月 03, 2010
```
Signed-off-by: NAvi Kivity <avi@redhat.com>
```
7ebaf15e

KVM: MMU: move access code parsing to FNAME(walk_addr) function · 33770780

由 Xiao Guangrong 提交于 9月 28, 2010

Move access code parsing from caller site to FNAME(walk_addr) function
Signed-off-by: NXiao Guangrong <xiaoguangrong@cn.fujitsu.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

33770780

KVM: MMU: audit: check whether have unsync sps after root sync · 6903074c

由 Xiao Guangrong 提交于 9月 27, 2010

After root synced, all unsync sps are synced, this patch add a check to make
sure it's no unsync sps in VCPU's page table
Signed-off-by: NXiao Guangrong <xiaoguangrong@cn.fujitsu.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

6903074c

KVM: MMU: audit: introduce audit_printk to cleanup audit code · 38904e12

由 Xiao Guangrong 提交于 9月 27, 2010

Introduce audit_printk, and record audit point instead audit name
Signed-off-by: NXiao Guangrong <xiaoguangrong@cn.fujitsu.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

38904e12

KVM: MMU: audit: unregister audit tracepoints before module unloaded · c42fffe3

由 Xiao Guangrong 提交于 9月 27, 2010

fix:

Call Trace:
 [<ffffffffa01e46ba>] ? kvm_mmu_pte_write+0x229/0x911 [kvm]
 [<ffffffffa01c6ba9>] ? gfn_to_memslot+0x39/0xa0 [kvm]
 [<ffffffffa01c6c26>] ? mark_page_dirty+0x16/0x2e [kvm]
 [<ffffffffa01c6d6f>] ? kvm_write_guest_page+0x67/0x7f [kvm]
 [<ffffffff81066fbd>] ? local_clock+0x2a/0x3b
 [<ffffffffa01d52ce>] emulator_write_phys+0x46/0x54 [kvm]
 ......
Code:  Bad RIP value.
RIP  [<ffffffffa0172056>] 0xffffffffa0172056
 RSP <ffff880134f69a70>
CR2: ffffffffa0172056
Signed-off-by: NXiao Guangrong <xiaoguangrong@cn.fujitsu.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

c42fffe3

KVM: MMU: audit: fix vcpu's spte walking · 98224bf1

由 Xiao Guangrong 提交于 9月 27, 2010

After nested nested paging, it may using long mode to shadow 32/PAE paging
guest, so this patch fix it
Signed-off-by: NXiao Guangrong <xiaoguangrong@cn.fujitsu.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

98224bf1

KVM: MMU: set access bit for direct mapping · 33f91edb

由 Xiao Guangrong 提交于 9月 27, 2010

Set access bit while setup up direct page table if it's nonpaing or npt enabled,
it's good for CPU's speculate access
Signed-off-by: NXiao Guangrong <xiaoguangrong@cn.fujitsu.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

33f91edb

KVM: MMU: cleanup for error mask set while walk guest page table · 20bd40dc

由 Xiao Guangrong 提交于 9月 27, 2010

Small cleanup for set page fault error code
Signed-off-by: NXiao Guangrong <xiaoguangrong@cn.fujitsu.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

20bd40dc

KVM: MMU: update 'root_hpa' out of loop in PAE shadow path · 6292757f

由 Xiao Guangrong 提交于 9月 27, 2010

The value of 'vcpu->arch.mmu.pae_root' is not modified, so we can update
'root_hpa' out of the loop.
Signed-off-by: NXiao Guangrong <xiaoguangrong@cn.fujitsu.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

6292757f

KVM: x86 emulator: Eliminate compilation warning in x86_decode_insn() · 7129eeca

由 Sheng Yang 提交于 9月 28, 2010

Eliminate:
arch/x86/kvm/emulate.c:801: warning: ‘sv’ may be used uninitialized in this
function

on gcc 4.1.2
Signed-off-by: NSheng Yang <sheng@linux.intel.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

7129eeca

KVM: x86: Fix constant type in kvm_get_time_scale · 50933623

由 Jan Kiszka 提交于 9月 26, 2010

Older gcc versions complain about the improper type (for x86-32), 4.5
seems to fix this silently. However, we should better use the right type
initially.
Signed-off-by: NJan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

50933623

KVM: VMX: Add AX to list of registers clobbered by guest switch · 07d6f555

由 Jan Kiszka 提交于 9月 28, 2010

By chance this caused no harm so far. We overwrite AX during switch
to/from guest context, so we must declare this.
Signed-off-by: NJan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

07d6f555

KVM: x86: TSC catchup mode · c285545f

由 Zachary Amsden 提交于 9月 18, 2010

Negate the effects of AN TYM spell while kvm thread is preempted by tracking
conversion factor to the highest TSC rate and catching the TSC up when it has
fallen behind the kernel view of time.  Note that once triggered, we don't
turn off catchup mode.

A slightly more clever version of this is possible, which only does catchup
when TSC rate drops, and which specifically targets only CPUs with broken
TSC, but since these all are considered unstable_tsc(), this patch covers
all necessary cases.
Signed-off-by: NZachary Amsden <zamsden@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

c285545f

KVM: x86: Rename timer function · 34c238a1

由 Zachary Amsden 提交于 9月 18, 2010

This just changes some names to better reflect the usage they
will be given.  Separated out to keep confusion to a minimum.
Signed-off-by: NZachary Amsden <zamsden@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

34c238a1

KVM: x86: Make math work for other scales · 5f4e3f88

由 Zachary Amsden 提交于 9月 18, 2010

The math in kvm_get_time_scale relies on the fact that
NSEC_PER_SEC < 2^32.  To use the same function to compute
arbitrary time scales, we must extend the first reduction
step to shrink the base rate to a 32-bit value, and
possibly reduce the scaled rate into a 32-bit as well.

Note we must take care to avoid an arithmetic overflow
when scaling up the tps32 value (this could not happen
with the fixed scaled value of NSEC_PER_SEC, but can
happen with scaled rates above 2^31.
Signed-off-by: NZachary Amsden <zamsden@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

5f4e3f88

KVM: VMX: Respect interrupt window in big real mode · 49e9d557

由 Avi Kivity 提交于 9月 19, 2010

If an interrupt is pending, we need to stop emulation so we
can inject it.
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

49e9d557

KVM: VMX: Emulated real mode interrupt injection · a92601bb

由 Mohammed Gamal 提交于 9月 19, 2010

Replace the inject-as-software-interrupt hack we currently have with
emulated injection.
Signed-off-by: NMohammed Gamal <m.gamal005@gmail.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

a92601bb

KVM: Add kvm_inject_realmode_interrupt() wrapper · 63995653

由 Mohammed Gamal 提交于 9月 19, 2010

This adds a wrapper function kvm_inject_realmode_interrupt() around the
emulator function emulate_int_real() to allow real mode interrupt injection.

[avi: initialize operand and address sizes before emulating interrupts]
[avi: initialize rip for real mode interrupt injection]
[avi: clear interrupt pending flag after emulating interrupt injection]
Signed-off-by: NMohammed Gamal <m.gamal005@gmail.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

63995653

KVM: MMU: fix counting of rmap entries in rmap_add() · cb16a7b3

由 Hillf Danton 提交于 9月 18, 2010

It seems that rmap entries are under counted.
Signed-off-by: NHillf Danton <dhillf@gmail.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

cb16a7b3

KVM: SVM: do not generate "external interrupt exit" if other exit is pending · a0a07cd2

由 Gleb Natapov 提交于 9月 20, 2010

Nested SVM checks for external interrupt after injecting nested exception.
In case there is external interrupt pending the code generates "external
interrupt exit" and overwrites previous exit info. If previously injected
exception already generated exit it will be lost.
Signed-off-by: NGleb Natapov <gleb@redhat.com>
Acked-by: NJoerg Roedel <joerg.roedel@amd.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

a0a07cd2

KVM: Convert PIC lock from raw spinlock to ordinary spinlock · f4f51050

由 Avi Kivity 提交于 9月 19, 2010

The PIC code used to be called from preempt_disable() context, which
wasn't very good for PREEMPT_RT.  That is no longer the case, so move
back from raw_spinlock_t to spinlock_t.
Signed-off-by: NAvi Kivity <avi@redhat.com>
Acked-by: NThomas Gleixner <tglx@linutronix.de>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

f4f51050

KVM: x86: Fix kvmclock bug · 28e4639a

由 Zachary Amsden 提交于 9月 18, 2010

If preempted after kvmclock values are updated, but before hardware
virtualization is entered, the last tsc time as read by the guest is
never set. It underflows the next time kvmclock is updated if there
has not yet been a successful entry / exit into hardware virt.

Fix this by simply setting last_tsc to the newly read tsc value so
that any computed nsec advance of kvmclock is nulled.
Signed-off-by: NZachary Amsden <zamsden@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

28e4639a

KVM: MMU: Don't track nested fault info in error-code · 0959ffac

由 Joerg Roedel 提交于 9月 14, 2010

This patch moves the detection whether a page-fault was
nested or not out of the error code and moves it into a
separate variable in the fault struct.
Signed-off-by: NJoerg Roedel <joerg.roedel@amd.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

0959ffac

A
KVM: VMX: Move fixup_rmode_irq() to avoid forward declaration · 625831a3
由 Avi Kivity 提交于 7月 22, 2010
```
No code changes.
Signed-off-by: NAvi Kivity <avi@redhat.com>
```
625831a3

KVM: Non-atomic interrupt injection · b463a6f7

由 Avi Kivity 提交于 7月 20, 2010

Change the interrupt injection code to work from preemptible, interrupts
enabled context.  This works by adding a ->cancel_injection() operation
that undoes an injection in case we were not able to actually enter the guest
(this condition could never happen with atomic injection).
Signed-off-by: NAvi Kivity <avi@redhat.com>

b463a6f7

KVM: VMX: Parameterize vmx_complete_interrupts() for both exit and entry · 83422e17

由 Avi Kivity 提交于 7月 20, 2010

Currently vmx_complete_interrupts() can decode event information from vmx
exit fields into the generic kvm event queues. Make it able to decode
the information from the entry fields as well by parametrizing it.
Signed-off-by: NAvi Kivity <avi@redhat.com>

83422e17

A
KVM: VMX: Move real-mode interrupt injection fixup to vmx_complete_interrupts() · 537b37e2
由 Avi Kivity 提交于 7月 22, 2010
```
This allows reuse of vmx_complete_interrupts() for cancelling injections.
Signed-off-by: NAvi Kivity <avi@redhat.com>
```
537b37e2

KVM: VMX: Split up vmx_complete_interrupts() · 51aa01d1

由 Avi Kivity 提交于 7月 20, 2010

vmx_complete_interrupts() does too much, split it up:
 - vmx_vcpu_run() gets the "cache important vmcs fields" part
 - a new vmx_complete_atomic_exit() gets the parts that must be done atomically
 - a new vmx_recover_nmi_blocking() does what its name says
 - vmx_complete_interrupts() retains the event injection recovery code

This helps in reducing the work done in atomic context.
Signed-off-by: NAvi Kivity <avi@redhat.com>

51aa01d1

KVM: Check for pending events before attempting injection · 3842d135

由 Avi Kivity 提交于 7月 27, 2010

Instead of blindly attempting to inject an event before each guest entry,
check for a possible event first in vcpu->requests.  Sites that can trigger
event injection are modified to set KVM_REQ_EVENT:

- interrupt, nmi window opening
- ppr updates
- i8259 output changes
- local apic irr changes
- rflags updates
- gif flag set
- event set on exit

This improves non-injecting entry performance, and sets the stage for
non-atomic injection.
Signed-off-by: NAvi Kivity <avi@redhat.com>

3842d135

KVM: MMU: Fix regression with ept memory types merged into non-ept page tables · b0bc3ee2

由 Avi Kivity 提交于 9月 13, 2010

Commit "KVM: MMU: Make tdp_enabled a mmu-context parameter" made real-mode
set ->direct_map, and changed the code that merges in the memory type depend
on direct_map instead of tdp_enabled. However, in this case what really
matters is tdp, not direct_map, since tdp changes the pte format regardless
of whether the mapping is direct or not.

As a result, real-mode shadow mappings got corrupted with ept memory types.
The result was a huge slowdown, likely due to the cache being disabled.

Change it back as the simplest fix for the regression (real fix is to move
all that to vmx code, and not use tdp_enabled as a synonym for ept).
Signed-off-by: NAvi Kivity <avi@redhat.com>

b0bc3ee2

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功