提交 · 2eec73437487aa690882cafddca6e4d93df46f26 · openeuler / raspberrypi-kernel

12 1月, 2011 21 次提交

KVM: x86: Avoid issuing wbinvd twice · 2eec7343

由 Jan Kiszka 提交于 11月 01, 2010

Micro optimization to avoid calling wbinvd twice on the CPU that has to
emulate it. As we might be preempted between smp_call_function_many and
the local wbinvd, the cache might be filled again so that real work
could be done uselessly.
Signed-off-by: NJan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

2eec7343

KVM: pre-allocate one more dirty bitmap to avoid vmalloc() · 515a0127

由 Takuya Yoshikawa 提交于 10月 27, 2010

Currently x86's kvm_vm_ioctl_get_dirty_log() needs to allocate a bitmap by
vmalloc() which will be used in the next logging and this has been causing
bad effect to VGA and live-migration: vmalloc() consumes extra systime,
triggers tlb flush, etc.

This patch resolves this issue by pre-allocating one more bitmap and switching
between two bitmaps during dirty logging.

Performance improvement:
  I measured performance for the case of VGA update by trace-cmd.
  The result was 1.5 times faster than the original one.

  In the case of live migration, the improvement ratio depends on the workload
  and the guest memory size. In general, the larger the memory size is the more
  benefits we get.

Note:
  This does not change other architectures's logic but the allocation size
  becomes twice. This will increase the actual memory consumption only when
  the new size changes the number of pages allocated by vmalloc().
Signed-off-by: NTakuya Yoshikawa <yoshikawa.takuya@oss.ntt.co.jp>
Signed-off-by: NFernando Luis Vazquez Cao <fernando@oss.ntt.co.jp>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

515a0127

KVM: propagate fault r/w information to gup(), allow read-only memory · 612819c3

由 Marcelo Tosatti 提交于 10月 22, 2010

As suggested by Andrea, pass r/w error code to gup(), upgrading read fault
to writable if host pte allows it.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

612819c3

KVM: MMU: flush TLBs on writable -> read-only spte overwrite · 7905d9a5

由 Marcelo Tosatti 提交于 10月 22, 2010

This can happen in the following scenario:

vcpu0			vcpu1
read fault
gup(.write=0)
			gup(.write=1)
			reuse swap cache, no COW
			set writable spte
			use writable spte
set read-only spte
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

7905d9a5

KVM: MMU: remove kvm_mmu_set_base_ptes · 982c2565

由 Marcelo Tosatti 提交于 10月 22, 2010

Unused.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

982c2565

KVM: VMX: remove setting of shadow_base_ptes for EPT · ff1fcb9e

由 Marcelo Tosatti 提交于 10月 22, 2010

The EPT present/writable bits use the same position as normal
pagetable bits.

Since direct_map passes ACC_ALL to mmu_set_spte, thus always setting
the writable bit on sptes, use the generic PT_PRESENT shadow_base_pte.

Also pass present/writable error code information from EPT violation
to generic pagefault handler.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

ff1fcb9e

KVM: Avoid double interrupt injection with vapic · 83bcacb1

由 Avi Kivity 提交于 10月 25, 2010

After an interrupt injection, the PPR changes, and we have to reflect that
into the vapic. This causes a KVM_REQ_EVENT to be set, which causes the
whole interrupt injection routine to be run again (harmlessly).

Optimize by only setting KVM_REQ_EVENT if the ppr was lowered; otherwise
there is no chance that a new injection is needed.
Signed-off-by: NAvi Kivity <avi@redhat.com>

83bcacb1

KVM: SVM: Fold save_host_msrs() and load_host_msrs() into their callers · 82ca2d10

由 Avi Kivity 提交于 10月 21, 2010

This abstraction only serves to obfuscate.  Remove.
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

82ca2d10

KVM: SVM: Move fs/gs/ldt save/restore to heavyweight exit path · dacccfdd

由 Avi Kivity 提交于 10月 21, 2010

ldt is never used in the kernel context; same goes for fs (x86_64) and gs
(i386).  So save/restore them in the heavyweight exit path instead
of the lightweight path.

By itself, this doesn't buy us much, but it paves the way for moving vmload
and vmsave to the heavyweight exit path, since they modify the same registers.

[jan: fix copy/pase mistake on i386]
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NJan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

dacccfdd

KVM: SVM: Move svm->host_gs_base into a separate structure · afe9e66f

由 Avi Kivity 提交于 10月 21, 2010

More members will join it soon.
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

afe9e66f

KVM: SVM: Move guest register save out of interrupts disabled section · 13c34e07

由 Avi Kivity 提交于 10月 21, 2010

Saving guest registers is just a memory copy, and does not need to be in the
critical section.  Move outside the critical section to improve latency a
bit.
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

13c34e07

KVM: Move KVM context switch into own function · f56f5369

由 Andi Kleen 提交于 10月 20, 2010

gcc 4.5 with some special options is able to duplicate the VMX
context switch asm in vmx_vcpu_run(). This results in a compile error
because the inline asm sequence uses an on local label. The non local
label is needed because other code wants to set up the return address.

This patch moves the asm code into an own function and marks
that explicitely noinline to avoid this problem.

Better would be probably to just move it into an .S file.

The diff looks worse than the change really is, it's all just
code movement and no logic change.
Signed-off-by: NAndi Kleen <ak@linux.intel.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

f56f5369

KVM: x86: Mark kvm_arch_setup_async_pf static · 7e1fbeac

由 Jan Kiszka 提交于 10月 20, 2010

It has no user outside mmu.c and also no prototype.
Signed-off-by: NJan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

7e1fbeac

KVM: Send async PF when guest is not in userspace too. · fc5f06fa

由 Gleb Natapov 提交于 10月 14, 2010

If guest indicates that it can handle async pf in kernel mode too send
it, but only if interrupts are enabled.
Acked-by: NRik van Riel <riel@redhat.com>
Signed-off-by: NGleb Natapov <gleb@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

fc5f06fa

KVM: Let host know whether the guest can handle async PF in non-userspace context. · 6adba527

由 Gleb Natapov 提交于 10月 14, 2010

If guest can detect that it runs in non-preemptable context it can
handle async PFs at any time, so let host know that it can send async
PF even if guest cpu is not in userspace.
Acked-by: NRik van Riel <riel@redhat.com>
Signed-off-by: NGleb Natapov <gleb@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

6adba527

KVM: Inject asynchronous page fault into a PV guest if page is swapped out. · 7c90705b

由 Gleb Natapov 提交于 10月 14, 2010

Send async page fault to a PV guest if it accesses swapped out memory.
Guest will choose another task to run upon receiving the fault.

Allow async page fault injection only when guest is in user mode since
otherwise guest may be in non-sleepable context and will not be able
to reschedule.

Vcpu will be halted if guest will fault on the same page again or if
vcpu executes kernel code.
Acked-by: NRik van Riel <riel@redhat.com>
Signed-off-by: NGleb Natapov <gleb@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

7c90705b

KVM: Handle async PF in a guest. · 631bc487

由 Gleb Natapov 提交于 10月 14, 2010

When async PF capability is detected hook up special page fault handler
that will handle async page fault events and bypass other page faults to
regular page fault handler. Also add async PF handling to nested SVM
emulation. Async PF always generates exit to L1 where vcpu thread will
be scheduled out until page is available.
Acked-by: NRik van Riel <riel@redhat.com>
Signed-off-by: NGleb Natapov <gleb@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

631bc487

KVM: Add PV MSR to enable asynchronous page faults delivery. · 344d9588

由 Gleb Natapov 提交于 10月 14, 2010

Guest enables async PF vcpu functionality using this MSR.
Reviewed-by: NRik van Riel <riel@redhat.com>
Signed-off-by: NGleb Natapov <gleb@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

344d9588

KVM: Add memory slot versioning and use it to provide fast guest write interface · 49c7754c

由 Gleb Natapov 提交于 10月 18, 2010

Keep track of memslots changes by keeping generation number in memslots
structure. Provide kvm_write_guest_cached() function that skips
gfn_to_hva() translation if memslots was not changed since previous
invocation.
Acked-by: NRik van Riel <riel@redhat.com>
Signed-off-by: NGleb Natapov <gleb@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

49c7754c

KVM: Retry fault before vmentry · 56028d08

由 Gleb Natapov 提交于 10月 17, 2010

When page is swapped in it is mapped into guest memory only after guest
tries to access it again and generate another fault. To save this fault
we can map it immediately since we know that guest is going to access
the page. Do it only when tdp is enabled for now. Shadow paging case is
more complicated. CR[034] and EFER registers should be switched before
doing mapping and then switched back.
Acked-by: NRik van Riel <riel@redhat.com>
Signed-off-by: NGleb Natapov <gleb@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

56028d08

KVM: Halt vcpu if page it tries to access is swapped out · af585b92

由 Gleb Natapov 提交于 10月 14, 2010

If a guest accesses swapped out memory do not swap it in from vcpu thread
context. Schedule work to do swapping and put vcpu into halted state
instead.

Interrupts will still be delivered to the guest and if interrupt will
cause reschedule guest will continue to run another task.

[avi: remove call to get_user_pages_noio(), nacked by Linus; this
      makes everything synchrnous again]
Acked-by: NRik van Riel <riel@redhat.com>
Signed-off-by: NGleb Natapov <gleb@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

af585b92

02 1月, 2011 2 次提交

KVM: Don't reset mmu context unnecessarily when updating EFER · 010c520e

由 Avi Kivity 提交于 10月 11, 2010

The only bit of EFER that affects the mmu is NX, and this is already
accounted for (LME only takes effect when changing cr0).

Based on a patch by Hillf Danton.
Signed-off-by: NAvi Kivity <avi@redhat.com>

010c520e

KVM: i8259: initialize isr_ack · d0dfc6b7

由 Avi Kivity 提交于 12月 31, 2010

isr_ack is never initialized.  So, until the first PIC reset, interrupts
may fail to be injected.  This can cause Windows XP to fail to boot, as
reported in the fallout from the fix to
https://bugzilla.kernel.org/show_bug.cgi?id=21962.
Reported-and-tested-by: NNicolas Prochazka <prochazka.nicolas@gmail.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

d0dfc6b7

29 12月, 2010 1 次提交

KVM: MMU: Fix incorrect direct gfn for unpaged mode shadow · 649497d1

由 Avi Kivity 提交于 12月 28, 2010

We use the physical address instead of the base gfn for the four
PAE page directories we use in unpaged mode.  When the guest accesses
an address above 1GB that is backed by a large host page, a BUG_ON()
in kvm_mmu_set_gfn() triggers.

Resolves: https://bugzilla.kernel.org/show_bug.cgi?id=21962Reported-and-tested-by: NNicolas Prochazka <prochazka.nicolas@gmail.com>
KVM-Stable-Tag.
Signed-off-by: NAvi Kivity <avi@redhat.com>

649497d1

16 12月, 2010 1 次提交
- A
  KVM: Fix preemption counter leak in kvm_timer_init() · 3e26f230
  由 Avi Kivity 提交于 12月 16, 2010
```
Based on a patch from Thomas Meyer.
Signed-off-by: NAvi Kivity <avi@redhat.com>
```
  3e26f230
08 12月, 2010 2 次提交

KVM: SVM: Do not report xsave in supported cpuid · 24d1b15f

由 Joerg Roedel 提交于 12月 07, 2010

To support xsave properly for the guest the SVM module need
software support for it. As long as this is not present do
not report the xsave as supported feature in cpuid.
As a side-effect this patch moves the bit() helper function
into the x86.h file so that it can be used in svm.c too.

KVM-Stable-Tag.
Signed-off-by: NJoerg Roedel <joerg.roedel@amd.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

24d1b15f

KVM: Fix OSXSAVE after migration · 3ea3aa8c

由 Sheng Yang 提交于 12月 08, 2010

CPUID's OSXSAVE is a mirror of CR4.OSXSAVE bit. We need to update the CPUID
after migration.

KVM-Stable-Tag.
Signed-off-by: NSheng Yang <sheng@linux.intel.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

3ea3aa8c

18 11月, 2010 2 次提交

KVM: VMX: Fix host userspace gsbase corruption · c8770e7b

由 Avi Kivity 提交于 11月 11, 2010

We now use load_gs_index() to load gs safely; unfortunately this also
changes MSR_KERNEL_GS_BASE, which we managed separately.  This resulted
in confusion and breakage running 32-bit host userspace on a 64-bit kernel.

Fix by
- saving guest MSR_KERNEL_GS_BASE before we we reload the host's gs
- doing the host save/load unconditionally, instead of only when in guest
  long mode

Things can be cleaned up further, but this is the minmal fix for now.
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

c8770e7b

KVM: Correct ordering of ldt reload wrt fs/gs reload · 0a77fe4c

由 Avi Kivity 提交于 10月 19, 2010

If fs or gs refer to the ldt, they must be reloaded after the ldt.  Reorder
the code to that effect.

Userspace code that uses the ldt with kvm is nonexistent, so this doesn't fix
a user-visible bug.
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

0a77fe4c

06 11月, 2010 4 次提交

KVM: x86: Issue smp_call_function_many with preemption disabled · 453d9c57

由 Jan Kiszka 提交于 11月 01, 2010

smp_call_function_many is specified to be called only with preemption
disabled. Fulfill this requirement.
Signed-off-by: NJan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

453d9c57

KVM: x86: fix information leak to userland · 97e69aa6

由 Vasiliy Kulikov 提交于 10月 30, 2010

Structures kvm_vcpu_events, kvm_debugregs, kvm_pit_state2 and
kvm_clock_data are copied to userland with some padding and reserved
fields unitialized.  It leads to leaking of contents of kernel stack
memory.  We have to initialize them to zero.

In patch v1 Jan Kiszka suggested to fill reserved fields with zeros
instead of memset'ting the whole struct.  It makes sense as these
fields are explicitly marked as padding.  No more fields need zeroing.

KVM-Stable-Tag.
Signed-off-by: NVasiliy Kulikov <segooon@gmail.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

97e69aa6

KVM: MMU: fix rmap_remove on non present sptes · eb45fda4

由 Marcelo Tosatti 提交于 10月 25, 2010

drop_spte should not attempt to rmap_remove a non present shadow pte.

This fixes a BUG_ON seen on kvm-autotest.
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>
Reported-by: NLucas Meneghel Rodrigues <lmr@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

eb45fda4

KVM: Write protect memory after slot swap · edde99ce

由 Michael S. Tsirkin 提交于 10月 25, 2010

I have observed the following bug trigger:

1. userspace calls GET_DIRTY_LOG
2. kvm_mmu_slot_remove_write_access is called and makes a page ro
3. page fault happens and makes the page writeable
   fault is logged in the bitmap appropriately
4. kvm_vm_ioctl_get_dirty_log swaps slot pointers

a lot of time passes

5. guest writes into the page
6. userspace calls GET_DIRTY_LOG

At point (5), bitmap is clean and page is writeable,
thus, guest modification of memory is not logged
and GET_DIRTY_LOG returns an empty bitmap.

The rule is that all pages are either dirty in the current bitmap,
or write-protected, which is violated here.

It seems that just moving kvm_mmu_slot_remove_write_access down
to after the slot pointer swap should fix this bug.

KVM-Stable-Tag.
Signed-off-by: NMichael S. Tsirkin <mst@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

edde99ce

24 10月, 2010 7 次提交

KVM: MCE: Send SRAR SIGBUS directly · 77db5cbd

由 Huang Ying 提交于 10月 08, 2010

Originally, SRAR SIGBUS is sent to QEMU-KVM via touching the poisoned
page. But commit 96054569 prevents the
signal from being sent. So now the signal is sent via
force_sig_info_fault directly.

[marcelo: use send_sig_info instead]
Reported-by: NDean Nelson <dnelson@redhat.com>
Signed-off-by: NHuang Ying <ying.huang@intel.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

77db5cbd

KVM: MCE: Add MCG_SER_P into KVM_MCE_CAP_SUPPORTED · 5854dbca

由 Huang Ying 提交于 10月 08, 2010

Now we have MCG_SER_P (and corresponding SRAO/SRAR MCE) support in
kernel and QEMU-KVM, the MCG_SER_P should be added into
KVM_MCE_CAP_SUPPORTED to make all these code really works.
Reported-by: NDean Nelson <dnelson@redhat.com>
Signed-off-by: NHuang Ying <ying.huang@intel.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

5854dbca

KVM: fix typo in copyright notice · 9611c187

由 Nicolas Kaiser 提交于 10月 06, 2010

Fix typo in copyright notice.
Signed-off-by: NNicolas Kaiser <nikai@nikai.net>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

9611c187

KVM: Disable interrupts around get_kernel_ns() · 395c6b0a

由 Avi Kivity 提交于 10月 04, 2010

get_kernel_ns() wants preemption disabled.  It doesn't make a lot of sense
during the get/set ioctls (no way to make them non-racy) but the callee wants
it.
Signed-off-by: NAvi Kivity <avi@redhat.com>

395c6b0a

A
KVM: MMU: Avoid sign extension in mmu_alloc_direct_roots() pae root address · 7ebaf15e
由 Avi Kivity 提交于 10月 03, 2010
```
Signed-off-by: NAvi Kivity <avi@redhat.com>
```
7ebaf15e

KVM: MMU: move access code parsing to FNAME(walk_addr) function · 33770780

由 Xiao Guangrong 提交于 9月 28, 2010

Move access code parsing from caller site to FNAME(walk_addr) function
Signed-off-by: NXiao Guangrong <xiaoguangrong@cn.fujitsu.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

33770780

KVM: MMU: audit: check whether have unsync sps after root sync · 6903074c

由 Xiao Guangrong 提交于 9月 27, 2010

After root synced, all unsync sps are synced, this patch add a check to make
sure it's no unsync sps in VCPU's page table
Signed-off-by: NXiao Guangrong <xiaoguangrong@cn.fujitsu.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

6903074c