提交 · 862d31f788f9a249f7656d02d8d4006e306108ce · openeuler / Kernel

06 10月, 2012 17 次提交

KVM: PPC: E500: Implement MMU notifiers · 862d31f7

由 Alexander Graf 提交于 7月 31, 2012

The e500 target has lived without mmu notifiers ever since it got
introduced, but fails for the user space check on them with hugetlbfs.

So in order to get that one working, implement mmu notifiers in a
reasonably dumb fashion and be happy. On embedded hardware, we almost
never end up with mmu notifier calls, since most people don't overcommit.
Signed-off-by: NAlexander Graf <agraf@suse.de>

862d31f7

KVM: PPC: BookE: Add support for vcpu->mode · d69c6436

由 Alexander Graf 提交于 8月 08, 2012

Generic KVM code might want to know whether we are inside guest context
or outside. It also wants to be able to push us out of guest context.

Add support to the BookE code for the generic vcpu->mode field that describes
the above states.
Signed-off-by: NAlexander Graf <agraf@suse.de>

d69c6436

KVM: PPC: BookE: Add check_requests helper function · 4ffc6356

由 Alexander Graf 提交于 8月 08, 2012

We need a central place to check for pending requests in. Add one that
only does the timer check we already do in a different place.

Later, this central function can be extended by more checks.
Signed-off-by: NAlexander Graf <agraf@suse.de>

4ffc6356

powerpc/epapr: export epapr_hypercall_start · 8043e494

由 Scott Wood 提交于 8月 10, 2012

This fixes breakage introduced by the following commit:

  commit 6d2d82627f4f1e96a33664ace494fa363e0495cb
  Author: Liu Yu-B13201 <Yu.Liu@freescale.com>
  Date:   Tue Jul 3 05:48:56 2012 +0000

    PPC: Don't use hardcoded opcode for ePAPR hcall invocation

when a driver that uses ePAPR hypercalls is built as a module.
Reported-by: NGeert Uytterhoeven <geert@linux-m68k.org>
Signed-off-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

8043e494

KVM: PPC: Quieten message about allocating linear regions · 1340f3e8

由 Paul Mackerras 提交于 8月 06, 2012

This is printed once for every RMA or HPT region that get
preallocated.  If one preallocates hundreds of such regions
(in order to run hundreds of KVM guests), that gets rather
painful, so make it a bit quieter.
Signed-off-by: NPaul Mackerras <paulus@samba.org>
Signed-off-by: NAlexander Graf <agraf@suse.de>

1340f3e8

KVM: PPC: E500: Fix clear_tlb_refs · 2bb890f5

由 Alexander Graf 提交于 8月 02, 2012

Our mapping code assumes that TLB0 entries are always mapped. However, after
calling clear_tlb_refs() this is no longer the case.

Map them dynamically if we find an entry unmapped in TLB0.
Signed-off-by: NAlexander Graf <agraf@suse.de>

2bb890f5

KVM: PPC: BookE: Expose remote TLB flushes in debugfs · cf1c5ca4

由 Alexander Graf 提交于 8月 01, 2012

We're already counting remote TLB flushes in a variable, but don't export
it to user space yet. Do so, so we know what's going on.
Signed-off-by: NAlexander Graf <agraf@suse.de>

cf1c5ca4

KVM: PPC: Expose SYNC cap based on mmu notifiers · f4800b1f

由 Alexander Graf 提交于 8月 07, 2012

Semantically, the "SYNC" cap means that we have mmu notifiers available.
Express this in our #ifdef'ery around the feature, so that we can be sure
we don't miss out on ppc targets when they get their implementation.
Signed-off-by: NAlexander Graf <agraf@suse.de>

f4800b1f

KVM: PPC: PR: Use generic tracepoint for guest exit · 97c95059

由 Alexander Graf 提交于 8月 02, 2012

We want to have tracing information on guest exits for booke as well
as book3s. Since most information is identical, use a common trace point.
Signed-off-by: NAlexander Graf <agraf@suse.de>

97c95059

PPC: Don't use hardcoded opcode for ePAPR hcall invocation · 8e525d59

由 Liu Yu-B13201 提交于 7月 03, 2012

Signed-off-by: NLiu Yu <yu.liu@freescale.com>
Signed-off-by: NStuart Yoder <stuart.yoder@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

8e525d59

powerpc/fsl-soc: use CONFIG_EPAPR_PARAVIRT for hcalls · 305bcf26

由 Scott Wood 提交于 7月 03, 2012

Signed-off-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NStuart Yoder <stuart.yoder@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

305bcf26

S
PPC: select EPAPR_PARAVIRT for all users of epapr hcalls · 40656397
由 Stuart Yoder 提交于 7月 03, 2012
```
Signed-off-by: NStuart Yoder <stuart.yoder@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>
```
40656397

KVM: PPC: ev_idle hcall support for e500 guests · 2f979de8

由 Liu Yu-B13201 提交于 7月 03, 2012

Signed-off-by: NLiu Yu <yu.liu@freescale.com>
[varun: 64-bit changes]
Signed-off-by: NVarun Sethi <Varun.Sethi@freescale.com>
Signed-off-by: NStuart Yoder <stuart.yoder@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

2f979de8

KVM: PPC: Add support for ePAPR idle hcall in host kernel · 9202e076

由 Liu Yu-B13201 提交于 7月 03, 2012

And add a new flag definition in kvm_ppc_pvinfo to indicate
whether the host supports the EV_IDLE hcall.
Signed-off-by: NLiu Yu <yu.liu@freescale.com>
[stuart.yoder@freescale.com: cleanup,fixes for conditions allowing idle]
Signed-off-by: NStuart Yoder <stuart.yoder@freescale.com>
[agraf: fix typo]
Signed-off-by: NAlexander Graf <agraf@suse.de>

9202e076

KVM: PPC: add pvinfo for hcall opcodes on e500mc/e5500 · 784bafac

由 Stuart Yoder 提交于 7月 03, 2012

Signed-off-by: NLiu Yu <yu.liu@freescale.com>
[stuart: factored this out from idle hcall support in host patch]
Signed-off-by: NStuart Yoder <stuart.yoder@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

784bafac

KVM: PPC: use definitions in epapr header for hcalls · fdcf8bd7

由 Stuart Yoder 提交于 7月 03, 2012

Signed-off-by: NStuart Yoder <stuart.yoder@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

fdcf8bd7

S
PPC: epapr: create define for return code value of success · e13dcc1a
由 Stuart Yoder 提交于 7月 03, 2012
```
Signed-off-by: NStuart Yoder <stuart.yoder@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>
```
e13dcc1a

28 9月, 2012 1 次提交

KVM: s390: Fix vcpu_load handling in interrupt code · 3d11df7a

由 Christian Borntraeger 提交于 9月 27, 2012

Recent changes (KVM: make processes waiting on vcpu mutex killable)
now requires to check the return value of vcpu_load. This triggered
a warning in s390 specific kvm code. Turns out that we can actually
remove the put/load, since schedule will do the right thing via
the preempt notifiers.
Reported-by: NFengguang Wu <fengguang.wu@intel.com>
Signed-off-by: NChristian Borntraeger <borntraeger@de.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

3d11df7a

23 9月, 2012 2 次提交

KVM: x86: Fix guest debug across vcpu INIT reset · c8639010

由 Jan Kiszka 提交于 9月 21, 2012

If we reset a vcpu on INIT, we so far overwrote dr7 as provided by
KVM_SET_GUEST_DEBUG, and we also cleared switch_db_regs unconditionally.

Fix this by saving the dr7 used for guest debugging and calculating the
effective register value as well as switch_db_regs on any potential
change. This will change to focus of the set_guest_debug vendor op to
update_dp_bp_intercept.

Found while trying to stop on start_secondary.
Signed-off-by: NJan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

c8639010

KVM: Add resampling irqfds for level triggered interrupts · 7a84428a

由 Alex Williamson 提交于 9月 21, 2012

To emulate level triggered interrupts, add a resample option to
KVM_IRQFD.  When specified, a new resamplefd is provided that notifies
the user when the irqchip has been resampled by the VM.  This may, for
instance, indicate an EOI.  Also in this mode, posting of an interrupt
through an irqfd only asserts the interrupt.  On resampling, the
interrupt is automatically de-asserted prior to user notification.
This enables level triggered interrupts to be posted and re-enabled
from vfio with no userspace intervention.

All resampling irqfds can make use of a single irq source ID, so we
reserve a new one for this interface.
Signed-off-by: NAlex Williamson <alex.williamson@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

7a84428a

20 9月, 2012 12 次提交

KVM: optimize apic interrupt delivery · 1e08ec4a

由 Gleb Natapov 提交于 9月 13, 2012

Most interrupt are delivered to only one vcpu. Use pre-build tables to
find interrupt destination instead of looping through all vcpus. In case
of logical mode loop only through vcpus in a logical cluster irq is sent
to.
Signed-off-by: NGleb Natapov <gleb@redhat.com>
Acked-by: NMichael S. Tsirkin <mst@redhat.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

1e08ec4a

Merge branch 'queue' into next · 1d86b5cc

由 Avi Kivity 提交于 9月 20, 2012

* queue:
  KVM: MMU: Eliminate pointless temporary 'ac'
  KVM: MMU: Avoid access/dirty update loop if all is well
  KVM: MMU: Eliminate eperm temporary
  KVM: MMU: Optimize is_last_gpte()
  KVM: MMU: Simplify walk_addr_generic() loop
  KVM: MMU: Optimize pte permission checks
  KVM: MMU: Update accessed and dirty bits after guest pagetable walk
  KVM: MMU: Move gpte_access() out of paging_tmpl.h
  KVM: MMU: Optimize gpte_access() slightly
  KVM: MMU: Push clean gpte write protection out of gpte_access()
  KVM: clarify kvmclock documentation
  KVM: make processes waiting on vcpu mutex killable
  KVM: SVM: Make use of asm.h
  KVM: VMX: Make use of asm.h
  KVM: VMX: Make lto-friendly
Signed-off-by: NAvi Kivity <avi@redhat.com>

1d86b5cc

KVM: MMU: Eliminate pointless temporary 'ac' · c5421519

由 Avi Kivity 提交于 9月 19, 2012

'ac' essentially reconstructs the 'access' variable we already
have, except for the PFERR_PRESENT_MASK and PFERR_RSVD_MASK.  As
these are not used by callees, just use 'access' directly.
Reviewed-by: NXiao Guangrong <xiaoguangrong@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

c5421519

KVM: MMU: Avoid access/dirty update loop if all is well · b514c30f

由 Avi Kivity 提交于 9月 16, 2012

Keep track of accessed/dirty bits; if they are all set, do not
enter the accessed/dirty update loop.
Reviewed-by: NXiao Guangrong <xiaoguangrong@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

b514c30f

KVM: MMU: Eliminate eperm temporary · 71331a1d

由 Avi Kivity 提交于 9月 16, 2012

'eperm' is no longer used in the walker loop, so we can eliminate it.
Reviewed-by: NXiao Guangrong <xiaoguangrong@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

71331a1d

KVM: MMU: Optimize is_last_gpte() · 6fd01b71

由 Avi Kivity 提交于 9月 12, 2012

Instead of branchy code depending on level, gpte.ps, and mmu configuration,
prepare everything in a bitmap during mode changes and look it up during
runtime.
Reviewed-by: NXiao Guangrong <xiaoguangrong@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

6fd01b71

KVM: MMU: Simplify walk_addr_generic() loop · 13d22b6a

由 Avi Kivity 提交于 9月 12, 2012

The page table walk is coded as an infinite loop, with a special
case on the last pte.

Code it as an ordinary loop with a termination condition on the last
pte (large page or walk length exhausted), and put the last pte handling
code after the loop where it belongs.
Reviewed-by: NXiao Guangrong <xiaoguangrong@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

13d22b6a

KVM: MMU: Optimize pte permission checks · 97d64b78

由 Avi Kivity 提交于 9月 12, 2012

walk_addr_generic() permission checks are a maze of branchy code, which is
performed four times per lookup.  It depends on the type of access, efer.nxe,
cr0.wp, cr4.smep, and in the near future, cr4.smap.

Optimize this away by precalculating all variants and storing them in a
bitmap.  The bitmap is recalculated when rarely-changing variables change
(cr0, cr4) and is indexed by the often-changing variables (page fault error
code, pte access permissions).

The permission check is moved to the end of the loop, otherwise an SMEP
fault could be reported as a false positive, when PDE.U=1 but PTE.U=0.
Noted by Xiao Guangrong.

The result is short, branch-free code.
Reviewed-by: NXiao Guangrong <xiaoguangrong@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

97d64b78

KVM: MMU: Update accessed and dirty bits after guest pagetable walk · 8cbc7069

由 Avi Kivity 提交于 9月 16, 2012

While unspecified, the behaviour of Intel processors is to first
perform the page table walk, then, if the walk was successful, to
atomically update the accessed and dirty bits of walked paging elements.

While we are not required to follow this exactly, doing so will allow us
to perform the access permissions check after the walk is complete, rather
than after each walk step.

(the tricky case is SMEP: a zero in any pte's U bit makes the referenced
page a supervisor page, so we can't fault on a one bit during the walk
itself).
Reviewed-by: NXiao Guangrong <xiaoguangrong@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

8cbc7069

KVM: MMU: Move gpte_access() out of paging_tmpl.h · 3d34adec

由 Avi Kivity 提交于 9月 12, 2012

We no longer rely on paging_tmpl.h defines; so we can move the function
to mmu.c.

Rely on zero extension to 64 bits to get the correct nx behaviour.
Reviewed-by: NXiao Guangrong <xiaoguangrong@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

3d34adec

KVM: MMU: Optimize gpte_access() slightly · edc2ae84

由 Avi Kivity 提交于 9月 12, 2012

If nx is disabled, then is gpte[63] is set we will hit a reserved
bit set fault before checking permissions; so we can ignore the
setting of efer.nxe.
Reviewed-by: NXiao Guangrong <xiaoguangrong@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

edc2ae84

KVM: MMU: Push clean gpte write protection out of gpte_access() · 8ea667f2

由 Avi Kivity 提交于 9月 12, 2012

gpte_access() computes the access permissions of a guest pte and also
write-protects clean gptes.  This is wrong when we are servicing a
write fault (since we'll be setting the dirty bit momentarily) but
correct when instantiating a speculative spte, or when servicing a
read fault (since we'll want to trap a following write in order to
set the dirty bit).

It doesn't seem to hurt in practice, but in order to make the code
readable, push the write protection out of gpte_access() and into
a new protect_clean_gpte() which is called explicitly when needed.
Reviewed-by: NXiao Guangrong <xiaoguangrong@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

8ea667f2

18 9月, 2012 2 次提交

KVM: clarify kvmclock documentation · 879238fe

由 Stefan Fritsch 提交于 9月 16, 2012

- mention that system time needs to be added to wallclock time
- positive tsc_shift means left shift, not right
- mention additional 32bit right shift
Signed-off-by: NStefan Fritsch <sf@sfritsch.de>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

879238fe

KVM: make processes waiting on vcpu mutex killable · 9fc77441

由 Michael S. Tsirkin 提交于 9月 16, 2012

vcpu mutex can be held for unlimited time so
taking it with mutex_lock on an ioctl is wrong:
one process could be passed a vcpu fd and
call this ioctl on the vcpu used by another process,
it will then be unkillable until the owner exits.

Call mutex_lock_killable instead and return status.
Note: mutex_lock_interruptible would be even nicer,
but I am not sure all users are prepared to handle EINTR
from these ioctls. They might misinterpret it as an error.

Cleanup paths expect a vcpu that can't be used by
any userspace so this will always succeed - catch bugs
by calling BUG_ON.

Catch callers that don't check return state by adding
__must_check.
Signed-off-by: NMichael S. Tsirkin <mst@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

9fc77441

17 9月, 2012 3 次提交

KVM: SVM: Make use of asm.h · 7454766f

由 Avi Kivity 提交于 9月 16, 2012

Use macros for bitness-insensitive register names, instead of
rolling our own.
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

7454766f

KVM: VMX: Make use of asm.h · b188c81f

由 Avi Kivity 提交于 9月 16, 2012

Use macros for bitness-insensitive register names, instead of
rolling our own.
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

b188c81f

KVM: VMX: Make lto-friendly · 83287ea4

由 Avi Kivity 提交于 9月 16, 2012

LTO (link-time optimization) doesn't like local labels to be referred to
from a different function, since the two functions may be built in separate
compilation units.  Use an external variable instead.
Signed-off-by: NAvi Kivity <avi@redhat.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

83287ea4

13 9月, 2012 1 次提交

KVM: x86: lapic: Clean up find_highest_vector() and count_vectors() · ecba9a52

由 Takuya Yoshikawa 提交于 9月 05, 2012

find_highest_vector() and count_vectors():
 - Instead of using magic values, define and use proper macros.

find_highest_vector():
 - Remove likely() which is there only for historical reasons and not
   doing correct branch predictions anymore.  Using such heuristics
   to optimize this function is not worth it now.  Let CPUs predict
   things instead.

 - Stop checking word[0] separately.  This was only needed for doing
   likely() optimization.

 - Use for loop, not while, to iterate over the register array to make
   the code clearer.

Note that we actually confirmed that the likely() did wrong predictions
by inserting debug code.
Acked-by: NMichael S. Tsirkin <mst@redhat.com>
Signed-off-by: NTakuya Yoshikawa <yoshikawa.takuya@oss.ntt.co.jp>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

ecba9a52

10 9月, 2012 2 次提交

KVM: MMU: remove unnecessary check · 7de5bdc9

由 Xiao Guangrong 提交于 9月 07, 2012

Checking the return of kvm_mmu_get_page is unnecessary since it is
guaranteed by memory cache
Signed-off-by: NXiao Guangrong <xiaoguangrong@linux.vnet.ibm.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

7de5bdc9

KVM: Depend on HIGH_RES_TIMERS · 92b5265d

由 Liu, Jinsong 提交于 9月 10, 2012

KVM lapic timer and tsc deadline timer based on hrtimer,
setting a leftmost node to rb tree and then do hrtimer reprogram.
If hrtimer not configured as high resolution, hrtimer_enqueue_reprogram
do nothing and then make kvm lapic timer and tsc deadline timer fail.
Signed-off-by: NLiu, Jinsong <jinsong.liu@intel.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

92b5265d

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功