提交 · 5bd1cf118533aba41b3fbd4834e6362a9237db71 · openeuler / Kernel

06 10月, 2012 40 次提交

KVM: PPC: set IN_GUEST_MODE before checking requests · 5bd1cf11

由 Scott Wood 提交于 8月 22, 2012

Avoid a race as described in the code comment.

Also remove a related smp_wmb() from booke's kvmppc_prepare_to_enter().
I can't see any reason for it, and the book3s_pr version doesn't have it.
Signed-off-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

5bd1cf11

KVM: PPC: e500: MMU API: fix leak of shared_tlb_pages · adbb48a8

由 Scott Wood 提交于 8月 22, 2012

This was found by kmemleak.
Signed-off-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

adbb48a8

KVM: PPC: e500: fix allocation size error on g2h_tlb1_map · e400e72f

由 Scott Wood 提交于 8月 22, 2012

We were only allocating half the bytes we need, which was made more
obvious by a recent fix to the memset in  clear_tlb1_bitmap().
Signed-off-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>
Cc: stable@vger.kernel.org

e400e72f

KVM: PPC: Book3S HV: Fix calculation of guest phys address for MMIO emulation · 70bddfef

由 Paul Mackerras 提交于 9月 20, 2012

In the case where the host kernel is using a 64kB base page size and
the guest uses a 4k HPTE (hashed page table entry) to map an emulated
MMIO device, we were calculating the guest physical address wrongly.
We were calculating a gfn as the guest physical address shifted right
16 bits (PAGE_SHIFT) but then only adding back in 12 bits from the
effective address, since the HPTE had a 4k page size.  Thus the gpa
reported to userspace was missing 4 bits.

Instead, we now compute the guest physical address from the HPTE
without reference to the host page size, and then compute the gfn
by shifting the gpa right PAGE_SHIFT bits.
Reported-by: NAlexey Kardashevskiy <aik@ozlabs.ru>
Signed-off-by: NPaul Mackerras <paulus@samba.org>
Signed-off-by: NAlexander Graf <agraf@suse.de>

70bddfef

KVM: PPC: Book3S HV: Remove bogus update of physical thread IDs · 964ee98c

由 Paul Mackerras 提交于 9月 20, 2012

When making a vcpu non-runnable we incorrectly changed the
thread IDs of all other threads on the core, just remove that
code.
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>
Signed-off-by: NPaul Mackerras <paulus@samba.org>
Signed-off-by: NAlexander Graf <agraf@suse.de>

964ee98c

KVM: PPC: Book3S HV: Fix updates of vcpu->cpu · a47d72f3

由 Paul Mackerras 提交于 9月 20, 2012

This removes the powerpc "generic" updates of vcpu->cpu in load and
put, and moves them to the various backends.

The reason is that "HV" KVM does its own sauce with that field
and the generic updates might corrupt it. The field contains the
CPU# of the -first- HW CPU of the core always for all the VCPU
threads of a core (the one that's online from a host Linux
perspective).

However, the preempt notifiers are going to be called on the
threads VCPUs when they are running (due to them sleeping on our
private waitqueue) causing unload to be called, potentially
clobbering the value.
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>
Signed-off-by: NPaul Mackerras <paulus@samba.org>
Signed-off-by: NAlexander Graf <agraf@suse.de>

a47d72f3

KVM: Move some PPC ioctl definitions to the correct place · ed7a8d7a

由 Paul Mackerras 提交于 9月 13, 2012

This moves the definitions of KVM_CREATE_SPAPR_TCE and
KVM_ALLOCATE_RMA in include/linux/kvm.h from the section listing the
vcpu ioctls to the section listing VM ioctls, as these are both
implemented and documented as VM ioctls.

Fortunately there is no actual collision of ioctl numbers at this
point.  Moving these to the correct section will reduce the
probability of a future collision.  This does not change the
user/kernel ABI at all.
Signed-off-by: NPaul Mackerras <paulus@samba.org>
Acked-by: NAlexander Graf <agraf@suse.de>
Signed-off-by: NAlexander Graf <agraf@suse.de>

ed7a8d7a

KVM: PPC: Book3S HV: Handle memory slot deletion and modification correctly · dfe49dbd

由 Paul Mackerras 提交于 9月 11, 2012

This adds an implementation of kvm_arch_flush_shadow_memslot for
Book3S HV, and arranges for kvmppc_core_commit_memory_region to
flush the dirty log when modifying an existing slot.  With this,
we can handle deletion and modification of memory slots.

kvm_arch_flush_shadow_memslot calls kvmppc_core_flush_memslot, which
on Book3S HV now traverses the reverse map chains to remove any HPT
(hashed page table) entries referring to pages in the memslot.  This
gets called by generic code whenever deleting a memslot or changing
the guest physical address for a memslot.

We flush the dirty log in kvmppc_core_commit_memory_region for
consistency with what x86 does.  We only need to flush when an
existing memslot is being modified, because for a new memslot the
rmap array (which stores the dirty bits) is all zero, meaning that
every page is considered clean already, and when deleting a memslot
we obviously don't care about the dirty bits any more.
Signed-off-by: NPaul Mackerras <paulus@samba.org>
Signed-off-by: NAlexander Graf <agraf@suse.de>

dfe49dbd

KVM: PPC: Move kvm->arch.slot_phys into memslot.arch · a66b48c3

由 Paul Mackerras 提交于 9月 11, 2012

Now that we have an architecture-specific field in the kvm_memory_slot
structure, we can use it to store the array of page physical addresses
that we need for Book3S HV KVM on PPC970 processors.  This reduces the
size of struct kvm_arch for Book3S HV, and also reduces the size of
struct kvm_arch_memory_slot for other PPC KVM variants since the fields
in it are now only compiled in for Book3S HV.

This necessitates making the kvm_arch_create_memslot and
kvm_arch_free_memslot operations specific to each PPC KVM variant.
That in turn means that we now don't allocate the rmap arrays on
Book3S PR and Book E.

Since we now unpin pages and free the slot_phys array in
kvmppc_core_free_memslot, we no longer need to do it in
kvmppc_core_destroy_vm, since the generic code takes care to free
all the memslots when destroying a VM.

We now need the new memslot to be passed in to
kvmppc_core_prepare_memory_region, since we need to initialize its
arch.slot_phys member on Book3S HV.
Signed-off-by: NPaul Mackerras <paulus@samba.org>
Signed-off-by: NAlexander Graf <agraf@suse.de>

a66b48c3

KVM: PPC: Book3S HV: Take the SRCU read lock before looking up memslots · 2c9097e4

由 Paul Mackerras 提交于 9月 11, 2012

The generic KVM code uses SRCU (sleeping RCU) to protect accesses
to the memslots data structures against updates due to userspace
adding, modifying or removing memory slots.  We need to do that too,
both to avoid accessing stale copies of the memslots and to avoid
lockdep warnings.  This therefore adds srcu_read_lock/unlock pairs
around code that accesses and uses memslots.

Since the real-mode handlers for H_ENTER, H_REMOVE and H_BULK_REMOVE
need to access the memslots, and we don't want to call the SRCU code
in real mode (since we have no assurance that it would only access
the linear mapping), we hold the SRCU read lock for the VM while
in the guest.  This does mean that adding or removing memory slots
while some vcpus are executing in the guest will block for up to
two jiffies.  This tradeoff is acceptable since adding/removing
memory slots only happens rarely, while H_ENTER/H_REMOVE/H_BULK_REMOVE
are performance-critical hot paths.
Signed-off-by: NPaul Mackerras <paulus@samba.org>
Signed-off-by: NAlexander Graf <agraf@suse.de>

2c9097e4

KVM: PPC: bookehv: Allow duplicate calls of DO_KVM macro · d61966fc

由 Mihai Caraman 提交于 9月 12, 2012

The current form of DO_KVM macro restricts its use to one call per input
parameter set. This is caused by kvmppc_resume_\intno\()_\srr1 symbol
definition.
Duplicate calls of DO_KVM are required by distinct implementations of
exeption handlers which are delegated at runtime. Use a rare label number
to avoid conflicts with the calling contexts.
Signed-off-by: NMihai Caraman <mihai.caraman@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

d61966fc

KVM: PPC: BookE: Support FPU on non-hv systems · 7a08c274

由 Alexander Graf 提交于 8月 16, 2012

When running on HV aware hosts, we can not trap when the guest sets the FP
bit, so we just let it do so when it wants to, because it has full access to
MSR.

For non-HV aware hosts with an FPU (like 440), we need to also adjust the
shadow MSR though. Otherwise the guest gets an FP unavailable trap even when
it really enabled the FP bit in MSR.
Signed-off-by: NAlexander Graf <agraf@suse.de>

7a08c274

KVM: PPC: 440: Implement mfdcrx · ceb985f9

由 Alexander Graf 提交于 8月 16, 2012

We need mfdcrx to execute properly on 460 cores.
Signed-off-by: NAlexander Graf <agraf@suse.de>

ceb985f9

KVM: PPC: 440: Implement mtdcrx · e4dcfe88

由 Alexander Graf 提交于 8月 16, 2012

We need mtdcrx to execute properly on 460 cores.
Signed-off-by: NAlexander Graf <agraf@suse.de>

e4dcfe88

Document IACx/DACx registers access using ONE_REG API · 2e232702

由 Bharat Bhushan 提交于 8月 15, 2012

Patch to access the debug registers (IACx/DACx) using ONE_REG api
was sent earlier. But that missed the respective documentation.

Also corrected the index number referencing in section 4.69
Signed-off-by: NBharat Bhushan <bharat.bhushan@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

2e232702

KVM: PPC: E500: Remove E500_TLB_DIRTY flag · 430c7ff5

由 Alexander Graf 提交于 8月 15, 2012

Since we always mark pages as dirty immediately when mapping them read/write
now, there's no need for the dirty flag in our cache.
Signed-off-by: NAlexander Graf <agraf@suse.de>

430c7ff5

KVM: PPC: Use symbols for exit trace · 166a2b70

由 Alexander Graf 提交于 8月 15, 2012

Exit traces are a lot easier to read when you don't have to remember
cryptic numbers for guest exit reasons. Symbolify them in our trace
output.
Signed-off-by: NAlexander Graf <agraf@suse.de>

166a2b70

KVM: PPC: BookE: Add MCSR SPR support · 50c871ed

由 Alexander Graf 提交于 8月 13, 2012

Add support for the MCSR SPR. This only implements the SPR storage
bits, not actual machine checks.
Signed-off-by: NAlexander Graf <agraf@suse.de>

50c871ed

KVM: PPC: 44x: Initialize PVR · 491dd5b8

由 Alexander Graf 提交于 8月 13, 2012

We need to make sure that vcpu->arch.pvr is initialized to a sane value,
so let's just take the host PVR.
Signed-off-by: NAlexander Graf <agraf@suse.de>

491dd5b8

booke: Added ONE_REG interface for IAC/DAC debug registers · 6df8d3fc

由 Bharat Bhushan 提交于 8月 08, 2012

IAC/DAC are defined as 32 bit while they are 64 bit wide. So ONE_REG
interface is added to set/get them.
Signed-off-by: NBharat Bhushan <bharat.bhushan@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

6df8d3fc

KVM: PPC: booke: Add watchdog emulation · f61c94bb

由 Bharat Bhushan 提交于 8月 08, 2012

This patch adds the watchdog emulation in KVM. The watchdog
emulation is enabled by KVM_ENABLE_CAP(KVM_CAP_PPC_BOOKE_WATCHDOG) ioctl.
The kernel timer are used for watchdog emulation and emulates
h/w watchdog state machine. On watchdog timer expiry, it exit to QEMU
if TCR.WRC is non ZERO. QEMU can reset/shutdown etc depending upon how
it is configured.
Signed-off-by: NLiu Yu <yu.liu@freescale.com>
Signed-off-by: NScott Wood <scottwood@freescale.com>
[bharat.bhushan@freescale.com: reworked patch]
Signed-off-by: NBharat Bhushan <bharat.bhushan@freescale.com>
[agraf: adjust to new request framework]
Signed-off-by: NAlexander Graf <agraf@suse.de>

f61c94bb

KVM: PPC: Add return value to core_check_requests · 7c973a2e

由 Alexander Graf 提交于 8月 13, 2012

Requests may want to tell us that we need to go back into host state,
so add a return value for the checks.
Signed-off-by: NAlexander Graf <agraf@suse.de>

7c973a2e

KVM: PPC: Add return value in prepare_to_enter · 7ee78855

由 Alexander Graf 提交于 8月 13, 2012

Our prepare_to_enter helper wants to be able to return in more circumstances
to the host than only when an interrupt is pending. Broaden the interface a
bit and move even more generic code to the generic helper.
Signed-off-by: NAlexander Graf <agraf@suse.de>

7ee78855

KVM: PPC: Ignore EXITING_GUEST_MODE mode · 206c2ed7

由 Alexander Graf 提交于 8月 13, 2012

We don't need to do anything when mode is EXITING_GUEST_MODE, because
we essentially are outside of guest mode and did everything it asked
us to do by the time we check it.
Signed-off-by: NAlexander Graf <agraf@suse.de>

206c2ed7

KVM: PPC: Move kvm_guest_enter call into generic code · 3766a4c6

由 Alexander Graf 提交于 8月 13, 2012

We need to call kvm_guest_enter in booke and book3s, so move its
call to generic code.
Signed-off-by: NAlexander Graf <agraf@suse.de>

3766a4c6

KVM: PPC: Book3S: PR: Rework irq disabling · bd2be683

由 Alexander Graf 提交于 8月 13, 2012

Today, we disable preemption while inside guest context, because we need
to expose to the world that we are not in a preemptible context. However,
during that time we already have interrupts disabled, which would indicate
that we are in a non-preemptible context.

The reason the checks for irqs_disabled() fail for us though is that we
manually control hard IRQs and ignore all the lazy EE framework. Let's
stop doing that. Instead, let's always use lazy EE to indicate when we
want to disable IRQs, but do a special final switch that gets us into
EE disabled, but soft enabled state. That way when we get back out of
guest state, we are immediately ready to process interrupts.

This simplifies the code drastically and reduces the time that we appear
as preempt disabled.
Signed-off-by: NAlexander Graf <agraf@suse.de>

bd2be683

KVM: PPC: Consistentify vcpu exit path · 24afa37b

由 Alexander Graf 提交于 8月 12, 2012

When getting out of __vcpu_run, let's be consistent about the state we
return in. We want to always

  * have IRQs enabled
  * have called kvm_guest_exit before
Signed-off-by: NAlexander Graf <agraf@suse.de>

24afa37b

KVM: PPC: Book3S: PR: Indicate we're out of guest mode · 0652eaae

由 Alexander Graf 提交于 8月 12, 2012

When going out of guest mode, indicate that we are in vcpu->mode. That way
requests from other CPUs don't needlessly need to kick us to process them,
because it'll just happen next time we enter the guest.
Signed-off-by: NAlexander Graf <agraf@suse.de>

0652eaae

KVM: PPC: Exit guest context while handling exit · 706fb730

由 Alexander Graf 提交于 8月 12, 2012

The x86 implementation of KVM accounts for host time while processing
guest exits. Do the same for us.
Signed-off-by: NAlexander Graf <agraf@suse.de>

706fb730

KVM: PPC: Book3S: PR: Only do resched check once per exit · c63ddcb4

由 Alexander Graf 提交于 8月 12, 2012

Now that we use our generic exit helper, we can safely drop our previous
kvm_resched that we used to trigger at the beginning of the exit handler
function.
Signed-off-by: NAlexander Graf <agraf@suse.de>

c63ddcb4

A
KVM: PPC: BookE: Drop redundant vcpu->mode set · e85ad380
由 Alexander Graf 提交于 8月 12, 2012
```
We only need to set vcpu->mode to outside once.
Signed-off-by: NAlexander Graf <agraf@suse.de>
```
e85ad380

KVM: PPC: Book3s: PR: Add (dumb) MMU Notifier support · 9b0cb3c8

由 Alexander Graf 提交于 8月 10, 2012

Now that we have very simple MMU Notifier support for e500 in place,
also add the same simple support to book3s. It gets us one step closer
to actual fast support.
Signed-off-by: NAlexander Graf <agraf@suse.de>

9b0cb3c8

KVM: PPC: Use same kvmppc_prepare_to_enter code for booke and book3s_pr · 03d25c5b

由 Alexander Graf 提交于 8月 10, 2012

We need to do the same things when preparing to enter a guest for booke and
book3s_pr cores. Fold the generic code into a generic function that both call.
Signed-off-by: NAlexander Graf <agraf@suse.de>

03d25c5b

KVM: PPC: BookE: No duplicate request != 0 check · 2d8185d4

由 Alexander Graf 提交于 8月 10, 2012

We only call kvmppc_check_requests() when vcpu->requests != 0, so drop
the redundant check in the function itself
Signed-off-by: NAlexander Graf <agraf@suse.de>

2d8185d4

KVM: PPC: BookE: Add some more trace points · 6346046c

由 Alexander Graf 提交于 8月 08, 2012

Without trace points, debugging what exactly is going on inside guest
code can be very tricky. Add a few more trace points at places that
hopefully tell us more when things go wrong.
Signed-off-by: NAlexander Graf <agraf@suse.de>

6346046c

KVM: PPC: E500: Implement MMU notifiers · 862d31f7

由 Alexander Graf 提交于 7月 31, 2012

The e500 target has lived without mmu notifiers ever since it got
introduced, but fails for the user space check on them with hugetlbfs.

So in order to get that one working, implement mmu notifiers in a
reasonably dumb fashion and be happy. On embedded hardware, we almost
never end up with mmu notifier calls, since most people don't overcommit.
Signed-off-by: NAlexander Graf <agraf@suse.de>

862d31f7

KVM: PPC: BookE: Add support for vcpu->mode · d69c6436

由 Alexander Graf 提交于 8月 08, 2012

Generic KVM code might want to know whether we are inside guest context
or outside. It also wants to be able to push us out of guest context.

Add support to the BookE code for the generic vcpu->mode field that describes
the above states.
Signed-off-by: NAlexander Graf <agraf@suse.de>

d69c6436

KVM: PPC: BookE: Add check_requests helper function · 4ffc6356

由 Alexander Graf 提交于 8月 08, 2012

We need a central place to check for pending requests in. Add one that
only does the timer check we already do in a different place.

Later, this central function can be extended by more checks.
Signed-off-by: NAlexander Graf <agraf@suse.de>

4ffc6356

powerpc/epapr: export epapr_hypercall_start · 8043e494

由 Scott Wood 提交于 8月 10, 2012

This fixes breakage introduced by the following commit:

  commit 6d2d82627f4f1e96a33664ace494fa363e0495cb
  Author: Liu Yu-B13201 <Yu.Liu@freescale.com>
  Date:   Tue Jul 3 05:48:56 2012 +0000

    PPC: Don't use hardcoded opcode for ePAPR hcall invocation

when a driver that uses ePAPR hypercalls is built as a module.
Reported-by: NGeert Uytterhoeven <geert@linux-m68k.org>
Signed-off-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

8043e494

KVM: PPC: Quieten message about allocating linear regions · 1340f3e8

由 Paul Mackerras 提交于 8月 06, 2012

This is printed once for every RMA or HPT region that get
preallocated.  If one preallocates hundreds of such regions
(in order to run hundreds of KVM guests), that gets rather
painful, so make it a bit quieter.
Signed-off-by: NPaul Mackerras <paulus@samba.org>
Signed-off-by: NAlexander Graf <agraf@suse.de>

1340f3e8

openeuler / Kernel 接近 2 年 前同步成功

openeuler / Kernel
接近 2 年前同步成功