提交 · 8b6db3bc965c204db6868d4005808b4fdc9c46d7 · openanolis / cloud-kernel

24 10月, 2010 40 次提交

KVM: PPC: Implement correct SID mapping on Book3s_32 · 8b6db3bc

由 Alexander Graf 提交于 8月 15, 2010

Up until now we were doing segment mappings wrong on Book3s_32. For Book3s_64
we were using a trick where we know that a single mmu_context gives us 16 bits
of context ids.

The mm system on Book3s_32 instead uses a clever algorithm to distribute VSIDs
across the available range, so a context id really only gives us 16 available
VSIDs.

To keep at least a few guest processes in the SID shadow, let's map a number of
contexts that we can use as VSID pool. This makes the code be actually correct
and shouldn't hurt performance too much.
Signed-off-by: NAlexander Graf <agraf@suse.de>

8b6db3bc

KVM: PPC: Force enable nap on KVM · ad087376

由 Alexander Graf 提交于 8月 17, 2010

There are some heuristics in the PPC power management code that try to find
out if the particular hardware we're running on supports proper power management
or just hangs the machine when going into nap mode.

Since we know that KVM is safe with nap, let's force enable it in the PV code
once we're certain that we are on a KVM VM.
Signed-off-by: NAlexander Graf <agraf@suse.de>

ad087376

KVM: PPC: Make PV mtmsrd L=1 work with r30 and r31 · df08bd10

由 Alexander Graf 提交于 8月 05, 2010

We had an arbitrary limitation in mtmsrd L=1 that kept us from using r30 and
r31 as input registers. Let's get rid of that and get more potential speedups!
Signed-off-by: NAlexander Graf <agraf@suse.de>

df08bd10

KVM: PPC: Update int_pending also on dequeue · 9ee18b1e

由 Alexander Graf 提交于 8月 05, 2010

When having a decrementor interrupt pending, the dequeuing happens manually
through an mtdec instruction. This instruction simply calls dequeue on that
interrupt, so the int_pending hint doesn't get updated.

This patch enables updating the int_pending hint also on dequeue, thus
correctly enabling guests to stay in guest contexts more often.
Signed-off-by: NAlexander Graf <agraf@suse.de>

9ee18b1e

KVM: PPC: Make PV mtmsr work with r30 and r31 · 512ba59e

由 Alexander Graf 提交于 8月 05, 2010

So far we've been restricting ourselves to r0-r29 as registers an mtmsr
instruction could use. This was bad, as there are some code paths in
Linux actually using r30.

So let's instead handle all registers gracefully and get rid of that
stupid limitation
Signed-off-by: NAlexander Graf <agraf@suse.de>

512ba59e

KVM: PPC: Add mtsrin PV code · cbe487fa

由 Alexander Graf 提交于 8月 03, 2010

This is the guest side of the mtsr acceleration. Using this a guest can now
call mtsrin with almost no overhead as long as it ensures that it only uses
it with (MSR_IR|MSR_DR) == 0. Linux does that, so we're good.
Signed-off-by: NAlexander Graf <agraf@suse.de>

cbe487fa

KVM: PPC: Put segment registers in shared page · df1bfa25

由 Alexander Graf 提交于 8月 03, 2010

Now that the actual mtsr doesn't do anything anymore, we can move the sr
contents over to the shared page, so a guest can directly read and write
its sr contents from guest context.
Signed-off-by: NAlexander Graf <agraf@suse.de>

df1bfa25

KVM: PPC: Interpret SR registers on demand · 8e865178

由 Alexander Graf 提交于 8月 03, 2010

Right now we're examining the contents of Book3s_32's segment registers when
the register is written and put the interpreted contents into a struct.

There are two reasons this is bad. For starters, the struct has worse real-time
performance, as it occupies more ram. But the more important part is that with
segment registers being interpreted from their raw values, we can put them in
the shared page, allowing guests to mess with them directly.

This patch makes the internal representation of SRs be u32s.
Signed-off-by: NAlexander Graf <agraf@suse.de>

8e865178

KVM: PPC: Move BAT handling code into spr handler · c1c88e2f

由 Alexander Graf 提交于 8月 02, 2010

The current approach duplicates the spr->bat finding logic and makes it harder
to reuse the actually used variables. So let's move everything down to the spr
handler.
Signed-off-by: NAlexander Graf <agraf@suse.de>

c1c88e2f

KVM: PPC: Add feature bitmap for magic page · 7508e16c

由 Alexander Graf 提交于 8月 03, 2010

We will soon add SR PV support to the shared page, so we need some
infrastructure that allows the guest to query for features KVM exports.

This patch adds a second return value to the magic mapping that
indicated to the guest which features are available.
Signed-off-by: NAlexander Graf <agraf@suse.de>

7508e16c

KVM: PPC: Remove unused define · cb24c508

由 Alexander Graf 提交于 8月 02, 2010

The define VSID_ALL is unused. Let's remove it.
Signed-off-by: NAlexander Graf <agraf@suse.de>

cb24c508

KVM: PPC: Revert "KVM: PPC: Use kernel hash function" · b9877ce2

由 Alexander Graf 提交于 8月 02, 2010

It turns out the in-kernel hash function is sub-optimal for our subtle
hash inputs where every bit is significant. So let's revert to the original
hash functions.

This reverts commit 05340ab4f9a6626f7a2e8f9fe5397c61d494f445.
Signed-off-by: NAlexander Graf <agraf@suse.de>

b9877ce2

KVM: PPC: Move slb debugging to tracepoints · 928d78be

由 Alexander Graf 提交于 8月 02, 2010

This patch moves debugging printks for shadow SLB debugging over to tracepoints.
Signed-off-by: NAlexander Graf <agraf@suse.de>

928d78be

KVM: PPC: Make invalidation code more reliable · e7c1d14e

由 Alexander Graf 提交于 8月 02, 2010

There is a race condition in the pte invalidation code path where we can't
be sure if a pte was invalidated already. So let's move the spin lock around
to get rid of the race.
Signed-off-by: NAlexander Graf <agraf@suse.de>

e7c1d14e

KVM: PPC: Don't flush PTEs on NX/RO hit · 2e602847

由 Alexander Graf 提交于 8月 02, 2010

When hitting a no-execute or read-only data/inst storage interrupt we were
flushing the respective PTE so we're sure it gets properly overwritten next.

According to the spec, this is unnecessary though. The guest issues a tlbie
anyways, so we're safe to just keep the PTE around and have it manually removed
from the guest, saving us a flush.
Signed-off-by: NAlexander Graf <agraf@suse.de>

2e602847

KVM: PPC: Preload magic page when in kernel mode · 4cb6b7ea

由 Alexander Graf 提交于 8月 02, 2010

When the guest jumps into kernel mode and has the magic page mapped, theres a
very high chance that it will also use it. So let's detect that scenario and
map the segment accordingly.
Signed-off-by: NAlexander Graf <agraf@suse.de>

4cb6b7ea

KVM: PPC: Add tracepoints for generic spte flushes · c60b4cf7

由 Alexander Graf 提交于 8月 02, 2010

The different ways of flusing shadow ptes have their own debug prints which use
stupid old printk.

Let's move them to tracepoints, making them easier available, faster and
possible to activate on demand
Signed-off-by: NAlexander Graf <agraf@suse.de>

c60b4cf7

KVM: PPC: Fix sid map search after flush · c22c3196

由 Alexander Graf 提交于 8月 02, 2010

After a flush the sid map contained lots of entries with 0 for their gvsid and
hvsid value. Unfortunately, 0 can be a real value the guest searches for when
looking up a vsid so it would incorrectly find the host's 0 hvsid mapping which
doesn't belong to our sid space.

So let's also check for the valid bit that indicated that the sid we're
looking at actually contains useful data.
Signed-off-by: NAlexander Graf <agraf@suse.de>

c22c3196

KVM: PPC: Move pte invalidate debug code to tracepoint · 8696ee43

由 Alexander Graf 提交于 8月 02, 2010

This patch moves the SPTE flush debug printk over to tracepoints.
Signed-off-by: NAlexander Graf <agraf@suse.de>

8696ee43

KVM: PPC: Add tracepoint for generic mmu map · 4c4eea77

由 Alexander Graf 提交于 8月 02, 2010

This patch moves the generic mmu map debugging over to tracepoints.
Signed-off-by: NAlexander Graf <agraf@suse.de>

4c4eea77

A
KVM: PPC: Move book3s_64 mmu map debug print to trace point · 82fdee7b
由 Alexander Graf 提交于 8月 02, 2010
```
This patch moves Book3s MMU debugging over to tracepoints.
Signed-off-by: NAlexander Graf <agraf@suse.de>
```
82fdee7b

KVM: PPC: Move EXIT_DEBUG partially to tracepoints · bed1ed98

由 Alexander Graf 提交于 8月 02, 2010

We have a debug printk on every exit that is usually #ifdef'ed out. Using
tracepoints makes a lot more sense here though, as they can be dynamically
enabled.

This patch converts the most commonly used debug printks of EXIT_DEBUG to
tracepoints.
Signed-off-by: NAlexander Graf <agraf@suse.de>

bed1ed98

KVM: PPC: Fix CONFIG_KVM_GUEST && !CONFIG_KVM case · 989044ee

由 Alexander Graf 提交于 8月 30, 2010

When CONFIG_KVM_GUEST is selected, but CONFIG_KVM is not, we were missing
some defines in asm-offsets.c and included too many headers at other places.

This patch makes above configuration work.
Reported-by: NStephen Rothwell <sfr@canb.auug.org.au>
Signed-off-by: NAlexander Graf <agraf@suse.de>
Signed-off-by: NAvi Kivity <avi@redhat.com>

989044ee

KVM: PPC: fix leakage of error page in kvmppc_patch_dcbz() · 646bab55

由 Wei Yongjun 提交于 8月 17, 2010

Add kvm_release_page_clean() after is_error_page() to avoid
leakage of error page.
Signed-off-by: NWei Yongjun <yjwei@cn.fujitsu.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

646bab55

KVM: PPC: Move KVM trampolines before __end_interrupts · a58ddea5

由 Alexander Graf 提交于 7月 29, 2010

When using a relocatable kernel we need to make sure that the trampline code
and the interrupt handlers are both copied to low memory. The only way to do
this reliably is to put them in the copied section.

This patch should make relocated kernels work with KVM.

KVM-Stable-Tag
Signed-off-by: NAlexander Graf <agraf@suse.de>
Signed-off-by: NAvi Kivity <avi@redhat.com>

a58ddea5

KVM: PPC: Make long relocations be ulong · 2b05d71f

由 Alexander Graf 提交于 7月 29, 2010

On Book3S KVM we directly expose some asm pointers to C code as
variables. These need to be relocated and thus break on relocatable
kernels.

To make sure we can at least build, let's mark them as long instead
of u32 where 64bit relocations don't work.

This fixes the following build error:

WARNING: 2 bad relocations^M
> c000000000008590 R_PPC64_ADDR32    .text+0x4000000000008460^M
> c000000000008594 R_PPC64_ADDR32    .text+0x4000000000008598^M

Please keep in mind that actually using KVM on a relocated kernel
might still break. This only fixes the compile problem.
Reported-by: NSubrata Modak <subrata@linux.vnet.ibm.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>
Signed-off-by: NAvi Kivity <avi@redhat.com>

2b05d71f

KVM: PPC: Use MSR_DR for external load_up · 0e677903

由 Alexander Graf 提交于 7月 29, 2010

Book3S_32 requires MSR_DR to be disabled during load_up_xxx while on Book3S_64
it's supposed to be enabled. I misread the code and disabled it in both cases,
potentially breaking the PS3 which has a really small RMA.

This patch makes KVM work on the PS3 again.
Signed-off-by: NAlexander Graf <agraf@suse.de>
Signed-off-by: NAvi Kivity <avi@redhat.com>

0e677903

KVM: PPC: Add book3s_32 tlbie flush acceleration · 2d27fc5e

由 Alexander Graf 提交于 7月 29, 2010

On Book3s_32 the tlbie instruction flushed effective addresses by the mask
0x0ffff000. This is pretty hard to reflect with a hash that hashes ~0xfff, so
to speed up that target we should also keep a special hash around for it.
Signed-off-by: NAlexander Graf <agraf@suse.de>
Signed-off-by: NAvi Kivity <avi@redhat.com>

2d27fc5e

KVM: PPC: correctly check gfn_to_pfn() return value · 49451389

由 Gleb Natapov 提交于 7月 29, 2010

On failure gfn_to_pfn returns bad_page so use correct function to check
for that.
Signed-off-by: NGleb Natapov <gleb@redhat.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>
Signed-off-by: NAvi Kivity <avi@redhat.com>

49451389

KVM: PPC: RCU'ify the Book3s MMU · 2e0908af

由 Alexander Graf 提交于 7月 29, 2010

So far we've been running all code without locking of any sort. This wasn't
really an issue because I didn't see any parallel access to the shadow MMU
code coming.

But then I started to implement dirty bitmapping to MOL which has the video
code in its own thread, so suddenly we had the dirty bitmap code run in
parallel to the shadow mmu code. And with that came trouble.

So I went ahead and made the MMU modifying functions as parallelizable as
I could think of. I hope I didn't screw up too much RCU logic :-). If you
know your way around RCU and locking and what needs to be done when, please
take a look at this patch.
Signed-off-by: NAlexander Graf <agraf@suse.de>
Signed-off-by: NAvi Kivity <avi@redhat.com>

2e0908af

KVM: PPC: Book3S_32 MMU debug compile fixes · 53021042

由 Alexander Graf 提交于 7月 29, 2010

Due to previous changes, the Book3S_32 guest MMU code didn't compile properly
when enabling debugging.

This patch repairs the broken code paths, making it possible to define DEBUG_MMU
and friends again.
Signed-off-by: NAlexander Graf <agraf@suse.de>
Signed-off-by: NAvi Kivity <avi@redhat.com>

53021042

KVM: PPC: Add get_pvinfo interface to query hypercall instructions · 15711e9c

由 Alexander Graf 提交于 7月 29, 2010

We need to tell the guest the opcodes that make up a hypercall through
interfaces that are controlled by userspace. So we need to add a call
for userspace to allow it to query those opcodes so it can pass them
on.

This is required because the hypercall opcodes can change based on
the hypervisor conditions. If we're running in hardware accelerated
hypervisor mode, a hypercall looks different from when we're running
without hardware acceleration.
Signed-off-by: NAlexander Graf <agraf@suse.de>
Signed-off-by: NAvi Kivity <avi@redhat.com>

15711e9c

KVM: PPC: PV wrteei · 644bfa01

由 Alexander Graf 提交于 7月 29, 2010

On BookE the preferred way to write the EE bit is the wrteei instruction. It
already encodes the EE bit in the instruction.

So in order to get BookE some speedups as well, let's also PV'nize thati
instruction.
Signed-off-by: NAlexander Graf <agraf@suse.de>
Signed-off-by: NAvi Kivity <avi@redhat.com>

644bfa01

KVM: PPC: PV mtmsrd L=0 and mtmsr · 78109277

由 Alexander Graf 提交于 7月 29, 2010

There is also a form of mtmsr where all bits need to be addressed. While the
PPC64 Linux kernel behaves resonably well here, on PPC32 we do not have an
L=1 form. It does mtmsr even for simple things like only changing EE.

So we need to hook into that one as well and check for a mask of bits that we
deem safe to change from within guest context.
Signed-off-by: NAlexander Graf <agraf@suse.de>
Signed-off-by: NAvi Kivity <avi@redhat.com>

78109277

KVM: PPC: PV mtmsrd L=1 · 819a63dc

由 Alexander Graf 提交于 7月 29, 2010

The PowerPC ISA has a special instruction for mtmsr that only changes the EE
and RI bits, namely the L=1 form.

Since that one is reasonably often occuring and simple to implement, let's
go with this first. Writing EE=0 is always just a store. Doing EE=1 also
requires us to check for pending interrupts and if necessary exit back to the
hypervisor.
Signed-off-by: NAlexander Graf <agraf@suse.de>
Signed-off-by: NAvi Kivity <avi@redhat.com>

819a63dc

KVM: PPC: PV assembler helpers · 92234722

由 Alexander Graf 提交于 7月 29, 2010

When we hook an instruction we need to make sure we don't clobber any of
the registers at that point. So we write them out to scratch space in the
magic page. To make sure we don't fall into a race with another piece of
hooked code, we need to disable interrupts.

To make the later patches and code in general easier readable, let's introduce
a set of defines that save and restore r30, r31 and cr. Let's also define some
helpers to read the lower 32 bits of a 64 bit field on 32 bit systems.
Signed-off-by: NAlexander Graf <agraf@suse.de>
Signed-off-by: NAvi Kivity <avi@redhat.com>

92234722

KVM: PPC: Introduce branch patching helper · 71ee8e34

由 Alexander Graf 提交于 7月 29, 2010

We will need to patch several instruction streams over to a different
code path, so we need a way to patch a single instruction with a branch
somewhere else.

This patch adds a helper to facilitate this patching.
Signed-off-by: NAlexander Graf <agraf@suse.de>
Signed-off-by: NAvi Kivity <avi@redhat.com>

71ee8e34

KVM: PPC: Introduce kvm_tmp framework · 2d4f5671

由 Alexander Graf 提交于 7月 29, 2010

We will soon require more sophisticated methods to replace single instructions
with multiple instructions. We do that by branching to a memory region where we
write replacement code for the instruction to.

This region needs to be within 32 MB of the patched instruction though, because
that's the furthest we can jump with immediate branches.

So we keep 1MB of free space around in bss. After we're done initing we can just
tell the mm system that the unused pages are free, but until then we have enough
space to fit all our code in.
Signed-off-by: NAlexander Graf <agraf@suse.de>
Signed-off-by: NAvi Kivity <avi@redhat.com>

2d4f5671

KVM: PPC: PV tlbsync to nop · d1290b15

由 Alexander Graf 提交于 7月 29, 2010

With our current MMU scheme we don't need to know about the tlbsync instruction.
So we can just nop it out.
Signed-off-by: NAlexander Graf <agraf@suse.de>
Signed-off-by: NAvi Kivity <avi@redhat.com>

d1290b15

KVM: PPC: PV instructions to loads and stores · d1293c92

由 Alexander Graf 提交于 7月 29, 2010

Some instructions can simply be replaced by load and store instructions to
or from the magic page.

This patch replaces often called instructions that fall into the above category.
Signed-off-by: NAlexander Graf <agraf@suse.de>
Signed-off-by: NAvi Kivity <avi@redhat.com>

d1293c92

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功