提交 · c4befc58a0cc5a8cc5b4a7234d67b6b16dec4e70 · openeuler / Kernel

12 7月, 2011 14 次提交

KVM: PPC: Move fields between struct kvm_vcpu_arch and kvmppc_vcpu_book3s · c4befc58

由 Paul Mackerras 提交于 6月 29, 2011

This moves the slb field, which represents the state of the emulated
SLB, from the kvmppc_vcpu_book3s struct to the kvm_vcpu_arch, and the
hpte_hash_[v]pte[_long] fields from kvm_vcpu_arch to kvmppc_vcpu_book3s.
This is in accord with the principle that the kvm_vcpu_arch struct
represents the state of the emulated CPU, and the kvmppc_vcpu_book3s
struct holds the auxiliary data structures used in the emulation.
Signed-off-by: NPaul Mackerras <paulus@samba.org>
Signed-off-by: NAlexander Graf <agraf@suse.de>

c4befc58

KVM: PPC: Fix machine checks on 32-bit Book3S · 149dbdb1

由 Paul Mackerras 提交于 6月 29, 2011

Commit 69acc0d3ba ("KVM: PPC: Resolve real-mode handlers through
function exports") resulted in vcpu->arch.trampoline_lowmem and
vcpu->arch.trampoline_enter ending up with kernel virtual addresses
rather than physical addresses.  This is OK on 64-bit Book3S machines,
which ignore the top 4 bits of the effective address in real mode,
but on 32-bit Book3S machines, accessing these addresses in real mode
causes machine check interrupts, as the hardware uses the whole
effective address as the physical address in real mode.

This fixes the problem by using __pa() to convert these addresses
to physical addresses.
Signed-off-by: NPaul Mackerras <paulus@samba.org>
Signed-off-by: NAlexander Graf <agraf@suse.de>

149dbdb1

KVM: PPC: e500: Don't search over the entire TLB0. · 1aee47a0

由 Scott Wood 提交于 6月 14, 2011

Only look in the 4 entries that could possibly contain the
entry we're looking for.
Signed-off-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

1aee47a0

KVM: PPC: e500: Add shadow PID support · dd9ebf1f

由 Liu Yu 提交于 6月 14, 2011

Dynamically assign host PIDs to guest PIDs, splitting each guest PID into
multiple host (shadow) PIDs based on kernel/user and MSR[IS/DS].  Use
both PID0 and PID1 so that the shadow PIDs for the right mode can be
selected, that correspond both to guest TID = zero and guest TID = guest
PID.

This allows us to significantly reduce the frequency of needing to
invalidate the entire TLB.  When the guest mode or PID changes, we just
update the host PID0/PID1.  And since the allocation of shadow PIDs is
global, multiple guests can share the TLB without conflict.

Note that KVM does not yet support the guest setting PID1 or PID2 to
a value other than zero.  This will need to be fixed for nested KVM
to work.  Until then, we enforce the requirement for guest PID1/PID2
to stay zero by failing the emulation if the guest tries to set them
to something else.
Signed-off-by: NLiu Yu <yu.liu@freescale.com>
Signed-off-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

dd9ebf1f

KVM: PPC: e500: Stop keeping shadow TLB · 08b7fa92

由 Liu Yu 提交于 6月 14, 2011

Instead of a fully separate set of TLB entries, keep just the
pfn and dirty status.
Signed-off-by: NLiu Yu <yu.liu@freescale.com>
Signed-off-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

08b7fa92

KVM: PPC: e500: enable magic page · a4cd8b23

由 Scott Wood 提交于 6月 14, 2011

This is a shared page used for paravirtualization.  It is always present
in the guest kernel's effective address space at the address indicated
by the hypercall that enables it.

The physical address specified by the hypercall is not used, as
e500 does not have real mode.
Signed-off-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

a4cd8b23

KVM: PPC: e500: Support large page mappings of PFNMAP vmas. · 9973d54e

由 Scott Wood 提交于 6月 14, 2011

This allows large pages to be used on guest mappings backed by things like
/dev/mem, resulting in a significant speedup when guest memory
is mapped this way (it's useful for directly-assigned MMIO, too).

This is not a substitute for hugetlbfs integration, but is useful for
configurations where devices are directly assigned on chips without an
IOMMU -- in these cases, we need guest physical and true physical to
match, and be contiguous, so static reservation and mapping via /dev/mem
is the most straightforward way to set things up.
Signed-off-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

9973d54e

KVM: PPC: e500: Eliminate shadow_pages[], and use pfns instead. · 59c1f4e3

由 Scott Wood 提交于 6月 14, 2011

This is in line with what other architectures do, and will allow us to
map things other than ordinary, unreserved kernel pages -- such as
dedicated devices, or large contiguous reserved regions.
Signed-off-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

59c1f4e3

KVM: PPC: e500: don't use MAS0 as intermediate storage. · 0ef30995

由 Scott Wood 提交于 6月 14, 2011

This avoids races.  It also means that we use the shadow TLB way,
rather than the hardware hint -- if this is a problem, we could do
a tlbsx before inserting a TLB0 entry.
Signed-off-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

0ef30995

KVM: PPC: e500: Disable preloading TLB1 in tlb_load(). · 6fc4d1eb

由 Scott Wood 提交于 6月 14, 2011

Since TLB1 loading doesn't check the shadow TLB before allocating another
entry, you can get duplicates.

Once shadow PIDs are enabled in a later patch, we won't need to
invalidate the TLB on every switch, so this optimization won't be
needed anyway.
Signed-off-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

6fc4d1eb

KVM: PPC: e500: Save/restore SPE state · 4cd35f67

由 Scott Wood 提交于 6月 14, 2011

This is done lazily.  The SPE save will be done only if the guest has
used SPE since the last preemption or heavyweight exit.  Restore will be
done only on demand, when enabling MSR_SPE in the shadow MSR, in response
to an SPE fault or mtmsr emulation.

For SPEFSCR, Linux already switches it on context switch (non-lazily), so
the only remaining bit is to save it between qemu and the guest.
Signed-off-by: NLiu Yu <yu.liu@freescale.com>
Signed-off-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

4cd35f67

KVM: PPC: booke: use shadow_msr · ecee273f

由 Scott Wood 提交于 6月 14, 2011

Keep the guest MSR and the guest-mode true MSR separate, rather than
modifying the guest MSR on each guest entry to produce a true MSR.

Any bits which should be modified based on guest MSR must be explicitly
propagated from vcpu->arch.shared->msr to vcpu->arch.shadow_msr in
kvmppc_set_msr().

While we're modifying the guest entry code, reorder a few instructions
to bury some load latencies.
Signed-off-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

ecee273f

KVM: PPC: Resolve real-mode handlers through function exports · a22a2dac

由 Alexander Graf 提交于 6月 07, 2011

Up until now, Book3S KVM had variables stored in the kernel that a kernel module
or the kvm code in the kernel could read from to figure out where some real mode
helper functions are located.

This is all unnecessary. The high bits of the EA get ignore in real mode, so we
can just use the pointer as is. Also, it's a lot easier on relocations when we
use the normal way of resolving the address to a function, instead of jumping
through hoops.

This patch fixes compilation with CONFIG_RELOCATABLE=y.
Signed-off-by: NAlexander Graf <agraf@suse.de>

a22a2dac

KVM: PPC: fix partial application of "exit timing in ticks" · 24294b9a

由 Stuart Yoder 提交于 5月 17, 2011

When http://www.spinics.net/lists/kvm-ppc/msg02664.html
was applied to produce commit b51e7aa7ed6d8d134d02df78300ab0f91cfff4d2,
the removal of the conversion in add_exit_timing was left out.
Signed-off-by: NStuart Yoder <stuart.yoder@freescale.com>
Signed-off-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

24294b9a

22 5月, 2011 5 次提交

KVM: PPC: booke: add sregs support · 5ce941ee

由 Scott Wood 提交于 4月 27, 2011

Signed-off-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

5ce941ee

KVM: PPC: booke: save/restore VRSAVE (a.k.a. USPRG0) · eab17672

由 Scott Wood 提交于 4月 27, 2011

Linux doesn't use USPRG0 (now renamed VRSAVE in the architecture, even
when Altivec isn't involved), but a guest might.
Signed-off-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

eab17672

KVM: PPC: use ticks, not usecs, for exit timing · 1a040b26

由 Stuart Yoder 提交于 3月 28, 2011

Convert to microseconds when displaying
(with fix from Bharat Bhushan <Bharat.Bhushan@freescale.com>).

This reduces rounding error with large quantities of short exits.
Signed-off-by: NStuart Yoder <stuart.yoder@freescale.com>
Signed-off-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

1a040b26

KVM: PPC: fix exit accounting for SPRs, tlbwe, tlbsx · 49ea0695

由 Scott Wood 提交于 3月 28, 2011

The exit type setting for mfspr/mtspr is moved from 44x to toplevel SPR
emulation.  This enables it on e500, and makes sure that all SPRs
are covered.

Exit accounting for tlbwe and tlbsx is added to e500.
Signed-off-by: NStuart Yoder <stuart.yoder@freescale.com>
Signed-off-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

49ea0695

KVM: PPC: e500: emulate SVR · 90d34b0e

由 Scott Wood 提交于 3月 29, 2011

Return the actual host SVR for now, as we already do for PVR.  Eventually
we may support Qemu overriding PVR/SVR if the situation is appropriate,
once we implement KVM_SET_SREGS on e500.
Signed-off-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

90d34b0e

20 5月, 2011 2 次提交

powerpc/kvm: Fix the build for 32-bit Book 3S (classic) processors · 593adf31

由 Paul Mackerras 提交于 5月 11, 2011

Commits a5d4f3ad ("powerpc: Base support for exceptions using
HSRR0/1") and 673b189a ("powerpc: Always use SPRN_SPRG_HSCRATCH0
when running in HV mode") cause compile and link errors for 32-bit
classic Book 3S processors when KVM is enabled.  This fixes these
errors.
Signed-off-by: NPaul Mackerras <paulus@samba.org>
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

593adf31

powerpc/kvm: Fix kvmppc_core_pending_dec · 44075d95

由 Paul Mackerras 提交于 5月 11, 2011

The vcpu->arch.pending_exceptions field is a bitfield indexed by
interrupt priority number as returned by kvmppc_book3s_vec2irqprio.
However, kvmppc_core_pending_dec was using an interrupt vector shifted
by 7 as the bit index.  Fix it to use the irqprio value for the
decrementer interrupt instead.  This problem was found by code
inspection.
Signed-off-by: NPaul Mackerras <paulus@samba.org>
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

44075d95

11 5月, 2011 1 次提交

KVM: PPC: Fix issue clearing exit timing counters · 09000adb

由 Bharat Bhushan 提交于 3月 25, 2011

Following dump is observed on host when clearing the exit timing counters

[root@p1021mds kvm]# echo -n 'c' > vm1200_vcpu0_timing
INFO: task echo:1276 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
echo          D 0ff5bf94     0  1276   1190 0x00000000
Call Trace:
[c2157e40] [c0007908] __switch_to+0x9c/0xc4
[c2157e50] [c040293c] schedule+0x1b4/0x3bc
[c2157e90] [c04032dc] __mutex_lock_slowpath+0x74/0xc0
[c2157ec0] [c00369e4] kvmppc_init_timing_stats+0x20/0xb8
[c2157ed0] [c0036b00] kvmppc_exit_timing_write+0x84/0x98
[c2157ef0] [c00b9f90] vfs_write+0xc0/0x16c
[c2157f10] [c00ba284] sys_write+0x4c/0x90
[c2157f40] [c000e320] ret_from_syscall+0x0/0x3c

        The vcpu->mutex is used by kvm_ioctl_* (KVM_RUN etc) and same was
used when clearing the stats (in kvmppc_init_timing_stats()). What happens
is that when the guest is idle then it held the vcpu->mutx. While the
exiting timing process waits for guest to release the vcpu->mutex and
a hang state is reached.

        Now using seprate lock for exit timing stats.
Signed-off-by: NBharat Bhushan <Bharat.Bhushan@freescale.com>
Acked-by: NAlexander Graf <agraf@suse.de>
Signed-off-by: NAvi Kivity <avi@redhat.com>

09000adb

20 4月, 2011 3 次提交

powerpc: Always use SPRN_SPRG_HSCRATCH0 when running in HV mode · 673b189a

由 Paul Mackerras 提交于 4月 05, 2011

This uses feature sections to arrange that we always use HSPRG1
as the scratch register in the interrupt entry code rather than
SPRG2 when we're running in hypervisor mode on POWER7. This will
ensure that we don't trash the guest's SPRG2 when we are running
KVM guests. To simplify the code, we define GET_SCRATCH0() and
SET_SCRATCH0() macros like the GET_PACA/SET_PACA macros.
Signed-off-by: NPaul Mackerras <paulus@samba.org>
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

673b189a

powerpc: Base support for exceptions using HSRR0/1 · a5d4f3ad

由 Benjamin Herrenschmidt 提交于 4月 05, 2011

Pass the register type to the prolog, also provides alternate "HV"
version of hardware interrupt (0x500) and adjust LPES accordingly

We tag those interrupts by setting bit 0x2 in the trap number
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

a5d4f3ad

powerpc: In HV mode, use HSPRG0 for PACA · 2dd60d79

由 Benjamin Herrenschmidt 提交于 1月 20, 2011

When running in Hypervisor mode (arch 2.06 or later), we store the PACA
in HSPRG0 instead of SPRG1. The architecture specifies that SPRGs may be
lost during a "nap" power management operation (though they aren't
currently on POWER7) and this enables use of SPRG1 by KVM guests.
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

2dd60d79

18 3月, 2011 1 次提交

KVM: PPC: Fix SPRG get/set for Book3S and BookE · bc9c1933

由 Peter Tyser 提交于 12月 29, 2010

Previously SPRGs 4-7 were improperly read and written in
kvm_arch_vcpu_ioctl_get_regs() and kvm_arch_vcpu_ioctl_set_regs();
Signed-off-by: NAlexander Graf <agraf@suse.de>
Signed-off-by: NPeter Tyser <ptyser@xes-inc.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

bc9c1933

12 1月, 2011 2 次提交

KVM: Clean up vm creation and release · d89f5eff

由 Jan Kiszka 提交于 11月 09, 2010

IA64 support forces us to abstract the allocation of the kvm structure.
But instead of mixing this up with arch-specific initialization and
doing the same on destruction, split both steps. This allows to move
generic destruction calls into generic code.

It also fixes error clean-up on failures of kvm_create_vm for IA64.
Signed-off-by: NJan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: NAvi Kivity <avi@redhat.com>

d89f5eff

KVM: replace vmalloc and memset with vzalloc · 26535037

由 Takuya Yoshikawa 提交于 11月 02, 2010

Let's use newly introduced vzalloc().
Signed-off-by: NTakuya Yoshikawa <yoshikawa.takuya@oss.ntt.co.jp>
Signed-off-by: NJesper Juhl <jj@chaosbits.net>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

26535037

06 11月, 2010 4 次提交

KVM: PPC: BookE: Load the lower half of MSR · df8940ea

由 Scott Wood 提交于 9月 30, 2010

This was preventing the guest from setting any bits in the
hardware MSR which aren't forced on, such as MSR[SPE].
Signed-off-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

df8940ea

KVM: PPC: BookE: fix sleep with interrupts disabled · bb59e974

由 Scott Wood 提交于 9月 30, 2010

It is not legal to call mutex_lock() with interrupts disabled.
This will assert with debug checks enabled.

If there's a real need to disable interrupts here, it could be done
after the mutex is acquired -- but I don't see why it's needed at all.
Signed-off-by: NScott Wood <scottwood@freescale.com>
Reviewed-by: NChristian Ehrhardt <ehrhardt@linux.vnet.ibm.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

bb59e974

KVM: PPC: e500: Call kvm_vcpu_uninit() before kvmppc_e500_tlb_uninit(). · f22e2f04

由 Scott Wood 提交于 10月 05, 2010

The VCPU uninit calls some TLB functions, and the TLB uninit function
frees the memory used by them.
Signed-off-by: NScott Wood <scottwood@freescale.com>
Acked-by: NLiu Yu <yu.liu@freescale.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

f22e2f04

KVM: PPC: fix information leak to userland · d8cdddcd

由 Vasiliy Kulikov 提交于 10月 30, 2010

Structure kvm_ppc_pvinfo is copied to userland with flags and
pad fields unitialized.  It leads to leaking of contents of
kernel stack memory.
Signed-off-by: NVasiliy Kulikov <segooon@gmail.com>
Signed-off-by: NMarcelo Tosatti <mtosatti@redhat.com>

d8cdddcd

24 10月, 2010 8 次提交

KVM: PPC: Fix compile error in e500_tlb.c · 344941be

由 Alexander Graf 提交于 8月 31, 2010

The e500_tlb.c file didn't compile for me due to the following error:

arch/powerpc/kvm/e500_tlb.c: In function ‘kvmppc_e500_shadow_map’:
arch/powerpc/kvm/e500_tlb.c:300: error: format ‘%lx’ expects type ‘long unsigned int’, but argument 2 has type ‘gfn_t’

So let's explicitly cast the argument to make printk happy.
Signed-off-by: NAlexander Graf <agraf@suse.de>

344941be

KVM: PPC: e500_tlb: Fix a minor copy-paste tracing bug · 21e537ba

由 Kyle Moffett 提交于 8月 30, 2010

The kvmppc_e500_stlbe_invalidate() function was trying to pass too many
parameters to trace_kvm_stlb_inval().  This appears to be a bad
copy-paste from a call to trace_kvm_stlb_write().
Signed-off-by: NKyle Moffett <Kyle.D.Moffett@boeing.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

21e537ba

KVM: PPC: Implement level interrupts for BookE · c5335f17

由 Alexander Graf 提交于 8月 30, 2010

BookE also wants to support level based interrupts, so let's implement
all the necessary logic there. We need to trick a bit here because the
irqprios are 1:1 assigned to architecture defined values. But since there
is some space left there, we can just pick a random one and move it later
on - it's internal anyways.
Signed-off-by: NAlexander Graf <agraf@suse.de>

c5335f17

KVM: PPC: Expose level based interrupt cap · 7b4203e8

由 Alexander Graf 提交于 8月 30, 2010

Now that we have all the level interrupt magic in place, let's
expose the capability to user space, so it can make use of it!
Signed-off-by: NAlexander Graf <agraf@suse.de>

7b4203e8

KVM: PPC: Implement Level interrupts on Book3S · 17bd1580

由 Alexander Graf 提交于 8月 30, 2010

The current interrupt logic is just completely broken. We get a notification
from user space, telling us that an interrupt is there. But then user space
expects us that we just acknowledge an interrupt once we deliver it to the
guest.

This is not how real hardware works though. On real hardware, the interrupt
controller pulls the external interrupt line until it gets notified that the
interrupt was received.

So in reality we have two events: pulling and letting go of the interrupt line.

To maintain backwards compatibility, I added a new request for the pulling
part. The letting go part was implemented earlier already.

With this in place, we can now finally start guests that do not randomly stall
and stop to work at random times.

This patch implements above logic for Book3S.
Signed-off-by: NAlexander Graf <agraf@suse.de>

17bd1580

KVM: PPC: allow ppc440gp to pass the compatibility check · ebc65874

由 Hollis Blanchard 提交于 8月 07, 2010

Match only the first part of cur_cpu_spec->platform.

440GP (the first 440 processor) is identified by the string "ppc440gp", while
all later 440 processors use simply "ppc440".
Signed-off-by: NHollis Blanchard <hollis_blanchard@mentor.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

ebc65874

KVM: PPC: fix compilation of "dump tlbs" debug function · 0b3bafc8

由 Hollis Blanchard 提交于 8月 07, 2010

Missing local variable.
Signed-off-by: NHollis Blanchard <hollis_blanchard@mentor.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

0b3bafc8

KVM: PPC: initialize IVORs in addition to IVPR · 082decf2

由 Hollis Blanchard 提交于 8月 07, 2010

Developers can now tell at a glace the exact type of the premature interrupt,
instead of just knowing that there was some premature interrupt.
Signed-off-by: NHollis Blanchard <hollis_blanchard@mentor.com>
Signed-off-by: NAlexander Graf <agraf@suse.de>

082decf2

openeuler / Kernel 接近 2 年 前同步成功

openeuler / Kernel
接近 2 年前同步成功