提交 · 7cd01b08d35f1b7d55686ed8cd57c94d3406ec8f · openeuler / Kernel

20 10月, 2018 1 次提交

powerpc: Add support for function error injection · 7cd01b08

由 Naveen N. Rao 提交于 6月 07, 2018

We implement regs_set_return_value() and override_function_with_return()
for this purpose.

On powerpc, a return from a function (blr) just branches to the location
contained in the link register. So, we can just update pt_regs rather
than redirecting execution to a dummy function that returns.
Signed-off-by: NNaveen N. Rao <naveen.n.rao@linux.vnet.ibm.com>
Reviewed-by: NSamuel Mendoza-Jonas <sam@mendozajonas.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

7cd01b08

19 10月, 2018 3 次提交

powerpc/time: Fix clockevent_decrementer initalisation for PR KVM · b4d16ab5

由 Michael Ellerman 提交于 10月 17, 2018

In the recent commit 8b78fdb0 ("powerpc/time: Use
clockevents_register_device(), fixing an issue with large
decrementer") we changed the way we initialise the decrementer
clockevent(s).

We no longer initialise the mult & shift values of
decrementer_clockevent itself.

This has the effect of breaking PR KVM, because it uses those values
in kvmppc_emulate_dec(). The symptom is guest kernels spin forever
mid-way through boot.

For now fix it by assigning back to decrementer_clockevent the mult
and shift values.

Fixes: 8b78fdb0 ("powerpc/time: Use clockevents_register_device(), fixing an issue with large decrementer")
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

b4d16ab5

powerpc/aout: Fix struct user definition to use user_pt_regs · 6ce7bff0

由 Michael Ellerman 提交于 10月 15, 2018

I'm pretty sure this is dead code, it's only used by the a.out core
dump code, and we don't support a.out. We should remove it.

But while it's in the tree it should be using the ABI version of
pt_regs which is called user_pt_regs in the kernel, because the whole
struct is written to the core dump and so its size shouldn't change.

Note this isn't a uapi header so we don't need an ifdef.

Fixes: 002af939 ("powerpc: Split user/kernel definitions of struct pt_regs")
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

6ce7bff0

powerpc/uapi: Fix sigcontext definition to use user_pt_regs · 22a3d03d

由 Michael Ellerman 提交于 10月 15, 2018

My recent patch to split pt_regs between user and kernel missed
the usage in struct sigcontext.

Because this is a user visible struct it should be using the user
visible definition, which when we're building for the kernel is called
struct user_pt_regs.

As far as I can see this hasn't actually caused a bug (yet), because
we don't use the sizeof() the sigcontext->regs anywhere. But we should
still fix it to avoid confusion and future bugs.

Fixes: 002af939 ("powerpc: Split user/kernel definitions of struct pt_regs")
Reported-by: NMadhavan Srinivasan <maddy@linux.vnet.ibm.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

22a3d03d

18 10月, 2018 19 次提交

powerpc/io: remove old GCC version implementation · a0e10291

由 Christophe Leroy 提交于 10月 16, 2018

GCC 4.6 is the minimum supported now.
Signed-off-by: NChristophe Leroy <christophe.leroy@c-s.fr>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

a0e10291

powerpc: Add -Werror at arch/powerpc level · 23ad1a27

由 Michael Ellerman 提交于 10月 10, 2018

Back when I added -Werror in commit ba55bd74 ("powerpc: Add
configurable -Werror for arch/powerpc") I did it by adding it to most
of the arch Makefiles.

At the time we excluded math-emu, because apparently it didn't build
cleanly. But that seems to have been fixed somewhere in the interim.

So move the -Werror addition to the top-level of the arch, this saves
us from repeating it in every Makefile and means we won't forget to
add it to any new sub-dirs.
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

23ad1a27

powerpc: Move core kernel logic into arch/powerpc/Kbuild · c47ca98d

由 Michael Ellerman 提交于 10月 10, 2018

This is a nice cleanup, arch/powerpc/Makefile is long and messy so
moving this out helps a little.

It also allows us to do:

  $ make arch/powerpc

Which can be helpful if you just want to compile test some changes to
arch code and not link everything.

Finally it also gives us a single place to do subdir-cc-flags
assignments which affect the whole of arch/powerpc, which we will do
in a future patch.
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

c47ca98d

macintosh/windfarm_smu_sat: Fix debug output · fc0c8b36

由 Benjamin Herrenschmidt 提交于 10月 15, 2018

There's some antiquated debug output that's trying
to do a hand-made hexdump and turning into horrible
1-byte-per-line output these days.

Use print_hex_dump() instead
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

fc0c8b36

powerpc/traps: remove redundant in_interrupt panic in die() · bd03fd84

由 Christophe Leroy 提交于 10月 15, 2018

do_exit() already includes a test to panic() is in_interrupt()

This patch removes powerpc one which is redundant.
Signed-off-by: NChristophe Leroy <christophe.leroy@c-s.fr>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

bd03fd84

powerpc/prom_init: Generate "phandle" instead of "linux, phandle" · f1f208e5

由 Benjamin Herrenschmidt 提交于 10月 15, 2018

When creating the boot-time FDT from an actual Open Firmware live
tree, let's generate "phandle" properties for the phandles instead
of the old deprecated "linux,phandle".
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>
[mpe: Unsplit warning printf()]
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

f1f208e5

powerpc: Check prom_init for disallowed sections · 2c51d97e

由 Benjamin Herrenschmidt 提交于 10月 15, 2018

prom_init.c must not modify the kernel image outside
of the .bss.prominit section. Thus make sure that
prom_init.o doesn't have anything in any of these:

	.data
	.bss
	.init.data
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

2c51d97e

powerpc/prom_init: Move __prombss to it's own section and store it in .bss · 5f69e388

由 Benjamin Herrenschmidt 提交于 10月 15, 2018

This makes __prombss its own section, and for now store
it in .bss.

This will give us the ability later to store it elsewhere
and/or free it after boot (it's about 8KB).
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

5f69e388

B
powerpc/prom_init: Move a few remaining statics to appropriate sections · 8ca2d515
由 Benjamin Herrenschmidt 提交于 10月 15, 2018
```
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>
```
8ca2d515

powerpc/prom_init: Move const structures to __initconst · d00e34b9

由 Benjamin Herrenschmidt 提交于 10月 15, 2018

As they are no longer used past the end of prom_init
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

d00e34b9

powerpc/prom_init: Move ibm_arch_vec to __prombss · a614f52e

由 Benjamin Herrenschmidt 提交于 10月 15, 2018

Make the existing initialized definition constant and copy
it to a __prombss copy
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

a614f52e

powerpc/prom_init: Move prom_radix_disable to __prombss · c886087c

由 Benjamin Herrenschmidt 提交于 10月 15, 2018

Initialize it dynamically instead of statically
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

c886087c

powerpc/prom_init: Remove support for OPAL v2 · 11fdb309

由 Benjamin Herrenschmidt 提交于 10月 15, 2018

We removed support for running under any OPAL version
earlier than v3 in 2015 (they never saw the light of day
anyway), but we kept some leftovers of this support in
prom_init.c, so let's take it out.
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

11fdb309

powerpc/prom_init: Replace __initdata with __prombss when applicable · e63334e5

由 Benjamin Herrenschmidt 提交于 10月 15, 2018

This replaces all occurrences of __initdata for uninitialized
data with a new __prombss

Currently __promdata is defined to be __initdata but we'll
eventually change that.
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

e63334e5

powerpc/pseries: Add driver for PAPR SCM regions · b5beae5e

由 Oliver O'Halloran 提交于 10月 15, 2018

Adds a driver that implements support for enabling and accessing PAPR
SCM regions. Unfortunately due to how the PAPR interface works we can't
use the existing of_pmem driver (yet) because:

 a) The guest is required to use the H_SCM_BIND_MEM h-call to add
    add the SCM region to it's physical address space, and
 b) There is currently no mechanism for relating a bare of_pmem region
    to the backing DIMM (or not-a-DIMM for our case).

Both of these are easily handled by rolling the functionality into a
seperate driver so here we are...
Acked-by: NDan Williams <dan.j.williams@intel.com>
Signed-off-by: NOliver O'Halloran <oohall@gmail.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

b5beae5e

powerpc/pseries: PAPR persistent memory support · 4c5d87db

由 Oliver O'Halloran 提交于 10月 15, 2018

This patch implements support for discovering storage class memory
devices at boot and for handling hotplug of new regions via RTAS
hotplug events.
Signed-off-by: NOliver O'Halloran <oohall@gmail.com>
[mpe: Fix CONFIG_MEMORY_HOTPLUG=n build]
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

4c5d87db

powerpc/traps: fix machine check handlers to use pr_cont() · 422123cc

由 Christophe Leroy 提交于 10月 15, 2018

When printing the machine check cause, the cause appears on the
following line due to bad use of printk without \n:

[   33.663993] Machine check in kernel mode.
[   33.664011] Caused by (from SRR1=9032):
[   33.664036] Data access error at address c90c8000

This patch fixes it by using pr_cont() for the second part:

[  133.258131] Machine check in kernel mode.
[  133.258146] Caused by (from SRR1=9032): Data access error at address c90c8000
Signed-off-by: NChristophe Leroy <christophe.leroy@c-s.fr>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

422123cc

powerpc/book3e: redefine pte_mkprivileged() and pte_mkuser() · bde1a133

由 Christophe Leroy 提交于 10月 17, 2018

Book3e defines both _PAGE_USER and _PAGE_PRIVILEGED, so the nohash
default pte_mkprivileged() and pte_mkuser() are not usable.

This patch redefines them for book3e.

In theorie, only pte_mkprivileged() needs to be redefined because
_PAGE_USER includes _PAGE_PRIVILEGED, but it is less confusing
to redefine both.

Fixes: a0da4bc1 ("powerpc/mm: Allow platforms to redefine some helpers")
Signed-off-by: NChristophe Leroy <christophe.leroy@c-s.fr>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

bde1a133

powerpc/mm: Make pte_pgprot return all pte bits · b9fb4480

由 Aneesh Kumar K.V 提交于 10月 17, 2018

Other archs do the same and instead of adding required pte bits (which
got masked out) in __ioremap_at(), make sure we filter only pfn bits
out.

Fixes: 26973fa5 ("powerpc/mm: use pte helpers in generic code")
Reviewed-by: NChristophe Leroy <christophe.leroy@c-s.fr>
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.ibm.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

b9fb4480

14 10月, 2018 17 次提交

powerpc/mm: Increase the max addressable memory to 2PB · 4ffe713b

由 Aneesh Kumar K.V 提交于 9月 20, 2018

Currently we limit the max addressable memory to 128TB. This patch increase the
limit to 2PB. We can have devices like nvdimm which adds memory above 512TB
limit.

We still don't support regular system ram above 512TB. One of the challenge with
that is the percpu allocator, that allocates per node memory and use the max
distance between them as the percpu offsets. This means with large gap in
address space ( system ram above 1PB) we will run out of vmalloc space to map
the percpu allocation.

In order to support addressable memory above 512TB, kernel should be able to
linear map this range. To do that with hash translation we now add 4 context
to kernel linear map region. Our per context addressable range is 512TB. We
still keep VMALLOC and VMEMMAP region to old size. SLB miss handlers is updated
to validate these limit.

We also limit this update to SPARSEMEM_VMEMMAP and SPARSEMEM_EXTREME
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.ibm.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

4ffe713b

powerpc/mm/hash: Rename get_ea_context to get_user_context · c9f80734

由 Aneesh Kumar K.V 提交于 9月 20, 2018

We will be adding get_kernel_context later. Update function name to indicate
this handle context allocation user space address.
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.ibm.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

c9f80734

powerpc/64s/hash: Add some SLB debugging tests · e15a4fea

由 Nicholas Piggin 提交于 10月 03, 2018

This adds CONFIG_DEBUG_VM checks to ensure:
  - The kernel stack is in the SLB after it's flushed and bolted.
  - We don't insert an SLB for an address that is aleady in the SLB.
  - The kernel SLB miss handler does not take an SLB miss.
Signed-off-by: NNicholas Piggin <npiggin@gmail.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

e15a4fea

powerpc/64s/hash: Simplify slb_flush_and_rebolt() · 94ee4272

由 Nicholas Piggin 提交于 10月 03, 2018

slb_flush_and_rebolt() is misleading, it is called in virtual mode, so
it can not possibly change the stack, so it should not be touching the
shadow area. And since vmalloc is no longer bolted, it should not
change any bolted mappings at all.

Change the name to slb_flush_and_restore_bolted(), and have it just
load the kernel stack from what's currently in the shadow SLB area.
Signed-off-by: NNicholas Piggin <npiggin@gmail.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

94ee4272

powerpc/64s/hash: Add a SLB preload cache · 5434ae74

由 Nicholas Piggin 提交于 9月 15, 2018

When switching processes, currently all user SLBEs are cleared, and a
few (exec_base, pc, and stack) are preloaded. In trivial testing with
small apps, this tends to miss the heap and low 256MB segments, and it
will also miss commonly accessed segments on large memory workloads.

Add a simple round-robin preload cache that just inserts the last SLB
miss into the head of the cache and preloads those at context switch
time. Every 256 context switches, the oldest entry is removed from the
cache to shrink the cache and require fewer slbmte if they are unused.

Much more could go into this, including into the SLB entry reclaim
side to track some LRU information etc, which would require a study of
large memory workloads. But this is a simple thing we can do now that
is an obvious win for common workloads.

With the full series, process switching speed on the context_switch
benchmark on POWER9/hash (with kernel speculation security masures
disabled) increases from 140K/s to 178K/s (27%).

POWER8 does not change much (within 1%), it's unclear why it does not
see a big gain like POWER9.

Booting to busybox init with 256MB segments has SLB misses go down
from 945 to 69, and with 1T segments 900 to 21. These could almost all
be eliminated by preloading a bit more carefully with ELF binary
loading.
Signed-off-by: NNicholas Piggin <npiggin@gmail.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

5434ae74

powerpc/64s/hash: Provide arch_setup_exec() hooks for hash slice setup · 425d3314

由 Nicholas Piggin 提交于 9月 15, 2018

This will be used by the SLB code in the next patch, but for now this
sets the slb_addr_limit to the correct size for 32-bit tasks.
Signed-off-by: NNicholas Piggin <npiggin@gmail.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

425d3314

powerpc/64s/hash: Add SLB allocation status bitmaps · 126b11b2

由 Nicholas Piggin 提交于 9月 15, 2018

Add 32-entry bitmaps to track the allocation status of the first 32
SLB entries, and whether they are user or kernel entries. These are
used to allocate free SLB entries first, before resorting to the round
robin allocator.
Signed-off-by: NNicholas Piggin <npiggin@gmail.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

126b11b2

powerpc/64s/hash: Convert SLB miss handlers to C · 48e7b769

由 Nicholas Piggin 提交于 9月 15, 2018

This patch moves SLB miss handlers completely to C, using the standard
exception handler macros to set up the stack and branch to C.

This can be done because the segment containing the kernel stack is
always bolted, so accessing it with relocation on will not cause an
SLB exception.

Arbitrary kernel memory must not be accessed when handling kernel
space SLB misses, so care should be taken there. However user SLB
misses can access any kernel memory, which can be used to move some
fields out of the paca (in later patches).

User SLB misses could quite easily reconcile IRQs and set up a first
class kernel environment and exit via ret_from_except, however that
doesn't seem to be necessary at the moment, so we only do that if a
bad fault is encountered.

[ Credit to Aneesh for bug fixes, error checks, and improvements to
  bad address handling, etc ]
Signed-off-by: NNicholas Piggin <npiggin@gmail.com>
[mpe: Disallow tracing for all of slb.c for now.]
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

48e7b769

powerpc/64: Interrupts save PPR on stack rather than thread_struct · 4c2de74c

由 Nicholas Piggin 提交于 10月 13, 2018

PPR is the odd register out when it comes to interrupt handling, it is
saved in current->thread.ppr while all others are saved on the stack.

The difficulty with this is that accessing thread.ppr can cause a SLB
fault, but the SLB fault handler implementation in C change had
assumed the normal exception entry handlers would not cause an SLB
fault.

Fix this by allocating room in the interrupt stack to save PPR.
Signed-off-by: NNicholas Piggin <npiggin@gmail.com>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

4c2de74c

powerpc/ptrace: Don't use sizeof(struct pt_regs) in ptrace code · 3eeacd9f

由 Michael Ellerman 提交于 10月 13, 2018

Now that we've split the user & kernel versions of pt_regs we need to
be more careful in the ptrace code.

For now we've ensured the location of the fields in both structs is
the same, so most of the ptrace code doesn't need updating.

But there are a few places where we use sizeof(pt_regs), and these
will be wrong as soon as we increase the size of the kernel structure.

So flip them all to use sizeof(user_pt_regs).
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

3eeacd9f

powerpc: Split user/kernel definitions of struct pt_regs · 002af939

由 Michael Ellerman 提交于 10月 12, 2018

We use a shared definition for struct pt_regs in uapi/asm/ptrace.h.
That means the layout of the structure is ABI, ie. we can't change it.

That would be fine if it was only used to describe the user-visible
register state of a process, but it's also the struct we use in the
kernel to describe the registers saved in an interrupt frame.

We'd like more flexibility in the content (and possibly layout) of the
kernel version of the struct, but currently that's not possible.

So split the definition into a user-visible definition which remains
unchanged, and a kernel internal one.

At the moment they're still identical, and we check that at build
time. That's because we have code (in ptrace etc.) that assumes that
they are the same. We will fix that code in future patches, and then
we can break the strict symmetry between the two structs.
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

002af939

powerpc/prom_init: Make "default_colors" const · 7f995d3b

由 Benjamin Herrenschmidt 提交于 5月 31, 2018

It's never modified.
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

7f995d3b

powerpc/prom_init: Make "fake_elf" const · 30c69ca0

由 Benjamin Herrenschmidt 提交于 5月 31, 2018

It is never modified
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

30c69ca0

powerpc/prom_init: Make of_workarounds static · 3bad719b

由 Benjamin Herrenschmidt 提交于 5月 31, 2018

It's not used anywhere else.
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

3bad719b

powerpc/book3s64: Avoid multiple endian conversion in pte helpers · 1b2443a5

由 Christophe Leroy 提交于 10月 09, 2018

In the same spirit as already done in pte query helpers,
this patch changes pte setting helpers to perform endian
conversions on the constants rather than on the pte value.

In the meantime, it changes pte_access_permitted() to use
pte helpers for the same reason.
Signed-off-by: NChristophe Leroy <christophe.leroy@c-s.fr>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

1b2443a5

powerpc/8xx: change name of a few page flags to avoid confusion · ff005525

由 Christophe Leroy 提交于 10月 09, 2018

_PAGE_PRIVILEGED corresponds to the SH bit which doesn't protect
against user access but only disables ASID verification on kernel
accesses. User access is controlled with _PMD_USER flag.

Name it _PAGE_SH instead of _PAGE_PRIVILEGED

_PAGE_HUGE corresponds to the SPS bit which doesn't really tells
that's it is a huge page but only that it is not a 4k page.

Name it _PAGE_SPS instead of _PAGE_HUGE
Reviewed-by: NAneesh Kumar K.V <aneesh.kumar@linux.ibm.com>
Signed-off-by: NChristophe Leroy <christophe.leroy@c-s.fr>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

ff005525

powerpc/mm: Get rid of pte-common.h · 56623153

由 Christophe Leroy 提交于 10月 09, 2018

Do not include pte-common.h in nohash/32/pgtable.h

As that was the last includer, get rid of pte-common.h
Reviewed-by: NAneesh Kumar K.V <aneesh.kumar@linux.ibm.com>
Signed-off-by: NChristophe Leroy <christophe.leroy@c-s.fr>
Signed-off-by: NMichael Ellerman <mpe@ellerman.id.au>

56623153

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功