提交 · 61305a96fad622ae0f0e78cb06f67ad721d378f9 · openeuler / raspberrypi-kernel

20 9月, 2011 40 次提交

powerpc/powernv: Add support for p5ioc2 PCI-X and PCIe · 61305a96

由 Benjamin Herrenschmidt 提交于 9月 19, 2011

This adds support for PCI-X and PCIe on the p5ioc2 IO hub using
OPAL. This includes allocating & setting up TCE tables and config
space access routines.

This also supports fallbacks via RTAS when OPAL is absent, using
legacy TCE format pre-allocated via the device-tree (BML style)
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

61305a96

powerpc/powernv: Machine check and other system interrupts · ed79ba9e

由 Benjamin Herrenschmidt 提交于 9月 19, 2011

OPAL can handle various interrupt for us such as Machine Checks (it
performs all sorts of recovery tasks and passes back control to us with
informations about the error), Hardware Management Interrupts and Softpatch
interrupts.

This wires up the mechanisms and prints out specific informations returned
by HAL when a machine check occurs.
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

ed79ba9e

powerpc/powernv: Register and handle OPAL interrupts · a125e092

由 Benjamin Herrenschmidt 提交于 9月 19, 2011

We do the minimum which is to "pass" interrupts to HAL, which
makes the console smoother and will allow us to implement
interrupt based completion and console.
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

a125e092

powerpc/powernv: Add OPAL ICS backend · 5c7c1e94

由 Benjamin Herrenschmidt 提交于 9月 19, 2011

OPAL handles HW access to the various ICS or equivalent chips
for us (with the exception of p5ioc2 based HEA which uses a

different backend) similarily to what RTAS does on pSeries.
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

5c7c1e94

powerpc/powernv: Add RTC and NVRAM support plus RTAS fallbacks · 628daa8d

由 Benjamin Herrenschmidt 提交于 9月 19, 2011

Implements OPAL RTC and NVRAM support and wire all that up to
the powernv platform.

We use RTAS for RTC as a fallback if available. Using RTAS for nvram
is not supported yet, pending some rework/cleanup and generalization
of the pSeries & CHRP code. We also use RTAS fallbacks for power off
and reboot
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

628daa8d

powerpc/powernv: Hookup reboot and poweroff functions · ec27329f

由 Benjamin Herrenschmidt 提交于 9月 19, 2011

This calls the respective HAL functions, and spin on hal_poll_event()
to ensure the HAL has a chance to communicate with the FSP to trigger
the reboot or shutdown operation
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

ec27329f

powerpc/powernv: Support for OPAL console · daea1175

由 Benjamin Herrenschmidt 提交于 9月 19, 2011

This adds a udbg and an hvc console backend for supporting a console
using the OPAL console interfaces.

On OPAL v1 we have hvc0 mapped to whatever console the system was
configured for (network or hvsi serial port) via the service
processor.

On OPAL v2 we have hvcN mapped to the Nth console provided by OPAL
which generally corresponds to:

	hvc0 : network console (raw protocol)
	hvc1 : serial port S1 (hvsi)
	hvc2 : serial port S2 (hvsi)

Note: At this point, early debug console only works with OPAL v1
and shouldn't be enabled in a normal kernel.
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

daea1175

powerpc/powernv: Add support for instanciating OPAL v2 from Open Firmware · 6e35d5da

由 Benjamin Herrenschmidt 提交于 9月 19, 2011

OPAL v2 is instantiated in a way similar to RTAS using Open Firmware
client interface calls, and the resulting address and entry point are
put in the device-tree
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

6e35d5da

powerpc/powernv: Basic support for OPAL · 14a43e69

由 Benjamin Herrenschmidt 提交于 9月 19, 2011

Add definition of OPAL interfaces along with  the wrappers to call
into OPAL runtime and the early device-tree parsing hook to locate
the OPAL runtime firmware.
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

14a43e69

powerpc/powernv: Get kernel command line accross OPAL takeover · 817c21ad

由 Benjamin Herrenschmidt 提交于 9月 19, 2011

We stash it in boot_command_line which isn't in BSS and so won't
be overwritten. We then use that as a default cmd_line before
we walk the device-tree.
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

817c21ad

powerpc/powernv: Add OPAL takeover from PowerVM · 27f44888

由 Benjamin Herrenschmidt 提交于 9月 19, 2011

On machines supporting the OPAL firmware version 1, the system
is initially booted under pHyp. We then use a special hypercall
to verify if OPAL is available and if it is, we then trigger
a "takeover" which disables pHyp and loads the OPAL runtime
firmware, giving control to the kernel in hypervisor mode.

This patch add the necessary code to detect that the OPAL takeover
capability is present when running under PowerVM (aka pHyp) and
perform said takeover to get hypervisor control of the processor.

To perform the takeover, we must first use RTAS (within Open
Firmware runtime environment) to start all processors & threads,
in order to give control to OPAL on all of them. We then call
the takeover hypercall on everybody, OPAL will re-enter the kernel
main entry point passing it a flat device-tree.
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

27f44888

powerpc/powernv: Add CPU hotplug support · 344eb010

由 Benjamin Herrenschmidt 提交于 9月 19, 2011

Unplugged CPU go into NAP mode in a loop until woken up
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

344eb010

powerpc: Add skeleton PowerNV platform · 55190f88

由 Benjamin Herrenschmidt 提交于 9月 19, 2011

This adds a skeletton for the new Power "Non Virtualized"
platform which will be used by machines supporting running
without an hypervisor, for example in order to run KVM.

These machines will be using a new firmware called OPAL
for which the support will be provided by later patches.

The PowerNV platform is intended to be also usable under
the BML environment used internally for early CPU bringup
which is why the code also supports using RTAS instead of
OPAL in various places.
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

55190f88

powerpc/powernv: Don't clobber r9 in relative_toc() · e550592e

由 Benjamin Herrenschmidt 提交于 9月 19, 2011

With OPAL, r8 and r9 will be used to pass the OPAL base and entry
for debugging purposes (those informations are also in the
device-tree). We don't want to clobber those registers that
early.
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

e550592e

powerpc/pci: Call pcie_bus_configure_settings() · 781fb7a3

由 Benjamin Herrenschmidt 提交于 9月 19, 2011

This new function is used to properly setup the PCI Express Max Payload Size
(and in some circumstances Max Read Request Size).

Some systems will not operate properly if these aren't set correctly and
the firmware doesn't always do it.
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

781fb7a3

powerpc/smp: More generic support for "soft hotplug" · fb82b839

由 Benjamin Herrenschmidt 提交于 9月 19, 2011

This adds more generic support for doing CPU hotplug with a simple
idle loop and no actual reset of the processors. The generic
smp_generic_kick_cpu() does the hotplug bringup trick if the PACA
shows that the CPU has already been started at boot and we provide
an accessor for the CPU state.
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

fb82b839

powerpc/udbg: Fix Kconfig entry for avoiding 44x early debug with KVM · b8bb922c

由 Benjamin Herrenschmidt 提交于 9月 19, 2011

It was preventing the global early debug selection whenever KVM was enabled
instead of only preventing the 440 specific one.
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

b8bb922c

powerpc: Fix deadlock in icswx code · 8bdafa39

由 Anton Blanchard 提交于 9月 14, 2011

The icswx code introduced an A-B B-A deadlock:

     CPU0                    CPU1
     ----                    ----
lock(&anon_vma->mutex);
                             lock(&mm->mmap_sem);
                             lock(&anon_vma->mutex);
lock(&mm->mmap_sem);

Instead of using the mmap_sem to keep mm_users constant, take the
page table spinlock.
Signed-off-by: NAnton Blanchard <anton@samba.org>
Cc: <stable@kernel.org>
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

8bdafa39

powerpc: Fix oops when echoing bad values to /sys/devices/system/memory/probe · a1194097

由 Anton Blanchard 提交于 8月 10, 2011

If we echo an address the hypervisor doesn't like to
/sys/devices/system/memory/probe we oops the box:

# echo 0x10000000000 > /sys/devices/system/memory/probe

kernel BUG at arch/powerpc/mm/hash_utils_64.c:541!

The backtrace is:

create_section_mapping
arch_add_memory
add_memory
memory_probe_store
sysdev_class_store
sysfs_write_file
vfs_write
SyS_write

In create_section_mapping we BUG if htab_bolt_mapping returned
an error. A better approach is to return an error which will
propagate back to userspace.

Rerunning the test with this patch applied:

# echo 0x10000000000 > /sys/devices/system/memory/probe
-bash: echo: write error: Invalid argument
Signed-off-by: NAnton Blanchard <anton@samba.org>
Cc: stable@kernel.org
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

a1194097

powerpc: Coding style cleanups · dfbe93a2

由 Anton Blanchard 提交于 8月 10, 2011

While converting code to use for_each_node_by_type I noticed a
number of coding style issues.
Signed-off-by: NAnton Blanchard <anton@samba.org>
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

dfbe93a2

powerpc: Use for_each_node_by_type instead of open coding it · 94db7c5e

由 Anton Blanchard 提交于 8月 10, 2011

Use for_each_node_by_type instead of open coding it.
Signed-off-by: NAnton Blanchard <anton@samba.org>
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

94db7c5e

powerpc/numa: Remove double of_node_put in hot_add_node_scn_to_nid · 60831842

由 Anton Blanchard 提交于 8月 10, 2011

During memory hotplug testing, I got the following warning:

ERROR: Bad of_node_put() on /memory@0

of_node_release
kref_put
of_node_put
of_find_node_by_type
hot_add_node_scn_to_nid
hot_add_scn_to_nid
memory_add_physaddr_to_nid
...

of_find_node_by_type() loop does the of_node_put for us so we only
need the handle the case where we terminate the loop early.

As suggested by Stephen Rothwell we can do the of_node_put
unconditionally outside of the loop since of_node_put handles a
NULL argument fine.
Signed-off-by: NAnton Blanchard <anton@samba.org>
Cc: stable@kernel.org
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

60831842

powerpc/numa: Remove duplicate RECLAIM_DISTANCE definition · e377bc5d

由 Anton Blanchard 提交于 7月 24, 2011

We have two identical definitions of RECLAIM_DISTANCE, looks like
the patch got applied twice. Remove one.
Signed-off-by: NAnton Blanchard <anton@samba.org>
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

e377bc5d

powerpc/numa: Disable NEWIDLE balancing at node level · 7bebcf09

由 Anton Blanchard 提交于 7月 24, 2011

On big POWER7 boxes we see large amounts of CPU time in system
processes like workqueue and watchdog kernel threads.

We currently rebalance the entire machine each time a task goes
idle and this is very expensive on large machines. Disable newidle
balancing at the node level and rely on the scheduler tick to
rebalance across nodes.
Signed-off-by: NAnton Blanchard <anton@samba.org>
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

7bebcf09

powerpc/numa: Increase SD_NODES_PER_DOMAIN to 32. · d4761ad2

由 Anton Blanchard 提交于 7月 24, 2011

The largest POWER7 boxes have 32 nodes. SD_NODES_PER_DOMAIN groups
nodes into chunks of 16 and adds a global balancing domain
(SD_ALLNODES) above it.

If we bump SD_NODES_PER_DOMAIN to 32, then we avoid this extra
level of balancing on our largest boxes.
Signed-off-by: NAnton Blanchard <anton@samba.org>
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

d4761ad2

powerpc/numa: Enable SD_WAKE_AFFINE in node definition · a200d8e4

由 Anton Blanchard 提交于 7月 24, 2011

When chasing a performance issue on ppc64, I noticed tasks
communicating via a pipe would often end up on different nodes.

It turns out SD_WAKE_AFFINE is not set in our node defition. Commit
9fcd18c9 (sched: re-tune balancing) enabled SD_WAKE_AFFINE
in the node definition for x86 and we need a similar change for
ppc64.

I used lmbench lat_ctx and perf bench pipe to verify this fix. Each
benchmark was run 10 times and the average taken.

lmbench lat_ctx:

before:  66565 ops/sec
after:  204700 ops/sec

3.1x faster

perf bench pipe:

before: 5.6570 usecs
after:  1.3470 usecs

4.2x faster
Signed-off-by: NAnton Blanchard <anton@samba.org>
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

a200d8e4

powerpc/ps3: Add gelic udbg driver · c26afe9e

由 Hector Martin 提交于 8月 31, 2011

Add a new udbg driver for the PS3 gelic Ehthernet device.

This driver shares only a few stucture and constant definitions with the
gelic Ethernet device driver, so is implemented as a stand-alone driver
with no dependencies on the gelic Ethernet device driver.
Signed-off-by: NHector Martin <hector@marcansoft.com>
Signed-off-by: NAndre Heider <a.heider@gmail.com>
Signed-off-by: NGeoff Levand <geoff@infradead.org>
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

c26afe9e

powerpc/eeh: Fix /proc/ppc64/eeh creation · 8feaa434

由 Thadeu Lima de Souza Cascardo 提交于 8月 26, 2011

Since commit 188917e1, /proc/ppc64 is a
symlink to /proc/powerpc/. That means that creating /proc/ppc64/eeh will
end up with a unaccessible file, that is not listed under /proc/powerpc/
and, then, not listed under /proc/ppc64/.

Creating /proc/powerpc/eeh fixes that problem and maintain the
compatibility intended with the ppc64 symlink.
Signed-off-by: NThadeu Lima de Souza Cascardo <cascardo@linux.vnet.ibm.com>
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>
Cc: <stable@kernel.org>	[3.x]

8feaa434

powerpc/xics: Add __init to marker icp_native_init() · cf01a404

由 Arnaud Lacombe 提交于 8月 25, 2011

This should fix the following warning:

 LD      arch/powerpc/sysdev/xics/built-in.o
WARNING: arch/powerpc/sysdev/xics/built-in.o(.text+0x1310): Section mismatch in
reference from the function .icp_native_init() to the function
.init.text:.icp_native_init_one_node()
The function .icp_native_init() references
the function __init .icp_native_init_one_node().
This is often because .icp_native_init lacks a __init
annotation or the annotation of .icp_native_init_one_node is wrong.

icp_native_init() is only referenced in `arch/powerpc/sysdev/xics/xics-common.c'
by xics_init() which is itself marked with __init.

= not built-tested =
Reported-by: NTimur Tabi <timur@freescale.com>
Signed-off-by: NArnaud Lacombe <lacombar@gmail.com>
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

cf01a404

powerpc/pseries: Avoid spurious error during hotplug CPU add · 9c740025

由 Anton Blanchard 提交于 8月 14, 2011

During hotplug CPU add we get the following error:

Unexpected Error (0) returned from configure-connector

ibm,configure-connector returns 0 for configuration complete, so
catch this and avoid the error.
Signed-off-by: NAnton Blanchard <anton@samba.org>
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>
Cc: <stable@kernel.org>

9c740025

powerpc/mm: Fix the call trace when resumed from hibernation · 0330581a

由 Tang Yuantian 提交于 8月 16, 2011

	In SMP mode, the kernel would produce call trace when resumed
	from hibernation. The reason is when the function destroy_context
	is called to drop the resuming mm context, the mm->context.active
	is 1 which is wrong and should be zero.
	We pass the current->active_mm as previous mm context to function
	switch_mmu_context to decrease the context.active by 1.

	In UP mode, there is no effect.
Signed-off-by: NTang Yuantian <b29983@freescale.com>
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

0330581a

powerpc/4xx/pci: Add __init annotations for *init_port_hw() functions. · 9c57a32b

由 Tony Breeds 提交于 8月 10, 2011

The various port_init_hw methods of ppc4xx_pciex_hwops should have been
marked __init and when I added ppc4xx_pciex_port_reset_sdr(), which is
__init.  This added many section mismatch warnings like:

WARNING: arch/powerpc/sysdev/built-in.o(.text+0x5c68): Section mismatch in reference from the function ppc440spe_pciex_init_port_hw() to the function .init.text:ppc4xx_pciex_port_reset_sdr()
The function ppc440spe_pciex_init_port_hw() references
the function __init ppc4xx_pciex_port_reset_sdr().
This is often because ppc440spe_pciex_init_port_hw lacks a __init
annotation or the annotation of ppc4xx_pciex_port_reset_sdr is wrong.

Trivial patch to silence those warnings.
Reported-By: NStephen Rothwell <sfr@canb.auug.org.au>
Signed-off-by: NTony Breeds <tony@bakeyournoodle.com>

Yours Tony
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

9c57a32b

powerpc/wsp: Add MSI support for PCI on PowerEN · f9a71e0f

由 Michael Ellerman 提交于 8月 08, 2011

Based on a patch by Michael Ellerman <michael@ellerman.id.au>

Patch was simply forward ported upstream.

Jimi Xenidis <jimix@pobox.com>
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

f9a71e0f

powerpc/wsp: Add PCIe Root support to PowerEN/WSP · f352c725

由 Benjamin Herrenschmidt 提交于 8月 08, 2011

Based on a patch by Benjamin Herrenschmidt <benh@kernel.crashing.org>

Modernized and slightly modified to not record erros into the nvram
log since we do not have that device driver just yet.

Jimi Xenidis <jimix@pobox.com>
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

f352c725

powerpc/wsp: Fix Wire Speed Processor platform configs · 2fa3d9e5

由 Jimi Xenidis 提交于 8月 08, 2011

Some config selections were applied to the platform (reference board)
when they actuall apply to the chip.
Signed-off-by: NJimi Xenidis <jimix@pobox.com>
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

2fa3d9e5

pseries/iommu: Add missing kfree · 7a19081f

由 Julia Lawall 提交于 8月 08, 2011

At this point, window has not been stored anywhere, so it has to be freed
before leaving the function.

A simplified version of the semantic match that finds this problem is as
follows: (http://coccinelle.lip6.fr/)

// <smpl>
@exists@
local idexpression x;
statement S,S1;
expression E;
identifier fl;
expression *ptr != NULL;
@@

x = \(kmalloc\|kzalloc\|kcalloc\)(...);
...
if (x == NULL) S
<... when != x
     when != if (...) { <+...kfree(x)...+> }
     when any
     when != true x == NULL
x->fl
...>
(
if (x == NULL) S1
|
if (...) { ... when != x
               when forall
(
 return \(0\|<+...x...+>\|ptr\);
|
* return ...;
)
}
)
// </smpl>
Signed-off-by: NJulia Lawall <julia@diku.dk>
Acked-by: NNishanth Aravamudan <nacc@us.ibm.com>
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

7a19081f

powerpc/32: Pass device tree address as u64 to machine_init · 6dece0eb

由 Scott Wood 提交于 7月 25, 2011

u64 is used rather than phys_addr_t to keep things simple, as
this is called from assembly code.

Update callers to pass a 64-bit address in r3/r4.  Other unused
register assignments that were once parameters to machine_init
are dropped.

For FSL BookE, look up the physical address of the device tree from the
effective address passed in r3 by the loader.  This is required for
situations where memory does not start at zero (due to AMP or IOMMU-less
virtualization), and thus the IMA doesn't start at zero, and thus the
device tree effective address does not equal the physical address.
Signed-off-by: NScott Wood <scottwood@freescale.com>
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

6dece0eb

powerpc/nvram: Add compression to fit more oops output into NVRAM · 6c493685

由 Jim Keniston 提交于 7月 25, 2011

Capture more than twice as much text from the printk buffer, and
compress it to fit it in the lnx,oops-log NVRAM partition.  You
can view the compressed text using the new (as of July 20) --unzip
option of the nvram command in the powerpc-utils package.

[BenH: Added select of ZLIB_DEFLATE]
Signed-off-by: NJim Keniston <jkenisto@us.ibm.com>
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

6c493685

powerpc: Fix build dependencies for epapr.c which needs libfdt.h · 73927693

由 Matthew McClintock 提交于 7月 19, 2011

Currently, the build can (very rarely) fail to build because libfdt.h has
not been created or is in the process of being copied.
Signed-off-by: NMatthew McClintock <msm@freescale.com>
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

73927693

powerpc/mpic: Add support for discontiguous cores · 14b92470

由 Timur Tabi 提交于 7月 08, 2011

There is one place in the MPIC driver that assumes that the cores are numbered
from 0 to n-1.  However, this is not true if the CPUs are not numbered
sequentially.  This can happen on a eight-core SOC where cores two and three
are removed in the device tree.  So instead of blindly looping, we iterate
over the discovered CPUs and use the SMP ID as the index.

This means that we no longer ask the MPIC how many CPUs there are, so
we also delete mpic->num_cpus.

We also catch if the number of CPUs in the SOC exceeds the number that the
MPIC supports.  This should never happen, of course, but it's good to be
sure.
Signed-off-by: NTimur Tabi <timur@freescale.com>
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

14b92470