提交 · 53255c9a4dade6ff2162121430d13aaadb38a69c · openeuler / raspberrypi-kernel

09 10月, 2014 7 次提交

s390/ftrace: remove 31 bit ftrace support · 53255c9a

由 Heiko Carstens 提交于 10月 07, 2014

31 bit and 64 bit diverge more and more and it is rather painful
to keep both parts running.
To make things simpler just remove the 31 bit support which nobody
uses anyway.
Signed-off-by: NHeiko Carstens <heiko.carstens@de.ibm.com>
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

53255c9a

s390/kdump: add support for vector extension · a62bc073

由 Michael Holzheu 提交于 10月 06, 2014

With this patch for kdump the s390 vector registers are stored into the
prepared save areas in the old kernel and into the REGSET_VX_LOW and
REGSET_VX_HIGH ELF notes for /proc/vmcore in the new kernel.

The NT_S390_VXRS_LOW note contains the lower halves of the first 16 vector
registers 0-15. The higher halves are stored in the floating point register
ELF note. The NT_S390_VXRS_HIGH contains the full vector registers 16-31.

The kernel provides a save area for storing vector register in case of
machine checks. A pointer to this save are is stored in the CPU lowcore
at offset 0x11b0. This save area is also used to save the registers for
kdump. In case of a dumped crashed kdump those areas are used to extract
the registers of the production system.

The vector registers for remote CPUs are stored using the "store additional
status at address" SIGP. For the dump CPU the vector registers are stored
with the VSTM instruction.

With this patch also zfcpdump stores the vector registers.
Reviewed-by: NHeiko Carstens <heiko.carstens@de.ibm.com>
Signed-off-by: NMichael Holzheu <holzheu@linux.vnet.ibm.com>
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

a62bc073

s390/disassembler: add vector instructions · 3585cb02

由 Martin Schwidefsky 提交于 10月 06, 2014

Add the instruction introduced with the vector extension to the in-kernel
disassembler.
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

3585cb02

s390: add support for vector extension · 80703617

由 Martin Schwidefsky 提交于 10月 06, 2014

The vector extension introduces 32 128-bit vector registers and a set of
instruction to operate on the vector registers.

The kernel can control the use of vector registers for the problem state
program with a bit in control register 0. Once enabled for a process the
kernel needs to retain the content of the vector registers on context
switch. The signal frame is extended to include the vector registers.
Two new register sets NT_S390_VXRS_LOW and NT_S390_VXRS_HIGH are added
to the regset interface for the debugger and core dumps.
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

80703617

s390/idle: consolidate idle functions and definitions · b5f87f15

由 Martin Schwidefsky 提交于 10月 01, 2014

Move the C functions and definitions related to the idle state handling
to arch/s390/include/asm/idle.h and arch/s390/kernel/idle.c. The function
s390_get_idle_time is renamed to arch_cpu_idle_time and vtime_stop_cpu to
enabled_wait.
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

b5f87f15

s390/nohz: use a per-cpu flag for arch_needs_cpu · fe0f4976

由 Martin Schwidefsky 提交于 9月 30, 2014

Move the nohz_delay bit from the s390_idle data structure to the
per-cpu flags. Clear the nohz delay flag in __cpu_disable and
remove the cpu hotplug notifier that used to do this.
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

fe0f4976

s390/vtime: do not reset idle data on CPU hotplug · a9b16499

由 Martin Schwidefsky 提交于 10月 01, 2014

The sysfs attributes /sys/devices/system/cpu/cpu0/idle_count and
/sys/devices/system/cpu/cpu0/idle_time_us are reset to zero every
time a CPU is set online. The idle and iowait fields in /proc/stat
corresponding to idle_time_us are not reset. To make things
consistent do not reset the data for the sys attributes.
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

a9b16499

30 9月, 2014 1 次提交

s390/mm: make use of ipte range facility · cfb0b241

由 Heiko Carstens 提交于 9月 23, 2014

Invalidate several pte entries at once if the ipte range facility
is available. Currently this works only for DEBUG_PAGE_ALLOC where
several up to 2 ^ MAX_ORDER may be invalidated at once.
Signed-off-by: NHeiko Carstens <heiko.carstens@de.ibm.com>
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

cfb0b241

26 9月, 2014 2 次提交

s390/setup: correct 4-level kernel page table detection · 242a112a

由 Martin Schwidefsky 提交于 9月 26, 2014

Fix calculation to decide if a 4-level kernel page table is required.
Git commit c972cc60 "s390/vmalloc: have separate modules area"
added the separate module area which reduces the size of the vmalloc
area but fails to take it into account for the 3 vs 4 level page table
decision.
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

242a112a

s390/topology: call set_sched_topology early · 48e9a6c1

由 Martin Schwidefsky 提交于 9月 24, 2014

The call to topology_init is too late for the set_sched_topology call.
The initial scheduling domain structure has already been established
with default topology array. Use the smp_cpus_done() call to get the
s390 specific topology array registered early enough.

Cc: stable@vger.kernel.org # v3.16+
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

48e9a6c1

25 9月, 2014 10 次提交

s390/uprobes: architecture backend for uprobes · 2a0a5b22

由 Jan Willeke 提交于 9月 22, 2014

Signed-off-by: NJan Willeke <willeke@de.ibm.com>
Signed-off-by: NHeiko Carstens <heiko.carstens@de.ibm.com>
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

2a0a5b22

s390/uprobes: common library for kprobes and uprobes · 975fab17

由 Jan Willeke 提交于 9月 22, 2014

This patch moves common functions from kprobes.c to probes.c.
Thus its possible for uprobes to use them without enabling kprobes.
Signed-off-by: NJan Willeke <willeke@de.ibm.com>
Signed-off-by: NHeiko Carstens <heiko.carstens@de.ibm.com>
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

975fab17

s390/rwlock: use the interlocked-access facility 1 instructions · bbae71bf

由 Martin Schwidefsky 提交于 9月 22, 2014

Make use of the load-and-add, load-and-or and load-and-and instructions
to atomically update the read-write lock without a compare-and-swap loop.
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

bbae71bf

s390/rwlock: improve writer fairness · 94232a43

由 Martin Schwidefsky 提交于 9月 22, 2014

Set the write-lock bit in the out-of-line rwlock code to indicate that
a writer is waiting. Additional readers will no be able to get the lock
until at least one writer got the lock. Additional writers have to wait
for the first writer to release the lock again.
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

94232a43

M
s390/rwlock: remove interrupt-enabling rwlock variant. · 2684e73a
由 Martin Schwidefsky 提交于 9月 22, 2014
```
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>
```
2684e73a

s390/mm: remove change bit override support · 6a5c1482

由 Heiko Carstens 提交于 9月 22, 2014

Signed-off-by: NHeiko Carstens <heiko.carstens@de.ibm.com>
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

6a5c1482

s390/vmemmap: remove memset call from vmemmap_populate() · 70c9d296

由 Heiko Carstens 提交于 9月 20, 2014

If the vmemmap array gets filled with large pages we allocate those
pages with vmemmap_alloc_block(), which returns cleared pages.
Only for single 4k pages we call our own vmem_alloc_pages() which does
not return cleared pages. However we can also call vmemmap_alloc_block()
to allocate the 4k pages.
This way we can also make sure the vmemmap array is cleared after its
population.
Therefore we can remove the memset at the end of the function which
would clear the vmmemmap array a second time on machines which do
support EDAT1.

On very large configurations this can save us several seconds.
Signed-off-by: NHeiko Carstens <heiko.carstens@de.ibm.com>
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

70c9d296

s390/head.s: use zero as address for stfl · b881dcfb

由 Christian Borntraeger 提交于 9月 19, 2014

The architecture suggests to use address 0 as parameter for stfl,
to allow for future extensions. Using __LC_STFL_FAC_LIST (0x200)
shows which address is used, but might be not future proof.
Signed-off-by: NChristian Borntraeger <borntraeger@de.ibm.com>
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

b881dcfb

s390/rwlock: use directed yield for write-locked rwlocks · d59b93da

由 Martin Schwidefsky 提交于 9月 19, 2014

Add an owner field to the arch_rwlock_t to be able to pass the timeslice
of a virtual CPU with diagnose 0x9c to the lock owner in case the rwlock
is write-locked. The undirected yield in case the rwlock is acquired
writable but the lock is read-locked is removed.
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

d59b93da

s390/hmcdrv: HMC drive CD/DVD access · 8f933b10

由 Ralf Hoppe 提交于 4月 08, 2013

This device driver allows accessing a HMC drive CD/DVD-ROM.
It can be used in a LPAR and z/VM environment.
Reviewed-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>
Reviewed-by: NHeiko Carstens <heiko.carstens@de.ibm.com>
Signed-off-by: NRalf Hoppe <rhoppe@de.ibm.com>
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

8f933b10

09 9月, 2014 10 次提交

s390/spinlock: optimize spin_unlock code · 44230282

由 Heiko Carstens 提交于 9月 08, 2014

Use a memory barrier + store sequence instead of a load + compare and swap
sequence to unlock a spinlock and an rw lock.
For the spinlock case this saves us two memory reads and a not needed cpu
serialization after the compare and swap instruction stored the new value.

The kernel size (performance_defconfig) gets reduced by ~14k.

Average execution time of a tight inlined spin_unlock loop drops from
5.8ns to 0.7ns on a zEC12 machine.

An artificial stress test case where several counters are protected with
a single spinlock and which are only incremented while holding the spinlock
shows ~30% improvement on a 4 cpu machine.
Signed-off-by: NHeiko Carstens <heiko.carstens@de.ibm.com>
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

44230282

s390/ftrace: optimize mcount code · 3d1e220d

由 Heiko Carstens 提交于 9月 03, 2014

Reduce the number of executed instructions within the mcount block if
function tracing is enabled. We achieve that by using a non-standard
C function call ABI. Since the called function is also written in
assembler this is not a problem.
This also allows to replace the unconditional store at the beginning
of the mcount block with a larl instruction, which doesn't touch
memory.

In theory we could also patch the first instruction of the mcount block
to enable and disable function tracing. However this would break kprobes.
This could be fixed with implementing the "kprobes_on_ftrace" feature;
however keeping the odd jprobes working seems not to be possible without
a lot of code churn. Therefore keep the code easy and simply accept one
wasted 1-cycle "larl" instruction per function prologue.
Signed-off-by: NHeiko Carstens <heiko.carstens@de.ibm.com>
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

3d1e220d

s390/kprobes: remove unused jprobe_return_end() · ea2f4769

由 Heiko Carstens 提交于 9月 03, 2014

Even if it has a __used annotation it is actually unused.
Signed-off-by: NHeiko Carstens <heiko.carstens@de.ibm.com>
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

ea2f4769

s390/ftrace: enforce DYNAMIC_FTRACE if FUNCTION_TRACER is selected · 5d6a0163

由 Heiko Carstens 提交于 8月 15, 2014

We have too many combinations for function tracing. Lets simply stick to
the most advanced option, so we don't have to care of other combinations.

This means we always select DYNAMIC_FTRACE if FUNCTION_TRACER is selected.

In the s390 Makefile also remove CONFIG_FTRACE_SYSCALLS since that
functionality got moved to architecture independent code in the meantime.
Signed-off-by: NHeiko Carstens <heiko.carstens@de.ibm.com>
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

5d6a0163

s390/ftrace: add HAVE_DYNAMIC_FTRACE_WITH_REGS support · 10dec7db

由 Heiko Carstens 提交于 8月 15, 2014

This code is based on a patch from Vojtech Pavlik.
http://marc.info/?l=linux-s390&m=140438885114413&w=2

The actual implementation now differs significantly:
Instead of adding a second function "ftrace_regs_caller" which would be nearly
identical to the existing ftrace_caller function, the current ftrace_caller
function is now an alias to ftrace_regs_caller and always passes the needed
pt_regs structure and function_trace_op parameters unconditionally.

Besides that also use asm offsets to correctly allocate and access the new
struct pt_regs on the stack.

While at it we can make use of new instruction to get rid of some indirect
loads if compiled for new machines.

The passed struct pt_regs can be changed by the called function and it's new
contents will replace the current contents.

Note: to change the return address the embedded psw member of the pt_regs
structure must be changed. The psw member is right now incomplete, since
the mask part is missing. For all current use cases this should be sufficent.
Providing and restoring a sane mask would mean we need to add an epsw/lpswe
pair to the mcount code. Only these two instruction would cost us ~120 cycles
which currently seems not necessary.

Cc: Vojtech Pavlik <vojtech@suse.cz>
Cc: Jiri Kosina <jkosina@suse.cz>
Cc: Jiri Slaby <jslaby@suse.cz>
Cc: Steven Rostedt <rostedt@goodmis.org>
Signed-off-by: NHeiko Carstens <heiko.carstens@de.ibm.com>
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

10dec7db

s390/ftrace: optimize function graph caller code · 2481a87b

由 Heiko Carstens 提交于 8月 15, 2014

When the function graph tracer is disabled we can skip three additional
instructions. So let's just do this.

So if function tracing is enabled but function graph tracing is
runtime disabled, we get away with a single unconditional branch.
Signed-off-by: NHeiko Carstens <heiko.carstens@de.ibm.com>
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

2481a87b

s390: pass march flag to assembly files as well · 0f1b1ff5

由 Heiko Carstens 提交于 8月 14, 2014

Currently the march flag gets only passed to C files, but not to
assembler files.
This means that we can't add new instructions like e.g. aghik to asm
files, since the assembler doesn't know of the new instructions if
the appropriate march flag isn't specified.

So also pass the march flag when compiling assembler files as well.
Signed-off-by: NHeiko Carstens <heiko.carstens@de.ibm.com>
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

0f1b1ff5

s390/vdso: add vdso support for coarse clocks · b7eacb59

由 Martin Schwidefsky 提交于 8月 29, 2014

Add CLOCK_REALTIME_COARSE and CLOCK_MONOTONIC_COARSE optimization to
the 64-bit and 31-bit vdso.
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

b7eacb59

s390/vdso: replace stck with stcke · 070b7be6

由 Martin Schwidefsky 提交于 8月 29, 2014

If gettimeofday / clock_gettime are called multiple times in a row
the STCK instruction will stall until a difference in the result is
visible. This unnecessarily slows down the vdso calls, use stcke
instead of stck to get rid of the stall.
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

070b7be6

s390: remove unused MACHINE_FLAG_RRBM · b7d5006d

由 Heiko Carstens 提交于 8月 27, 2014

Signed-off-by: NHeiko Carstens <heiko.carstens@de.ibm.com>
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

b7d5006d

02 9月, 2014 2 次提交

KVM: s390/mm: Fix guest storage key corruption in ptep_set_access_flags · 1951497d

由 Christian Borntraeger 提交于 8月 28, 2014

commit 0944fe3f ("s390/mm: implement software referenced bits")
triggered another paging/storage key corruption. There is an
unhandled invalid->valid pte change where we have to set the real
storage key from the pgste.
When doing paging a guest page might be swapcache or swap and when
faulted in it might be read-only and due to a parallel scan old.
An do_wp_page will make it writeable and young. Due to software
reference tracking this page was invalid and now becomes valid.
Signed-off-by: NChristian Borntraeger <borntraeger@de.ibm.com>
Acked-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>
Cc: stable@vger.kernel.org # v3.12+

1951497d

KVM: s390/mm: Fix storage key corruption during swapping · 3e03d4c4

由 Christian Borntraeger 提交于 8月 28, 2014

Since 3.12 or more precisely  commit 0944fe3f ("s390/mm:
implement software referenced bits") guest storage keys get
corrupted during paging. This commit added another valid->invalid
translation for page tables - namely ptep_test_and_clear_young.
We have to transfer the storage key into the pgste in that case.
Signed-off-by: NChristian Borntraeger <borntraeger@de.ibm.com>
Acked-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>
Cc: stable@vger.kernel.org # v3.12+

3e03d4c4

01 9月, 2014 2 次提交

s390/vdso: remove NULL pointer check from clock_gettime · 5da76157

由 Martin Schwidefsky 提交于 8月 29, 2014

The explicit NULL pointer check on the timespec argument is only
required for clock_getres but not for clock_gettime.
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

5da76157

s390/ipl: Add missing SCSI loadparm attributes to /sys/firmware · 69928601

由 Michael Holzheu 提交于 8月 26, 2014

Currently the loadparm is only supported for CCW IPL. But also for SCSI
IPL it can be specified either on the HMC load panel respectively
z/VM console or via diagnose 308.

So fix this for SCSI and add the required sysfs attributes for reading the
IPL loadparm and for setting the loadparm for re-IPL.

With this patch the following two sysfs attributes are introduced:

 - /sys/firmware/ipl/loadparm (for system that have been IPLed from SCSI)
 - /sys/firmware/reipl/fcp/loadparm

Because the loadparm is now available for SCSI and CCW it is moved
now from "struct ipl_block_ccw" to the generic "struct ipl_list_hdr".
Reviewed-by: NHeiko Carstens <heiko.carstens@de.ibm.com>
Signed-off-by: NMichael Holzheu <holzheu@linux.vnet.ibm.com>
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

69928601

30 8月, 2014 1 次提交

kexec: remove CONFIG_KEXEC dependency on crypto · b41d34b4

由 Vivek Goyal 提交于 8月 29, 2014

New system call depends on crypto.  As it did not have a separate config
option, CONFIG_KEXEC was modified to select CRYPTO and CRYPTO_SHA256.

But now previous patch introduced a new config option for new syscall.
So CONFIG_KEXEC does not require crypto.  Remove that dependency.
Signed-off-by: NVivek Goyal <vgoyal@redhat.com>
Cc: Eric Biederman <ebiederm@xmission.com>
Cc: H. Peter Anvin <hpa@zytor.com>
Cc: Shaun Ruffell <sruffell@digium.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

b41d34b4

25 8月, 2014 2 次提交

KVM: s390/mm: try a cow on read only pages for key ops · ab3f285f

由 Christian Borntraeger 提交于 8月 19, 2014

The PFMF instruction handler  blindly wrote the storage key even if
the page was mapped R/O in the host. Lets try a COW before continuing
and bail out in case of errors.
Signed-off-by: NChristian Borntraeger <borntraeger@de.ibm.com>
Reviewed-by: NDominik Dingel <dingel@linux.vnet.ibm.com>
Cc: stable@vger.kernel.org

ab3f285f

KVM: s390: Fix user triggerable bug in dead code · 614a80e4

由 Christian Borntraeger 提交于 8月 06, 2014

In the early days, we had some special handling for the
KVM_EXIT_S390_SIEIC exit, but this was gone in 2009 with commit
d7b0b5eb (KVM: s390: Make psw available on all exits, not
just a subset).

Now this switch statement is just a sanity check for userspace
not messing with the kvm_run structure. Unfortunately, this
allows userspace to trigger a kernel BUG. Let's just remove
this switch statement.
Signed-off-by: NChristian Borntraeger <borntraeger@de.ibm.com>
Reviewed-by: NCornelia Huck <cornelia.huck@de.ibm.com>
Reviewed-by: NDavid Hildenbrand <dahi@linux.vnet.ibm.com>
Cc: stable@vger.kernel.org

614a80e4

12 8月, 2014 3 次提交

s390: wire up memfd_create syscall · 7bb1cdbf

由 Heiko Carstens 提交于 8月 11, 2014

Signed-off-by: NHeiko Carstens <heiko.carstens@de.ibm.com>
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

7bb1cdbf

s390: add system information as device randomness · bcfcbb6b

由 Martin Schwidefsky 提交于 8月 11, 2014

The virtual-machine cpu information data block and the cpu-id of
the boot cpu can be used as source of device randomness.
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

bcfcbb6b

s390/kdump: Clear subchannel ID to signal non-CCW/SCSI IPL · 852ffd0f

由 Michael Holzheu 提交于 8月 08, 2014

For CCW and SCSI IPL the hardware sets the subchannel ID and number
correctly at 0xb8. For kdump at 0xb8 normally there is the data of
the previously IPLed system.

In order to be clean now for kdump and kexec always set the subchannel
ID and number to zero. This tells the next OS that no CCW/SCSI IPL
has been done.
Reviewed-by: NSebastian Ott <sebott@linux.vnet.ibm.com>
Signed-off-by: NMichael Holzheu <holzheu@linux.vnet.ibm.com>
Signed-off-by: NMartin Schwidefsky <schwidefsky@de.ibm.com>

852ffd0f