提交 · a7c5724b5c17775ca8ea2fd9906d8a7e37337cce · Linux-御风守护者 / linux

25 12月, 2015 1 次提交

sparc64: fix FP corruption in user copy functions · a7c5724b

由 Rob Gardner 提交于 12月 22, 2015

Short story: Exception handlers used by some copy_to_user() and
copy_from_user() functions do not diligently clean up floating point
register usage, and this can result in a user process seeing invalid
values in floating point registers. This sometimes makes the process
fail.

Long story: Several cpu-specific (NG4, NG2, U1, U3) memcpy functions
use floating point registers and VIS alignaddr/faligndata to
accelerate data copying when source and dest addresses don't align
well. Linux uses a lazy scheme for saving floating point registers; It
is not done upon entering the kernel since it's a very expensive
operation. Rather, it is done only when needed. If the kernel ends up
not using FP regs during the course of some trap or system call, then
it can return to user space without saving or restoring them.

The various memcpy functions begin their FP code with VISEntry (or a
variation thereof), which saves the FP regs. They conclude their FP
code with VISExit (or a variation) which essentially marks the FP regs
"clean", ie, they contain no unsaved values. fprs.FPRS_FEF is turned
off so that a lazy restore will be triggered when/if the user process
accesses floating point regs again.

The bug is that the user copy variants of memcpy, copy_from_user() and
copy_to_user(), employ an exception handling mechanism to detect faults
when accessing user space addresses, and when this handler is invoked,
an immediate return from the function is forced, and VISExit is not
executed, thus leaving the fprs register in an indeterminate state,
but often with fprs.FPRS_FEF set and one or more dirty bits. This
results in a return to user space with invalid values in the FP regs,
and since fprs.FPRS_FEF is on, no lazy restore occurs.

This bug affects copy_to_user() and copy_from_user() for NG4, NG2,
U3, and U1. All are fixed by using a new exception handler for those
loads and stores that are done during the time between VISEnter and
VISExit.

n.b. In NG4memcpy, the problematic code can be triggered by a copy
size greater than 128 bytes and an unaligned source address. This bug
is known to be the cause of random user process memory corruptions
while perf is running with the callgraph option (ie, perf record -g).
This occurs because perf uses copy_from_user() to read user stacks,
and may fault when it follows a stack frame pointer off to an
invalid page. Validation checks on the stack address just obscure
the underlying problem.
Signed-off-by: NRob Gardner <rob.gardner@oracle.com>
Signed-off-by: NDave Aldridge <david.j.aldridge@oracle.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

a7c5724b

08 8月, 2015 1 次提交

sparc64: use ENTRY/ENDPROC in VISsave · 73958c65

由 Sam Ravnborg 提交于 8月 07, 2015

From 7d8a508d74e6cacf0f2438286a959c3195a35a37 Mon Sep 17 00:00:00 2001
From: Sam Ravnborg <sam@ravnborg.org>
Date: Fri, 7 Aug 2015 20:26:12 +0200
Subject: [PATCH] sparc64: use ENTRY/ENDPROC in VISsave

Commit 44922150
("sparc64: Fix userspace FPU register corruptions") left a
stale globl symbol which was not used.

Fix this and introduce use of ENTRY/ENDPROC
Signed-off-by: NSam Ravnborg <sam@ravnborg.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

73958c65

07 8月, 2015 1 次提交

sparc64: Fix userspace FPU register corruptions. · 44922150

由 David S. Miller 提交于 8月 06, 2015

If we have a series of events from userpsace, with %fprs=FPRS_FEF,
like follows:

ETRAP
	ETRAP
		VIS_ENTRY(fprs=0x4)
		VIS_EXIT
		RTRAP (kernel FPU restore with fpu_saved=0x4)
	RTRAP

We will not restore the user registers that were clobbered by the FPU
using kernel code in the inner-most trap.

Traps allocate FPU save slots in the thread struct, and FPU using
sequences save the "dirty" FPU registers only.

This works at the initial trap level because all of the registers
get recorded into the top-level FPU save area, and we'll return
to userspace with the FPU disabled so that any FPU use by the user
will take an FPU disabled trap wherein we'll load the registers
back up properly.

But this is not how trap returns from kernel to kernel operate.

The simplest fix for this bug is to always save all FPU register state
for anything other than the top-most FPU save area.

Getting rid of the optimized inner-slot FPU saving code ends up
making VISEntryHalf degenerate into plain VISEntry.

Longer term we need to do something smarter to reinstate the partial
save optimizations.  Perhaps the fundament error is having trap entry
and exit allocate FPU save slots and restore register state.  Instead,
the VISEntry et al. calls should be doing that work.

This bug is about two decades old.
Reported-by: NJames Y Knight <jyknight@google.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

44922150

27 7月, 2015 1 次提交

sparc: Provide atomic_{or,xor,and} · 304a0d69

由 Peter Zijlstra 提交于 4月 23, 2014

Implement atomic logic ops -- atomic_{or,xor,and}.

These will replace the atomic_{set,clear}_mask functions that are
available on some archs.
Acked-by: NDavid S. Miller <davem@davemloft.net>
Signed-off-by: NPeter Zijlstra (Intel) <peterz@infradead.org>
Signed-off-by: NThomas Gleixner <tglx@linutronix.de>

304a0d69

24 3月, 2015 1 次提交

sparc64: Fix several bugs in memmove(). · 2077cef4

由 David S. Miller 提交于 3月 23, 2015

Firstly, handle zero length calls properly.  Believe it or not there
are a few of these happening during early boot.

Next, we can't just drop to a memcpy() call in the forward copy case
where dst <= src.  The reason is that the cache initializing stores
used in the Niagara memcpy() implementations can end up clearing out
cache lines before we've sourced their original contents completely.

For example, considering NG4memcpy, the main unrolled loop begins like
this:

     load   src + 0x00
     load   src + 0x08
     load   src + 0x10
     load   src + 0x18
     load   src + 0x20
     store  dst + 0x00

Assume dst is 64 byte aligned and let's say that dst is src - 8 for
this memcpy() call.  That store at the end there is the one to the
first line in the cache line, thus clearing the whole line, which thus
clobbers "src + 0x28" before it even gets loaded.

To avoid this, just fall through to a simple copy only mildly
optimized for the case where src and dst are 8 byte aligned and the
length is a multiple of 8 as well.  We could get fancy and call
GENmemcpy() but this is good enough for how this thing is actually
used.
Reported-by: NDavid Ahern <david.ahern@oracle.com>
Reported-by: NBob Picco <bpicco@meloft.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

2077cef4

08 11月, 2014 1 次提交

sparc32: Implement xchg and atomic_xchg using ATOMIC_HASH locks · 1a17fdc4

由 Andreas Larsson 提交于 11月 05, 2014

Atomicity between xchg and cmpxchg cannot be guaranteed when xchg is
implemented with a swap and cmpxchg is implemented with locks.
Without this, e.g. mcs_spin_lock and mcs_spin_unlock are broken.
Signed-off-by: NAndreas Larsson <andreas@gaisler.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

1a17fdc4

15 10月, 2014 1 次提交

sparc64: Fix FPU register corruption with AES crypto offload. · f4da3628

由 David S. Miller 提交于 10月 14, 2014

The AES loops in arch/sparc/crypto/aes_glue.c use a scheme where the
key material is preloaded into the FPU registers, and then we loop
over and over doing the crypt operation, reusing those pre-cooked key
registers.

There are intervening blkcipher*() calls between the crypt operation
calls.  And those might perform memcpy() and thus also try to use the
FPU.

The sparc64 kernel FPU usage mechanism is designed to allow such
recursive uses, but with a catch.

There has to be a trap between the two FPU using threads of control.

The mechanism works by, when the FPU is already in use by the kernel,
allocating a slot for FPU saving at trap time.  Then if, within the
trap handler, we try to use the FPU registers, the pre-trap FPU
register state is saved into the slot.  Then at trap return time we
notice this and restore the pre-trap FPU state.

Over the long term there are various more involved ways we can make
this work, but for a quick fix let's take advantage of the fact that
the situation where this happens is very limited.

All sparc64 chips that support the crypto instructiosn also are using
the Niagara4 memcpy routine, and that routine only uses the FPU for
large copies where we can't get the source aligned properly to a
multiple of 8 bytes.

We look to see if the FPU is already in use in this context, and if so
we use the non-large copy path which only uses integer registers.

Furthermore, we also limit this special logic to when we are doing
kernel copy, rather than a user copy.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

f4da3628

10 9月, 2014 2 次提交

locking, sparc64: Fix atomics · caa17d49

由 Peter Zijlstra 提交于 9月 02, 2014

The patch folding the atomic ops had a silly fail in the _return primitives.

Fixes: 4f3316c2 ("locking,arch,sparc: Fold atomic_ops")
Reported-by: NGuenter Roeck <linux@roeck-us.net>
Tested-by: NGuenter Roeck <linux@roeck-us.net>
Signed-off-by: NPeter Zijlstra (Intel) <peterz@infradead.org>
Cc: "David S. Miller" <davem@davemloft.net>
Cc: Stephen Rothwell <sfr@canb.auug.org.au>
Cc: David S. Miller <davem@davemloft.net>
Cc: Linus Torvalds <torvalds@linux-foundation.org>
Cc: sparclinux@vger.kernel.org
Link: http://lkml.kernel.org/r/20140902094016.GD31157@worktop.ger.corp.intel.comSigned-off-by: NIngo Molnar <mingo@kernel.org>

caa17d49

sparc: Let memset return the address argument · 74cad25c

由 Andreas Larsson 提交于 8月 29, 2014

This makes memset follow the standard (instead of returning 0 on success). This
is needed when certain versions of gcc optimizes around memset calls and assume
that the address argument is preserved in %o0.
Signed-off-by: NAndreas Larsson <andreas@gaisler.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

74cad25c

14 8月, 2014 1 次提交

locking,arch,sparc: Fold atomic_ops · 4f3316c2

由 Peter Zijlstra 提交于 3月 26, 2014

Many of the atomic op implementations are the same except for one
instruction; fold the lot into a few CPP macros and reduce LoC.

This also prepares for easy addition of new ops.
Signed-off-by: NPeter Zijlstra <peterz@infradead.org>
Acked-by: NDavid S. Miller <davem@davemloft.net>
Cc: Bjorn Helgaas <bhelgaas@google.com>
Cc: Kirill Tkhai <tkhai@yandex.ru>
Cc: Linus Torvalds <torvalds@linux-foundation.org>
Cc: Paul E. McKenney <paulmck@linux.vnet.ibm.com>
Cc: Sam Ravnborg <sam@ravnborg.org>
Cc: sparclinux@vger.kernel.org
Link: http://lkml.kernel.org/r/20140508135852.825281379@infradead.orgSigned-off-by: NIngo Molnar <mingo@kernel.org>

4f3316c2

22 7月, 2014 1 次提交

sparc64: update IO access functions in PeeCeeI · 6b8b5507

由 Sam Ravnborg 提交于 7月 20, 2014

The PeeCeeI.c code used in*() + out*() for IO access.
But these are in little endian and the native (big) endian
result was required which resulted in some bit-shifting.
Shift the code over to use the __raw_*() variants all over.

This simplifies the code as we can drop the calls
to le16_to_cpu() and le32_to_cpu().
And it should be a little faster too.

With this change we now uses the same type of IO access functions
in all of the file.
Signed-off-by: NSam Ravnborg <sam@ravnborg.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

6b8b5507

19 7月, 2014 1 次提交

sparc64,ftrace: Remove check of obsolete variable function_trace_stop · 2563b9d9

由 Steven Rostedt (Red Hat) 提交于 6月 25, 2014

Nothing sets function_trace_stop to disable function tracing anymore.
Remove the check for it in the arch code.

Link: http://lkml.kernel.org/r/20140703.211820.1674895115102216877.davem@davemloft.net

Cc: David S. Miller <davem@davemloft.net>
OKed-to-go-through-tracing-tree-by: NDavid S. Miller <davem@davemloft.net>
Signed-off-by: NSteven Rostedt <rostedt@goodmis.org>

2563b9d9

18 5月, 2014 1 次提交

sparc64: Add membar to Niagara2 memcpy code. · 5aa4ecfd

由 David S. Miller 提交于 5月 17, 2014

This is the prevent previous stores from overlapping the block stores
done by the memcpy loop.

Based upon a glibc patch by Jose E. Marchesi
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

5aa4ecfd

02 5月, 2014 1 次提交

sparc32: introduce asm-generic/io.h · e1039fb4

由 Sam Ravnborg 提交于 4月 26, 2014

Use asm-generic/io.h definitions where applicable.
The inxx() and outxx() methods whcih was duplicated in pcic.c +
leon_pci.c are replaced by a set of static inlins from asm-generic/io.h

iomap.c is replaced by the generic versions, but are still
present to support sparc64.
Signed-off-by: NSam Ravnborg <sam@ravnborg.org>
Cc: Daniel Hellstrom <daniel@gaisler.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

e1039fb4

13 11月, 2013 1 次提交

sparc64: Make PAGE_OFFSET variable. · b2d43834

由 David S. Miller 提交于 9月 20, 2013

Choose PAGE_OFFSET dynamically based upon cpu type.

Original UltraSPARC-I (spitfire) chips only supported a 44-bit
virtual address space.

Newer chips (T4 and later) support 52-bit virtual addresses
and up to 47-bits of physical memory space.

Therefore we have to adjust PAGE_SIZE dynamically based upon
the capabilities of the chip.

Note that this change alone does not allow us to support > 43-bit
physical memory, to do that we need to re-arrange our page table
support.  The current encodings of the pmd_t and pgd_t pointers
restricts us to "32 + 11" == 43 bits.

This change can waste quite a bit of memory for the various tables.
In particular, a future change should work to size and allocate
kern_linear_bitmap[] and sparc64_valid_addr_bitmap[] dynamically.
This isn't easy as we really cannot take a TLB miss when accessing
kern_linear_bitmap[].  We'd have to lock it into the TLB or similar.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
Acked-by: NBob Picco <bob.picco@oracle.com>

b2d43834

06 9月, 2013 1 次提交

sparc64: Remove RWSEM export leftovers · 61d9b935

由 Kirill Tkhai 提交于 8月 12, 2013

The functions

			__down_read
			__down_read_trylock
			__down_write
			__down_write_trylock
			__up_read
			__up_write
			__downgrade_write

are implemented inline, so remove corresponding EXPORT_SYMBOLs
(They lead to compile errors on RT kernel).
Signed-off-by: NKirill Tkhai <tkhai@yandex.ru>
CC: David Miller <davem@davemloft.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

61d9b935

01 5月, 2013 1 次提交

Kconfig: consolidate CONFIG_DEBUG_STRICT_USER_COPY_CHECKS · 446f24d1

由 Stephen Boyd 提交于 4月 30, 2013

The help text for this config is duplicated across the x86, parisc, and
s390 Kconfig.debug files.  Arnd Bergman noted that the help text was
slightly misleading and should be fixed to state that enabling this
option isn't a problem when using pre 4.4 gcc.

To simplify the rewording, consolidate the text into lib/Kconfig.debug
and modify it there to be more explicit about when you should say N to
this config.

Also, make the text a bit more generic by stating that this option
enables compile time checks so we can cover architectures which emit
warnings vs.  ones which emit errors.  The details of how an
architecture decided to implement the checks isn't as important as the
concept of compile time checking of copy_from_user() calls.

While we're doing this, remove all the copy_from_user_overflow() code
that's duplicated many times and place it into lib/ so that any
architecture supporting this option can get the function for free.
Signed-off-by: NStephen Boyd <sboyd@codeaurora.org>
Acked-by: NArnd Bergmann <arnd@arndb.de>
Acked-by: NIngo Molnar <mingo@kernel.org>
Acked-by: NH. Peter Anvin <hpa@zytor.com>
Cc: Arjan van de Ven <arjan@linux.intel.com>
Acked-by: NHelge Deller <deller@gmx.de>
Cc: Heiko Carstens <heiko.carstens@de.ibm.com>
Cc: Stephen Rothwell <sfr@canb.auug.org.au>
Cc: Chris Metcalf <cmetcalf@tilera.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

446f24d1

01 4月, 2013 1 次提交

sparc/srmmu: clear trailing edge of bitmap properly · 54df2db3

由 Akinobu Mita 提交于 3月 29, 2013

srmmu_nocache_bitmap is cleared by bit_map_init().  But bit_map_init()
attempts to clear by memset(), so it can't clear the trailing edge of
bitmap properly on big-endian architecture if the number of bits is not
a multiple of BITS_PER_LONG.

Actually, the number of bits in srmmu_nocache_bitmap is not always
a multiple of BITS_PER_LONG.  It is calculated as below:

        bitmap_bits = srmmu_nocache_size >> SRMMU_NOCACHE_BITMAP_SHIFT;

srmmu_nocache_size is decided proportionally by the amount of system RAM
and it is rounded to a multiple of PAGE_SIZE.  SRMMU_NOCACHE_BITMAP_SHIFT
is defined as (PAGE_SHIFT - 4).  So it can only be said that bitmap_bits
is a multiple of 16.

This fixes the problem by using bitmap_clear() instead of memset()
in bit_map_init() and this also uses BITS_TO_LONGS() to calculate correct
size at bitmap allocation time.
Signed-off-by: NAkinobu Mita <akinobu.mita@gmail.com>
Cc: "David S. Miller" <davem@davemloft.net>
Cc: sparclinux@vger.kernel.org
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

54df2db3

10 11月, 2012 1 次提交

sparc: Support atomic64_dec_if_positive properly. · 193d2aad

由 David S. Miller 提交于 11月 09, 2012

Sparc32 already supported it, as a consequence of using the
generic atomic64 implementation.  And the sparc64 implementation
is rather trivial.

This allows us to set ARCH_HAS_ATOMIC64_DEC_IF_POSITIVE for all
of sparc, and avoid the annoying warning from lib/atomic64_test.c
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

193d2aad

06 10月, 2012 1 次提交

sparc64: Niagara-4 bzero/memset, plus use MRU stores in page copy. · 9f825962

由 David S. Miller 提交于 10月 05, 2012

This adds optimized memset/bzero/page-clear routines for Niagara-4.

We basically can do what powerpc has been able to do for a decade (via
the "dcbz" instruction), which is use cache line clearing stores for
bzero and memsets with a 'c' argument of zero.

As long as we make the cache initializing store to each 32-byte
subblock of the L2 cache line, it works.

As with other Niagara-4 optimized routines, the key is to make sure to
avoid any usage of the %asi register, as reads and writes to it cost
at least 50 cycles.

For the user clear cases, we don't use these new routines, we use the
Niagara-1 variants instead.  Those have to use %asi in an unavoidable
way.

A Niagara-4 8K page clear costs just under 600 cycles.

Add definitions of the MRU variants of the cache initializing store
ASIs.  By default, cache initializing stores install the line as Least
Recently Used.  If we know we're going to use the data immediately
(which is true for page copies and clears) we can use the Most
Recently Used variant, to decrease the likelyhood of the lines being
evicted before they get used.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

9f825962

29 9月, 2012 1 次提交
- D
  sparc64: Fix trailing whitespace in NG4 memcpy. · 42a4172b
  由 David S. Miller 提交于 9月 28, 2012
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  42a4172b
28 9月, 2012 1 次提交
- D
  sparc64: Fix comment type in NG4 copy from user. · 90192057
  由 David S. Miller 提交于 9月 27, 2012
```
Noticed by Greg Onufer.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  90192057
27 9月, 2012 2 次提交

sparc64: Fix return value of Niagara-2 memcpy. · 1b62ca7b

由 David S. Miller 提交于 9月 27, 2012

It gets clobbered by the kernel's VISEntryHalf, so we have to save it
in a different register than the set clobbered by that macro.

The instance in glibc is OK and doesn't have this problem.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

1b62ca7b

sparc64: Add SPARC-T4 optimized memcpy. · ae2c6ca6

由 David S. Miller 提交于 9月 26, 2012

		Before		After
		--------------	--------------
bw_tcp:         1288.53 MB/sec	1637.77 MB/sec
bw_pipe:        1517.18 MB/sec	2107.61 MB/sec
bw_unix:        1838.38 MB/sec	2640.91 MB/sec

make -s -j128
allmodconfig	5min 49sec	5min 31sec
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

ae2c6ca6

21 8月, 2012 1 次提交
- D
  sparc64: Add SHA1 driver making use of the 'sha1' instruction. · 4ff28d4c
  由 David S. Miller 提交于 8月 19, 2012
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
Acked-by: NHerbert Xu <herbert@gondor.apana.org.au>
```
  4ff28d4c
27 6月, 2012 1 次提交
- D
  sparc64: Consistently use fsrc2 rather than fmovd in optimized asm. · 6f1d827f
  由 David S. Miller 提交于 6月 27, 2012
```
Because fsrc2, unlike fmovd, does not update the %fsr register.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  6f1d827f
27 5月, 2012 1 次提交

sparc: use the new generic strnlen_user() function · 2c66f623

由 David Miller 提交于 5月 26, 2012

This throws away the sparc-specific functions in favor of the generic
optimized version.
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

2c66f623

25 5月, 2012 3 次提交

lib: Sparc's strncpy_from_user is generic enough, move under lib/ · 2922585b

由 David S. Miller 提交于 5月 24, 2012

To use this, an architecture simply needs to:

1) Provide a user_addr_max() implementation via asm/uaccess.h

2) Add "select GENERIC_STRNCPY_FROM_USER" to their arch Kcnfig

3) Remove the existing strncpy_from_user() implementation and symbol
   exports their architecture had.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
Acked-by: NDavid Howells <dhowells@redhat.com>

2922585b

kernel: Move REPEAT_BYTE definition into linux/kernel.h · 44696908

由 David S. Miller 提交于 5月 23, 2012

And make sure that everything using it explicitly includes
that header file.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

44696908

sparc: Increase portability of strncpy_from_user() implementation. · 35c96460

由 David S. Miller 提交于 5月 23, 2012

Hide details of maximum user address calculation in a new
asm/uaccess.h interface named user_addr_max().

Provide little-endian implementation in find_zero(), which should work
but can probably be improved.

Abstrace alignment check behind IS_UNALIGNED() macro.

Kill double-semicolon, noticed by David Howells.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

35c96460

24 5月, 2012 1 次提交

sparc: Optimize strncpy_from_user() zero byte search. · 4efcac3a

由 David S. Miller 提交于 5月 23, 2012

Compute a mask that will only have 0x80 in the bytes which
had a zero in them.  The formula is:

	~(((x & 0x7f7f7f7f) + 0x7f7f7f7f) | x | 0x7f7f7f7f)

In the inner word iteration, we have to compute the "x | 0x7f7f7f7f"
part, so we can reuse that in the above calculation.

Once we have this mask, we perform divide and conquer to find the
highest 0x80 location.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

4efcac3a

23 5月, 2012 1 次提交

sparc: Add full proper error handling to strncpy_from_user(). · ff06dffb

由 David S. Miller 提交于 5月 22, 2012

Linus removed the end-of-address-space hackery from
fs/namei.c:do_getname() so we really have to validate these edge
conditions and cannot cheat any more (as x86 used to as well).

Move to a common C implementation like x86 did.  And if both
src and dst are sufficiently aligned we'll do word at a time
copies and checks as well.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

ff06dffb

20 5月, 2012 2 次提交

sparc32: Add ucmpdi2.o to obj-y instead of lib-y. · 74c7b289

由 David S. Miller 提交于 5月 19, 2012

Otherwise if no references exist in the static kernel image,
we won't export the symbol properly to modules.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

74c7b289

sparc32: add ucmpdi2 · de36e66d

由 Sam Ravnborg 提交于 5月 19, 2012

Based on copy from microblaze add ucmpdi2 implementation.
This fixes build of niu driver which failed with:

drivers/built-in.o: In function `niu_get_nfc':
niu.c:(.text+0x91494): undefined reference to `__ucmpdi2'

This driver will never be used on a sparc32 system,
but patch added to fix build breakage with all*config builds.
Signed-off-by: NSam Ravnborg <sam@ravnborg.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

de36e66d

16 5月, 2012 1 次提交

sparc32: Kill off software 32-bit multiply/divide routines. · 1b35a57b

由 David S. Miller 提交于 5月 15, 2012

For the explicit calls to .udiv/.umul in assembler, I made a
mechanical (read as: safe) transformation.  I didn't attempt
to make any simplifications.

In particular, __ndelay and __udelay can be simplified significantly.
Some of the %y reads are unnecessary and these routines have no need
any longer for allocating a register window, they can be leaf
functions.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

1b35a57b

14 5月, 2012 1 次提交

sparc32: Kill btfixup for xchg()'s 'swap' instruction. · 73c1377d

由 David S. Miller 提交于 5月 13, 2012

We always have this instruction available, so no need to use
btfixup for it any more.

This also eradicates the whole of atomic_32.S and thus the
__atomic_begin and __atomic_end symbols completely.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

73c1377d

12 5月, 2012 3 次提交
- D
  sparc: Convert some assembler over to linakge.h's ENTRY/ENDPROC · 8695c37d
  由 David S. Miller 提交于 5月 11, 2012
```
Use those, instead of doing it all by hand.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  8695c37d
- D
  sparc32: Remove inline strncmp "optimization" for constant counts. · b55e81b9
  由 David S. Miller 提交于 5月 11, 2012
```
Let the compiler do stuff like this.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  b55e81b9
- S
  sparc32: drop sun4c specific ___xchg32 implementation · 593fc6ea
  由 Sam Ravnborg 提交于 5月 11, 2012
```
Signed-off-by: NSam Ravnborg <sam@ravnborg.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  593fc6ea
02 2月, 2012 1 次提交

lib: Fix multiple definitions of clz_tab · c6df4b17

由 David Miller 提交于 2月 02, 2012

Both sparc 32-bit's software divide assembler and MPILIB provide
clz_tab[] with identical contents.

Break it out into a seperate object file and select it when
SPARC32 or MPILIB is set.
Reported-by: NAl Viro <viro@ZenIV.linux.org.uk>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
Signed-off-by: NJames Morris <jmorris@namei.org>

c6df4b17

Linux-御风守护者 / linux 与 Fork 源项目一致

Linux-御风守护者 / linux
与 Fork 源项目一致