提交 · 7f33306ee57bce9c79825e89c457a91025aa5aad · openeuler / raspberrypi-kernel

13 1月, 2010 2 次提交

sh: Don't perform an icbi on a P2 address · 6430a598

由 Matt Fleming 提交于 1月 13, 2010

The legacy P2 area may not always be mapped (for example when using
PMB). So perform an icbi on an address that we know will always be
mapped.
Signed-off-by: NMatt Fleming <matt@console-pimps.org>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

6430a598

sh: Move over to dynamically allocated FPU context. · 0ea820cf

由 Paul Mundt 提交于 1月 13, 2010

This follows the x86 xstate changes and implements a task_xstate slab
cache that is dynamically sized to match one of hard FP/soft FP/FPU-less.

This also tidies up and consolidates some of the SH-2A/SH-4 FPU
fragmentation. Now fpu state restorers are commonly defined, with the
init_fpu()/fpu_init() mess reworked to follow the x86 convention.
The fpu_init() register initialization has been replaced by xstate setup
followed by writing out to hardware via the standard restore path.

As init_fpu() now performs a slab allocation a secondary lighterweight
restorer is also introduced for the context switch.

In the future the DSP state will be rolled in here, too.

More work remains for math emulation and the SH-5 FPU, which presently
uses its own special (UP-only) interfaces.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

0ea820cf

12 1月, 2010 6 次提交

sh: Always provide thread_info allocators. · cbf6b1ba

由 Paul Mundt 提交于 1月 12, 2010

Presently the thread_info allocators are special cased, depending on
THREAD_SHIFT < PAGE_SHIFT. This provides a sensible definition for them
regardless of configuration, in preparation for extended CPU state.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

cbf6b1ba

sh: Move start_thread() out of line. · 70e068ee

由 Paul Mundt 提交于 1月 12, 2010

start_thread() will become a bit heavier with the xstate freeing to be
added in, so move it out-of-line in preparation.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

70e068ee

sh: Split out the unaligned counters and user bits. · a99eae54

由 Paul Mundt 提交于 1月 12, 2010

This splits out the unaligned access counters and userspace bits in to
their own generic interface, which will allow them to be wired up on sh64
too.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

a99eae54

sh: Consolidate the sh_bios earlyprintk code. · 776258df

由 Paul Mundt 提交于 1月 12, 2010

Now that the sh-sci earlyprintk is taken care of by the sh-sci driver
directly, there's no longer any reason for having a split-out
early_printk framework. sh_bios is the only other thing that uses it, so
we just migrate the leftovers in to there. As it's possible to have
multiple early_param()'s for the same string, there's not much point in
having this split out anymore anyways, particularly since the sh_bios
dependencies are still special-cased within sh-sci itself.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

776258df

sh: Kill off more unused sh_bios callbacks. · b9303a79

由 Paul Mundt 提交于 1月 12, 2010

sh_bios_char_out() is not used by anything in-tree these days, so just
get rid of it.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

b9303a79

sh: Tidy up the sh bios VBR handling. · 191d0d24

由 Paul Mundt 提交于 1月 12, 2010

This moves the VBR handling out of the main trap handling code and in to
the sh-bios helper code. A couple of accessors are added in order to
permit other kernel code to get at the VBR value for state save/restore
paths.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

191d0d24

08 1月, 2010 1 次提交

sh: consolidate atomic_cmpxchg()/atomic_add_unless() definitions. · 8c0b8139

由 Paul Mundt 提交于 1月 08, 2010

The LL/SC and IRQ versions were using generic stubs while the GRB version
was just reimplementing what it already had for the standard cmpxchg()
code. As we have optimized cmpxchg() implementations that are decoupled
from the atomic code, simply falling back on the generic wrapper does the
right thing. With this in place the GRB case is unaffected while the
LL/SC case gets to use its optimized cmpxchg().
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

8c0b8139

05 1月, 2010 4 次提交

sh: Reclaim TIF_DEBUG. · 9fae4fb3

由 Paul Mundt 提交于 1月 05, 2010

This was used by the old hw-breakpoints API, but now there is nothing
is using it anymore, so just kill it off.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

9fae4fb3

sh: Kill off dead UBC headers. · 7025bec9

由 Paul Mundt 提交于 1月 05, 2010

Nothing is using these now, so kill them all off.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

7025bec9

sh: Abstracted SH-4A UBC support on hw-breakpoint core. · 4352fc1b

由 Paul Mundt 提交于 1月 05, 2010

This is the next big chunk of hw_breakpoint support. This decouples
the SH-4A support from the core and moves it out in to its own stub,
following many of the conventions established with the perf events
layering.

In addition to extending SH-4A support to encapsulate the remainder
of the UBC channels, clock framework support for handling the UBC
interface clock is added as well, allowing for dynamic clock gating.

This also fixes up a regression introduced by the SIGTRAP handling that
broke the ksym_tracer, to the extent that the current support works well
with all of the ksym_tracer/ptrace/kgdb. The kprobes singlestep code will
follow in turn.

With this in place, the remaining UBC variants (SH-2A and SH-4) can now
be trivially plugged in.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

4352fc1b

sh: Drop down to a single quicklist. · 0176bd3d

由 Paul Mundt 提交于 1月 05, 2010

We previously had 2 quicklists, one for the PGD case and one for PTEs.
Now that the PGD/PMD cases are handled through slab caches due to the
multi-level configurability, only the PTE quicklist remains. As such,
reduce NR_QUICK to its appropriate size and bump down the PTE quicklist
index.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

0176bd3d

02 1月, 2010 2 次提交

sh: Move page table allocation out of line · 2a5eacca

由 Matt Fleming 提交于 12月 31, 2009

We also switched away from quicklists and instead moved to slab
caches. After benchmarking both implementations the difference is
negligible. The slab caches suit us better though because the size of a
pgd table is just 4 entries when we're using a 3-level page table layout
and quicklists always deal with pages.
Signed-off-by: NMatt Fleming <matt@console-pimps.org>

2a5eacca

sh: Correct the PTRS_PER_PMD and PMD_SHIFT values · 3f5ab768

由 Matt Fleming 提交于 12月 24, 2009

The previous expressions were wrong which made free_pmd_range() explode
when using anything other than 4KB pages (which is why 8KB and 64KB
pages were disabled with the 3-level page table layout).

The problem was that pmd_offset() was returning an index of non-zero
when it should have been returning 0. This non-zero offset was used to
calculate the address of the pmd table to free in free_pmd_range(),
which ended up trying to free an object that was not aligned on a page
boundary.

Now 3-level page tables should work with 4KB, 8KB and 64KB pages.
Signed-off-by: NMatt Fleming <matt@console-pimps.org>

3f5ab768

31 12月, 2009 1 次提交

sh: Remove unused functions · e591a517

由 Matt Fleming 提交于 12月 13, 2009

Apply some TLC to the SH64 header files and remove some functions that
are not used anymore.
Signed-off-by: NMatt Fleming <matt@console-pimps.org>

e591a517

29 12月, 2009 1 次提交

sh: Only provide a PCLK definition for legacy CPG CPUs. · 8152a74b

由 Paul Mundt 提交于 12月 29, 2009

As CPUs are migrated over to more fully-featured clock frameworks of
their own and off of the legacy CPG code, they no longer have any real
need for defining the PCLK value. The PCLK define in itself is already
fairly misleading, as many boards get their input clocks from different
sources, making this value fairly arbitrary anyways.

Outside of the legacy CPG clock framework, the only place where this
value is used is for deriving CLOCK_TICK_RATE, which we set back to the
legacy PIT value that it was before the PCLK definitions were added in
the first place.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

8152a74b

28 12月, 2009 1 次提交

sh: Convert ptrace to hw_breakpoint API. · 34d0b5af

由 Paul Mundt 提交于 12月 28, 2009

This is the initial step for converting singlestep handling via ptrace
over to hw_breakpoints.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

34d0b5af

17 12月, 2009 3 次提交

sh: Definitions for 3-level page table layout · 5d9b4b19

由 Matt Fleming 提交于 12月 13, 2009

If using 64-bit PTEs and 4K pages then each page table has 512 entries
(as opposed to 1024 entries with 32-bit PTEs). Unlike MIPS, SH follows
the convention that all structures in the page table (pgd_t, pmd_t,
pgprot_t, etc) must be the same size. Therefore, 64-bit PTEs require
64-bit PGD entries, etc. Using 2-levels of page tables and 64-bit PTEs
it is only possible to map 1GB of virtual address space.

In order to map all 4GB of virtual address space we need to adopt a
3-level page table layout. This actually works out better for
CONFIG_SUPERH32 because we only waste 2 PGD entries on the P1 and P2
areas (which are untranslated) instead of 256.
Signed-off-by: NMatt Fleming <matt@console-pimps.org>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

5d9b4b19

sh: Abstract the number of page table levels · b73c8063

由 Matt Fleming 提交于 11月 25, 2009

Keep the dimensions of the page tables in a separate header file in
preparation for allowing a three level page table structure.
Signed-off-by: NMatt Fleming <matt@console-pimps.org>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

b73c8063

sh: Fix up MAX_DMA_CHANNELS definition when DMA is disabled. · 2f7bb2df

由 Paul Mundt 提交于 12月 17, 2009

MAX_DMA_CHANNELS is tested for the total number of channels in order to
populate an IRQ map. Stub this out completely when no DMA support is
enabled -- as used to be the default behaviour before this was
generalized for use by the dmaengine code.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

2f7bb2df

16 12月, 2009 1 次提交

elf: kill USE_ELF_CORE_DUMP · 698ba7b5

由 Christoph Hellwig 提交于 12月 15, 2009

Currently all architectures but microblaze unconditionally define
USE_ELF_CORE_DUMP.  The microblaze omission seems like an error to me, so
let's kill this ifdef and make sure we are the same everywhere.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Acked-by: NHugh Dickins <hugh.dickins@tiscali.co.uk>
Cc: <linux-arch@vger.kernel.org>
Cc: Michal Simek <michal.simek@petalogix.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

698ba7b5

15 12月, 2009 5 次提交

locking: Convert raw_rwlock functions to arch_rwlock · e5931943

由 Thomas Gleixner 提交于 12月 03, 2009

Name space cleanup for rwlock functions. No functional change.
Signed-off-by: NThomas Gleixner <tglx@linutronix.de>
Acked-by: NPeter Zijlstra <peterz@infradead.org>
Acked-by: NDavid S. Miller <davem@davemloft.net>
Acked-by: NIngo Molnar <mingo@elte.hu>
Cc: linux-arch@vger.kernel.org

e5931943

locking: Convert raw_rwlock to arch_rwlock · fb3a6bbc

由 Thomas Gleixner 提交于 12月 03, 2009

Not strictly necessary for -rt as -rt does not have non sleeping
rwlocks, but it's odd to not have a consistent naming convention.

No functional change.
Signed-off-by: NThomas Gleixner <tglx@linutronix.de>
Acked-by: NPeter Zijlstra <peterz@infradead.org>
Acked-by: NDavid S. Miller <davem@davemloft.net>
Acked-by: NIngo Molnar <mingo@elte.hu>
Cc: linux-arch@vger.kernel.org

fb3a6bbc

locking: Convert __raw_spin* functions to arch_spin* · 0199c4e6

由 Thomas Gleixner 提交于 12月 02, 2009

Name space cleanup. No functional change.
Signed-off-by: NThomas Gleixner <tglx@linutronix.de>
Acked-by: NPeter Zijlstra <peterz@infradead.org>
Acked-by: NDavid S. Miller <davem@davemloft.net>
Acked-by: NIngo Molnar <mingo@elte.hu>
Cc: linux-arch@vger.kernel.org

0199c4e6

locking: Rename __RAW_SPIN_LOCK_UNLOCKED to __ARCH_SPIN_LOCK_UNLOCKED · edc35bd7

由 Thomas Gleixner 提交于 12月 03, 2009

Further name space cleanup. No functional change
Signed-off-by: NThomas Gleixner <tglx@linutronix.de>
Acked-by: NPeter Zijlstra <peterz@infradead.org>
Acked-by: NDavid S. Miller <davem@davemloft.net>
Acked-by: NIngo Molnar <mingo@elte.hu>
Cc: linux-arch@vger.kernel.org

edc35bd7

locking: Convert raw_spinlock to arch_spinlock · 445c8951

由 Thomas Gleixner 提交于 12月 02, 2009

The raw_spin* namespace was taken by lockdep for the architecture
specific implementations. raw_spin_* would be the ideal name space for
the spinlocks which are not converted to sleeping locks in preempt-rt.

Linus suggested to convert the raw_ to arch_ locks and cleanup the
name space instead of using an artifical name like core_spin,
atomic_spin or whatever

No functional change.
Signed-off-by: NThomas Gleixner <tglx@linutronix.de>
Acked-by: NPeter Zijlstra <peterz@infradead.org>
Acked-by: NDavid S. Miller <davem@davemloft.net>
Acked-by: NIngo Molnar <mingo@elte.hu>
Cc: linux-arch@vger.kernel.org

445c8951

14 12月, 2009 3 次提交

sh: Stub in P3 ioremap support for nommu parts. · 0eb37e26

由 Paul Mundt 提交于 12月 14, 2009

p3_ioremap() references __ioremap() which is presently undefined on
nommu. This provides a trivial stub to fix the build up.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

0eb37e26

sh: wire up vmallocinfo support in ioremap() implementations. · bf3cdeda

由 Paul Mundt 提交于 12月 14, 2009

This wires up the caller information for the ioremap VMA, which allows
for more helpful caller tracking via /proc/vmallocinfo. Follows the x86
and powerpc changes of the same nature.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

bf3cdeda

sh: Couple kernel and user write page perm bits for CONFIG_X2TLB · fcb4ebd6

由 Matt Fleming 提交于 12月 11, 2009

pte_write() should check whether the permissions include either the user
or kernel write permission bits. Likewise, pte_wrprotect() needs to
remove both the kernel and user write bits.

Without this patch handle_tlbmiss() doesn't handle faulting in pages
from the P3 area (our vmalloc space) because of a write. Mappings of the
P3 space have the _PAGE_EXT_KERN_WRITE bit but not _PAGE_EXT_USER_WRITE.
Signed-off-by: NMatt Fleming <matt@console-pimps.org>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

fcb4ebd6

12 12月, 2009 2 次提交

sh: move machtypes.h to include/generated · 3252b11f

由 Sam Ravnborg 提交于 10月 17, 2009

Signed-off-by: NSam Ravnborg <sam@ravnborg.org>
Cc: Paul Mundt <lethal@linux-sh.org>
Cc: Al Viro <viro@zeniv.linux.org.uk>
Signed-off-by: NMichal Marek <mmarek@suse.cz>

3252b11f

kbuild: move asm-offsets.h to include/generated · 559df2e0

由 Sam Ravnborg 提交于 4月 19, 2009

The simplest method was to add an extra asm-offsets.h
file in arch/$ARCH/include/asm that references the generated file.

We can now migrate the architectures one-by-one to reference
the generated file direct - and when done we can delete the
temporary arch/$ARCH/include/asm/asm-offsets.h file.
Signed-off-by: NSam Ravnborg <sam@ravnborg.org>
Cc: Al Viro <viro@zeniv.linux.org.uk>
Signed-off-by: NMichal Marek <mmarek@suse.cz>

559df2e0

11 12月, 2009 1 次提交

sh: Wire up recvmmsg syscall. · c89fbd39

由 Paul Mundt 提交于 12月 11, 2009

The stub already existed in the _64 syscall table, but was lacking a
__NR_recvmmsg definition, while it was absent entirely for _32 variants.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

c89fbd39

08 12月, 2009 1 次提交

sh: hw-breakpoints: Add preliminary support for SH-4A UBC. · 09a07294

由 Paul Mundt 提交于 11月 09, 2009

This adds preliminary support for the SH-4A UBC to the hw-breakpoints API.
Presently only a single channel is implemented, and the ptrace interface
still needs to be converted. This is the first step to cleaning up the
long-standing UBC mess, making the UBC more generally accessible, and
finally making it SMP safe.

An additional abstraction will be layered on top of this as with the perf
events code to permit the various CPU families to wire up support for
their own specific UBCs, as many variations exist.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

09a07294

30 11月, 2009 2 次提交

sh: Break out SuperH PFC code · fae43399

由 Magnus Damm 提交于 11月 27, 2009

This file breaks out the SuperH PFC code from
arch/sh/kernel/gpio.c + arch/sh/include/asm/gpio.h
to drivers/sh/pfc.c + include/linux/sh_pfc.h.

Similar to the INTC stuff. The non-SuperH specific
file location makes it possible to share the code
between multiple architectures.
Signed-off-by: NMagnus Damm <damm@opensource.se>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

fae43399

sh: Move KEYSC header file · fc1d003d

由 Magnus Damm 提交于 11月 27, 2009

This patch moves the KEYSC header file from the
SuperH specific asm directory to a place where
it can be shared by multiple architectures.
Signed-off-by: NMagnus Damm <damm@opensource.se>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

fc1d003d

26 11月, 2009 1 次提交

block: add helpers to run flush_dcache_page() against a bio and a request's pages · 2d4dc890

由 Ilya Loginov 提交于 11月 26, 2009

Mtdblock driver doesn't call flush_dcache_page for pages in request.  So,
this causes problems on architectures where the icache doesn't fill from
the dcache or with dcache aliases.  The patch fixes this.

The ARCH_IMPLEMENTS_FLUSH_DCACHE_PAGE symbol was introduced to avoid
pointless empty cache-thrashing loops on architectures for which
flush_dcache_page() is a no-op.  Every architecture was provided with this
flush pages on architectires where ARCH_IMPLEMENTS_FLUSH_DCACHE_PAGE is
equal 1 or do nothing otherwise.

See "fix mtd_blkdevs problem with caches on some architectures" discussion
on LKML for more information.
Signed-off-by: NIlya Loginov <isloginov@gmail.com>
Cc: Ingo Molnar <mingo@elte.hu>
Cc: David Woodhouse <dwmw2@infradead.org>
Cc: Peter Horton <phorton@bitbox.co.uk>
Cc: "Ed L. Cashin" <ecashin@coraid.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

2d4dc890

25 11月, 2009 1 次提交
- P
  sh: Fix up the FPU emulation build. · 6ba65383
  由 Paul Mundt 提交于 11月 25, 2009
```
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>
```
  6ba65383
24 11月, 2009 2 次提交

sh: Minor optimisations to FPU handling · d3ea9fa0

由 Stuart Menefy 提交于 9月 25, 2009

A number of small optimisations to FPU handling, in particular:

 - move the task USEDFPU flag from the thread_info flags field (which
   is accessed asynchronously to the thread) to a new status field,
   which is only accessed by the thread itself. This allows locking to
   be removed in most cases, or can be reduced to a preempt_lock().
   This mimics the i386 behaviour.

 - move the modification of regs->sr and thread_info->status flags out
   of save_fpu() to __unlazy_fpu(). This gives the compiler a better
   chance to optimise things, as well as making save_fpu() symmetrical
   with restore_fpu() and init_fpu().

 - implement prepare_to_copy(), so that when creating a thread, we can
   unlazy the FPU prior to copying the thread data structures.

Also make sure that the FPU is disabled while in the kernel, in
particular while booting, and for newly created kernel threads,

In a very artificial benchmark, the execution time for 2500000
context switches was reduced from 50 to 45 seconds.
Signed-off-by: NStuart Menefy <stuart.menefy@st.com>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

d3ea9fa0

sh: Improve performance of SH4 versions of copy/clear_user_highpage · 39ac11c1

由 Stuart Menefy 提交于 10月 27, 2009

The previous implementation of clear_user_highpage and copy_user_highpage
checked to see if there was a D-cache aliasing issue between the user
and kernel mappings of a page, but if there was they always did a
flush with writeback on the dirtied kernel alias.

However as we now have the ability to map a page into kernel space
with the same cache colour as the user mapping, there is no need to
write back this data.

Currently we also invalidate the kernel alias as a precaution, however
I'm not sure if this is actually required.

Also correct the definition of FIX_CMAP_END so that the mappings created
by kmap_coherent() are actually at the correct colour.
Signed-off-by: NStuart Menefy <stuart.menefy@st.com>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

39ac11c1