提交 · 44658dfb660c4744b862571a3d8e1dae65e8b75a · openeuler / raspberrypi-kernel

24 11月, 2009 2 次提交

sh: Minor optimisations to FPU handling · d3ea9fa0

由 Stuart Menefy 提交于 9月 25, 2009

A number of small optimisations to FPU handling, in particular:

 - move the task USEDFPU flag from the thread_info flags field (which
   is accessed asynchronously to the thread) to a new status field,
   which is only accessed by the thread itself. This allows locking to
   be removed in most cases, or can be reduced to a preempt_lock().
   This mimics the i386 behaviour.

 - move the modification of regs->sr and thread_info->status flags out
   of save_fpu() to __unlazy_fpu(). This gives the compiler a better
   chance to optimise things, as well as making save_fpu() symmetrical
   with restore_fpu() and init_fpu().

 - implement prepare_to_copy(), so that when creating a thread, we can
   unlazy the FPU prior to copying the thread data structures.

Also make sure that the FPU is disabled while in the kernel, in
particular while booting, and for newly created kernel threads,

In a very artificial benchmark, the execution time for 2500000
context switches was reduced from 50 to 45 seconds.
Signed-off-by: NStuart Menefy <stuart.menefy@st.com>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

d3ea9fa0

sh: add sleazy FPU optimization · a0458b07

由 Giuseppe CAVALLARO 提交于 7月 07, 2009

sh port of the sLeAZY-fpu feature currently implemented for some architectures
such us i386.

Right now the SH kernel has a 100% lazy fpu behaviour.
This is of course great for applications that have very sporadic or no FPU use.
However for very frequent FPU users...  you take an extra trap every context
switch.
The patch below adds a simple heuristic to this code: after 5 consecutive
context switches of FPU use, the lazy behavior is disabled and the context
gets restored every context switch.
After 256 switches, this is reset and the 100% lazy behavior is returned.

Tests with LMbench showed no regression.
I saw a little improvement due to the prefetching (~2%).

The tests below also show that, with this sLeazy patch, indeed,
the number of FPU exceptions is reduced.
To test this. I hacked the lat_ctx LMBench to use the FPU a little more.

   sLeasy implementation
   ===========================================
   switch_to calls            |  79326
   sleasy   calls             |  42577
   do_fpu_state_restore  calls|  59232
   restore_fpu   calls        |  59032

   Exceptions:  0x800 (FPU disabled  ): 16604

   100% Leazy (default implementation)
   ===========================================
   switch_to  calls            |  79690
   do_fpu_state_restore calls  |  53299
   restore_fpu  calls          |   53101

   Exceptions: 0x800 (FPU disabled  ):  53273
Signed-off-by: NGiuseppe Cavallaro <peppe.cavallaro@st.com>
Signed-off-by: NStuart Menefy <stuart.menefy@st.com>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

a0458b07

12 11月, 2009 1 次提交
- P
  sh: Fix up the CONFIG_PERF_EVENTS=n build for SH-4. · c4e708dc
  由 Paul Mundt 提交于 11月 12, 2009
```
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>
```
  c4e708dc
05 11月, 2009 1 次提交

sh: perf events: Add support for SH7750-style counters. · 1d823323

由 Paul Mundt 提交于 11月 05, 2009

This adds perf events support for the SH7750/SH7750S/SH7091 performance
counters.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

1d823323

27 8月, 2009 1 次提交

sh: fix CPU_SH7723/7724 numbering bug · b37c7c66

由 Kuninori Morimoto 提交于 8月 26, 2009

Signed-off-by: NKuninori Morimoto <morimoto.kuninori@renesas.com>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

b37c7c66

21 8月, 2009 1 次提交

sh: Add initial support for SH7757 CPU subtype · c01f0f1a

由 Yoshihiro Shimoda 提交于 8月 21, 2009

Signed-off-by: NYoshihiro Shimoda <shimoda.yoshihiro@renesas.com>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

c01f0f1a

15 8月, 2009 2 次提交

sh: Kill off the unhandled pvr case in SH-4 CPU probing. · eccee745

由 Paul Mundt 提交于 8月 15, 2009

This is superfluous, as the default CPU type and family are already
established by the initial cpuinfo definition. Given that we are still
able to probe for the CPU family even if we are not able to detect the
subtype, it's preferable to let the probing code fill out what it can and
leave the rest.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

eccee745

sh: Track the CPU family in sh_cpuinfo. · e82da214

由 Paul Mundt 提交于 8月 15, 2009

This adds a family member to struct sh_cpuinfo, which allows us to fall
back more on the probe routines to work out what sort of subtype we are
running on. This will be used by the CPU cache initialization code in
order to first do family-level initialization, followed by subtype-level
optimizations.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

e82da214

13 8月, 2009 1 次提交

sh: convert processor device setup functions to arch_initcall() · ba9a6337

由 Magnus Damm 提交于 7月 22, 2009

Convert the processor platform device setup
functions from __initcall() and sometimes
device_initcall() to arch_initcall().

This makes sure that the platform devices are
registered a bit earlier so the devices are
available when drivers register using initcall
levels earlier than device_initcall().

A good example is platform devices needed by
i2c-sh_mobile.c which registers a bit earlier
using subsys_initcall().
Signed-off-by: NMagnus Damm <damm@igel.co.jp>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

ba9a6337

23 7月, 2009 1 次提交

sh: convert processor device setup functions to arch_initcall() · 955c9863

由 Magnus Damm 提交于 7月 22, 2009

Convert the processor platform device setup
functions from __initcall() and sometimes
device_initcall() to arch_initcall().

This makes sure that the platform devices are
registered a bit earlier so the devices are
available when drivers register using initcall
levels earlier than device_initcall().

A good example is platform devices needed by
i2c-sh_mobile.c which registers a bit earlier
using subsys_initcall().
Signed-off-by: NMagnus Damm <damm@igel.co.jp>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

955c9863

01 6月, 2009 3 次提交

sh: Tidy up SH-4A boot_cpu_data.flags probing. · 0bf8513e

由 Paul Mundt 提交于 6月 01, 2009

This tidies up the boot_cpu_data.flags probing on SH-4A. All of them have
a few things in common, which we can blindly set, rather than having each
subtype have to set the same flags. We can also make assumptions about
cache ways and the validity of PTEA, so this also kills off CPU_HAS_PTEA
as a config option. There was also a bug in the FPU probing, which is now
tidied up.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

0bf8513e

sh: Tidy up the optional L2 probing, wire it up for SH7786. · 7863d3f7

由 Paul Mundt 提交于 6月 01, 2009

This tidies up the L2 probing, as it may or may not be implemented on a
CPU, regardless of whether it is supported. This converts the cvr
validity checks from BUG_ON()'s to simply clearing the CPU_HAS_L2_CACHE
flag and moving on with life.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

7863d3f7

sh: SH7724 has an L2 cache. · 98fbe45b

由 Kuninori Morimoto 提交于 5月 29, 2009

Add the CPU_HAS_L2_CACHE flag to SH7724.
Signed-off-by: NKuninori Morimoto <morimoto.kuninori@renesas.com>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

98fbe45b

13 5月, 2009 2 次提交

sh: clkfwk: Rework legacy CPG clock handling. · 253b0887

由 Paul Mundt 提交于 5月 13, 2009

This moves out the old legacy CPG clocks to their own file, and converts
over the existing users. With these clocks going away and each CPU
dealing with them on their own, CPUs can gradually move over to the new
interface.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

253b0887

sh: clkfwk: module_clk -> peripheral_clk rename. · af777ce4

由 Paul Mundt 提交于 5月 13, 2009

For consistenct naming, and to allow us to fix up some confusion in the
SH-Mobile clock framework, amongst other places.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

af777ce4

12 5月, 2009 6 次提交

sh: add sh4-202 INTC tables · 67d889bd

由 Magnus Damm 提交于 5月 12, 2009

This patch adds INTC tables for sh4-202 with support
for HUDI, TMU0, TMU1, TMU2, RTC, SCIF and WDT.
Signed-off-by: NMagnus Damm <damm@igel.co.jp>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

67d889bd

sh: TMU platform data for sh4-202 · 5f8a29ba

由 Magnus Damm 提交于 5月 12, 2009

This patch adds TMU platform data for sh4-202. Both clockevent
and clocksource support is enabled.
Signed-off-by: NMagnus Damm <damm@igel.co.jp>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

5f8a29ba

sh: clkfwk: Use arch_clk_init() for on-chip clock registration. · 9fe5ee0e

由 Paul Mundt 提交于 5月 12, 2009

CPUs registering on-chip clocks should be using arch_clk_init() with the
new scheme so that the CPUs have the opportunity to establish the
topology prior to the initial root clock rate propagation. This ensures
that CPUs with on-chip clocks that use CLK_ENABLE_ON_INIT are properly
enabled at the initial propagation time, without having to further poke
the root clocks.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

9fe5ee0e

sh: clkfwk: Tidy up on-chip clock registration and rate propagation. · f5c84cf5

由 Paul Mundt 提交于 5月 12, 2009

This tidies up the set_rate hack that the on-chip clocks were abusing to
trigger rate propagation, which is now handled generically.

Additionally, now that CLK_ENABLE_ON_INIT is wired up where it needs to
be for these clocks, the clk_enable() can go away. In some cases this was
bumping up the refcount higher than it should have been.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

f5c84cf5

sh: clkfwk: Consolidate the ALWAYS_ENABLED / NEEDS_INIT mess. · 4ff29ff8

由 Paul Mundt 提交于 5月 12, 2009

There is no real distinction here in behaviour, either a clock needs to
be enabled on initialiation or not. The ALWAYS_ENABLED flag was always
intended to only apply to clocks that were physically always on and could
simply not be disabled at all from software. Unfortunately over time this
was abused and the meaning became a bit blurry.

So, we kill off both of all of those paths now, as well as the newer
NEEDS_INIT flag, and consolidate on a CLK_ENABLE_ON_INIT. Clocks that
need to be enabled on initialization can set this, and it will purposely
enable them and bump the refcount up.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

4ff29ff8

sh: clkfwk: Make recalc return an unsigned long. · b68d8201

由 Paul Mundt 提交于 5月 12, 2009

This is prep work for cleaning up some of the rate propagation bits.
Trivial conversion.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

b68d8201

11 5月, 2009 3 次提交

sh: multiple vectors per irq - sh7760 · 3d6ad460

由 Magnus Damm 提交于 5月 11, 2009

Update intc tables and platform data to use one linux irq
per maskable interrupt source instead of keeping the one-to-one
mapping between vectors and linux irqs.

This fixes potential irq masking issues for sh7760 hardware
blocks such as DMAC/TMU2/REF.
Signed-off-by: NMagnus Damm <damm@igel.co.jp>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

3d6ad460

sh: TMU platform data for sh7760 · c42f32dc

由 Magnus Damm 提交于 5月 11, 2009

This patch adds TMU platform data for sh7760. Both clockevent
and clocksource support is enabled.
Signed-off-by: NMagnus Damm <damm@igel.co.jp>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

c42f32dc

sh: TMU platform data for sh775x · 03f408f1

由 Magnus Damm 提交于 5月 11, 2009

This patch adds TMU platform data for sh775x. Both clockevent
and clocksource support is enabled.
Signed-off-by: NMagnus Damm <damm@igel.co.jp>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

03f408f1

16 4月, 2009 1 次提交

sh: Add support for SH7724 (SH-Mobile R2R) CPU subtype. · 0207a2ef

由 Kuninori Morimoto 提交于 4月 16, 2009

This implements initial support for the SH-Mobile R2R CPU.
Based on Rev 0.11 of the initial SH7724 hardware manual.
Signed-off-by: NKuninori Morimoto <morimoto.kuninori@renesas.com>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

0207a2ef

02 4月, 2009 1 次提交

sh: Kill off broken direct-mapped cache mode. · e8208828

由 Paul Mundt 提交于 4月 02, 2009

Forcing direct-mapped worked on certain older 2-way set associative
parts, but was always error prone on 4-way parts. As these are the
norm these days, there is not much point in continuing to support this
mode. Most of the folks that used direct-mapped mode generally just
wanted writethrough caching in the first place..
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

e8208828

17 3月, 2009 1 次提交

sh: Support for extended ASIDs on PTEAEX-capable SH-X3 cores. · 8263a67e

由 Paul Mundt 提交于 3月 17, 2009

This adds support for extended ASIDs (up to 16-bits) on newer SH-X3 cores
that implement the PTAEX register and respective functionality. Presently
only the 65nm SH7786 (90nm only supports legacy 8-bit ASIDs).

The main change is in how the PTE is written out when loading the entry
in to the TLB, as well as in how the TLB entry is selectively flushed.

While SH-X2 extended mode splits out the memory-mapped U and I-TLB data
arrays for extra bits, extended ASID mode splits out the address arrays.
While we don't use the memory-mapped data array access, the address
array accesses are necessary for selective TLB flushes, so these are
implemented newly and replace the generic SH-4 implementation.

With this, TLB flushes in switch_mm() are almost non-existent on newer
parts.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

8263a67e

10 3月, 2009 1 次提交

sh: hibernation support · 2ef7f0da

由 Magnus Damm 提交于 3月 06, 2009

Add Suspend-to-disk / swsusp / CONFIG_HIBERNATION support
to the SuperH architecture.

To suspend, use "swapon /dev/sda2; echo disk > /sys/power/state"
To resume, pass "resume=/dev/sda2" on the kernel command line.

The patch "pm: rework includes, remove arch ifdefs V2" is
needed to allow the generic swsusp code to build properly.

Hibernation is not enabled with this patch though, a patch
setting ARCH_HIBERNATION_POSSIBLE will be submitted later.
Signed-off-by: NMagnus Damm <damm@igel.co.jp>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

2ef7f0da

03 3月, 2009 1 次提交

sh: Add support for SH7786 CPU subtype. · 55ba99eb

由 Kuninori Morimoto 提交于 3月 03, 2009

This adds preliminary support for the SH7786 CPU subtype.

While this is a dual-core CPU, only UP is supported for now. L2 cache
support is likewise not yet implemented.

More information on this particular CPU subtype is available at:

http://www.renesas.com/fmwk.jsp?cnt=sh7786_root.jsp&fp=/products/mpumcu/superh_family/sh7780_series/sh7786_group/Signed-off-by: NKuninori Morimoto <morimoto.kuninori@renesas.com>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

55ba99eb

27 2月, 2009 1 次提交

sh: multiple vectors per irq - sh7750 · 69977e7e

由 Magnus Damm 提交于 2月 24, 2009

Update intc tables and platform data to use one linux irq
per maskable interrupt source instead of keeping the one-to-one
mapping between vectors and linux irqs.

This fixes potential irq masking issues for sh775x hardware
blocks such as SCI/SCIF/RTC/DMAC/TMU2/REF.
Signed-off-by: NMagnus Damm <damm@igel.co.jp>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

69977e7e

29 1月, 2009 1 次提交

sh: fcnvds fix with denormalized numbers on SH-4 FPU. · 0f6dee23

由 Carmelo AMOROSO 提交于 10月 15, 2008

This fixes a bug in the FPU exception handler for the FCNVDS instruction.
To get the register number the instruction is shifted right by 9,
though it should be shifted right by 8.

More information at ST Linux bugzilla:

	https://bugzilla.stlinux.com/show_bug.cgi?id=4892Signed-off-by: NGiuseppe Di Giore <giuseppe.di-giore@st.com>
Signed-off-by: NCarmelo Amoroso <carmelo.amoroso@st.com>
Signed-off-by: NStuart Menefy <stuart.menefy@st.com>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

0f6dee23

22 12月, 2008 1 次提交

sh: __udivdi3 -> do_div() in softfloat lib. · e9bf51e5

由 Paul Mundt 提交于 11月 18, 2008

Inhibit the generation of __udivdi3 for the softfloat lib, use do_div()
outright.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

e9bf51e5

08 9月, 2008 2 次提交

sh: Enable IRLM mode for SH7760 IRQ_MODE_IRQ. · 53abf911

由 Luca Santini 提交于 9月 08, 2008

Follows the same setting as SH7750.
Signed-off-by: NLuca Santini <luca.santini@spesonline.com>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

53abf911

sh: Subnormal double to float conversion · b6ad1e8c

由 Carl Shaw 提交于 9月 05, 2008

This patch adds support for the SH4 to convert a subnormal double
into a float by catching the FPE and implementing the FCNVDS
instruction in software.
Signed-off-by: NCarl Shaw <carl.shaw@st.com>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

b6ad1e8c

04 8月, 2008 1 次提交

sh: Fix up broken kerneldoc comments. · 6a9545bd

由 Paul Mundt 提交于 8月 04, 2008

These were completely unparseable, so fix them up.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

6a9545bd

29 7月, 2008 1 次提交

sh: migrate to arch/sh/include/ · f15cbe6f

由 Paul Mundt 提交于 7月 29, 2008

This follows the sparc changes a439fe51.

Most of the moving about was done with Sam's directions at:

http://marc.info/?l=linux-sh&m=121724823706062&w=2

with subsequent hacking and fixups entirely my fault.
Signed-off-by: NSam Ravnborg <sam@ravnborg.org>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

f15cbe6f

28 7月, 2008 2 次提交
- P
  sh: Record the major cut revision for probed SH-4A parts. · 068f5914
  由 Paul Mundt 提交于 7月 02, 2008
```
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>
```
  068f5914
- S
  sh: Stub in silicon cut in CPU info. · 3611ee7a
  由 Stuart Menefy 提交于 7月 02, 2008
```
Signed-off-by: NStuart Menefy <stuart.menefy@st.com>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>
```
  3611ee7a
23 5月, 2008 1 次提交

sh: add probe support for new sh7723 cut · b76baf4c

由 Magnus Damm 提交于 5月 22, 2008

This patch adds support for sh7723 silicon with a prr value of 0x51.
Signed-off-by: NMagnus Damm <damm@igel.co.jp>
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

b76baf4c

19 4月, 2008 1 次提交

sh: Fix up L2 cache probe. · 440fc172

由 Paul Mundt 提交于 4月 09, 2008

SH7723 is the first hard silicon to implement the L2, and unsurprisingly,
does the precise inverse of what the specification alleges. XOR the
URAM/L2 size bits to get back in line with the existing parsing logic.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

440fc172