提交 · a70a93229943c177f0062490b4f8e44be4cef685 · openeuler / raspberrypi-kernel

10 11月, 2007 15 次提交

sched: proper prototype for kernel/sched.c:migration_init() · e6fe6649

由 Adrian Bunk 提交于 11月 09, 2007

This patch adds a proper prototype for migration_init() in
include/linux/sched.h

Since there's no point in always returning 0 to a caller that doesn't check
the return value it also changes the function to return void.
Signed-off-by: NAdrian Bunk <bunk@kernel.org>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

e6fe6649

sched: avoid large irq-latencies in smp-balancing · b82d9fdd

由 Peter Zijlstra 提交于 11月 09, 2007

SMP balancing is done with IRQs disabled and can iterate the full rq.
When rqs are large this can cause large irq-latencies. Limit the nr of
iterations on each run.

This fixes a scheduling latency regression reported by the -rt folks.
Signed-off-by: NPeter Zijlstra <a.p.zijlstra@chello.nl>
Acked-by: NSteven Rostedt <rostedt@goodmis.org>
Tested-by: NGregory Haskins <ghaskins@novell.com>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

b82d9fdd

sched: fix copy_namespace() <-> sched_fork() dependency in do_fork · 3c90e6e9

由 Srivatsa Vaddagiri 提交于 11月 09, 2007

Sukadev Bhattiprolu reported a kernel crash with control groups.
There are couple of problems discovered by Suka's test:

- The test requires the cgroup filesystem to be mounted with
  atleast the cpu and ns options (i.e both namespace and cpu 
  controllers are active in the same hierarchy). 

	# mkdir /dev/cpuctl
	# mount -t cgroup -ocpu,ns none cpuctl
	(or simply)
	# mount -t cgroup none cpuctl -> Will activate all controllers
					 in same hierarchy.

- The test invokes clone() with CLONE_NEWNS set. This causes a a new child
  to be created, also a new group (do_fork->copy_namespaces->ns_cgroup_clone->
  cgroup_clone) and the child is attached to the new group (cgroup_clone->
  attach_task->sched_move_task). At this point in time, the child's scheduler 
  related fields are uninitialized (including its on_rq field, which it has
  inherited from parent). As a result sched_move_task thinks its on
  runqueue, when it isn't.

  As a solution to this problem, I moved sched_fork() call, which
  initializes scheduler related fields on a new task, before
  copy_namespaces(). I am not sure though whether moving up will
  cause other side-effects. Do you see any issue?

- The second problem exposed by this test is that task_new_fair()
  assumes that parent and child will be part of the same group (which 
  needn't be as this test shows). As a result, cfs_rq->curr can be NULL
  for the child.

  The solution is to test for curr pointer being NULL in
  task_new_fair().

With the patch below, I could run ns_exec() fine w/o a crash.
Reported-by: NSukadev Bhattiprolu <sukadev@us.ibm.com>
Signed-off-by: NSrivatsa Vaddagiri <vatsa@linux.vnet.ibm.com>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

3c90e6e9

sched: clean up the wakeup preempt check, #2 · 502d26b5

由 Ingo Molnar 提交于 11月 09, 2007

clean up the preemption check to not use unnecessary 64-bit
variables. This improves code size:

   text    data     bss     dec     hex filename
  44227    3326      36   47589    b9e5 sched.o.before
  44201    3326      36   47563    b9cb sched.o.after
Signed-off-by: NIngo Molnar <mingo@elte.hu>

502d26b5

sched: clean up the wakeup preempt check · 77d9cc44

由 Ingo Molnar 提交于 11月 09, 2007

clean up the wakeup preemption check. No code changed:

   text    data     bss     dec     hex filename
  44227    3326      36   47589    b9e5 sched.o.before
  44227    3326      36   47589    b9e5 sched.o.after
Signed-off-by: NIngo Molnar <mingo@elte.hu>

77d9cc44

sched: wakeup preemption fix · 8bc6767a

由 Ingo Molnar 提交于 11月 09, 2007

wakeup preemption fix: do not make it dependent on p->prio.
Preemption purely depends on ->vruntime.

This improves preemption in mixed-nice-level workloads.
Signed-off-by: NIngo Molnar <mingo@elte.hu>

8bc6767a

sched: remove PREEMPT_RESTRICT · 3e3e13f3

由 Ingo Molnar 提交于 11月 09, 2007

remove PREEMPT_RESTRICT. (this is a separate commit so that any
regression related to the removal itself is bisectable)
Signed-off-by: NIngo Molnar <mingo@elte.hu>

3e3e13f3

sched: turn off PREEMPT_RESTRICT · 52d3da1a

由 Ingo Molnar 提交于 11月 09, 2007

PREEMPT_RESTRICT was a method aimed at reducing the amount of wakeup
related preemption. It has a disadvantage though, it can prevent
legitimate wakeups if a task is 'unlucky' to be hit too early by a tick
that clears peer_preempt.

Now that the wakeup preemption has been cleaned up we dont seem to have
excessive preemptions anymore, so this feature can be turned off. (and
removed in the next patch)
Signed-off-by: NIngo Molnar <mingo@elte.hu>

52d3da1a

sched: cleanup, use NSEC_PER_MSEC and NSEC_PER_SEC · d6322faf

由 Eric Dumazet 提交于 11月 09, 2007

1) hardcoded 1000000000 value is used five times in places where
   NSEC_PER_SEC might be more readable.

2) A conversion from nsec to msec uses the hardcoded 1000000 value,
   which is a candidate for NSEC_PER_MSEC.

no code changed:

    text    data     bss     dec     hex filename
   44359    3326      36   47721    ba69 sched.o.before
   44359    3326      36   47721    ba69 sched.o.after
Signed-off-by: NEric Dumazet <dada1@cosmosbay.com>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

d6322faf

sched: reintroduce SMP tunings again · 19978ca6

由 Ingo Molnar 提交于 11月 09, 2007

Yanmin Zhang reported an aim7 regression and bisected it down to:

 |  commit 38ad464d
 |  Author: Ingo Molnar <mingo@elte.hu>
 |  Date:   Mon Oct 15 17:00:02 2007 +0200
 |
 |     sched: uniform tunings
 |
 |     use the same defaults on both UP and SMP.

fix this by reintroducing similar SMP tunings again. This resolves
the regression.

(also update the comments to match the ilog2(nr_cpus) tuning effect)
Signed-off-by: NIngo Molnar <mingo@elte.hu>

19978ca6

sched: restore deterministic CPU accounting on powerpc · fa13a5a1

由 Paul Mackerras 提交于 11月 09, 2007

Since powerpc started using CONFIG_GENERIC_CLOCKEVENTS, the
deterministic CPU accounting (CONFIG_VIRT_CPU_ACCOUNTING) has been
broken on powerpc, because we end up counting user time twice: once in
timer_interrupt() and once in update_process_times().

This fixes the problem by pulling the code in update_process_times
that updates utime and stime into a separate function called
account_process_tick.  If CONFIG_VIRT_CPU_ACCOUNTING is not defined,
there is a version of account_process_tick in kernel/timer.c that
simply accounts a whole tick to either utime or stime as before.  If
CONFIG_VIRT_CPU_ACCOUNTING is defined, then arch code gets to
implement account_process_tick.

This also lets us simplify the s390 code a bit; it means that the s390
timer interrupt can now call update_process_times even when
CONFIG_VIRT_CPU_ACCOUNTING is turned on, and can just implement a
suitable account_process_tick().

account_process_tick() now takes the task_struct * as an argument.
Tested both with and without CONFIG_VIRT_CPU_ACCOUNTING.
Signed-off-by: NPaul Mackerras <paulus@samba.org>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

fa13a5a1

sched: fix delay accounting regression · 9a41785c

由 Balbir Singh 提交于 11月 09, 2007

Fix the delay accounting regression introduced by commit
75d4ef16. rq no longer has sched_info
data associated with it. task_struct sched_info structure is used by delay
accounting to provide back statistics to user space.

also remove direct use of sched_clock() (which is not a valid thing to
do anymore) and use rq->clock instead.
Signed-off-by: NBalbir Singh <balbir@linux.vnet.ibm.com>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

9a41785c

sched: reintroduce the sched_min_granularity tunable · b2be5e96

由 Peter Zijlstra 提交于 11月 09, 2007

we lost the sched_min_granularity tunable to a clever optimization
that uses the sched_latency/min_granularity ratio - but the ratio
is quite unintuitive to users and can also crash the kernel if the
ratio is set to 0. So reintroduce the min_granularity tunable,
while keeping the ratio maintained internally.

no functionality changed.

[ mingo@elte.hu: some fixlets. ]
Signed-off-by: NPeter Zijlstra <a.p.zijlstra@chello.nl>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

b2be5e96

sched: documentation: place_entity() comments · 2cb8600e

由 Peter Zijlstra 提交于 11月 09, 2007

Add a few comments to place_entity(). No code changed.
Signed-off-by: NPeter Zijlstra <a.p.zijlstra@chello.nl>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

2cb8600e

sched: fix vslice · 10b77724

由 Peter Zijlstra 提交于 11月 09, 2007

vslice was missing a factor NICE_0_LOAD, as weight is in
weight*NICE_0_LOAD units.

the effect of this bug was larger initial slices and
thus latency-noisier forks.
Signed-off-by: NPeter Zijlstra <a.p.zijlstra@chello.nl>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

10b77724

06 11月, 2007 2 次提交

time: fix inconsistent function names in comments · 8dce39c2

由 Li Zefan 提交于 11月 05, 2007

Signed-off-by: NLi Zefan <lizf@cn.fujitsu.com>
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: john stultz <johnstul@us.ibm.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

8dce39c2

Dump stack during sysctl registration failure · 5db6a4da

由 Alexey Dobriyan 提交于 11月 05, 2007

Let's make immediately obvious from where sysctl comes from and messages
itself more noticeable.
Signed-off-by: NAlexey Dobriyan <adobriyan@gmail.com>
Acked-by: N"Eric W. Biederman" <ebiederm@xmission.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

5db6a4da

05 11月, 2007 1 次提交

kernel/futex.c: make 3 functions static · fad23fc7

由 Adrian Bunk 提交于 11月 02, 2007

The following functions can now become static again:
- get_futex_key()
- get_futex_key_refs()
- drop_futex_key_refs()
Signed-off-by: NAdrian Bunk <bunk@kernel.org>
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>

fad23fc7

31 10月, 2007 1 次提交
- D
  [COMPAT]: Fix build on COMPAT platforms when CONFIG_NET is disabled. · f3baa482
  由 David S. Miller 提交于 10月 29, 2007
```
Add some missing cond_syscall() entries for this case.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  f3baa482
30 10月, 2007 9 次提交

Freezer: do not allow freezing processes to clear TIF_SIGPENDING · cc5f916e

由 Rafael J. Wysocki 提交于 10月 29, 2007

Do not allow processes to clear their TIF_SIGPENDING if TIF_FREEZE is set,
so that they will not race with the freezer (like mysqld does, for example).
Signed-off-by: NRafael J. Wysocki <rjw@sisk.pl>
Acked-by: NNigel Cunningham <nigel@suspend2.net>
Acked-by: NPavel Machek <pavel@ucw.cz>
Cc: Oleg Nesterov <oleg@tv-sign.ru>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

cc5f916e

sched: fix /proc/<PID>/stat stime/utime monotonicity, part 2 · 9301899b

由 Balbir Singh 提交于 10月 30, 2007

Extend Peter's patch to fix accounting issues, by keeping stime
monotonic too.
Signed-off-by: NBalbir Singh <balbir@linux.vnet.ibm.com>
Signed-off-by: NIngo Molnar <mingo@elte.hu>
Tested-by: NFrans Pop <elendil@planet.nl>

9301899b

sched: fix style in kernel/sched.c · 38605cae

由 Ingo Molnar 提交于 10月 29, 2007

fallout of recent commits: small coding style fixes.
Signed-off-by: NIngo Molnar <mingo@elte.hu>

38605cae

sched: fix style of swap() macro in kernel/sched_fair.c · 8eb172d9

由 Ingo Molnar 提交于 10月 29, 2007

fix style of swap() macro in kernel/sched_fair.c.

( this macro should eventually move to a general header, as ext3 uses
  a similar construct too. )
Signed-off-by: NIngo Molnar <mingo@elte.hu>

8eb172d9

sched: report CPU usage in CFS cgroup directories · fe5c7cc2

由 Paul Menage 提交于 10月 29, 2007

Adds a cpu.usage file to the CFS cgroup that reports CPU usage in
milliseconds for that cgroup's tasks

[ mingo@elte.hu: style cleanups. ]
Signed-off-by: NPaul Menage <menage@google.com>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

fe5c7cc2

sched: move rcu_head to task_group struct · ae8393e5

由 Srivatsa Vaddagiri 提交于 10月 29, 2007

Peter Zijlstra noticed that the rcu_head object need not be present
in every cfs_rq of a group. Move it to the task_group structure
instead.
Signed-off-by: NSrivatsa Vaddagiri <vatsa@linux.vnet.ibm.com>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

ae8393e5

sched: fix incorrect assumption that cpu 0 exists · 7bae49d4

由 James Bottomley 提交于 10月 29, 2007

This patch:

commit 9b5b7751
Author: Srivatsa Vaddagiri <vatsa@linux.vnet.ibm.com>
Date:   Mon Oct 15 17:00:09 2007 +0200

    sched: clean up code under CONFIG_FAIR_GROUP_SCHED

Introduced an assumption of the existence of CPU0 via this line

cfs_rq = tg->cfs_rq[0];

If you have no CPU0, that will be NULL.  The fix seems to be just to
take whatever cfs_rq queue comes out of the for_each_possible_cpu()
loop, since they're all equally good for the destruction operation.
Signed-off-by: NJames Bottomley <James.Bottomley@SteelEye.com>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

7bae49d4

sched: keep utime/stime monotonic · 73a2bcb0

由 Peter Zijlstra 提交于 10月 29, 2007

keep utime/stime monotonic.

cpustats use utime/stime as a ratio against sum_exec_runtime, as a
consequence it can happen - when the ratio changes faster than time
accumulates - that either can be appear to go backwards.
Signed-off-by: NPeter Zijlstra <a.p.zijlstra@chello.nl>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

73a2bcb0

sched: make kernel/sched.c:account_guest_time() static · f7402e03

由 Adrian Bunk 提交于 10月 29, 2007

account_guest_time() can become static.
Signed-off-by: NAdrian Bunk <bunk@kernel.org>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

f7402e03

29 10月, 2007 5 次提交

x86 merge fallout: uml · ca5cd877

由 Al Viro 提交于 10月 29, 2007

Don't undef __i386__/__x86_64__ in uml anymore, make sure that (few) places
that required adjusting the ifdefs got those.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

ca5cd877

Quieten hrtimer printk: "Switched to high resolution mode .." · edfed66e

由 Michael Ellerman 提交于 10月 29, 2007

Change the hrtimer printk "Switched to high resolution mode .." to
be KERN_DEBUG, rather than KERN_INFO. If users need to see this they
can pass "loglevel" or "debug" on the command line, or check dmesg.
Signed-off-by: NMichael Ellerman <michael@ellerman.id.au>
Signed-off-by: NThomas Gleixner <tglx@linutronix.de>

 kernel/hrtimer.c |    2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

edfed66e

timer_list: Fix printk format strings · 129f1d2c

由 Vegard Nossum 提交于 10月 11, 2007

This makes sure printk format strings contain no more than a single
line.
Signed-off-by: NVegard Nossum <vegard.nossum@gmail.com>
Signed-off-by: NThomas Gleixner <tglx@linutronix.de>

129f1d2c

clockevents: unexport tick_nohz_get_sleep_length · 64e38eb0

由 Adrian Bunk 提交于 10月 24, 2007

This patch removes the unused 
EXPORT_SYMBOL_GPL(tick_nohz_get_sleep_length).
Signed-off-by: NAdrian Bunk <bunk@kernel.org>
Signed-off-by: NThomas Gleixner <tglx@linutronix.de>

64e38eb0

lockdep: fix a typo in the __lock_acquire comment · 17aacfb9

由 Gautham R Shenoy 提交于 10月 28, 2007

Fix a typo in the __lock_acquire comment.
Signed-off-by: NGautham R Shenoy <ego@in.ibm.com>
Signed-off-by: NPeter Zijlstra <a.p.zijlstra@chello.nl>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

17aacfb9

26 10月, 2007 1 次提交

cpuidle: unexport tick_nohz_get_sleep_length · 4d8b4e1e

由 Adrian Bunk 提交于 10月 24, 2007

This patch removes the unused
EXPORT_SYMBOL_GPL(tick_nohz_get_sleep_length),
which we no long user b/c we no longer build optional modules.
Signed-off-by: NAdrian Bunk <bunk@kernel.org>
Acked-by: NVenkatesh Pallipadi <venkatesh.pallipadi@intel.com>
Acked-by: NThomas Gleixner <tglx@linutronix.de>
Signed-off-by: NLen Brown <len.brown@intel.com>

4d8b4e1e

25 10月, 2007 6 次提交

sched: fix unconditional irq lock · ab63a633

由 Peter Zijlstra 提交于 10月 25, 2007

Lockdep noticed that this lock can also be taken from hardirq context, and can
thus not unconditionally disable/enable irqs.

 WARNING: at kernel/lockdep.c:2033 trace_hardirqs_on()
  [show_trace_log_lvl+26/48] show_trace_log_lvl+0x1a/0x30
  [show_trace+18/32] show_trace+0x12/0x20
  [dump_stack+22/32] dump_stack+0x16/0x20
  [trace_hardirqs_on+405/416] trace_hardirqs_on+0x195/0x1a0
  [_read_unlock_irq+34/48] _read_unlock_irq+0x22/0x30
  [sched_debug_show+2615/4224] sched_debug_show+0xa37/0x1080
  [show_state_filter+326/368] show_state_filter+0x146/0x170
  [sysrq_handle_showstate+10/16] sysrq_handle_showstate+0xa/0x10
  [__handle_sysrq+123/288] __handle_sysrq+0x7b/0x120
  [handle_sysrq+40/64] handle_sysrq+0x28/0x40
  [kbd_event+1045/1680] kbd_event+0x415/0x690
  [input_pass_event+206/208] input_pass_event+0xce/0xd0
  [input_handle_event+170/928] input_handle_event+0xaa/0x3a0
  [input_event+95/112] input_event+0x5f/0x70
  [atkbd_interrupt+434/1456] atkbd_interrupt+0x1b2/0x5b0
  [serio_interrupt+59/128] serio_interrupt+0x3b/0x80
  [i8042_interrupt+263/576] i8042_interrupt+0x107/0x240
  [handle_IRQ_event+40/96] handle_IRQ_event+0x28/0x60
  [handle_edge_irq+175/320] handle_edge_irq+0xaf/0x140
  [do_IRQ+64/128] do_IRQ+0x40/0x80
  [common_interrupt+46/52] common_interrupt+0x2e/0x34
Signed-off-by: NPeter Zijlstra <a.p.zijlstra@chello.nl>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

ab63a633

sched: isolate SMP balancing code a bit more · 681f3e68

由 Peter Williams 提交于 10月 24, 2007

At the moment, a lot of load balancing code that is irrelevant to non
SMP systems gets included during non SMP builds.

This patch addresses this issue and reduces the binary size on non
SMP systems:

   text    data     bss     dec     hex filename
  10983      28    1192   12203    2fab sched.o.before
  10739      28    1192   11959    2eb7 sched.o.after
Signed-off-by: NPeter Williams <pwil3058@bigpond.net.au>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

681f3e68

sched: reduce balance-tasks overhead · e1d1484f

由 Peter Williams 提交于 10月 24, 2007

At the moment, balance_tasks() provides low level functionality for both
  move_tasks() and move_one_task() (indirectly) via the load_balance()
function (in the sched_class interface) which also provides dual
functionality.  This dual functionality complicates the interfaces and
internal mechanisms and makes the run time overhead of operations that
are called with two run queue locks held.

This patch addresses this issue and reduces the overhead of these
operations.
Signed-off-by: NPeter Williams <pwil3058@bigpond.net.au>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

e1d1484f

sched: make cpu_shares_{show,store}() static · a0f846aa

由 Adrian Bunk 提交于 10月 24, 2007

cpu_shares_{show,store}() can become static.
Signed-off-by: NAdrian Bunk <bunk@kernel.org>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

a0f846aa

sched: clean up some control group code · 2b01dfe3

由 Paul Menage 提交于 10月 24, 2007

- replace "cont" with "cgrp" in a few places in the CFS cgroup code, 
- use write_uint rather than write for cpu.shares write function
Signed-off-by: NPaul Menage <menage@google.com>
Acked-by : Srivatsa Vaddagiri <vatsa@linux.vnet.ibm.com>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

2b01dfe3

sched: document profile=sleep requiring CONFIG_SCHEDSTATS · b3da2a73

由 Mel Gorman 提交于 10月 24, 2007

profile=sleep only works if CONFIG_SCHEDSTATS is set. This patch notes
the limitation in Documentation/kernel-parameters.txt and prints a
warning at boot-time if profile=sleep is used without CONFIG_SCHEDSTAT.
Signed-off-by: NMel Gorman <mel@csn.ul.ie>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

b3da2a73