提交 · a8e504d2a57ecd3f905b402072cdd1903f963bef · openanolis / cloud-kernel

09 8月, 2007 9 次提交

sched: eliminate rq_clock() use · a8e504d2

由 Ingo Molnar 提交于 8月 09, 2007

eliminate rq_clock() use by changing it to:

   update_rq_clock(rq)
   now = rq->clock;

identity transformation - no change in behavior.
Signed-off-by: NIngo Molnar <mingo@elte.hu>

a8e504d2

sched: add [__]update_rq_clock(rq) · b04a0f4c

由 Ingo Molnar 提交于 8月 09, 2007

add the [__]update_rq_clock(rq) functions. (No change in functionality,
just reorganization to prepare for elimination of the heavy 64-bit
timestamp-passing in the scheduler.)
Signed-off-by: NIngo Molnar <mingo@elte.hu>

b04a0f4c

sched: fix bug in balance_tasks() · a4ac01c3

由 Peter Williams 提交于 8月 09, 2007

There are two problems with balance_tasks() and how it used:

1. The variables best_prio and best_prio_seen (inherited from the old
move_tasks()) were only required to handle problems caused by the
active/expired arrays, the order in which they were processed and the
possibility that the task with the highest priority could be on either.
These issues are no longer present and the extra overhead associated
with their use is unnecessary (and possibly wrong).

2. In the absence of CONFIG_FAIR_GROUP_SCHED being set, the same
this_best_prio variable needs to be used by all scheduling classes or
there is a risk of moving too much load. E.g. if the highest priority
task on this at the beginning is a fairly low priority task and the rt
class migrates a task (during its turn) then that moved task becomes the
new highest priority task on this_rq but when the sched_fair class
initializes its copy of this_best_prio it will get the priority of the
original highest priority task as, due to the run queue locks being
held, the reschedule triggered by pull_task() will not have taken place.
This could result in inappropriate overriding of skip_for_load and
excessive load being moved.

The attached patch addresses these problems by deleting all reference to
best_prio and best_prio_seen and making this_best_prio a reference
parameter to the various functions involved.

load_balance_fair() has also been modified so that this_best_prio is
only reset (in the loop) if CONFIG_FAIR_GROUP_SCHED is set. This should
preserve the effect of helping spread groups' higher priority tasks
around the available CPUs while improving system performance when
CONFIG_FAIR_GROUP_SCHED isn't set.
Signed-off-by: NPeter Williams <pwil3058@bigpond.net.au>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

a4ac01c3

sched: remove binary sysctls from kernel.sched_domain · e0361851

由 Alexey Dobriyan 提交于 8月 09, 2007

kernel.sched_domain hierarchy is under CTL_UNNUMBERED and thus
unreachable to sysctl(2). Generating .ctl_number's in such situation is
not useful.
Signed-off-by: NAlexey Dobriyan <adobriyan@sw.ru>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

e0361851

sched: schedule() speedup · 8e717b19

由 Ingo Molnar 提交于 8月 09, 2007

speed up schedule(): share the 'now' parameter that deactivate_task()
was calculating internally.

( this also fixes the small accounting window between the deactivate
  call and the pick_next_task() call. )
Signed-off-by: NIngo Molnar <mingo@elte.hu>

8e717b19

sched: uninline rq_clock() · 7bfd0485

由 Ingo Molnar 提交于 8月 09, 2007

uninline rq_clock() to save 263 bytes of code:

   text    data     bss     dec     hex filename
   39561    3642      24   43227    a8db sched.o.before
   39298    3642      24   42964    a7d4 sched.o.after
Signed-off-by: NIngo Molnar <mingo@elte.hu>

7bfd0485

sched: clean up sched_getaffinity() · 9531b62f

由 Ulrich Drepper 提交于 8月 09, 2007

here's another tiny cleanup.  The generated code is not affected (gcc is
smart enough) but for people looking over the code it is just irritating
to have the extra conditional.
Signed-off-by: NUlrich Drepper <drepper@redhat.com>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

9531b62f

sched: simplify move_tasks() · 43010659

由 Peter Williams 提交于 8月 09, 2007

The move_tasks() function is currently multiplexed with two distinct
capabilities:

1. attempt to move a specified amount of weighted load from one run
queue to another; and
2. attempt to move a specified number of tasks from one run queue to
another.

The first of these capabilities is used in two places, load_balance()
and load_balance_idle(), and in both of these cases the return value of
move_tasks() is used purely to decide if tasks/load were moved and no
notice of the actual number of tasks moved is taken.

The second capability is used in exactly one place,
active_load_balance(), to attempt to move exactly one task and, as
before, the return value is only used as an indicator of success or failure.

This multiplexing of sched_task() was introduced, by me, as part of the
smpnice patches and was motivated by the fact that the alternative, one
function to move specified load and one to move a single task, would
have led to two functions of roughly the same complexity as the old
move_tasks() (or the new balance_tasks()).  However, the new modular
design of the new CFS scheduler allows a simpler solution to be adopted
and this patch addresses that solution by:

1. adding a new function, move_one_task(), to be used by
active_load_balance(); and
2. making move_tasks() a single purpose function that tries to move a
specified weighted load and returns 1 for success and 0 for failure.

One of the consequences of these changes is that neither move_one_task()
or the new move_tasks() care how many tasks sched_class.load_balance()
moves and this enables its interface to be simplified by returning the
amount of load moved as its result and removing the load_moved pointer
from the argument list.  This helps simplify the new move_tasks() and
slightly reduces the amount of work done in each of
sched_class.load_balance()'s implementations.

Further simplification, e.g. changes to balance_tasks(), are possible
but (slightly) complicated by the special needs of load_balance_fair()
so I've left them to a later patch (if this one gets accepted).

NB Since move_tasks() gets called with two run queue locks held even
small reductions in overhead are worthwhile.

[ mingo@elte.hu ]

this change also reduces code size nicely:

   text    data     bss     dec     hex filename
   39216    3618      24   42858    a76a sched.o.before
   39173    3618      24   42815    a73f sched.o.after
Signed-off-by: NPeter Williams <pwil3058@bigpond.net.au>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

43010659

sched: reorder update_cpu_load(rq) with the ->task_tick() call · f1a438d8

由 Ingo Molnar 提交于 8月 09, 2007

Peter Williams suggested to flip the order of update_cpu_load(rq) with
the ->task_tick() call. This is a NOP for the current scheduler (the
two functions are independent of each other), ->task_tick() might
create some state for update_cpu_load() in the future (or in PlugSched).
Signed-off-by: NIngo Molnar <mingo@elte.hu>

f1a438d8

02 8月, 2007 8 次提交

[PATCH] sched: reduce debug code · 6cfb0d5d

由 Ingo Molnar 提交于 8月 02, 2007

move the rest of the debugging/instrumentation code to under
CONFIG_SCHEDSTATS too. This reduces code size and speeds code up:

    text    data     bss     dec     hex filename
   33044    4122      28   37194    914a sched.o.before
   32708    4122      28   36858    8ffa sched.o.after
Signed-off-by: NIngo Molnar <mingo@elte.hu>

6cfb0d5d

[PATCH] sched: move load-calculation functions · 9c217245

由 Ingo Molnar 提交于 8月 02, 2007

move load-calculation functions so that they can use the per-policy
declarations and methods.
Signed-off-by: NIngo Molnar <mingo@elte.hu>

9c217245

[PATCH] sched: ->task_new cleanup · cad60d93

由 Ingo Molnar 提交于 8月 02, 2007

make sched_class.task_new == NULL a 'default method', this
allows the removal of task_rt_new.
Signed-off-by: NIngo Molnar <mingo@elte.hu>

cad60d93

[PATCH] sched: uninline inc/dec_nr_running() · 4e6f96f3

由 Ingo Molnar 提交于 8月 02, 2007

uninline inc_nr_running() and dec_nr_running():

   text    data     bss     dec     hex filename
   29039    4162      24   33225    81c9 sched.o.before
   29027    4162      24   33213    81bd sched.o.after
Signed-off-by: NIngo Molnar <mingo@elte.hu>

4e6f96f3

[PATCH] sched: uninline calc_delta_mine() · cb1c4fc9

由 Ingo Molnar 提交于 8月 02, 2007

uninline calc_delta_mine():

   text    data     bss     dec     hex filename
   29162    4162      24   33348    8244 sched.o.before
   29039    4162      24   33225    81c9 sched.o.after
Signed-off-by: NIngo Molnar <mingo@elte.hu>

cb1c4fc9

[PATCH] sched: calc_delta_mine(): use fixed limit · ecf691da

由 Ingo Molnar 提交于 8月 02, 2007

use fixed limit in calc_delta_mine() - this saves an instruction :)
Signed-off-by: NIngo Molnar <mingo@elte.hu>

ecf691da

[PATCH] sched: tidy up left over smpnice code · 5a4f3ea7

由 Peter Williams 提交于 8月 02, 2007

1. The only place that RTPRIO_TO_LOAD_WEIGHT() is used is in the call to
move_tasks() in the function active_load_balance() and its purpose here
is just to make sure that the load to be moved is big enough to ensure
that exactly one task is moved (if there's one available).  This can be
accomplished by using ULONG_MAX instead and this allows
RTPRIO_TO_LOAD_WEIGHT() to be deleted.

2. This, in turn, allows PRIO_TO_LOAD_WEIGHT() to be deleted.

3. This allows load_weight() to be deleted which allows
TIME_SLICE_NICE_ZERO to be deleted along with the comment above it.
Signed-off-by: NPeter Williams <pwil3058@bigpond.net.au>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

5a4f3ea7

[PATCH] sched: remove cache_hot_time · 362a7016

由 Ingo Molnar 提交于 8月 02, 2007

remove the last unused remains of cache_hot_time.
Signed-off-by: NIngo Molnar <mingo@elte.hu>

362a7016

01 8月, 2007 1 次提交

sched: fix kernel-doc warnings · 421cee29

由 Randy Dunlap 提交于 7月 31, 2007

Fix kernel-doc warnings in sched.c:

Warning(linux-2623-rc1g4//kernel/sched.c:1685): No description found for parameter 'notifier'
Warning(linux-2623-rc1g4//kernel/sched.c:1696): No description found for parameter 'notifier'
Warning(linux-2623-rc1g4//kernel/sched.c:1750): No description found for parameter 'prev'
Signed-off-by: NRandy Dunlap <randy.dunlap@oracle.com>
Cc: Ingo Molnar <mingo@elte.hu>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

421cee29

26 7月, 2007 4 次提交

[PATCH] sched: debug feature - make the sched-domains tree runtime-tweakable · e692ab53

由 Nick Piggin 提交于 7月 26, 2007

debugging feature: make the sched-domains tree runtime-tweakable.
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
[ mingo@elte.hu: made it depend on CONFIG_SCHED_DEBUG & small updates ]
Signed-off-by: NIngo Molnar <mingo@elte.hu>

e692ab53

[PATCH] sched: make cpu_clock() not use the rq clock · 2cd4d0ea

由 Ingo Molnar 提交于 7月 26, 2007

it is enough to disable interrupts to get the precise rq-clock
of the local CPU.

this also solves an NMI watchdog regression: the NMI watchdog
calls touch_softlockup_watchdog(), which might deadlock on
rq->lock if the NMI hits an rq-locked critical section.
Signed-off-by: NIngo Molnar <mingo@elte.hu>

2cd4d0ea

[PATCH] sched: remove unused rq->load_balance_class · 018a2212

由 Satoru Takeuchi 提交于 7月 26, 2007

Remove unused rq->load_balance_class.
Signed-off-by: NSatoru Takeuchi <takeuchi_satoru@jp.fujitsu.com>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

018a2212

[PATCH] sched: arch preempt notifier mechanism · e107be36

由 Avi Kivity 提交于 7月 26, 2007

This adds a general mechanism whereby a task can request the scheduler to
notify it whenever it is preempted or scheduled back in.  This allows the
task to swap any special-purpose registers like the fpu or Intel's VT
registers.
Signed-off-by: NAvi Kivity <avi@qumranet.com>
[ mingo@elte.hu: fixes, cleanups ]
Signed-off-by: NIngo Molnar <mingo@elte.hu>

e107be36

20 7月, 2007 4 次提交

[PATCH] sched: implement cpu_clock(cpu) high-speed time source · e436d800

由 Ingo Molnar 提交于 7月 19, 2007

Implement the cpu_clock(cpu) interface for kernel-internal use:
high-speed (but slightly incorrect) per-cpu clock constructed from
sched_clock().

This API, unused at the moment, will be used in the future by blktrace,
by the softlockup-watchdog, by printk and by lockstat.
Signed-off-by: NIngo Molnar <mingo@elte.hu>

e436d800

[PATCH] sched: fix the all pinned logic in load_balance_newidle() · 969bb4e4

由 Suresh Siddha 提交于 7月 19, 2007

nr_moved is not the correct check for triggering all pinned logic. Fix
the all pinned logic in the case of load_balance_newidle().
Signed-off-by: NSuresh Siddha <suresh.b.siddha@intel.com>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

969bb4e4

[PATCH] sched: fix newly idle load balance in case of SMT · 9439aab8

由 Suresh Siddha 提交于 7月 19, 2007

In the presence of SMT, newly idle balance was never happening for
multi-core and SMP domains (even when both the logical siblings are
idle).

If thread 0 is already idle and when thread 1 is about to go to idle,
newly idle load balance always think that one of the threads is not idle
and skips doing the newly idle load balance for multi-core and SMP
domains.

This is because of the idle_cpu() macro, which checks if the current
process on a cpu is an idle process. But this is not the case for the
thread doing the load_balance_newidle().

Fix this by using runqueue's nr_running field instead of idle_cpu(). And
also skip the logic of 'only one idle cpu in the group will be doing
load balancing' during newly idle case.
Signed-off-by: NSuresh Siddha <suresh.b.siddha@intel.com>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

9439aab8

use the new percpu interface for shared data · f34e3b61

由 Fenghua Yu 提交于 7月 19, 2007

Currently most of the per cpu data, which is accessed by different cpus,
has a ____cacheline_aligned_in_smp attribute.  Move all this data to the
new per cpu shared data section: .data.percpu.shared_aligned.

This will seperate the percpu data which is referenced frequently by other
cpus from the local only percpu data.
Signed-off-by: NFenghua Yu <fenghua.yu@intel.com>
Acked-by: NSuresh Siddha <suresh.b.siddha@intel.com>
Cc: Rusty Russell <rusty@rustcorp.com.au>
Cc: Christoph Lameter <clameter@sgi.com>
Cc: "Luck, Tony" <tony.luck@intel.com>
Cc: Andi Kleen <ak@suse.de>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

f34e3b61

18 7月, 2007 1 次提交

Freezer: make kernel threads nonfreezable by default · 83144186

由 Rafael J. Wysocki 提交于 7月 17, 2007

Currently, the freezer treats all tasks as freezable, except for the kernel
threads that explicitly set the PF_NOFREEZE flag for themselves.  This
approach is problematic, since it requires every kernel thread to either
set PF_NOFREEZE explicitly, or call try_to_freeze(), even if it doesn't
care for the freezing of tasks at all.

It seems better to only require the kernel threads that want to or need to
be frozen to use some freezer-related code and to remove any
freezer-related code from the other (nonfreezable) kernel threads, which is
done in this patch.

The patch causes all kernel threads to be nonfreezable by default (ie.  to
have PF_NOFREEZE set by default) and introduces the set_freezable()
function that should be called by the freezable kernel threads in order to
unset PF_NOFREEZE.  It also makes all of the currently freezable kernel
threads call set_freezable(), so it shouldn't cause any (intentional)
change of behaviour to appear.  Additionally, it updates documentation to
describe the freezing of tasks more accurately.

[akpm@linux-foundation.org: build fixes]
Signed-off-by: NRafael J. Wysocki <rjw@sisk.pl>
Acked-by: NNigel Cunningham <nigel@nigel.suspend2.net>
Cc: Pavel Machek <pavel@ucw.cz>
Cc: Oleg Nesterov <oleg@tv-sign.ru>
Cc: Gautham R Shenoy <ego@in.ibm.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

83144186

16 7月, 2007 3 次提交

[PATCH] sched: prettify prio_to_wmult[] · e4af30be

由 Ingo Molnar 提交于 7月 16, 2007

prettify the prio_to_wmult[] array. (this could have saved us from the typos)
Signed-off-by: NIngo Molnar <mingo@elte.hu>

e4af30be

I
[PATCH] sched: document prio_to_wmult[] · 5714d2de
由 Ingo Molnar 提交于 7月 16, 2007
```
document prio_to_wmult[].
Signed-off-by: NIngo Molnar <mingo@elte.hu>
```
5714d2de

[PATCH] sched: improve weight-array comments · f9153ee6

由 Ingo Molnar 提交于 7月 16, 2007

improve the comments around the wmult array (which controls the weight
of niced tasks). Clarify that to achieve a 10% difference in CPU
utilization, a weight multiplier of 1.25 has to be used.
Signed-off-by: NIngo Molnar <mingo@elte.hu>

f9153ee6

14 7月, 2007 4 次提交

CFS: Fix missing digit off in wmult table · 4fd88517

由 Thomas Gleixner 提交于 7月 13, 2007

Roman Zippel noticed another inconsistency of the wmult table.

wmult[16] has a missing digit.
Signed-off-by: NThomas Gleixner <tglx@linutronix.de>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

4fd88517

[PATCH] sched: fix show_task()/show_tasks() output · 4bd77321

由 Ingo Molnar 提交于 7月 11, 2007

fix show_task()/show_tasks() output:

- there's no sibling info anymore

- the fields were not aligned properly with the description

- get rid of the lazy-TLB output: it's been quite some time since
  we last had a bug there, and when we had a bug it wasnt helped a
  bit by this debug output.
Signed-off-by: NIngo Molnar <mingo@elte.hu>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

4bd77321

[PATCH] sched: allow larger granularity · a5968df8

由 Ingo Molnar 提交于 7月 11, 2007

Allow granularity up to 100 msecs, instead of 10 msecs.
(needed on larger boxes)
Signed-off-by: NIngo Molnar <mingo@elte.hu>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

a5968df8

[PATCH] sched: fix prio_to_wmult[] for nice 1 · e127031f

由 Mike Galbraith 提交于 7月 11, 2007

There's a typo in the values in prio_to_wmult[] for nice level 1.  While
it did not cause bad CPU distribution, but caused more rescheduling
between nice-0 and nice-1 tasks than necessary.
Signed-off-by: NIngo Molnar <mingo@elte.hu>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

e127031f

10 7月, 2007 6 次提交

sched: add CFS credits · c31f2e8a

由 Ingo Molnar 提交于 7月 09, 2007

add credits for recent major scheduler contributions:

  Con Kolivas, for pioneering the fair-scheduling approach
  Peter Williams, for smpnice
  Mike Galbraith, for interactivity tuning of CFS
  Srivatsa Vaddagiri, for group scheduling enhancements
Signed-off-by: NIngo Molnar <mingo@elte.hu>

c31f2e8a

sched: clean up sleep_on() APIs · 0fec171c

由 Ingo Molnar 提交于 7月 09, 2007

clean up the sleep_on() APIs:

 - do not use fastcall
 - replace fragile macro magic with proper inline functions
Signed-off-by: NIngo Molnar <mingo@elte.hu>

0fec171c

sched: style cleanups · 9761eea8

由 Ingo Molnar 提交于 7月 09, 2007

4 small style cleanups to sched.c: checkpatch.pl is now happy about
the totality of sched.c [ignoring false positives] - yay! ;-)
Signed-off-by: NIngo Molnar <mingo@elte.hu>

9761eea8

sched: remove unused rq types from sched.c · b2cfba19

由 Ingo Molnar 提交于 7月 09, 2007

remove unused rq types from sched.c, now that we switched
over to CFS.
Signed-off-by: NIngo Molnar <mingo@elte.hu>

b2cfba19

sched: remove interactivity types · 634fa8c9

由 Ingo Molnar 提交于 7月 09, 2007

remove now unused interactivity-heuristics related defined and
types of the old scheduler.
Signed-off-by: NIngo Molnar <mingo@elte.hu>

634fa8c9

sched: clean up include files in sched.c · dff06c15

由 Ingo Molnar 提交于 7月 09, 2007

clean up include files in sched.c, they were still old-style <asm/>.
Signed-off-by: NIngo Molnar <mingo@elte.hu>

dff06c15

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功