1. 11 4月, 2011 7 次提交
    • P
      sched: Change NODE sched_domain group creation · cd4ea6ae
      Peter Zijlstra 提交于
      The NODE sched_domain is 'special' in that it allocates sched_groups
      per CPU, instead of sharing the sched_groups between all CPUs.
      
      While this might have some benefits on large NUMA and avoid remote
      memory accesses when iterating the sched_groups, this does break
      current code that assumes sched_groups are shared between all
      sched_domains (since the dynamic cpu_power patches).
      
      So refactor the NODE groups to behave like all other groups.
      
      (The ALLNODES domain again shared its groups across the CPUs for some
      reason).
      
      If someone does measure a performance decrease due to this change we
      need to revisit this and come up with another way to have both dynamic
      cpu_power and NUMA work nice together.
      Signed-off-by: NPeter Zijlstra <a.p.zijlstra@chello.nl>
      Cc: Mike Galbraith <efault@gmx.de>
      Cc: Nick Piggin <npiggin@kernel.dk>
      Cc: Linus Torvalds <torvalds@linux-foundation.org>
      Cc: Andrew Morton <akpm@linux-foundation.org>
      Link: http://lkml.kernel.org/r/20110407122941.978111700@chello.nlSigned-off-by: NIngo Molnar <mingo@elte.hu>
      cd4ea6ae
    • P
      sched: Simplify build_sched_groups() · a06dadbe
      Peter Zijlstra 提交于
      Notice that the mask being computed is the same as the domain span we
      just computed. By using the domain_span we can avoid some mask
      allocations and computations.
      Signed-off-by: NPeter Zijlstra <a.p.zijlstra@chello.nl>
      Cc: Mike Galbraith <efault@gmx.de>
      Cc: Nick Piggin <npiggin@kernel.dk>
      Cc: Linus Torvalds <torvalds@linux-foundation.org>
      Cc: Andrew Morton <akpm@linux-foundation.org>
      Link: http://lkml.kernel.org/r/20110407122941.925028189@chello.nlSigned-off-by: NIngo Molnar <mingo@elte.hu>
      a06dadbe
    • P
      sched: Simplify ->cpu_power initialization · d274cb30
      Peter Zijlstra 提交于
      The code in update_group_power() does what init_sched_groups_power()
      does and more, so remove the special init_ code and call the generic
      code instead.
      
      Also move the sd->span_weight initialization because
      update_group_power() needs it.
      Signed-off-by: NPeter Zijlstra <a.p.zijlstra@chello.nl>
      Cc: Mike Galbraith <efault@gmx.de>
      Cc: Nick Piggin <npiggin@kernel.dk>
      Cc: Linus Torvalds <torvalds@linux-foundation.org>
      Cc: Andrew Morton <akpm@linux-foundation.org>
      Link: http://lkml.kernel.org/r/20110407122941.875856012@chello.nlSigned-off-by: NIngo Molnar <mingo@elte.hu>
      d274cb30
    • P
      sched: Remove obsolete arch_ prefixes · c4a8849a
      Peter Zijlstra 提交于
      Non weak static functions clearly are not arch specific, so remove the
      arch_ prefix.
      Signed-off-by: NPeter Zijlstra <a.p.zijlstra@chello.nl>
      Cc: Mike Galbraith <efault@gmx.de>
      Cc: Nick Piggin <npiggin@kernel.dk>
      Cc: Linus Torvalds <torvalds@linux-foundation.org>
      Cc: Andrew Morton <akpm@linux-foundation.org>
      Link: http://lkml.kernel.org/r/20110407122941.820460566@chello.nlSigned-off-by: NIngo Molnar <mingo@elte.hu>
      c4a8849a
    • S
      sched: Eliminate dead code from wakeup_gran() · f4ad9bd2
      Shaohua Li 提交于
      calc_delta_fair() checks NICE_0_LOAD already, delete duplicate check.
      
      Signed-off-by: Shaohua Li<shaohua.li@intel.com>
      Signed-off-by: NPeter Zijlstra <a.p.zijlstra@chello.nl>
      Cc: Mike Galbraith <efault@gmx.de>
      Link: http://lkml.kernel.org/r/1302238389.3981.92.camel@sli10-conroeSigned-off-by: NIngo Molnar <mingo@elte.hu>
      f4ad9bd2
    • K
      sched: Fix erroneous all_pinned logic · b30aef17
      Ken Chen 提交于
      The scheduler load balancer has specific code to deal with cases of
      unbalanced system due to lots of unmovable tasks (for example because of
      hard CPU affinity). In those situation, it excludes the busiest CPU that
      has pinned tasks for load balance consideration such that it can perform
      second 2nd load balance pass on the rest of the system.
      
      This all works as designed if there is only one cgroup in the system.
      
      However, when we have multiple cgroups, this logic has false positives and
      triggers multiple load balance passes despite there are actually no pinned
      tasks at all.
      
      The reason it has false positives is that the all pinned logic is deep in
      the lowest function of can_migrate_task() and is too low level:
      
      load_balance_fair() iterates each task group and calls balance_tasks() to
      migrate target load. Along the way, balance_tasks() will also set a
      all_pinned variable. Given that task-groups are iterated, this all_pinned
      variable is essentially the status of last group in the scanning process.
      Task group can have number of reasons that no load being migrated, none
      due to cpu affinity. However, this status bit is being propagated back up
      to the higher level load_balance(), which incorrectly think that no tasks
      were moved.  It kick off the all pinned logic and start multiple passes
      attempt to move load onto puller CPU.
      
      To fix this, move the all_pinned aggregation up at the iterator level.
      This ensures that the status is aggregated over all task-groups, not just
      last one in the list.
      Signed-off-by: NKen Chen <kenchen@google.com>
      Cc: stable@kernel.org
      Signed-off-by: NPeter Zijlstra <a.p.zijlstra@chello.nl>
      Link: http://lkml.kernel.org/r/BANLkTi=ernzNawaR5tJZEsV_QVnfxqXmsQ@mail.gmail.comSigned-off-by: NIngo Molnar <mingo@elte.hu>
      b30aef17
    • K
      sched: Fix sched-domain avg_load calculation · b0432d8f
      Ken Chen 提交于
      In function find_busiest_group(), the sched-domain avg_load isn't
      calculated at all if there is a group imbalance within the domain. This
      will cause erroneous imbalance calculation.
      
      The reason is that calculate_imbalance() sees sds->avg_load = 0 and it
      will dump entire sds->max_load into imbalance variable, which is used
      later on to migrate entire load from busiest CPU to the puller CPU.
      
      This has two really bad effect:
      
      1. stampede of task migration, and they won't be able to break out
         of the bad state because of positive feedback loop: large load
         delta -> heavier load migration -> larger imbalance and the cycle
         goes on.
      
      2. severe imbalance in CPU queue depth.  This causes really long
         scheduling latency blip which affects badly on application that
         has tight latency requirement.
      
      The fix is to have kernel calculate domain avg_load in both cases. This
      will ensure that imbalance calculation is always sensible and the target
      is usually half way between busiest and puller CPU.
      Signed-off-by: NKen Chen <kenchen@google.com>
      Signed-off-by: NPeter Zijlstra <a.p.zijlstra@chello.nl>
      Cc: <stable@kernel.org>
      Link: http://lkml.kernel.org/r/20110408002322.3A0D812217F@elm.corp.google.comSigned-off-by: NIngo Molnar <mingo@elte.hu>
      b0432d8f
  2. 09 4月, 2011 1 次提交
  3. 05 4月, 2011 3 次提交
  4. 04 4月, 2011 1 次提交
  5. 03 4月, 2011 1 次提交
  6. 01 4月, 2011 1 次提交
  7. 31 3月, 2011 5 次提交
  8. 30 3月, 2011 3 次提交
  9. 29 3月, 2011 8 次提交
  10. 28 3月, 2011 3 次提交
  11. 27 3月, 2011 5 次提交
  12. 26 3月, 2011 2 次提交