1. 18 1月, 2008 1 次提交
    • J
      cpufreq: Initialise default governor before use · 6915719b
      Johannes Weiner 提交于
      When the cpufreq driver starts up at boot time, it calls into the default
      governor which might not be initialised yet.  This hurts when the
      governor's worker function relies on memory that is not yet set up by its
      init function.
      
      This migrates all governors from module_init() to fs_initcall() when being
      the default, as was already done in cpufreq_performance when it was the
      only possible choice.  The performance governor is always initialized early
      because it might be used as fallback even when not being the default.
      
      Fixes at least one actual oops where ondemand is the default governor and
      cpufreq_governor_dbs() uses the uninitialised kondemand_wq work-queue
      during boot-time.
      Signed-off-by: NJohannes Weiner <hannes@saeurebad.de>
      Cc: Dave Jones <davej@codemonkey.org.uk>
      Cc: "Rafael J. Wysocki" <rjw@sisk.pl>
      Cc: Venkatesh Pallipadi <venkatesh.pallipadi@intel.com>
      Acked-by: NIngo Molnar <mingo@elte.hu>
      Cc: Thomas Gleixner <tglx@linutronix.de>
      Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
      Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
      6915719b
  2. 05 10月, 2007 1 次提交
    • T
      [CPUFREQ] allow ondemand and conservative cpufreq governors to be used as default · 1c256245
      Thomas Renninger 提交于
      Depending on the transition latency of the HW for cpufreq switches, the
      ondemand or conservative governor cannot be used with certain cpufreq
      drivers.  Still the ondemand should be the default governor on a wide range
      of systems.  This patch allows this and lets the governor fallback to the
      performance governor at cpufreq driver load time, if the driver does not
      support fast enough frequency switching.
      
      Main benefit is that on e.g.  installation or other systems without
      userspace support a working dynamic cpufreq support can be achieved on most
      systems by simply loading the cpufreq driver.  This is especially essential
      for recent x86(_64) laptop hardware which may rely on working dynamic
      cpufreq OS support.
      Signed-off-by: NThomas Renninger <trenn@suse.de>
      Signed-off-by: NVenkatesh Pallipadi <venkatesh.pallipadi@intel.com>
      Cc: Russell King <rmk@arm.linux.org.uk>
      Cc: Bryan Wu <bryan.wu@analog.com>
      Cc: Andi Kleen <ak@suse.de>
      Cc: "Luck, Tony" <tony.luck@intel.com>
      Cc: Paul Mackerras <paulus@samba.org>
      Cc: Benjamin Herrenschmidt <benh@kernel.crashing.org>
      Cc: Paul Mundt <lethal@linux-sh.org>
      Cc: "David S. Miller" <davem@davemloft.net>
      Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
      Signed-off-by: NDave Jones <davej@redhat.com>
      1c256245
  3. 22 6月, 2007 2 次提交
  4. 09 5月, 2007 1 次提交
  5. 21 2月, 2007 1 次提交
  6. 11 2月, 2007 4 次提交
  7. 22 11月, 2006 1 次提交
  8. 07 11月, 2006 1 次提交
  9. 21 10月, 2006 1 次提交
  10. 16 10月, 2006 1 次提交
    • V
      [CPUFREQ][8/8] acpi-cpufreq: Add support for freq feedback from hardware · dfde5d62
      Venkatesh Pallipadi 提交于
      Enable ondemand governor and acpi-cpufreq to use IA32_APERF and IA32_MPERF MSR
      to get active frequency feedback for the last sampling interval. This will
      make ondemand take right frequency decisions when hardware coordination of
      frequency is going on.
      
      Without APERF/MPERF, ondemand can take wrong decision at times due
      to underlying hardware coordination or TM2.
      Example:
      * CPU 0 and CPU 1 are hardware cooridnated.
      * CPU 1 running at highest frequency.
      * CPU 0 was running at highest freq. Now ondemand reduces it to
        some intermediate frequency based on utilization.
      * Due to underlying hardware coordination with other CPU 1, CPU 0 continues to
        run at highest frequency (as long as other CPU is at highest).
      * When ondemand samples CPU 0 again next time, without actual frequency
        feedback from APERF/MPERF, it will think that previous frequency change
        was successful and can go to wrong target frequency. This is because it
        thinks that utilization it has got this sampling interval is when running at
        intermediate frequency, rather than actual highest frequency.
      
      More information about IA32_APERF IA32_MPERF MSR:
      Refer to IA-32 Intel® Architecture Software Developer's Manual at
      http://developer.intel.comSigned-off-by: NVenkatesh Pallipadi <venkatesh.pallipadi@intel.com>
      Signed-off-by: NDave Jones <davej@redhat.com>
      dfde5d62
  11. 06 9月, 2006 1 次提交
  12. 14 8月, 2006 1 次提交
  13. 12 8月, 2006 2 次提交
    • A
      [CPUFREQ][2/2] ondemand: updated add powersave_bias tunable · 05ca0350
      Alexey Starikovskiy 提交于
      ondemand selects the minimum frequency that can retire
      a workload with negligible idle time -- ideally resulting in the highest
      performance/power efficiency with negligible performance impact.
      
      But on some systems and some workloads, this algorithm
      is more performance biased than necessary, and
      de-tuning it a bit to allow some performance impact
      can save measurable power.
      
      This patch adds a "powersave_bias" tunable to ondemand
      to allow it to reduce its target frequency by a specified percent.
      
      By default, the powersave_bias is 0 and has no effect.
      powersave_bias is in units of 0.1%, so it has an effective range
      of 1 through 1000, resulting in 0.1% to 100% impact.
      
      In practice, users will not be able to detect a difference between
      0.1% increments, but 1.0% increments turned out to be too large.
      Also, the max value of 1000 (100%) would simply peg the system
      in its deepest power saving P-state, unless the processor really has
      a hardware P-state at 0Hz:-)
      
      For example, If ondemand requests 2.0GHz based on utilization,
      and powersave_bias=100, this code will knock 10% off the target
      and seek  a target of 1.8GHz instead of 2.0GHz until the
      next sampling.  If 1.8 is an exact match with an hardware frequency
      we use it, otherwise we average our time between the frequency
      next higher than 1.8 and next lower than 1.8.
      
      Note that a user or administrative program can change powersave_bias
      at run-time depending on how they expect the system to be used.
      
      Signed-off-by: Venkatesh Pallipadi <venkatesh.pallipadi at intel.com>
      Signed-off-by: Alexey Starikovskiy <alexey.y.starikovskiy at intel.com>
      Signed-off-by: NDave Jones <davej@redhat.com>
      05ca0350
    • A
      [CPUFREQ][1/2] ondemand: updated tune for hardware coordination · 1ce28d6b
      Alexey Starikovskiy 提交于
      Try to make dbs_check_cpu() call on all CPUs at the same jiffy.
      This will help when multiple cores share P-states via Hardware Coordination.
      
      Signed-off-by: Venkatesh Pallipadi <venkatesh.pallipadi at intel.com>
      Signed-off-by: Alexey Starikovskiy <alexey.y.starikovskiy at intel.com>
      Signed-off-by: NDave Jones <davej@redhat.com>
      1ce28d6b
  14. 26 7月, 2006 1 次提交
    • A
      [PATCH] Reorganize the cpufreq cpu hotplug locking to not be totally bizare · 153d7f3f
      Arjan van de Ven 提交于
      The patch below moves the cpu hotplugging higher up in the cpufreq
      layering; this is needed to avoid recursive taking of the cpu hotplug
      lock and to otherwise detangle the mess.
      
      The new rules are:
      1. you must do lock_cpu_hotplug() around the following functions:
         __cpufreq_driver_target
         __cpufreq_governor (for CPUFREQ_GOV_LIMITS operation only)
         __cpufreq_set_policy
      2. governer methods (.governer) must NOT take the lock_cpu_hotplug()
         lock in any way; they are called with the lock taken already
      3. if your governer spawns a thread that does things, like calling
         __cpufreq_driver_target, your thread must honor rule #1.
      4. the policy lock and other cpufreq internal locks nest within
         the lock_cpu_hotplug() lock.
      
      I'm not entirely happy about how the __cpufreq_governor rule ended up
      (conditional locking rule depending on the argument) but basically all
      callers pass this as a constant so it's not too horrible.
      
      The patch also removes the cpufreq_governor() function since during the
      locking audit it turned out to be entirely unused (so no need to fix it)
      
      The patch works on my testbox, but it could use more testing
      (otoh... it can't be much worse than the current code)
      Signed-off-by: NArjan van de Ven <arjan@linux.intel.com>
      Signed-off-by: NLinus Torvalds <torvalds@osdl.org>
      153d7f3f
  15. 24 7月, 2006 1 次提交
  16. 30 6月, 2006 3 次提交
  17. 23 6月, 2006 1 次提交
    • A
      [PATCH] cpufreq build fix · 138a0128
      Andrew Morton 提交于
      drivers/cpufreq/cpufreq_ondemand.c: In function 'do_dbs_timer':
      drivers/cpufreq/cpufreq_ondemand.c:374: warning: implicit declaration of function 'lock_cpu_hotplug'
      drivers/cpufreq/cpufreq_ondemand.c:381: warning: implicit declaration of function 'unlock_cpu_hotplug'
      drivers/cpufreq/cpufreq_conservative.c: In function 'do_dbs_timer':
      drivers/cpufreq/cpufreq_conservative.c:425: warning: implicit declaration of function 'lock_cpu_hotplug'
      drivers/cpufreq/cpufreq_conservative.c:432: warning: implicit declaration of function 'unlock_cpu_hotplug'
      
      Cc: Dave Jones <davej@codemonkey.org.uk>
      Signed-off-by: NAndrew Morton <akpm@osdl.org>
      Signed-off-by: NLinus Torvalds <torvalds@osdl.org>
      138a0128
  18. 22 6月, 2006 1 次提交
    • V
      [CPUFREQ] Fix ondemand vs suspend deadlock · 4ec223d0
      Venkatesh Pallipadi 提交于
      Rootcaused the bug to a deadlock in cpufreq and ondemand. Due to non-existent
      ordering between cpu_hotplug lock and dbs_mutex. Basically a race condition
      between cpu_down() and do_dbs_timer().
      
      cpu_down() flow:
      * cpu_down() call for CPU 1
      * Takes hot plug lock
      * Calls pre down notifier
      *     cpufreq notifier handler calls cpufreq_driver_target() which takes
            cpu_hotplug lock again. OK as cpu_hotplug lock is recursive in same
            process context
      * CPU 1 goes down
      * Calls post down notifier
      *     cpufreq notifier handler calls ondemand event stop which takes dbs_mutex
      
      So, cpu_hotplug lock is taken before dbs_mutex in this flow.
      
      do_dbs_timer is triggerred by a periodic timer event.
      It first takes dbs_mutex and then takes cpu_hotplug lock in
      cpufreq_driver_target().
      Note the reverse order here compared to above. So, if this timer event happens
      at right moment during cpu_down, system will deadlok.
      
      Attached patch fixes the issue for both ondemand and conservative.
      Signed-off-by: NVenkatesh Pallipadi <venkatesh.pallipadi@intel.com>
      Signed-off-by: NDave Jones <davej@redhat.com>
      4ec223d0
  19. 09 5月, 2006 1 次提交
  20. 26 3月, 2006 3 次提交
  21. 28 2月, 2006 1 次提交
  22. 19 1月, 2006 1 次提交
  23. 01 12月, 2005 1 次提交
    • A
      [PATCH] cpufreq_conservative/ondemand: invert meaning of 'ignore nice' · 001893cd
      Alexander Clouter 提交于
      The use of the 'ignore_nice' sysfs file is confusing to anyone using it.
      This removes the sysfs file 'ignore_nice' and in its place creates a
      'ignore_nice_load' entry that defaults to '0'; meaning nice'd processes
      _are_ counted towards the 'business' calculation.
      
      WARNING: this obvious breaks any userland tools that expected ignore_nice'
      to exist, to draw attention to this fact it was concluded on the mailing
      list that the entry should be removed altogether so the userland app breaks
      and so the author can build simple to detect workaround.  Having said that
      it seems currently very few tools even make use of this functionality; all
      I could find was a Gentoo Wiki entry.
      Signed-off-by: NAlexander Clouter <alex-kernel@digriz.org.uk>
      Signed-off-by: NAndrew Morton <akpm@osdl.org>
      Signed-off-by: NDave Jones <davej@redhat.com>
      001893cd
  24. 21 9月, 2005 1 次提交
    • D
      [CPUFREQ] Avoid the ondemand cpufreq governor to use a too high frequency for stats. · df8b59be
      Dave Jones 提交于
      The problem is in the ondemand governor, there is a periodic measurement
      of the CPU usage. This CPU usage is updated by the scheduler after every
      tick (basically, by adding 1 either to "idle" or to "user" or to
      "system"). So if the frequency of the governor is too high, the stat
      will be meaningless (as mostly no number have changed).
      
      So this patch checks that the measurements are separated by at least 10
      ticks. It means that by default, stats will have about 5% error (20
      ticks). Of course those numbers can be argued but, IMHO, they look sane.
      The patch also includes a small clean-up to check more explictly the
      result of the conversion from ns to µs being null.
      
      Let's note that (on x86) this has never been really needed before 2.6.13
      because HZ was always 1000. Now that HZ can be 100, some CPU might be
      affected by this problem. For instance when HZ=100, the centrino ,which
      has a 10µs transition latency, would lead to the governor allowing to
      read stats every tick (10ms)!
      Signed-off-by: NEric Piel <eric.piel@tremplin-utc.net>
      Signed-off-by: NDave Jones <davej@redhat.com>
      df8b59be
  25. 01 6月, 2005 7 次提交