1. 31 3月, 2017 1 次提交
  2. 24 3月, 2017 9 次提交
    • T
      sysrq: Reset the watchdog timers while displaying high-resolution timers · 01070427
      Tom Hromatka 提交于
      On systems with a large number of CPUs, running sysrq-<q> can cause
      watchdog timeouts.  There are two slow sections of code in the sysrq-<q>
      path in timer_list.c.
      
      1. print_active_timers() - This function is called by print_cpu() and
         contains a slow goto loop.  On a machine with hundreds of CPUs, this
         loop took approximately 100ms for the first CPU in a NUMA node.
         (Subsequent CPUs in the same node ran much quicker.)  The total time
         to print all of the CPUs is ultimately long enough to trigger the
         soft lockup watchdog.
      
      2. print_tickdevice() - This function outputs a large amount of textual
         information.  This function also took approximately 100ms per CPU.
      
      Since sysrq-<q> is not a performance critical path, there should be no
      harm in touching the nmi watchdog in both slow sections above.  Touching
      it in just one location was insufficient on systems with hundreds of
      CPUs as occasional timeouts were still observed during testing.
      
      This issue was observed on an Oracle T7 machine with 128 CPUs, but I
      anticipate it may affect other systems with similarly large numbers of
      CPUs.
      Signed-off-by: NTom Hromatka <tom.hromatka@oracle.com>
      Reviewed-by: NRob Gardner <rob.gardner@oracle.com>
      Signed-off-by: NJohn Stultz <john.stultz@linaro.org>
      01070427
    • D
      timers, sched_clock: Update timeout for clock wrap · 1b8955bc
      David Engraf 提交于
      The scheduler clock framework may not use the correct timeout for the clock
      wrap. This happens when a new clock driver calls sched_clock_register()
      after the kernel called sched_clock_postinit(). In this case the clock wrap
      timeout is too long thus sched_clock_poll() is called too late and the clock
      already wrapped.
      
      On my ARM system the scheduler was no longer scheduling any other task than
      the idle task because the sched_clock() wrapped.
      Signed-off-by: NDavid Engraf <david.engraf@sysgo.com>
      Signed-off-by: NJohn Stultz <john.stultz@linaro.org>
      1b8955bc
    • J
      MAINTAINERS: Add Stephen Boyd as timekeeping reviewer · e1c09219
      John Stultz 提交于
      After showing expertise and presenting on the timekeeping
      subsystem at ELC[1], Stephen clearly should be included in
      the maintainer list.
      
      [1] https://www.youtube.com/watch?v=Puv4mW55bF8Acked-by: NStephen Boyd <sboyd@codeaurora.org>
      Signed-off-by: NJohn Stultz <john.stultz@linaro.org>
      e1c09219
    • N
      clockevents: Make clockevents_config() static · 0695bd99
      Nicolai Stange 提交于
      A clockevent device's rate should be configured before or at registration
      and changed afterwards through clockevents_update_freq() only.
      
      For the configuration at registration, we already have
      clockevents_config_and_register().
      
      Right now, there are no clockevents_config() users outside of the
      clockevents core.
      
      To mitigiate the risk of drivers errorneously reconfiguring their rates
      through clockevents_config() *after* device registration, make
      clockevents_config() static.
      Signed-off-by: NNicolai Stange <nicstange@gmail.com>
      Signed-off-by: NJohn Stultz <john.stultz@linaro.org>
      0695bd99
    • N
      clocksource: h8300_timer8: Don't reset rate in ->set_state_oneshot() · a17e0178
      Nicolai Stange 提交于
      With the upcoming NTP correction related rate adjustments to be implemented
      in the clockevents core, the latter needs to get informed about every rate
      change of a clockevent device made after its registration.
      
      Currently, h8300_timer8 violates this requirement in that it registers its
      clockevent device with the correct rate, but resets its ->mult and ->rate
      values in timer8_clock_event_start(), called from its ->set_state_oneshot()
      function.
      
      It seems like
        commit 4633f4ca ("clocksource/drivers/h8300: Cleanup startup and
                              remove module code."),
      which introduced the rate initialization at registration, missed to remove
      the manual setting of ->mult and ->shift from timer8_clock_event_start().
      
      Purge the setting of ->mult, ->shift, ->min_delta_ns and ->max_delta_ns
      from timer8_clock_event_start().
      Signed-off-by: NNicolai Stange <nicstange@gmail.com>
      Signed-off-by: NJohn Stultz <john.stultz@linaro.org>
      a17e0178
    • N
      clocksource: em_sti: Compute rate before registration · 4e53aa2f
      Nicolai Stange 提交于
      With the upcoming NTP correction related rate adjustments to be implemented
      in the clockevents core, the latter needs to get informed about every rate
      change of a clockevent device made after its registration.
      
      Currently, em_sti violates this requirement in that it registers its
      clockevent device with a dummy rate and sets its final rate through
      clockevents_config() called from its ->set_state_oneshot().
      
      This patch moves the setting of the clockevent device's rate to its
      registration.
      
      I checked all current em_sti users in arch/arm/mach-shmobile and right now,
      none of them changes any rate in any clock tree relevant to em_sti after
      their respective time_init(). Since all em_sti instances are created after
      time_init(), none of them should ever observe any clock rate changes.
      
      - Determine the ->rate value in em_sti_probe() at device probing rather
        than at first usage.
      - Set the clockevent device's rate at its registration.
      - Although not strictly necessary for the upcoming clockevent core changes,
        set the clocksource's rate at its registration for consistency.
      Signed-off-by: NNicolai Stange <nicstange@gmail.com>
      Signed-off-by: NJohn Stultz <john.stultz@linaro.org>
      4e53aa2f
    • N
      clocksource: em_sti: Split clock prepare and enable steps · 3814ae09
      Nicolai Stange 提交于
      Currently, the em_sti driver prepares and enables the needed clock in
      em_sti_enable(), potentially called through its clockevent device's
      ->set_state_oneshot().
      
      However, the clk_prepare() step may sleep whereas tick_program_event() and
      thus, ->set_state_oneshot(), can be called in atomic context.
      
      Split the clk_prepare_enable() in em_sti_enable() into two steps:
      - prepare the clock at device probing via clk_prepare()
      - and enable it in em_sti_enable() via clk_enable().
      Slightly reorder resource initialization in em_sti_probe() in order to
      facilitate error handling in later patches.
      Signed-off-by: NNicolai Stange <nicstange@gmail.com>
      Signed-off-by: NJohn Stultz <john.stultz@linaro.org>
      3814ae09
    • N
      clocksource: sh_tmu: Compute rate before registration again · c3c0a20d
      Nicolai Stange 提交于
      With the upcoming NTP correction related rate adjustments to be implemented
      in the clockevents core, the latter needs to get informed about every rate
      change of a clockevent device made after its registration.
      
      Currently, sh_tmu violates this requirement in that it registers its
      clockevent device with a dummy rate and sets its final rate through
      clockevents_config() called from its ->set_state_oneshot() and
      ->set_state_periodic() functions respectively.
      
      This patch moves the setting of the clockevent device's rate to its
      registration.
      
      Note that there has been some back and forth regarding this question with
      respect to the clocksource also provided by this driver:
        commit 66f49121 ("clocksource: sh_tmu: compute mult and shift before
                              registration")
      moves the rate determination from the clocksource's ->enable() function to
      before its registration. OTOH, the later
        commit 0aeac458 ("clocksource: sh_tmu: __clocksource_updatefreq_hz()
                              update")
      basically reverts this, saying
        "Without this patch the old code uses clocksource_register() together
         with a hack that assumes a never changing clock rate."
      
      However, I checked all current sh_tmu users in arch/sh as well as in
      arch/arm/mach-shmobile carefully and right now, none of them changes any
      rate in any clock tree relevant to sh_tmu after their respective
      time_init(). Since all sh_tmu instances are created after time_init(), none
      of them should ever observe any clock rate changes.
      
      What's more, both, a clocksource as well as a clockevent device, can
      immediately get selected for use at their registration and thus, enabled
      at this point already. So it's probably safer to assume a "never changing
      clock rate" here.
      
      - Move the struct sh_tmu_channel's ->rate member to struct sh_tmu_device:
        it's a property of the underlying clock which is in turn specific to
        the sh_tmu_device.
      - Determine the ->rate value in sh_tmu_setup() at device probing rather
        than at first usage.
      - Set the clockevent device's rate at its registration.
      - Although not strictly necessary for the upcoming clockevent core changes,
        set the clocksource's rate at its registration for consistency.
      Signed-off-by: NNicolai Stange <nicstange@gmail.com>
      Signed-off-by: NJohn Stultz <john.stultz@linaro.org>
      c3c0a20d
    • N
      clocksource: sh_cmt: Compute rate before registration again · 890f423b
      Nicolai Stange 提交于
      With the upcoming NTP correction related rate adjustments to be implemented
      in the clockevents core, the latter needs to get informed about every rate
      change of a clockevent device made after its registration.
      
      Currently, sh_cmt violates this requirement in that it registers its
      clockevent device with a dummy rate and sets its final ->mult and ->shift
      values from its ->set_state_oneshot() and ->set_state_periodic() functions
      respectively.
      
      This patch moves the setting of the clockevent device's ->mult and ->shift
      values to before its registration.
      
      Note that there has been some back and forth regarding this question with
      respect to the clocksource also provided by this driver:
        commit f4d7c356 ("clocksource: sh_cmt: compute mult and shift before
                              registration")
      moves the rate determination from the clocksource's ->enable() function to
      before its registration. OTOH, the later
        commit 3593f5fe ("clocksource: sh_cmt: __clocksource_updatefreq_hz()
                              update")
      basically reverts this, saying
        "Without this patch the old code uses clocksource_register() together
         with a hack that assumes a never changing clock rate."
      
      However, I checked all current sh_cmt users in arch/sh as well as in
      arch/arm/mach-shmobile carefully and right now, none of them changes any
      rate in any clock tree relevant to sh_cmt after their respective
      time_init(). Since all sh_cmt instances are created after time_init(), none
      of them should ever observe any clock rate changes.
      
      What's more, both, a clocksource as well as a clockevent device, can
      immediately get selected for use at their registration and thus, enabled
      at this point already. So it's probably safer to assume a "never changing
      clock rate" here.
      
      - Move the struct sh_cmt_channel's ->rate member to struct sh_cmt_device:
        it's a property of the underlying clock which is in turn specific to
        the sh_cmt_device.
      - Determine the ->rate value in sh_cmt_setup() at device probing rather
        than at first usage.
      - Set the clockevent device's ->mult and ->shift values right before its
        registration.
      - Although not strictly necessary for the upcoming clockevent core changes,
        set the clocksource's rate at its registration for consistency.
      Signed-off-by: NNicolai Stange <nicstange@gmail.com>
      Signed-off-by: NJohn Stultz <john.stultz@linaro.org>
      890f423b
  3. 20 3月, 2017 6 次提交
    • L
      Linux 4.11-rc3 · 97da3854
      Linus Torvalds 提交于
      97da3854
    • L
      mm/swap: don't BUG_ON() due to uninitialized swap slot cache · 452b94b8
      Linus Torvalds 提交于
      This BUG_ON() triggered for me once at shutdown, and I don't see a
      reason for the check.  The code correctly checks whether the swap slot
      cache is usable or not, so an uninitialized swap slot cache is not
      actually problematic afaik.
      
      I've temporarily just switched the BUG_ON() to a WARN_ON_ONCE(), since
      I'm not sure why that seemingly pointless check was there.  I suspect
      the real fix is to just remove it entirely, but for now we'll warn about
      it but not bring the machine down.
      
      Cc: "Huang, Ying" <ying.huang@intel.com>
      Cc: Tim Chen <tim.c.chen@linux.intel.com>
      Cc: Michal Hocko <mhocko@suse.com>
      Cc: Andrew Morton <akpm@linux-foundation.org>
      Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
      452b94b8
    • L
      Merge tag 'powerpc-4.11-5' of git://git.kernel.org/pub/scm/linux/kernel/git/powerpc/linux · a07a6e41
      Linus Torvalds 提交于
      Pull more powerpc fixes from Michael Ellerman:
       "A couple of minor powerpc fixes for 4.11:
      
         - wire up statx() syscall
      
         - don't print a warning on memory hotplug when HPT resizing isn't
           available
      
        Thanks to: David Gibson, Chandan Rajendra"
      
      * tag 'powerpc-4.11-5' of git://git.kernel.org/pub/scm/linux/kernel/git/powerpc/linux:
        powerpc/pseries: Don't give a warning when HPT resizing isn't available
        powerpc: Wire up statx() syscall
      a07a6e41
    • L
      Merge branch 'parisc-4.11-2' of git://git.kernel.org/pub/scm/linux/kernel/git/deller/parisc-linux · 4571bc5a
      Linus Torvalds 提交于
      Pull parisc fixes from Helge Deller:
      
       - Mikulas Patocka added support for R_PARISC_SECREL32 relocations in
         modules with CONFIG_MODVERSIONS.
      
       - Dave Anglin optimized the cache flushing for vmap ranges.
      
       - Arvind Yadav provided a fix for a potential NULL pointer dereference
         in the parisc perf code (and some code cleanups).
      
       - I wired up the new statx system call, fixed some compiler warnings
         with the access_ok() macro and fixed shutdown code to really halt a
         system at shutdown instead of crashing & rebooting.
      
      * 'parisc-4.11-2' of git://git.kernel.org/pub/scm/linux/kernel/git/deller/parisc-linux:
        parisc: Fix system shutdown halt
        parisc: perf: Fix potential NULL pointer dereference
        parisc: Avoid compiler warnings with access_ok()
        parisc: Wire up statx system call
        parisc: Optimize flush_kernel_vmap_range and invalidate_kernel_vmap_range
        parisc: support R_PARISC_SECREL32 relocation in modules
      4571bc5a
    • L
      Merge git://git.kernel.org/pub/scm/linux/kernel/git/nab/target-pending · 8aa34172
      Linus Torvalds 提交于
      Pull SCSI target fixes from Nicholas Bellinger:
       "The bulk of the changes are in qla2xxx target driver code to address
        various issues found during Cavium/QLogic's internal testing (stable
        CC's included), along with a few other stability and smaller
        miscellaneous improvements.
      
        There are also a couple of different patch sets from Mike Christie,
        which have been a result of his work to use target-core ALUA logic
        together with tcm-user backend driver.
      
        Finally, a patch to address some long standing issues with
        pass-through SCSI export of TYPE_TAPE + TYPE_MEDIUM_CHANGER devices,
        which will make folks using physical (or virtual) magnetic tape happy"
      
      * git://git.kernel.org/pub/scm/linux/kernel/git/nab/target-pending: (28 commits)
        qla2xxx: Update driver version to 9.00.00.00-k
        qla2xxx: Fix delayed response to command for loop mode/direct connect.
        qla2xxx: Change scsi host lookup method.
        qla2xxx: Add DebugFS node to display Port Database
        qla2xxx: Use IOCB interface to submit non-critical MBX.
        qla2xxx: Add async new target notification
        qla2xxx: Export DIF stats via debugfs
        qla2xxx: Improve T10-DIF/PI handling in driver.
        qla2xxx: Allow relogin to proceed if remote login did not finish
        qla2xxx: Fix sess_lock & hardware_lock lock order problem.
        qla2xxx: Fix inadequate lock protection for ABTS.
        qla2xxx: Fix request queue corruption.
        qla2xxx: Fix memory leak for abts processing
        qla2xxx: Allow vref count to timeout on vport delete.
        tcmu: Convert cmd_time_out into backend device attribute
        tcmu: make cmd timeout configurable
        tcmu: add helper to check if dev was configured
        target: fix race during implicit transition work flushes
        target: allow userspace to set state to transitioning
        target: fix ALUA transition timeout handling
        ...
      8aa34172
    • L
      Merge branch 'libnvdimm-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/nvdimm/nvdimm · 1b8df619
      Linus Torvalds 提交于
      Pull device-dax fixes from Dan Williams:
       "The device-dax driver was not being careful to handle falling back to
        smaller fault-granularity sizes.
      
        The driver already fails fault attempts that are smaller than the
        device's alignment, but it also needs to handle the cases where a
        larger page mapping could be established. For simplicity of the
        immediate fix the implementation just signals VM_FAULT_FALLBACK until
        fault-size == device-alignment.
      
        One fix is for -stable to address pmd-to-pte fallback from the
        original implementation, another fix is for the new (introduced in
        4.11-rc1) pud-to-pmd regression, and a typo fix comes along for the
        ride.
      
        These have received a build success notification from the kbuild
        robot"
      
      * 'libnvdimm-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/nvdimm/nvdimm:
        device-dax: fix debug output typo
        device-dax: fix pud fault fallback handling
        device-dax: fix pmd/pte fault fallback handling
      1b8df619
  4. 19 3月, 2017 24 次提交