1. 26 6月, 2015 13 次提交
  2. 25 6月, 2015 2 次提交
    • S
      perf bench numa: Fix to show proper convergence stats · 2b42b09b
      Srikar Dronamraju 提交于
      With commit: e1e455f4 (perf tools: Work around lack of sched_getcpu
      in glibc < 2.6), perf_bench numa mem with -c or -m option is not able to
      correctly calculate convergence.
      
      With the above commit, sched_getcpu always seems to return -1. The
      intention of commit e1e455f4 was to add a sched_getcpu in glibc < 2.6.
      Hence keep the sched_getcpu definition under an ifdef.
      
      This regression happened occurred between v4.0 and v4.1
      Signed-off-by: NSrikar Dronamraju <srikar@linux.vnet.ibm.com>
      Acked-by: NIngo Molnar <mingo@kernel.org>
      Cc: Jiri Olsa <jolsa@kernel.org>
      Cc: Masami Hiramatsu <masami.hiramatsu.pt@hitachi.com>
      Cc: Namhyung Kim <namhyung@kernel.org>
      Cc: Vinson Lee <vlee@twitter.com>
      Fixes:  e1e455f4 ("perf tools: Work around lack of sched_getcpu in glibc < 2.6")
      Link: http://lkml.kernel.org/r/20150624111004.GA5220@linux.vnet.ibm.comSigned-off-by: NArnaldo Carvalho de Melo <acme@redhat.com>
      2b42b09b
    • I
      Merge tag 'perf-core-for-mingo' of... · 27451700
      Ingo Molnar 提交于
      Merge tag 'perf-core-for-mingo' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/core
      
      Pull perf/core improvements and fixes from Arnaldo Carvalho de Melo:
      
      User visible changes:
      
        - Move toggling event logic from 'perf top' and into hists browser, allowing
          freeze/unfreeze with event lists with more than one entry (Namhyung Kim)
      
        - Add missing newlines when dumping PERF_RECORD_FINISHED_ROUND and
          showing the Aggregated stats in 'perf report -D' (Adrian Hunter)
      
      Infrastructure changes:
      
        - Allow auxtrace data alignment (Adrian Hunter)
      
        - Allow events with dot (Andi Kleen)
      
        - Fix failure to 'perf probe' events on arm (He Kuang)
      
        - Add testing for Makefile.perf (Jiri Olsa)
      
        - Add test for make install with prefix (Jiri Olsa)
      
        - Fix single target build dependency check (Jiri Olsa)
      
        - Access thread_map entries via accessors, prep patch to hold more info per
          entry, for ongoing 'perf stat --per-thread' work (Jiri Olsa)
      
        - Use __weak definition from compiler.h (Sukadev Bhattiprolu)
      
        - Split perf_pmu__new_alias() (Sukadev Bhattiprolu)
      Signed-off-by: NArnaldo Carvalho de Melo <acme@redhat.com>
      Signed-off-by: NIngo Molnar <mingo@kernel.org>
      27451700
  3. 24 6月, 2015 9 次提交
  4. 23 6月, 2015 16 次提交
    • L
      Merge branch 'irq-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 407a2c72
      Linus Torvalds 提交于
      Pull irq updates from Thomas Gleixner:
       "The irq departement delivers:
      
         - plug a potential race related to chained interrupt handlers
      
         - core updates which address the needs of the x86 irqdomain conversion
      
         - new irqchip callback to support affinity settings for VCPUs
      
         - the usual pile of updates to interrupt chip drivers
      
         - a few helper functions to allow further cleanups and
           simplifications
      
        I have a largish pile of coccinelle scripted/verified cleanups and
        simplifications pending on top of that, but I prefer to send that
        towards the end of the merge window when the arch/driver changes have
        hit your tree to avoid API change wreckage as far as possible"
      
      * 'irq-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: (34 commits)
        genirq: Remove bogus restriction in irq_move_mask_irq()
        irqchip: atmel-aic5: Add sama5d2 support
        irq: spear-shirq: Fix race in installing chained IRQ handler
        irq: irq-keystone: Fix race in installing chained IRQ handler
        gpio: gpio-tegra: Fix race in installing chained IRQ handler
        gpio: gpio-mxs: Fix race in installing chained IRQ handler
        gpio: gpio-mxc: Fix race in installing chained IRQ handler
        ARM: gemini: Fix race in installing GPIO chained IRQ handler
        GPU: ipu: Fix race in installing IPU chained IRQ handler
        ARM: sa1100: convert SA11x0 related code to use new chained handler helper
        irq: Add irq_set_chained_handler_and_data()
        irqchip: exynos-combiner: Save IRQ enable set on suspend
        genirq: Introduce helper function irq_data_get_affinity_mask()
        genirq: Introduce helper function irq_data_get_node()
        genirq: Introduce struct irq_common_data to host shared irq data
        genirq: Prevent crash in irq_move_irq()
        genirq: Enhance irq_data_to_desc() to support hierarchy irqdomain
        irqchip: gic: Simplify gic_configure_irq by using IRQCHIP_SET_TYPE_MASKED
        irqchip: renesas: intc-irqpin: Improve binding documentation
        genirq: Set IRQCHIP_SKIP_SET_WAKE for no_irq_chip
        ...
      407a2c72
    • L
      Merge branch 'timers-nohz-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 3a95398f
      Linus Torvalds 提交于
      Pull NOHZ updates from Thomas Gleixner:
       "A few updates to the nohz infrastructure:
      
         - recursion protection for context tracking
      
         - make the TIF_NOHZ inheritance smarter
      
         - isolate cpus which belong to the NOHZ full set"
      
      * 'timers-nohz-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        nohz: Set isolcpus when nohz_full is set
        nohz: Add tick_nohz_full_add_cpus_to() API
        context_tracking: Inherit TIF_NOHZ through forks instead of context switches
        context_tracking: Protect against recursion
      3a95398f
    • L
      Merge branch 'timers-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 43224b96
      Linus Torvalds 提交于
      Pull timer updates from Thomas Gleixner:
       "A rather largish update for everything time and timer related:
      
         - Cache footprint optimizations for both hrtimers and timer wheel
      
         - Lower the NOHZ impact on systems which have NOHZ or timer migration
           disabled at runtime.
      
         - Optimize run time overhead of hrtimer interrupt by making the clock
           offset updates smarter
      
         - hrtimer cleanups and removal of restrictions to tackle some
           problems in sched/perf
      
         - Some more leap second tweaks
      
         - Another round of changes addressing the 2038 problem
      
         - First step to change the internals of clock event devices by
           introducing the necessary infrastructure
      
         - Allow constant folding for usecs/msecs_to_jiffies()
      
         - The usual pile of clockevent/clocksource driver updates
      
        The hrtimer changes contain updates to sched, perf and x86 as they
        depend on them plus changes all over the tree to cleanup API changes
        and redundant code, which got copied all over the place.  The y2038
        changes touch s390 to remove the last non 2038 safe code related to
        boot/persistant clock"
      
      * 'timers-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: (114 commits)
        clocksource: Increase dependencies of timer-stm32 to limit build wreckage
        timer: Minimize nohz off overhead
        timer: Reduce timer migration overhead if disabled
        timer: Stats: Simplify the flags handling
        timer: Replace timer base by a cpu index
        timer: Use hlist for the timer wheel hash buckets
        timer: Remove FIFO "guarantee"
        timers: Sanitize catchup_timer_jiffies() usage
        hrtimer: Allow hrtimer::function() to free the timer
        seqcount: Introduce raw_write_seqcount_barrier()
        seqcount: Rename write_seqcount_barrier()
        hrtimer: Fix hrtimer_is_queued() hole
        hrtimer: Remove HRTIMER_STATE_MIGRATE
        selftest: Timers: Avoid signal deadlock in leap-a-day
        timekeeping: Copy the shadow-timekeeper over the real timekeeper last
        clockevents: Check state instead of mode in suspend/resume path
        selftests: timers: Add leap-second timer edge testing to leap-a-day.c
        ntp: Do leapsecond adjustment in adjtimex read path
        time: Prevent early expiry of hrtimers[CLOCK_REALTIME] at the leap second edge
        ntp: Introduce and use SECS_PER_DAY macro instead of 86400
        ...
      43224b96
    • L
      Merge branch 'x86-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · d70b3ef5
      Linus Torvalds 提交于
      Pull x86 core updates from Ingo Molnar:
       "There were so many changes in the x86/asm, x86/apic and x86/mm topics
        in this cycle that the topical separation of -tip broke down somewhat -
        so the result is a more traditional architecture pull request,
        collected into the 'x86/core' topic.
      
        The topics were still maintained separately as far as possible, so
        bisectability and conceptual separation should still be pretty good -
        but there were a handful of merge points to avoid excessive
        dependencies (and conflicts) that would have been poorly tested in the
        end.
      
        The next cycle will hopefully be much more quiet (or at least will
        have fewer dependencies).
      
        The main changes in this cycle were:
      
         * x86/apic changes, with related IRQ core changes: (Jiang Liu, Thomas
           Gleixner)
      
           - This is the second and most intrusive part of changes to the x86
             interrupt handling - full conversion to hierarchical interrupt
             domains:
      
                [IOAPIC domain]   -----
                                       |
                [MSI domain]      --------[Remapping domain] ----- [ Vector domain ]
                                       |   (optional)          |
                [HPET MSI domain] -----                        |
                                                               |
                [DMAR domain]     -----------------------------
                                                               |
                [Legacy domain]   -----------------------------
      
             This now reflects the actual hardware and allowed us to distangle
             the domain specific code from the underlying parent domain, which
             can be optional in the case of interrupt remapping.  It's a clear
             separation of functionality and removes quite some duct tape
             constructs which plugged the remap code between ioapic/msi/hpet
             and the vector management.
      
           - Intel IOMMU IRQ remapping enhancements, to allow direct interrupt
             injection into guests (Feng Wu)
      
         * x86/asm changes:
      
           - Tons of cleanups and small speedups, micro-optimizations.  This
             is in preparation to move a good chunk of the low level entry
             code from assembly to C code (Denys Vlasenko, Andy Lutomirski,
             Brian Gerst)
      
           - Moved all system entry related code to a new home under
             arch/x86/entry/ (Ingo Molnar)
      
           - Removal of the fragile and ugly CFI dwarf debuginfo annotations.
             Conversion to C will reintroduce many of them - but meanwhile
             they are only getting in the way, and the upstream kernel does
             not rely on them (Ingo Molnar)
      
           - NOP handling refinements. (Borislav Petkov)
      
         * x86/mm changes:
      
           - Big PAT and MTRR rework: making the code more robust and
             preparing to phase out exposing direct MTRR interfaces to drivers -
             in favor of using PAT driven interfaces (Toshi Kani, Luis R
             Rodriguez, Borislav Petkov)
      
           - New ioremap_wt()/set_memory_wt() interfaces to support
             Write-Through cached memory mappings.  This is especially
             important for good performance on NVDIMM hardware (Toshi Kani)
      
         * x86/ras changes:
      
           - Add support for deferred errors on AMD (Aravind Gopalakrishnan)
      
             This is an important RAS feature which adds hardware support for
             poisoned data.  That means roughly that the hardware marks data
             which it has detected as corrupted but wasn't able to correct, as
             poisoned data and raises an APIC interrupt to signal that in the
             form of a deferred error.  It is the OS's responsibility then to
             take proper recovery action and thus prolonge system lifetime as
             far as possible.
      
           - Add support for Intel "Local MCE"s: upcoming CPUs will support
             CPU-local MCE interrupts, as opposed to the traditional system-
             wide broadcasted MCE interrupts (Ashok Raj)
      
           - Misc cleanups (Borislav Petkov)
      
         * x86/platform changes:
      
           - Intel Atom SoC updates
      
        ... and lots of other cleanups, fixlets and other changes - see the
        shortlog and the Git log for details"
      
      * 'x86-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: (222 commits)
        x86/hpet: Use proper hpet device number for MSI allocation
        x86/hpet: Check for irq==0 when allocating hpet MSI interrupts
        x86/mm/pat, drivers/infiniband/ipath: Use arch_phys_wc_add() and require PAT disabled
        x86/mm/pat, drivers/media/ivtv: Use arch_phys_wc_add() and require PAT disabled
        x86/platform/intel/baytrail: Add comments about why we disabled HPET on Baytrail
        genirq: Prevent crash in irq_move_irq()
        genirq: Enhance irq_data_to_desc() to support hierarchy irqdomain
        iommu, x86: Properly handle posted interrupts for IOMMU hotplug
        iommu, x86: Provide irq_remapping_cap() interface
        iommu, x86: Setup Posted-Interrupts capability for Intel iommu
        iommu, x86: Add cap_pi_support() to detect VT-d PI capability
        iommu, x86: Avoid migrating VT-d posted interrupts
        iommu, x86: Save the mode (posted or remapped) of an IRTE
        iommu, x86: Implement irq_set_vcpu_affinity for intel_ir_chip
        iommu: dmar: Provide helper to copy shared irte fields
        iommu: dmar: Extend struct irte for VT-d Posted-Interrupts
        iommu: Add new member capability to struct irq_remap_ops
        x86/asm/entry/64: Disentangle error_entry/exit gsbase/ebx/usermode code
        x86/asm/entry/32: Shorten __audit_syscall_entry() args preparation
        x86/asm/entry/32: Explain reloading of registers after __audit_syscall_entry()
        ...
      d70b3ef5
    • L
      Merge branch 'x86-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 650ec5a6
      Linus Torvalds 提交于
      Pull x86 warning fixlet from Ingo Molnar:
       "A build fix for certain (rare) variants of binutils that did not make
        it into v4.1"
      
      * 'x86-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        x86/boot: Fix overflow warning with 32-bit binutils
      650ec5a6
    • L
      Merge branch 'x86-microcode-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 35ffccdb
      Linus Torvalds 提交于
      Pul x86 microcode updates from Ingo Molnar:
       "x86 microcode loader updates from Borislav Petkov:
      
         - early parsing of the built-in microcode
      
         - cleanups
      
         - misc smaller fixes"
      
      * 'x86-microcode-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        x86/microcode: Correct CPU family related variable types
        x86/microcode: Disable builtin microcode loading on 32-bit for now
        x86/microcode/intel: Rename get_matching_sig()
        x86/microcode/intel: Simplify get_matching_sig()
        x86/microcode/intel: Simplify update_match_cpu()
        x86/microcode/intel: Rename get_matching_microcode
        x86/cpu/microcode: Zap changelog
        x86/microcode: Parse built-in microcode early
        x86/microcode/intel: Remove unused @rev arg of get_matching_sig()
        x86/microcode/intel: Get rid of revision_is_newer()
      35ffccdb
    • L
      Merge branch 'x86-kdump-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · e2172d8f
      Linus Torvalds 提交于
      Pull x86 kdump updates from Ingo Molnar:
       "Three kdump robustness related improvements (Joerg Roedel)"
      
      * 'x86-kdump-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        x86/crash: Allocate enough low memory when crashkernel=high
        x86/swiotlb: Try coherent allocations with __GFP_NOWARN
        swiotlb: Warn on allocation failure in swiotlb_alloc_coherent()
      e2172d8f
    • L
      Merge branch 'x86-fpu-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · e75c73ad
      Linus Torvalds 提交于
      Pull x86 FPU updates from Ingo Molnar:
       "This tree contains two main changes:
      
         - The big FPU code rewrite: wide reaching cleanups and reorganization
           that pulls all the FPU code together into a clean base in
           arch/x86/fpu/.
      
           The resulting code is leaner and faster, and much easier to
           understand.  This enables future work to further simplify the FPU
           code (such as removing lazy FPU restores).
      
           By its nature these changes have a substantial regression risk: FPU
           code related bugs are long lived, because races are often subtle
           and bugs mask as user-space failures that are difficult to track
           back to kernel side backs.  I'm aware of no unfixed (or even
           suspected) FPU related regression so far.
      
         - MPX support rework/fixes.  As this is still not a released CPU
           feature, there were some buglets in the code - should be much more
           robust now (Dave Hansen)"
      
      * 'x86-fpu-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: (250 commits)
        x86/fpu: Fix double-increment in setup_xstate_features()
        x86/mpx: Allow 32-bit binaries on 64-bit kernels again
        x86/mpx: Do not count MPX VMAs as neighbors when unmapping
        x86/mpx: Rewrite the unmap code
        x86/mpx: Support 32-bit binaries on 64-bit kernels
        x86/mpx: Use 32-bit-only cmpxchg() for 32-bit apps
        x86/mpx: Introduce new 'directory entry' to 'addr' helper function
        x86/mpx: Add temporary variable to reduce masking
        x86: Make is_64bit_mm() widely available
        x86/mpx: Trace allocation of new bounds tables
        x86/mpx: Trace the attempts to find bounds tables
        x86/mpx: Trace entry to bounds exception paths
        x86/mpx: Trace #BR exceptions
        x86/mpx: Introduce a boot-time disable flag
        x86/mpx: Restrict the mmap() size check to bounds tables
        x86/mpx: Remove redundant MPX_BNDCFG_ADDR_MASK
        x86/mpx: Clean up the code by not passing a task pointer around when unnecessary
        x86/mpx: Use the new get_xsave_field_ptr()API
        x86/fpu/xstate: Wrap get_xsave_addr() to make it safer
        x86/fpu/xstate: Fix up bad get_xsave_addr() assumptions
        ...
      e75c73ad
    • L
      Merge branch 'x86-efi-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · cfe3eceb
      Linus Torvalds 提交于
      Pull x86 EFI updates from Ingo Molnar:
       "EFI changes:
      
         - Use idiomatic negative error values in efivar_create_sysfs_entry()
           instead of returning '1' to indicate error (Dan Carpenter)
      
         - Implement new support to expose the EFI System Resource Tables in
           sysfs, which provides information for performing firmware updates
           (Peter Jones)
      
         - Documentation cleanup in the EFI handover protocol section which
           falsely claimed that 'cmdline_size' needed to be filled out by the
           boot loader (Alex Smith)
      
         - Align the order of SMBIOS tables in /sys/firmware/efi/systab to
           match the way that we do things for ACPI and add documentation to
           Documentation/ABI (Jean Delvare)"
      
      * 'x86-efi-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        efi: Work around ia64 build problem with ESRT driver
        efi: Add 'systab' information to Documentation/ABI
        efi: dmi: List SMBIOS3 table before SMBIOS table
        efi/esrt: Fix some compiler warnings
        x86, doc: Remove cmdline_size from list of fields to be filled in for EFI handover
        efi: Add esrt support
        efi: efivar_create_sysfs_entry() should return negative error codes
      cfe3eceb
    • L
      Merge branch 'x86-debug-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 5ef6ca4f
      Linus Torvalds 提交于
      Pull x86 debugging documentation updates from Ingo Molnar:
       "Documentation updates about x86 kernel stacks"
      
      * 'x86-debug-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        x86/Documentation: Adapt Ingo's explanation on printing backtraces
        x86/Documentation: Remove STACKFAULT_STACK bulletpoint
        x86/Documentation: Move kernel-stacks doc one level up
      5ef6ca4f
    • L
      Merge branch 'x86-cpu-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · b3ba283d
      Linus Torvalds 提交于
      Pull x86 CPU features from Ingo Molnar:
       "Various CPU feature support related changes: in particular the
        /proc/cpuinfo model name sanitization change should be monitored, it
        has a chance to break stuff.  (but really shouldn't and there are no
        regression reports)"
      
      * 'x86-cpu-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        x86/cpu/amd: Give access to the number of nodes in a physical package
        x86/cpu: Trim model ID whitespace
        x86/cpu: Strip any /proc/cpuinfo model name field whitespace
        x86/cpu/amd: Set X86_FEATURE_EXTD_APICID for future processors
        x86/gart: Check for GART support before accessing GART registers
      b3ba283d
    • L
      Merge branch 'x86-cleanups-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · d43e4f44
      Linus Torvalds 提交于
      Pull x86 cleanups from Ingo Molnar:
       "Misc cleanups"
      
      * 'x86-cleanups-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        x86/mm: Clean up types in xlate_dev_mem_ptr() some more
        x86: Deinline dma_free_attrs()
        x86: Deinline dma_alloc_attrs()
        x86: Remove unused TI_cpu
        x86: Merge common 32-bit values in asm-offsets.c
      d43e4f44
    • L
      Merge branch 'sched-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 23b77762
      Linus Torvalds 提交于
      Pull scheduler updates from Ingo Molnar:
       "The main changes are:
      
         - lockless wakeup support for futexes and IPC message queues
           (Davidlohr Bueso, Peter Zijlstra)
      
         - Replace spinlocks with atomics in thread_group_cputimer(), to
           improve scalability (Jason Low)
      
         - NUMA balancing improvements (Rik van Riel)
      
         - SCHED_DEADLINE improvements (Wanpeng Li)
      
         - clean up and reorganize preemption helpers (Frederic Weisbecker)
      
         - decouple page fault disabling machinery from the preemption
           counter, to improve debuggability and robustness (David
           Hildenbrand)
      
         - SCHED_DEADLINE documentation updates (Luca Abeni)
      
         - topology CPU masks cleanups (Bartosz Golaszewski)
      
         - /proc/sched_debug improvements (Srikar Dronamraju)"
      
      * 'sched-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: (79 commits)
        sched/deadline: Remove needless parameter in dl_runtime_exceeded()
        sched: Remove superfluous resetting of the p->dl_throttled flag
        sched/deadline: Drop duplicate init_sched_dl_class() declaration
        sched/deadline: Reduce rq lock contention by eliminating locking of non-feasible target
        sched/deadline: Make init_sched_dl_class() __init
        sched/deadline: Optimize pull_dl_task()
        sched/preempt: Add static_key() to preempt_notifiers
        sched/preempt: Fix preempt notifiers documentation about hlist_del() within unsafe iteration
        sched/stop_machine: Fix deadlock between multiple stop_two_cpus()
        sched/debug: Add sum_sleep_runtime to /proc/<pid>/sched
        sched/debug: Replace vruntime with wait_sum in /proc/sched_debug
        sched/debug: Properly format runnable tasks in /proc/sched_debug
        sched/numa: Only consider less busy nodes as numa balancing destinations
        Revert 095bebf6 ("sched/numa: Do not move past the balance point if unbalanced")
        sched/fair: Prevent throttling in early pick_next_task_fair()
        preempt: Reorganize the notrace definitions a bit
        preempt: Use preempt_schedule_context() as the official tracing preemption point
        sched: Make preempt_schedule_context() function-tracing safe
        x86: Remove cpu_sibling_mask() and cpu_core_mask()
        x86: Replace cpu_**_mask() with topology_**_cpumask()
        ...
      23b77762
    • L
      Merge branch 'perf-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 6bc4c3ad
      Linus Torvalds 提交于
      Pull perf fixes from Ingo Molnar:
       "These are the left over fixes from the v4.1 cycle"
      
      * 'perf-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        perf tools: Fix build breakage if prefix= is specified
        perf/x86: Honor the architectural performance monitoring version
        perf/x86/intel: Fix PMI handling for Intel PT
        perf/x86/intel/bts: Fix DS area sharing with x86_pmu events
        perf/x86: Add more Broadwell model numbers
        perf: Fix ring_buffer_attach() RCU sync, again
      6bc4c3ad
    • L
      Merge branch 'perf-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · c58267e9
      Linus Torvalds 提交于
      Pull perf updates from Ingo Molnar:
       "Kernel side changes mostly consist of work on x86 PMU drivers:
      
         - x86 Intel PT (hardware CPU tracer) improvements (Alexander
           Shishkin)
      
         - x86 Intel CQM (cache quality monitoring) improvements (Thomas
           Gleixner)
      
         - x86 Intel PEBSv3 support (Peter Zijlstra)
      
         - x86 Intel PEBS interrupt batching support for lower overhead
           sampling (Zheng Yan, Kan Liang)
      
         - x86 PMU scheduler fixes and improvements (Peter Zijlstra)
      
        There's too many tooling improvements to list them all - here are a
        few select highlights:
      
        'perf bench':
      
            - Introduce new 'perf bench futex' benchmark: 'wake-parallel', to
              measure parallel waker threads generating contention for kernel
              locks (hb->lock). (Davidlohr Bueso)
      
        'perf top', 'perf report':
      
            - Allow disabling/enabling events dynamicaly in 'perf top':
              a 'perf top' session can instantly become a 'perf report'
              one, i.e. going from dynamic analysis to a static one,
              returning to a dynamic one is possible, to toogle the
              modes, just press 'f' to 'freeze/unfreeze' the sampling. (Arnaldo Carvalho de Melo)
      
            - Make Ctrl-C stop processing on TUI, allowing interrupting the load of big
              perf.data files (Namhyung Kim)
      
        'perf probe': (Masami Hiramatsu)
      
            - Support glob wildcards for function name
            - Support $params special probe argument: Collect all function arguments
            - Make --line checks validate C-style function name.
            - Add --no-inlines option to avoid searching inline functions
            - Greatly speed up 'perf probe --list' by caching debuginfo.
            - Improve --filter support for 'perf probe', allowing using its arguments
              on other commands, as --add, --del, etc.
      
        'perf sched':
      
            - Add option in 'perf sched' to merge like comms to lat output (Josef Bacik)
      
        Plus tons of infrastructure work - in particular preparation for
        upcoming threaded perf report support, but also lots of other work -
        and fixes and other improvements.  See (much) more details in the
        shortlog and in the git log"
      
      * 'perf-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: (305 commits)
        perf tools: Configurable per thread proc map processing time out
        perf tools: Add time out to force stop proc map processing
        perf report: Fix sort__sym_cmp to also compare end of symbol
        perf hists browser: React to unassigned hotkey pressing
        perf top: Tell the user how to unfreeze events after pressing 'f'
        perf hists browser: Honour the help line provided by builtin-{top,report}.c
        perf hists browser: Do not exit when 'f' is pressed in 'report' mode
        perf top: Replace CTRL+z with 'f' as hotkey for enable/disable events
        perf annotate: Rename source_line_percent to source_line_samples
        perf annotate: Display total number of samples with --show-total-period
        perf tools: Ensure thread-stack is flushed
        perf top: Allow disabling/enabling events dynamicly
        perf evlist: Add toggle_enable() method
        perf trace: Fix race condition at the end of started workloads
        perf probe: Speed up perf probe --list by caching debuginfo
        perf probe: Show usage even if the last event is skipped
        perf tools: Move libtraceevent dynamic list to separated LDFLAGS variable
        perf tools: Fix a problem when opening old perf.data with different byte order
        perf tools: Ignore .config-detected in .gitignore
        perf probe: Fix to return error if no probe is added
        ...
      c58267e9
    • L
      Merge branch 'locking-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 1bf7067c
      Linus Torvalds 提交于
      Pull locking updates from Ingo Molnar:
       "The main changes are:
      
         - 'qspinlock' support, enabled on x86: queued spinlocks - these are
           now the spinlock variant used by x86 as they outperform ticket
           spinlocks in every category.  (Waiman Long)
      
         - 'pvqspinlock' support on x86: paravirtualized variant of queued
           spinlocks.  (Waiman Long, Peter Zijlstra)
      
         - 'qrwlock' support, enabled on x86: queued rwlocks.  Similar to
           queued spinlocks, they are now the variant used by x86:
      
             CONFIG_ARCH_USE_QUEUED_SPINLOCKS=y
             CONFIG_QUEUED_SPINLOCKS=y
             CONFIG_ARCH_USE_QUEUED_RWLOCKS=y
             CONFIG_QUEUED_RWLOCKS=y
      
         - various lockdep fixlets
      
         - various locking primitives cleanups, further WRITE_ONCE()
           propagation"
      
      * 'locking-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: (24 commits)
        locking/lockdep: Remove hard coded array size dependency
        locking/qrwlock: Don't contend with readers when setting _QW_WAITING
        lockdep: Do not break user-visible string
        locking/arch: Rename set_mb() to smp_store_mb()
        locking/arch: Add WRITE_ONCE() to set_mb()
        rtmutex: Warn if trylock is called from hard/softirq context
        arch: Remove __ARCH_HAVE_CMPXCHG
        locking/rtmutex: Drop usage of __HAVE_ARCH_CMPXCHG
        locking/qrwlock: Rename QUEUE_RWLOCK to QUEUED_RWLOCKS
        locking/pvqspinlock: Rename QUEUED_SPINLOCK to QUEUED_SPINLOCKS
        locking/pvqspinlock: Replace xchg() by the more descriptive set_mb()
        locking/pvqspinlock, x86: Enable PV qspinlock for Xen
        locking/pvqspinlock, x86: Enable PV qspinlock for KVM
        locking/pvqspinlock, x86: Implement the paravirt qspinlock call patching
        locking/pvqspinlock: Implement simple paravirt support for the qspinlock
        locking/qspinlock: Revert to test-and-set on hypervisors
        locking/qspinlock: Use a simple write to grab the lock
        locking/qspinlock: Optimize for smaller NR_CPUS
        locking/qspinlock: Extract out code snippets for the next patch
        locking/qspinlock: Add pending bit
        ...
      1bf7067c