1. 05 9月, 2017 8 次提交
    • L
      Merge branch 'x86-microcode-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 0098410d
      Linus Torvalds 提交于
      Pull x86 microcode loading updates from Ingo Molnar:
       "Update documentation, improve robustness and fix a memory leak"
      
      * 'x86-microcode-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        x86/microcode/intel: Improve microcode patches saving flow
        x86/microcode: Document the three loading methods
        x86/microcode/AMD: Free unneeded patch before exit from update_cache()
      0098410d
    • L
      Merge branch 'x86-debug-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · f29139bf
      Linus Torvalds 提交于
      Pull x86 debug updates from Ingo Molnar:
       "Various fixes to the NUMA emulation code"
      
      * 'x86-debug-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        x86/numa_emulation: Recalculate numa_nodes_parsed from emulated nodes
        x86/numa_emulation: Assign physnode_mask directly from numa_nodes_parsed
        x86/numa_emulation: Refine the calculation of max_emu_nid and dfl_phys_nid
      f29139bf
    • L
      Merge branch 'x86-cpu-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · a1400cdb
      Linus Torvalds 提交于
      Pull x86 cpuid updates from Ingo Molnar:
       "AMD F17h related updates"
      
      * 'x86-cpu-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        x86/cpu/amd: Hide unused legacy_fixup_core_id() function
        x86/cpu/amd: Derive L3 shared_cpu_map from cpu_llc_shared_mask
        x86/cpu/amd: Limit cpu_core_id fixup to families older than F17h
      a1400cdb
    • L
      Merge branch 'x86-build-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · d5e54c4e
      Linus Torvalds 提交于
      Pull x86 build updates from Ingo Molnar:
       "More Clang support related fixes"
      
      * 'x86-build-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        x86/build: Use cc-option to validate stack alignment parameter
        x86/build: Fix stack alignment for CLang
        x86/build: Drop unused mflags-y
      d5e54c4e
    • L
      Merge branch 'x86-boot-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 45153920
      Linus Torvalds 提交于
      Pull x86 boot updates from Ingo Molnar:
       "The main changes are KASL related fixes and cleanups: in particular we
        now exclude certain physical memory ranges as KASLR randomization
        targets that have proven to be unreliable (early-)RAM on some firmware
        versions"
      
      * 'x86-boot-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        x86/boot/KASLR: Work around firmware bugs by excluding EFI_BOOT_SERVICES_* and EFI_LOADER_* from KASLR's choice
        x86/boot/KASLR: Prefer mirrored memory regions for the kernel physical address
        efi: Introduce efi_early_memdesc_ptr to get pointer to memmap descriptor
        x86/boot/KASLR: Rename process_e820_entry() into process_mem_region()
        x86/boot/KASLR: Switch to pass struct mem_vector to process_e820_entry()
        x86/boot/KASLR: Wrap e820 entries walking code into new function process_e820_entries()
      45153920
    • L
      Merge branch 'x86-asm-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · b0c79f49
      Linus Torvalds 提交于
      Pull x86 asm updates from Ingo Molnar:
      
       - Introduce the ORC unwinder, which can be enabled via
         CONFIG_ORC_UNWINDER=y.
      
         The ORC unwinder is a lightweight, Linux kernel specific debuginfo
         implementation, which aims to be DWARF done right for unwinding.
         Objtool is used to generate the ORC unwinder tables during build, so
         the data format is flexible and kernel internal: there's no
         dependency on debuginfo created by an external toolchain.
      
         The ORC unwinder is almost two orders of magnitude faster than the
         (out of tree) DWARF unwinder - which is important for perf call graph
         profiling. It is also significantly simpler and is coded defensively:
         there has not been a single ORC related kernel crash so far, even
         with early versions. (knock on wood!)
      
         But the main advantage is that enabling the ORC unwinder allows
         CONFIG_FRAME_POINTERS to be turned off - which speeds up the kernel
         measurably:
      
         With frame pointers disabled, GCC does not have to add frame pointer
         instrumentation code to every function in the kernel. The kernel's
         .text size decreases by about 3.2%, resulting in better cache
         utilization and fewer instructions executed, resulting in a broad
         kernel-wide speedup. Average speedup of system calls should be
         roughly in the 1-3% range - measurements by Mel Gorman [1] have shown
         a speedup of 5-10% for some function execution intense workloads.
      
         The main cost of the unwinder is that the unwinder data has to be
         stored in RAM: the memory cost is 2-4MB of RAM, depending on kernel
         config - which is a modest cost on modern x86 systems.
      
         Given how young the ORC unwinder code is it's not enabled by default
         - but given the performance advantages the plan is to eventually make
         it the default unwinder on x86.
      
         See Documentation/x86/orc-unwinder.txt for more details.
      
       - Remove lguest support: its intended role was that of a temporary
         proof of concept for virtualization, plus its removal will enable the
         reduction (removal) of the paravirt API as well, so Rusty agreed to
         its removal. (Juergen Gross)
      
       - Clean up and fix FSGS related functionality (Andy Lutomirski)
      
       - Clean up IO access APIs (Andy Shevchenko)
      
       - Enhance the symbol namespace (Jiri Slaby)
      
      * 'x86-asm-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: (47 commits)
        objtool: Handle GCC stack pointer adjustment bug
        x86/entry/64: Use ENTRY() instead of ALIGN+GLOBAL for stub32_clone()
        x86/fpu/math-emu: Add ENDPROC to functions
        x86/boot/64: Extract efi_pe_entry() from startup_64()
        x86/boot/32: Extract efi_pe_entry() from startup_32()
        x86/lguest: Remove lguest support
        x86/paravirt/xen: Remove xen_patch()
        objtool: Fix objtool fallthrough detection with function padding
        x86/xen/64: Fix the reported SS and CS in SYSCALL
        objtool: Track DRAP separately from callee-saved registers
        objtool: Fix validate_branch() return codes
        x86: Clarify/fix no-op barriers for text_poke_bp()
        x86/switch_to/64: Rewrite FS/GS switching yet again to fix AMD CPUs
        selftests/x86/fsgsbase: Test selectors 1, 2, and 3
        x86/fsgsbase/64: Report FSBASE and GSBASE correctly in core dumps
        x86/fsgsbase/64: Fully initialize FS and GS state in start_thread_common
        x86/asm: Fix UNWIND_HINT_REGS macro for older binutils
        x86/asm/32: Fix regs_get_register() on segment registers
        x86/xen/64: Rearrange the SYSCALL entries
        x86/asm/32: Remove a bunch of '& 0xffff' from pt_regs segment reads
        ...
      b0c79f49
    • L
      Merge branch 'sched-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · f213a6c8
      Linus Torvalds 提交于
      Pull scheduler updates from Ingo Molnar:
       "The main changes in this cycle were:
      
         - fix affine wakeups (Peter Zijlstra)
      
         - improve CPU onlining (and general bootup) scalability on systems
           with ridiculous number (thousands) of CPUs (Peter Zijlstra)
      
         - sched/numa updates (Rik van Riel)
      
         - sched/deadline updates (Byungchul Park)
      
         - sched/cpufreq enhancements and related cleanups (Viresh Kumar)
      
         - sched/debug enhancements (Xie XiuQi)
      
         - various fixes"
      
      * 'sched-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: (27 commits)
        sched/debug: Optimize sched_domain sysctl generation
        sched/topology: Avoid pointless rebuild
        sched/topology, cpuset: Avoid spurious/wrong domain rebuilds
        sched/topology: Improve comments
        sched/topology: Fix memory leak in __sdt_alloc()
        sched/completion: Document that reinit_completion() must be called after complete_all()
        sched/autogroup: Fix error reporting printk text in autogroup_create()
        sched/fair: Fix wake_affine() for !NUMA_BALANCING
        sched/debug: Intruduce task_state_to_char() helper function
        sched/debug: Show task state in /proc/sched_debug
        sched/debug: Use task_pid_nr_ns in /proc/$pid/sched
        sched/core: Remove unnecessary initialization init_idle_bootup_task()
        sched/deadline: Change return value of cpudl_find()
        sched/deadline: Make find_later_rq() choose a closer CPU in topology
        sched/numa: Scale scan period with tasks in group and shared/private
        sched/numa: Slow down scan rate if shared faults dominate
        sched/pelt: Fix false running accounting
        sched: Mark pick_next_task_dl() and build_sched_domain() as static
        sched/cpupri: Don't re-initialize 'struct cpupri'
        sched/deadline: Don't re-initialize 'struct cpudl'
        ...
      f213a6c8
    • L
      Merge branch 'ras-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 621bee34
      Linus Torvalds 提交于
      Pull RAS fix from Ingo Molnar:
       "A single change fixing SMCA bank initialization on systems that don't
        have CPU0 enabled"
      
      * 'ras-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        x86/mce/AMD: Allow any CPU to initialize the smca_banks array
      621bee34
  2. 04 9月, 2017 13 次提交
    • L
      Merge branch 'perf-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 9657752c
      Linus Torvalds 提交于
      Pull perf updates from Ingo Molnar:
       "Kernel side changes:
      
         - Add branch type profiling/tracing support. (Jin Yao)
      
         - Add the PERF_SAMPLE_PHYS_ADDR ABI to allow the tracing/profiling of
           physical memory addresses, where the PMU supports it. (Kan Liang)
      
         - Export some PMU capability details in the new
           /sys/bus/event_source/devices/cpu/caps/ sysfs directory. (Andi
           Kleen)
      
         - Aux data fixes and updates (Will Deacon)
      
         - kprobes fixes and updates (Masami Hiramatsu)
      
         - AMD uncore PMU driver fixes and updates (Janakarajan Natarajan)
      
        On the tooling side, here's a (limited!) list of highlights - there
        were many other changes that I could not list, see the shortlog and
        git history for details:
      
        UI improvements:
      
         - Implement a visual marker for fused x86 instructions in the
           annotate TUI browser, available now in 'perf report', more work
           needed to have it available as well in 'perf top' (Jin Yao)
      
           Further explanation from one of Jin's patches:
      
                   │   ┌──cmpl   $0x0,argp_program_version_hook
             81.93 │   ├──je     20
                   │   │  lock   cmpxchg %esi,0x38a9a4(%rip)
                   │   │↓ jne    29
                   │   │↓ jmp    43
             11.47 │20:└─→cmpxch %esi,0x38a999(%rip)
      
           That means the cmpl+je is a fused instruction pair and they should
           be considered together.
      
         - Record the branch type and then show statistics and info about in
           callchain entries (Jin Yao)
      
           Example from one of Jin's patches:
      
              # perf record -g -j any,save_type
              # perf report --branch-history --stdio --no-children
      
              38.50%  div.c:45                [.] main                    div
                      |
                      ---main div.c:42 (RET CROSS_2M cycles:2)
                         compute_flag div.c:28 (cycles:2)
                         compute_flag div.c:27 (RET CROSS_2M cycles:1)
                         rand rand.c:28 (cycles:1)
                         rand rand.c:28 (RET CROSS_2M cycles:1)
                         __random random.c:298 (cycles:1)
                         __random random.c:297 (COND_BWD CROSS_2M cycles:1)
                         __random random.c:295 (cycles:1)
                         __random random.c:295 (COND_BWD CROSS_2M cycles:1)
                         __random random.c:295 (cycles:1)
                         __random random.c:295 (RET CROSS_2M cycles:9)
      
        namespaces support:
      
         - Add initial support for namespaces, using setns to access files in
           namespaces, grabbing their build-ids, etc. (Krister Johansen)
      
        perf trace enhancements:
      
         - Beautify pkey_{alloc,free,mprotect} arguments in 'perf trace'
           (Arnaldo Carvalho de Melo)
      
         - Add initial 'clone' syscall args beautifier in 'perf trace'
           (Arnaldo Carvalho de Melo)
      
         - Ignore 'fd' and 'offset' args for MAP_ANONYMOUS in 'perf trace'
           (Arnaldo Carvalho de Melo)
      
         - Beautifiers for the 'cmd' arg of several ioctl types, including:
           sound, DRM, KVM, vhost virtio and perf_events. (Arnaldo Carvalho de
           Melo)
      
         - Add PERF_SAMPLE_CALLCHAIN and PERF_RECORD_MMAP[2] to 'perf data'
           CTF conversion, allowing CTF trace visualization tools to show
           callchains and to resolve symbols (Geneviève Bastien)
      
         - Beautify the fcntl syscall, which is an interesting one in the
           sense that infrastructure had to be put in place to change the
           formatters of some arguments according to the value in a previous
           one, i.e. cmd dictates how arg and the syscall return will be
           formatted. (Arnaldo Carvalho de Melo
      
        perf stat enhancements:
      
         - Use group read for event groups in 'perf stat', reducing overhead
           when groups are defined in the event specification, i.e. when using
           {} to enclose a list of events, asking them to be read at the same
           time, e.g.: "perf stat -e '{cycles,instructions}'" (Jiri Olsa)
      
        pipe mode improvements:
      
         - Process tracing data in 'perf annotate' pipe mode (David
           Carrillo-Cisneros)
      
         - Add header record types to pipe-mode, now this command:
      
              $ perf record -o - -e cycles sleep 1 | perf report --stdio --header
      
           Will show the same as in non-pipe mode, i.e. involving a perf.data
           file (David Carrillo-Cisneros)
      
        Vendor specific hardware event support updates/enhancements:
      
         - Update POWER9 vendor events tables (Sukadev Bhattiprolu)
      
         - Add POWER9 PMU events Sukadev (Bhattiprolu)
      
         - Support additional POWER8+ PVR in PMU mapfile (Shriya)
      
         - Add Skylake server uncore JSON vendor events (Andi Kleen)
      
         - Support exporting Intel PT data to sqlite3 with python perf
           scripts, this is in addition to the postgresql support that was
           already there (Adrian Hunter)"
      
      * 'perf-core-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: (253 commits)
        perf symbols: Fix plt entry calculation for ARM and AARCH64
        perf probe: Fix kprobe blacklist checking condition
        perf/x86: Fix caps/ for !Intel
        perf/core, x86: Add PERF_SAMPLE_PHYS_ADDR
        perf/core, pt, bts: Get rid of itrace_started
        perf trace beauty: Beautify pkey_{alloc,free,mprotect} arguments
        tools headers: Sync cpu features kernel ABI headers with tooling headers
        perf tools: Pass full path of FEATURES_DUMP
        perf tools: Robustify detection of clang binary
        tools lib: Allow external definition of CC, AR and LD
        perf tools: Allow external definition of flex and bison binary names
        tools build tests: Don't hardcode gcc name
        perf report: Group stat values on global event id
        perf values: Zero value buffers
        perf values: Fix allocation check
        perf values: Fix thread index bug
        perf report: Add dump_read function
        perf record: Set read_format for inherit_stat
        perf c2c: Fix remote HITM detection for Skylake
        perf tools: Fix static build with newer toolchains
        ...
      9657752c
    • L
      Merge branch 'core-rcu-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 0081a0ce
      Linus Torvalds 提交于
      Pull RCU updates from Ingo Molnad:
       "The main RCU related changes in this cycle were:
      
         - Removal of spin_unlock_wait()
         - SRCU updates
         - RCU torture-test updates
         - RCU Documentation updates
         - Extend the sys_membarrier() ABI with the MEMBARRIER_CMD_PRIVATE_EXPEDITED variant
         - Miscellaneous RCU fixes
         - CPU-hotplug fixes"
      
      * 'core-rcu-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: (63 commits)
        arch: Remove spin_unlock_wait() arch-specific definitions
        locking: Remove spin_unlock_wait() generic definitions
        drivers/ata: Replace spin_unlock_wait() with lock/unlock pair
        ipc: Replace spin_unlock_wait() with lock/unlock pair
        exit: Replace spin_unlock_wait() with lock/unlock pair
        completion: Replace spin_unlock_wait() with lock/unlock pair
        doc: Set down RCU's scheduling-clock-interrupt needs
        doc: No longer allowed to use rcu_dereference on non-pointers
        doc: Add RCU files to docbook-generation files
        doc: Update memory-barriers.txt for read-to-write dependencies
        doc: Update RCU documentation
        membarrier: Provide expedited private command
        rcu: Remove exports from rcu_idle_exit() and rcu_idle_enter()
        rcu: Add warning to rcu_idle_enter() for irqs enabled
        rcu: Make rcu_idle_enter() rely on callers disabling irqs
        rcu: Add assertions verifying blocked-tasks list
        rcu/tracing: Set disable_rcu_irq_enter on rcu_eqs_exit()
        rcu: Add TPS() protection for _rcu_barrier_trace strings
        rcu: Use idle versions of swait to make idle-hack clear
        swait: Add idle variants which don't contribute to load average
        ...
      0081a0ce
    • L
      Merge branch 'core-debugobjects-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · fea15437
      Linus Torvalds 提交于
      Pull debugobjects fix from Ingo Molnar:
       "A single commit making debugobjects interact better with kmemleak"
      
      * 'core-debugobjects-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        debugobjects: Make kmemleak ignore debug objects
      fea15437
    • L
      Merge branch 'docs-next' of git://git.lwn.net/linux · 81a84ad3
      Linus Torvalds 提交于
      Pull documentation updates from Jonathan Corbet:
       "After a fair amount of churn in the last couple of cycles, docs are
        taking it easier this time around. Lots of fixes and some new
        documentation, but nothing all that radical. Perhaps the most
        interesting change for many is the scripts/sphinx-pre-install tool
        from Mauro; it will tell you exactly which packages you need to
        install to get a working docs toolchain on your system.
      
        There are two little patches reaching outside of Documentation/; both
        just tweak kerneldoc comments to eliminate warnings and fix some
        dangling doc pointers"
      
      * 'docs-next' of git://git.lwn.net/linux: (52 commits)
        Documentation/sphinx: fix kernel-doc decode for non-utf-8 locale
        genalloc: Fix an incorrect kerneldoc comment
        doc: Add documentation for the genalloc subsystem
        assoc_array: fix path to assoc_array documentation
        kernel-doc parser mishandles declarations split into lines
        docs: ReSTify table of contents in core.rst
        docs: process: drop git snapshots from applying-patches.rst
        Documentation:input: fix typo
        swap: Remove obsolete sentence
        sphinx.rst: Allow Sphinx version 1.6 at the docs
        docs-rst: fix verbatim font size on tables
        Documentation: stable-kernel-rules: fix broken git urls
        rtmutex: update rt-mutex
        rtmutex: update rt-mutex-design
        docs: fix minimal sphinx version in conf.py
        docs: fix nested numbering in the TOC
        NVMEM documentation fix: A minor typo
        docs-rst: pdf: use same vertical margin on all Sphinx versions
        doc: Makefile: if sphinx is not found, run a check script
        docs: Fix paths in security/keys
        ...
      81a84ad3
    • L
      Merge tag 'hwmon-for-linus-v4.14' of... · fe91f281
      Linus Torvalds 提交于
      Merge tag 'hwmon-for-linus-v4.14' of git://git.kernel.org/pub/scm/linux/kernel/git/groeck/linux-staging
      
      Pull hwmon updates from Guenter Roeck:
      
       - new drivers:
         - Lantiq CPU temperature sensor
         - IBM CFF power supply
         - TPS53679 PMBus driver
      
       - new support:
         - LM5066I (lm25066 PMBus driver)
         - Intel VID protocol VR13 (PMBus drivers)
         - CAT34TS02C, GT30TS00, GT34TS02, and CAT34TS04 (jc42 driver)
      
       - cleanup and minor improvements in several drivers
      
      * tag 'hwmon-for-linus-v4.14' of git://git.kernel.org/pub/scm/linux/kernel/git/groeck/linux-staging: (36 commits)
        hwmon: (ltq-cputemp) add cpu temp sensor driver
        hwmon: (ltq-cputemp) add devicetree bindings documentation
        hwmon: (pmbus) Add support for Texas Instruments tps53679 device
        hwmon: (asc7621) make several arrays static const
        hwmon: (pmbus/lm25066) Add support for TI LM5066I
        hwmon: (pmbus/lm25066) Offset coefficient depends on CL
        hwmon: (pmbus) Add support for Intel VID protocol VR13
        Documentation: hwmon: Document the IBM CFF power supply
        hwmon: (pmbus) Add IBM Common Form Factor (CFF) power supply driver
        dt-bindings: hwmon: Document the IBM CCF power supply version 1
        hwmon: (ftsteutates) constify i2c_device_id
        hwmon: da9052: Add support for TSI channel
        mfd: da9052: Make touchscreen registration optional
        hwmon: da9052: Replace S_IRUGO with 0444
        mfd: da9052: Add register details for TSI
        hwmon: (aspeed-pwm) add THERMAL dependency
        hwmon: (pmbus) Add debugfs for status registers
        hwmon: (aspeed-pwm-tacho) cooling device support.
        Documentation: dt-bindings: aspeed-pwm-tacho cooling device.
        hwmon: (pmbus): Add generic alarm bit for iin and pin
        ...
      fe91f281
    • L
      Merge tag 'for-linus-ioctl' of git://git.kernel.org/pub/scm/linux/kernel/git/dledford/rdma · aa9d4648
      Linus Torvalds 提交于
      Pull rdma updates from Doug Ledford:
       "This is a big pull request.
      
        Of note is that I'm sending you the new ioctl API for the rdma
        subsystem. We put it up on linux-api@, but didn't get much response.
        The API is complex, but it solves two different problems in one go:
      
         1) The bi-directional nature of the RDMA file write calls, which
            created the security hole we had to handle (and for which the fix
            is now causing problems for systems in production, we were a bit
            over zealous in the fix and the ability to open a device, then
            fork, then create new queue pairs on the device and use them is
            broken).
      
         2) The bloat caused by different vendors implementing extensions to
            the base verbs API. Each vendor's hardware is slightly different,
            and the hardware might be suitable for one extension but not
            another.
      
            By the time we add generic extensions for all the different ways
            that the different hardware can offload things, the API becomes
            bloated. Things like our completion structs have started to exceed
            a cache line in size because of all the elements needed to support
            this. That in turn shows up heavily in the performance graphs with
            a noticable drop in performance on 100Gigabit links as our
            completion structs go from occupying one cache line to 1+.
      
            This API makes things like the completion structs modular in a
            very similar way to netlink so that your structs can only include
            the items needed for the offloads/features you are actually using
            on a given queue pair. In that way we support everything, but only
            use what we need, and our structs stay smaller.
      
        The ioctl API is better explained by the posting on linux-api@ than I
        can explain it here, so I'll just leave it at that.
      
        The rest of the pull request is typical stuff.
      
        Updates for 4.14 kernel merge window
      
         - Lots of hfi1 driver updates (mixed with a few qib and core updates
           as well)
      
         - rxe updates
      
         - various mlx updates
      
         - Set default roce type to RoCEv2
      
         - Several larger fixes for bnxt_re that were too big for -rc
      
         - Several larger fixes for qedr that, likewise, were too big for -rc
      
         - Misc core changes
      
         - Make the hns_roce driver compilable on arches other than aarch64 so
           we can more easily debug build issues related to it
      
         - Add rdma-netlink infrastructure updates
      
         - Add automatic IRQ affinity infrastructure
      
         - Add 32bit lid support
      
         - Lots of misc fixes across the subsystem from random people
      
         - Autoloading of RDMA netlink modules
      
         - PCI pool cleanups from Romain Perier
      
         - mlx5 driver feature additions and fixes
      
         - Hardware tag matchine feature
      
         - Fix sleeping in atomic when resolving roce ah
      
         - Add experimental ioctl interface as posted to linux-api@"
      
      * tag 'for-linus-ioctl' of git://git.kernel.org/pub/scm/linux/kernel/git/dledford/rdma: (328 commits)
        IB/core: Expose ioctl interface through experimental Kconfig
        IB/core: Assign root to all drivers
        IB/core: Add completion queue (cq) object actions
        IB/core: Add legacy driver's user-data
        IB/core: Export ioctl enum types to user-space
        IB/core: Explicitly destroy an object while keeping uobject
        IB/core: Add macros for declaring methods and attributes
        IB/core: Add uverbs merge trees functionality
        IB/core: Add DEVICE object and root tree structure
        IB/core: Declare an object instead of declaring only type attributes
        IB/core: Add new ioctl interface
        RDMA/vmw_pvrdma: Fix a signedness
        RDMA/vmw_pvrdma: Report network header type in WC
        IB/core: Add might_sleep() annotation to ib_init_ah_from_wc()
        IB/cm: Fix sleeping in atomic when RoCE is used
        IB/core: Add support to finalize objects in one transaction
        IB/core: Add a generic way to execute an operation on a uobject
        Documentation: Hardware tag matching
        IB/mlx5: Support IB_SRQT_TM
        net/mlx5: Add XRQ support
        ...
      aa9d4648
    • L
      Merge tag 'drm-for-v4.14' of git://people.freedesktop.org/~airlied/linux · 906dde0f
      Linus Torvalds 提交于
      Pull drm updates from Dave Airlie:
       "This is the main drm pull request for 4.14 merge window.
      
        I'm sending this early, as my continuing journey into fatherhood is
        occurring really soon now, I'm going to be mostly useless for the next
        couple of weeks, though I may be able to read email, I doubt I'll be
        doing much patch applications or git sending. If anything urgent pops
        up I've asked Daniel/Jani/Alex/Sean to try and direct stuff towards
        you.
      
        Outside drm changes:
      
        Some rcar-du updates that touch the V4L tree, all acks should be in
        place. It adds one export to the radix tree code for new i915 use
        case. There are some minor AGP cleanups (don't see that too often).
        Changes to the vbox driver in staging to avoid breaking compilation.
      
        Summary:
      
        core:
         - Atomic helper fixes
         - Atomic UAPI fixes
         - Add YCBCR 4:2:0 support
         - Drop set_busid hook
         - Refactor fb_helper locking
         - Remove a bunch of internal APIs
         - Add a bunch of better default handlers
         - Format modifier/blob plane property added
         - More internal header refactoring
         - Make more internal API names consistent
         - Enhanced syncobj APIs (wait/signal/reset/create signalled)
      
        bridge:
         - Add Synopsys Designware MIPI DSI host bridge driver
      
        tiny:
         - Add Pervasive Displays RePaper displays
         - Add support for LEGO MINDSTORMS EV3 LCD
      
        i915:
         - Lots of GEN10/CNL  support patches
         - drm syncobj support
         - Skylake+ watermark refactoring
         - GVT vGPU 48-bit ppgtt support
         - GVT performance improvements
         - NOA change ioctl
         - CCS (color compression) scanout support
         - GPU reset improvements
      
        amdgpu:
         - Initial hugepage support
         - BO migration logic rework
         - Vega10 improvements
         - Powerplay fixes
         - Stop reprogramming the MC
         - Fixes for ACP audio on stoney
         - SR-IOV fixes/improvements
         - Command submission overhead improvements
      
        amdkfd:
         - Non-dGPU upstreaming patches
         - Scratch VA ioctl
         - Image tiling modes
         - Update PM4 headers for new firmware
         - Drop all BUG_ONs.
      
        nouveau:
         - GP108 modesetting support.
         - Disable MSI on big endian.
      
        vmwgfx:
         - Add fence fd support.
      
        msm:
         - Runtime PM improvements
      
        exynos:
         - NV12MT support
         - Refactor KMS drivers
      
        imx-drm:
         - Lock scanout channel to improve memory bw
         - Cleanups
      
        etnaviv:
         - GEM object population fixes
      
        tegra:
         - Prep work for Tegra186 support
         - PRIME mmap support
      
        sunxi:
         - HDMI support improvements
         - HDMI CEC support
      
        omapdrm:
         - HDMI hotplug IRQ support
         - Big driver cleanup
         - OMAP5 DSI support
      
        rcar-du:
         - vblank fixes
         - VSP1 updates
      
        arcgpu:
         - Minor fixes
      
        stm:
         - Add STM32 DSI controller driver
      
        dw_hdmi:
         - Add support for Rockchip RK3399
         - HDMI CEC support
      
        atmel-hlcdc:
         - Add 8-bit color support
      
        vc4:
         - Atomic fixes
         - New ioctl to attach a label to a buffer object
         - HDMI CEC support
         - Allow userspace to dictate rendering order on submit ioctl"
      
      * tag 'drm-for-v4.14' of git://people.freedesktop.org/~airlied/linux: (1074 commits)
        drm/syncobj: Add a signal ioctl (v3)
        drm/syncobj: Add a reset ioctl (v3)
        drm/syncobj: Add a syncobj_array_find helper
        drm/syncobj: Allow wait for submit and signal behavior (v5)
        drm/syncobj: Add a CREATE_SIGNALED flag
        drm/syncobj: Add a callback mechanism for replace_fence (v3)
        drm/syncobj: add sync obj wait interface. (v8)
        i915: Use drm_syncobj_fence_get
        drm/syncobj: Add a race-free drm_syncobj_fence_get helper (v2)
        drm/syncobj: Rename fence_get to find_fence
        drm: kirin: Add mode_valid logic to avoid mode clocks we can't generate
        drm/vmwgfx: Bump the version for fence FD support
        drm/vmwgfx: Add export fence to file descriptor support
        drm/vmwgfx: Add support for imported Fence File Descriptor
        drm/vmwgfx: Prepare to support fence fd
        drm/vmwgfx: Fix incorrect command header offset at restart
        drm/vmwgfx: Support the NOP_ERROR command
        drm/vmwgfx: Restart command buffers after errors
        drm/vmwgfx: Move irq bottom half processing to threads
        drm/vmwgfx: Don't use drm_irq_[un]install
        ...
      906dde0f
    • L
      Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs · 69c0067a
      Linus Torvalds 提交于
      Pull misc fixes from Al Viro:
       "Loose ends and regressions from the last merge window.
      
        Strictly speaking, only binfmt_flat thing is a build regression per
        se - the rest is 'only sparse cares about that' stuff"
      
      [ This came in before the 4.13 release and could have gone there, but it
        was late in the release and nothing seemed critical enough to care, so
        I'm pulling it in the 4.14 merge window instead  - Linus ]
      
      * 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs:
        binfmt_flat: fix arch/m32r and arch/microblaze flat_put_addr_at_rp()
        compat_hdio_ioctl: Fix a declaration
        <linux/uaccess.h>: Fix copy_in_user() declaration
        annotate RWF_... flags
        teach SYSCALL_DEFINE/COMPAT_SYSCALL_DEFINE to handle __bitwise arguments
      69c0067a
    • L
      Linux 4.13 · 569dbb88
      Linus Torvalds 提交于
      569dbb88
    • L
      Merge branch 'upstream' of git://git.linux-mips.org/pub/scm/ralf/upstream-linus · 5e3b19d8
      Linus Torvalds 提交于
      Pull MIPS fixes from Ralf Baechle:
       "The two indirect syscall fixes have sat in linux-next for a few days.
        I did check back with a hardware designer to ensure a SYNC is really
        what's required for the GIC fix and so the GIC fix didn't make it into
        to linux-next in time for this final pull request.
      
        It builds in local build tests and passes Imagination's test system"
      
      * 'upstream' of git://git.linux-mips.org/pub/scm/ralf/upstream-linus:
        irqchip: mips-gic: SYNC after enabling GIC region
        MIPS: Remove pt_regs adjustments in indirect syscall handler
        MIPS: seccomp: Fix indirect syscall args
      5e3b19d8
    • L
      Merge branch 'x86-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · d0fa6ea1
      Linus Torvalds 提交于
      Pull x86 fixes from Thomas Gleixner:
      
       - Expand the space for uncompressing as the LZ4 worst case does not fit
         into the currently reserved space
      
       - Validate boot parameters more strictly to prevent out of bound access
         in the decompressor/boot code
      
       - Fix off by one errors in get_segment_base()
      
      * 'x86-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        x86/boot: Prevent faulty bootparams.screeninfo from causing harm
        x86/boot: Provide more slack space during decompression
        x86/ldt: Fix off by one in get_segment_base()
      d0fa6ea1
    • L
      Merge branch 'timers-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · 3b62dc6c
      Linus Torvalds 提交于
      Pull timer fix from Thomas Gleixner:
       "A single fix for a thinko in the raw timekeeper update which causes
        clock MONOTONIC_RAW to run with erratically increased frequency"
      
      * 'timers-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        time: Fix ktime_get_raw() incorrect base accumulation
      3b62dc6c
    • L
      Merge branch 'perf-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip · e92d51af
      Linus Torvalds 提交于
      Pull perf fixes from Thomas Gleixner:
      
       - Prevent a potential inconistency in the perf user space access which
         might lead to evading sanity checks.
      
       - Prevent perf recording function trace entries twice
      
      * 'perf-urgent-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip:
        perf/ftrace: Fix double traces of perf on ftrace:function
        perf/core: Fix potential double-fetch bug
      e92d51af
  3. 02 9月, 2017 15 次提交
    • L
      Merge branch 'for-next' of git://git.samba.org/sfrench/cifs-2.6 · d0d6ab53
      Linus Torvalds 提交于
      Pull cifs version warning fix from Steve French:
       "As requested, additional kernel warning messages to clarify the
        default dialect changes"
      
      [ There is still some discussion about exactly which version should be
        the new default.  Longer-term we have auto-negotiation coming, but
        that's not there yet..  - Linus ]
      
      * 'for-next' of git://git.samba.org/sfrench/cifs-2.6:
        Fix warning messages when mounting to older servers
      d0d6ab53
    • L
      Merge tag 'armsoc-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/arm/arm-soc · 54f70f52
      Linus Torvalds 提交于
      Pull ARM SoC fixes from Olof Johansson:
       "A couple of late-arriving fixes before final 4.13:
      
         - A few reverts of DT bindings on Allwinner for their ethernet
           driver. Discussion didn't converge, and since bindings are
           considered ABI it makes sense to revert instead of having to
           support two bindings long-term.
      
         - A fix to enumerate GPIOs properly on Marvell Armada AP806"
      
      * tag 'armsoc-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/arm/arm-soc:
        arm64: dts: marvell: fix number of GPIOs in Armada AP806 description
        arm: dts: sunxi: Revert EMAC changes
        arm64: dts: allwinner: Revert EMAC changes
        dt-bindings: net: Revert sun8i dwmac binding
      54f70f52
    • O
      Merge tag 'mvebu-fixes-4.13-3' of git://git.infradead.org/linux-mvebu into fixes · 6f71a925
      Olof Johansson 提交于
      mvebu fixes for 4.13 (part 3)
      
      Fix number of GPIOs in AP806 description for Armada 7K/8K
      
      * tag 'mvebu-fixes-4.13-3' of git://git.infradead.org/linux-mvebu:
        arm64: dts: marvell: fix number of GPIOs in Armada AP806 description
      Signed-off-by: NOlof Johansson <olof@lixom.net>
      6f71a925
    • L
      Merge branch 'i2c/for-current' of git://git.kernel.org/pub/scm/linux/kernel/git/wsa/linux · f8c6d724
      Linus Torvalds 提交于
      Pull i2c fixes from Wolfram Sang:
       "The ismt driver had a problem with a rarely used transaction type and
        the designware driver was made even more robust against non standard
        ACPI tables"
      
      * 'i2c/for-current' of git://git.kernel.org/pub/scm/linux/kernel/git/wsa/linux:
        i2c: designware: Round down ACPI provided clk to nearest supported clk
        i2c: ismt: Return EMSGSIZE for block reads with bogus length
        i2c: ismt: Don't duplicate the receive length for block reads
      f8c6d724
    • O
      epoll: fix race between ep_poll_callback(POLLFREE) and ep_free()/ep_remove() · 138e4ad6
      Oleg Nesterov 提交于
      The race was introduced by me in commit 971316f0 ("epoll:
      ep_unregister_pollwait() can use the freed pwq->whead").  I did not
      realize that nothing can protect eventpoll after ep_poll_callback() sets
      ->whead = NULL, only whead->lock can save us from the race with
      ep_free() or ep_remove().
      
      Move ->whead = NULL to the end of ep_poll_callback() and add the
      necessary barriers.
      
      TODO: cleanup the ewake/EPOLLEXCLUSIVE logic, it was confusing even
      before this patch.
      
      Hopefully this explains use-after-free reported by syzcaller:
      
      	BUG: KASAN: use-after-free in debug_spin_lock_before
      	...
      	 _raw_spin_lock_irqsave+0x4a/0x60 kernel/locking/spinlock.c:159
      	 ep_poll_callback+0x29f/0xff0 fs/eventpoll.c:1148
      
      this is spin_lock(eventpoll->lock),
      
      	...
      	Freed by task 17774:
      	...
      	 kfree+0xe8/0x2c0 mm/slub.c:3883
      	 ep_free+0x22c/0x2a0 fs/eventpoll.c:865
      
      Fixes: 971316f0 ("epoll: ep_unregister_pollwait() can use the freed pwq->whead")
      Reported-by: N范龙飞 <long7573@126.com>
      Cc: stable@vger.kernel.org
      Signed-off-by: NOleg Nesterov <oleg@redhat.com>
      Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
      138e4ad6
    • L
      Merge git://git.kernel.org/pub/scm/linux/kernel/git/davem/net · 8cf9f2a2
      Linus Torvalds 提交于
      Pull networking fixes from David Miller:
      
       1) Fix handling of pinned BPF map nodes in hash of maps, from Daniel
          Borkmann.
      
       2) IPSEC ESP error paths leak memory, from Steffen Klassert.
      
       3) We need an RCU grace period before freeing fib6_node objects, from
          Wei Wang.
      
       4) Must check skb_put_padto() return value in HSR driver, from FLorian
          Fainelli.
      
       5) Fix oops on PHY probe failure in ftgmac100 driver, from Andrew
          Jeffery.
      
       6) Fix infinite loop in UDP queue when using SO_PEEK_OFF, from Eric
          Dumazet.
      
       7) Use after free when tcf_chain_destroy() called multiple times, from
          Jiri Pirko.
      
       8) Fix KSZ DSA tag layer multiple free of SKBS, from Florian Fainelli.
      
       9) Fix leak of uninitialized memory in sctp_get_sctp_info(),
          inet_diag_msg_sctpladdrs_fill() and inet_diag_msg_sctpaddrs_fill().
          From Stefano Brivio.
      
      10) L2TP tunnel refcount fixes from Guillaume Nault.
      
      11) Don't leak UDP secpath in udp_set_dev_scratch(), from Yossi
          Kauperman.
      
      12) Revert a PHY layer change wrt. handling of PHY_HALTED state in
          phy_stop_machine(), it causes regressions for multiple people. From
          Florian Fainelli.
      
      13) When packets are sent out of br0 we have to clear the
          offload_fwdq_mark value.
      
      14) Several NULL pointer deref fixes in packet schedulers when their
          ->init() routine fails. From Nikolay Aleksandrov.
      
      15) Aquantium devices cannot checksum offload correctly when the packet
          is <= 60 bytes. From Pavel Belous.
      
      16) Fix vnet header access past end of buffer in AF_PACKET, from
          Benjamin Poirier.
      
      17) Double free in probe error paths of nfp driver, from Dan Carpenter.
      
      18) QOS capability not checked properly in DCB init paths of mlx5
          driver, from Huy Nguyen.
      
      19) Fix conflicts between firmware load failure and health_care timer in
          mlx5, also from Huy Nguyen.
      
      20) Fix dangling page pointer when DMA mapping errors occur in mlx5,
          from Eran Ben ELisha.
      
      21) ->ndo_setup_tc() in bnxt_en driver doesn't count rings properly,
          from Michael Chan.
      
      22) Missing MSIX vector free in bnxt_en, also from Michael Chan.
      
      23) Refcount leak in xfrm layer when using sk_policy, from Lorenzo
          Colitti.
      
      24) Fix copy of uninitialized data in qlge driver, from Arnd Bergmann.
      
      25) bpf_setsockopts() erroneously always returns -EINVAL even on
          success. Fix from Yuchung Cheng.
      
      26) tipc_rcv() needs to linearize the SKB before parsing the inner
          headers, from Parthasarathy Bhuvaragan.
      
      27) Fix deadlock between link status updates and link removal in netvsc
          driver, from Stephen Hemminger.
      
      28) Missed locking of page fragment handling in ESP output, from Steffen
          Klassert.
      
      29) Fix refcnt leak in ebpf congestion control code, from Sabrina
          Dubroca.
      
      30) sxgbe_probe_config_dt() doesn't check devm_kzalloc()'s return value,
          from Christophe Jaillet.
      
      31) Fix missing ipv6 rx_dst_cookie update when rx_dst is updated during
          early demux, from Paolo Abeni.
      
      32) Several info leaks in xfrm_user layer, from Mathias Krause.
      
      33) Fix out of bounds read in cxgb4 driver, from Stefano Brivio.
      
      34) Properly propagate obsolete state of route upwards in ipv6 so that
          upper holders like xfrm can see it. From Xin Long.
      
      * git://git.kernel.org/pub/scm/linux/kernel/git/davem/net: (118 commits)
        udp: fix secpath leak
        bridge: switchdev: Clear forward mark when transmitting packet
        mlxsw: spectrum: Forbid linking to devices that have uppers
        wl1251: add a missing spin_lock_init()
        Revert "net: phy: Correctly process PHY_HALTED in phy_stop_machine()"
        net: dsa: bcm_sf2: Fix number of CFP entries for BCM7278
        kcm: do not attach PF_KCM sockets to avoid deadlock
        sch_tbf: fix two null pointer dereferences on init failure
        sch_sfq: fix null pointer dereference on init failure
        sch_netem: avoid null pointer deref on init failure
        sch_fq_codel: avoid double free on init failure
        sch_cbq: fix null pointer dereferences on init failure
        sch_hfsc: fix null pointer deref and double free on init failure
        sch_hhf: fix null pointer dereference on init failure
        sch_multiq: fix double free on init failure
        sch_htb: fix crash on init failure
        net/mlx5e: Fix CQ moderation mode not set properly
        net/mlx5e: Fix inline header size for small packets
        net/mlx5: E-Switch, Unload the representors in the correct order
        net/mlx5e: Properly resolve TC offloaded ipv6 vxlan tunnel source address
        ...
      8cf9f2a2
    • L
      Merge tag 'ceph-for-4.13-rc8' of git://github.com/ceph/ceph-client · b8a78bb4
      Linus Torvalds 提交于
      Pull ceph fix from Ilya Dryomov:
       "ceph fscache page locking fix from Zheng, marked for stable"
      
      * tag 'ceph-for-4.13-rc8' of git://github.com/ceph/ceph-client:
        ceph: fix readpage from fscache
      b8a78bb4
    • L
      Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/dtor/input · 3e1d79c8
      Linus Torvalds 提交于
      Pull input fixes from Dmitry Torokhov:
       "Just a couple drivers fixes (Synaptics PS/2, Xpad)"
      
      * 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/dtor/input:
        Input: xpad - fix PowerA init quirk for some gamepad models
        Input: synaptics - fix device info appearing different on reconnect
      3e1d79c8
    • L
      Merge tag 'mmc-v4.13-rc7' of git://git.kernel.org/pub/scm/linux/kernel/git/ulfh/mmc · d7e44b86
      Linus Torvalds 提交于
      Pull two more MMC fixes from Ulf Hansson:
       "MMC core:
         - Fix block status codes
      
        MMC host:
         - sdhci-xenon: Fix SD bus voltage select"
      
      * tag 'mmc-v4.13-rc7' of git://git.kernel.org/pub/scm/linux/kernel/git/ulfh/mmc:
        mmc: sdhci-xenon: add set_power callback
        mmc: block: Fix block status codes
      d7e44b86
    • L
      Merge tag 'sound-4.13-rc8' of git://git.kernel.org/pub/scm/linux/kernel/git/tiwai/sound · 381cce59
      Linus Torvalds 提交于
      Pull sound fixes from Takashi Iwai:
       "Three regression fixes that should be addressed before the final
        release: a missing mutex call in OSS PCM emulation ioctl, ASoC rt5670
        headset detection breakage, and a regression in simple-card parser
        code"
      
      * tag 'sound-4.13-rc8' of git://git.kernel.org/pub/scm/linux/kernel/git/tiwai/sound:
        ASoC: simple_card_utils: fix fallback when "label" property isn't present
        ALSA: pcm: Fix power lock unbalance via OSS emulation
        ASoC: rt5670: Fix GPIO headset detection regression
      381cce59
    • L
      Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/s390/linux · bba2a5b8
      Linus Torvalds 提交于
      Pull s390 fixes from Martin Schwidefsky:
       "Three more bug fixes for v4.13.
      
        The two memory management related fixes are quite new, they fix kernel
        crashes that can be triggered by user space.
      
        The third commit fixes a bug in the vfio ccw translation code"
      
      * 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/s390/linux:
        s390/mm: fix BUG_ON in crst_table_upgrade
        s390/mm: fork vs. 5 level page tabel
        vfio: ccw: fix bad ptr math for TIC cda translation
      bba2a5b8
    • L
      Merge branch 'linus' of git://git.kernel.org/pub/scm/linux/kernel/git/herbert/crypto-2.6 · a1c516a6
      Linus Torvalds 提交于
      Pull crypto fixes from Herbert Xu:
       "This fixes the following issues:
      
         - Regression in chacha20 handling of chunked input
      
         - Crash in algif_skcipher when used with async io
      
         - Potential bogus pointer dereference in lib/mpi"
      
      * 'linus' of git://git.kernel.org/pub/scm/linux/kernel/git/herbert/crypto-2.6:
        crypto: algif_skcipher - only call put_page on referenced and used pages
        crypto: testmgr - add chunked test cases for chacha20
        crypto: chacha20 - fix handling of chunked input
        lib/mpi: kunmap after finishing accessing buffer
      a1c516a6
    • Y
      udp: fix secpath leak · e8a732d1
      Yossi Kuperman 提交于
      After commit dce4551c ("udp: preserve head state for IP_CMSG_PASSSEC")
      we preserve the secpath for the whole skb lifecycle, but we also
      end up leaking a reference to it.
      
      We must clear the head state on skb reception, if secpath is
      present.
      
      Fixes: dce4551c ("udp: preserve head state for IP_CMSG_PASSSEC")
      Signed-off-by: NYossi Kuperman <yossiku@mellanox.com>
      Signed-off-by: NPaolo Abeni <pabeni@redhat.com>
      Signed-off-by: NDavid S. Miller <davem@davemloft.net>
      e8a732d1
    • I
      bridge: switchdev: Clear forward mark when transmitting packet · 79e99bdd
      Ido Schimmel 提交于
      Commit 6bc506b4 ("bridge: switchdev: Add forward mark support for
      stacked devices") added the 'offload_fwd_mark' bit to the skb in order
      to allow drivers to indicate to the bridge driver that they already
      forwarded the packet in L2.
      
      In case the bit is set, before transmitting the packet from each port,
      the port's mark is compared with the mark stored in the skb's control
      block. If both marks are equal, we know the packet arrived from a switch
      device that already forwarded the packet and it's not re-transmitted.
      
      However, if the packet is transmitted from the bridge device itself
      (e.g., br0), we should clear the 'offload_fwd_mark' bit as the mark
      stored in the skb's control block isn't valid.
      
      This scenario can happen in rare cases where a packet was trapped during
      L3 forwarding and forwarded by the kernel to a bridge device.
      
      Fixes: 6bc506b4 ("bridge: switchdev: Add forward mark support for stacked devices")
      Signed-off-by: NIdo Schimmel <idosch@mellanox.com>
      Reported-by: NYotam Gigi <yotamg@mellanox.com>
      Tested-by: NYotam Gigi <yotamg@mellanox.com>
      Reviewed-by: NJiri Pirko <jiri@mellanox.com>
      Acked-by: NNikolay Aleksandrov <nikolay@cumulusnetworks.com>
      Signed-off-by: NDavid S. Miller <davem@davemloft.net>
      79e99bdd
    • I
      mlxsw: spectrum: Forbid linking to devices that have uppers · 25cc72a3
      Ido Schimmel 提交于
      The mlxsw driver relies on NETDEV_CHANGEUPPER events to configure the
      device in case a port is enslaved to a master netdev such as bridge or
      bond.
      
      Since the driver ignores events unrelated to its ports and their
      uppers, it's possible to engineer situations in which the device's data
      path differs from the kernel's.
      
      One example to such a situation is when a port is enslaved to a bond
      that is already enslaved to a bridge. When the bond was enslaved the
      driver ignored the event - as the bond wasn't one of its uppers - and
      therefore a bridge port instance isn't created in the device.
      
      Until such configurations are supported forbid them by checking that the
      upper device doesn't have uppers of its own.
      
      Fixes: 0d65fc13 ("mlxsw: spectrum: Implement LAG port join/leave")
      Signed-off-by: NIdo Schimmel <idosch@mellanox.com>
      Reported-by: NNogah Frankel <nogahf@mellanox.com>
      Tested-by: NNogah Frankel <nogahf@mellanox.com>
      Signed-off-by: NJiri Pirko <jiri@mellanox.com>
      Signed-off-by: NDavid S. Miller <davem@davemloft.net>
      25cc72a3
  4. 01 9月, 2017 4 次提交