1. 15 11月, 2020 5 次提交
    • L
      mm/slub: fix panic in slab_alloc_node() · 22e4663e
      Laurent Dufour 提交于
      While doing memory hot-unplug operation on a PowerPC VM running 1024 CPUs
      with 11TB of ram, I hit the following panic:
      
          BUG: Kernel NULL pointer dereference on read at 0x00000007
          Faulting instruction address: 0xc000000000456048
          Oops: Kernel access of bad area, sig: 11 [#2]
          LE PAGE_SIZE=64K MMU=Hash SMP NR_CPUS= 2048 NUMA pSeries
          Modules linked in: rpadlpar_io rpaphp
          CPU: 160 PID: 1 Comm: systemd Tainted: G      D           5.9.0 #1
          NIP:  c000000000456048 LR: c000000000455fd4 CTR: c00000000047b350
          REGS: c00006028d1b77a0 TRAP: 0300   Tainted: G      D            (5.9.0)
          MSR:  8000000000009033 <SF,EE,ME,IR,DR,RI,LE>  CR: 24004228  XER: 00000000
          CFAR: c00000000000f1b0 DAR: 0000000000000007 DSISR: 40000000 IRQMASK: 0
          GPR00: c000000000455fd4 c00006028d1b7a30 c000000001bec800 0000000000000000
          GPR04: 0000000000000dc0 0000000000000000 00000000000374ef c00007c53df99320
          GPR08: 000007c53c980000 0000000000000000 000007c53c980000 0000000000000000
          GPR12: 0000000000004400 c00000001e8e4400 0000000000000000 0000000000000f6a
          GPR16: 0000000000000000 c000000001c25930 c000000001d62528 00000000000000c1
          GPR20: c000000001d62538 c00006be469e9000 0000000fffffffe0 c0000000003c0ff8
          GPR24: 0000000000000018 0000000000000000 0000000000000dc0 0000000000000000
          GPR28: c00007c513755700 c000000001c236a4 c00007bc4001f800 0000000000000001
          NIP [c000000000456048] __kmalloc_node+0x108/0x790
          LR [c000000000455fd4] __kmalloc_node+0x94/0x790
          Call Trace:
            kvmalloc_node+0x58/0x110
            mem_cgroup_css_online+0x10c/0x270
            online_css+0x48/0xd0
            cgroup_apply_control_enable+0x2c4/0x470
            cgroup_mkdir+0x408/0x5f0
            kernfs_iop_mkdir+0x90/0x100
            vfs_mkdir+0x138/0x250
            do_mkdirat+0x154/0x1c0
            system_call_exception+0xf8/0x200
            system_call_common+0xf0/0x27c
          Instruction dump:
          e93e0000 e90d0030 39290008 7cc9402a e94d0030 e93e0000 7ce95214 7f89502a
          2fbc0000 419e0018 41920230 e9270010 <89290007> 7f994800 419e0220 7ee6bb78
      
      This pointing to the following code:
      
          mm/slub.c:2851
                  if (unlikely(!object || !node_match(page, node))) {
          c000000000456038:       00 00 bc 2f     cmpdi   cr7,r28,0
          c00000000045603c:       18 00 9e 41     beq     cr7,c000000000456054 <__kmalloc_node+0x114>
          node_match():
          mm/slub.c:2491
                  if (node != NUMA_NO_NODE && page_to_nid(page) != node)
          c000000000456040:       30 02 92 41     beq     cr4,c000000000456270 <__kmalloc_node+0x330>
          page_to_nid():
          include/linux/mm.h:1294
          c000000000456044:       10 00 27 e9     ld      r9,16(r7)
          c000000000456048:       07 00 29 89     lbz     r9,7(r9)	<<<< r9 = NULL
          node_match():
          mm/slub.c:2491
          c00000000045604c:       00 48 99 7f     cmpw    cr7,r25,r9
          c000000000456050:       20 02 9e 41     beq     cr7,c000000000456270 <__kmalloc_node+0x330>
      
      The panic occurred in slab_alloc_node() when checking for the page's node:
      
      	object = c->freelist;
      	page = c->page;
      	if (unlikely(!object || !node_match(page, node))) {
      		object = __slab_alloc(s, gfpflags, node, addr, c);
      		stat(s, ALLOC_SLOWPATH);
      
      The issue is that object is not NULL while page is NULL which is odd but
      may happen if the cache flush happened after loading object but before
      loading page.  Thus checking for the page pointer is required too.
      
      The cache flush is done through an inter processor interrupt when a
      piece of memory is off-lined.  That interrupt is triggered when a memory
      hot-unplug operation is initiated and offline_pages() is calling the
      slub's MEM_GOING_OFFLINE callback slab_mem_going_offline_callback()
      which is calling flush_cpu_slab().  If that interrupt is caught between
      the reading of c->freelist and the reading of c->page, this could lead
      to such a situation.  That situation is expected and the later call to
      this_cpu_cmpxchg_double() will detect the change to c->freelist and redo
      the whole operation.
      
      In commit 6159d0f5 ("mm/slub.c: page is always non-NULL in
      node_match()") check on the page pointer has been removed assuming that
      page is always valid when it is called.  It happens that this is not
      true in that particular case, so check for page before calling
      node_match() here.
      
      Fixes: 6159d0f5 ("mm/slub.c: page is always non-NULL in node_match()")
      Signed-off-by: NLaurent Dufour <ldufour@linux.ibm.com>
      Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
      Acked-by: NVlastimil Babka <vbabka@suse.cz>
      Acked-by: NChristoph Lameter <cl@linux.com>
      Cc: Wei Yang <richard.weiyang@gmail.com>
      Cc: Pekka Enberg <penberg@kernel.org>
      Cc: David Rientjes <rientjes@google.com>
      Cc: Joonsoo Kim <iamjoonsoo.kim@lge.com>
      Cc: Nathan Lynch <nathanl@linux.ibm.com>
      Cc: Scott Cheloha <cheloha@linux.ibm.com>
      Cc: Michal Hocko <mhocko@suse.com>
      Cc: <stable@vger.kernel.org>
      Link: https://lkml.kernel.org/r/20201027190406.33283-1-ldufour@linux.ibm.comSigned-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
      22e4663e
    • D
      mailmap: fix entry for Dmitry Baryshkov/Eremin-Solenikov · 044747e9
      Dmitry Baryshkov 提交于
      Change back surname to new (old) one.  Dmitry Baryshkov -> Dmitry
      Eremin-Solenikov -> Dmitry Baryshkov.  Map several odd entries to main
      identity.
      Signed-off-by: NDmitry Baryshkov <dbaryshkov@gmail.com>
      Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
      Link: https://lkml.kernel.org/r/20201103005158.1181426-1-dmitry.baryshkov@linaro.orgSigned-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
      044747e9
    • N
      mm/vmscan: fix NR_ISOLATED_FILE corruption on 64-bit · 2da9f630
      Nicholas Piggin 提交于
      Previously the negated unsigned long would be cast back to signed long
      which would have the correct negative value.  After commit 730ec8c0
      ("mm/vmscan.c: change prototype for shrink_page_list"), the large
      unsigned int converts to a large positive signed long.
      
      Symptoms include CMA allocations hanging forever holding the cma_mutex
      due to alloc_contig_range->...->isolate_migratepages_block waiting
      forever in "while (unlikely(too_many_isolated(pgdat)))".
      
      [akpm@linux-foundation.org: fix -stat.nr_lazyfree_fail as well, per Michal]
      
      Fixes: 730ec8c0 ("mm/vmscan.c: change prototype for shrink_page_list")
      Signed-off-by: NNicholas Piggin <npiggin@gmail.com>
      Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
      Acked-by: NMichal Hocko <mhocko@suse.com>
      Cc: Vaneet Narang <v.narang@samsung.com>
      Cc: Maninder Singh <maninder1.s@samsung.com>
      Cc: Amit Sahrawat <a.sahrawat@samsung.com>
      Cc: Mel Gorman <mgorman@suse.de>
      Cc: Vlastimil Babka <vbabka@suse.cz>
      Cc: <stable@vger.kernel.org>
      Link: https://lkml.kernel.org/r/20201029032320.1448441-1-npiggin@gmail.comSigned-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
      2da9f630
    • Z
      mm/compaction: stop isolation if too many pages are isolated and we have pages to migrate · d20bdd57
      Zi Yan 提交于
      In isolate_migratepages_block, if we have too many isolated pages and
      nr_migratepages is not zero, we should try to migrate what we have
      without wasting time on isolating.
      
      In theory it's possible that multiple parallel compactions will cause
      too_many_isolated() to become true even if each has isolated less than
      COMPACT_CLUSTER_MAX, and loop forever in the while loop.  Bailing
      immediately prevents that.
      
      [vbabka@suse.cz: changelog addition]
      
      Fixes: 1da2f328 (“mm,thp,compaction,cma: allow THP migration for CMA allocations”)
      Suggested-by: NVlastimil Babka <vbabka@suse.cz>
      Signed-off-by: NZi Yan <ziy@nvidia.com>
      Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
      Cc: <stable@vger.kernel.org>
      Cc: Mel Gorman <mgorman@techsingularity.net>
      Cc: Michal Hocko <mhocko@kernel.org>
      Cc: Rik van Riel <riel@surriel.com>
      Cc: Yang Shi <shy828301@gmail.com>
      Link: https://lkml.kernel.org/r/20201030183809.3616803-2-zi.yan@sent.comSigned-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
      d20bdd57
    • Z
      mm/compaction: count pages and stop correctly during page isolation · 38935861
      Zi Yan 提交于
      In isolate_migratepages_block, when cc->alloc_contig is true, we are
      able to isolate compound pages.  But nr_migratepages and nr_isolated did
      not count compound pages correctly, causing us to isolate more pages
      than we thought.
      
      So count compound pages as the number of base pages they contain.
      Otherwise, we might be trapped in too_many_isolated while loop, since
      the actual isolated pages can go up to COMPACT_CLUSTER_MAX*512=16384,
      where COMPACT_CLUSTER_MAX is 32, since we stop isolation after
      cc->nr_migratepages reaches to COMPACT_CLUSTER_MAX.
      
      In addition, after we fix the issue above, cc->nr_migratepages could
      never be equal to COMPACT_CLUSTER_MAX if compound pages are isolated,
      thus page isolation could not stop as we intended.  Change the isolation
      stop condition to '>='.
      
      The issue can be triggered as follows:
      
      In a system with 16GB memory and an 8GB CMA region reserved by
      hugetlb_cma, if we first allocate 10GB THPs and mlock them (so some THPs
      are allocated in the CMA region and mlocked), reserving 6 1GB hugetlb
      pages via /sys/kernel/mm/hugepages/hugepages-1048576kB/nr_hugepages will
      get stuck (looping in too_many_isolated function) until we kill either
      task.  With the patch applied, oom will kill the application with 10GB
      THPs and let hugetlb page reservation finish.
      
      [ziy@nvidia.com: v3]
      
      Link: https://lkml.kernel.org/r/20201030183809.3616803-1-zi.yan@sent.com
      Fixes: 1da2f328 ("cmm,thp,compaction,cma: allow THP migration for CMA allocations")
      Signed-off-by: NZi Yan <ziy@nvidia.com>
      Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
      Reviewed-by: NYang Shi <shy828301@gmail.com>
      Acked-by: NVlastimil Babka <vbabka@suse.cz>
      Cc: Rik van Riel <riel@surriel.com>
      Cc: Michal Hocko <mhocko@kernel.org>
      Cc: Mel Gorman <mgorman@techsingularity.net>
      Cc: <stable@vger.kernel.org>
      Link: https://lkml.kernel.org/r/20201029200435.3386066-1-zi.yan@sent.comSigned-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
      38935861
  2. 14 11月, 2020 9 次提交
    • L
      Merge tag 'devicetree-fixes-for-5.10-2' of git://git.kernel.org/pub/scm/linux/kernel/git/robh/linux · 9e6a39ea
      Linus Torvalds 提交于
      Pull devicetree fixes from Rob Herring:
      
       - fix Flexcan binding schema errors introduced in rc3
      
       - fix an of_node ref counting error in of_dma_is_coherent
      
      * tag 'devicetree-fixes-for-5.10-2' of git://git.kernel.org/pub/scm/linux/kernel/git/robh/linux:
        dt-bindings: clock: imx5: fix example
        dt-bindings: can: fsl,flexcan.yaml: fix compatible for i.MX35 and i.MX53
        dt-bindings: can: fsl,flexcan.yaml: fix fsl,stop-mode
        of/address: Fix of_node memory leak in of_dma_is_coherent
      9e6a39ea
    • L
      Merge tag 'pinctrl-v5.10-2' of git://git.kernel.org/pub/scm/linux/kernel/git/linusw/linux-pinctrl · 4de010e2
      Linus Torvalds 提交于
      Pull pin control fixes from Linus Walleij:
       "A bunch of pin control fixes for the v5.10 kernel series.
      
        Nothing in particular to say about it, because they are all driver
        fixes.
      
        I'm happy that some AMD driver fixes are appearing, it's been an
        undermaintained driver, and laptops have suffered.
      
        Summary:
      
         - Two fixes to the Intel pin controller drivers: fixing pull
           resistance bias.
      
         - Fix some invalid SSI pins on the Ingenic pin controller.
      
         - Make sure the clock is enabled when requesting interrupts from the
           Rockchip GPIO controller.
      
         - Make sure IRQs are mapped when looking up the IRQ for a GPIO line
           on the Rockchip GPIO Write.
      
         - Two regmap initialization fixes for the MCP23s08.
      
         - Fix a GPI-only prefix function problem on the Aspeed pin
           controller.
      
         - Disable the debounce filter correctly on the AMD pin controller.
      
         - Correct the timer clock setting for the AMD debounce timer.
      
         - Make the Qualcomm pin controller more cautious around the handling
           of PDC-related GPIO interrupts.
      
         - Fix the interrupt map in the Qualcomm SM8250 pin controller"
      
      * tag 'pinctrl-v5.10-2' of git://git.kernel.org/pub/scm/linux/kernel/git/linusw/linux-pinctrl:
        pinctrl: qcom: sm8250: Specify PDC map
        pinctrl: qcom: Move clearing pending IRQ to .irq_request_resources callback
        pinctrl: amd: use higher precision for 512 RtcClk
        pinctrl: amd: fix incorrect way to disable debounce filter
        pinctrl: aspeed: Fix GPI only function problem.
        pinctrl: mcp23s08: Print error message when regmap init fails
        pinctrl: mcp23s08: Use full chunk of memory for regmap configuration
        pinctrl: rockchip: create irq mapping in gpio_to_irq
        pinctrl: rockchip: enable gpio pclk for rockchip_gpio_to_irq
        pinctrl: ingenic: Fix invalid SSI pins
        pinctrl: intel: Set default bias in case no particular value given
        pinctrl: intel: Fix 2 kOhm bias which is 833 Ohm
      4de010e2
    • L
      Merge tag 'gpio-v5.10-2' of git://git.kernel.org/pub/scm/linux/kernel/git/linusw/linux-gpio · 29eb6b7d
      Linus Torvalds 提交于
      Pull GPIO fixes from Linus Walleij:
       "Some GPIO fixes I've collected with the help of Bartosz.
      
        Nothing special about them: all are driver and kbuild fixes + some
        documentation fixes:
      
         - Tidy up a missed function call in the designware driver when
           converting to gpiolib irqchip
      
         - Fix some bitmasks in the Aspeed driver
      
         - Fix some kerneldoc warnings and minor bugs in the improved
           userspace API documentation
      
         - Revert the revert of the OMAP fix for lost edge wakeup interrupts:
           the fix needs to stay in
      
         - Fix a compile error when deselecting the character device
      
         - A bunch of IRQ fixes on the idio GPIO drivers
      
         - Fix an off-by-one error in the SiFive GPIO driver"
      
      * tag 'gpio-v5.10-2' of git://git.kernel.org/pub/scm/linux/kernel/git/linusw/linux-gpio:
        gpio: sifive: Fix SiFive gpio probe
        gpio: pcie-idio-24: Enable PEX8311 interrupts
        gpio: pcie-idio-24: Fix IRQ Enable Register value
        gpio: pcie-idio-24: Fix irq mask when masking
        gpiolib: fix sysfs when cdev is not selected
        Revert "Revert "gpio: omap: Fix lost edge wake-up interrupts""
        gpio: uapi: clarify the meaning of 'empty' char arrays
        gpio: uapi: remove whitespace
        gpio: uapi: kernel-doc formatting improvements
        gpio: uapi: comment consistency
        gpio: uapi: fix kernel-doc warnings
        gpio: aspeed: fix ast2600 bank properties
        gpio: dwapb: Fix missing conversion to GPIO-lib-based IRQ-chip
      29eb6b7d
    • L
      Merge tag 'mmc-v5.10-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/ulfh/mmc · 673cb932
      Linus Torvalds 提交于
      Pull MMC fixes from Ulf Hansson:
      
       - tmio: Fixup support for reset
      
       - sdhci-of-esdhc: Extend erratum for pulse width to more broken HWs
      
       - renesas_sdhi: Fix re-binding of drivers
      
      * tag 'mmc-v5.10-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/ulfh/mmc:
        Revert "mmc: renesas_sdhi: workaround a regression when reinserting SD cards"
        mmc: tmio: bring tuning HW to a sane state with MMC_POWER_OFF
        mmc: tmio: when resetting, reset DMA controller, too
        mmc: sdhci-of-esdhc: Handle pulse width detection erratum for more SoCs
        mmc: renesas_sdhi_core: Add missing tmio_mmc_host_free() at remove
      673cb932
    • L
      Merge tag 'drm-fixes-2020-11-13' of git://anongit.freedesktop.org/drm/drm · e627c255
      Linus Torvalds 提交于
      Pull drm fixes from Dave Airlie:
       "Nearly didn't send you a PR this week at all, but a few things
        trickled in over the day, not a huge amount here, some i915, amdgpu
        and a bunch of misc fixes. I have a couple of nouveau fixes
        outstanding due to the PR having the wrong base, I'll figure it out
        next week.
      
        amdgpu:
         - Pageflip fix for DCN3
         - Declare TA firmware for green sardine
         - Headless navi fix
      
        i915:
         - Pull phys pread/pwrite implementations to the backend
         - Correctly set SFC capability for video engines
      
        bridge:
         - cdns Kconfig fix
      
        hyperv_fb:
         - fix missing include
      
        gma500:
         - oob access fix
      
        mcde:
         - unbalanced regulator fix"
      
      * tag 'drm-fixes-2020-11-13' of git://anongit.freedesktop.org/drm/drm:
        drm/amdgpu: enable DCN for navi10 headless SKU
        drm/amdgpu: add ta firmware load for green-sardine
        drm/i915: Correctly set SFC capability for video engines
        drm/i915/gem: Pull phys pread/pwrite implementations to the backend
        drm/i915/gem: Allow backends to override pread implementation
        drm/mcde: Fix unbalanced regulator
        drm/gma500: Fix out-of-bounds access to struct drm_device.vblank[]
        video: hyperv_fb: include vmalloc.h
        drm: bridge: cdns: Kconfig: Switch over dependency to ARCH_K3
        drm/amd/display: Add missing pflip irq
      e627c255
    • L
      Merge tag 'trace-v5.10-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/rostedt/linux-trace · 6186313d
      Linus Torvalds 提交于
      Pull bootconfig fix from Steven Rostedt:
       "Fix alignment of bootconfig
      
        GRUB may align the init ramdisk size to 4 bytes, the magic number at
        the end of the init ramdisk that denotes bootconfig is attached may
        not be at the exact end of the ramdisk. The kernel needs to check back
        at least 4 bytes"
      
      * tag 'trace-v5.10-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/rostedt/linux-trace:
        bootconfig: Extend the magic check range to the preceding 3 bytes
      6186313d
    • L
      Merge tag 'for-linus' of git://git.armlinux.org.uk/~rmk/linux-arm · e45f90fc
      Linus Torvalds 提交于
      Pull ARM fix from Russell King:
       "Just one bug fix: avoid a fortify panic when copying optprobe template"
      
      * tag 'for-linus' of git://git.armlinux.org.uk/~rmk/linux-arm:
        ARM: 9019/1: kprobes: Avoid fortify_panic() when copying optprobe template
      e45f90fc
    • L
      Merge tag 'arm64-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/arm64/linux · 88b31f07
      Linus Torvalds 提交于
      Pull arm64 fixes from Will Deacon:
      
       - Spectre/Meltdown safelisting for some Qualcomm KRYO cores
      
       - Fix RCU splat when failing to online a CPU due to a feature mismatch
      
       - Fix a recently introduced sparse warning in kexec()
      
       - Fix handling of CPU erratum 1418040 for late CPUs
      
       - Ensure hot-added memory falls within linear-mapped region
      
      * tag 'arm64-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/arm64/linux:
        arm64: cpu_errata: Apply Erratum 845719 to KRYO2XX Silver
        arm64: proton-pack: Add KRYO2XX silver CPUs to spectre-v2 safe-list
        arm64: kpti: Add KRYO2XX gold/silver CPU cores to kpti safelist
        arm64: Add MIDR value for KRYO2XX gold/silver CPU cores
        arm64/mm: Validate hotplug range before creating linear mapping
        arm64: smp: Tell RCU about CPUs that fail to come online
        arm64: psci: Avoid printing in cpu_psci_cpu_die()
        arm64: kexec_file: Fix sparse warning
        arm64: errata: Fix handling of 1418040 with late CPU onlining
      88b31f07
    • L
      Merge tag 'ext4_for_linus_bugfixes' of git://git.kernel.org/pub/scm/linux/kernel/git/tytso/ext4 · d3ba7afc
      Linus Torvalds 提交于
      Pull ext4 fixes from Ted Ts'o:
       "Two ext4 bug fixes, one being a revert of a commit sent during the
        merge window"
      
      * tag 'ext4_for_linus_bugfixes' of git://git.kernel.org/pub/scm/linux/kernel/git/tytso/ext4:
        Revert "ext4: fix superblock checksum calculation race"
        ext4: handle dax mount option collision
      d3ba7afc
  3. 13 11月, 2020 26 次提交