1. 13 1月, 2021 8 次提交
    • P
      bnxt_en: Clear DEFRAG flag in firmware message when retry flashing. · 68748775
      Pavan Chebbi 提交于
      When the FW tells the driver to retry the INSTALL_UPDATE command after
      it has cleared the NVM area, the driver is not clearing the previously
      used ALLOWED_TO_DEFRAG flag. As a result the FW tries to defrag the NVM
      area a second time in a loop and can fail the request.
      
      Fixes: 1432c3f6 ("bnxt_en: Retry installing FW package under NO_SPACE error condition.")
      Signed-off-by: NPavan Chebbi <pavan.chebbi@broadcom.com>
      Signed-off-by: NMichael Chan <michael.chan@broadcom.com>
      Signed-off-by: NJakub Kicinski <kuba@kernel.org>
      68748775
    • M
      bnxt_en: Improve stats context resource accounting with RDMA driver loaded. · 869c4d5e
      Michael Chan 提交于
      The function bnxt_get_ulp_stat_ctxs() does not count the stats contexts
      used by the RDMA driver correctly when the RDMA driver is freeing the
      MSIX vectors.  It assumes that if the RDMA driver is registered, the
      additional stats contexts will be needed.  This is not true when the
      RDMA driver is about to unregister and frees the MSIX vectors.
      
      This slight error leads to over accouting of the stats contexts needed
      after the RDMA driver has unloaded.  This will cause some firmware
      warning and error messages in dmesg during subsequent config. changes
      or ifdown/ifup.
      
      Fix it by properly accouting for extra stats contexts only if the
      RDMA driver is registered and MSIX vectors have been successfully
      requested.
      
      Fixes: c027c6b4 ("bnxt_en: get rid of num_stat_ctxs variable")
      Reviewed-by: NYongping Zhang <yongping.zhang@broadcom.com>
      Reviewed-by: NPavan Chebbi <pavan.chebbi@broadcom.com>
      Signed-off-by: NMichael Chan <michael.chan@broadcom.com>
      Signed-off-by: NJakub Kicinski <kuba@kernel.org>
      869c4d5e
    • L
      r8153_ecm: Add Lenovo Powered USB-C Hub as a fallback of r8152 · 2284bbd0
      Leon Schuermann 提交于
      This commit enables the use of the r8153_ecm driver, introduced with
      commit c1aedf01 ("net/usb/r8153_ecm: support ECM mode for
      RTL8153") for the Lenovo Powered USB-C Hub (17ef:721e) based on the
      Realtek RTL8153B chip.
      
      This results in the following driver preference:
      
      - if r8152 is available, use the r8152 driver
      - if r8152 is not available, use the r8153_ecm driver
      
      This is done to prevent the NIC from constantly sending pause frames
      when the host system enters standby (fixed by using the r8152 driver
      in "r8152: Add Lenovo Powered USB-C Travel Hub"), while still allowing
      the device to work with the r8153_ecm driver as a fallback.
      Signed-off-by: NLeon Schuermann <leon@is.currently.online>
      Tested-by: NLeon Schuermann <leon@is.currently.online>
      Link: https://lore.kernel.org/r/20210111190312.12589-3-leon@is.currently.onlineSigned-off-by: NJakub Kicinski <kuba@kernel.org>
      2284bbd0
    • L
      r8152: Add Lenovo Powered USB-C Travel Hub · cb82a549
      Leon Schuermann 提交于
      This USB-C Hub (17ef:721e) based on the Realtek RTL8153B chip used to
      use the cdc_ether driver. However, using this driver, with the system
      suspended the device constantly sends pause-frames as soon as the
      receive buffer fills up. This causes issues with other devices, where
      some Ethernet switches stop forwarding packets altogether.
      
      Using the Realtek driver (r8152) fixes this issue. Pause frames are no
      longer sent while the host system is suspended.
      Signed-off-by: NLeon Schuermann <leon@is.currently.online>
      Tested-by: NLeon Schuermann <leon@is.currently.online>
      Link: https://lore.kernel.org/r/20210111190312.12589-2-leon@is.currently.onlineSigned-off-by: NJakub Kicinski <kuba@kernel.org>
      cb82a549
    • V
      net: dsa: clear devlink port type before unregistering slave netdevs · 91158e16
      Vladimir Oltean 提交于
      Florian reported a use-after-free bug in devlink_nl_port_fill found with
      KASAN:
      
      (devlink_nl_port_fill)
      (devlink_port_notify)
      (devlink_port_unregister)
      (dsa_switch_teardown.part.3)
      (dsa_tree_teardown_switches)
      (dsa_unregister_switch)
      (bcm_sf2_sw_remove)
      (platform_remove)
      (device_release_driver_internal)
      (device_links_unbind_consumers)
      (device_release_driver_internal)
      (device_driver_detach)
      (unbind_store)
      
      Allocated by task 31:
       alloc_netdev_mqs+0x5c/0x50c
       dsa_slave_create+0x110/0x9c8
       dsa_register_switch+0xdb0/0x13a4
       b53_switch_register+0x47c/0x6dc
       bcm_sf2_sw_probe+0xaa4/0xc98
       platform_probe+0x90/0xf4
       really_probe+0x184/0x728
       driver_probe_device+0xa4/0x278
       __device_attach_driver+0xe8/0x148
       bus_for_each_drv+0x108/0x158
      
      Freed by task 249:
       free_netdev+0x170/0x194
       dsa_slave_destroy+0xac/0xb0
       dsa_port_teardown.part.2+0xa0/0xb4
       dsa_tree_teardown_switches+0x50/0xc4
       dsa_unregister_switch+0x124/0x250
       bcm_sf2_sw_remove+0x98/0x13c
       platform_remove+0x44/0x5c
       device_release_driver_internal+0x150/0x254
       device_links_unbind_consumers+0xf8/0x12c
       device_release_driver_internal+0x84/0x254
       device_driver_detach+0x30/0x34
       unbind_store+0x90/0x134
      
      What happens is that devlink_port_unregister emits a netlink
      DEVLINK_CMD_PORT_DEL message which associates the devlink port that is
      getting unregistered with the ifindex of its corresponding net_device.
      Only trouble is, the net_device has already been unregistered.
      
      It looks like we can stub out the search for a corresponding net_device
      if we clear the devlink_port's type. This looks like a bit of a hack,
      but also seems to be the reason why the devlink_port_type_clear function
      exists in the first place.
      
      Fixes: 3122433e ("net: dsa: Register devlink ports before calling DSA driver setup()")
      Signed-off-by: NVladimir Oltean <vladimir.oltean@nxp.com>
      Reviewed-by: NFlorian Fainelli <f.fainelli@gmail.com>
      Tested-by: NFlorian fainelli <f.fainelli@gmail.com>
      Reported-by: NFlorian Fainelli <f.fainelli@gmail.com>
      Link: https://lore.kernel.org/r/20210112004831.3778323-1-olteanv@gmail.comSigned-off-by: NJakub Kicinski <kuba@kernel.org>
      91158e16
    • V
      net: dsa: unbind all switches from tree when DSA master unbinds · 07b90056
      Vladimir Oltean 提交于
      Currently the following happens when a DSA master driver unbinds while
      there are DSA switches attached to it:
      
      $ echo 0000:00:00.5 > /sys/bus/pci/drivers/mscc_felix/unbind
      ------------[ cut here ]------------
      WARNING: CPU: 0 PID: 392 at net/core/dev.c:9507
      Call trace:
       rollback_registered_many+0x5fc/0x688
       unregister_netdevice_queue+0x98/0x120
       dsa_slave_destroy+0x4c/0x88
       dsa_port_teardown.part.16+0x78/0xb0
       dsa_tree_teardown_switches+0x58/0xc0
       dsa_unregister_switch+0x104/0x1b8
       felix_pci_remove+0x24/0x48
       pci_device_remove+0x48/0xf0
       device_release_driver_internal+0x118/0x1e8
       device_driver_detach+0x28/0x38
       unbind_store+0xd0/0x100
      
      Located at the above location is this WARN_ON:
      
      	/* Notifier chain MUST detach us all upper devices. */
      	WARN_ON(netdev_has_any_upper_dev(dev));
      
      Other stacked interfaces, like VLAN, do indeed listen for
      NETDEV_UNREGISTER on the real_dev and also unregister themselves at that
      time, which is clearly the behavior that rollback_registered_many
      expects. But DSA interfaces are not VLAN. They have backing hardware
      (platform devices, PCI devices, MDIO, SPI etc) which have a life cycle
      of their own and we can't just trigger an unregister from the DSA
      framework when we receive a netdev notifier that the master unregisters.
      
      Luckily, there is something we can do, and that is to inform the driver
      core that we have a runtime dependency to the DSA master interface's
      device, and create a device link where that is the supplier and we are
      the consumer. Having this device link will make the DSA switch unbind
      before the DSA master unbinds, which is enough to avoid the WARN_ON from
      rollback_registered_many.
      
      Note that even before the blamed commit, DSA did nothing intelligent
      when the master interface got unregistered either. See the discussion
      here:
      https://lore.kernel.org/netdev/20200505210253.20311-1-f.fainelli@gmail.com/
      But this time, at least the WARN_ON is loud enough that the
      upper_dev_link commit can be blamed.
      
      The advantage with this approach vs dev_hold(master) in the attached
      link is that the latter is not meant for long term reference counting.
      With dev_hold, the only thing that will happen is that when the user
      attempts an unbind of the DSA master, netdev_wait_allrefs will keep
      waiting and waiting, due to DSA keeping the refcount forever. DSA would
      not access freed memory corresponding to the master interface, but the
      unbind would still result in a freeze. Whereas with device links,
      graceful teardown is ensured. It even works with cascaded DSA trees.
      
      $ echo 0000:00:00.2 > /sys/bus/pci/drivers/fsl_enetc/unbind
      [ 1818.797546] device swp0 left promiscuous mode
      [ 1819.301112] sja1105 spi2.0: Link is Down
      [ 1819.307981] DSA: tree 1 torn down
      [ 1819.312408] device eno2 left promiscuous mode
      [ 1819.656803] mscc_felix 0000:00:00.5: Link is Down
      [ 1819.667194] DSA: tree 0 torn down
      [ 1819.711557] fsl_enetc 0000:00:00.2 eno2: Link is Down
      
      This approach allows us to keep the DSA framework absolutely unchanged,
      and the driver core will just know to unbind us first when the master
      goes away - as opposed to the large (and probably impossible) rework
      required if attempting to listen for NETDEV_UNREGISTER.
      
      As per the documentation at Documentation/driver-api/device_link.rst,
      specifying the DL_FLAG_AUTOREMOVE_CONSUMER flag causes the device link
      to be automatically purged when the consumer fails to probe or later
      unbinds. So we don't need to keep the consumer_link variable in struct
      dsa_switch.
      
      Fixes: 2f1e8ea7 ("net: dsa: link interfaces with the DSA master to get rid of lockdep warnings")
      Signed-off-by: NVladimir Oltean <vladimir.oltean@nxp.com>
      Reviewed-by: NFlorian Fainelli <f.fainelli@gmail.com>
      Tested-by: NFlorian Fainelli <f.fainelli@gmail.com>
      Link: https://lore.kernel.org/r/20210111230943.3701806-1-olteanv@gmail.comSigned-off-by: NJakub Kicinski <kuba@kernel.org>
      07b90056
    • M
      net: phy: smsc: fix clk error handling · a18caa97
      Marco Felsch 提交于
      Commit bedd8d78 ("net: phy: smsc: LAN8710/20: add phy refclk in
      support") added the phy clk support. The commit already checks if
      clk_get_optional() throw an error but instead of returning the error it
      ignores it.
      
      Fixes: bedd8d78 ("net: phy: smsc: LAN8710/20: add phy refclk in support")
      Suggested-by: NJakub Kicinski <kuba@kernel.org>
      Signed-off-by: NMarco Felsch <m.felsch@pengutronix.de>
      Reviewed-by: NAndrew Lunn <andrew@lunn.ch>
      Link: https://lore.kernel.org/r/20210111085932.28680-1-m.felsch@pengutronix.deSigned-off-by: NJakub Kicinski <kuba@kernel.org>
      a18caa97
    • P
      net: dcb: Accept RTM_GETDCB messages carrying set-like DCB commands · df85bc14
      Petr Machata 提交于
      In commit 826f328e ("net: dcb: Validate netlink message in DCB
      handler"), Linux started rejecting RTM_GETDCB netlink messages if they
      contained a set-like DCB_CMD_ command.
      
      The reason was that privileges were only verified for RTM_SETDCB messages,
      but the value that determined the action to be taken is the command, not
      the message type. And validation of message type against the DCB command
      was the obvious missing piece.
      
      Unfortunately it turns out that mlnx_qos, a somewhat widely deployed tool
      for configuration of DCB, accesses the DCB set-like APIs through
      RTM_GETDCB.
      
      Therefore do not bounce the discrepancy between message type and command.
      Instead, in addition to validating privileges based on the actual message
      type, validate them also based on the expected message type. This closes
      the loophole of allowing DCB configuration on non-admin accounts, while
      maintaining backward compatibility.
      
      Fixes: 2f90b865 ("ixgbe: this patch adds support for DCB to the kernel and ixgbe driver")
      Fixes: 826f328e ("net: dcb: Validate netlink message in DCB handler")
      Signed-off-by: NPetr Machata <petrm@nvidia.com>
      Link: https://lore.kernel.org/r/a3edcfda0825f2aa2591801c5232f2bbf2d8a554.1610384801.git.me@pmachata.orgSigned-off-by: NJakub Kicinski <kuba@kernel.org>
      df85bc14
  2. 12 1月, 2021 7 次提交
  3. 10 1月, 2021 8 次提交
  4. 09 1月, 2021 12 次提交
    • J
      Merge branch 'net-fix-issues-around-register_netdevice-failures' · c49243e8
      Jakub Kicinski 提交于
      Jakub Kicinski says:
      
      ====================
      net: fix issues around register_netdevice() failures
      
      This series attempts to clean up the life cycle of struct
      net_device. Dave has added dev->needs_free_netdev in the
      past to fix double frees, we can lean on that mechanism
      a little more to fix remaining issues with register_netdevice().
      
      This is the next chapter of the saga which already includes:
      commit 0e0eee24 ("net: correct error path in rtnl_newlink()")
      commit e51fb152 ("rtnetlink: fix a memory leak when ->newlink fails")
      commit cf124db5 ("net: Fix inconsistent teardown and release of private netdev state.")
      commit 93ee31f1 ("[NET]: Fix free_netdev on register_netdev failure.")
      commit 814152a8 ("net: fix memleak in register_netdevice()")
      commit 10cc514f ("net: Fix null de-reference of device refcount")
      
      The immediate problem which gets fixed here is that calling
      free_netdev() right after unregister_netdevice() is illegal
      because we need to release rtnl_lock first, to let the
      unregistration finish. Note that unregister_netdevice() is
      just a wrapper of unregister_netdevice_queue(), it only
      does half of the job.
      
      Where this limitation becomes most problematic is in failure
      modes of register_netdevice(). There is a notifier call right
      at the end of it, which lets other subsystems veto the entire
      thing. At which point we should really go through a full
      unregister_netdevice(), but we can't because callers may
      go straight to free_netdev() after the failure, and that's
      no bueno (see the previous paragraph).
      
      This set makes free_netdev() more lenient, when device
      is still being unregistered free_netdev() will simply set
      dev->needs_free_netdev and let the unregister process do
      the freeing.
      
      With the free_netdev() problem out of the way failures in
      register_netdevice() can make use of net_todo, again.
      Users are still expected to call free_netdev() right after
      failure but that will only set dev->needs_free_netdev.
      
      To prevent the pathological case of:
      
       dev->needs_free_netdev = true;
       if (register_netdevice(dev)) {
         rtnl_unlock();
         free_netdev(dev);
       }
      
      make register_netdevice()'s failure clear dev->needs_free_netdev.
      
      Problems described above are only present with register_netdevice() /
      unregister_netdevice(). We have two parallel APIs for registration
      of devices:
       - those called outside rtnl_lock (register_netdev(), and
         unregister_netdev());
       - and those to be used under rtnl_lock - register_netdevice()
         and unregister_netdevice().
      The former is trivial and has no problems. The alternative
      approach to fix the latter would be to also separate the
      freeing functions - i.e. add free_netdevice(). This has been
      implemented (incl. converting all relevant calls in the tree)
      but it feels a little unnecessary to put the burden of choosing
      the right free_netdev{,ice}() call on the programmer when we
      can "just do the right thing" by default.
      ====================
      
      Link: https://lore.kernel.org/r/20210106184007.1821480-1-kuba@kernel.orgSigned-off-by: NJakub Kicinski <kuba@kernel.org>
      c49243e8
    • J
      net: make sure devices go through netdev_wait_all_refs · 766b0515
      Jakub Kicinski 提交于
      If register_netdevice() fails at the very last stage - the
      notifier call - some subsystems may have already seen it and
      grabbed a reference. struct net_device can't be freed right
      away without calling netdev_wait_all_refs().
      
      Now that we have a clean interface in form of dev->needs_free_netdev
      and lenient free_netdev() we can undo what commit 93ee31f1 ("[NET]:
      Fix free_netdev on register_netdev failure.") has done and complete
      the unregistration path by bringing the net_set_todo() call back.
      
      After registration fails user is still expected to explicitly
      free the net_device, so make sure ->needs_free_netdev is cleared,
      otherwise rolling back the registration will cause the old double
      free for callers who release rtnl_lock before the free.
      
      This also solves the problem of priv_destructor not being called
      on notifier error.
      
      net_set_todo() will be moved back into unregister_netdevice_queue()
      in a follow up.
      Reported-by: NHulk Robot <hulkci@huawei.com>
      Reported-by: NYang Yingliang <yangyingliang@huawei.com>
      Signed-off-by: NJakub Kicinski <kuba@kernel.org>
      766b0515
    • J
      net: make free_netdev() more lenient with unregistering devices · c269a24c
      Jakub Kicinski 提交于
      There are two flavors of handling netdev registration:
       - ones called without holding rtnl_lock: register_netdev() and
         unregister_netdev(); and
       - those called with rtnl_lock held: register_netdevice() and
         unregister_netdevice().
      
      While the semantics of the former are pretty clear, the same can't
      be said about the latter. The netdev_todo mechanism is utilized to
      perform some of the device unregistering tasks and it hooks into
      rtnl_unlock() so the locked variants can't actually finish the work.
      In general free_netdev() does not mix well with locked calls. Most
      drivers operating under rtnl_lock set dev->needs_free_netdev to true
      and expect core to make the free_netdev() call some time later.
      
      The part where this becomes most problematic is error paths. There is
      no way to unwind the state cleanly after a call to register_netdevice(),
      since unreg can't be performed fully without dropping locks.
      
      Make free_netdev() more lenient, and defer the freeing if device
      is being unregistered. This allows error paths to simply call
      free_netdev() both after register_netdevice() failed, and after
      a call to unregister_netdevice() but before dropping rtnl_lock.
      
      Simplify the error paths which are currently doing gymnastics
      around free_netdev() handling.
      Signed-off-by: NJakub Kicinski <kuba@kernel.org>
      c269a24c
    • J
      docs: net: explain struct net_device lifetime · 2b446e65
      Jakub Kicinski 提交于
      Explain the two basic flows of struct net_device's operation.
      Signed-off-by: NJakub Kicinski <kuba@kernel.org>
      2b446e65
    • T
      ppp: fix refcount underflow on channel unbridge · c1787ffd
      Tom Parkin 提交于
      When setting up a channel bridge, ppp_bridge_channels sets the
      pch->bridge field before taking the associated reference on the bridge
      file instance.
      
      This opens up a refcount underflow bug if ppp_bridge_channels called
      via. iotcl runs concurrently with ppp_unbridge_channels executing via.
      file release.
      
      The bug is triggered by ppp_bridge_channels taking the error path
      through the 'err_unset' label.  In this scenario, pch->bridge is set,
      but the reference on the bridged channel will not be taken because
      the function errors out.  If ppp_unbridge_channels observes pch->bridge
      before it is unset by the error path, it will erroneously drop the
      reference on the bridged channel and cause a refcount underflow.
      
      To avoid this, ensure that ppp_bridge_channels holds a reference on
      each channel in advance of setting the bridge pointers.
      Signed-off-by: NTom Parkin <tparkin@katalix.com>
      Fixes: 4cf476ce ("ppp: add PPPIOCBRIDGECHAN and PPPIOCUNBRIDGECHAN ioctls")
      Acked-by: NGuillaume Nault <gnault@redhat.com>
      Link: https://lore.kernel.org/r/20210107181315.3128-1-tparkin@katalix.comSigned-off-by: NJakub Kicinski <kuba@kernel.org>
      c1787ffd
    • B
      udp: Prevent reuseport_select_sock from reading uninitialized socks · fd2ddef0
      Baptiste Lepers 提交于
      reuse->socks[] is modified concurrently by reuseport_add_sock. To
      prevent reading values that have not been fully initialized, only read
      the array up until the last known safe index instead of incorrectly
      re-reading the last index of the array.
      
      Fixes: acdcecc6 ("udp: correct reuseport selection with connected sockets")
      Signed-off-by: NBaptiste Lepers <baptiste.lepers@gmail.com>
      Acked-by: NWillem de Bruijn <willemb@google.com>
      Link: https://lore.kernel.org/r/20210107051110.12247-1-baptiste.lepers@gmail.comSigned-off-by: NJakub Kicinski <kuba@kernel.org>
      fd2ddef0
    • D
      net: fix use-after-free when UDP GRO with shared fraglist · 53475c5d
      Dongseok Yi 提交于
      skbs in fraglist could be shared by a BPF filter loaded at TC. If TC
      writes, it will call skb_ensure_writable -> pskb_expand_head to create
      a private linear section for the head_skb. And then call
      skb_clone_fraglist -> skb_get on each skb in the fraglist.
      
      skb_segment_list overwrites part of the skb linear section of each
      fragment itself. Even after skb_clone, the frag_skbs share their
      linear section with their clone in PF_PACKET.
      
      Both sk_receive_queue of PF_PACKET and PF_INET (or PF_INET6) can have
      a link for the same frag_skbs chain. If a new skb (not frags) is
      queued to one of the sk_receive_queue, multiple ptypes can see and
      release this. It causes use-after-free.
      
      [ 4443.426215] ------------[ cut here ]------------
      [ 4443.426222] refcount_t: underflow; use-after-free.
      [ 4443.426291] WARNING: CPU: 7 PID: 28161 at lib/refcount.c:190
      refcount_dec_and_test_checked+0xa4/0xc8
      [ 4443.426726] pstate: 60400005 (nZCv daif +PAN -UAO)
      [ 4443.426732] pc : refcount_dec_and_test_checked+0xa4/0xc8
      [ 4443.426737] lr : refcount_dec_and_test_checked+0xa0/0xc8
      [ 4443.426808] Call trace:
      [ 4443.426813]  refcount_dec_and_test_checked+0xa4/0xc8
      [ 4443.426823]  skb_release_data+0x144/0x264
      [ 4443.426828]  kfree_skb+0x58/0xc4
      [ 4443.426832]  skb_queue_purge+0x64/0x9c
      [ 4443.426844]  packet_set_ring+0x5f0/0x820
      [ 4443.426849]  packet_setsockopt+0x5a4/0xcd0
      [ 4443.426853]  __sys_setsockopt+0x188/0x278
      [ 4443.426858]  __arm64_sys_setsockopt+0x28/0x38
      [ 4443.426869]  el0_svc_common+0xf0/0x1d0
      [ 4443.426873]  el0_svc_handler+0x74/0x98
      [ 4443.426880]  el0_svc+0x8/0xc
      
      Fixes: 3a1296a3 (net: Support GRO/GSO fraglist chaining.)
      Signed-off-by: NDongseok Yi <dseok.yi@samsung.com>
      Acked-by: NWillem de Bruijn <willemb@google.com>
      Acked-by: NDaniel Borkmann <daniel@iogearbox.net>
      Link: https://lore.kernel.org/r/1610072918-174177-1-git-send-email-dseok.yi@samsung.comSigned-off-by: NJakub Kicinski <kuba@kernel.org>
      53475c5d
    • S
      net: ipa: modem: add missing SET_NETDEV_DEV() for proper sysfs links · afba9dc1
      Stephan Gerhold 提交于
      At the moment it is quite hard to identify the network interface
      provided by IPA in userspace components: The network interface is
      created as virtual device, without any link to the IPA device.
      The interface name ("rmnet_ipa%d") is the only indication that the
      network interface belongs to IPA, but this is not very reliable.
      
      Add SET_NETDEV_DEV() to associate the network interface with the
      IPA parent device. This allows userspace services like ModemManager
      to properly identify that this network interface is provided by IPA
      and belongs to the modem.
      
      Cc: Alex Elder <elder@kernel.org>
      Fixes: a646d6ec ("soc: qcom: ipa: modem and microcontroller")
      Signed-off-by: NStephan Gerhold <stephan@gerhold.net>
      Link: https://lore.kernel.org/r/20210106100755.56800-1-stephan@gerhold.netSigned-off-by: NJakub Kicinski <kuba@kernel.org>
      afba9dc1
    • L
      Merge tag 'net-5.11-rc3-2' of git://git.kernel.org/pub/scm/linux/kernel/git/netdev/net · 6279d812
      Linus Torvalds 提交于
      Pull more networking fixes from Jakub Kicinski:
       "Slightly lighter pull request to get back into the Thursday cadence.
      
        Current release - always broken:
      
         - can: mcp251xfd: fix Tx/Rx ring buffer driver race conditions
      
         - dsa: hellcreek: fix led_classdev build errors
      
        Previous releases - regressions:
      
         - ipv6: fib: flush exceptions when purging route to avoid netdev
           reference leak
      
         - ip_tunnels: fix pmtu check in nopmtudisc mode
      
         - ip: always refragment ip defragmented packets to avoid MTU issues
           when forwarding through tunnels, correct "packet too big" message
           is prohibitively tricky to generate
      
         - s390/qeth: fix locking for discipline setup / removal and during
           recovery to prevent both deadlocks and races
      
         - mlx5: Use port_num 1 instead of 0 when delete a RoCE address
      
        Previous releases - always broken:
      
         - cdc_ncm: correct overhead calculation in delayed_ndp_size to
           prevent out of bound accesses with Huawei 909s-120 LTE module
      
         - fix stmmac dwmac-sun8i suspend/resume:
                 - PHY being left powered off
                 - MAC syscon configuration being reset
                 - reference to the reset controller being improperly dropped
      
         - qrtr: fix null-ptr-deref in qrtr_ns_remove
      
         - can: tcan4x5x: fix bittiming const, use common bittiming from m_can
           driver
      
         - mlx5e: CT: Use per flow counter when CT flow accounting is enabled
      
         - mlx5e: Fix SWP offsets when vlan inserted by driver
      
        Misc:
      
         - bpf: Fix a task_iter bug caused by a bpf -> net merge conflict
           resolution
      
        And the usual many fixes to various error paths"
      
      * tag 'net-5.11-rc3-2' of git://git.kernel.org/pub/scm/linux/kernel/git/netdev/net: (69 commits)
        net: dsa: lantiq_gswip: Exclude RMII from modes that report 1 GbE
        s390/qeth: fix L2 header access in qeth_l3_osa_features_check()
        s390/qeth: fix locking for discipline setup / removal
        s390/qeth: fix deadlock during recovery
        selftests: fib_nexthops: Fix wrong mausezahn invocation
        nexthop: Bounce NHA_GATEWAY in FDB nexthop groups
        nexthop: Unlink nexthop group entry in error path
        nexthop: Fix off-by-one error in error path
        octeontx2-af: fix memory leak of lmac and lmac->name
        chtls: Fix chtls resources release sequence
        chtls: Added a check to avoid NULL pointer dereference
        chtls: Replace skb_dequeue with skb_peek
        chtls: Avoid unnecessary freeing of oreq pointer
        chtls: Fix panic when route to peer not configured
        chtls: Remove invalid set_tcb call
        chtls: Fix hardware tid leak
        net: ip: always refragment ip defragmented packets
        net: fix pmtu check in nopmtudisc mode
        selftests: netfilter: add selftest for ipip pmtu discovery with enabled connection tracking
        docs: octeontx2: tune rst markup
        ...
      6279d812
    • L
      Merge branch 'linus' of git://git.kernel.org/pub/scm/linux/kernel/git/herbert/crypto-2.6 · ea1c87c1
      Linus Torvalds 提交于
      Pull crypto fixes from Herbert Xu:
       "This fixes a functional bug in arm/chacha-neon as well as a potential
        buffer overflow in ecdh"
      
      * 'linus' of git://git.kernel.org/pub/scm/linux/kernel/git/herbert/crypto-2.6:
        crypto: ecdh - avoid buffer overflow in ecdh_set_secret()
        crypto: arm/chacha-neon - add missing counter increment
      ea1c87c1
    • L
      poll: fix performance regression due to out-of-line __put_user() · ef0ba055
      Linus Torvalds 提交于
      The kernel test robot reported a -5.8% performance regression on the
      "poll2" test of will-it-scale, and bisected it to commit d55564cf
      ("x86: Make __put_user() generate an out-of-line call").
      
      I didn't expect an out-of-line __put_user() to matter, because no normal
      core code should use that non-checking legacy version of user access any
      more.  But I had overlooked the very odd poll() usage, which does a
      __put_user() to update the 'revents' values of the poll array.
      
      Now, Al Viro correctly points out that instead of updating just the
      'revents' field, it would be much simpler to just copy the _whole_
      pollfd entry, and then we could just use "copy_to_user()" on the whole
      array of entries, the same way we use "copy_from_user()" a few lines
      earlier to get the original values.
      
      But that is not what we've traditionally done, and I worry that threaded
      applications might be concurrently modifying the other fields of the
      pollfd array.  So while Al's suggestion is simpler - and perhaps worth
      trying in the future - this instead keeps the "just update revents"
      model.
      
      To fix the performance regression, use the modern "unsafe_put_user()"
      instead of __put_user(), with the proper "user_write_access_begin()"
      guarding in place. This improves code generation enormously.
      
      Link: https://lore.kernel.org/lkml/20210107134723.GA28532@xsang-OptiPlex-9020/Reported-by: Nkernel test robot <oliver.sang@intel.com>
      Tested-by: NOliver Sang <oliver.sang@intel.com>
      Cc: Al Viro <viro@zeniv.linux.org.uk>
      Cc: David Laight <David.Laight@aculab.com>
      Cc: Peter Zijlstra <peterz@infradead.org>
      Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
      ef0ba055
    • P
      Revert "init/console: Use ttynull as a fallback when there is no console" · a91bd622
      Petr Mladek 提交于
      This reverts commit 757055ae.
      
      The commit caused that ttynull was used as the default console
      on several systems[1][2][3]. As a result, the console was
      blank even when a better alternative existed.
      
      It happened when there was no console configured
      on the command line and ttynull_init() was the first initcall
      calling register_console().
      
      Or it happened when /dev/ did not exist when console_on_rootfs()
      was called. It was not able to open /dev/console even though
      a console driver was registered. It tried to add ttynull console
      but it obviously did not help. But ttynull became the preferred
      console and was used by /dev/console when it was available later.
      
      The commit tried to fix a historical problem that have been there
      for ages. The primary motivation was the commit 3cffa06a
      ("printk/console: Allow to disable console output by using console=""
       or console=null"). It provided a clean solution for a workaround
       that was widely used and worked only by chance.
      
      This revert causes that the console="" or console=null command line
      options will again work only by chance. These options will cause that
      a particular console will be preferred and the default (tty) ones
      will not get enabled. There will be no console registered at
      all. As a result there won't be stdin, stdout, and stderr for
      the init process. But it worked exactly this way even before.
      
      The proper solution has to fulfill many conditions:
      
        + Register ttynull only when explicitly required or as
          the ultimate fallback.
      
        + ttynull should get associated with /dev/console but it must
          not become preferred console when used as a fallback.
          Especially, it must still be possible to replace it
          by a better console later.
      
      Such a change requires clean up of the register_console() code.
      Otherwise, it would be even harder to follow. Especially, the use
      of has_preferred_console and CON_CONSDEV flag is tricky. The clean
      up is risky. The ordering of consoles is not well defined. And
      any changes tend to break existing user settings.
      
      Do the revert at the least risky solution for now.
      
      [1] https://lore.kernel.org/linux-kselftest/20201221144302.GR4077@smile.fi.intel.com/
      [2] https://lore.kernel.org/lkml/d2a3b3c0-e548-7dd1-730f-59bc5c04e191@synopsys.com/
      [3] https://patchwork.ozlabs.org/project/linux-um/patch/20210105120128.10854-1-thomas@m3y3r.de/Reported-by: NAndy Shevchenko <andriy.shevchenko@linux.intel.com>
      Reported-by: NVineet Gupta <vgupta@synopsys.com>
      Reported-by: NThomas Meyer <thomas@m3y3r.de>
      Signed-off-by: NPetr Mladek <pmladek@suse.com>
      Acked-by: NGreg Kroah-Hartman <gregkh@linuxfoundation.org>
      Acked-by: NSergey Senozhatsky <sergey.senozhatsky@gmail.com>
      Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
      a91bd622
  5. 08 1月, 2021 5 次提交