1. 04 6月, 2021 6 次提交
    • V
      net: pcs: xpcs: convert to phylink_pcs_ops · 11059740
      Vladimir Oltean 提交于
      Since all the remaining members of struct mdio_xpcs_ops have direct
      equivalents in struct phylink_pcs_ops, it is about time we remove it
      altogether.
      
      Since the phylink ops return void, we need to remove the error
      propagation from the various xpcs methods and simply print an error
      message where appropriate.
      
      Since xpcs_get_state_c73() detects link faults and attempts to reset the
      link on its own by calling xpcs_config(), but xpcs_config() now has a
      lot of phylink arguments which are not needed and cannot be simply
      fabricated by anybody else except phylink, the actual implementation has
      been moved into a smaller xpcs_do_config().
      
      The const struct mdio_xpcs_ops *priv->hw->xpcs has been removed, so we
      need to look at the struct mdio_xpcs_args pointer now as an indication
      whether the port has an XPCS or not.
      Signed-off-by: NVladimir Oltean <vladimir.oltean@nxp.com>
      Signed-off-by: NDavid S. Miller <davem@davemloft.net>
      11059740
    • V
      net: pcs: xpcs: convert to mdio_device · 2cac15da
      Vladimir Oltean 提交于
      Unify the 2 existing PCS drivers (lynx and xpcs) by doing a similar
      thing on probe, which is to have a *_create function that takes a
      struct mdio_device * given by the caller, and builds a private PCS
      structure around that.
      
      This changes stmmac to hold only a pointer to the xpcs, as opposed to
      the full structure. This will be used in the next patch when struct
      mdio_xpcs_ops is removed. Currently a pointer to struct mdio_xpcs_ops
      is used as a shorthand to determine whether the port has an XPCS or not.
      We can do the same now with the mdio_xpcs_args pointer.
      Signed-off-by: NVladimir Oltean <vladimir.oltean@nxp.com>
      Signed-off-by: NDavid S. Miller <davem@davemloft.net>
      2cac15da
    • V
      net: pcs: xpcs: export xpcs_probe · 8e2bb956
      Vladimir Oltean 提交于
      Similar to the other recently functions, it is not necessary for
      xpcs_probe to be a function pointer, so export it so that it can be
      called directly.
      Signed-off-by: NVladimir Oltean <vladimir.oltean@nxp.com>
      Signed-off-by: NDavid S. Miller <davem@davemloft.net>
      8e2bb956
    • V
      net: pcs: xpcs: export xpcs_config_eee · 14b517cb
      Vladimir Oltean 提交于
      There is no good reason why we need to go through:
      
      stmmac_xpcs_config_eee
      -> stmmac_do_callback
         -> mdio_xpcs_ops->config_eee
            -> xpcs_config_eee
      
      when we can simply call xpcs_config_eee.
      
      priv->hw->xpcs is of the type "const struct mdio_xpcs_ops *" and is used
      as a placeholder/synonym for priv->plat->mdio_bus_data->has_xpcs. It is
      done that way because the mdio_bus_data pointer might or might not be
      populated in all stmmac instantiations.
      Signed-off-by: NVladimir Oltean <vladimir.oltean@nxp.com>
      Signed-off-by: NDavid S. Miller <davem@davemloft.net>
      14b517cb
    • V
      net: pcs: xpcs: export xpcs_validate · a1a753ed
      Vladimir Oltean 提交于
      Calling a function pointer with a single implementation through
      struct mdio_xpcs_ops is clunky, and the stmmac_do_callback system forces
      this to return int, even though it always returns zero.
      
      Simply remove the "validate" function pointer from struct mdio_xpcs_ops
      and replace it with an exported xpcs_validate symbol which is called
      directly by stmmac.
      
      priv->hw->xpcs is of the type "const struct mdio_xpcs_ops *" and is used
      as a placeholder/synonym for priv->plat->mdio_bus_data->has_xpcs. It is
      done that way because the mdio_bus_data pointer might or might not be
      populated in all stmmac instantiations.
      Signed-off-by: NVladimir Oltean <vladimir.oltean@nxp.com>
      Signed-off-by: NDavid S. Miller <davem@davemloft.net>
      a1a753ed
    • V
      net: pcs: xpcs: make the checks related to the PHY interface mode stateless · 9900074e
      Vladimir Oltean 提交于
      The operating mode of the driver is currently to populate its
      struct mdio_xpcs_args::supported and struct mdio_xpcs_args::an_mode
      statically in xpcs_probe(), based on the passed phy_interface_t,
      and work with those.
      
      However this is not the operation that phylink expects from a PCS
      driver, because the port might be attached to an SFP cage that triggers
      changes of the phy_interface_t dynamically as one SFP module is
      unpluggged and another is plugged.
      
      To migrate towards that model, the struct mdio_xpcs_args should not
      cache anything related to the phy_interface_t, but just look up the
      statically defined, const struct xpcs_compat structure corresponding to
      the detected PCS OUI/model number.
      
      So we delete the "supported" and "an_mode" members of struct
      mdio_xpcs_args, and add the "id" structure there (since the ID is not
      expected to change at runtime).
      
      Since xpcs->supported is used deep in the code in _xpcs_config_aneg_c73(),
      we need to modify some function headers to pass the xpcs_compat from all
      callers. In turn, the xpcs_compat is always supplied externally to the
      xpcs module:
      - Most of the time by phylink
      - In xpcs_probe() it is needed because xpcs_soft_reset() writes to
        MDIO_MMD_PCS or to MDIO_MMD_VEND2 depending on whether an_mode is clause
        37 or clause 73. In order to not introduce functional changes related
        to when the soft reset is issued, we continue to require the initial
        phy_interface_t argument to be passed to xpcs_probe() so we can pass
        this on to xpcs_soft_reset().
      - stmmac_open() wants to know whether to call stmmac_init_phy() or not,
        and for that it looks inside xpcs->an_mode, because the clause 73
        (backplane) AN modes supposedly do not have a PHY. Because we moved
        an_mode outside of struct mdio_xpcs_args, this is now no longer
        directly possible, so we introduce a helper function xpcs_get_an_mode()
        which protects the data encapsulation of the xpcs module and requires
        a phy_interface_t to be passed as argument. This function can look up
        the appropriate compat based on the phy_interface_t.
      Signed-off-by: NVladimir Oltean <vladimir.oltean@nxp.com>
      Signed-off-by: NDavid S. Miller <davem@davemloft.net>
      9900074e
  2. 03 6月, 2021 1 次提交
    • W
      net: stmmac: fix issue where clk is being unprepared twice · ab00f3e0
      Wong Vee Khee 提交于
      In the case of MDIO bus registration failure due to no external PHY
      devices is connected to the MAC, clk_disable_unprepare() is called in
      stmmac_bus_clk_config() and intel_eth_pci_probe() respectively.
      
      The second call in intel_eth_pci_probe() will caused the following:-
      
      [   16.578605] intel-eth-pci 0000:00:1e.5: No PHY found
      [   16.583778] intel-eth-pci 0000:00:1e.5: stmmac_dvr_probe: MDIO bus (id: 2) registration failed
      [   16.680181] ------------[ cut here ]------------
      [   16.684861] stmmac-0000:00:1e.5 already disabled
      [   16.689547] WARNING: CPU: 13 PID: 2053 at drivers/clk/clk.c:952 clk_core_disable+0x96/0x1b0
      [   16.697963] Modules linked in: dwc3 iTCO_wdt mei_hdcp iTCO_vendor_support udc_core x86_pkg_temp_thermal kvm_intel marvell10g kvm sch_fq_codel nfsd irqbypass dwmac_intel(+) stmmac uio ax88179_178a pcs_xpcs phylink uhid spi_pxa2xx_platform usbnet mei_me pcspkr tpm_crb mii i2c_i801 dw_dmac dwc3_pci thermal dw_dmac_core intel_rapl_msr libphy i2c_smbus mei tpm_tis intel_th_gth tpm_tis_core tpm intel_th_acpi intel_pmc_core intel_th i915 fuse configfs snd_hda_intel snd_intel_dspcfg snd_intel_sdw_acpi snd_hda_codec snd_hda_core snd_pcm snd_timer snd soundcore
      [   16.746785] CPU: 13 PID: 2053 Comm: systemd-udevd Tainted: G     U            5.13.0-rc3-intel-lts #76
      [   16.756134] Hardware name: Intel Corporation Alder Lake Client Platform/AlderLake-S ADP-S DRR4 CRB, BIOS ADLIFSI1.R00.1494.B00.2012031421 12/03/2020
      [   16.769465] RIP: 0010:clk_core_disable+0x96/0x1b0
      [   16.774222] Code: 00 8b 05 45 96 17 01 85 c0 7f 24 48 8b 5b 30 48 85 db 74 a5 8b 43 7c 85 c0 75 93 48 8b 33 48 c7 c7 6e 32 cc b7 e8 b2 5d 52 00 <0f> 0b 5b 5d c3 65 8b 05 76 31 18 49 89 c0 48 0f a3 05 bc 92 1a 01
      [   16.793016] RSP: 0018:ffffa44580523aa0 EFLAGS: 00010086
      [   16.798287] RAX: 0000000000000000 RBX: ffff8d7d0eb70a00 RCX: 0000000000000000
      [   16.805435] RDX: 0000000000000002 RSI: ffffffffb7c62d5f RDI: 00000000ffffffff
      [   16.812610] RBP: 0000000000000287 R08: 0000000000000000 R09: ffffa445805238d0
      [   16.819759] R10: 0000000000000001 R11: 0000000000000001 R12: ffff8d7d0eb70a00
      [   16.826904] R13: ffff8d7d027370c8 R14: 0000000000000006 R15: ffffa44580523ad0
      [   16.834047] FS:  00007f9882fa2600(0000) GS:ffff8d80a0940000(0000) knlGS:0000000000000000
      [   16.842177] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
      [   16.847966] CR2: 00007f9882bea3d8 CR3: 000000010b126001 CR4: 0000000000370ee0
      [   16.855144] Call Trace:
      [   16.857614]  clk_core_disable_lock+0x1b/0x30
      [   16.861941]  intel_eth_pci_probe.cold+0x11d/0x136 [dwmac_intel]
      [   16.867913]  pci_device_probe+0xcf/0x150
      [   16.871890]  really_probe+0xf5/0x3e0
      [   16.875526]  driver_probe_device+0x64/0x150
      [   16.879763]  device_driver_attach+0x53/0x60
      [   16.883998]  __driver_attach+0x9f/0x150
      [   16.887883]  ? device_driver_attach+0x60/0x60
      [   16.892288]  ? device_driver_attach+0x60/0x60
      [   16.896698]  bus_for_each_dev+0x77/0xc0
      [   16.900583]  bus_add_driver+0x184/0x1f0
      [   16.904469]  driver_register+0x6c/0xc0
      [   16.908268]  ? 0xffffffffc07ae000
      [   16.911598]  do_one_initcall+0x4a/0x210
      [   16.915489]  ? kmem_cache_alloc_trace+0x305/0x4e0
      [   16.920247]  do_init_module+0x5c/0x230
      [   16.924057]  load_module+0x2894/0x2b70
      [   16.927857]  ? __do_sys_finit_module+0xb5/0x120
      [   16.932441]  __do_sys_finit_module+0xb5/0x120
      [   16.936845]  do_syscall_64+0x42/0x80
      [   16.940476]  entry_SYSCALL_64_after_hwframe+0x44/0xae
      [   16.945586] RIP: 0033:0x7f98830e5ccd
      [   16.949177] Code: 00 c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 93 31 0c 00 f7 d8 64 89 01 48
      [   16.967970] RSP: 002b:00007ffc66b60168 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
      [   16.975583] RAX: ffffffffffffffda RBX: 000055885de35ef0 RCX: 00007f98830e5ccd
      [   16.982725] RDX: 0000000000000000 RSI: 00007f98832541e3 RDI: 0000000000000012
      [   16.989868] RBP: 0000000000020000 R08: 0000000000000000 R09: 0000000000000000
      [   16.997042] R10: 0000000000000012 R11: 0000000000000246 R12: 00007f98832541e3
      [   17.004222] R13: 0000000000000000 R14: 0000000000000000 R15: 00007ffc66b60328
      [   17.011369] ---[ end trace df06a3dab26b988c ]---
      [   17.016062] ------------[ cut here ]------------
      [   17.020701] stmmac-0000:00:1e.5 already unprepared
      
      Removing the stmmac_bus_clks_config() call in stmmac_dvr_probe and let
      dwmac-intel to handle the unprepare and disable of the clk device.
      
      Fixes: 5ec55823 ("net: stmmac: add clocks management for gmac driver")
      Cc: Joakim Zhang <qiangqing.zhang@nxp.com>
      Signed-off-by: NWong Vee Khee <vee.khee.wong@linux.intel.com>
      Reviewed-by: NJoakim Zhang <qiangqing.zhang@nxp.com>
      Signed-off-by: NDavid S. Miller <davem@davemloft.net>
      ab00f3e0
  3. 02 6月, 2021 1 次提交
  4. 31 5月, 2021 1 次提交
    • S
      net: stmmac: fix kernel panic due to NULL pointer dereference of mdio_bus_data · 593f555f
      Sriranjani P 提交于
      Fixed link does not need mdio bus and in that case mdio_bus_data will
      not be allocated. Before using mdio_bus_data we should check for NULL.
      
      This patch fix the kernel panic due to NULL pointer dereference of
      mdio_bus_data when it is not allocated.
      
      Without this patch we do see following kernel crash caused due to kernel
      NULL pointer dereference.
      
      Call trace:
      stmmac_dvr_probe+0x3c/0x10b0
      dwc_eth_dwmac_probe+0x224/0x378
      platform_probe+0x68/0xe0
      really_probe+0x130/0x3d8
      driver_probe_device+0x68/0xd0
      device_driver_attach+0x74/0x80
      __driver_attach+0x58/0xf8
      bus_for_each_dev+0x7c/0xd8
      driver_attach+0x24/0x30
      bus_add_driver+0x148/0x1f0
      driver_register+0x64/0x120
      __platform_driver_register+0x28/0x38
      dwc_eth_dwmac_driver_init+0x1c/0x28
      do_one_initcall+0x78/0x158
      kernel_init_freeable+0x1f0/0x244
      kernel_init+0x14/0x118
      ret_from_fork+0x10/0x30
      Code: f9002bfb 9113e2d9 910e6273 aa0003f7 (f9405c78)
      ---[ end trace 32d9d41562ddc081 ]---
      
      Fixes: e5e5b771 ("net: stmmac: make in-band AN mode parsing is supported for non-DT")
      Signed-off-by: NSriranjani P <sriranjani.p@samsung.com>
      Signed-off-by: NPankaj Dubey <pankaj.dubey@samsung.com>
      Link: https://lore.kernel.org/r/20210528071056.35252-1-sriranjani.p@samsung.comSigned-off-by: NJakub Kicinski <kuba@kernel.org>
      593f555f
  5. 29 5月, 2021 1 次提交
    • V
      net: stmmac: the XPCS obscures a potential "PHY not found" error · 4751d2aa
      Vladimir Oltean 提交于
      stmmac_mdio_register() has logic to search for PHYs on the MDIO bus and
      assign them IRQ lines, as well as to set priv->plat->phy_addr.
      
      If no PHY is found, the "found" variable remains set to 0 and the
      function errors out.
      
      After the introduction of commit f213bbe8 ("net: stmmac: Integrate
      it with DesignWare XPCS"), the "found" variable was immediately reused
      for searching for a PCS on the same MDIO bus.
      
      This can result in 2 types of potential problems (none of them seems to
      be seen on the only Intel system that sets has_xpcs = true, otherwise it
      would have been reported):
      
      1. If a PCS is found but a PHY is not, then the code happily exits with
         no error. One might say "yes, but this is not possible, because
         of_mdiobus_register will probe a PHY for all MDIO addresses,
         including for the XPCS, so if an XPCS exists, then a PHY certainly
         exists too". Well, that is not true, see intel_mgbe_common_data():
      
      	/* Ensure mdio bus scan skips intel serdes and pcs-xpcs */
      	plat->mdio_bus_data->phy_mask = 1 << INTEL_MGBE_ADHOC_ADDR;
      	plat->mdio_bus_data->phy_mask |= 1 << INTEL_MGBE_XPCS_ADDR;
      
      2. A PHY is found but an MDIO device with the XPCS PHY ID isn't, and in
         that case, the error message will be "No PHY found". Confusing.
      Signed-off-by: NVladimir Oltean <vladimir.oltean@nxp.com>
      Link: https://lore.kernel.org/r/20210527155959.3270478-1-olteanv@gmail.comSigned-off-by: NJakub Kicinski <kuba@kernel.org>
      4751d2aa
  6. 21 5月, 2021 2 次提交
  7. 18 5月, 2021 4 次提交
  8. 15 5月, 2021 2 次提交
  9. 13 5月, 2021 2 次提交
  10. 12 5月, 2021 1 次提交
  11. 11 5月, 2021 1 次提交
  12. 08 5月, 2021 1 次提交
    • Y
      net: stmmac: Do not enable RX FIFO overflow interrupts · 8a7cb245
      Yannick Vignon 提交于
      The RX FIFO overflows when the system is not able to process all received
      packets and they start accumulating (first in the DMA queue in memory,
      then in the FIFO). An interrupt is then raised for each overflowing packet
      and handled in stmmac_interrupt(). This is counter-productive, since it
      brings the system (or more likely, one CPU core) to its knees to process
      the FIFO overflow interrupts.
      
      stmmac_interrupt() handles overflow interrupts by writing the rx tail ptr
      into the corresponding hardware register (according to the MAC spec, this
      has the effect of restarting the MAC DMA). However, without freeing any rx
      descriptors, the DMA stops right away, and another overflow interrupt is
      raised as the FIFO overflows again. Since the DMA is already restarted at
      the end of stmmac_rx_refill() after freeing descriptors, disabling FIFO
      overflow interrupts and the corresponding handling code has no side effect,
      and eliminates the interrupt storm when the RX FIFO overflows.
      Signed-off-by: NYannick Vignon <yannick.vignon@nxp.com>
      Link: https://lore.kernel.org/r/20210506143312.20784-1-yannick.vignon@oss.nxp.comSigned-off-by: NJakub Kicinski <kuba@kernel.org>
      8a7cb245
  13. 05 5月, 2021 1 次提交
  14. 01 5月, 2021 2 次提交
  15. 23 4月, 2021 2 次提交
    • M
      stmmac: intel: Enable HW descriptor prefetch by default · 676b7ec6
      Mohammad Athari Bin Ismail 提交于
      Enable HW descriptor prefetch by default by setting plat->dma_cfg->dche =
      true in intel_mgbe_common_data(). Need to be noted that this capability
      only be supported in DWMAC core version 5.20 onwards. In stmmac, there is
      a checking to check the core version. If the core version is below 5.20,
      this capability wouldn`t be configured.
      
      Below is the iperf result comparison between HW descriptor prefetch
      disabled(DCHE=0b) and enabled(DCHE=1b). Tested on Intel Elkhartlake
      platform with DWMAC Core 5.20. Observed line rate performance
      improvement with HW descriptor prefetch enabled.
      
      DCHE = 0b
      [  5] local 169.254.1.162 port 42123 connected to 169.254.244.142 port 5201
      [ ID] Interval           Transfer     Bitrate         Total Datagrams
      [  5]   0.00-1.00   sec  96.7 MBytes   811 Mbits/sec  70050
      [  5]   1.00-2.00   sec  96.5 MBytes   809 Mbits/sec  69850
      [  5]   2.00-3.00   sec  96.3 MBytes   808 Mbits/sec  69720
      [  5]   3.00-4.00   sec  95.9 MBytes   804 Mbits/sec  69450
      [  5]   4.00-5.00   sec  96.0 MBytes   806 Mbits/sec  69530
      [  5]   5.00-6.00   sec  96.8 MBytes   812 Mbits/sec  70080
      [  5]   6.00-7.00   sec  96.9 MBytes   813 Mbits/sec  70140
      [  5]   7.00-8.00   sec  96.8 MBytes   812 Mbits/sec  70080
      [  5]   8.00-9.00   sec  97.0 MBytes   814 Mbits/sec  70230
      [  5]   9.00-10.00  sec  96.9 MBytes   813 Mbits/sec  70170
      - - - - - - - - - - - - - - - - - - - - - - - - -
      [ ID] Interval           Transfer     Bitrate         Jitter    Lost/Total Datagrams
      [  5]   0.00-10.00  sec   966 MBytes   810 Mbits/sec  0.000 ms  0/699300 (0%)  sender
      [  5]   0.00-10.00  sec   966 MBytes   810 Mbits/sec  0.011 ms  0/699265 (0%)  receiver
      
      DCHE = 1b
      [  5] local 169.254.1.162 port 49740 connected to 169.254.244.142 port 5201
      [ ID] Interval           Transfer     Bitrate         Total Datagrams
      [  5]   0.00-1.00   sec  97.9 MBytes   821 Mbits/sec  70880
      [  5]   1.00-2.00   sec  98.1 MBytes   823 Mbits/sec  71060
      [  5]   2.00-3.00   sec  98.2 MBytes   824 Mbits/sec  71140
      [  5]   3.00-4.00   sec  98.2 MBytes   824 Mbits/sec  71090
      [  5]   4.00-5.00   sec  98.1 MBytes   823 Mbits/sec  71050
      [  5]   5.00-6.00   sec  98.1 MBytes   823 Mbits/sec  71040
      [  5]   6.00-7.00   sec  98.1 MBytes   823 Mbits/sec  71050
      [  5]   7.00-8.00   sec  98.2 MBytes   824 Mbits/sec  71140
      [  5]   8.00-9.00   sec  98.2 MBytes   824 Mbits/sec  71120
      [  5]   9.00-10.00  sec  98.3 MBytes   824 Mbits/sec  71150
      - - - - - - - - - - - - - - - - - - - - - - - - -
      [ ID] Interval           Transfer     Bitrate         Jitter    Lost/Total Datagrams
      [  5]   0.00-10.00  sec   981 MBytes   823 Mbits/sec  0.000 ms  0/710720 (0%)  sender
      [  5]   0.00-10.00  sec   981 MBytes   823 Mbits/sec  0.041 ms  0/710650 (0%) receiver
      Signed-off-by: NMohammad Athari Bin Ismail <mohammad.athari.ismail@intel.com>
      Signed-off-by: NDavid S. Miller <davem@davemloft.net>
      676b7ec6
    • M
      net: stmmac: Add HW descriptor prefetch setting for DWMAC Core 5.20 onwards · 96874c61
      Mohammad Athari Bin Ismail 提交于
      DWMAC Core 5.20 onwards supports HW descriptor prefetching.
      Additionally, it also depends on platform specific RTL configuration.
      This capability could be enabled by setting DMA_Mode bit-19 (DCHE).
      
      So, to enable this cability, platform must set plat->dma_cfg->dche = true
      and the DWMAC core version must be 5.20 onwards. Else, this capability
      wouldn`t be configured
      Signed-off-by: NMohammad Athari Bin Ismail <mohammad.athari.ismail@intel.com>
      Signed-off-by: NDavid S. Miller <davem@davemloft.net>
      96874c61
  16. 22 4月, 2021 3 次提交
  17. 20 4月, 2021 1 次提交
    • W
      net: stmmac: fix memory leak during driver probe · d7f576dc
      Wong Vee Khee 提交于
      On driver probe, kmemleak reported the following memory leak which was
      due to allocated bitmap that was not being freed in stmmac_dvr_probe().
      
      unreferenced object 0xffff9276014b13c0 (size 8):
        comm "systemd-udevd", pid 2143, jiffies 4294681112 (age 116.720s)
        hex dump (first 8 bytes):
          00 00 00 00 00 00 00 00                          ........
        backtrace:
          [<00000000c51e34b2>] stmmac_dvr_probe+0x1c0/0x440 [stmmac]
          [<00000000b530eb41>] intel_eth_pci_probe.cold+0x2b/0x14e [dwmac_intel]
          [<00000000b10f8929>] pci_device_probe+0xd2/0x150
          [<00000000fb254c74>] really_probe+0xf8/0x410
          [<0000000034128a59>] driver_probe_device+0x5d/0x150
          [<00000000016104d5>] device_driver_attach+0x53/0x60
          [<00000000cb18cd07>] __driver_attach+0x96/0x140
          [<00000000da9ffd5c>] bus_for_each_dev+0x7a/0xc0
          [<00000000af061a88>] bus_add_driver+0x184/0x1f0
          [<000000008be5c1c5>] driver_register+0x6c/0xc0
          [<0000000052b18a9e>] do_one_initcall+0x4d/0x210
          [<00000000154d4f07>] do_init_module+0x5c/0x230
          [<000000009b648d09>] load_module+0x2a5a/0x2d40
          [<000000000d86b76d>] __do_sys_finit_module+0xb5/0x120
          [<000000002b0cef95>] do_syscall_64+0x33/0x40
          [<0000000067b45bbb>] entry_SYSCALL_64_after_hwframe+0x44/0xa9
      
      Fixes: bba2556e ("net: stmmac: Enable RX via AF_XDP zero-copy")
      Cc: Ong Boon Leong <boon.leong.ong@intel.com>
      Signed-off-by: NWong Vee Khee <vee.khee.wong@linux.intel.com>
      Signed-off-by: NDavid S. Miller <davem@davemloft.net>
      d7f576dc
  18. 15 4月, 2021 2 次提交
  19. 14 4月, 2021 6 次提交
    • O
      net: stmmac: Add TX via XDP zero-copy socket · 132c32ee
      Ong Boon Leong 提交于
      We add the support of XDP ZC TX submission and cleaning into
      stmmac_tx_clean(). The function is made to clean as many TX complete
      frames as possible, i.e. limit by priv->dma_tx_size instead of NAPI
      budget. For TX ring that is associated with XSK pool, the function
      stmmac_xdp_xmit_zc() is introduced to TX frame buffers from XSK pool by
      using xsk_tx_peek_desc(). To make stmmac_tx_clean() support the cleaning
      of XSK TX frames, STMMAC_TXBUF_T_XSK_TX TX buffer type is introduced.
      
      As stmmac_tx_clean() uses the return value to cue whether NAPI function
      should continue to poll, we augment the caller of stmmac_tx_clean() to
      pass NAPI budget instead of priv->dma_tx_size through 'budget' input and
      made stmmac_tx_clean() to always clean up-to the TX ring size instead.
      This allows us to use the return boolean status of stmmac_xdp_xmit_zc()
      to decide if XSK TX work is done or not: If true, set 'xmits' to return
      'budget - 1' so that NAPI poll may exit. Else, set 'xmits' to return
      'budget' to make NAPI poll continue to poll since XSK TX work is not
      done. Finally, at the end of stmmac_tx_clean(), the function now take
      a maximum value between 'count' and 'xmits' so that status from both
      TX cleaning and XSK TX (only for XDP ZC) is considered.
      
      This patch adds a new NAPI poll called stmmac_napi_poll_rxtx() that is
      meant to be enabled/disabled for RX and TX ring that are bound to XSK
      pool. This NAPI poll function starts with cleaning TX ring, then submits
      XSK TX frames to TX ring before proceed to perform RX operations, i.e.
      , receiving RX frames and replenishing RX ring with RX free buffers
      obtained from XSK pool. Therefore, during XSK RX and TX setup, the driver
      enables stmmac_napi_poll_rxtx() for RX and TX operations, then during
      XSK RX and TX pool tear-down, the driver reenables the exisiting
      independent NAPI poll functions accordingly: stmmac_napi_poll_rx() and
      stmmac_napi_poll_tx().
      Signed-off-by: NOng Boon Leong <boon.leong.ong@intel.com>
      Signed-off-by: NDavid S. Miller <davem@davemloft.net>
      132c32ee
    • O
      net: stmmac: Enable RX via AF_XDP zero-copy · bba2556e
      Ong Boon Leong 提交于
      This patch adds the support for receiving packet via AF_XDP zero-copy
      mechanism.
      
      XDP ZC uses 1:1 mapping of XDP buffer to receive packet, therefore the
      use of split header is not used currently. The 'xdp_buff' is declared as
      union together with a struct that contains 'page', 'addr' and
      'page_offset' that are associated with primary buffer.
      
      RX buffers are now allocated either via page_pool or xsk pool. For RX
      buffers from xsk_pool they are allocated and deallocated using below
      functions:
      
       * stmmac_alloc_rx_buffers_zc(struct stmmac_priv *priv, u32 queue)
       * dma_free_rx_xskbufs(struct stmmac_priv *priv, u32 queue)
      
      With above functions now available, we then extend the following driver
      functions to support XDP ZC:
       * stmmac_reinit_rx_buffers()
       * __init_dma_rx_desc_rings()
       * init_dma_rx_desc_rings()
       * __free_dma_rx_desc_resources()
      
      Note: stmmac_alloc_rx_buffers_zc() may return -ENOMEM due to RX XDP
      buffer pool is not allocated (e.g. samples/bpf/xdpsock TX-only). But,
      it is still ok to let TX XDP ZC to continue, therefore, the -ENOMEM
      is silently ignored to let the driver succcessfully transition to XDP
      ZC mode for the said RX and TX queue.
      
      As XDP ZC buffer size is different, the DMA buffer size is required
      to be reprogrammed accordingly for RX DMA/Queue that is populated with
      XDP buffer from XSK pool.
      
      Next, to add or remove per-queue XSK pool, stmmac_xdp_setup_pool()
      will call stmmac_xdp_enable_pool() or stmmac_xdp_disable_pool()
      that in-turn coordinates the tearing down and setting up RX ring via
      RX buffers and descriptors removal and reallocation through
      stmmac_disable_rx_queue() and stmmac_enable_rx_queue(). In addition,
      stmmac_xsk_wakeup() is added to initiate XDP RX buffer replenishing
      by signalling user application to add available XDP frames back to
      FILL queue.
      
      For RX processing using XDP zero-copy buffer, stmmac_rx_zc() is
      introduced which is implemented with the assumption that RX split
      header is disabled. For XDP verdict is XDP_PASS, the XDP buffer is
      copied into a sk_buff allocated through stmmac_construct_skb_zc()
      and sent to Linux network GRO inside stmmac_dispatch_skb_zc(). Free RX
      buffers are then replenished using stmmac_rx_refill_zc()
      
      v2: introduce __stmmac_disable_all_queues() to contain the original code
          that does napi_disable() and then make stmmac_setup_tc_block_cb()
          to use it. Move synchronize_rcu() into stmmac_disable_all_queues()
          that eventually calls __stmmac_disable_all_queues(). Then,
          make both stmmac_release() and stmmac_suspend() to use
          stmmac_disable_all_queues(). Thanks David Miller for spotting the
          synchronize_rcu() issue in v1 patch.
      Signed-off-by: NOng Boon Leong <boon.leong.ong@intel.com>
      Signed-off-by: NDavid S. Miller <davem@davemloft.net>
      bba2556e
    • O
      net: stmmac: Refactor __stmmac_xdp_run_prog for XDP ZC · bba71cac
      Ong Boon Leong 提交于
      Prepare stmmac_xdp_run_prog() for AF_XDP zero-copy support which will be
      added by upcoming patches by splitting out the XDP verdict processing
      into __stmmac_xdp_run_prog() and it callable for XDP ZC path which does
      not need to verify bpf_prog is not NULL.
      
      The stmmac_xdp_run_prog() is used for regular XDP Rx path which requires
      bpf_prog to be verified.
      Signed-off-by: NOng Boon Leong <boon.leong.ong@intel.com>
      Signed-off-by: NDavid S. Miller <davem@davemloft.net>
      bba71cac
    • O
      net: stmmac: rearrange RX and TX desc init into per-queue basis · de0b90e5
      Ong Boon Leong 提交于
      Below functions are made to be per-queue in preparation of XDP ZC:
      
       __init_dma_rx_desc_rings(struct stmmac_priv *priv, u32 queue, gfp_t flags)
       __init_dma_tx_desc_rings(struct stmmac_priv *priv, u32 queue)
      
      The original functions below are stay maintained for all queue usage:
      
       init_dma_rx_desc_rings(struct net_device *dev, gfp_t flags)
       init_dma_tx_desc_rings(struct net_device *dev)
      Signed-off-by: NOng Boon Leong <boon.leong.ong@intel.com>
      Signed-off-by: NDavid S. Miller <davem@davemloft.net>
      de0b90e5
    • O
      net: stmmac: refactor stmmac_init_rx_buffers for stmmac_reinit_rx_buffers · da5ec7f2
      Ong Boon Leong 提交于
      The per-queue RX buffer allocation in stmmac_reinit_rx_buffers() can be
      made to use stmmac_alloc_rx_buffers() by merging the page_pool alloc
      checks for "buf->page" and "buf->sec_page" in stmmac_init_rx_buffers().
      
      This is in preparation for XSK pool allocation later.
      Signed-off-by: NOng Boon Leong <boon.leong.ong@intel.com>
      Signed-off-by: NDavid S. Miller <davem@davemloft.net>
      da5ec7f2
    • O
      net: stmmac: introduce dma_recycle_rx_skbufs for stmmac_reinit_rx_buffers · 80f573c9
      Ong Boon Leong 提交于
      Rearrange RX buffer page_pool recycling logics into dma_recycle_rx_skbufs,
      so that we prepare stmmac_reinit_rx_buffers() for XSK pool expansion.
      Signed-off-by: NOng Boon Leong <boon.leong.ong@intel.com>
      Signed-off-by: NDavid S. Miller <davem@davemloft.net>
      80f573c9