1. 30 10月, 2021 1 次提交
  2. 28 9月, 2021 1 次提交
  3. 25 6月, 2021 1 次提交
    • T
      intel: Remove rcu_read_lock() around XDP program invocation · 49589b23
      Toke Høiland-Jørgensen 提交于
      The Intel drivers all have rcu_read_lock()/rcu_read_unlock() pairs around
      XDP program invocations. However, the actual lifetime of the objects
      referred by the XDP program invocation is longer, all the way through to
      the call to xdp_do_flush(), making the scope of the rcu_read_lock() too
      small. This turns out to be harmless because it all happens in a single
      NAPI poll cycle (and thus under local_bh_disable()), but it makes the
      rcu_read_lock() misleading.
      
      Rather than extend the scope of the rcu_read_lock(), just get rid of it
      entirely. With the addition of RCU annotations to the XDP_REDIRECT map
      types that take bh execution into account, lockdep even understands this to
      be safe, so there's really no reason to keep it around.
      Signed-off-by: NToke Høiland-Jørgensen <toke@redhat.com>
      Signed-off-by: NDaniel Borkmann <daniel@iogearbox.net>
      Tested-by: Jesper Dangaard Brouer <brouer@redhat.com> # i40e
      Cc: Jesse Brandeburg <jesse.brandeburg@intel.com>
      Cc: Tony Nguyen <anthony.l.nguyen@intel.com>
      Cc: intel-wired-lan@lists.osuosl.org
      Link: https://lore.kernel.org/bpf/20210624160609.292325-12-toke@redhat.com
      49589b23
  4. 03 6月, 2021 1 次提交
  5. 02 4月, 2021 1 次提交
    • M
      i40e: fix receiving of single packets in xsk zero-copy mode · 528060ef
      Magnus Karlsson 提交于
      Fix so that single packets are received immediately instead of in
      batches of 8. If you sent 1 pps to a system, you received 8 packets
      every 8 seconds instead of 1 packet every second. The problem behind
      this was that the work_done reporting from the Tx part of the driver
      was broken. The work_done reporting in i40e controls not only the
      reporting back to the napi logic but also the setting of the interrupt
      throttling logic. When Tx or Rx reports that it has more to do,
      interrupts are throttled or coalesced and when they both report that
      they are done, interrupts are armed right away. If the wrong work_done
      value is returned, the logic will start to throttle interrupts in a
      situation where it should have just enabled them. This leads to the
      undesired batching behavior seen in user-space.
      
      Fix this by returning the correct boolean value from the Tx xsk
      zero-copy path. Return true if there is nothing to do or if we got
      fewer packets to process than we asked for. Return false if we got as
      many packets as the budget since there might be more packets we can
      process.
      
      Fixes: 3106c580 ("i40e: Use batched xsk Tx interfaces to increase performance")
      Reported-by: NSreedevi Joshi <sreedevi.joshi@intel.com>
      Signed-off-by: NMagnus Karlsson <magnus.karlsson@intel.com>
      Acked-by: NMaciej Fijalkowski <maciej.fijalkowski@intel.com>
      Tested-by: NKiran Bhandare <kiranx.bhandare@intel.com>
      Signed-off-by: NTony Nguyen <anthony.l.nguyen@intel.com>
      528060ef
  6. 24 3月, 2021 1 次提交
  7. 16 3月, 2021 1 次提交
  8. 20 2月, 2021 1 次提交
    • N
      i40e: Fix endianness conversions · b32cddd2
      Norbert Ciosek 提交于
      Fixes the following sparse warnings:
      i40e_main.c:5953:32: warning: cast from restricted __le16
      i40e_main.c:8008:29: warning: incorrect type in assignment (different base types)
      i40e_main.c:8008:29:    expected unsigned int [assigned] [usertype] ipa
      i40e_main.c:8008:29:    got restricted __le32 [usertype]
      i40e_main.c:8008:29: warning: incorrect type in assignment (different base types)
      i40e_main.c:8008:29:    expected unsigned int [assigned] [usertype] ipa
      i40e_main.c:8008:29:    got restricted __le32 [usertype]
      i40e_txrx.c:1950:59: warning: incorrect type in initializer (different base types)
      i40e_txrx.c:1950:59:    expected unsigned short [usertype] vlan_tag
      i40e_txrx.c:1950:59:    got restricted __le16 [usertype] l2tag1
      i40e_txrx.c:1953:40: warning: cast to restricted __le16
      i40e_xsk.c:448:38: warning: invalid assignment: |=
      i40e_xsk.c:448:38:    left side has type restricted __le64
      i40e_xsk.c:448:38:    right side has type int
      
      Fixes: 2f4b411a ("i40e: Enable cloud filters via tc-flower")
      Fixes: 2a508c64 ("i40e: fix VLAN.TCI == 0 RX HW offload")
      Fixes: 3106c580 ("i40e: Use batched xsk Tx interfaces to increase performance")
      Fixes: 8f88b303 ("i40e: Add infrastructure for queue channel support")
      Signed-off-by: NNorbert Ciosek <norbertx.ciosek@intel.com>
      Tested-by: NTony Brelinski <tonyx.brelinski@intel.com>
      Signed-off-by: NTony Nguyen <anthony.l.nguyen@intel.com>
      b32cddd2
  9. 13 2月, 2021 1 次提交
  10. 09 2月, 2021 4 次提交
  11. 14 1月, 2021 1 次提交
  12. 17 12月, 2020 1 次提交
    • B
      i40e, xsk: clear the status bits for the next_to_use descriptor · 64050b5b
      Björn Töpel 提交于
      On the Rx side, the next_to_use index points to the next item in the
      HW ring to be refilled/allocated, and next_to_clean points to the next
      item to potentially be processed.
      
      When the HW Rx ring is fully refilled, i.e. no packets has been
      processed, the next_to_use will be next_to_clean - 1. When the ring is
      fully processed next_to_clean will be equal to next_to_use. The latter
      case is where a bug is triggered.
      
      If the next_to_use bits are not cleared, and the "fully processed"
      state is entered, a stale descriptor can be processed.
      
      The skb-path correctly clear the status bit for the next_to_use
      descriptor, but the AF_XDP zero-copy path did not do that.
      
      This change adds the status bits clearing of the next_to_use
      descriptor.
      
      Fixes: 3b4f0b66 ("i40e, xsk: Migrate to new MEM_TYPE_XSK_BUFF_POOL")
      Signed-off-by: NBjörn Töpel <bjorn.topel@intel.com>
      Signed-off-by: NJakub Kicinski <kuba@kernel.org>
      64050b5b
  13. 25 11月, 2020 1 次提交
  14. 18 11月, 2020 2 次提交
  15. 11 11月, 2020 1 次提交
  16. 15 9月, 2020 2 次提交
  17. 01 9月, 2020 3 次提交
  18. 02 7月, 2020 3 次提交
  19. 29 5月, 2020 1 次提交
  20. 22 5月, 2020 4 次提交
  21. 15 5月, 2020 1 次提交
  22. 06 2月, 2020 1 次提交
  23. 21 12月, 2019 1 次提交
  24. 19 12月, 2019 1 次提交
  25. 09 11月, 2019 1 次提交
    • M
      i40e: need_wakeup flag might not be set for Tx · 70563957
      Magnus Karlsson 提交于
      The need_wakeup flag for Tx might not be set for AF_XDP sockets that
      are only used to send packets. This happens if there is at least one
      outstanding packet that has not been completed by the hardware and we
      get that corresponding completion (which will not generate an
      interrupt since interrupts are disabled in the napi poll loop) between
      the time we stopped processing the Tx completions and interrupts are
      enabled again. In this case, the need_wakeup flag will have been
      cleared at the end of the Tx completion processing as we believe we
      will get an interrupt from the outstanding completion at a later point
      in time. But if this completion interrupt occurs before interrupts
      are enable, we lose it and should at that point really have set the
      need_wakeup flag since there are no more outstanding completions that
      can generate an interrupt to continue the processing. When this
      happens, user space will see a Tx queue need_wakeup of 0 and skip
      issuing a syscall, which means will never get into the Tx processing
      again and we have a deadlock.
      
      This patch introduces a quick fix for this issue by just setting the
      need_wakeup flag for Tx to 1 all the time. I am working on a proper
      fix for this that will toggle the flag appropriately, but it is more
      challenging than I anticipated and I am afraid that this patch will
      not be completed before the merge window closes, therefore this easier
      fix for now. This fix has a negative performance impact in the range
      of 0% to 4%. Towards the higher end of the scale if you have driver
      and application on the same core and issue a lot of packets, and
      towards no negative impact if you use two cores, lower transmission
      speeds and/or a workload that also receives packets.
      Signed-off-by: NMagnus Karlsson <magnus.karlsson@intel.com>
      Tested-by: NAndrew Bowers <andrewx.bowers@intel.com>
      Signed-off-by: NJeff Kirsher <jeffrey.t.kirsher@intel.com>
      70563957
  26. 02 11月, 2019 1 次提交
  27. 16 9月, 2019 1 次提交
  28. 12 9月, 2019 1 次提交
    • M
      i40e: fix potential RX buffer starvation for AF_XDP · 1f459bdc
      Magnus Karlsson 提交于
      When the RX rings are created they are also populated with buffers
      so that packets can be received. Usually these are kernel buffers,
      but for AF_XDP in zero-copy mode, these are user-space buffers and
      in this case the application might not have sent down any buffers
      to the driver at this point. And if no buffers are allocated at ring
      creation time, no packets can be received and no interrupts will be
      generated so the NAPI poll function that allocates buffers to the
      rings will never get executed.
      
      To rectify this, we kick the NAPI context of any queue with an
      attached AF_XDP zero-copy socket in two places in the code. Once
      after an XDP program has loaded and once after the umem is registered.
      This take care of both cases: XDP program gets loaded first then AF_XDP
      socket is created, and the reverse, AF_XDP socket is created first,
      then XDP program is loaded.
      
      Fixes: 0a714186 ("i40e: add AF_XDP zero-copy Rx support")
      Signed-off-by: NMagnus Karlsson <magnus.karlsson@intel.com>
      Tested-by: NAndrew Bowers <andrewx.bowers@intel.com>
      Signed-off-by: NJeff Kirsher <jeffrey.t.kirsher@intel.com>
      1f459bdc