1. 15 2月, 2017 1 次提交
  2. 14 2月, 2017 4 次提交
  3. 13 2月, 2017 23 次提交
  4. 12 2月, 2017 12 次提交
    • H
      i40e: Save more link abilities when using ethtool · b7eaf8f1
      Henry Tieman 提交于
      Ethtool support needs to save more PHY information. The
      added information includes FEC capabilities and 25G link
      types. Without this change it is possible to lose 25G or
      FEC settings by using ethtool.
      
      Change-ID: Ie42255b1e901ffbf9583b8c46466a54894114280
      Signed-off-by: NHenry Tieman <henry.w.tieman@intel.com>
      Tested-by: NAndrew Bowers <andrewx.bowers@intel.com>
      Signed-off-by: NJeff Kirsher <jeffrey.t.kirsher@intel.com>
      b7eaf8f1
    • J
      i40e: avoid race condition when sending filters to firmware for addition · 671889e6
      Jacob Keller 提交于
      Refactor how we add new filters to firmware to avoid a race condition
      that can occur due to removing filters from the hash temporarily.
      
      To understand the race condition, suppose that you have a number of MAC
      filters, but have not yet added any VLANs. Now, add two VLANs in rapid
      succession. A possible resulting flow would look something like the
      following:
      
      (1) lock hash for add VLAN
      (2) add the new MAC/VLAN combos for each current MAC filter
      (3) unlock hash
      (4) lock hash for filter sync
      (5) notice that we have a VLAN, so prepare to update all MAC filters
          with VLAN=-1 to be VLAN=0.
      (6) move NEW and REMOVE filters to temporary list
      (7) unlock hash
      (8) lock hash for add VLAN
      (9) add new MAC/VLAN combos. Notice that no MAC filters are currently in
          the hash list, so we don't add any VLANs <--- BUG!
      (10) unlock hash
      (11) sync the temporary lists to firmware
      (12) lock hash for post-sync
      (13) move the temporary elements back to the main list
      ....
      
      Because we take filters out of the main hash into temporary lists, we
      introduce a narrow window where it is possible that other callers to the
      list will not see some of the filters which were previously added but
      have not yet been finalized. This results in sometimes dropping VLAN
      additions, and could also result in failing to add a MAC address on the
      newly added VLAN.
      
      One obvious way to avoid this race condition would be to lock the entire
      firmware process. Unfortunately this does not work because adminq
      firmware commands take a mutex which results in a sleep while atomic
      BUG(). So, we can't use the simplest approach.
      
      An alternative approach is to simply not remove the filters from the
      hash list while adding. Instead, add an i40e_new_mac_filter structure
      which we will use to track added filters. This avoids the need to remove
      the filter from the hash list. We'll store a pointer to the original
      i40e_mac_filter, along with our own copy of the state.
      
      We won't update the state directly, so as to avoid race with other code
      that may modify the state while under the lock. We are safe to read
      f->macaddr and f->vlan since these only change in two locations. The
      first is on filter creation, which must have already occurred. The
      second is inside i40e_correct_vlan_filters which was previously run
      after creation of this object and can't be run again until after. Thus,
      we should be safe to read the MAC address and VLAN while outside the
      lock.
      
      We also aren't going to run into a use-after-free issue because the only
      place where we free filters is when they are marked FAILED or when we
      remove them inside the sync subtask. Since the subtask has its own
      critical flag to prevent duplicate runs, we know this won't happen. We
      also know that the only location to transition a filter from NEW to
      FAILED is inside the subtask also, so we aren't worried about that
      either.
      
      Use the wrapper i40e_new_mac_filter for additions, and once we've
      finalized the addition to firmware, we will update the filter state
      inside a lock, and then free the wrapper structure.
      
      In order to avoid a possible race condition with filter deletion, we
      won't update the original filter state unless it is still
      I40E_FILTER_NEW when we finish the firmware sync.
      
      This approach is more complex, but avoids race conditions related to
      filters being temporarily removed from the list. We do not need the same
      behavior for deletion because we always unconditionally removed the
      filters from the list regardless of the firmware status.
      
      Change-Id: I14b74bc2301f8e69433fbe77ebca532db20c5317
      Signed-off-by: NJacob Keller <jacob.e.keller@intel.com>
      Tested-by: NAndrew Bowers <andrewx.bowers@intel.com>
      Signed-off-by: NJeff Kirsher <jeffrey.t.kirsher@intel.com>
      671889e6
    • J
      i40e: allow i40e_update_filter_state to skip broadcast filters · d88d40b0
      Jacob Keller 提交于
      Fix a bug where we modified the mac_filter_hash while outside a lock,
      when handling addition of broadcast filters.
      
      Normally, we add filters to firmware by batching the additions into
      lists and issuing 1 update for every few filters. Broadcast filters are
      handled differently, by instead setting the broadcast promiscuous mode
      flags. In order to make sure the 1<->1 mapping of filters in our
      addition array lined up with filters in the hlist tmp_add_list, we had
      to remove the filter and move it back to the main hash. However, we
      didn't do this under lock, which could cause consistency problems for
      the list.
      
      Fix this by updating i40e_update_filter_state logic so that it knows to
      avoid broadcast filters. This ensures that we don't have to remove the
      filter separately, and can put it back using the normal flow.
      
      Change-ID: Id288fade80b3e3a9a54b68cc249188cb95147518
      Signed-off-by: NJacob Keller <jacob.e.keller@intel.com>
      Tested-by: NAndrew Bowers <andrewx.bowers@intel.com>
      Signed-off-by: NJeff Kirsher <jeffrey.t.kirsher@intel.com>
      d88d40b0
    • J
      i40e: don't warn every time we clear an Rx timestamp register · e6e3fc2b
      Jacob Keller 提交于
      The intent of this message was to indicate to a user that we might have
      missed a timestamp event for a valid packet. The original method of
      detecting the missed events relied on waiting until all 4 registers were
      filled.
      
      A recent commit d55458c0cd7a5 ("i40e: replace PTP Rx timestamp hang
      logic") replaced this logic with much better detection
      scheme that could detect a stalled Rx timestamp register even when other
      registers were still functional.
      
      The new logic means that a message will be displayed almost as soon as
      a timestamp for a dropped frame occurs. This new logic highlights that
      the hardware will attempt timestamp for frames which it later decides to
      drop. The most prominent example is when a multicast PTP frame is
      received on a multicast address that we are not subscribed to.
      
      Because the hardware initiates the Rx timestamp as soon as possible, it
      will latch an RXTIME register, but then drop the packet.
      
      This results in users being confused by the message as they are not
      expecting to see dropped timestamp messages unless their application
      also indicates that timestamps were missing.
      
      Resolve this by reducing the severity and frequency of the displayed
      message. We now only print the message if 3 or 4 of the RXTIME registers
      are stalled and get cleared within the same watchdog event. This ensures
      that the common case does not constantly display the message.
      Additionally, since the message is likely not as meaningful to most
      users, reduce the message to a dev_dbg instead of a dev_warn.
      
      Users can still get a count of the number of timestamps dropped by
      reading the ethtool statistics value, if necessary.
      
      Change-ID: I35494442226a444c418dfb4f91a3070d06c8435c
      Signed-off-by: NJacob Keller <jacob.e.keller@intel.com>
      Tested-by: NAndrew Bowers <andrewx.bowers@intel.com>
      Signed-off-by: NJeff Kirsher <jeffrey.t.kirsher@intel.com>
      e6e3fc2b
    • H
      i40e: Save link FEC info from link up event · 3e03d7cc
      Henry Tieman 提交于
      Store the FEC status bits from the link up event into the
      hw_link_info structure.
      
      Change-ID: I9a7b256f6dfb0dce89c2f503075d0d383526832e
      Signed-off-by: NHenry Tieman <henry.w.tieman@intel.com>
      Tested-by: NAndrew Bowers <andrewx.bowers@intel.com>
      Signed-off-by: NJeff Kirsher <jeffrey.t.kirsher@intel.com>
      3e03d7cc
    • S
      i40e: Add bus number info to i40e_bus_info struct · b3f028fc
      Sudheer Mogilappagari 提交于
      Currently i40e_bus_info has PCI device and function info only and log
      messages print device number as bus number. Added field to provide bus
      number info and modified log statements to print bus, device and
      function information.
      
      Change-ID: I811617cee2714cc0d6bade8d369f57040990756f
      Signed-off-by: NSudheer Mogilappagari <sudheer.mogilappagari@intel.com>
      Tested-by: NAndrew Bowers <andrewx.bowers@intel.com>
      Signed-off-by: NJeff Kirsher <jeffrey.t.kirsher@intel.com>
      b3f028fc
    • M
      i40e: Clean up dead code · 3bb83baf
      Mitch Williams 提交于
      The function i40e_client_prepare() can never return an error. So make it
      void and quit checking its return value.
      
      Change-ID: I9ff311e2324dde329eb68648efb2c94aaff856db
      Signed-off-by: NMitch Williams <mitch.a.williams@intel.com>
      Tested-by: NAndrew Bowers <andrewx.bowers@intel.com>
      Signed-off-by: NJeff Kirsher <jeffrey.t.kirsher@intel.com>
      3bb83baf
    • B
    • J
      i40e: update comment explaining where FDIR buffers are freed · a158aeaf
      Jacob Keller 提交于
      The original comment implies that the only location where the raw_packet
      buffer will be freed is in i40e_clean_tx_ring() which is incorrect. In
      fact this isn't even the normal case. Update the comment explaining
      where the memory is freed.
      
      Change-ID: Ie0defc35ed1c3af183f81fdc60b6d783707a5595
      Signed-off-by: NJacob Keller <jacob.e.keller@intel.com>
      Tested-by: NAndrew Bowers <andrewx.bowers@intel.com>
      Signed-off-by: NJeff Kirsher <jeffrey.t.kirsher@intel.com>
      a158aeaf
    • S
      i40e/i40evf: eliminate i40e_pull_tail() · 9b37c937
      Scott Peterson 提交于
      Reorganize the i40e_pull_tail() logic, doing it in i40e_add_rx_frag()
      where it's cheaper.  The igb driver does this the same way.
      
      Also renames i40e_page_is_reserved() to reflect what it actually
      tests.
      
      Change-ID: Icd9cc507aae1fcdc02308b3a09034111b4c24071
      Signed-off-by: NScott Peterson <scott.d.peterson@intel.com>
      Tested-by: NAndrew Bowers <andrewx.bowers@intel.com>
      Signed-off-by: NJeff Kirsher <jeffrey.t.kirsher@intel.com>
      9b37c937
    • S
      i40e/i40evf: Moves skb from i40e_rx_buffer to i40e_ring · e72e5659
      Scott Peterson 提交于
      This patch reduces the size of struct i40e_rx_buffer by one pointer,
      and makes the i40e driver a little more consistent with the igb driver
      in terms of packets that span buffers.
      
      We do this by moving the skb field from struct i40e_rx_buffer to
      struct i40e_ring. We pass the skb we already have (or NULL if we
      don't) to i40e_fetch_rx_buffer(), which skips the skb allocation if we
      already have one for this packet.
      
      Change-ID: I4ad48a531844494ba0c5d8e1a62209a057f661b0
      Signed-off-by: NScott Peterson <scott.d.peterson@intel.com>
      Tested-by: NAndrew Bowers <andrewx.bowers@intel.com>
      Signed-off-by: NJeff Kirsher <jeffrey.t.kirsher@intel.com>
      e72e5659
    • S
      i40e/i40evf: Limit DMA sync of RX buffers to actual packet size · 7987dcd7
      Scott Peterson 提交于
      On packet RX, we perform a DMA sync for CPU before passing the
      packet up.  Here we limit that sync to the actual length of the
      incoming packet, rather than always syncing the entire buffer.
      
      Change-ID: I626aaf6c37275a8ce9e81efcaa773f327b331487
      Signed-off-by: NScott Peterson <scott.d.peterson@intel.com>
      Tested-by: NAndrew Bowers <andrewx.bowers@intel.com>
      Signed-off-by: NJeff Kirsher <jeffrey.t.kirsher@intel.com>
      7987dcd7