提交 · bd9a265db26cdbfe74a303111381d90e66f56877 · openeuler / raspberrypi-kernel

13 12月, 2013 3 次提交

sfc: Add RX packet timestamping for EF10 · bd9a265d

由 Jon Cooper 提交于 11月 18, 2013

The EF10 firmware can optionally insert RX timestamps in the packet
prefix.  These only include the clock minor value.  We must also
enable periodic time sync events on each event queue which provide
the high bits of the clock value.

[bwh: Combined and rebased several changes.
 Added the above description and some sanity checks for inline vs
 separate timestamps.
 Changed efx_rx_skb_attach_timestamp() to read the packet prefix
 from the skb head area.]
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

bd9a265d

sfc: Copy RX prefix into skb head area in efx_rx_mk_skb() · 2ccd0b19

由 Ben Hutchings 提交于 11月 28, 2013

We can potentially pull the entire packet contents into the head area
and then free the page it was in.  In order to read an inline
timestamp safely, we need to copy the prefix into the head area as
well.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

2ccd0b19

J
sfc: Make initial fill of RX descriptors synchronous · cce28794
由 Jon Cooper 提交于 10月 02, 2013
```
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>
```
cce28794

07 12月, 2013 1 次提交

sfc: RX buffer allocation takes prefix size into account in IP header alignment · 2ec03014

由 Andrew Rybchenko 提交于 11月 16, 2013

rx_prefix_size is 4-bytes aligned on Falcon/Siena (16 bytes), but it is equal
to 14 on EF10. So, it should be taken into account if arch requires IP header
to be 4-bytes aligned (via NET_IP_ALIGN).

Fixes: 8127d661 ('sfc: Add support for Solarflare SFC9100 family')
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

2ec03014

21 9月, 2013 1 次提交

sfc: Support ARFS for IPv6 flows · c47b2d9d

由 Ben Hutchings 提交于 9月 03, 2013

Extend efx_filter_rfs() to map TCP/IPv6 and UDP/IPv6 flows into
efx_filter_spec. These are only supported on EF10; on Falcon and
Siena they will be rejected by efx_farch_filter_from_gen_spec().
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

c47b2d9d

30 8月, 2013 3 次提交

sfc: Update copyright banners · f7a6d2c4

由 Ben Hutchings 提交于 8月 29, 2013

Update the dates for files that have been added to in 2012-2013.
Drop the 'Solarstorm' brand name that's still lingering here.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

f7a6d2c4

sfc: Prepare for RX scatter on EF10 · e8c68c0a

由 Jon Cooper 提交于 3月 08, 2013

RX DMA scatter is always enabled on EF10.  Adjust the common RX
completion handling to allow for this.

RX completion events on EF10 include the length used from a single
descriptor, not the cumulative length used.  Add a field to struct
efx_rx_queue to hold the cumulative length.

[bwh: Also fix a related comment]
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

e8c68c0a

sfc: Document conditions for multicast replication vs filter replacement · b883d0bd

由 Ben Hutchings 提交于 1月 15, 2013

Add the efx_filter_is_mc_recip() function to decide whether a filter
is for a multicast recipient and can coexist with other filters with
the same match values.  Update efx_filter_insert_filter() kernel-doc
to explain the conditions for this.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

b883d0bd

28 8月, 2013 2 次提交

sfc: Add support for reading packet length from prefix · 3dced740

由 Ben Hutchings 提交于 4月 27, 2013

Define a flag for struct efx_rx_buffer and efx_rx_packet() that
indicates packet length must be read from the prefix.  If this
is set, read the length in __efx_rx_packet() (when the prefix
should have arrived in cache).
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

3dced740

sfc: Generalise packet hash lookup to support EF10 RX prefix · 43a3739d

由 Jon Cooper 提交于 10月 18, 2012

EF10 uses an entirely different RX prefix format from Falcon-arch.
Extend struct efx_nic_type to describe this.

[bwh: Also replace the magic numbers used for the Falcon-arch RX prefix]
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

43a3739d

23 8月, 2013 1 次提交

sfc: Make most filter operations NIC-type-specific · add72477

由 Ben Hutchings 提交于 11月 08, 2012

Aside from accelerated RFS, there is almost nothing that can be shared
between the filter table implementations for the Falcon architecture
and EF10.

Move the few shared functions into efx.c and rx.c and the rest into
farch.c.  Introduce efx_nic_type operations for the implementation and
inline wrapper functions that call these.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

add72477

22 8月, 2013 2 次提交

sfc: Refactor queue teardown sequence to allow for EF10 flush behaviour · e42c3d85

由 Ben Hutchings 提交于 5月 27, 2013

Currently efx_stop_datapath() will try to flush our DMA queues (if DMA
is enabled), then finalise software and hardware state for each queue.
However, for EF10 we must ask the MC to finalise each queue, which
implicitly starts flushing it, and then wait for the flush events.
We therefore need to delegate more of this to the NIC type.

Combine all the hardware operations into a new NIC-type operation
efx_nic_type::fini_dmaq, and call this before tearing down the
software state and buffers for all the DMA queues.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

e42c3d85

sfc: Stop RX refill before flushing RX queues · d8aec745

由 Ben Hutchings 提交于 5月 27, 2013

rx_queue::enabled guards refill, so rename it to reflect that.  Clear
it at the start of the queue teardown process rather than waiting for
the RX queue to be flushed.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

d8aec745

05 7月, 2013 1 次提交

sfc: Fix memory leak when discarding scattered packets · 734d4e15

由 Ben Hutchings 提交于 7月 04, 2013

Commit 2768935a ('sfc: reuse pages to avoid DMA mapping/unmapping
costs') did not fully take account of DMA scattering which was
introduced immediately before.  If a received packet is invalid and
must be discarded, we only drop a reference to the first buffer's
page, but we need to drop a reference for each buffer the packet
used.

I think this bug was missed partly because efx_recycle_rx_buffers()
was not renamed and so no longer does what its name says.  It does not
change the state of buffers, but only prepares the underlying pages
for recycling.  Rename it accordingly.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

734d4e15

25 6月, 2013 4 次提交

sfc: Improve test for IOMMU in use · 636d73da

由 Ben Hutchings 提交于 6月 12, 2013

The device::iommu_group field may be set even if no IOMMU is in use.
iommu_present() is still a better indicator, although it doesn't tell
us whether *our* device is affected.
Reported-by: NAlex Williamson <alex.williamson@redhat.com>
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

636d73da

sfc: Do not pass non-TCP packets into GRO code · e79255de

由 Ben Hutchings 提交于 5月 16, 2013

GRO can handle non-TCP packets and pass them up without coalescing,
but it has to do some extra work to parse the packet which we can
bypass using the hardware parse result.  (This condition yields a
false negative for TCP/IPv6 packets received by Falcon, but its
performance is already poor in that case due to lack of checksum
offload.)
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

e79255de

sfc: Increase size of RX SKB header area · d4ef5b6f

由 Jon Cooper 提交于 4月 08, 2013

This allows the SKB to hold the headers without reallocation more often.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

d4ef5b6f

J
sfc: Enable RX checksum offload for packets not handled by GRO · c99dffc4
由 Jon Cooper 提交于 4月 08, 2013
```
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>
```
c99dffc4

15 5月, 2013 2 次提交

sfc: Reduce RX scatter buffer size, and reduce alignment if appropriate · 950c54df

由 Ben Hutchings 提交于 5月 13, 2013

efx_start_datapath() asserts that we can fit 2 RX scatter buffers plus
a software structure, each appropriately aligned, into a single page.
Where L1_CACHE_BYTES == 256 and PAGE_SIZE == 4096, which is the case
on s390, this assertion fails.

The current scatter buffer size is also not a multiple of 64 or 128,
which are more common cache line sizes.  If we can make both the start
and end of a scatter buffer cache-aligned, this will reduce the need
for read-modify-write operations on inter- processor links.

Fix the alignment by reducing EFX_RX_USR_BUF_SIZE to 2048 - 256 ==
1792.  (We could use 2048 - L1_CACHE_BYTES, but EFX_RX_USR_BUF_SIZE
also affects user-level networking where a larger amount of
housekeeping data may be needed.  Although this version of the driver
does not support user-level networking, I prefer to keep scattering
behaviour consistent with the out-of-tree version.)

This still doesn't fix the s390 build because like most architectures
it has NET_IP_ALIGN == 2.  When NET_IP_ALIGN != 0 we cannot achieve
cache line alignment at either the start or end of a scatter buffer,
so there is actually no point in padding the buffers to a multiple of
the cache line size.  All we need is 4-byte alignment of the network
header, so do that.

Adjust the assertions accordingly.
Reported-by: NGeert Uytterhoeven <geert@linux-m68k.org>
Reported-by: NHeiko Carstens <heiko.carstens@de.ibm.com>
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>
Acked-by: NGeert Uytterhoeven <geert@linux-m68k.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

950c54df

sfc: Delete EFX_PAGE_IP_ALIGN, equivalent to NET_IP_ALIGN · c14ff2ea

由 Ben Hutchings 提交于 5月 13, 2013

The two architectures that define CONFIG_HAVE_EFFICIENT_UNALIGNED_ACCESS
(powerpc and x86) now both define NET_IP_ALIGN as 0, so there is no
need for this optimisation any more.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

c14ff2ea

18 3月, 2013 1 次提交

sfc: make local functions static · debd0034

由 stephen hemminger 提交于 3月 16, 2013

Trivial sparse detected functions that should be static.
Signed-off-by: NStephen Hemminger <stephen@networkplumber.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

debd0034

08 3月, 2013 11 次提交

sfc: allocate more RX buffers per page · 1648a23f

由 Daniel Pieczko 提交于 2月 13, 2013

Allocating 2 buffers per page is insanely inefficient when MTU is 1500
and PAGE_SIZE is 64K (as it usually is on POWER).  Allocate as many as
we can fit, and choose the refill batch size at run-time so that we
still always use a whole page at once.

[bwh: Fix loop condition to allow for compound pages; rebase]
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

1648a23f

sfc: Replace efx_rx_is_last_buffer() with a flag · 179ea7f0

由 Ben Hutchings 提交于 3月 07, 2013

This condition is brittle and we have lots of flags to spare.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

179ea7f0

sfc: reuse pages to avoid DMA mapping/unmapping costs · 2768935a

由 Daniel Pieczko 提交于 2月 13, 2013

On POWER systems, DMA mapping/unmapping operations are very expensive.
These changes reduce these costs by trying to reuse DMA mapped pages.

After all the buffers associated with a page have been processed and
passed up, the page is placed into a ring (if there is room).  For
each page that is required for a refill operation, a page in the ring
is examined to determine if its page count has fallen to 1, ie. the
kernel has released its reference to these packets.  If this is the
case, the page can be immediately added back into the RX descriptor
ring, without having to re-map it for DMA.

If the kernel is still holding a reference to this page, it is removed
from the ring and unmapped for DMA.  Then a new page, which can
immediately be used by RX buffers in the descriptor ring, is allocated
and DMA mapped.

The time a page needs to spend in the recycle ring before the kernel
has released its page references is based on the number of buffers
that use this page.  As large pages can hold more RX buffers, the RX
recycle ring can be shorter.  This reduces memory usage on POWER
systems, while maintaining the performance gain achieved by recycling
pages, following the driver change to pack more than two RX buffers
into large pages.

When an IOMMU is not present, the recycle ring can be small to reduce
memory usage, since DMA mapping operations are inexpensive.

With a small recycle ring, attempting to refill the descriptor queue
with more buffers than the equivalent size of the recycle ring could
ultimately lead to memory leaks if page entries in the recycle ring
were overwritten.  To prevent this, the check to see if the recycle
ring is full is changed to check if the next entry to be written is
NULL.

[bwh: Combine and rebase several commits so this is complete
 before the following buffer-packing changes.  Remove module
 parameter.]
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

2768935a

sfc: Enable RX DMA scattering where possible · 85740cdf

由 Ben Hutchings 提交于 1月 29, 2013

Enable RX DMA scattering iff an RX buffer large enough for the current
MTU will not fit into a single page and the NIC supports DMA
scattering for kernel-mode RX queues.

On Falcon and Siena, the RX_USR_BUF_SIZE field is used as the DMA
limit for both all RX queues with scatter enabled.  Set it to 1824,
matching what Onload uses now.

Maintain a statistic for frames truncated due to lack of descriptors
(rx_nodesc_trunc).  This is distinct from rx_frm_trunc which may be
incremented when scattering is disabled and implies an over-length
frame.

Whenever an MTU change causes scattering to be turned on or off,
update filters that point to the PF queues, but leave others
unchanged, as VF drivers assume scattering is off.

Add n_frags parameters to various functions, and make them iterate:
- efx_rx_packet()
- efx_recycle_rx_buffers()
- efx_rx_mk_skb()
- efx_rx_deliver()

Make efx_handle_rx_event() responsible for updating
efx_rx_queue::removed_count.

Change the RX pipeline state to a starting ring index and number of
fragments, and make __efx_rx_packet() responsible for clearing it.

Based on earlier versions by David Riddoch and Jon Cooper.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

85740cdf

sfc: Update RX buffer address together with length · b74e3e8c

由 Ben Hutchings 提交于 1月 29, 2013

Adjust rx_buf->page_offset when we eat the RX hash prefix.  Remove
efx_rx_buf_offset(), which is now redundant.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

b74e3e8c

sfc: Explicitly prefetch RX hash prefix, not just Ethernet heade · 5036b7c7

由 Ben Hutchings 提交于 1月 29, 2013

Currently we prefetch from the Ethernet header, but we will also read
the hash prefix.  In practice they should be in the same cache line
and this won't hurt, but it is still pointless to add on the hash
prefix size.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

5036b7c7

sfc: Replace efx_rx_buf_eh() with simpler efx_rx_buf_va() · b184f16b

由 Ben Hutchings 提交于 1月 29, 2013

efx_rx_buf_va() returns the virtual address of the current start of
the buffer. The callers must add the hash prefix size themselves.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

b184f16b

sfc: Wrap __efx_rx_packet() with efx_rx_flush_packet() · ff734ef4

由 Ben Hutchings 提交于 1月 29, 2013

The pipeline mechanism will need to change a bit for scattered
packets. Add a wrapper to insulate efx_process_channel() from this.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

ff734ef4

sfc: Properly distinguish RX buffer and DMA lengths · 272baeeb

由 Ben Hutchings 提交于 1月 29, 2013

Replace efx_nic::rx_buffer_len with efx_nic::rx_dma_len, the maximum
RX DMA length.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

272baeeb

sfc: Remove rx_alloc_method SKB · 97d48a10

由 Alexandre Rames 提交于 1月 11, 2013

[bwh: Remove more dead code, and make efx_ptp_rx() pull the data it
 needs into the header area.]
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

97d48a10

sfc: Allow efx_channel_type::receive_skb() to reject a packet · 4a74dc65

由 Ben Hutchings 提交于 3月 05, 2013

Instead of having efx_ptp_rx() call netif_receive_skb() for an invalid
PTP packet, make it return false for rejected packets and have
efx_rx_deliver() pass them up.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

4a74dc65

07 3月, 2013 1 次提交

sfc: Correct efx_rx_buffer::page_offset when EFX_PAGE_IP_ALIGN != 0 · c73e787a

由 Ben Hutchings 提交于 3月 05, 2013

RX DMA buffers start at an offset of EFX_PAGE_IP_ALIGN bytes from the
start of a cache line.  This offset obviously needs to be included in
the virtual address, but this was missed in commit b590ace0
('sfc: Fix efx_rx_buf_offset() in the presence of swiotlb') since
EFX_PAGE_IP_ALIGN is equal to 0 on both x86 and powerpc.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

c73e787a

26 2月, 2013 2 次提交

sfc: Fix efx_rx_buf_offset() in the presence of swiotlb · b590ace0

由 Ben Hutchings 提交于 1月 10, 2013

We assume that the mapping between DMA and virtual addresses is done
on whole pages, so we can find the page offset of an RX buffer using
the lower bits of the DMA address.  However, swiotlb maps in units of
2K, breaking this assumption.

Add an explicit page_offset field to struct efx_rx_buffer.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

b590ace0

sfc: Properly sync RX DMA buffer when it is not the last in the page · 3a68f19d

由 Ben Hutchings 提交于 12月 20, 2012

We may currently allocate two RX DMA buffers to a page, and only unmap
the page when the second is completed. We do not sync the first RX
buffer to be completed; this can result in packet loss or corruption
if the last RX buffer completed in a NAPI poll is the first in a page
and is not DMA-coherent. (In the middle of a NAPI poll, we will
handle the following RX completion and unmap the page *before* looking
at the content of the first buffer.)
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

3a68f19d

01 12月, 2012 1 次提交
- B
  sfc: Delete redundant page_addr variable from efx_init_rx_buffers_page() · b8e02517
  由 Ben Hutchings 提交于 9月 06, 2012
```
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>
```
  b8e02517
08 9月, 2012 2 次提交

sfc: Add channel specific receive_skb handler and post_remove callback · c31e5f9f

由 Stuart Hodgson 提交于 7月 18, 2012

Allows an extra channel to override the standard receive_skb handler
and also for extra non generic operations to be performed on remove.

Also set default rx strategy so only skbs can be delivered to the
PTP receive function.
Signed-off-by: NStuart Hodgson <smhodgson@solarflare.com>
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

c31e5f9f

sfc: Add explicit RX queue flag to channel · 79d68b37

由 Stuart Hodgson 提交于 7月 16, 2012

The PTP channel will have its own RX queue even though it's not
a regular traffic channel.

Original work by Ben Hutchings <bhutchings@solarflare.com>
Signed-off-by: NStuart Hodgson <smhodgson@solarflare.com>
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

79d68b37

17 7月, 2012 1 次提交
- B
  sfc: Use generic DMA API, not PCI-DMA API · 0e33d870
  由 Ben Hutchings 提交于 5月 17, 2012
```
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>
```
  0e33d870
11 7月, 2012 1 次提交

drivers/net/ethernet: Fix (nearly-)kernel-doc comments for various functions · 49ce9c2c

由 Ben Hutchings 提交于 7月 10, 2012

Fix incorrect start markers, wrapped summary lines, missing section
breaks, incorrect separators, and some name mismatches.  Delete
a few that are content-free.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>
Acked-by: NJeff Kirsher <jeffrey.t.kirsher@intel.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

49ce9c2c