提交 · 1648a23fa159e5c433aac06dc5e0d9db36146016 · openanolis / cloud-kernel

08 3月, 2013 11 次提交

sfc: allocate more RX buffers per page · 1648a23f

由 Daniel Pieczko 提交于 2月 13, 2013

Allocating 2 buffers per page is insanely inefficient when MTU is 1500
and PAGE_SIZE is 64K (as it usually is on POWER).  Allocate as many as
we can fit, and choose the refill batch size at run-time so that we
still always use a whole page at once.

[bwh: Fix loop condition to allow for compound pages; rebase]
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

1648a23f

sfc: Replace efx_rx_is_last_buffer() with a flag · 179ea7f0

由 Ben Hutchings 提交于 3月 07, 2013

This condition is brittle and we have lots of flags to spare.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

179ea7f0

sfc: reuse pages to avoid DMA mapping/unmapping costs · 2768935a

由 Daniel Pieczko 提交于 2月 13, 2013

On POWER systems, DMA mapping/unmapping operations are very expensive.
These changes reduce these costs by trying to reuse DMA mapped pages.

After all the buffers associated with a page have been processed and
passed up, the page is placed into a ring (if there is room).  For
each page that is required for a refill operation, a page in the ring
is examined to determine if its page count has fallen to 1, ie. the
kernel has released its reference to these packets.  If this is the
case, the page can be immediately added back into the RX descriptor
ring, without having to re-map it for DMA.

If the kernel is still holding a reference to this page, it is removed
from the ring and unmapped for DMA.  Then a new page, which can
immediately be used by RX buffers in the descriptor ring, is allocated
and DMA mapped.

The time a page needs to spend in the recycle ring before the kernel
has released its page references is based on the number of buffers
that use this page.  As large pages can hold more RX buffers, the RX
recycle ring can be shorter.  This reduces memory usage on POWER
systems, while maintaining the performance gain achieved by recycling
pages, following the driver change to pack more than two RX buffers
into large pages.

When an IOMMU is not present, the recycle ring can be small to reduce
memory usage, since DMA mapping operations are inexpensive.

With a small recycle ring, attempting to refill the descriptor queue
with more buffers than the equivalent size of the recycle ring could
ultimately lead to memory leaks if page entries in the recycle ring
were overwritten.  To prevent this, the check to see if the recycle
ring is full is changed to check if the next entry to be written is
NULL.

[bwh: Combine and rebase several commits so this is complete
 before the following buffer-packing changes.  Remove module
 parameter.]
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

2768935a

sfc: Enable RX DMA scattering where possible · 85740cdf

由 Ben Hutchings 提交于 1月 29, 2013

Enable RX DMA scattering iff an RX buffer large enough for the current
MTU will not fit into a single page and the NIC supports DMA
scattering for kernel-mode RX queues.

On Falcon and Siena, the RX_USR_BUF_SIZE field is used as the DMA
limit for both all RX queues with scatter enabled.  Set it to 1824,
matching what Onload uses now.

Maintain a statistic for frames truncated due to lack of descriptors
(rx_nodesc_trunc).  This is distinct from rx_frm_trunc which may be
incremented when scattering is disabled and implies an over-length
frame.

Whenever an MTU change causes scattering to be turned on or off,
update filters that point to the PF queues, but leave others
unchanged, as VF drivers assume scattering is off.

Add n_frags parameters to various functions, and make them iterate:
- efx_rx_packet()
- efx_recycle_rx_buffers()
- efx_rx_mk_skb()
- efx_rx_deliver()

Make efx_handle_rx_event() responsible for updating
efx_rx_queue::removed_count.

Change the RX pipeline state to a starting ring index and number of
fragments, and make __efx_rx_packet() responsible for clearing it.

Based on earlier versions by David Riddoch and Jon Cooper.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

85740cdf

sfc: Update RX buffer address together with length · b74e3e8c

由 Ben Hutchings 提交于 1月 29, 2013

Adjust rx_buf->page_offset when we eat the RX hash prefix.  Remove
efx_rx_buf_offset(), which is now redundant.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

b74e3e8c

sfc: Explicitly prefetch RX hash prefix, not just Ethernet heade · 5036b7c7

由 Ben Hutchings 提交于 1月 29, 2013

Currently we prefetch from the Ethernet header, but we will also read
the hash prefix.  In practice they should be in the same cache line
and this won't hurt, but it is still pointless to add on the hash
prefix size.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

5036b7c7

sfc: Replace efx_rx_buf_eh() with simpler efx_rx_buf_va() · b184f16b

由 Ben Hutchings 提交于 1月 29, 2013

efx_rx_buf_va() returns the virtual address of the current start of
the buffer. The callers must add the hash prefix size themselves.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

b184f16b

sfc: Wrap __efx_rx_packet() with efx_rx_flush_packet() · ff734ef4

由 Ben Hutchings 提交于 1月 29, 2013

The pipeline mechanism will need to change a bit for scattered
packets. Add a wrapper to insulate efx_process_channel() from this.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

ff734ef4

sfc: Properly distinguish RX buffer and DMA lengths · 272baeeb

由 Ben Hutchings 提交于 1月 29, 2013

Replace efx_nic::rx_buffer_len with efx_nic::rx_dma_len, the maximum
RX DMA length.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

272baeeb

sfc: Remove rx_alloc_method SKB · 97d48a10

由 Alexandre Rames 提交于 1月 11, 2013

[bwh: Remove more dead code, and make efx_ptp_rx() pull the data it
 needs into the header area.]
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

97d48a10

sfc: Allow efx_channel_type::receive_skb() to reject a packet · 4a74dc65

由 Ben Hutchings 提交于 3月 05, 2013

Instead of having efx_ptp_rx() call netif_receive_skb() for an invalid
PTP packet, make it return false for rejected packets and have
efx_rx_deliver() pass them up.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

4a74dc65

07 3月, 2013 1 次提交

sfc: Correct efx_rx_buffer::page_offset when EFX_PAGE_IP_ALIGN != 0 · c73e787a

由 Ben Hutchings 提交于 3月 05, 2013

RX DMA buffers start at an offset of EFX_PAGE_IP_ALIGN bytes from the
start of a cache line.  This offset obviously needs to be included in
the virtual address, but this was missed in commit b590ace0
('sfc: Fix efx_rx_buf_offset() in the presence of swiotlb') since
EFX_PAGE_IP_ALIGN is equal to 0 on both x86 and powerpc.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

c73e787a

26 2月, 2013 2 次提交

sfc: Fix efx_rx_buf_offset() in the presence of swiotlb · b590ace0

由 Ben Hutchings 提交于 1月 10, 2013

We assume that the mapping between DMA and virtual addresses is done
on whole pages, so we can find the page offset of an RX buffer using
the lower bits of the DMA address.  However, swiotlb maps in units of
2K, breaking this assumption.

Add an explicit page_offset field to struct efx_rx_buffer.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

b590ace0

sfc: Properly sync RX DMA buffer when it is not the last in the page · 3a68f19d

由 Ben Hutchings 提交于 12月 20, 2012

We may currently allocate two RX DMA buffers to a page, and only unmap
the page when the second is completed. We do not sync the first RX
buffer to be completed; this can result in packet loss or corruption
if the last RX buffer completed in a NAPI poll is the first in a page
and is not DMA-coherent. (In the middle of a NAPI poll, we will
handle the following RX completion and unmap the page *before* looking
at the content of the first buffer.)
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

3a68f19d

01 12月, 2012 1 次提交
- B
  sfc: Delete redundant page_addr variable from efx_init_rx_buffers_page() · b8e02517
  由 Ben Hutchings 提交于 9月 06, 2012
```
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>
```
  b8e02517
08 9月, 2012 2 次提交

sfc: Add channel specific receive_skb handler and post_remove callback · c31e5f9f

由 Stuart Hodgson 提交于 7月 18, 2012

Allows an extra channel to override the standard receive_skb handler
and also for extra non generic operations to be performed on remove.

Also set default rx strategy so only skbs can be delivered to the
PTP receive function.
Signed-off-by: NStuart Hodgson <smhodgson@solarflare.com>
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

c31e5f9f

sfc: Add explicit RX queue flag to channel · 79d68b37

由 Stuart Hodgson 提交于 7月 16, 2012

The PTP channel will have its own RX queue even though it's not
a regular traffic channel.

Original work by Ben Hutchings <bhutchings@solarflare.com>
Signed-off-by: NStuart Hodgson <smhodgson@solarflare.com>
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

79d68b37

17 7月, 2012 1 次提交
- B
  sfc: Use generic DMA API, not PCI-DMA API · 0e33d870
  由 Ben Hutchings 提交于 5月 17, 2012
```
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>
```
  0e33d870
11 7月, 2012 1 次提交

drivers/net/ethernet: Fix (nearly-)kernel-doc comments for various functions · 49ce9c2c

由 Ben Hutchings 提交于 7月 10, 2012

Fix incorrect start markers, wrapped summary lines, missing section
breaks, incorrect separators, and some name mismatches.  Delete
a few that are content-free.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>
Acked-by: NJeff Kirsher <jeffrey.t.kirsher@intel.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

49ce9c2c

10 5月, 2012 2 次提交

sfc: By default refill RX rings as soon as space for a batch · 64235187

由 David Riddoch 提交于 4月 11, 2012

Previously we refilled with much larger batches, which caused large latency
spikes. We now have many more much much smaller spikes!
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

64235187

sfc: Fill RX rings completely full, rather than to 95% full · da9ca505

由 David Riddoch 提交于 4月 11, 2012

There was no runtime control of the fast_fill_limit in any case, so purged
that field.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

da9ca505

07 3月, 2012 1 次提交

sfc: Update comments on efx_rx_packet_gro() · 61321d92

由 Ben Hutchings 提交于 2月 25, 2012

The in-tree driver has never supported Driverlink.  The rest of the
comments are rather redundant, but we can usefully state what the
requirements are on the buffer state.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

61321d92

25 2月, 2012 1 次提交

sfc: Fix assignment of ip_summed for pre-allocated skbs · ff3bc1e7

由 Ben Hutchings 提交于 2月 25, 2012

When pre-allocating skbs for received packets, we set ip_summed =
CHECKSUM_UNNCESSARY.  We used to change it back to CHECKSUM_NONE when
the received packet had an incorrect checksum or unhandled protocol.

Commit bc8acf2c ('drivers/net: avoid
some skb->ip_summed initializations') mistakenly replaced the latter
assignment with a DEBUG-only assertion that ip_summed ==
CHECKSUM_NONE.  This assertion is always false, but it seems no-one
has exercised this code path in a DEBUG build.

Fix this by moving our assignment of CHECKSUM_UNNECESSARY into
efx_rx_packet_gro().
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

ff3bc1e7

16 2月, 2012 2 次提交

sfc: Leave interrupts and event queues enabled whenever we can · 9f2cb71c

由 Ben Hutchings 提交于 2月 08, 2012

When SR-IOV is enabled we may receive FLR (Function-Level Reset)
events, associated queue flush events and requests from VF drivers at
any time.  Therefore we need to keep event queues and interrupts
enabled whenever possible.

Currently we stop interrupt-driven event processing before flushing RX
and TX queues; efx_nic_flush_queues() then polls event queues for
flush events and discards any others it finds.  Change it to work with
the regular event handling functions.

Currently efx_start_channel() fills RX queues synchronously when a
device is brought up.  This could now race with NAPI, so change it to
send fill events.

This was almost entirely written by Steve Hodgson, formerly
shodgson@solarflare.com.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

9f2cb71c

sfc: Generate RX fill events based on RX queues, not channels · 2ae75dac

由 Ben Hutchings 提交于 2月 07, 2012

This makes it harder to accidentally send such events to TX-only
channels.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

2ae75dac

31 1月, 2012 1 次提交

sfc: Use a more sensible cast in efx_rx_buf_offset() · 06e63c57

由 Ben Hutchings 提交于 1月 30, 2012

This function returns the page offset of the buffer, which can be
calculated based on either its DMA address or its virtual address. It
used to use the virtual address and we would cast that to unsigned
long, as anything smaller would result in a compiler warning. Now
that it's using the DMA address we should use unsigned int, matching
the return type. It is also unnecessary to use __force.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

06e63c57

27 1月, 2012 2 次提交
- B
  sfc: Replace efx_rx_buffer::is_page and other booleans with a flags field · db339569
  由 Ben Hutchings 提交于 8月 26, 2011
```
Replace checksummed and discard booleans from efx_handle_rx_event()
with a bitmask, added to the flags field.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>
```
  db339569
- B
  sfc: Move the end of the non-GRO RX path into its own function · 1ddceb4c
  由 Ben Hutchings 提交于 1月 23, 2012
```
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>
```
  1ddceb4c
06 1月, 2012 1 次提交

sfc: Remove parentheses around return expressions, reported by checkpatch · 0beaca2c

由 Ben Hutchings 提交于 1月 05, 2012

Fix the following error:

ERROR: return is not a function, parentheses are not required
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

0beaca2c

17 12月, 2011 1 次提交
- B
  sfc: Use skb_fill_page_desc() to simplify passing of page buffers to GRO · 70350b06
  由 Ben Hutchings 提交于 12月 16, 2011
```
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>
```
  70350b06
04 12月, 2011 1 次提交

sfc: Use kcalloc instead of kzalloc to allocate array · c2e4e25a

由 Thomas Meyer 提交于 12月 02, 2011

The advantage of kcalloc is, that will prevent integer overflows which could
result from the multiplication of number of elements and size and it is also
a bit nicer to read.

The semantic patch that makes this change is available
in https://lkml.org/lkml/2011/11/25/107Signed-off-by: NThomas Meyer <thomas@m3y3r.de>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

c2e4e25a

01 11月, 2011 1 次提交

drivers/net: Add moduleparam.h to drivers as required. · 6eb07caf

由 Paul Gortmaker 提交于 9月 15, 2011

These files were using moduleparam infrastructure, but were not
including anything for it -- which is fine when module.h is being
implicitly included in all files, but that is going away.
Signed-off-by: NPaul Gortmaker <paul.gortmaker@windriver.com>

6eb07caf

19 10月, 2011 1 次提交

net: add skb frag size accessors · 9e903e08

由 Eric Dumazet 提交于 10月 18, 2011

To ease skb->truesize sanitization, its better to be able to localize
all references to skb frags size.

Define accessors : skb_frag_size() to fetch frag size, and
skb_frag_size_{set|add|sub}() to manipulate it.
Signed-off-by: NEric Dumazet <eric.dumazet@gmail.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

9e903e08

23 9月, 2011 1 次提交

sfc: convert to SKB paged frag API. · 4a22c4c9

由 Ian Campbell 提交于 9月 21, 2011

Signed-off-by: NIan Campbell <ian.campbell@citrix.com>
Cc: Solarflare linux maintainers <linux-net-drivers@solarflare.com>
Cc: Steve Hodgson <shodgson@solarflare.com>
Cc: Ben Hutchings <bhutchings@solarflare.com>
Cc: netdev@vger.kernel.org
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

4a22c4c9

11 8月, 2011 1 次提交

sfc: Move the Solarflare drivers · 874aeea5

由 Jeff Kirsher 提交于 5月 13, 2011

Moves the Solarflare drivers into drivers/net/ethernet/sfc/ and
make the necessary Kconfig and Makefile changes.

CC: Steve Hodgson <shodgson@solarflare.com>
CC: Ben Hutchings <bhutchings@solarflare.com>
Signed-off-by: NJeff Kirsher <jeffrey.t.kirsher@intel.com>

874aeea5

23 5月, 2011 2 次提交

Add appropriate <linux/prefetch.h> include for prefetch users · 70c71606

由 Paul Gortmaker 提交于 5月 22, 2011

After discovering that wide use of prefetch on modern CPUs
could be a net loss instead of a win, net drivers which were
relying on the implicit inclusion of prefetch.h via the list
headers showed up in the resulting cleanup fallout.  Give
them an explicit include via the following $0.02 script.

 =========================================
 #!/bin/bash
 MANUAL=""
 for i in `git grep -l 'prefetch(.*)' .` ; do
 	grep -q '<linux/prefetch.h>' $i
 	if [ $? = 0 ] ; then
 		continue
 	fi

 	(	echo '?^#include <linux/?a'
 		echo '#include <linux/prefetch.h>'
 		echo .
 		echo w
 		echo q
 	) | ed -s $i > /dev/null 2>&1
 	if [ $? != 0 ]; then
 		echo $i needs manual fixup
 		MANUAL="$i $MANUAL"
 	fi
 done
 echo ------------------- 8\<----------------------
 echo vi $MANUAL
 =========================================
Signed-off-by: NPaul <paul.gortmaker@windriver.com>
[ Fixed up some incorrect #include placements, and added some
  non-network drivers and the fib_trie.c case    - Linus ]
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

70c71606

drivers/net: add prefetch header for prefetch users · c0cba59e

由 Paul Gortmaker 提交于 5月 22, 2011

After discovering that wide use of prefetch on modern CPUs
could be a net loss instead of a win, net drivers which were
relying on the implicit inclusion of prefetch.h via the list
headers showed up in the resulting cleanup fallout.  Give
them an explicit include via the following $0.02 script.

 =========================================
 #!/bin/bash
 MANUAL=""
 for i in `git grep -l 'prefetch(.*)' .` ; do
 	grep -q '<linux/prefetch.h>' $i
 	if [ $? = 0 ] ; then
 		continue
 	fi

 	(	echo '?^#include <linux/?a'
 		echo '#include <linux/prefetch.h>'
 		echo .
 		echo w
 		echo q
 	) | ed -s $i > /dev/null 2>&1
 	if [ $? != 0 ]; then
 		echo $i needs manual fixup
 		MANUAL="$i $MANUAL"
 	fi
 done
 echo ------------------- 8\<----------------------
 echo vi $MANUAL
 =========================================
Signed-off-by: NPaul <paul.gortmaker@windriver.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

c0cba59e

05 4月, 2011 1 次提交
- B
  sfc: Implement generic features interface · abfe9039
  由 Ben Hutchings 提交于 4月 05, 2011
```
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>
```
  abfe9039
02 4月, 2011 1 次提交

sfc: Move test of rx_checksum_enabled from nic.c to rx.c · ab3cf6d0

由 Ben Hutchings 提交于 4月 01, 2011

This is preparation for using the generic netdev features interface,
and should have no effect in itself.
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>

ab3cf6d0

01 3月, 2011 1 次提交
- B
  sfc: Update copyright dates · 0a6f40c6
  由 Ben Hutchings 提交于 2月 25, 2011
```
Signed-off-by: NBen Hutchings <bhutchings@solarflare.com>
```
  0a6f40c6

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功