提交 · a754a8fcaf383be3c5fcc6c3c08e36d9f3005988 · openeuler / raspberrypi-kernel

05 8月, 2016 1 次提交

NTB: allocate number transport entries depending on size of ring size · a754a8fc

由 Dave Jiang 提交于 4月 08, 2016

Currently we only allocate a fixed default number of descriptors for the tx
and rx side. We should dynamically resize it to the number of descriptors
resides in the transport rings. We should know the number of transmit
descriptors at initializaiton. We will allocate the default number of
descriptors for receive side and allocate additional ones when we know the
actual max entries for receive.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Acked-by: NAllen Hubbe <allen.hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

a754a8fc

18 3月, 2016 2 次提交

ntb: stop link work when we do not have memory · 84f76685

由 Dave Jiang 提交于 2月 29, 2016

Instead of keep trying to go through the init routine when we aren't able
to allocate memory, we should just stop and go down.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

84f76685

ntb: stop tasklet from spinning forever during shutdown. · e9021331

由 Dave Jiang 提交于 2月 23, 2016

We can leave tasklet spinning forever if we disable the tasklet during
qp shutdown and the tasklets are still being kicked off. This hopefully
should avoid that race condition.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Reported-by: NAlex Depoutovitch <alex@pernixdata.com>
Tested-by: NAlex Depoutovitch <alex@pernixdata.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

e9021331

11 1月, 2016 2 次提交

NTB: Address out of DMA descriptor issue with NTB · 8c874cc1

由 Dave Jiang 提交于 1月 08, 2016

The transport right now does not handle the case where we run out of DMA
descriptors. We just fail when we do not succeed. Adding code to retry for
a bit attempting to use the DMA engine instead of instantly fail to CPU
copy.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Reviewed-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

8c874cc1

NTB: ntb_process_tx error path bug · 179f912a

由 Jon Mason 提交于 12月 18, 2015

The transmit overrun avoidance error path in ntb_process_tx accidentally
swapped the first two values being passed to the tx_handler client.
This could result in crashes in the ntb_netdev (or other out-of-tree NTB
clients).
Reported-by: NAlex Depoutovitch <alex@pernixdata.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

179f912a

09 11月, 2015 5 次提交

NTB: fix 32-bit compiler warning · fdcb4b2e

由 Arnd Bergmann 提交于 10月 07, 2015

resource_size_t may be 32-bit wide on some architectures, which causes
this warning when building the NTB code:

drivers/ntb/ntb_transport.c: In function 'ntb_transport_link_work':
drivers/ntb/ntb_transport.c:828:46: warning: right shift count >= width of type [-Wshift-count-overflow]

The warning is harmless but can be avoided by using the upper_32_bits()
macro.
Signed-off-by: NArnd Bergmann <arnd@arndb.de>
Fixes: e26a5843 ("NTB: Split ntb_hw_intel and ntb_transport drivers")
Signed-off-by: NJon Mason <jdmason@kudzu.us>

fdcb4b2e

NTB: invalid buf pointer in multi-MW setups · c92ba3c5

由 Jon Mason 提交于 10月 04, 2015

Order of operations issue with the QP Num and MW count, which would
result in the receive buffer pointer being invalid if there are more
than 1 MW.  Corrected with parenthesis to enforce the proper order of
operations.
Reported-by: NJohn I. Kading <John.Kading@gd-ms.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

c92ba3c5

NTB: remove unused variable · 70d4687d

由 Sudip Mukherjee 提交于 10月 03, 2015

These variables were not used anywhere. So remove them.
Signed-off-by: NSudip Mukherjee <sudip@vectorindia.org>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

70d4687d

NTB: fix access of free-ed pointer · d4adee09

由 Sudip Mukherjee 提交于 10月 03, 2015

We were accessing nt->mw_vec after freeing it. Fix the error path so
that we free nt->mw_vec after we have finished using it.
Signed-off-by: NSudip Mukherjee <sudip@vectorindia.org>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

d4adee09

NTB: Fix issue where we may be accessing NULL ptr · 04afde45

由 Dave Jiang 提交于 9月 17, 2015

smatch detected an issue in the function ntb_transport_max_size() where
we could be dereferencing a dma channel pointer when it is NULL.
Reported-by: NDan Carpenter <dan.carpenter@oracle.com>
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

04afde45

08 9月, 2015 5 次提交

NTB: Use unique DMA channels for TX and RX · 569410ca

由 Dave Jiang 提交于 7月 13, 2015

Allocate two DMA channels, one for TX operation and one for RX
operation, instead of having one DMA channel for everything. This
provides slightly better performance, and also will make error handling
cleaner later on.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

569410ca

NTB: Remove dma_sync_wait from ntb_async_rx · 905921e7

由 Allen Hubbe 提交于 7月 13, 2015

The dma_sync_wait can hurt the performance of workloads mixed with both
large and small frames. Large frames will be copied using the dma
engine. Small frames will be copied by the cpu. The dma_sync_wait
prevents the cpu and dma engine copying in parallel.

In the period where the cpu is copying, the dma engine is stopped. The
dma engine is not doing any useful work to copy large frames during that
time, and the additional time to restart the dma engine for the next
large frame. This will decrease the throughput for the portion of a
workload with large frames.

In the period where the dma engine is copying, the cpu is held up
waiting for dma to complete. The small frames processing will be
delayed until the dma is complete. The RX frames are completed
in-order, and the processing of small frames takes very little time, so
dma_sync_wait may have an insignificant impact on the respose time of
frames. The more significant impact is to the system, because the delay
in dma_sync_wait is implemented as busy non-blocking wait. This can
prevent the delayed core from doing any useful work, even if it could be
processing work for other drivers, unrelated to transport RX processing.

After applying the earlier patch to fix out-of-order RX acknoledgement,
the dma_sync_wait is no longer necessary. Remove it, so that cpu memcpy
will proceed immediately for small frames, in parallel with ongoing dma
for large frames. Do not hold up the cpu from doing work while dma is
in progress. The prior fix will continue to ensure in-order completion
of the RX frames to the upper layer, and in-order delivery of the RX
acknoledgement.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

905921e7

NTB: Clean up QP stats info · d98ef99e

由 Dave Jiang 提交于 7月 13, 2015

Make QP stats info more readable for debugging purposes.  Also add an
entry to indicate whether DMA is being used.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

d98ef99e

NTB: Make the transport list in order of discovery · 31510000

由 Dave Jiang 提交于 7月 13, 2015

The list should be added from the bottom and not the top in order to
ensure the transport is provided in the same order to clients as ntb
devices are discovered.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

31510000

NTB: Add flow control to the ntb_netdev · e74bfeed

由 Dave Jiang 提交于 7月 13, 2015

Right now if we push the NTB really hard, we start dropping packets due
to not able to process the packets fast enough. We need to st:qop the
upper layer from flooding us when that happens.

A timer is necessary in order to restart the queue once the resource has
been processed on the receive side. Due to the way NTB is setup, the
resources on the tx side are tied to the processing of the rx side and
there's no async way to know when the rx side has released those
resources.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

e74bfeed

10 8月, 2015 6 次提交

NTB: Fix dereference before check · 30a4bb1e

由 Allen Hubbe 提交于 7月 13, 2015

Remove early dereference of a pointer that is checked later in the code.
Reported-by: NDan Carpenter <dan.carpenter@oracle.com>
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

30a4bb1e

NTB: Fix zero size or integer overflow in ntb_set_mw · 8c9edf63

由 Allen Hubbe 提交于 7月 13, 2015

A plain 32 bit integer will overflow for values over 4GiB.

Change the plain integer size to the appropriate size type in
ntb_set_mw.  Change the type of the size parameter and two local
variables used for size.

Even if there is no overflow, a size of zero is invalid here.
Reported-by: NJuyoung Jung <jjung@micron.com>
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

8c9edf63

NTB: Schedule to receive on QP link up · 8b5a22d8

由 Allen Hubbe 提交于 7月 13, 2015

Schedule to receive on QP link up, to make sure that the doorbell is
properly cleared for interrupts.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

8b5a22d8

NTB: Fix oops in debugfs when transport is half-up · 260bee94

由 Dave Jiang 提交于 7月 13, 2015

When the remote side is not up, we do not have all the context for the
transport, and that causes NULL ptr access. Have the debugfs reads check
to see if transport is up before we make access.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

260bee94

NTB: Fix transport stats for multiple devices · c8650fd0

由 Dave Jiang 提交于 7月 13, 2015

Currently the debugfs does not have files for all NTB transport queue
pairs.  When there are multiple NTBs present in a system, the QP names
of the last transport clobber the names of previously added transport
QPs.  Only the last added QPs can be observed via debugfs.

Create a directory per NTB transport to associate the QPs with that
transport.  Name the directory the same as the PCI device.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

c8650fd0

NTB: Fix ntb_transport out-of-order RX update · da2e5ae5

由 Allen Hubbe 提交于 7月 13, 2015

It was possible for a synchronous update of the RX index in the error
case to get ahead of the asynchronous RX index update in the normal
case. Change the RX processing to preserve an RX completion order.

There were two error cases. First, if a buffer is not present to
receive data, there would be no queue entry to preserve the RX
completion order. Instead of dropping the RX frame, leave the RX frame
in the ring. Schedule RX processing when RX entries are enqueued, in
case there are RX frames waiting in the ring to be received.

Second, if a buffer is too small to receive data, drop the frame in the
ring, mark the RX entry as done, and indicate the error in the RX entry
length. Check for a negative length in the receive callback in
ntb_netdev, and count occurrences as rx_length_errors.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

da2e5ae5

05 7月, 2015 11 次提交

NTB: Print driver name and version in module init · 7eb38781

由 Dave Jiang 提交于 6月 15, 2015

Printouts driver name and version to indicate what is being loaded.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

7eb38781

NTB: Increase transport MTU to 64k from 16k · 9891417d

由 Dave Jiang 提交于 6月 03, 2015

Benchmarking showed a significant performance increase with the MTU size
to 64k instead of 16k.  Change the driver default to 64k.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

9891417d

NTB: Default to CPU memcpy for performance · a41ef053

由 Dave Jiang 提交于 5月 19, 2015

Disable DMA usage by default, since the CPU provides much better
performance with write combining.  Provide a module parameter to enable
DMA usage when offloading the memcpy is preferred.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

a41ef053

NTB: Improve performance with write combining · 06917f75

由 Dave Jiang 提交于 5月 19, 2015

Changing the memory window BAR mappings to write combining significantly
boosts the performance.  We will also use memcpy that uses non-temporal
store, which showed performance improvement when doing non-cached
memcpys.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

06917f75

NTB: Use NUMA memory and DMA chan in transport · 1199aa61

由 Allen Hubbe 提交于 5月 18, 2015

Allocate memory and request the DMA channel for the same NUMA node as
the NTB device.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

1199aa61

NTB: Rate limit ntb_qp_link_work · 28762289

由 Allen Hubbe 提交于 5月 11, 2015

When the ntb transport is connecting and waiting for the peer, the debug
console receives lots of debug level messages about the remote qp link
status being down.  Rate limit those messages.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

28762289

NTB: Reset transport QP link stats on down · 2849b5d7

由 Allen Hubbe 提交于 5月 12, 2015

Reset the link stats when the link goes down.  In particular, the TX and
RX index and count must be reset, or else the TX side will be sending
packets to the RX side where the RX side is not expecting them.  Reset
all the stats, to be consistent.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

2849b5d7

NTB: Do not advance transport RX on link down · c0900b33

由 Allen Hubbe 提交于 5月 12, 2015

On link down, don't advance RX index to the next entry.  The next entry
should never be valid after receiving the link down flag.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

c0900b33

NTB: Differentiate transport link down messages · e22e0b9d

由 Allen Hubbe 提交于 5月 12, 2015

The same message "qp %d: Link Down\n" was printed at two locations in
ntb_transport.  Change the messages so they are distinct.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

e22e0b9d

NTB: Read peer info from local SPAD in transport · 0f69a7df

由 Dave Jiang 提交于 6月 02, 2015

The transport was writing and then reading the peer scratch pad,
essentially reading what it just wrote instead of exchanging any
information with the peer.  The transport expects the peer values to be
the same as the local values, so this issue was not obvious.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

0f69a7df

NTB: Split ntb_hw_intel and ntb_transport drivers · e26a5843

由 Allen Hubbe 提交于 4月 09, 2015

Change ntb_hw_intel to use the new NTB hardware abstraction layer.

Split ntb_transport into its own driver.  Change it to use the new NTB
hardware abstraction layer.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

e26a5843

02 7月, 2015 1 次提交

NTB: Move files in preparation for NTB abstraction · ec110bc7

由 Allen Hubbe 提交于 5月 07, 2015

This patch only moves files to their new locations, before applying the
next two patches adding the NTB Abstraction layer.  Splitting this patch
from the next is intended make distinct which code is changed only due
to moving the files, versus which are substantial code changes in adding
the NTB Abstraction layer.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

ec110bc7

14 9月, 2014 2 次提交

ntb: Add alignment check to meet hardware requirement · 3cc5ba19

由 Dave Jiang 提交于 8月 28, 2014

The NTB translate register must have the value to be BAR size aligned.
This alignment check make sure that the DMA memory allocated has the
proper alignment. Another requirement for NTB to function properly with
memory window BAR size greater or equal to 4M is to use the CMA feature
in 3.16 kernel with the appropriate CONFIG_CMA_ALIGNMENT and
CONFIG_CMA_SIZE_MBYTES set.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

3cc5ba19

NTB: correct the spread of queues over mw's · a1413cfb

由 Jon Mason 提交于 6月 19, 2014

The detection of an uneven number of queues on the given memory windows
was not correct.  The mw_num is zero based and the mod should be
division to spread them evenly over the mw's.
Signed-off-by: NJon Mason <jon.mason@intel.com>

a1413cfb

08 4月, 2014 2 次提交

NTB: Code Style Clean-up · 53ca4fea

由 Jon Mason 提交于 11月 26, 2013

Some white space and 80 char overruns corrected.
Signed-off-by: NJon Mason <jon.mason@intel.com>

53ca4fea

NTB: client event cleanup · 403c63cb

由 Jon Mason 提交于 7月 29, 2013

Provide a better event interface between the client and transport
Signed-off-by: NJon Mason <jon.mason@intel.com>

403c63cb

21 11月, 2013 3 次提交

J
NTB: Disable interrupts and poll under high load · e8aeb60c
由 Jon Mason 提交于 4月 18, 2013
```
Disable interrupts and poll under high load
Signed-off-by: NJon Mason <jon.mason@intel.com>
```
e8aeb60c

NTB: correct dmaengine_get/put usage · 94681194

由 Jon Mason 提交于 11月 19, 2013

dmaengine_get() causes the initialization of the per-cpu channel tables.
It needs to be called prior to dma_find_channel().

Initial version by Dan Williams <dan.j.williams@intel.com>
Signed-off-by: NJon Mason <jon.mason@intel.com>

94681194

NTB: Fix ntb_transport link down race · fca4d518

由 Jon Mason 提交于 9月 09, 2013

A WARN_ON is being hit in ntb_qp_link_work due to the NTB transport link
being down while the ntb qp link is still active. This is caused by the
transport link being brought down prior to the qp link worker thread
being terminated. To correct this, shutdown the qp's prior to bringing
the transport link down. Also, only call the qp worker thread if it is
in interrupt context, otherwise call the function directly.
Signed-off-by: NJon Mason <jon.mason@intel.com>

fca4d518