提交 · 865be78022e9ae8151c755d01201012ccf5e3232 · openanolis / cloud-kernel

20 6月, 2017 3 次提交

ntb: no sleep in ntb_async_tx_submit · 88931ec3

由 Allen Hubbe 提交于 6月 09, 2017

Do not sleep in ntb_async_tx_submit, which could deadlock.
This reverts commit "8c874cc1"

Fixes: 8c874cc1 ("NTB: Address out of DMA descriptor issue with NTB")
Reported-by: NJia-Ju Bai <baijiaju1990@163.com>
Signed-off-by: NAllen Hubbe <Allen.Hubbe@dell.com>
Acked-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

88931ec3

ntb_transport: fix bug calculating num_qps_mw · 8e8496e0

由 Logan Gunthorpe 提交于 6月 05, 2017

A divide by zero error occurs if qp_count is less than mw_count because
num_qps_mw is calculated to be zero. The calculation appears to be
incorrect.

The requirement is for num_qps_mw to be set to qp_count / mw_count
with any remainder divided among the earlier mws.

For example, if mw_count is 5 and qp_count is 12 then mws 0 and 1
will have 3 qps per window and mws 2 through 4 will have 2 qps per window.
Thus, when mw_num < qp_count % mw_count, num_qps_mw is 1 higher
than when mw_num >= qp_count.
Signed-off-by: NLogan Gunthorpe <logang@deltatee.com>
Fixes: e26a5843 ("NTB: Split ntb_hw_intel and ntb_transport drivers")
Acked-by: NAllen Hubbe <Allen.Hubbe@dell.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

8e8496e0

ntb_transport: fix qp count bug · cb827ee6

由 Logan Gunthorpe 提交于 6月 05, 2017

In cases where there are more mw's than spads/2-2, the mw count gets
reduced to match the limitation. ntb_transport also tries to ensure that
there are fewer qps than mws but uses the full mw count instead of
the reduced one. When this happens, the math in
'ntb_transport_setup_qp_mw' will get confused and result in a kernel
paging request bug.

This patch fixes the bug by reducing qp_count to the reduced mw count
instead of the full mw count.
Signed-off-by: NLogan Gunthorpe <logang@deltatee.com>
Fixes: e26a5843 ("NTB: Split ntb_hw_intel and ntb_transport drivers")
Acked-by: NAllen Hubbe <Allen.Hubbe@dell.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

cb827ee6

17 2月, 2017 2 次提交

ntb_transport: Pick an unused queue · 8fcd0950

由 Thomas VanSelus 提交于 2月 13, 2017

Fix typo causing ntb_transport_create_queue to select the first
queue every time, instead of using the next free queue.
Signed-off-by: NThomas VanSelus <tvanselus@xes-inc.com>
Signed-off-by: NAaron Sierra <asierra@xes-inc.com>
Acked-by: NAllen Hubbe <Allen.Hubbe@dell.com>
Fixes: fce8a7bb ("PCI-Express Non-Transparent Bridge Support")
Signed-off-by: NJon Mason <jdmason@kudzu.us>

8fcd0950

NTB: ntb_transport: fix debugfs_remove_recursive · dd62245e

由 Allen Hubbe 提交于 12月 27, 2016

The call to debugfs_remove_recursive(qp->debugfs_dir) of the sub-level
directory must not be later than
debugfs_remove_recursive(nt_debugfs_dir) of the top-level directory.
Otherwise, the sub-level directory will not exist, and it would be
invalid (panic) to attempt to remove it.  This removes the top-level
directory last, after sub-level directories have been cleaned up.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@dell.com>
Fixes: e26a5843 ("NTB: Split ntb_hw_intel and ntb_transport drivers")
Signed-off-by: NJon Mason <jdmason@kudzu.us>

dd62245e

24 12月, 2016 2 次提交

ntb_transport: Remove unnecessary call to ntb_peer_spad_read · dfb7d24c

由 Steve Wahl 提交于 12月 21, 2016

The results were previously ignored, anyway.
Signed-off-by: NSteve Wahl <Steve.Wahl@dell.com>
Fixes: e26a5843Acked-by: NAllen Hubbe <Allen.Hubbe@dell.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

dfb7d24c

ntb_transport: Limit memory windows based on available, scratchpads · b17faba0

由 Shyam Sundar S K 提交于 12月 07, 2016

When the underlying NTB H/W driver advertises more memory windows
than the number of scratchpads available to setup MW's, it is likely
that we may end up filling the remaining memory windows with garbage.
So to avoid that, lets limit the memory windows that transport driver
can setup based on the available scratchpads.
Signed-off-by: NShyam Sundar S K <Shyam-sundar.S-k@amd.com>
Acked-by: NAllen Hubbe <Allen.Hubbe@dell.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

b17faba0

14 11月, 2016 1 次提交

ntb_transport: make DMA_OUT_RESOURCE_TO HZ independent · c0a88032

由 Nicholas Mc Guire 提交于 8月 22, 2016

schedule_timeout_* takes a timeout in jiffies but the code currently is
passing in a constant which makes this timeout HZ dependent, so pass it
through msecs_to_jiffies() to fix this up.
Signed-off-by: NNicholas Mc Guire <hofrat@osadl.org>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

c0a88032

08 8月, 2016 2 次提交

ntb: add DMA error handling for RX DMA · 72203572

由 Dave Jiang 提交于 7月 20, 2016

Adding support on the rx DMA path to allow recovery of errors when
DMA responds with error status and abort all the subsequent ops.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Acked-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Cc: Jon Mason <jdmason@kudzu.us>
Cc: linux-ntb@googlegroups.com
Signed-off-by: NVinod Koul <vinod.koul@intel.com>

72203572

ntb: add DMA error handling for TX DMA · 9cabc269

由 Dave Jiang 提交于 7月 20, 2016

Adding support on the tx DMA path to allow recovery of errors when
DMA responds with error status and abort all the subsequent ops.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Acked-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Cc: Jon Mason <jdmason@kudzu.us>
Cc: linux-ntb@googlegroups.com
Signed-off-by: NVinod Koul <vinod.koul@intel.com>

9cabc269

05 8月, 2016 2 次提交

ntb_transport: Check the number of spads the hardware supports · 19645a07

由 Logan Gunthorpe 提交于 6月 07, 2016

I'm working on hardware that currently has a limited number of
scratchpad registers and ntb_ndev fails with no clue as to why. I
feel it is better to fail early and provide a reasonable error message
then to fail later on.

The same is done to ntb_perf, but it doesn't currently require enough
spads to actually fail. I've also removed the unused SPAD_MSG and
SPAD_ACK enums so that MAX_SPAD accurately reflects the number of
spads used.
Signed-off-by: NLogan Gunthorpe <logang@deltatee.com>
Acked-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

19645a07

NTB: allocate number transport entries depending on size of ring size · a754a8fc

由 Dave Jiang 提交于 4月 08, 2016

Currently we only allocate a fixed default number of descriptors for the tx
and rx side. We should dynamically resize it to the number of descriptors
resides in the transport rings. We should know the number of transmit
descriptors at initializaiton. We will allocate the default number of
descriptors for receive side and allocate additional ones when we know the
actual max entries for receive.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Acked-by: NAllen Hubbe <allen.hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

a754a8fc

18 3月, 2016 2 次提交

ntb: stop link work when we do not have memory · 84f76685

由 Dave Jiang 提交于 2月 29, 2016

Instead of keep trying to go through the init routine when we aren't able
to allocate memory, we should just stop and go down.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

84f76685

ntb: stop tasklet from spinning forever during shutdown. · e9021331

由 Dave Jiang 提交于 2月 23, 2016

We can leave tasklet spinning forever if we disable the tasklet during
qp shutdown and the tasklets are still being kicked off. This hopefully
should avoid that race condition.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Reported-by: NAlex Depoutovitch <alex@pernixdata.com>
Tested-by: NAlex Depoutovitch <alex@pernixdata.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

e9021331

11 1月, 2016 2 次提交

NTB: Address out of DMA descriptor issue with NTB · 8c874cc1

由 Dave Jiang 提交于 1月 08, 2016

The transport right now does not handle the case where we run out of DMA
descriptors. We just fail when we do not succeed. Adding code to retry for
a bit attempting to use the DMA engine instead of instantly fail to CPU
copy.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Reviewed-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

8c874cc1

NTB: ntb_process_tx error path bug · 179f912a

由 Jon Mason 提交于 12月 18, 2015

The transmit overrun avoidance error path in ntb_process_tx accidentally
swapped the first two values being passed to the tx_handler client.
This could result in crashes in the ntb_netdev (or other out-of-tree NTB
clients).
Reported-by: NAlex Depoutovitch <alex@pernixdata.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

179f912a

09 11月, 2015 5 次提交

NTB: fix 32-bit compiler warning · fdcb4b2e

由 Arnd Bergmann 提交于 10月 07, 2015

resource_size_t may be 32-bit wide on some architectures, which causes
this warning when building the NTB code:

drivers/ntb/ntb_transport.c: In function 'ntb_transport_link_work':
drivers/ntb/ntb_transport.c:828:46: warning: right shift count >= width of type [-Wshift-count-overflow]

The warning is harmless but can be avoided by using the upper_32_bits()
macro.
Signed-off-by: NArnd Bergmann <arnd@arndb.de>
Fixes: e26a5843 ("NTB: Split ntb_hw_intel and ntb_transport drivers")
Signed-off-by: NJon Mason <jdmason@kudzu.us>

fdcb4b2e

NTB: invalid buf pointer in multi-MW setups · c92ba3c5

由 Jon Mason 提交于 10月 04, 2015

Order of operations issue with the QP Num and MW count, which would
result in the receive buffer pointer being invalid if there are more
than 1 MW.  Corrected with parenthesis to enforce the proper order of
operations.
Reported-by: NJohn I. Kading <John.Kading@gd-ms.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

c92ba3c5

NTB: remove unused variable · 70d4687d

由 Sudip Mukherjee 提交于 10月 03, 2015

These variables were not used anywhere. So remove them.
Signed-off-by: NSudip Mukherjee <sudip@vectorindia.org>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

70d4687d

NTB: fix access of free-ed pointer · d4adee09

由 Sudip Mukherjee 提交于 10月 03, 2015

We were accessing nt->mw_vec after freeing it. Fix the error path so
that we free nt->mw_vec after we have finished using it.
Signed-off-by: NSudip Mukherjee <sudip@vectorindia.org>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

d4adee09

NTB: Fix issue where we may be accessing NULL ptr · 04afde45

由 Dave Jiang 提交于 9月 17, 2015

smatch detected an issue in the function ntb_transport_max_size() where
we could be dereferencing a dma channel pointer when it is NULL.
Reported-by: NDan Carpenter <dan.carpenter@oracle.com>
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

04afde45

08 9月, 2015 5 次提交

NTB: Use unique DMA channels for TX and RX · 569410ca

由 Dave Jiang 提交于 7月 13, 2015

Allocate two DMA channels, one for TX operation and one for RX
operation, instead of having one DMA channel for everything. This
provides slightly better performance, and also will make error handling
cleaner later on.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

569410ca

NTB: Remove dma_sync_wait from ntb_async_rx · 905921e7

由 Allen Hubbe 提交于 7月 13, 2015

The dma_sync_wait can hurt the performance of workloads mixed with both
large and small frames. Large frames will be copied using the dma
engine. Small frames will be copied by the cpu. The dma_sync_wait
prevents the cpu and dma engine copying in parallel.

In the period where the cpu is copying, the dma engine is stopped. The
dma engine is not doing any useful work to copy large frames during that
time, and the additional time to restart the dma engine for the next
large frame. This will decrease the throughput for the portion of a
workload with large frames.

In the period where the dma engine is copying, the cpu is held up
waiting for dma to complete. The small frames processing will be
delayed until the dma is complete. The RX frames are completed
in-order, and the processing of small frames takes very little time, so
dma_sync_wait may have an insignificant impact on the respose time of
frames. The more significant impact is to the system, because the delay
in dma_sync_wait is implemented as busy non-blocking wait. This can
prevent the delayed core from doing any useful work, even if it could be
processing work for other drivers, unrelated to transport RX processing.

After applying the earlier patch to fix out-of-order RX acknoledgement,
the dma_sync_wait is no longer necessary. Remove it, so that cpu memcpy
will proceed immediately for small frames, in parallel with ongoing dma
for large frames. Do not hold up the cpu from doing work while dma is
in progress. The prior fix will continue to ensure in-order completion
of the RX frames to the upper layer, and in-order delivery of the RX
acknoledgement.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

905921e7

NTB: Clean up QP stats info · d98ef99e

由 Dave Jiang 提交于 7月 13, 2015

Make QP stats info more readable for debugging purposes.  Also add an
entry to indicate whether DMA is being used.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

d98ef99e

NTB: Make the transport list in order of discovery · 31510000

由 Dave Jiang 提交于 7月 13, 2015

The list should be added from the bottom and not the top in order to
ensure the transport is provided in the same order to clients as ntb
devices are discovered.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

31510000

NTB: Add flow control to the ntb_netdev · e74bfeed

由 Dave Jiang 提交于 7月 13, 2015

Right now if we push the NTB really hard, we start dropping packets due
to not able to process the packets fast enough. We need to st:qop the
upper layer from flooding us when that happens.

A timer is necessary in order to restart the queue once the resource has
been processed on the receive side. Due to the way NTB is setup, the
resources on the tx side are tied to the processing of the rx side and
there's no async way to know when the rx side has released those
resources.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

e74bfeed

10 8月, 2015 6 次提交

NTB: Fix dereference before check · 30a4bb1e

由 Allen Hubbe 提交于 7月 13, 2015

Remove early dereference of a pointer that is checked later in the code.
Reported-by: NDan Carpenter <dan.carpenter@oracle.com>
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

30a4bb1e

NTB: Fix zero size or integer overflow in ntb_set_mw · 8c9edf63

由 Allen Hubbe 提交于 7月 13, 2015

A plain 32 bit integer will overflow for values over 4GiB.

Change the plain integer size to the appropriate size type in
ntb_set_mw.  Change the type of the size parameter and two local
variables used for size.

Even if there is no overflow, a size of zero is invalid here.
Reported-by: NJuyoung Jung <jjung@micron.com>
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

8c9edf63

NTB: Schedule to receive on QP link up · 8b5a22d8

由 Allen Hubbe 提交于 7月 13, 2015

Schedule to receive on QP link up, to make sure that the doorbell is
properly cleared for interrupts.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

8b5a22d8

NTB: Fix oops in debugfs when transport is half-up · 260bee94

由 Dave Jiang 提交于 7月 13, 2015

When the remote side is not up, we do not have all the context for the
transport, and that causes NULL ptr access. Have the debugfs reads check
to see if transport is up before we make access.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

260bee94

NTB: Fix transport stats for multiple devices · c8650fd0

由 Dave Jiang 提交于 7月 13, 2015

Currently the debugfs does not have files for all NTB transport queue
pairs.  When there are multiple NTBs present in a system, the QP names
of the last transport clobber the names of previously added transport
QPs.  Only the last added QPs can be observed via debugfs.

Create a directory per NTB transport to associate the QPs with that
transport.  Name the directory the same as the PCI device.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

c8650fd0

NTB: Fix ntb_transport out-of-order RX update · da2e5ae5

由 Allen Hubbe 提交于 7月 13, 2015

It was possible for a synchronous update of the RX index in the error
case to get ahead of the asynchronous RX index update in the normal
case. Change the RX processing to preserve an RX completion order.

There were two error cases. First, if a buffer is not present to
receive data, there would be no queue entry to preserve the RX
completion order. Instead of dropping the RX frame, leave the RX frame
in the ring. Schedule RX processing when RX entries are enqueued, in
case there are RX frames waiting in the ring to be received.

Second, if a buffer is too small to receive data, drop the frame in the
ring, mark the RX entry as done, and indicate the error in the RX entry
length. Check for a negative length in the receive callback in
ntb_netdev, and count occurrences as rx_length_errors.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

da2e5ae5

05 7月, 2015 8 次提交

NTB: Print driver name and version in module init · 7eb38781

由 Dave Jiang 提交于 6月 15, 2015

Printouts driver name and version to indicate what is being loaded.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

7eb38781

NTB: Increase transport MTU to 64k from 16k · 9891417d

由 Dave Jiang 提交于 6月 03, 2015

Benchmarking showed a significant performance increase with the MTU size
to 64k instead of 16k.  Change the driver default to 64k.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

9891417d

NTB: Default to CPU memcpy for performance · a41ef053

由 Dave Jiang 提交于 5月 19, 2015

Disable DMA usage by default, since the CPU provides much better
performance with write combining.  Provide a module parameter to enable
DMA usage when offloading the memcpy is preferred.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

a41ef053

NTB: Improve performance with write combining · 06917f75

由 Dave Jiang 提交于 5月 19, 2015

Changing the memory window BAR mappings to write combining significantly
boosts the performance.  We will also use memcpy that uses non-temporal
store, which showed performance improvement when doing non-cached
memcpys.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

06917f75

NTB: Use NUMA memory and DMA chan in transport · 1199aa61

由 Allen Hubbe 提交于 5月 18, 2015

Allocate memory and request the DMA channel for the same NUMA node as
the NTB device.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

1199aa61

NTB: Rate limit ntb_qp_link_work · 28762289

由 Allen Hubbe 提交于 5月 11, 2015

When the ntb transport is connecting and waiting for the peer, the debug
console receives lots of debug level messages about the remote qp link
status being down.  Rate limit those messages.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

28762289

NTB: Reset transport QP link stats on down · 2849b5d7

由 Allen Hubbe 提交于 5月 12, 2015

Reset the link stats when the link goes down.  In particular, the TX and
RX index and count must be reset, or else the TX side will be sending
packets to the RX side where the RX side is not expecting them.  Reset
all the stats, to be consistent.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

2849b5d7

NTB: Do not advance transport RX on link down · c0900b33

由 Allen Hubbe 提交于 5月 12, 2015

On link down, don't advance RX index to the next entry.  The next entry
should never be valid after receiving the link down flag.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

c0900b33

openanolis / cloud-kernel 接近 2 年 前同步成功

openanolis / cloud-kernel
接近 2 年前同步成功