提交 · 04afde45e096201f8fd74c1db848a5d85d1aa57d · openeuler / raspberrypi-kernel

09 11月, 2015 1 次提交

NTB: Fix issue where we may be accessing NULL ptr · 04afde45

由 Dave Jiang 提交于 9月 17, 2015

smatch detected an issue in the function ntb_transport_max_size() where
we could be dereferencing a dma channel pointer when it is NULL.
Reported-by: NDan Carpenter <dan.carpenter@oracle.com>
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

04afde45

08 9月, 2015 5 次提交

NTB: Use unique DMA channels for TX and RX · 569410ca

由 Dave Jiang 提交于 7月 13, 2015

Allocate two DMA channels, one for TX operation and one for RX
operation, instead of having one DMA channel for everything. This
provides slightly better performance, and also will make error handling
cleaner later on.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

569410ca

NTB: Remove dma_sync_wait from ntb_async_rx · 905921e7

由 Allen Hubbe 提交于 7月 13, 2015

The dma_sync_wait can hurt the performance of workloads mixed with both
large and small frames. Large frames will be copied using the dma
engine. Small frames will be copied by the cpu. The dma_sync_wait
prevents the cpu and dma engine copying in parallel.

In the period where the cpu is copying, the dma engine is stopped. The
dma engine is not doing any useful work to copy large frames during that
time, and the additional time to restart the dma engine for the next
large frame. This will decrease the throughput for the portion of a
workload with large frames.

In the period where the dma engine is copying, the cpu is held up
waiting for dma to complete. The small frames processing will be
delayed until the dma is complete. The RX frames are completed
in-order, and the processing of small frames takes very little time, so
dma_sync_wait may have an insignificant impact on the respose time of
frames. The more significant impact is to the system, because the delay
in dma_sync_wait is implemented as busy non-blocking wait. This can
prevent the delayed core from doing any useful work, even if it could be
processing work for other drivers, unrelated to transport RX processing.

After applying the earlier patch to fix out-of-order RX acknoledgement,
the dma_sync_wait is no longer necessary. Remove it, so that cpu memcpy
will proceed immediately for small frames, in parallel with ongoing dma
for large frames. Do not hold up the cpu from doing work while dma is
in progress. The prior fix will continue to ensure in-order completion
of the RX frames to the upper layer, and in-order delivery of the RX
acknoledgement.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

905921e7

NTB: Clean up QP stats info · d98ef99e

由 Dave Jiang 提交于 7月 13, 2015

Make QP stats info more readable for debugging purposes.  Also add an
entry to indicate whether DMA is being used.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

d98ef99e

NTB: Make the transport list in order of discovery · 31510000

由 Dave Jiang 提交于 7月 13, 2015

The list should be added from the bottom and not the top in order to
ensure the transport is provided in the same order to clients as ntb
devices are discovered.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

31510000

NTB: Add flow control to the ntb_netdev · e74bfeed

由 Dave Jiang 提交于 7月 13, 2015

Right now if we push the NTB really hard, we start dropping packets due
to not able to process the packets fast enough. We need to st:qop the
upper layer from flooding us when that happens.

A timer is necessary in order to restart the queue once the resource has
been processed on the receive side. Due to the way NTB is setup, the
resources on the tx side are tied to the processing of the rx side and
there's no async way to know when the rx side has released those
resources.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

e74bfeed

10 8月, 2015 6 次提交

NTB: Fix dereference before check · 30a4bb1e

由 Allen Hubbe 提交于 7月 13, 2015

Remove early dereference of a pointer that is checked later in the code.
Reported-by: NDan Carpenter <dan.carpenter@oracle.com>
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

30a4bb1e

NTB: Fix zero size or integer overflow in ntb_set_mw · 8c9edf63

由 Allen Hubbe 提交于 7月 13, 2015

A plain 32 bit integer will overflow for values over 4GiB.

Change the plain integer size to the appropriate size type in
ntb_set_mw.  Change the type of the size parameter and two local
variables used for size.

Even if there is no overflow, a size of zero is invalid here.
Reported-by: NJuyoung Jung <jjung@micron.com>
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

8c9edf63

NTB: Schedule to receive on QP link up · 8b5a22d8

由 Allen Hubbe 提交于 7月 13, 2015

Schedule to receive on QP link up, to make sure that the doorbell is
properly cleared for interrupts.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

8b5a22d8

NTB: Fix oops in debugfs when transport is half-up · 260bee94

由 Dave Jiang 提交于 7月 13, 2015

When the remote side is not up, we do not have all the context for the
transport, and that causes NULL ptr access. Have the debugfs reads check
to see if transport is up before we make access.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

260bee94

NTB: Fix transport stats for multiple devices · c8650fd0

由 Dave Jiang 提交于 7月 13, 2015

Currently the debugfs does not have files for all NTB transport queue
pairs.  When there are multiple NTBs present in a system, the QP names
of the last transport clobber the names of previously added transport
QPs.  Only the last added QPs can be observed via debugfs.

Create a directory per NTB transport to associate the QPs with that
transport.  Name the directory the same as the PCI device.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

c8650fd0

NTB: Fix ntb_transport out-of-order RX update · da2e5ae5

由 Allen Hubbe 提交于 7月 13, 2015

It was possible for a synchronous update of the RX index in the error
case to get ahead of the asynchronous RX index update in the normal
case. Change the RX processing to preserve an RX completion order.

There were two error cases. First, if a buffer is not present to
receive data, there would be no queue entry to preserve the RX
completion order. Instead of dropping the RX frame, leave the RX frame
in the ring. Schedule RX processing when RX entries are enqueued, in
case there are RX frames waiting in the ring to be received.

Second, if a buffer is too small to receive data, drop the frame in the
ring, mark the RX entry as done, and indicate the error in the RX entry
length. Check for a negative length in the receive callback in
ntb_netdev, and count occurrences as rx_length_errors.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

da2e5ae5

05 7月, 2015 11 次提交

NTB: Print driver name and version in module init · 7eb38781

由 Dave Jiang 提交于 6月 15, 2015

Printouts driver name and version to indicate what is being loaded.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

7eb38781

NTB: Increase transport MTU to 64k from 16k · 9891417d

由 Dave Jiang 提交于 6月 03, 2015

Benchmarking showed a significant performance increase with the MTU size
to 64k instead of 16k.  Change the driver default to 64k.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

9891417d

NTB: Default to CPU memcpy for performance · a41ef053

由 Dave Jiang 提交于 5月 19, 2015

Disable DMA usage by default, since the CPU provides much better
performance with write combining.  Provide a module parameter to enable
DMA usage when offloading the memcpy is preferred.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

a41ef053

NTB: Improve performance with write combining · 06917f75

由 Dave Jiang 提交于 5月 19, 2015

Changing the memory window BAR mappings to write combining significantly
boosts the performance.  We will also use memcpy that uses non-temporal
store, which showed performance improvement when doing non-cached
memcpys.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

06917f75

NTB: Use NUMA memory and DMA chan in transport · 1199aa61

由 Allen Hubbe 提交于 5月 18, 2015

Allocate memory and request the DMA channel for the same NUMA node as
the NTB device.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

1199aa61

NTB: Rate limit ntb_qp_link_work · 28762289

由 Allen Hubbe 提交于 5月 11, 2015

When the ntb transport is connecting and waiting for the peer, the debug
console receives lots of debug level messages about the remote qp link
status being down.  Rate limit those messages.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

28762289

NTB: Reset transport QP link stats on down · 2849b5d7

由 Allen Hubbe 提交于 5月 12, 2015

Reset the link stats when the link goes down.  In particular, the TX and
RX index and count must be reset, or else the TX side will be sending
packets to the RX side where the RX side is not expecting them.  Reset
all the stats, to be consistent.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

2849b5d7

NTB: Do not advance transport RX on link down · c0900b33

由 Allen Hubbe 提交于 5月 12, 2015

On link down, don't advance RX index to the next entry.  The next entry
should never be valid after receiving the link down flag.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

c0900b33

NTB: Differentiate transport link down messages · e22e0b9d

由 Allen Hubbe 提交于 5月 12, 2015

The same message "qp %d: Link Down\n" was printed at two locations in
ntb_transport.  Change the messages so they are distinct.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

e22e0b9d

NTB: Read peer info from local SPAD in transport · 0f69a7df

由 Dave Jiang 提交于 6月 02, 2015

The transport was writing and then reading the peer scratch pad,
essentially reading what it just wrote instead of exchanging any
information with the peer.  The transport expects the peer values to be
the same as the local values, so this issue was not obvious.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

0f69a7df

NTB: Split ntb_hw_intel and ntb_transport drivers · e26a5843

由 Allen Hubbe 提交于 4月 09, 2015

Change ntb_hw_intel to use the new NTB hardware abstraction layer.

Split ntb_transport into its own driver.  Change it to use the new NTB
hardware abstraction layer.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

e26a5843

02 7月, 2015 1 次提交

NTB: Move files in preparation for NTB abstraction · ec110bc7

由 Allen Hubbe 提交于 5月 07, 2015

This patch only moves files to their new locations, before applying the
next two patches adding the NTB Abstraction layer.  Splitting this patch
from the next is intended make distinct which code is changed only due
to moving the files, versus which are substantial code changes in adding
the NTB Abstraction layer.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

ec110bc7

14 9月, 2014 2 次提交

ntb: Add alignment check to meet hardware requirement · 3cc5ba19

由 Dave Jiang 提交于 8月 28, 2014

The NTB translate register must have the value to be BAR size aligned.
This alignment check make sure that the DMA memory allocated has the
proper alignment. Another requirement for NTB to function properly with
memory window BAR size greater or equal to 4M is to use the CMA feature
in 3.16 kernel with the appropriate CONFIG_CMA_ALIGNMENT and
CONFIG_CMA_SIZE_MBYTES set.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

3cc5ba19

NTB: correct the spread of queues over mw's · a1413cfb

由 Jon Mason 提交于 6月 19, 2014

The detection of an uneven number of queues on the given memory windows
was not correct.  The mw_num is zero based and the mod should be
division to spread them evenly over the mw's.
Signed-off-by: NJon Mason <jon.mason@intel.com>

a1413cfb

08 4月, 2014 2 次提交

NTB: Code Style Clean-up · 53ca4fea

由 Jon Mason 提交于 11月 26, 2013

Some white space and 80 char overruns corrected.
Signed-off-by: NJon Mason <jon.mason@intel.com>

53ca4fea

NTB: client event cleanup · 403c63cb

由 Jon Mason 提交于 7月 29, 2013

Provide a better event interface between the client and transport
Signed-off-by: NJon Mason <jon.mason@intel.com>

403c63cb

21 11月, 2013 3 次提交

J
NTB: Disable interrupts and poll under high load · e8aeb60c
由 Jon Mason 提交于 4月 18, 2013
```
Disable interrupts and poll under high load
Signed-off-by: NJon Mason <jon.mason@intel.com>
```
e8aeb60c

NTB: correct dmaengine_get/put usage · 94681194

由 Jon Mason 提交于 11月 19, 2013

dmaengine_get() causes the initialization of the per-cpu channel tables.
It needs to be called prior to dma_find_channel().

Initial version by Dan Williams <dan.j.williams@intel.com>
Signed-off-by: NJon Mason <jon.mason@intel.com>

94681194

NTB: Fix ntb_transport link down race · fca4d518

由 Jon Mason 提交于 9月 09, 2013

A WARN_ON is being hit in ntb_qp_link_work due to the NTB transport link
being down while the ntb qp link is still active. This is caused by the
transport link being brought down prior to the qp link worker thread
being terminated. To correct this, shutdown the qp's prior to bringing
the transport link down. Also, only call the qp worker thread if it is
in interrupt context, otherwise call the function directly.
Signed-off-by: NJon Mason <jon.mason@intel.com>

fca4d518

15 11月, 2013 2 次提交

dmaengine: remove DMA unmap flags · 0776ae7b

由 Bartlomiej Zolnierkiewicz 提交于 10月 18, 2013

Remove no longer needed DMA unmap flags:
- DMA_COMPL_SKIP_SRC_UNMAP
- DMA_COMPL_SKIP_DEST_UNMAP
- DMA_COMPL_SRC_UNMAP_SINGLE
- DMA_COMPL_DEST_UNMAP_SINGLE

Cc: Vinod Koul <vinod.koul@intel.com>
Cc: Tomasz Figa <t.figa@samsung.com>
Cc: Dave Jiang <dave.jiang@intel.com>
Signed-off-by: NBartlomiej Zolnierkiewicz <b.zolnierkie@samsung.com>
Signed-off-by: NKyungmin Park <kyungmin.park@samsung.com>
Acked-by: NJon Mason <jon.mason@intel.com>
Acked-by: NMark Brown <broonie@linaro.org>
[djbw: clean up straggling skip unmap flags in ntb]
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

0776ae7b

NTB: convert to dmaengine_unmap_data · 6f57fd05

由 Bartlomiej Zolnierkiewicz 提交于 10月 18, 2013

Use the generic unmap object to unmap dma buffers.

Cc: Vinod Koul <vinod.koul@intel.com>
Cc: Tomasz Figa <t.figa@samsung.com>
Cc: Dave Jiang <dave.jiang@intel.com>
Signed-off-by: NBartlomiej Zolnierkiewicz <b.zolnierkie@samsung.com>
Signed-off-by: NKyungmin Park <kyungmin.park@samsung.com>
Acked-by: NJon Mason <jon.mason@intel.com>
[djbw: fix up unmap len, and GFP flags]
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

6f57fd05

06 9月, 2013 4 次提交

NTB: Comment Fix · f9a2cf89

由 Jon Mason 提交于 7月 29, 2013

Add "data" ntb_register_db_callback parameter description comment and
correct poor spelling.
Signed-off-by: NJon Mason <jon.mason@intel.com>

f9a2cf89

NTB: Remove unused variable · 3daa3a07

由 Jon Mason 提交于 9月 03, 2013

Remove unused pci_dev variable from ntb_transport_free()
Signed-off-by: NJon Mason <jon.mason@intel.com>

3daa3a07

NTB: Rename Variables for NTB-RP · 49793889

由 Jon Mason 提交于 7月 15, 2013

Many variable names in the NTB driver refer to the primary or secondary
side.  However, these variables will be used to access the reverse case
when in NTB-RP mode.  Make these names more generic in anticipation of
NTB-RP support.
Signed-off-by: NJon Mason <jon.mason@intel.com>

49793889

NTB: Use DMA Engine to Transmit and Receive · 282a2fee

由 Jon Mason 提交于 2月 12, 2013

Allocate and use a DMA engine channel to transmit and receive data over
NTB.  If none is allocated, fall back to using the CPU to transfer data.
Signed-off-by: NJon Mason <jon.mason@intel.com>
Reviewed-by: NDan Williams <dan.j.williams@intel.com>
Reviewed-by: NDave Jiang <dave.jiang@intel.com>

282a2fee

04 9月, 2013 2 次提交

NTB: Xeon Errata Workaround · 948d3a65

由 Jon Mason 提交于 4月 18, 2013

There is a Xeon hardware errata related to writes to SDOORBELL or
B2BDOORBELL in conjunction with inbound access to NTB MMIO Space, which
may hang the system. To workaround this issue, use one of the memory
windows to access the interrupt and scratch pad registers on the remote
system. This bypasses the issue, but removes one of the memory windows
from use by the transport. This reduction of MWs necessitates adding
some logic to determine the number of available MWs.

Since some NTB usage methodologies may have unidirectional traffic, the
ability to disable the workaround via modparm has been added.

See BF113 in
http://www.intel.com/content/dam/www/public/us/en/documents/specification-updates/xeon-c5500-c3500-spec-update.pdf
See BT119 in
http://www.intel.com/content/dam/www/public/us/en/documents/specification-updates/xeon-e5-family-spec-update.pdfSigned-off-by: NJon Mason <jon.mason@intel.com>

948d3a65

NTB: Correct debugfs to work with more than 1 NTB Device · 1517a3f2

由 Jon Mason 提交于 7月 30, 2013

Debugfs was setup in NTB to only have a single debugfs directory. This
resulted in the leaking of debugfs directories and files when multiple
NTB devices were present, due to each device stomping on the variables
containing the previous device's values (thus preventing them from being
freed on cleanup). Correct this by creating a secondary directory of
the PCI BDF for each device present, and nesting the previously existing
information in those directories.
Signed-off-by: NJon Mason <jon.mason@intel.com>

1517a3f2

16 5月, 2013 1 次提交

NTB: Multiple NTB client fix · 8b19d450

由 Jon Mason 提交于 4月 26, 2013

Fix issue with adding multiple ntb client devices to the ntb virtual
bus.  Previously, multiple devices would be added with the same name,
resulting in crashes.  To get around this issue, add a unique number to
the device when it is added.
Signed-off-by: NJon Mason <jon.mason@intel.com>

8b19d450