提交 · 19645a077120c6417e9dc5ad469c45194cf78a82 · openeuler / raspberrypi-kernel

05 8月, 2016 6 次提交

ntb_transport: Check the number of spads the hardware supports · 19645a07

由 Logan Gunthorpe 提交于 6月 07, 2016

I'm working on hardware that currently has a limited number of
scratchpad registers and ntb_ndev fails with no clue as to why. I
feel it is better to fail early and provide a reasonable error message
then to fail later on.

The same is done to ntb_perf, but it doesn't currently require enough
spads to actually fail. I've also removed the unused SPAD_MSG and
SPAD_ACK enums so that MAX_SPAD accurately reflects the number of
spads used.
Signed-off-by: NLogan Gunthorpe <logang@deltatee.com>
Acked-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

19645a07

ntb_tool: Add memory window debug support · 8b71d285

由 Logan Gunthorpe 提交于 6月 03, 2016

We allocate some memory window buffers when the link comes up, then we
provide debugfs files to read/write each side of the link.

This is useful for debugging the mapping when writing new drivers.
Signed-off-by: NLogan Gunthorpe <logang@deltatee.com>
Acked-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

8b71d285

ntb_perf: Allow limiting the size of the memory windows · 4aae9777

由 Logan Gunthorpe 提交于 6月 03, 2016

On my system, dma_alloc_coherent won't produce memory anywhere
near the size of the BAR. So I needed a way to limit this.

It's pretty much copied straight from ntb_transport.
Signed-off-by: NLogan Gunthorpe <logang@deltatee.com>
Acked-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

4aae9777

NTB: allocate number transport entries depending on size of ring size · a754a8fc

由 Dave Jiang 提交于 4月 08, 2016

Currently we only allocate a fixed default number of descriptors for the tx
and rx side. We should dynamically resize it to the number of descriptors
resides in the transport rings. We should know the number of transmit
descriptors at initializaiton. We will allocate the default number of
descriptors for receive side and allocate additional ones when we know the
actual max entries for receive.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Acked-by: NAllen Hubbe <allen.hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

a754a8fc

ntb_tool: BUG: Ensure the buffer size is large enough to return all spads · 625f0802

由 Logan Gunthorpe 提交于 6月 20, 2016

On hardware with 32 scratchpad registers the spad field in ntb tool
could chop off the end. The maximum buffer size is increased from
256 to 15 times the number or scratchpads.
Signed-off-by: NLogan Gunthorpe <logang@deltatee.com>
Acked-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

625f0802

ntb_tool: Fix infinite loop bug when writing spad/peer_spad file · c792eba1

由 Logan Gunthorpe 提交于 5月 27, 2016

If you tried to write two spads in one line, as per the example:

root@peer# echo '0 0x01010101 1 0x7f7f7f7f' > $DBG_DIR/peer_spad

then the CPU would freeze in an infinite loop.

This wasn't immediately obvious but 'pos' was not incrementing the
buffer, so after reading the second pair of values, 'pos' would once
again be 3 and it would re-read the second pair of values ad infinitum.
Signed-off-by: NLogan Gunthorpe <logang@deltatee.com>
Acked-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

c792eba1

26 3月, 2016 1 次提交

NTB: Remove _addr functions from ntb_hw_amd · 4f1b50c3

由 Allen Hubbe 提交于 3月 21, 2016

Kernel zero day testing warned about address space confusion.  A virtual
iomem address was used where a physical address is expected.  The
offending functions implement an optional part of the api, so they are
removed.  They can be added later, after testing.

Fixes: a1b36958Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Acked-by: NXiangliang Yu <Xiangliang.Yu@amd.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

4f1b50c3

22 3月, 2016 2 次提交

NTB: Fix incorrect clean up routine in ntb_perf · 838850ee

由 Dave Jiang 提交于 3月 18, 2016

The clean up routine when we failed to allocate kthread is not cleaning
up all the threads, only the same one over and over again.
Reported-by: NDan Carpenter <dan.carpenter@oracle.com>
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Acked-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

838850ee

NTB: Fix incorrect return check in ntb_perf · ddc8f6fe

由 Dave Jiang 提交于 3月 18, 2016

kthread_create_no_node() returns error pointers, never NULL. Fix check so
it handles error correctly.
Reported-by: NDan Carpenter <dan.carpenter@oracle.com>
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

ddc8f6fe

18 3月, 2016 5 次提交

ntb: fix possible NULL dereference · 2572c7fb

由 Sudip Mukherjee 提交于 3月 10, 2016

kmalloc can fail and we should check for NULL before using the pointer
returned by kmalloc.
Signed-off-by: NSudip Mukherjee <sudip.mukherjee@codethink.co.uk>
Acked-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

2572c7fb

ntb: add missing setup of translation window · ee5f750f

由 Dave Jiang 提交于 3月 07, 2016

The perf tool is missing the setup of translation window. Adding call to
setup the translation window for backed memory.
Signed-off-by: NJohn Kading <john.kading@gd-ms.com>
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

ee5f750f

ntb: stop link work when we do not have memory · 84f76685

由 Dave Jiang 提交于 2月 29, 2016

Instead of keep trying to go through the init routine when we aren't able
to allocate memory, we should just stop and go down.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

84f76685

ntb: stop tasklet from spinning forever during shutdown. · e9021331

由 Dave Jiang 提交于 2月 23, 2016

We can leave tasklet spinning forever if we disable the tasklet during
qp shutdown and the tasklets are still being kicked off. This hopefully
should avoid that race condition.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Reported-by: NAlex Depoutovitch <alex@pernixdata.com>
Tested-by: NAlex Depoutovitch <alex@pernixdata.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

e9021331

ntb: perf test: fix address space confusion · 1985a881

由 Arnd Bergmann 提交于 1月 26, 2016

The ntb driver assigns between pointers an __iomem tokens, and
also casts them to 64-bit integers, which results in compiler
warnings on 32-bit systems:

drivers/ntb/test/ntb_perf.c: In function 'perf_copy':
drivers/ntb/test/ntb_perf.c:213:10: error: cast from pointer to integer of different size [-Werror=pointer-to-int-cast]
  vbase = (u64)(u64 *)mw->vbase;
          ^
drivers/ntb/test/ntb_perf.c:214:14: error: cast from pointer to integer of different size [-Werror=pointer-to-int-cast]
  dst_vaddr = (u64)(u64 *)dst;
              ^

This adds __iomem annotations where needed and changes the temporary
variables to iomem pointers to avoid casting them to u64. I did not
see the problem in linux-next earlier, but it show showed up in
4.5-rc1.
Signed-off-by: NArnd Bergmann <arnd@arndb.de>
Acked-by: NDave Jiang <dave.jiang@intel.com>
Fixes: 8a7b6a77 ("ntb: ntb perf tool")
Signed-off-by: NJon Mason <jdmason@kudzu.us>

1985a881

22 1月, 2016 2 次提交

NTB: Fix macro parameter conflict with field name · 03beaec8

由 Allen Hubbe 提交于 1月 21, 2016

If the parameter given to the macro is replaced throughout the macro as
it is evaluated.  The intent is that the macro parameter should replace
the only the first parameter to container_of().  However, the way the
macro was written, it would also inadvertantly replace a structure field
name.  If a parameter of any other name is given to the macro, it will
fail to compile, if the structure does not contain a field of the same
name.  At worst, it will compile, and hide improper access of an
unintended field in the structure.

Change the macro parameter name, so it does not conflict with the
structure field name.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Acked-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

03beaec8

NTB: Add support for AMD PCI-Express Non-Transparent Bridge · a1b36958

由 Xiangliang Yu 提交于 1月 21, 2016

This adds support for AMD's PCI-Express Non-Transparent Bridge
(NTB) device on the Zeppelin platform. The driver connnects to the
standard NTB sub-system interface, with modification to add hooks
for power management in a separate patch. The AMD NTB device has 3
memory windows, 16 doorbell, 16 scratch-pad registers, and supports
up to 16 PCIe lanes running a Gen3 speeds.
Signed-off-by: NXiangliang Yu <Xiangliang.Yu@amd.com>
Reviewed-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

a1b36958

18 1月, 2016 1 次提交

ntb: ntb perf tool · 8a7b6a77

由 Dave Jiang 提交于 1月 13, 2016

Providing raw performance data via a tool that directly access data from
NTB w/o any software overhead. This allows measurement of the hardware
performance limit. In revision one we are only doing single direction
CPU and DMA writes. Eventually we will provide bi-directional writes.

The measurement using DMA engine for NTB performance measure does
not measure the raw performance of DMA engine over NTB due to software
overhead. But it should provide the peak performance through the Linux DMA
driver.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Tested-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

8a7b6a77

11 1月, 2016 3 次提交

NTB: Address out of DMA descriptor issue with NTB · 8c874cc1

由 Dave Jiang 提交于 1月 08, 2016

The transport right now does not handle the case where we run out of DMA
descriptors. We just fail when we do not succeed. Adding code to retry for
a bit attempting to use the DMA engine instead of instantly fail to CPU
copy.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Reviewed-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

8c874cc1

NTB: Clear property bits in BAR value · 703872c2

由 Dave Jiang 提交于 11月 19, 2015

The lower bits read from a BAR register will contain property bits
that we do not care about. Clear those so that we can use the BAR
values for limit and xlat registers.
Reported-by: NConrad Meyer <cem@freebsd.org>
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

703872c2

NTB: ntb_process_tx error path bug · 179f912a

由 Jon Mason 提交于 12月 18, 2015

The transmit overrun avoidance error path in ntb_process_tx accidentally
swapped the first two values being passed to the tx_handler client.
This could result in crashes in the ntb_netdev (or other out-of-tree NTB
clients).
Reported-by: NAlex Depoutovitch <alex@pernixdata.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

179f912a

09 11月, 2015 6 次提交

NTB: fix 32-bit compiler warning · fdcb4b2e

由 Arnd Bergmann 提交于 10月 07, 2015

resource_size_t may be 32-bit wide on some architectures, which causes
this warning when building the NTB code:

drivers/ntb/ntb_transport.c: In function 'ntb_transport_link_work':
drivers/ntb/ntb_transport.c:828:46: warning: right shift count >= width of type [-Wshift-count-overflow]

The warning is harmless but can be avoided by using the upper_32_bits()
macro.
Signed-off-by: NArnd Bergmann <arnd@arndb.de>
Fixes: e26a5843 ("NTB: Split ntb_hw_intel and ntb_transport drivers")
Signed-off-by: NJon Mason <jdmason@kudzu.us>

fdcb4b2e

NTB: unify translation addresses · 8b782fab

由 Dave Jiang 提交于 9月 24, 2015

There is no need for the upstream and downstream addresses to be different
for the NTB configs. Go to using a single set of address. It is still
possible to configure them differently using module parameter override
however.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Acked and Tested-by: Allen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

8b782fab

NTB: invalid buf pointer in multi-MW setups · c92ba3c5

由 Jon Mason 提交于 10月 04, 2015

Order of operations issue with the QP Num and MW count, which would
result in the receive buffer pointer being invalid if there are more
than 1 MW.  Corrected with parenthesis to enforce the proper order of
operations.
Reported-by: NJohn I. Kading <John.Kading@gd-ms.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

c92ba3c5

NTB: remove unused variable · 70d4687d

由 Sudip Mukherjee 提交于 10月 03, 2015

These variables were not used anywhere. So remove them.
Signed-off-by: NSudip Mukherjee <sudip@vectorindia.org>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

70d4687d

NTB: fix access of free-ed pointer · d4adee09

由 Sudip Mukherjee 提交于 10月 03, 2015

We were accessing nt->mw_vec after freeing it. Fix the error path so
that we free nt->mw_vec after we have finished using it.
Signed-off-by: NSudip Mukherjee <sudip@vectorindia.org>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

d4adee09

NTB: Fix issue where we may be accessing NULL ptr · 04afde45

由 Dave Jiang 提交于 9月 17, 2015

smatch detected an issue in the function ntb_transport_max_size() where
we could be dereferencing a dma channel pointer when it is NULL.
Reported-by: NDan Carpenter <dan.carpenter@oracle.com>
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

04afde45

08 9月, 2015 8 次提交

NTB: Fix range check on memory window index · 9a07826f

由 Allen Hubbe 提交于 8月 31, 2015

The range check must exclude the upper bound.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

9a07826f

NTB: Improve index handling in B2B MW workaround · 2aa2a77a

由 Allen Hubbe 提交于 8月 31, 2015

Check that b2b_mw_idx is in range of the number of memory windows when
initializing the device.  The workaround is considered to be in effect
only if the device b2b_idx is exactly UINT_MAX, instead of any index
past the last memory window.

Only print B2B MW workaround information in debugfs if the workaround is
in effect.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

2aa2a77a

NTB: Use unique DMA channels for TX and RX · 569410ca

由 Dave Jiang 提交于 7月 13, 2015

Allocate two DMA channels, one for TX operation and one for RX
operation, instead of having one DMA channel for everything. This
provides slightly better performance, and also will make error handling
cleaner later on.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

569410ca

NTB: Remove dma_sync_wait from ntb_async_rx · 905921e7

由 Allen Hubbe 提交于 7月 13, 2015

The dma_sync_wait can hurt the performance of workloads mixed with both
large and small frames. Large frames will be copied using the dma
engine. Small frames will be copied by the cpu. The dma_sync_wait
prevents the cpu and dma engine copying in parallel.

In the period where the cpu is copying, the dma engine is stopped. The
dma engine is not doing any useful work to copy large frames during that
time, and the additional time to restart the dma engine for the next
large frame. This will decrease the throughput for the portion of a
workload with large frames.

In the period where the dma engine is copying, the cpu is held up
waiting for dma to complete. The small frames processing will be
delayed until the dma is complete. The RX frames are completed
in-order, and the processing of small frames takes very little time, so
dma_sync_wait may have an insignificant impact on the respose time of
frames. The more significant impact is to the system, because the delay
in dma_sync_wait is implemented as busy non-blocking wait. This can
prevent the delayed core from doing any useful work, even if it could be
processing work for other drivers, unrelated to transport RX processing.

After applying the earlier patch to fix out-of-order RX acknoledgement,
the dma_sync_wait is no longer necessary. Remove it, so that cpu memcpy
will proceed immediately for small frames, in parallel with ongoing dma
for large frames. Do not hold up the cpu from doing work while dma is
in progress. The prior fix will continue to ensure in-order completion
of the RX frames to the upper layer, and in-order delivery of the RX
acknoledgement.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

905921e7

NTB: Clean up QP stats info · d98ef99e

由 Dave Jiang 提交于 7月 13, 2015

Make QP stats info more readable for debugging purposes.  Also add an
entry to indicate whether DMA is being used.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

d98ef99e

NTB: Make the transport list in order of discovery · 31510000

由 Dave Jiang 提交于 7月 13, 2015

The list should be added from the bottom and not the top in order to
ensure the transport is provided in the same order to clients as ntb
devices are discovered.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

31510000

NTB: Add PCI Device IDs for Broadwell Xeon · 0a5d19d9

由 Dave Jiang 提交于 7月 13, 2015

Adding PCI Device IDs for B2B (back to back), RP (root port, primary),
and TB (transparent bridge, secondary) devices.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

0a5d19d9

NTB: Add flow control to the ntb_netdev · e74bfeed

由 Dave Jiang 提交于 7月 13, 2015

Right now if we push the NTB really hard, we start dropping packets due
to not able to process the packets fast enough. We need to st:qop the
upper layer from flooding us when that happens.

A timer is necessary in order to restart the queue once the resource has
been processed on the receive side. Due to the way NTB is setup, the
resources on the tx side are tied to the processing of the rx side and
there's no async way to know when the rx side has released those
resources.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

e74bfeed

10 8月, 2015 6 次提交

ntb: avoid format string in dev_set_name · e15f9409

由 Kees Cook 提交于 7月 24, 2015

Avoid any chance of format string expansion when calling dev_set_name.
Signed-off-by: NKees Cook <keescook@chromium.org>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

e15f9409

NTB: Fix dereference before check · 30a4bb1e

由 Allen Hubbe 提交于 7月 13, 2015

Remove early dereference of a pointer that is checked later in the code.
Reported-by: NDan Carpenter <dan.carpenter@oracle.com>
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

30a4bb1e

NTB: Fix zero size or integer overflow in ntb_set_mw · 8c9edf63

由 Allen Hubbe 提交于 7月 13, 2015

A plain 32 bit integer will overflow for values over 4GiB.

Change the plain integer size to the appropriate size type in
ntb_set_mw.  Change the type of the size parameter and two local
variables used for size.

Even if there is no overflow, a size of zero is invalid here.
Reported-by: NJuyoung Jung <jjung@micron.com>
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

8c9edf63

NTB: Schedule to receive on QP link up · 8b5a22d8

由 Allen Hubbe 提交于 7月 13, 2015

Schedule to receive on QP link up, to make sure that the doorbell is
properly cleared for interrupts.
Signed-off-by: NAllen Hubbe <Allen.Hubbe@emc.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

8b5a22d8

NTB: Fix oops in debugfs when transport is half-up · 260bee94

由 Dave Jiang 提交于 7月 13, 2015

When the remote side is not up, we do not have all the context for the
transport, and that causes NULL ptr access. Have the debugfs reads check
to see if transport is up before we make access.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

260bee94

NTB: Fix transport stats for multiple devices · c8650fd0

由 Dave Jiang 提交于 7月 13, 2015

Currently the debugfs does not have files for all NTB transport queue
pairs.  When there are multiple NTBs present in a system, the QP names
of the last transport clobber the names of previously added transport
QPs.  Only the last added QPs can be observed via debugfs.

Create a directory per NTB transport to associate the QPs with that
transport.  Name the directory the same as the PCI device.
Signed-off-by: NDave Jiang <dave.jiang@intel.com>
Signed-off-by: NJon Mason <jdmason@kudzu.us>

c8650fd0