提交 · 301c2c3f039a1f9478f6cbef60f2ccd4da9bd4a1 · openeuler / Kernel

18 6月, 2011 2 次提交

RDMA/cxgb4: Don't truncate MR lengths · 301c2c3f

由 Steve Wise 提交于 6月 14, 2011

Remove left-over code from T3 that limited MR sizes to 32b.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

301c2c3f

RDMA/cxgb4: Don't exceed hw IQ depth limit for user CQs · 2ff7d09a

由 Steve Wise 提交于 6月 01, 2011

Memory allocated for user CQs gets rounded up to the next page
boundary. And after rounding, we recalculate the resulting IQ depth
and we need to make sure we don't exceed the HW limits.

This bug can result a much smaller CQ allocated than was expected if
the HW size field is exceeded, resulting in CQ overflow failures.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

2ff7d09a

25 5月, 2011 1 次提交

RDMA/cxgb4: Use completion objects for event blocking · c337374b

由 Steve Wise 提交于 5月 20, 2011

There exists a race condition when using wait_queue_head_t objects
that are declared on the stack.  This was being done in a few places
where we are sending work requests to the FW and awaiting replies, but
we don't have an endpoint structure with an embedded c4iw_wr_wait
struct.  So the code was allocating it locally on the stack.  Bad
design.  The race is:

  1) thread on cpuX declares the wait_queue_head_t on the stack, then
     posts a firmware WR with that wait object ptr as the cookie to be
     returned in the WR reply.  This thread will proceed to block in
     wait_event_timeout() but before it does:

  2) An interrupt runs on cpuY with the WR reply.  fw6_msg() handles
     this and calls c4iw_wake_up().  c4iw_wake_up() sets the condition
     variable in the c4iw_wr_wait object to TRUE and will call
     wake_up(), but before it calls wake_up():

  3) The thread on cpuX calls c4iw_wait_for_reply(), which calls
     wait_event_timeout().  The wait_event_timeout() macro checks the
     condition variable and returns immediately since it is TRUE.  So
     this thread never blocks/sleeps. The function then returns
     effectively deallocating the c4iw_wr_wait object that was on the
     stack.

  4) So at this point cpuY has a pointer to the c4iw_wr_wait object
     that is no longer valid.  Further its pointing to a stack frame
     that might now be in use by some other context/thread.  So cpuY
     continues execution and calls wake_up() on a ptr to a wait object
     that as been effectively deallocated.

This race, when it hits, can cause a crash in wake_up(), which I've
seen under heavy stress. It can also corrupt the referenced stack
which can cause any number of failures.

The fix:

Use struct completion, which supports on-stack declarations.
Completions use a spinlock around setting the condition to true and
the wake up so that steps 2 and 4 above are atomic and step 3 can
never happen in-between.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>

c337374b

10 5月, 2011 5 次提交

RDMA/cxgb4: EEH errors can hang the driver · 2f25e9a5

由 Steve Wise 提交于 5月 09, 2011

A few more EEH fixes:

c4iw_wait_for_reply(): detect fatal EEH condition on timeout and
return an error.

The iw_cxgb4 driver was only calling ib_deregister_device() on an EEH
event followed by a ib_register_device() when the device was
reinitialized.  However, the RDMA core doesn't allow multiple
iterations of register/deregister by the provider. See
drivers/infiniband/core/sysfs.c: ib_device_unregister_sysfs() where
the kobject ref is held until the device is deallocated in
ib_deallocate_device().  Calling deregister adds this kobj reference,
and then a subsequent register call will generate a WARN_ON() from the
kobject subsystem because the kobject is being initialized but is
already initialized with the ref held.

So the provider must deregister and dealloc when resetting for an EEH
event, then alloc/register to re-initialize.  To do this, we cannot
use the device ptr as our ULD handle since it will change with each
reallocation.  This commit adds a ULD context struct which is used as
the ULD handle, and then contains the device pointer and other state
needed.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

2f25e9a5

RDMA/cxgb4: Reset wait condition atomically · d9594d99

由 Steve Wise 提交于 5月 09, 2011

The driver was never really waiting for RDMA_WR/FINI completions
because the condition variable used to determine if the completion
happened was never reset, and this condition variable is reused for
both connection setup and teardown.  This causes various driver
crashes under heavy loads due to releasing resources too early.

The fix is to use atomic bits to correctly reset the condition
immediately after the completion is detected.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

d9594d99

RDMA/cxgb4: Fix missing parentheses · 85d215b0

由 Roel Kluin 提交于 5月 09, 2011

Parens are missing: '|' has a higher presedence than '?'.
Signed-off-by: NRoel Kluin <roel.kluin@gmail.com>
Acked-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

85d215b0

RDMA/cxgb4: Initialization errors can cause crash · bbe9a0a2

由 Steve Wise 提交于 5月 09, 2011

c4iw_uld_add() must return ERR_PTR() values instead of NULL on failure.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

bbe9a0a2

RDMA/cxgb4: Don't change QP state outside EP lock · 30c95c2d

由 Steve Wise 提交于 5月 09, 2011

Concurrent ingress CLOSE and ULP ABORT operations causes a crash due
to a race condition where the close path releases the EP lock and then
tries to move the QP state to CLOSED.  This must be done inside the EP
lock to avoid the race.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

30c95c2d

04 5月, 2011 1 次提交
- D
  ipv4: Make caller provide on-stack flow key to ip_route_output_ports(). · 31e4543d
  由 David S. Miller 提交于 5月 03, 2011
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  31e4543d
27 4月, 2011 1 次提交

cxgb4: use pgprot_writecombine() on powerpc · e297d9dd

由 Nishanth Aravamudan 提交于 3月 14, 2011

Commit fe3cc0d9 ("powerpc: Add
pgprot_writecombine") in benh's tree exposes the pgprot_writecombine()
API to drivers on powerpc. cxgb4 has an open-coded version of the same,
so use the common API now that it's available.
Signed-off-by: NNishanth Aravamudan <nacc@us.ibm.com>
Cc: Steve Wise <swise@opengridcomputing.com>
Cc: Anton Blanchard <anton@samba.org>
Acked-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

e297d9dd

15 3月, 2011 7 次提交

RDMA/cxgb4: Debugfs dump_qp() updates · db5d040d

由 Steve Wise 提交于 3月 11, 2011

- Show whether the SQ is in onchip memory or not.
- Dump both SQ and RQ QIDs.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

db5d040d

RDMA/cxgb4: Dispatch FATAL event on EEH errors · 767fbe81

由 Steve Wise 提交于 3月 11, 2011

This at least kicks the user mode applications that are watching for
device events.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

767fbe81

RDMA/cxgb4: Use ULP_MODE_TCPDDP · b48f3b9c

由 Steve Wise 提交于 3月 11, 2011

Set the ULP mode for initial RDMA connection setup to the proper DDP
mode. This avoids wasting some HW resources while in streaming mode.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

b48f3b9c

RDMA/cxgb4: Enable on-chip SQ support by default · a9c77198

由 Steve Wise 提交于 3月 11, 2011

Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

a9c77198

RDMA/cxgb4: Do CIDX_INC updates every 1/16 CQ depth CQE reaps · ffc3f748

由 Steve Wise 提交于 3月 11, 2011

This avoids the CIDX_INC overflow issue with T4A2 when running
kernel RDMA applications.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

ffc3f748

RDMA/cxgb4: Remove db_drop_task · 29428137

由 Steve Wise 提交于 3月 11, 2011

Unloading iw_cxgb4 can crash due to the unload code trying to use
db_drop_task, which is uninitialized.  So remove this dead code.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

29428137

RDMA/cxgb4: Turn on delayed ACK · b52fe09e

由 Steve Wise 提交于 3月 11, 2011

Set the default to on.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

b52fe09e

13 3月, 2011 1 次提交

ipv4: Create and use route lookup helpers. · 78fbfd8a

由 David S. Miller 提交于 3月 12, 2011

The idea here is this minimizes the number of places one has to edit
in order to make changes to how flows are defined and used.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

78fbfd8a

03 3月, 2011 1 次提交
- D
  ipv4: Make output route lookup return rtable directly. · b23dd4fe
  由 David S. Miller 提交于 3月 02, 2011
```
Instead of on the stack.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  b23dd4fe
02 3月, 2011 2 次提交
- D
  ipv4: Kill can_sleep arg to ip_route_output_flow() · 273447b3
  由 David S. Miller 提交于 3月 01, 2011
```
This boolean state is now available in the flow flags.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  273447b3
- D
  ipv4: Make final arg to ip_route_output_flow to be boolean "can_sleep" · 420d44da
  由 David S. Miller 提交于 3月 01, 2011
```
Since that is what the current vague "flags" argument means.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  420d44da
29 1月, 2011 2 次提交

RDMA/cxgb4: Set the correct device physical function for iWARP connections · 94788657

由 Steve Wise 提交于 1月 21, 2011

The PF passed to FW was 0, causing PCI failures in an SR-IOV environment.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Cc: <stable@kernel.org>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

94788657

RDMA/cxgb4: Limit MAXBURST EQ context field to 256B · 6a09a9d6

由 Steve Wise 提交于 1月 21, 2011

MAXBURST cannot exceed 256B for on-chip queues.  With a 512B MAXBURST,
we can lock up the chip.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Cc: <stable@kernel.org>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

6a09a9d6

11 1月, 2011 2 次提交

RDMA/cxgb4: Don't re-init wait object in init/fini paths · db8b1016

由 Steve Wise 提交于 1月 10, 2011

Re-initializing the wait object in rdma_init()/rdma_fini() causes a
timing window which can lead to a deadlock during close.  Once this
deadlock hits, all RDMA activity over the T4 device will be stuck.

There's no need to re-init the wait object, so remove it.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Cc: <stable@kernel.org>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

db8b1016

RDMA/cxgb3,cxgb4: Remove dead code · c9431091

由 Stephen Hemminger 提交于 1月 10, 2011

This removes unused code found by running 'make namespacecheck';
compile tested only.
Signed-off-by: NStephen Hemminger <shemminger@vyatta.com>
Acked-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

c9431091

15 11月, 2010 1 次提交

infiniband: Only include mutex.h once in drivers/infiniband/hw/cxgb4/iw_cxgb4.h · e987fa35

由 Jesper Juhl 提交于 11月 07, 2010

Only include the header linux/mutex.h once inside
drivers/infiniband/hw/cxgb4/iw_cxgb4.h
Signed-off-by: NJesper Juhl <jj@chaosbits.net>
Signed-off-by: NJiri Kosina <jkosina@suse.cz>

e987fa35

27 10月, 2010 1 次提交

RDMA/cxgb4: Remove unnecessary KERN_<level> use · aa1ad260

由 Joe Perches 提交于 10月 25, 2010

Signed-off-by: NJoe Perches <joe@perches.com>
Acked-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

aa1ad260

24 10月, 2010 1 次提交

IB: Replace EXTRA_CFLAGS with ccflags-y · 7454159d

由 matt mooney 提交于 9月 24, 2010

Signed-off-by: Nmatt mooney <mfm@muteddisk.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

7454159d

23 10月, 2010 2 次提交

RDMA/cxgb4: Use cxgb4 service for packet gl to skb · da411ba1

由 Steve Wise 提交于 10月 18, 2010

Remove the local service t4_pktgl_to_skb() and use cxgb4_pktgl_to_skb()
exported by cxgb4.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

da411ba1

RDMA/cxgb4: Export T4 TCP MIB · de5dd81b

由 Steve Wise 提交于 10月 18, 2010

Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

de5dd81b

18 10月, 2010 1 次提交

Update broken web addresses in the kernel. · 631dd1a8

由 Justin P. Mattock 提交于 10月 18, 2010

The patch below updates broken web addresses in the kernel
Signed-off-by: NJustin P. Mattock <justinmattock@gmail.com>
Cc: Maciej W. Rozycki <macro@linux-mips.org>
Cc: Geert Uytterhoeven <geert@linux-m68k.org>
Cc: Finn Thain <fthain@telegraphics.com.au>
Cc: Randy Dunlap <rdunlap@xenotime.net>
Cc: Matt Turner <mattst88@gmail.com>
Cc: Dimitry Torokhov <dmitry.torokhov@gmail.com>
Cc: Mike Frysinger <vapier.adi@gmail.com>
Acked-by: NBen Pfaff <blp@cs.stanford.edu>
Acked-by: NHans J. Koch <hjk@linutronix.de>
Reviewed-by: NFinn Thain <fthain@telegraphics.com.au>
Signed-off-by: NJiri Kosina <jkosina@suse.cz>

631dd1a8

12 10月, 2010 2 次提交

RDMA/cxgb4: Use simple_read_from_buffer() for debugfs handlers · 3160977a

由 Steve Wise 提交于 9月 29, 2010

We can replace our equivalent open-coded version.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

3160977a

RDMA/cxgb4: Add default_llseek to debugfs files · 8bbac892

由 Steve Wise 提交于 9月 29, 2010

Incorporate BKL removal changes.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

8bbac892

29 9月, 2010 7 次提交

RDMA/cxgb4: Fastreg NSMR fixes · 40dbf6ee

由 Steve Wise 提交于 9月 17, 2010

- Remove dsgl support - doesn't work in T4.
- Wrap the immediate PBL as needed when building it in the wr.
- Adjust max pbl depth allowed based on ulptx alignment requirements.
- Bump the slots per SQ to 5 to allow up to 128MB fast registers.
- Advertise fastreg support by default.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

40dbf6ee

RDMA/cxgb4: Don't set completion flag for read requests · 410ade4c

由 Steve Wise 提交于 9月 17, 2010

Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

410ade4c

RDMA/cxgb4: Set the default TCP send window to 128KB · 98ae68b7

由 Steve Wise 提交于 9月 10, 2010

This helps with large IO throughput.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

98ae68b7

RDMA/cxgb4: Use a mutex for QP and EP state transitions · 2f5b48c3

由 Steve Wise 提交于 9月 10, 2010

Move the connection setup/teardown paths to the workq thread removing
spin lock/irq disable requirements for these paths.  This allows calls
down to the LLD for EP and QP state transition actions to be atomic
with respect to processing CPL messages coming up from the HW.
Namely, calls to rdma_init() and rdma_fini() can now be called with
the mutex held avoiding many race conditions with the abort path.

The QP spinlock is still used but only to manipulate the qp state.  This
allows the fastpaths, poll, post_send, and pos_recv, to run in the
irq context.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

2f5b48c3

RDMA/cxgb4: Support on-chip SQs · c6d7b267

由 Steve Wise 提交于 9月 13, 2010

T4 support on-chip SQs to reduce latency.  This patch adds support for
this in iw_cxgb4:

 - Manage ocqp memory like other adapter mem resources.
 - Allocate user mode SQs from ocqp mem if available.
 - Map ocqp mem to user process using write combining.
 - Map PCIE_MA_SYNC reg to user process.

Bump uverbs ABI.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

c6d7b267

RDMA/cxgb4: Centralize the wait logic · aadc4df3

由 Steve Wise 提交于 9月 10, 2010

Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

aadc4df3

RDMA/cxgb4: debugfs files for dumping active stags · 9e8d1fa3

由 Steve Wise 提交于 9月 10, 2010

Add "stags" debugfs file.  This is useful for examining the TPTE and
PBL entries in adapter memory.  It allows scripts to dump just the
active entries.

Also clean up the "qps" file handlers and shared common code.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

9e8d1fa3

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功