提交 · 3786cf189f8b39cac870193368f9ad9f95fff9a4 · openanolis / cloud-kernel

06 12月, 2011 2 次提交

infiniband: cxgb4: Consolidate 3 copies of the same operation into 1 helper function. · 3786cf18

由 David Miller 提交于 12月 02, 2011

Three pieces of code do the same thing, create a l2t entry and then
import this information into the c4iw_ep object.

Create a helper function and call it from these 3 locations instead.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
Acked-by: NRoland Dreier <roland@purestorage.com>

3786cf18

net: Rename dst_get_neighbour{, _raw} to dst_get_neighbour_noref{, _raw}. · 27217455

由 David Miller 提交于 12月 02, 2011

To reflect the fact that a refrence is not obtained to the
resulting neighbour entry.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
Acked-by: NRoland Dreier <roland@purestorage.com>

27217455

30 11月, 2011 1 次提交

IB: Fix RCU lockdep splats · 580da35a

由 Eric Dumazet 提交于 11月 29, 2011

Commit f2c31e32 ("net: fix NULL dereferences in check_peer_redir()")
forgot to take care of infiniband uses of dst neighbours.

Many thanks to Marc Aurele who provided a nice bug report and feedback.
Reported-by: NMarc Aurele La France <tsi@ualberta.ca>
Signed-off-by: NEric Dumazet <eric.dumazet@gmail.com>
Cc: David Miller <davem@davemloft.net>
Cc: <stable@kernel.org>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

580da35a

29 11月, 2011 2 次提交

RDMA/cxgb4: Fix retry with MPAv1 logic for MPAv2 · 01b225e1

由 Kumar Sanghvi 提交于 11月 28, 2011

Fix logic so that we don't retry with MPAv1 once we have done that
already. Otherwise, we end up retrying with MPAv1 even when its not
needed on getting peer aborts - and this could lead to kernel panic.
Signed-off-by: NKumar Sanghvi <kumaras@chelsio.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

01b225e1

RDMA/cxgb4: Fix iw_cxgb4 count_rcqes() logic · c34c97ad

由 Jonathan Lallinger 提交于 10月 20, 2011

Fix another place in the code where logic dealing with the t4_cqe was
using the wrong QID.  This fixes the counting logic so that it tests
against the SQ QID instead of the RQ QID when counting RCQES.

Signed-off by: Jonathan Lallinger <jonathan@ogc.us>
Signed-off by: Steve Wise <swise@ogc.us>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

c34c97ad

01 11月, 2011 3 次提交

infiniband: Fix up module files that need to include module.h · e4dd23d7

由 Paul Gortmaker 提交于 5月 27, 2011

They had been getting it implicitly via device.h but we can't
rely on that for the future, due to a pending cleanup so fix
it now.
Signed-off-by: NPaul Gortmaker <paul.gortmaker@windriver.com>

e4dd23d7

RDMA/cxgb4: Mark QP in error before disabling the queue in firmware · d32ae393

由 Tom Tucker 提交于 10月 25, 2011

QPs need to be moved to error before telling the firwmare to shutdown
the queue.  Otherwise, the application can submit WRs that will never
get fetched by the hardware and never flushed by the driver.
Signed-off-by: NKumar Sanghvi <kumaras@chelsio.com>
Acked-by: NSteve Wise <swsie@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

d32ae393

RDMA/cxgb4: Serialize calls to CQ's comp_handler · 581bbe2c

由 Kumar Sanghvi 提交于 10月 24, 2011

Commit 01e7da6b ("RDMA/cxgb4: Make sure flush CQ entries are
collected on connection close") introduced a potential problem where a
CQ's comp_handler can get called simultaneously from different places
in the iw_cxgb4 driver.  This does not comply with
Documentation/infiniband/core_locking.txt, which states that at a
given point of time, there should be only one callback per CQ should
be active.

This problem was reported by Parav Pandit <Parav.Pandit@Emulex.Com>.
Based on discussion between Parav Pandit and Steve Wise, this patch
fixes the above problem by serializing the calls to a CQ's
comp_handler using a spin_lock.
Reported-by: NParav Pandit <Parav.Pandit@Emulex.Com>
Signed-off-by: NKumar Sanghvi <kumaras@chelsio.com>
Acked-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

581bbe2c

15 10月, 2011 2 次提交

RDMA/cxgb4: Use correct QID in insert_recv_cqe() · e14d62c0

由 Jonathan Lallinger 提交于 10月 13, 2011

When creating flushed receive CQEs, set the QPID field in the t4_cqe
to the SQ QID and not the RQ QID.  Otherwise the poll code will not
find the correct QP context.

Signed-off by: Jonathan Lallinger <jonathan@ogc.us>
Signed-off by: Steve Wise <swise@ogc.us>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

e14d62c0

RDMA/cxgb4: Make sure flush CQ entries are collected on connection close · 01e7da6b

由 Kumar Sanghvi 提交于 10月 13, 2011

At the time when a peer closes the connection, iw_cxgb4 will not send
a cq event if ibqp.uobject exists.  In that case, its possible for a
user application to get blocked in ibv_get_cq_event().

To resolve this, call the cq's comp_handler to unblock any read from
ibv_get_cq_event().  This will trigger userspace to poll the cq and
collect flush status completions for any pending work requests.
Signed-off-by: NKumar Sanghvi <kumaras@chelsio.com>
Acked-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

01e7da6b

07 10月, 2011 2 次提交

RDMA/cxgb4: Add support for MPAv2 Enhanced RDMA Negotiation · d2fe99e8

由 Kumar Sanghvi 提交于 9月 25, 2011

This patch adds support for Enhanced RDMA Connection Establishment
(draft-ietf-storm-mpa-peer-connect-06), aka MPAv2.  Details of draft
can be obtained from:
<http://www.ietf.org/id/draft-ietf-storm-mpa-peer-connect-06.txt>

The patch updates the following functions for initiator perspective:
 - send_mpa_request
 - process_mpa_reply
 - post_terminate for TERM error codes
 - destroy_qp for TERM related change
 - adds layer/etype/ecode to c4iw_qp_attrs for sending with TERM
 - peer_abort for retrying connection attempt with MPA_v1 message
 - added c4iw_reconnect function

The patch updates the following functions for responder perspective:
 - process_mpa_request
 - send_mpa_reply
 - c4iw_accept_cr
 - passes ird/ord to upper layers
Signed-off-by: NKumar Sanghvi <kumaras@chelsio.com>
Reviewed-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

d2fe99e8

RDMA/cxgb4: Fail RDMA initialization for unsupported cards · 9efe10a1

由 Steve Wise 提交于 10月 06, 2011

The iw_cxgb4 module crashes at init time if the T4 card does not
support RDMA.  So clean up the init logic to correctly deal with
non-RDMA cards.

 - If any RDMA resources are not available, then fail the initialization
   logging an info message.
 - Clean up properly on initialization failures.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

9efe10a1

11 8月, 2011 1 次提交

chelsio: Move the Chelsio drivers · f7917c00

由 Jeff Kirsher 提交于 4月 07, 2011

Moves the drivers for the Chelsio chipsets into
drivers/net/ethernet/chelsio/ and the necessary Kconfig and Makefile
changes.

CC: Divy Le Ray <divy@chelsio.com>
CC: Dimitris Michailidis <dm@chelsio.com>
CC: Casey Leedom <leedom@chelsio.com>
Signed-off-by: NJeff Kirsher <jeffrey.t.kirsher@intel.com>

f7917c00

27 7月, 2011 1 次提交

atomic: use <linux/atomic.h> · 60063497

由 Arun Sharma 提交于 7月 26, 2011

This allows us to move duplicated code in <asm/atomic.h>
(atomic_inc_not_zero() for now) to <linux/atomic.h>
Signed-off-by: NArun Sharma <asharma@fb.com>
Reviewed-by: NEric Dumazet <eric.dumazet@gmail.com>
Cc: Ingo Molnar <mingo@elte.hu>
Cc: David Miller <davem@davemloft.net>
Cc: Eric Dumazet <eric.dumazet@gmail.com>
Acked-by: NMike Frysinger <vapier@gentoo.org>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

60063497

19 7月, 2011 2 次提交

RDMA/cxgb4: Use printk_ratelimited() instead of printk_ratelimit() · 3cbe182a

由 Manuel Zerpies 提交于 6月 16, 2011

Since printk_ratelimit() shouldn't be used anymore (see comment in
include/linux/printk.h), replace it with printk_ratelimited().
Signed-off-by: NManuel Zerpies <manuel.f.zerpies@ww.stud.uni-erlangen.de>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

3cbe182a

RDMA: Allow for NULL .modify_device() and .modify_port() methods · 10e1b54b

由 Bart Van Assche 提交于 6月 18, 2011

These methods don't make sense for iWARP devices, so rather than
forcing them to implement stubs, just return -ENOSYS in the core if
the hardware driver doesn't set .modify_device and/or .modify_port.
Signed-off-by: NRoland Dreier <roland@purestorage.com>

10e1b54b

18 7月, 2011 1 次提交
- D
  net: Abstract dst->neighbour accesses behind helpers. · 69cce1d1
  由 David S. Miller 提交于 7月 17, 2011
```
dst_{get,set}_neighbour()
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  69cce1d1
18 6月, 2011 3 次提交

RDMA/cxgb4: Couple of abort fixes · 8da7e7a5

由 Steve Wise 提交于 6月 14, 2011

- fix a race where the driver could end up sending a close_con_req
  after an abort_rpl.  In c4iw_ep_disconnect(), send abort or close
  request with the ep mutex held.

- fix a hang where driver fails to wake up when a connection is reset
  during a normal close.  Wake up any waiters in the interrupt path,
  and correctly cleanup after rdma_fini() failures.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

8da7e7a5

RDMA/cxgb4: Don't truncate MR lengths · 301c2c3f

由 Steve Wise 提交于 6月 14, 2011

Remove left-over code from T3 that limited MR sizes to 32b.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

301c2c3f

RDMA/cxgb4: Don't exceed hw IQ depth limit for user CQs · 2ff7d09a

由 Steve Wise 提交于 6月 01, 2011

Memory allocated for user CQs gets rounded up to the next page
boundary. And after rounding, we recalculate the resulting IQ depth
and we need to make sure we don't exceed the HW limits.

This bug can result a much smaller CQ allocated than was expected if
the HW size field is exceeded, resulting in CQ overflow failures.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

2ff7d09a

25 5月, 2011 1 次提交

RDMA/cxgb4: Use completion objects for event blocking · c337374b

由 Steve Wise 提交于 5月 20, 2011

There exists a race condition when using wait_queue_head_t objects
that are declared on the stack.  This was being done in a few places
where we are sending work requests to the FW and awaiting replies, but
we don't have an endpoint structure with an embedded c4iw_wr_wait
struct.  So the code was allocating it locally on the stack.  Bad
design.  The race is:

  1) thread on cpuX declares the wait_queue_head_t on the stack, then
     posts a firmware WR with that wait object ptr as the cookie to be
     returned in the WR reply.  This thread will proceed to block in
     wait_event_timeout() but before it does:

  2) An interrupt runs on cpuY with the WR reply.  fw6_msg() handles
     this and calls c4iw_wake_up().  c4iw_wake_up() sets the condition
     variable in the c4iw_wr_wait object to TRUE and will call
     wake_up(), but before it calls wake_up():

  3) The thread on cpuX calls c4iw_wait_for_reply(), which calls
     wait_event_timeout().  The wait_event_timeout() macro checks the
     condition variable and returns immediately since it is TRUE.  So
     this thread never blocks/sleeps. The function then returns
     effectively deallocating the c4iw_wr_wait object that was on the
     stack.

  4) So at this point cpuY has a pointer to the c4iw_wr_wait object
     that is no longer valid.  Further its pointing to a stack frame
     that might now be in use by some other context/thread.  So cpuY
     continues execution and calls wake_up() on a ptr to a wait object
     that as been effectively deallocated.

This race, when it hits, can cause a crash in wake_up(), which I've
seen under heavy stress. It can also corrupt the referenced stack
which can cause any number of failures.

The fix:

Use struct completion, which supports on-stack declarations.
Completions use a spinlock around setting the condition to true and
the wake up so that steps 2 and 4 above are atomic and step 3 can
never happen in-between.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>

c337374b

10 5月, 2011 5 次提交

RDMA/cxgb4: EEH errors can hang the driver · 2f25e9a5

由 Steve Wise 提交于 5月 09, 2011

A few more EEH fixes:

c4iw_wait_for_reply(): detect fatal EEH condition on timeout and
return an error.

The iw_cxgb4 driver was only calling ib_deregister_device() on an EEH
event followed by a ib_register_device() when the device was
reinitialized.  However, the RDMA core doesn't allow multiple
iterations of register/deregister by the provider. See
drivers/infiniband/core/sysfs.c: ib_device_unregister_sysfs() where
the kobject ref is held until the device is deallocated in
ib_deallocate_device().  Calling deregister adds this kobj reference,
and then a subsequent register call will generate a WARN_ON() from the
kobject subsystem because the kobject is being initialized but is
already initialized with the ref held.

So the provider must deregister and dealloc when resetting for an EEH
event, then alloc/register to re-initialize.  To do this, we cannot
use the device ptr as our ULD handle since it will change with each
reallocation.  This commit adds a ULD context struct which is used as
the ULD handle, and then contains the device pointer and other state
needed.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

2f25e9a5

RDMA/cxgb4: Reset wait condition atomically · d9594d99

由 Steve Wise 提交于 5月 09, 2011

The driver was never really waiting for RDMA_WR/FINI completions
because the condition variable used to determine if the completion
happened was never reset, and this condition variable is reused for
both connection setup and teardown.  This causes various driver
crashes under heavy loads due to releasing resources too early.

The fix is to use atomic bits to correctly reset the condition
immediately after the completion is detected.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

d9594d99

RDMA/cxgb4: Fix missing parentheses · 85d215b0

由 Roel Kluin 提交于 5月 09, 2011

Parens are missing: '|' has a higher presedence than '?'.
Signed-off-by: NRoel Kluin <roel.kluin@gmail.com>
Acked-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

85d215b0

RDMA/cxgb4: Initialization errors can cause crash · bbe9a0a2

由 Steve Wise 提交于 5月 09, 2011

c4iw_uld_add() must return ERR_PTR() values instead of NULL on failure.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

bbe9a0a2

RDMA/cxgb4: Don't change QP state outside EP lock · 30c95c2d

由 Steve Wise 提交于 5月 09, 2011

Concurrent ingress CLOSE and ULP ABORT operations causes a crash due
to a race condition where the close path releases the EP lock and then
tries to move the QP state to CLOSED.  This must be done inside the EP
lock to avoid the race.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

30c95c2d

04 5月, 2011 1 次提交
- D
  ipv4: Make caller provide on-stack flow key to ip_route_output_ports(). · 31e4543d
  由 David S. Miller 提交于 5月 03, 2011
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  31e4543d
27 4月, 2011 1 次提交

cxgb4: use pgprot_writecombine() on powerpc · e297d9dd

由 Nishanth Aravamudan 提交于 3月 14, 2011

Commit fe3cc0d9 ("powerpc: Add
pgprot_writecombine") in benh's tree exposes the pgprot_writecombine()
API to drivers on powerpc. cxgb4 has an open-coded version of the same,
so use the common API now that it's available.
Signed-off-by: NNishanth Aravamudan <nacc@us.ibm.com>
Cc: Steve Wise <swise@opengridcomputing.com>
Cc: Anton Blanchard <anton@samba.org>
Acked-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NBenjamin Herrenschmidt <benh@kernel.crashing.org>

e297d9dd

15 3月, 2011 7 次提交

RDMA/cxgb4: Debugfs dump_qp() updates · db5d040d

由 Steve Wise 提交于 3月 11, 2011

- Show whether the SQ is in onchip memory or not.
- Dump both SQ and RQ QIDs.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

db5d040d

RDMA/cxgb4: Dispatch FATAL event on EEH errors · 767fbe81

由 Steve Wise 提交于 3月 11, 2011

This at least kicks the user mode applications that are watching for
device events.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

767fbe81

RDMA/cxgb4: Use ULP_MODE_TCPDDP · b48f3b9c

由 Steve Wise 提交于 3月 11, 2011

Set the ULP mode for initial RDMA connection setup to the proper DDP
mode. This avoids wasting some HW resources while in streaming mode.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

b48f3b9c

RDMA/cxgb4: Enable on-chip SQ support by default · a9c77198

由 Steve Wise 提交于 3月 11, 2011

Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

a9c77198

RDMA/cxgb4: Do CIDX_INC updates every 1/16 CQ depth CQE reaps · ffc3f748

由 Steve Wise 提交于 3月 11, 2011

This avoids the CIDX_INC overflow issue with T4A2 when running
kernel RDMA applications.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

ffc3f748

RDMA/cxgb4: Remove db_drop_task · 29428137

由 Steve Wise 提交于 3月 11, 2011

Unloading iw_cxgb4 can crash due to the unload code trying to use
db_drop_task, which is uninitialized.  So remove this dead code.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

29428137

RDMA/cxgb4: Turn on delayed ACK · b52fe09e

由 Steve Wise 提交于 3月 11, 2011

Set the default to on.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

b52fe09e

13 3月, 2011 1 次提交

ipv4: Create and use route lookup helpers. · 78fbfd8a

由 David S. Miller 提交于 3月 12, 2011

The idea here is this minimizes the number of places one has to edit
in order to make changes to how flows are defined and used.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

78fbfd8a

03 3月, 2011 1 次提交
- D
  ipv4: Make output route lookup return rtable directly. · b23dd4fe
  由 David S. Miller 提交于 3月 02, 2011
```
Instead of on the stack.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  b23dd4fe
02 3月, 2011 2 次提交
- D
  ipv4: Kill can_sleep arg to ip_route_output_flow() · 273447b3
  由 David S. Miller 提交于 3月 01, 2011
```
This boolean state is now available in the flow flags.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  273447b3
- D
  ipv4: Make final arg to ip_route_output_flow to be boolean "can_sleep" · 420d44da
  由 David S. Miller 提交于 3月 01, 2011
```
Since that is what the current vague "flags" argument means.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  420d44da
29 1月, 2011 1 次提交

RDMA/cxgb4: Set the correct device physical function for iWARP connections · 94788657

由 Steve Wise 提交于 1月 21, 2011

The PF passed to FW was 0, causing PCI failures in an SR-IOV environment.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Cc: <stable@kernel.org>
Signed-off-by: NRoland Dreier <roland@purestorage.com>

94788657

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功