提交 · 9e8d1fa3420f489da8a5da47c026511aa71fa50b · openanolis / cloud-kernel

29 9月, 2010 7 次提交

RDMA/cxgb4: debugfs files for dumping active stags · 9e8d1fa3

由 Steve Wise 提交于 9月 10, 2010

Add "stags" debugfs file.  This is useful for examining the TPTE and
PBL entries in adapter memory.  It allows scripts to dump just the
active entries.

Also clean up the "qps" file handlers and shared common code.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

9e8d1fa3

RDMA/cxgb4: Log HW lack-of-resource errors · 05fb9629

由 Steve Wise 提交于 9月 10, 2010

This helps debug cases where HW resources are depleted.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

05fb9629

RDMA/cxgb4: Handle CPL_RDMA_TERMINATE messages · 0e42c1f4

由 Steve Wise 提交于 9月 10, 2010

T4 FW sends up CPL_RDMA_TERMINATE to indicate a peer TERM.  This
triggers the QP moving to TERMINATE state.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

0e42c1f4

RDMA/cxgb4: Ignore TERMINATE CQEs · 6ff0e343

由 Steve Wise 提交于 9月 10, 2010

T4 incorrectly inserts TERM CQEs into the CQ.  Silently ignore them.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

6ff0e343

RDMA/cxgb4: Ignore positive return values from cxgb4_*_send() functions · 74594861

由 Steve Wise 提交于 9月 10, 2010

The cxgb4_*_send() functions return NET_XMIT_ values, which are
positive integers or negative errno values.  So don't treat positive
return values as an error.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

74594861

RDMA/cxgb4: Zero out ISGL padding · 13fecb83

由 Steve Wise 提交于 9月 10, 2010

The HW design requires zeroing any pad in SGLs.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

13fecb83

RDMA/cxgb4: Don't use null ep ptr · af93fb5d

由 Steve Wise 提交于 9月 10, 2010

In c4iw_modify_qp() error path, only use qhp->ep if ep is not already set.
Otherwise qhp->ep can be NULL and we crash.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

af93fb5d

28 9月, 2010 2 次提交

RDMA/cxgb4: Fix warnings about casts to/from pointers of different sizes · c8e081a1

由 Roland Dreier 提交于 9月 27, 2010

Fix:

drivers/infiniband/hw/cxgb4/qp.c: In function ‘create_qp’:
drivers/infiniband/hw/cxgb4/qp.c:147: warning: cast from pointer to integer of different size
drivers/infiniband/hw/cxgb4/qp.c: In function ‘rdma_fini’:
drivers/infiniband/hw/cxgb4/qp.c:988: warning: cast from pointer to integer of different size
drivers/infiniband/hw/cxgb4/qp.c: In function ‘rdma_init’:
drivers/infiniband/hw/cxgb4/qp.c:1063: warning: cast from pointer to integer of different size
drivers/infiniband/hw/cxgb4/mem.c: In function ‘write_adapter_mem’:
drivers/infiniband/hw/cxgb4/mem.c:74: warning: cast from pointer to integer of different size
drivers/infiniband/hw/cxgb4/cq.c: In function ‘destroy_cq’:
drivers/infiniband/hw/cxgb4/cq.c:58: warning: cast from pointer to integer of different size
drivers/infiniband/hw/cxgb4/cq.c: In function ‘create_cq’:
drivers/infiniband/hw/cxgb4/cq.c:135: warning: cast from pointer to integer of different size
drivers/infiniband/hw/cxgb4/cm.c: In function ‘fw6_msg’:
drivers/infiniband/hw/cxgb4/cm.c:2326: warning: cast to pointer from integer of different size

by casting pointers to unsigned long instead of u64.
Reported-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

c8e081a1

RDMA/cxgb3: Turn off RX coalescing for iWARP connections · bec658ff

由 Steve Wise 提交于 9月 18, 2010

The HW by default has RX coalescing on.  For iWARP connections, this
causes a 100ms delay in connection establishement due to the ingress
MPA Start message being stalled in HW.  So explicitly turn RX
coalescing off when setting up iWARP connections.

This was causing very bad performance for NP64 gather operations using
Open MPI, due to the way it sets up connections on larger jobs.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Cc: <stable@kernel.org>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

bec658ff

09 9月, 2010 4 次提交

RDMA/nes: Fix hang with modified FIN handling on A0 cards · 29da03b9

由 Faisal Latif 提交于 9月 01, 2010

Changing state to CLOSING when FIN is received causes A0 cards to
hang.  Fix this by checking for A0 cards in FIN handling.
Signed-off-by: NFaisal Latif <faisal.latif@intel.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

29da03b9

RDMA/nes: Change state to closing after FIN · 67d70721

由 Faisal Latif 提交于 8月 14, 2010

When the driver receives an AE for FIN received, it closes the
connection without changing the state of the connection in the
hardware to closing.  By changing the state to closing, hardware will
do a normal close sequence.
Signed-off-by: NFaisal Latif <faisal.latif@intel.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

67d70721

RDMA/nes: Fix double CLOSE event indication crash · dae58728

由 Faisal Latif 提交于 8月 14, 2010

During a stress testing in a large cluster, multiple close event are
detected and BUG() is hit in the iWARP core. The cause is that the
active node gave up while waiting for an MPA response from the peer
and tried to close the connection by sending RST. The passive node
driver receives the RST but is waiting for MPA response from the user.
When the MPA accept is received, the driver offloads the connection
and sends a CLOSE event. The driver gets an AE indicating RESET
received and also sends a CLOSE event, hitting a BUG().

Fix this by correcting RESET handling and sending CLOSE events.
Signed-off-by: NFaisal Latif <faisal.latif@intel.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

dae58728

RDMA/nes: Write correct register write to set TX pause param · 70c9db0f

由 Chien Tung 提交于 9月 07, 2010

Setting TX pause param writes to the wrong register location causing
the adapter to hang. Correct the define used to write the reigster.

Addresses: https://bugs.openfabrics.org/show_bug.cgi?id=2116Reported-by: NShiri Franchi <shirif@voltaire.com>
Signed-off-by: NChien Tung <chien.tin.tung@intel.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

70c9db0f

03 9月, 2010 1 次提交

RDMA/cxgb3: Don't exceed the max HW CQ depth · dc4e96ce

由 Steve Wise 提交于 8月 28, 2010

The max depth supported by T3 is 64K entries.  This fixes a bug
introduced in commit 9918b28d ("RDMA/cxgb3: Increase the max CQ
depth") that causes stalls and possibly crashes in large MPI clusters.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Cc: <stable@kernel.org>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

dc4e96ce

08 8月, 2010 1 次提交

RDMA/cxgb4: Obtain RDMA QID ranges from LLD/FW · 93fb72e4

由 Steve Wise 提交于 6月 23, 2010

Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

93fb72e4

06 8月, 2010 4 次提交

of/device: Replace struct of_device with struct platform_device · 2dc11581

由 Grant Likely 提交于 8月 06, 2010

of_device is just an alias for platform_device, so remove it entirely.  Also
replace to_of_device() with to_platform_device() and update comment blocks.

This patch was initially generated from the following semantic patch, and then
edited by hand to pick up the bits that coccinelle didn't catch.

@@
@@
-struct of_device
+struct platform_device
Signed-off-by: NGrant Likely <grant.likely@secretlab.ca>
Reviewed-by: NDavid S. Miller <davem@davemloft.net>

2dc11581

IB/qib: Add missing <linux/slab.h> include · ba818afd

由 David Miller 提交于 8月 05, 2010

Fix build failure on sparc64 which is missing the include of
<linux/slab.h> via <asm/pci.h> that x86, powerpc, ia64, etc. have.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

ba818afd

IB/ehca: Drop unnecessary NULL test · 2db00321

由 Julia Lawall 提交于 8月 03, 2010

list_for_each_entry binds its first argument to a non-null value, and thus
any null test on the value of that argument is superfluous.

The semantic patch that makes this change is as follows:
(http://coccinelle.lip6.fr/)

// <smpl>
@@
iterator I;
expression x;
statement S,S1,S2;
@@

I(x,...) { <...
- if (x == NULL && ...) S
  ...> }
// </smpl>
Signed-off-by: NJulia Lawall <julia@diku.dk>
Acked-by: NAlexander Schmidt <alexs@linux.vnet.ibm.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

2db00321

RDMA/nes: Fix confusing if statement indentation · 817979ac

由 Roland Dreier 提交于 8月 05, 2010

Fix confusing indentation that makes a statement look as if it's part of
an if statement when in fact it isn't.
Reported-by: NJulia Lawall <julia@diku.dk>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

817979ac

05 8月, 2010 13 次提交

IB/ehca: Init irq tasklet before irq can happen · bd5d0ccb

由 Alexander Schmidt 提交于 7月 05, 2010

Initialize tasklet before interrupts are requested to prevent
scheduling of an uninitialized tasklet.
Signed-off-by: NAlexander Schmidt <alexs@linux.vnet.ibm.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

bd5d0ccb

RDMA/nes: Fix misindented code · b2a899ea

由 Roland Dreier 提交于 8月 04, 2010

In nes_probe(), a bit of code is indented one tab stop too far.  Fix this.
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

b2a899ea

RDMA/nes: Fix showing wqm_quanta · df924f83

由 Roland Dreier 提交于 8月 04, 2010

In nes_show_wqm_quanta(), the wrong value is printed.  Fix this.
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

df924f83

RDMA/nes: Get rid of "set but not used" variables · 69d51023

由 Roland Dreier 提交于 8月 04, 2010

Delete dead code in various places that is shown by gcc 4.6's new
-Wunused-but-set-variable warnings.
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

69d51023

RDMA/nes: Read firmware version from correct place · ff0380ce

由 Miroslaw Walukiewicz 提交于 7月 15, 2010

Signed-off-by: NMirek Walukiewicz <miroslaw.walukiewicz@intel.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

ff0380ce

IB/srp: Export req_lim via sysfs · 89de7486

由 Bart Van Assche 提交于 8月 03, 2010

Export req_lim via sysfs for debugging.
Signed-off-by: NBart Van Assche <bart.vanassche@gmail.com>
Acked-by: NDavid Dillow <dave@thedillows.org>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

89de7486

IB/srp: Make receive buffer handling more robust · c996bb47

由 Bart Van Assche 提交于 7月 30, 2010

The current strategy in ib_srp for posting receive buffers is:

 * Post one buffer after channel establishment.
 * Post one buffer before sending an SRP_CMD or SRP_TSK_MGMT to the target.

As a result, only the first non-SRP_RSP information unit from the
target will be processed.  If that first information unit is an
SRP_T_LOGOUT, it will be processed.  On the other hand, if the
initiator receives an SRP_CRED_REQ or SRP_AER_REQ before it receives a
SRP_T_LOGOUT, the SRP_T_LOGOUT won't be processed.

We can fix this inconsistency by changing the strategy for posting
receive buffers to:

 * Post all receive buffers after channel establishment.
 * After a receive buffer has been consumed and processed, post it again.

A side effect is that the ib_post_recv() call is moved out of the SCSI
command processing path.  Since __srp_post_recv() is not called
directly any more, get rid of it and move the code directly into
srp_post_recv().  Also, move srp_post_recv() up in the file to avoid a
forward declaration.
Signed-off-by: NBart Van Assche <bart.vanassche@gmail.com>
Acked-by: NDavid Dillow <dave@thedillows.org>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

c996bb47

IB/srp: Use print_hex_dump() · 7a700811

由 Bart Van Assche 提交于 7月 29, 2010

Replace an open-coded dump of the receive buffer with a call to
print_hex_dump().
Signed-off-by: NBart Van Assche <bart.vanassche@gmail.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

7a700811

IB: Rename RAW_ETY to RAW_ETHERTYPE · a2ebf07a

由 Aleksey Senin 提交于 7月 04, 2010

Change abbreviated IB_QPT_RAW_ETY to IB_QPT_RAW_ETHERTYPE to make
the special QP type easier to understand.

cf http://www.mail-archive.com/linux-rdma@vger.kernel.org/msg04530.htmlSigned-off-by: NAleksey Senin <alekseys@voltaire.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

a2ebf07a

RDMA/nes: Fix two sparse warnings · 812d8672

由 Or Gerlitz 提交于 7月 20, 2010

Simple changes to fix warnings:

      CHECK   drivers/infiniband/hw/nes/nes_verbs.c
    nes_verbs.c:1944:45: warning: Using plain integer as NULL pointer
    nes_verbs.c:1944:48: warning: Using plain integer as NULL pointer
      CHECK   drivers/infiniband/hw/nes/nes_cm.c
    nes_cm.c:2645:43: warning: mixing different enum types
    nes_cm.c:2645:43:     int enum iw_cm_event_type  versus
    nes_cm.c:2645:43:     int enum iw_cm_event_status
Signed-off-by: NOr Gerlitz <ogerlitz@voltaire.com>
Acked-by: NChien Tung <chien.tin.tung@intel.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

812d8672

RDMA/cxgb3: Make needlessly global iwch_l2t_send() static · 18199f57

由 Or Gerlitz 提交于 7月 20, 2010

Signed-off-by: NOr Gerlitz <ogerlitz@voltaire.com>
Acked-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

18199f57

O
IB/iser: Make needlessly global iser_alloc_rx_descriptors() static · 48d8fceb
由 Or Gerlitz 提交于 7月 20, 2010
```
Signed-off-by: NOr Gerlitz <ogerlitz@voltaire.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>
```
48d8fceb

RDMA/cxgb4: Add timeouts when waiting for FW responses · a5f4a078

由 Steve Wise 提交于 7月 23, 2010

Don't hang a host thread if the FW stops responding.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

a5f4a078

04 8月, 2010 4 次提交

IB/qib: Fix race between qib_error_qp() and receive packet processing · a5210c12

由 Ralph Campbell 提交于 8月 02, 2010

When transitioning a QP to the error state, in progress RWQEs need to
be marked complete.  This also involves releasing the reference count
to the memory regions referenced in the SGEs.  The locking in the
receive packet processing wasn't sufficient to prevent qib_error_qp()
from modifying the r_sge state at the same time, thus leading to
kernel panics.
Signed-off-by: NRalph Campbell <ralph.campbell@qlogic.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

a5210c12

IB/qib: Limit the number of packets processed per interrupt · 3e3aed0b

由 Ralph Campbell 提交于 8月 02, 2010

Don't processes too many packets without allowing other IRQ functions
a chance to run. Otherwise, there is a chance of getting a "soft
lockup" messages and poor application response times.
Signed-off-by: NRalph Campbell <ralph.campbell@qlogic.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

3e3aed0b

IB/qib: Allow writes to the diag_counters to be able to clear them · 4c6931f5

由 Ira Weiny 提交于 7月 14, 2010

Signed-off-by: NIra Weiny <weiny2@llnl.gov>
Acked-by: NRalph Campbell <ralph.campbell@qlogic.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

4c6931f5

IB/qib: Set cfgctxts to number of CPUs by default · 0502f94c

由 Ralph Campbell 提交于 7月 21, 2010

Up to now, we have set the number of available user contexts based on
the number of hardware contexts which is set according to the number
of available CPUs. This was fine since most CPUs had a power of two
number of cores and the chip supported 4, 8, or 16 user contexts. Now
that some systems have 12 cores, the default isn't optimal and should
be set to 12 even though 16 hardware contexts need to be enabled.
Signed-off-by: NRalph Campbell <ralph.campbell@qlogic.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

0502f94c

03 8月, 2010 3 次提交

RDMA/cxgb4: Set/reset the EP timer inside EP lock · ca5a2202

由 Steve Wise 提交于 7月 23, 2010

Endpoint timer manipulation needs to be done inside the lock.  Otherwise
we can get into a situation where a timer is stopped before it is started,
which hits the WARN_ON() in stop_ep_timer().
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

ca5a2202

RDMA/cxgb4: Use correct control txq · d4f1a5c6

由 Steve Wise 提交于 7月 23, 2010

There is only one control txq per tx channel.  So use the port number
as the queue index when sending.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

d4f1a5c6

RDMA/cxgb4: Fix race in fini path · 73d6fcad

由 Steve Wise 提交于 7月 23, 2010

There exists a race condition where the app disconnects, which
initiates an orderly close (via rdma_fini()), concurrently with an
ingress abort condition, which initiates an abortive close operation.
Since rdma_fini() must be called without IRQs disabled, the fini can
be called after the QP has been transitioned to ERROR. This is ok,
but we need to protect against qp->ep getting NULLed.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

73d6fcad

29 7月, 2010 1 次提交

IB/cm: Check LAP state before sending an MRA · 50a025c6

由 Sean Hefty 提交于 7月 21, 2010

NULL pointer dereferences in ib_cm_init_qp_attr() were seen by some
users.  From a crash dump, I determined that we died in
cm_init_qp_rts_attr() (it's inlined, so it doesn't show up in the
traceback) on the line labeled below:

static int cm_init_qp_rts_attr(struct cm_id_private *cm_id_priv,
                               struct ib_qp_attr *qp_attr,
                               int *qp_attr_mask)
{
        ........
        if (cm_id_priv->id.lap_state == IB_CM_LAP_UNINIT) {
                .....
        } else {
               *qp_attr_mask = IB_QP_ALT_PATH | IB_QP_PATH_MIG_STATE;
               qp_attr->alt_port_num = cm_id_priv->alt_av.port->port_num; <-die


The problem is that the rdma_cm can call ib_send_cm_mra() after a
connection has been established.  The ib_cm incorrectly assumes that
the MRA is in response to a LAP (load alternate path) message, even
though no LAP message has been received.  The ib_cm needs to check the
lap_state before sending an MRA if the cm_id state is established.
Reported-by: NArthur Kepner <akepner@sgi.com>
Reported-by: NJosh England <jjengla@gmail.com>
Signed-off-by: NSean Hefty <sean.hefty@intel.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

50a025c6

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功