提交 · 542869a17eee2edf389273f40f757aa4e662b3da · openeuler / Kernel

10 10月, 2007 40 次提交

IB/ipath: Remove duplicate copy of LMC · 542869a1

由 Ralph Campbell 提交于 9月 13, 2007

The LMC value was being saved by the SMA in two places. This patch
cleans it up so only one copy is kept.
Signed-off-by: NRalph Campbell <ralph.campbell@qlogic.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

542869a1

IB/ipath: Add ability to set the LMC via the sysfs debugging interface · 15cba26f

由 Ralph Campbell 提交于 9月 12, 2007

This patch adds the ability to set the LMC via a sysfs file as if the SM
sent a SubnSet(PortInfo) MAD.  It is useful for debugging when no SM is
running.
Signed-off-by: NRalph Campbell <ralph.campbell@qlogic.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

15cba26f

IB/ipath: Optimize completion queue entry insertion and polling · 6cff2faa

由 Ralph Campbell 提交于 9月 07, 2007

The code to add an entry to the completion queue stored the QPN which is
needed for the user level verbs view of the completion queue entry but
the kernel struct ib_wc contains a pointer to the QP instead of a QPN.
When the kernel polled for a completion queue entry, the QPN was lookup
up and the QP pointer recovered. This patch stores the CQE differently
based on whether the CQ is a kernel CQ or a user CQ thus avoiding the
QPN to QP lookup overhead.
Signed-off-by: NRalph Campbell <ralph.campbell@qlogic.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

6cff2faa

IB/ipath: Implement IB_EVENT_QP_LAST_WQE_REACHED · d42b01b5

由 Ralph Campbell 提交于 8月 25, 2007

This patch implements the IB_EVENT_QP_LAST_WQE_REACHED event which is
needed by ib_ipoib to destroy the QP when used in connected mode.
Signed-off-by: NRalph Campbell <ralph.campbell@qlogic.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

d42b01b5

IB/ipath: Generate flush CQE when QP is in error state · c9cf7db2

由 Ralph Campbell 提交于 8月 25, 2007

Follow the IB spec. (C10-96) for post send which states that a flushed 
completion event should be generated for work requests posted when a QP
is in the error state.
Signed-off-by: NRalph Campbell <ralph.campbell@qlogic.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

c9cf7db2

IB/ipath: Remove redundant code · 036be09c

由 Ralph Campbell 提交于 8月 20, 2007

This patch removes some redundant initialization code.
Signed-off-by: NRalph Campbell <ralph.campbell@qlogic.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

036be09c

IB/ipath: Future proof eeprom checksum code (contents reading) · d29cc6ef

由 Dave Olson 提交于 8月 17, 2007

In an earlier change, the amount of data read from the flash was
mistakenly limited to the size known to the current driver.  This causes
problems when the length is increased, and written with the new longer
version; the checksum would fail because not enough data was read.
Always read the full 128 byte length to prevent this.
Signed-off-by: NDave Olson <dave.olson@qlogic.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

d29cc6ef

IB/ipath: UC RDMA WRITE with IMMEDIATE doesn't send the immediate · 55046698

由 Ralph Campbell 提交于 8月 17, 2007

This patch fixes a bug in the receive processing for UC RDMA WRITE with
immediate which caused the last packet to be dropped.
Signed-off-by: NRalph Campbell <ralph.campbell@qlogic.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

55046698

IB/ipath: Correctly describe workaround for TID write chip bug · 9ef8617a

由 Dave Olson 提交于 8月 09, 2007

This is a comment change, only, correcting the comment to match the
implemented workaround, rather than the original workaround, and
clarifying why it's needed.
Signed-off-by: NDave Olson <dave.olson@qlogic.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

9ef8617a

IB/ipath: Remove unneeded code for ipathfs · 1793b477

由 Ralph Campbell 提交于 8月 07, 2007

The ipathfs file system is used to export binary data verses ASCII data
such as through /sys. This patch removes some unneeded files since the
data is available through other /sys files.
Signed-off-by: NRalph Campbell <ralph.campbell@qlogic.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

1793b477

IB/ipath: Verify host bus bandwidth to chip will not limit performance · 9bec3992

由 Dave Olson 提交于 6月 01, 2007

There have been a number of issues where host bandwidth via HT or PCIe
to the InfiniPath chip has been limited in some fashion (BIOS,
configuration, etc.), resulting in user confusion.  This check gives a
clear warning that something is wrong and needs to be resolved.
Signed-off-by: NDave Olson <dave.olson@qlogic.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

9bec3992

IB/ipath: Change UD to queue work requests like RC & UC · 4ee97180

由 Ralph Campbell 提交于 7月 25, 2007

The code to post UD sends tried to process work requests at the time
ib_post_send() is called without using a WQE queue. This was fine as
long as HW resources were available for sending a packet. This patch
changes UD to be handled more like RC and UC and shares more code.
Signed-off-by: NRalph Campbell <ralph.campbell@qlogic.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

4ee97180

IB/ipath: Performance optimization for CPU differences · 210d6ca3

由 Ralph Campbell 提交于 7月 24, 2007

Different processors have different ordering restrictions for write
combining.  By taking advantage of this, we can eliminate some write
barriers when writing to the send buffers.
Signed-off-by: NRalph Campbell <ralph.campbell@qlogic.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

210d6ca3

IB/ipath: iba6110 rev4 GPIO counters support · 327a338d

由 Arthur Jones 提交于 8月 02, 2007

On iba6110 rev4, support for three more IB counters were added.  The
LocalLinkIntegrityError counter, the ExcessiveBufferOverrunErrors
counter and support for error counting of flow control packets on an
invalid VL.  These counters trigger GPIO interrupts and the sw keeps
track of the counts.  Since we also use GPIO interrupts to signal packet
reception, we need to turn off the fast interrupts, or we risk losing a
GPIO interrupt.
Signed-off-by: NArthur Jones <arthur.jones@qlogic.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

327a338d

IB/ehca: Fix clipping of device limits to INT_MAX · 76dea3bc

由 Roland Dreier 提交于 10月 09, 2007

Doing min_t(int, foo, INT_MAX) doesn't work correctly, because if foo
is bigger than INT_MAX, then when treated as a signed integer, it will
become negative and hence such an expression is just an elaborate NOP.

Fix such cases in ehca to do min_t(unsigned, foo, INT_MAX) instead.
This fixes negative reported values for max_cqe, max_pd and max_ah:

Before:

        max_cqe:                        -64
        max_pd:                         -1
        max_ah:                         -1

After:
        max_cqe:                        2147483647
        max_pd:                         2147483647
        max_ah:                         2147483647

Based on a bug report and fix from Anton Blanchard <anton@samba.org>.
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

76dea3bc

IPoIB/cm: Clean up initialization of QP attr in ipoib_cm_create_tx_qp() · ede6bc04

由 Dotan Barak 提交于 10月 07, 2007

Make the way QP is being created in ipoib_cm_create_tx_qp()
consistent with ipoib_cm_create_rx_qp().
Signed-off-by: NDotan Barak <dotanb@dev.mellanox.co.il>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

ede6bc04

mlx4_core: Use mmiowb() to avoid firmware commands getting jumbled up · 2e61c646

由 Roland Dreier 提交于 10月 09, 2007

Firmware commands are sent to the HCA by writing multiple words to a
command register block. Access to this block of registers is
serialized with a mutex. However, on large SGI systems writes to the
register block may be reordered within the system interconnect and
reach the HCA in a different order than they were issued (even with
the mutex). Fix this by adding an mmiowb() before dropping the mutex.

This bug was observed with real workloads with the similar FW command
code in the mthca driver, and adding the mmiowb() as in commit
66547550 ("IB/mthca: Use mmiowb() to avoid firmware commands getting
jumbled up") was confirmed to fix the problems, so we should add the
same fix to mlx4.
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

2e61c646

IB/mthca: Use mmiowb() to avoid firmware commands getting jumbled up · 76d7cc03

由 Roland Dreier 提交于 10月 09, 2007

Firmware commands are sent to the HCA by writing multiple words to a
command register block.  Access to this block of registers is
serialized with a mutex.  However, on large SGI systems, problems were
seen with multiple CPUs issuing FW commands at the same time, because
the writes to the register block may be reordered within the system
interconnect and reach the HCA in a different order than they were
issued (even with the mutex).  Fix this by adding an mmiowb() before
dropping the mutex.
Tested-by: NArthur Kepner <akepner@sgi.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

76d7cc03

RDMA/cma: Queue IB CM MRAs to avoid unnecessary remote retries · dcb3f974

由 Sean Hefty 提交于 8月 01, 2007

Automatically queue MRA message to decrease the number of retries sent
by the remote side during connection establishment.  This also has the
effect of increasing the overall connection timeout without using a
longer retry time in the case of dropped packets.
Signed-off-by: NSean Hefty <sean.hefty@intel.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

dcb3f974

IB/cm: Modify interface to send MRAs in response to duplicate messages · de98b693

由 Sean Hefty 提交于 8月 01, 2007

The IB CM provides a message received acknowledged (MRA) message that
can be sent to indicate that a REQ or REP message has been received, but
will require more time to process than the timeout specified by those
messages.  In many cases, the application may not know how long it will
take to respond to a CM message, but the majority of the time, it will
usually respond before a retry has been sent.  Rather than sending an
MRA in response to all messages just to handle the case where a longer
timeout is needed, it is more efficient to queue the MRA for sending in
case a duplicate message is received.

This avoids sending an MRA when it is not needed, but limits the number
of times that a REQ or REP will be resent.  It also provides for a
simpler implementation than generating the MRA based on a timer event.
(That is, trying to send the MRA after receiving the first REQ or REP if
a response has not been generated, so that it is received at the remote
side before a duplicate REQ or REP has been received)
Signed-off-by: NSean Hefty <sean.hefty@intel.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

de98b693

IB/mthca: Increase max number of QPs per multicast group to 56 · 1a1eb6a6

由 Roland Dreier 提交于 10月 09, 2007

Increase the number of QPs allowed per multicast group from 8 to 56.
This allows for one QP per core on 16-core systems, which are now
quite common, and allows some space for future growth.

This is basically the same patch that Jack Morgenstein
<jackm@dev.mellanox.co.il> just supplied for mlx4.
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

1a1eb6a6

mlx4_core: Increase max number of QPs per multicast group to 56 · e57ac0c2

由 Jack Morgenstein 提交于 10月 02, 2007

Increase the number of QPs allowed per multicast group from 8 to 56.
This allows for one QP per core on 16-core systems, which are now
quite common, and allows some space for future growth.
Signed-off-by: NJack Morgenstein <jackm@dev.mellanox.co.il>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

e57ac0c2

IB/mlx4: Implement FMRs · 8ad11fb6

由 Jack Morgenstein 提交于 8月 01, 2007

Implement FMRs for mlx4.  This is an adaptation of code from mthca.
Signed-off-by: NJack Morgenstein <jackm@dev.mellanox.co.il>
Signed-off-by: NMichael S. Tsirkin <mst@dev.mellanox.co.il>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

8ad11fb6

mlx4_core: Write MTTs from CPU instead with of WRITE_MTT FW command · d7bb58fb

由 Jack Morgenstein 提交于 8月 01, 2007

Write MTT entries directly to ICM from the driver (eliminating use of
WRITE_MTT command).  This reduces the number of FW commands needed to
register an MR by at least a factor of 2 and speeds up memory
registration significantly.  This code will also be used to implement
FMRs.
Signed-off-by: NJack Morgenstein <jackm@dev.mellanox.co.il>
Signed-off-by: NMichael S. Tsirkin <mst@dev.mellanox.co.il>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

d7bb58fb

mlx4_core: Fix meaning of dev->caps.reserved_mtts · 121964ec

由 Roland Dreier 提交于 10月 09, 2007

Everything that uses caps.reserved_mtts expects it to be a count of MTT
segments, not MTT entries.  So convert the value that the FW gives us to
a count of segments.
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

121964ec

mlx4_core: Reserve the correct number of MTT segments · cf78237d

由 Roland Dreier 提交于 10月 09, 2007

Taking ilog2(dev->caps.reserved_mtts) to find out the order to pass to
the MTT buddy allocator will do the wrong thing if reserved_mtts is ever
not a power of 2. Be safe and use fls(dev->caps.reserved_mtts - 1).
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

cf78237d

mlx4_core: Support ICM tables in coherent memory · 5b0bf5e2

由 Jack Morgenstein 提交于 8月 01, 2007

Enable having ICM tables in coherent memory, and use coherent memory
for the dMPT table. This will allow writing MPT entries for MRs both
via the SW2HW_MPT command and also directly by the driver for FMR
remapping without needing to flush or worry about cacheline boundaries.
Signed-off-by: NJack Morgenstein <jackm@dev.mellanox.co.il>
Signed-off-by: NMichael S. Tsirkin <mst@dev.mellanox.co.il>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

5b0bf5e2

IB/uverbs: Make ib_uverbs_release_event_file() static · 04d29b0e

由 Roland Dreier 提交于 10月 09, 2007

ib_uverbs_release_event_file() is only used in uverbs_main.c, so make it
static to that file.  Also move the definition before the first use, so
a forward declaration is not needed.
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

04d29b0e

IB/umad: Fix bit ordering and 32-on-64 problems on big endian systems · a394f83b

由 Roland Dreier 提交于 10月 09, 2007

The declaration of struct ib_user_mad_reg_req.method_mask[] exported
to userspace was an array of __u32, but the kernel internally treated
it as a bitmap made up of longs.  This makes a difference for 64-bit
big-endian kernels, where numbering the bits in an array of__u32 gives:

    |31.....0|63....31|95....64|127...96|

while numbering the bits in an array of longs gives:

    |63..............0|127............64|

64-bit userspace can handle this by just treating method_mask[] as an
array of longs, but 32-bit userspace is really stuck: the meaning of
the bits in method_mask[] depends on whether the kernel is 32-bit or
64-bit, and there's no sane way for userspace to know that.

Fix this by updating <rdma/ib_user_mad.h> to make it clear that
method_mask[] is an array of longs, and using a compat_ioctl method to
convert to an array of 64-bit longs to handle the 32-on-64 problem.
This fixes the interface description to match existing behavior (so
working binaries continue to work) in almost all situations, and gives
consistent semantics in the case of 32-bit userspace that can run on
either a 32-bit or 64-bit kernel, so that the same binary can work for
both 32-on-32 and 32-on-64 systems.
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

a394f83b

IB/umad: Add P_Key index support · 2be8e3ee

由 Roland Dreier 提交于 10月 09, 2007

Add support for setting the P_Key index of sent MADs and getting the
P_Key index of received MADs.  This requires a change to the layout of
the ABI structure struct ib_user_mad_hdr, so to avoid breaking
compatibility, we default to the old (unchanged) ABI and add a new
ioctl IB_USER_MAD_ENABLE_PKEY that allows applications that are aware
of the new ABI to opt into using it.

We plan on switching to the new ABI by default in a year or so, and
this patch adds a warning that is printed when an application uses the
old ABI, to push people towards converting to the new ABI.
Signed-off-by: NRoland Dreier <rolandd@cisco.com>
Reviewed-by: NSean Hefty <sean.hefty@intel.com>
Reviewed-by: NHal Rosenstock <hal@xsigo.com>

2be8e3ee

IB/ehca: Return srq_attr->max_sge in ehca_query_srq() · c01759ce

由 Joachim Fenkes 提交于 9月 28, 2007

Totally forgot this.
Signed-off-by: NJoachim Fenkes <fenkes@de.ibm.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

c01759ce

H
IB/ehca: Adjust 64-bit alignment of create QP response for userspace · a6607223
由 Hoang-Nam Nguyen 提交于 9月 28, 2007
```
Signed-off-by: NHoang-Nam Nguyen <hnguyen@de.ibm.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>
```
a6607223
H
IB/ehca: Fix mem leak of firmware ctrlblock in ehca_create_srq() · 03f72a51
由 Hoang-Nam Nguyen 提交于 9月 28, 2007
```
Signed-off-by: NHoang-Nam Nguyen <hnguyen@de.ibm.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>
```
03f72a51

IB/mlx4: Display misc device information under /sys/class/infiniband/ · cd9281d8

由 Jack Morgenstein 提交于 9月 18, 2007

display the following device information under /sys/class/infiniband/mlx4_X:
board_id, fw_ver, hw_rev, hca_type.

This patch makes this information available to userspace utilities
such as ibstat and ibv_devinfo.
Signed-off-by: NJack Morgenstein <jackm@dev.mellanox.co.il>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

cd9281d8

IB/core: Fix handling of multicast response failures · 57cb61d5

由 Ralph Campbell 提交于 9月 20, 2007

I was looking at the code for multicast.c and noticed that
ib_sa_join_multicast() calls queue_join() which puts the
request at the front of the group->pending_list.  If this
is a second request, it seems like it would interfere with
process_join_error() since group->last_join won't point
to the member at the head of the pending_list. The sequence
would thus be:

1. ib_sa_join_multicast()
   puts member1 on head of pending_list and starts work thread
2. mcast_work_handler()
   calls send_join() which sets group->last_join to member1
3. ib_sa_join_multicast()
   puts member2 on head of pending_list
4. join operation for member1 receives failures response from SA.
5. join_handler() is called with error status
6. process_join_error() fails to process member1 since
   it doesn't match the first entry in the group->pending_list.

The impact is that the failed join request is tossed.  The second
request is processed, and after it completes, the original request ends
up being retried.

This change also results in join requests being processed in FIFO
order.
Signed-off-by: NRalph Campbell <ralph.campbell@qlogic.com>
Signed-off-by: NSean Hefty <sean.hefty@intel.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

57cb61d5

IB/ehca: Misc cpuinit section annotations and #ifdef cleanups · 9faa559c

由 Satyam Sharma 提交于 8月 23, 2007

* Replace {un}register_cpu_notifier with {un}register_hotcpu_notifier
  thereby losing a couple of #ifdef HOTPLUG_CPU pairs.
* Move comp_pool_callback_nb declaration to below that of callback
  function so that initialization of .notifier_call and .priority can
  occur at build time itself and not runtime.
* Mark the notifier_block (and callback function, and another static
  function used by it) as __cpuinit{data} for the sake of consistency
  and remove enclosing #ifdef. (This may increase size for modular
  build of this module, however, because these are no longer dropped
  unconditionally now.)
Signed-off-by: NSatyam Sharma <satyam@infradead.org>
Acked-by: NJoachim Fenkes <fenkes@de.ibm.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

9faa559c

mlx4_core: Change capability decoding: SRC->XRC · ea98054f

由 Roland Dreier 提交于 10月 09, 2007

The SRC ("scalable RC") transport has been renamed to XRC ("extended 
RC"), to avoid having an abbreviation that is so easily confused with an 
abbreviation for "source."  Update the HCA capability decoding output to 
use the new name.
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

ea98054f

IB/iser: Remove unnecessary includes · ec2a1344

由 Roland Dreier 提交于 10月 09, 2007

<asm/scatterlist.h> is not needed because everyplace it appears,
<linux/scatterlist.h> also appears.  <asm/io.h> is not needed because
nothing seems to be using device IO anyway.
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

ec2a1344

RDMA/cma: Use neigh_event_send() to start neighbour discovery · 935ef2d7

由 Steve Wise 提交于 9月 12, 2007

Calling arp_send() to initiate neighbour discovery (ND) doesn't do the
full ND protocol.  Namely, it doesn't handle retransmitting the arp
request if it is dropped. The function neigh_event_send() does all
this.  Without doing full ND, RDMA address resolution fails in the
presence of dropped ARP broadcast packets.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Acked-by: NSean Hefty <sean.hefty@intel.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

935ef2d7

IB/ehca: Only use MR large pages for hugetlb regions · 3a31c419

由 Joachim Fenkes 提交于 9月 13, 2007

...because, on virtualized hardware like System p, we can't be sure
that the physical pages behind them are contiguous otherwise.
Signed-off-by: NJoachim Fenkes <fenkes@de.ibm.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

3a31c419

openeuler / Kernel 大约 1 年 前同步成功

openeuler / Kernel
大约 1 年前同步成功