提交 · d84106477733cb155c5dcaea664ddf120bf69eb7 · openeuler / raspberrypi-kernel

06 9月, 2009 5 次提交

IB/mthca: Don't allow userspace open while recovering from catastrophic error · d8410647

由 Jack Morgenstein 提交于 9月 05, 2009

Userspace apps are supposed to release all ib device resources if they
receive a fatal async event (IBV_EVENT_DEVICE_FATAL).  However, the
app has no way of knowing when the device has come back up, except to
repeatedly attempt ibv_open_device() until it succeeds.

However, currently there is no protection against the open succeeding
while the device is in being removed following the fatal event.  In
this case, the open will succeed, but as a result the device waits in
the middle of its removal until the new app releases its resources --
and the new app will not do so, since the open succeeded at a point
following the fatal event generation.

This patch adds an "active" flag to the device. The active flag is set
to false (in the fatal event flow) before the "fatal" event is
generated, so any subsequent ibv_dev_open() call to the device will
fail until the device comes back up, thus preventing the above
deadlock.
Signed-off-by: NJack Morgenstein <jackm@dev.mellanox.co.il>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

d8410647

IB/mthca: Distinguish multiple devices in /proc/interrupts · d94a8689

由 Arputham Benjamin 提交于 9月 05, 2009

When the mthca driver uses the same name for interrupts for every
device in the system.  This can make it very confusing trying to work
out exactly which device MSI-X interrupts are for.  Change the driver
to add the PCI name of the device to the interrupt name.
Signed-off-by: NArputham Benjamin <abenjamin@sgi.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

d94a8689

IB/mthca: Annotate CQ locking · ffe063f3

由 Roland Dreier 提交于 9月 05, 2009

mthca_ib_lock_cqs()/mthca_ib_unlock_cqs() are helper functions that
lock/unlock both CQs attached to a QP in the proper order to avoid
AB-BA deadlocks.  Annotate this so sparse can understand what's going
on (and warn us if we misuse these functions).
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

ffe063f3

IB/mthca: Remove unnecessary include of <linux/init.h> · deecb5d6

由 Roland Dreier 提交于 9月 05, 2009

mthca_reset.c doesn't have any function annotations, so there's no
reason to include <linux/init.h>.
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

deecb5d6

IB/mthca: Remove unnecessary include of <asm/page.h> · fc128558

由 Roland Dreier 提交于 9月 05, 2009

mthca_config_reg.h was including <asm/page.h> for no reason -- the whole
file is just defines of constants, so it's entirely self-contained.
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

fc128558

24 6月, 2009 1 次提交

IB/ehca: Bump version number · 1d4d6da5

由 Alexander Schmidt 提交于 6月 23, 2009

Increment version number for DMEM toleration.
Signed-off-by: NAlexander Schmidt <alexs@linux.vnet.ibm.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

1d4d6da5

23 6月, 2009 5 次提交

IB/mthca: Replace dma_sync_single() use with proper functions · 99987bea

由 Roland Dreier 提交于 6月 22, 2009

dma_sync_single() is deprecated now, and the use in mthca is wrong:
there should be a dma_sync_single_for_cpu() before touching the memory
from the CPU, and a dma_sync_single_for_device() afterwards.  Fix
this, prompted by a kick in the pants from a patch from FUJITA
Tomonori <fujita.tomonori@lab.ntt.co.jp>.
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

99987bea

RDMA/nes: Fix FIN state handling under error conditions · 68237a0f

由 Faisal Latif 提交于 6月 22, 2009

During cluster testing, one QP was not closed, as FIN is not handled
properly when its rexmit count expires or in some cases when RST is is
received after sending FIN.  The reason is that the cm_id does not get
decremented under these conditions.
Signed-off-by: NFaisal Latif <faisal.latif@intel.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

68237a0f

RDMA/nes: Fix max_qp_init_rd_atom returned from query device · 66388d67

由 Faisal Latif 提交于 6月 22, 2009

In nes_query_device(), max_qp_init_rd_atom is incorrectly set to
max_qp_wr.  This was found when a test application had a dapl async
event error.
Signed-off-by: NFaisal Latif <faisal.latif@intel.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

66388d67

IB/ehca: Ensure that guid_entry index is not negative · af04662b

由 Roel Kluin 提交于 6月 22, 2009

This prevents the memcpy() of a guid_entries element using a negative index.
Signed-off-by: NRoel Kluin <roel.kluin@gmail.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

af04662b

IB/ehca: Tolerate dynamic memory operations before driver load · 0cf89dcd

由 Hannes Hering 提交于 6月 22, 2009

Implement toleration of dynamic memory operations and 16 GB gigantic
pages, where "toleration" means that the driver can cope with dynamic
memory operations that happen before the driver is loaded.  While the
ehca driver is loaded, dynamic memory operations are still prohibited
by returning NOTIFY_BAD from the memory notifier.

On module load the driver walks through available system memory,
checks for available memory ranges and then registers the kernel
internal memory region accordingly.  The translation of address ranges
is implemented via a 3-level busmap.
Signed-off-by: NHannes Hering <hering2@de.ibm.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

0cf89dcd

16 6月, 2009 1 次提交

infiniband: ehca: remove driver_data direct access of struct device · f899c2dd

由 Greg Kroah-Hartman 提交于 5月 04, 2009

In the near future, the driver core is going to not allow direct access
to the driver_data pointer in struct device.  Instead, the functions
dev_get_drvdata() and dev_set_drvdata() should be used.  These functions
have been around since the beginning, so are backwards compatible with
all older kernel versions.

Cc: Sean Hefty <sean.hefty@intel.com>
Cc: Roland Dreier <rolandd@cisco.com>
Cc: Hal Rosenstock <hal.rosenstock@gmail.com>
Cc: general@lists.openfabrics.org
Cc: Christoph Raisch <raisch@de.ibm.com>
Acked-by: NHoang-Nam Nguyen <hnguyen@de.ibm.com>
Signed-off-by: NGreg Kroah-Hartman <gregkh@suse.de>

f899c2dd

14 6月, 2009 1 次提交

IB/mthca: Don't double-free IRQs when falling back from MSI-X to INTx · 9aa0a489

由 Roland Dreier 提交于 6月 13, 2009

When both MSI-X and legacy INTx fail to generate an interrupt, the
driver frees the MSI-X interrupts twice.  Fix this by clearing the
have_irq flag for the MSI-X interrupts when they are freed the first
time.
Reported-by: NYinghai Lu <yhlu.kernel@gmail.com>
Tested-by: NYinghai Lu <yhlu.kernel@gmail.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

9aa0a489

06 6月, 2009 1 次提交

IB/mlx4: Add strong ordering to local inval and fast reg work requests · 2ac6bf4d

由 Jack Morgenstein 提交于 6月 05, 2009

The ConnectX Programmer's Reference Manual states that the "SO" bit
must be set when posting Fast Register and Local Invalidate send work
requests.  When this bit is set, the work request will be executed
only after all previous work requests on the send queue have been
executed.  (If the bit is not set, Fast Register and Local Invalidate
WQEs may begin execution too early, which violates the defined
semantics for these operations)

This fixes the issue with NFS/RDMA reported in
<http://lists.openfabrics.org/pipermail/general/2009-April/059253.html>
Signed-off-by: NJack Morgenstein <jackm@dev.mellanox.co.il>
Cc: <stable@kernel.org>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

2ac6bf4d

04 6月, 2009 1 次提交

IB/ehca: Remove superfluous bitmasks from QP control block · 25a52393

由 Joachim Fenkes 提交于 6月 03, 2009

All the fields in the control block are nicely right-aligned, so no
masking is necessary.
Signed-off-by: NJoachim Fenkes <fenkes@de.ibm.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

25a52393

28 5月, 2009 3 次提交

RDMA/cxgb3: Limit fast register size based on T3 limitations · 3026c19a

由 Steve Wise 提交于 5月 27, 2009

T3 firmware only supports one WRs worth of page list for fast register
work requests.  The driver currently allows 2 WRs worth, which
doesn't work for T3, so reduce the limit in the driver.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

3026c19a

RDMA/cxgb3: Report correct port state and MTU · 7ab1a2b3

由 Steve Wise 提交于 5月 27, 2009

Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

7ab1a2b3

IB/mthca: Add module parameter for number of MTTs per segment · c1f67a88

由 Eli Cohen 提交于 5月 27, 2009

The current MTT allocator uses kmalloc() to allocate a buffer for its
buddy allocator, and thus is limited in the amount of MTT segments
that it can control.  As a result, the size of memory that can be
registered is limited too.  This patch uses a module parameter to
control the number of MTT entries that each segment represents,
allowing more memory to be registered with the same number of
segments.
Signed-off-by: NEli Cohen <eli@mellanox.co.il>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

c1f67a88

16 5月, 2009 1 次提交

RDMA/nes: Fix off-by-one bugs in reset_adapter_ne020() and init_serdes() · 28e43a51

由 Roel Kluin 提交于 5月 15, 2009

With a postfix increment, i is incremented one past 10K/5K before the
loop ends, so the error messages will be displayed too soon if the
test succeeds on the last iteration.  Fix the comparisons to be >
instead of >=.
Signed-off-by: NRoel Kluin <roel.kluin@gmail.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

28e43a51

14 5月, 2009 5 次提交

infiniband: Remove void casts · 5b891a93

由 Jack Stone 提交于 5月 13, 2009

Remove uneeded casts of void *.
Signed-off-by: NJack Stone <jwjstone@fastmail.fm>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

5b891a93

IB/ehca: Increment version number · bde2cfaf

由 Stefan Roscher 提交于 5月 13, 2009

Signed-off-by: NStefan Roscher <stefan.roscher@de.ibm.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

bde2cfaf

IB/ehca: Remove unnecessary memory operations for userspace queue pairs · 1988d1fa

由 Stefan Roscher 提交于 5月 13, 2009

The queue map for flush completion circumvention is only used for
kernel space queue pairs.  This patch skips the allocation of the
queue maps in case the QP is created for userspace.  In addition, this
patch does not iomap the galpas for kernel usage if the queue pair is
only used in userspace.  These changes will improve the performance of
creation of userspace queue pairs.
Signed-off-by: NStefan Roscher <stefan.roscher@de.ibm.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

1988d1fa

IB/ehca: Fall back to vmalloc() for big allocations · c94f156f

由 Stefan Roscher 提交于 5月 13, 2009

In case of large queue pairs there is the possibillity of allocation
failures due to memory fragmentation when using kmalloc(). To ensure
the memory is allocated even if kmalloc() can not find chunks which
are big enough, we fall back to allocating the memory with vmalloc().
Signed-off-by: NStefan Roscher <stefan.roscher@de.ibm.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

c94f156f

IB/ehca: Replace vmalloc() with kmalloc() for queue allocation · bf31a1a0

由 Anton Blanchard 提交于 5月 13, 2009

To improve performance of driver resource allocation, replace
vmalloc() calls with kmalloc().
Signed-off-by: NStefan Roscher <stefan.roscher@de.ibm.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

bf31a1a0

09 5月, 2009 1 次提交

Fix deadlock in ipathfs ->get_sb() · 265e771e

由 Al Viro 提交于 5月 06, 2009

forgot to unlock superblock before calling deactivate_super()...
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

265e771e

08 5月, 2009 1 次提交

IB/mlx4: Don't overwrite fast registration page list when posting work request · 2b6b7d4b

由 Jack Morgenstein 提交于 5月 07, 2009

The low-level mlx4 driver modified the page-list addresses for fast
register work requests post send to big-endian, and set a "present"
bit.  This caused problems later when the consumer attempted to unmap
the pages using the page-list (using the list addresses which were
assumed to be still in CPU-endian order).  Fix the mlx4 driver to
allocate two buffers and use a private buffer for the hardware-format
bus addresses.

This patch fixes <https://bugs.openfabrics.org/show_bug.cgi?id=1571>,
an NFS/RDMA server crash.  The cause of the crash was found by Vu Pham
of Mellanox.  The fix is along the lines suggested by Steve Wise in
comment #21 in bug 1571.
Signed-off-by: NJack Morgenstein <jackm@dev.mellanox.co.il>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

2b6b7d4b

30 4月, 2009 1 次提交

RDMA/cxgb3: Don't complete flushed send work requests twice · ec6995dd

由 Steve Wise 提交于 4月 29, 2009

When the SQ is flushed, mark the flushed entries as not signaled so
the poll logic doesn't re-insert the CQ entry thinking its an out of
order completion.

The bug can cause the NFS/RDMA server to crash due to processing the
same completed work request twice.
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

ec6995dd

28 4月, 2009 12 次提交

RDMA/nes: Update iw_nes version · 26cc5e57

由 Chien Tung 提交于 4月 27, 2009

Update version number to 1.5.0.0
Signed-off-by: NChien Tung <chien.tin.tung@intel.com>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

26cc5e57

RDMA/nes: Fix error path in nes_accept() · 9256b251