提交 · d84106477733cb155c5dcaea664ddf120bf69eb7 · openeuler / raspberrypi-kernel

06 9月, 2009 1 次提交

IB/mthca: Don't allow userspace open while recovering from catastrophic error · d8410647

由 Jack Morgenstein 提交于 9月 05, 2009

Userspace apps are supposed to release all ib device resources if they
receive a fatal async event (IBV_EVENT_DEVICE_FATAL).  However, the
app has no way of knowing when the device has come back up, except to
repeatedly attempt ibv_open_device() until it succeeds.

However, currently there is no protection against the open succeeding
while the device is in being removed following the fatal event.  In
this case, the open will succeed, but as a result the device waits in
the middle of its removal until the new app releases its resources --
and the new app will not do so, since the open succeeded at a point
following the fatal event generation.

This patch adds an "active" flag to the device. The active flag is set
to false (in the fatal event flow) before the "fatal" event is
generated, so any subsequent ibv_dev_open() call to the device will
fail until the device comes back up, thus preventing the above
deadlock.
Signed-off-by: NJack Morgenstein <jackm@dev.mellanox.co.il>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

d8410647

30 9月, 2008 1 次提交

IB/mthca: Use pci_request_regions() · 208dde28

由 Roland Dreier 提交于 9月 29, 2008

Back in prehistoric (pre-git!) days, the kernel's MSI-X support did
request_mem_region() on a device's MSI-X tables, which meant that a
driver that enabled MSI-X couldn't use pci_request_regions() (since
that would clash with the PCI layer's MSI-X request).

However, that was removed (by me!) years ago, so mthca can just use
pci_request_regions() and pci_release_regions() instead of its own
much more complicated code that avoids requesting the MSI-X tables.
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

208dde28

15 7月, 2008 3 次提交

IB/mthca: Use round_jiffies() for catastrophic error polling timer · c036925a

由 Roland Dreier 提交于 7月 14, 2008

Exactly when the catastrophic error polling timer function runs is not
important, so use round_jiffies() to save unnecessary wakeups.
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

c036925a

IB/mthca: Remove "stop" flag for catastrophic error polling timer · 4522e08c

由 Roland Dreier 提交于 7月 14, 2008

Since we use del_timer_sync() anyway, there's no need for an
additional flag to tell the timer not to rearm.
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

4522e08c

RDMA: Remove subversion $Id tags · f3781d2e

由 Roland Dreier 提交于 7月 14, 2008

They don't get updated by git and so they're worse than useless.
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

f3781d2e

22 11月, 2006 1 次提交
- D
  WorkStruct: make allyesconfig · c4028958
  由 David Howells 提交于 11月 22, 2006
```
Fix up for make allyesconfig.
Signed-Off-By: NDavid Howells <dhowells@redhat.com>
```
  c4028958
23 9月, 2006 1 次提交

IB/mthca: Recover from catastrophic errors · b3b30f5e

由 Jack Morgenstein 提交于 8月 15, 2006

Trigger device remove and then add when a catastrophic error is
detected in hardware.  This, in turn, will cause a device reset, which
we hope will recover from the catastrophic condition.

Since this might interefere with debugging the root cause, add a
module option to suppress this behaviour.
Signed-off-by: NJack Morgenstein <jackm@mellanox.co.il>
Signed-off-by: NMichael S. Tsirkin <mst@mellanox.co.il>
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

b3b30f5e

11 11月, 2005 1 次提交

[IB] mthca: fix typo in catastrophic error polling · 0b4ff2c0

由 Roland Dreier 提交于 11月 07, 2005

Fix a typo in the rearming of the catastrophic error polling timer: we
should rearm the timer as long as the stop flag is _not_ set.
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

0b4ff2c0

07 11月, 2005 1 次提交

[PATCH] fix remaining missing includes · 8c65b4a6

由 Tim Schmielau 提交于 11月 07, 2005

Fix more include file problems that surfaced since I submitted the previous
fix-missing-includes.patch.  This should now allow not to include sched.h
from module.h, which is done by a followup patch.
Signed-off-by: NTim Schmielau <tim@physik3.uni-rostock.de>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

8c65b4a6

28 10月, 2005 1 次提交

[IB] mthca: first pass at catastrophic error reporting · 3d155f8c

由 Roland Dreier 提交于 10月 27, 2005

Add some initial support for detecting and reporting catastrophic
errors reported by Mellanox HCAs.  We start a periodic timer which
polls the catastrophic error reporting buffer in device memory.  If an
error is detected, we dump the contents of the buffer for port-mortem
debugging, and report a fatal asynchronous error to higher levels.

In the future we can try to recover from these errors by resetting the
device, but this will require some work in higher-level code as well.
Let's get this in now, so that we at least get catastrophic errors
reported in logs.
Signed-off-by: NRoland Dreier <rolandd@cisco.com>

3d155f8c