提交 · 7d8a804c594b61a05c698126165b5dc417d94a0f · openeuler / raspberrypi-kernel

24 12月, 2008 3 次提交

dlm: add time stamp of blocking callback · e3a84ad4

由 David Teigland 提交于 12月 09, 2008

Record the time the latest blocking callback was queued for
a lock.  This will be used for debugging in combination with
lock queue timestamp changes in the previous patch.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

e3a84ad4

dlm: change lock time stamping · eeda418d

由 David Teigland 提交于 12月 09, 2008

Use ktime instead of jiffies for timestamping lkb's.  Also stamp the
time on every lkb whenever it's added to a resource queue, instead of
just stamping locks subject to timeouts.  This will allow us to use
timestamps more widely for debugging all locks.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

eeda418d

dlm: improve how bast mode handling · fd22a51b

由 David Teigland 提交于 12月 09, 2008

The lkb bastmode value is set in the context of processing the
lock, and read by the dlm_astd thread.  Because it's accessed
in these two separate contexts, the writing/reading ought to
be done under a lock.  This is simple to do by setting it and
reading it when the lkb is added to and removed from dlm_astd's
callback list which is properly locked.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

fd22a51b

15 7月, 2008 2 次提交

dlm: fix uninitialized variable for search_rsb_list callers · 18c60c0a

由 Benny Halevy 提交于 6月 30, 2008

gcc 4.3.0 correctly emits the following warning.
search_rsb_list does not *r_ret if no dlm_rsb is found
and _search_rsb may pass the uninitialized value upstream
on the error path when both calls to search_rsb_list
return non-zero error.

The fix sets *r_ret to NULL on search_rsb_list's not-found path.
Signed-off-by: NBenny Halevy <bhalevy@panasas.com>
Signed-off-by: NDavid Teigland <teigland@redhat.com>

18c60c0a

dlm: fix basts for granted CW waiting PR/CW · 329fc4c3

由 David Teigland 提交于 5月 20, 2008

The fix in commit 36509258 was addressing
the case of a granted PR lock with waiting PR and CW locks.  It's a
special case that requires forcing a CW bast.  However, that forced CW
bast was incorrectly applying to a second condition where the granted
lock was CW.  So, the holder of a CW lock could receive an extraneous CW
bast instead of a PR bast.  This fix narrows the original special case to
what was intended.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

329fc4c3

22 4月, 2008 2 次提交

dlm: save master info after failed no-queue request · 761b9d3f

由 David Teigland 提交于 2月 21, 2008

When a NOQUEUE request fails, the rsb res_master field is unnecessarily
reset to -1, instead of leaving the valid master setting in place. We
want to save the looked-up master values while the rsb is on the "toss
list" so that another lookup can be avoided if the rsb is soon reused.
The fix is to simply leave res_master value alone.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

761b9d3f

dlm: make dlm_print_rsb() static · 170e19ab

由 Adrian Bunk 提交于 2月 13, 2008

dlm_print_rsb() can now become static.
Signed-off-by: NAdrian Bunk <bunk@kernel.org>
Signed-off-by: NDavid Teigland <teigland@redhat.com>

170e19ab

07 2月, 2008 1 次提交

dlm: eliminate astparam type casting · d292c0cc

由 David Teigland 提交于 2月 06, 2008

Put lkb_astparam in a union with a dlm_user_args pointer to
eliminate a lot of type casting.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

d292c0cc

06 2月, 2008 1 次提交

dlm: proper types for asts and basts · e5dae548

由 David Teigland 提交于 2月 06, 2008

Use proper types for ast and bast functions, and use
consistent type for ast param.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

e5dae548

04 2月, 2008 7 次提交

dlm: fix overflows when copying from ->m_extra to lvb · a9cc9159

由 Al Viro 提交于 1月 26, 2008

Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
Signed-off-by: NDavid Teigland <teigland@redhat.com>

a9cc9159

dlm: make find_rsb() fail gracefully when namelen is too large · ef58bcca

由 Al Viro 提交于 1月 25, 2008

We *can* get there from receive_request() and dlm_recover_master_copy()
with namelen too large if incoming request is invalid; BUG() from
DLM_ASSERT() in allocate_rsb() is a bit excessive reaction to that
and in case of dlm_recover_master_copy() we would actually oops before
that while calculating hash of up to 64Kb worth of data - with data
actually being 64 _bytes_ in kmalloc()'ed struct.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
Signed-off-by: NDavid Teigland <teigland@redhat.com>

ef58bcca

dlm: receive_rcom_lock_args() overflow check · a5dd0631

由 Al Viro 提交于 1月 25, 2008

Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
Signed-off-by: NDavid Teigland <teigland@redhat.com>

a5dd0631

A
dlm: verify that places expecting rcom_lock have packet long enough · ae773d0b
由 Al Viro 提交于 1月 25, 2008
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
Signed-off-by: NDavid Teigland <teigland@redhat.com>
```
ae773d0b

dlm: do not byteswap rcom_lock · 163a1859

由 Al Viro 提交于 1月 25, 2008

Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
Signed-off-by: NDavid Teigland <teigland@redhat.com>

163a1859

dlm: dlm_process_incoming_buffer() fixes · eef7d739

由 Al Viro 提交于 1月 25, 2008

* check that length is large enough to cover the non-variable part of message or
  rcom resp. (after checking that it's large enough to cover the header, of
  course).

* kill more pointless casts
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
Signed-off-by: NDavid Teigland <teigland@redhat.com>

eef7d739

dlm: use proper C for dlm/requestqueue stuff (and fix alignment bug) · 8b0d8e03

由 Al Viro 提交于 1月 25, 2008

a) don't cast the pointer to dlm_header *, we use it as dlm_message *
   anyway.
b) we copy the message into a queue element, then pass the pointer to
   copy to dlm_receive_message_saved(); declare it properly to make sure
   that we have the right alignment.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
Signed-off-by: NDavid Teigland <teigland@redhat.com>

8b0d8e03

31 1月, 2008 9 次提交

dlm: keep cached master rsbs during recovery · 85f0379a

由 David Teigland 提交于 1月 16, 2008

To prevent the master of an rsb from changing rapidly, an unused rsb is kept
on the "toss list" for a period of time to be reused. The toss list was
being cleared completely for each recovery, which is unnecessary. Much of
the benefit of the toss list can be maintained if nodes keep rsb's in their
toss list that they are the master of. These rsb's need to be included
when the resource directory is rebuilt during recovery.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

85f0379a

dlm: change error message to debug · 594199eb

由 David Teigland 提交于 1月 16, 2008

The invalid lockspace messages are normal and can appear relatively
often.  They should be suppressed without debugging enabled.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

594199eb

dlm: limit dir lookup loop · 755b5eb8

由 David Teigland 提交于 1月 09, 2008

In a rare case we may need to repeat a local resource directory lookup
due to a race with removing the rsb and removing the resdir record.
We'll never need to do more than a single additional lookup, though,
so the infinite loop around the lookup can be removed. In addition
to being unnecessary, the infinite loop is dangerous since some other
unknown condition may appear causing the loop to never break.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

755b5eb8

dlm: reject normal unlock when lock is waiting for lookup · 42dc1601

由 David Teigland 提交于 1月 09, 2008

Non-forced unlocks should be rejected if the lock is waiting on the
rsb_lookup list for another lock to establish the master node.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

42dc1601

dlm: validate messages before processing · c54e04b0

由 David Teigland 提交于 1月 09, 2008

There was some hit and miss validation of messages that has now been
cleaned up and unified.  Before processing a message, the new
validate_message() function checks that the lkb is the appropriate type,
process-copy or master-copy, and that the message is from the correct
nodeid for the the given lkb.  Other checks and assertions on the
lkb type and nodeid have been removed.  The assertions were particularly
bad since they would panic the machine instead of just ignoring the bad
message.

Although other recent patches have made processing old message unlikely,
it still may be possible for an old message to be processed and caught
by these checks.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

c54e04b0

dlm: reject messages from non-members · 46b43eed

由 David Teigland 提交于 1月 08, 2008

Messages from nodes that are no longer members of the lockspace should be
ignored.  When nodes are removed from the lockspace, recovery can
sometimes complete quickly enough that messages arrive from a removed node
after recovery has completed.  When processed, these messages would often
cause an error message, and could in some cases change some state, causing
problems.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

46b43eed

dlm: another call to confirm_master in receive_request_reply · aec64e1b

由 David Teigland 提交于 1月 08, 2008

When a failed request (EBADR or ENOTBLK) is unlocked/canceled instead of
retried, there may be other lkb's waiting on the rsb_lookup list for it
to complete. A call to confirm_master() is needed to move on to the next
waiting lkb since the current one won't be retried.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

aec64e1b

dlm: recover locks waiting for overlap replies · 601342ce

由 David Teigland 提交于 1月 07, 2008

When recovery looks at locks waiting for replies, it fails to consider
locks that have already received a reply for their first remote operation,
but not received a reply for secondary, overlapping unlock/cancel.  The
appropriate stub reply needs to be called for these waiters.

Appears when we start doing recovery in the presence of a many overlapping
unlock/cancel ops.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

601342ce

dlm: clear ast_type when removing from astqueue · 8a358ca8

由 David Teigland 提交于 1月 07, 2008

The lkb_ast_type field indicates whether the lkb is on the astqueue list.
When clearing locks for a process, lkb's were being removed from the astqueue
list without clearing the field.  If release_lockspace then happened
immediately afterward, it could try to remove the lkb from the list a second
time.

Appears when process calls libdlm dlm_release_lockspace() which first
closes the ls dev triggering clear_proc_locks, and then removes the ls
(a write to control dev) causing release_lockspace().
Signed-off-by: NDavid Teigland <teigland@redhat.com>

8a358ca8

30 1月, 2008 3 次提交

dlm: use dlm prefix on alloc and free functions · 52bda2b5

由 David Teigland 提交于 11月 07, 2007

The dlm functions in memory.c should use the dlm_ prefix. Also, use
kzalloc/kfree directly for dlm_direntry's, removing the wrapper functions.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

52bda2b5

dlm: don't print common non-errors · 11b2498b

由 David Teigland 提交于 11月 07, 2007

Change log_error() to log_debug() for conditions that can occur in
large number in normal operation.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

11b2498b

dlm: proper prototypes · e028398d

由 Adrian Bunk 提交于 11月 03, 2007

This patch adds a proper prototype for some functions in
fs/dlm/dlm_internal.h
Signed-off-by: NAdrian Bunk <bunk@kernel.org>
Signed-off-by: NDavid Teigland <teigland@redhat.com>

e028398d

10 10月, 2007 2 次提交

[DLM] block dlm_recv in recovery transition · c36258b5

由 David Teigland 提交于 9月 27, 2007

Introduce a per-lockspace rwsem that's held in read mode by dlm_recv
threads while working in the dlm.  This allows dlm_recv activity to be
suspended when the lockspace transitions to, from and between recovery
cycles.

The specific bug prompting this change is one where an in-progress
recovery cycle is aborted by a new recovery cycle.  While dlm_recv was
processing a recovery message, the recovery cycle was aborted and
dlm_recoverd began cleaning up.  dlm_recv decremented recover_locks_count
on an rsb after dlm_recoverd had reset it to zero.  This is fixed by
suspending dlm_recv (taking write lock on the rwsem) before aborting the
current recovery.

The transitions to/from normal and recovery modes are simplified by using
this new ability to block dlm_recv.  The switch from normal to recovery
mode means dlm_recv goes from processing locking messages, to saving them
for later, and vice versa.  Races are avoided by blocking dlm_recv when
setting the flag that switches between modes.
Signed-off-by: NDavid Teigland <teigland@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

c36258b5

[DLM] don't overwrite castparam if it's NULL · b434eda6

由 Patrick Caulfield 提交于 10月 01, 2007

If the castaddr passed to the userland API is NULL then don't overwrite the
existing castparam. This allows a different thread to cancel a lock request and
the CANCEL AST gets delivered to the original thread.

bz#306391 (for RHEL4) refers.
Signed-Off-By: NPatrick Caulfield <pcaulfie@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

b434eda6

14 8月, 2007 1 次提交

[DLM] fix basts for granted PR waiting CW · 36509258

由 David Teigland 提交于 8月 07, 2007

Fix a long standing bug where a blocking callback would be missed
when there's a granted lock in PR mode and waiting locks in both
PR and CW modes (and the PR lock was added to the waiting queue
before the CW lock). The logic simply compared the numerical values
of the modes to determine if a blocking callback was required, but in
the one case of PR and CW, the lower valued CW mode blocks the higher
valued PR mode. We just need to add a special check for this PR/CW
case in the tests that decide when a blocking callback is needed.
Signed-off-by: NDavid Teigland <teigland@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

36509258

09 7月, 2007 9 次提交

[DLM] variable allocation · 44f487a5

由 Patrick Caulfield 提交于 6月 06, 2007

Add a new flag, DLM_LSFL_FS, to be used when a file system creates a lockspace.
This flag causes the dlm to use GFP_NOFS for allocations instead of GFP_KERNEL.
(This updated version of the patch uses gfp_t for ls_allocation.)
Signed-Off-By: NPatrick Caulfield <pcaulfie@redhat.com>
Signed-Off-By: NDavid Teigland <teigland@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

44f487a5

[DLM] canceling deadlocked lock · 8b4021fa

由 David Teigland 提交于 5月 29, 2007

Add a function that can be used through libdlm by a system daemon to cancel
another process's deadlocked lock.  A completion ast with EDEADLK is returned
to the process waiting for the lock.
Signed-off-by: NDavid Teigland <teigland@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

8b4021fa

[DLM] timeout fixes · 84d8cd69

由 David Teigland 提交于 5月 29, 2007

Various fixes related to the new timeout feature:
- add_timeout() missed setting TIMEWARN flag on lkb's when the
  TIMEOUT flag was already set
- clear_proc_locks should remove a dead process's locks from the
  timeout list
- the end-of-life calculation for user locks needs to consider that
  ETIMEDOUT is equivalent to -DLM_ECANCEL
- make initial default timewarn_cs config value visible in configfs
- change bit position of TIMEOUT_CANCEL flag so it's not copied to
  a remote master node
- set timestamp on remote lkb's so a lock dump will display the time
  they've been waiting
Signed-off-by: NDavid Teigland <teigland@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

84d8cd69

[DLM] Compile fix · b3cab7b9

由 Steven Whitehouse 提交于 5月 29, 2007

A one liner fix which got missed from the earlier patches.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>
Cc: Fabio Massimo Di Nitto <fabbione@ubuntu.com>
Cc: David Teigland <teigland@redhat.com>

b3cab7b9

[DLM] fix compile breakage · 639aca41

由 David Teigland 提交于 5月 18, 2007

In the rush to get the previous patch set sent, a compilation bug I fixed
shortly before sending somehow got clobbered, probably by a missed quilt
refresh or something.
Signed-off-by: NDavid Teigland <teigland@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

639aca41

[DLM] cancel in conversion deadlock [4/6] · c85d65e9

由 David Teigland 提交于 5月 18, 2007

When conversion deadlock is detected, cancel the conversion and return
EDEADLK to the application. This is a new default behavior where before
the dlm would allow the deadlock to exist indefinately.

The DLM_LKF_NODLCKWT flag can now be used in a conversion to prevent the
dlm from performing conversion deadlock detection/cancelation on it.
The DLM_LKF_CONVDEADLK flag can continue to be used as before to tell the
dlm to demote the granted mode of the lock being converted if it gets into
a conversion deadlock.
Signed-off-by: NDavid Teigland <teigland@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

c85d65e9

[DLM] dlm_device interface changes [3/6] · d7db923e

由 David Teigland 提交于 5月 18, 2007

Change the user/kernel device interface used by libdlm:
- Add ability for userspace to check the version of the interface.  libdlm
  can now adapt to different versions of the kernel interface.
- Increase the size of the flags passed in a lock request so all possible
  flags can be used from userspace.
- Add an opaque "xid" value for each lock.  This "transaction id" will be
  used later to associate locks with each other during deadlock detection.
- Add a "timeout" value for each lock.  This is used along with the
  DLM_LKF_TIMEOUT flag.

Also, remove a fragment of unused code in device_read().

This patch requires updating libdlm which is backward compatible with
older kernels.
Signed-off-by: NDavid Teigland <teigland@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

d7db923e

[DLM] add lock timeouts and warnings [2/6] · 3ae1acf9

由 David Teigland 提交于 5月 18, 2007

New features: lock timeouts and time warnings.  If the DLM_LKF_TIMEOUT
flag is set, then the request/conversion will be canceled after waiting
the specified number of centiseconds (specified per lock).  This feature
is only available for locks requested through libdlm (can be enabled for
kernel dlm users if there's a use for it.)

If the new DLM_LSFL_TIMEWARN flag is set when creating the lockspace, then
a warning message will be sent to userspace (using genetlink) after a
request/conversion has been waiting for a given number of centiseconds
(configurable per node).  The time warnings will be used in the future
to do deadlock detection in userspace.
Signed-off-by: NDavid Teigland <teigland@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

3ae1acf9

[DLM] block scand during recovery [1/6] · 85e86edf

由 David Teigland 提交于 5月 18, 2007

Don't let dlm_scand run during recovery since it may try to do a resource
directory removal while the directory nodes are changing.
Signed-off-by: NDavid Teigland <teigland@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

85e86edf