提交 · da8c66638ae684c99abcb30e89d2803402e7ca20 · openanolis / cloud-kernel

17 11月, 2012 1 次提交

dlm: fix lvb invalidation conditions · da8c6663

由 David Teigland 提交于 11月 15, 2012

When a node is removed that held a PW/EX lock, the
existing master node should invalidate the lvb on the
resource due to the purged lock.

Previously, the existing master node was invalidating
the lvb if it found only NL/CR locks on the resource
during recovery for the removed node.  This could lead
to cases where it invalidated the lvb and shouldn't
have, or cases where it should have invalidated and
didn't.

When recovery selects a *new* master node for a
resource, and that new master finds only NL/CR locks
on the resource after lock recovery, it should
invalidate the lvb.  This case was handled correctly
(but was incorrectly applied to the existing master
case also.)

When a process exits while holding a PW/EX lock,
the lvb on the resource should be invalidated.
This was not happening.

The lvb contents and VALNOTVALID flag should be
recovered before granting locks in recovery so that
the recovered lvb state is provided in the callback.
The lvb was being recovered after the lock was granted.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

da8c6663

09 8月, 2012 1 次提交

dlm: fix unlock balance warnings · 475f230c

由 David Teigland 提交于 8月 02, 2012

The in_recovery rw_semaphore has always been acquired and
released by different threads by design.  To work around
the "BUG: bad unlock balance detected!" messages, adjust
things so the dlm_recoverd thread always does both down_write
and up_write.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

475f230c

17 7月, 2012 3 次提交

dlm: fix race between remove and lookup · 05c32f47

由 David Teigland 提交于 6月 14, 2012

It was possible for a remove message on an old
rsb to be sent after a lookup message on a new
rsb, where the rsbs were for the same resource
name.  This could lead to a missing directory
entry for the new rsb.

It is fixed by keeping a copy of the resource
name being removed until after the remove has
been sent.  A lookup checks if this in-progress
remove matches the name it is looking up.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

05c32f47

dlm: use idr instead of list for recovered rsbs · 1d7c484e

由 David Teigland 提交于 5月 15, 2012

When a large number of resources are being recovered,
a linear search of the recover_list takes a long time.
Use an idr in place of a list.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

1d7c484e

dlm: use rsbtbl as resource directory · c04fecb4

由 David Teigland 提交于 5月 10, 2012

Remove the dir hash table (dirtbl), and use
the rsb hash table (rsbtbl) as the resource
directory.  It has always been an unnecessary
duplication of information.

This improves efficiency by using a single rsbtbl
lookup in many cases where both rsbtbl and dirtbl
lookups were needed previously.

This eliminates the need to handle cases of rsbtbl
and dirtbl being out of sync.

In many cases there will be memory savings because
the dir hash table no longer exists.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

c04fecb4

03 5月, 2012 1 次提交

dlm: fixes for nodir mode · 4875647a

由 David Teigland 提交于 4月 26, 2012

The "nodir" mode (statically assign master nodes instead
of using the resource directory) has always been highly
experimental, and never seriously used.  This commit
fixes a number of problems, making nodir much more usable.

- Major change to recovery: recover all locks and restart
  all in-progress operations after recovery.  In some
  cases it's not possible to know which in-progess locks
  to recover, so recover all.  (Most require recovery
  in nodir mode anyway since rehashing changes most
  master nodes.)

- Change the way nodir mode is enabled, from a command
  line mount arg passed through gfs2, into a sysfs
  file managed by dlm_controld, consistent with the
  other config settings.

- Allow recovering MSTCPY locks on an rsb that has not
  yet been turned into a master copy.

- Ignore RCOM_LOCK and RCOM_LOCK_REPLY recovery messages
  from a previous, aborted recovery cycle.  Base this
  on the local recovery status not being in the state
  where any nodes should be sending LOCK messages for the
  current recovery cycle.

- Hold rsb lock around dlm_purge_mstcpy_locks() because it
  may run concurrently with dlm_recover_master_copy().

- Maintain highbast on process-copy lkb's (in addition to
  the master as is usual), because the lkb can switch
  back and forth between being a master and being a
  process copy as the master node changes in recovery.

- When recovering MSTCPY locks, flag rsb's that have
  non-empty convert or waiting queues for granting
  at the end of recovery.  (Rename flag from LOCKS_PURGED
  to RECOVER_GRANT and similar for the recovery function,
  because it's not only resources with purged locks
  that need grant a grant attempt.)

- Replace a couple of unnecessary assertion panics with
  error messages.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

4875647a

27 4月, 2012 1 次提交

dlm: limit rcom debug messages · d6e24788

由 David Teigland 提交于 4月 23, 2012

Unify the checking for both types of ignored
rcom messages, and replace the two log_debug
statements with a single, rate limited debug
message.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

d6e24788

04 1月, 2012 2 次提交

dlm: add recovery callbacks · 60f98d18

由 David Teigland 提交于 11月 02, 2011

These new callbacks notify the dlm user about lock recovery.
GFS2, and possibly others, need to be aware of when the dlm
will be doing lock recovery for a failed lockspace member.

In the past, this coordination has been done between dlm and
file system daemons in userspace, which then direct their
kernel counterparts.  These callbacks allow the same
coordination directly, and more simply.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

60f98d18

dlm: add node slots and generation · 757a4271

由 David Teigland 提交于 10月 20, 2011

Slot numbers are assigned to nodes when they join the lockspace.
The slot number chosen is the minimum unused value starting at 1.
Once a node is assigned a slot, that slot number will not change
while the node remains a lockspace member.  If the node leaves
and rejoins it can be assigned a new slot number.

A new generation number is also added to a lockspace.  It is
set and incremented during each recovery along with the slot
collection/assignment.

The slot numbers will be passed to gfs2 which will use them as
journal id's.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

757a4271

19 11月, 2011 1 次提交

dlm: convert rsb list to rb_tree · 9beb3bf5

由 Bob Peterson 提交于 10月 26, 2011

Change the linked lists to rb_tree's in the rsb
hash table to speed up searches.  Slow rsb searches
were having a large impact on gfs2 performance due
to the large number of dlm locks gfs2 uses.
Signed-off-by: NBob Peterson <rpeterso@redhat.com>
Signed-off-by: NDavid Teigland <teigland@redhat.com>

9beb3bf5

16 7月, 2011 1 次提交

dlm: use workqueue for callbacks · 23e8e1aa

由 David Teigland 提交于 4月 05, 2011

Instead of creating our own kthread (dlm_astd) to deliver
callbacks for all lockspaces, use a per-lockspace workqueue
to deliver the callbacks.  This eliminates complications and
slowdowns from many lockspaces sharing the same thread.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

23e8e1aa

13 7月, 2011 1 次提交

dlm: improve rsb searches · 3881ac04

由 David Teigland 提交于 7月 07, 2011

By pre-allocating rsb structs before searching the hash
table, they can be inserted immediately.  This avoids
always having to repeat the search when adding the struct
to hash list.

This also adds space to the rsb struct for a max resource
name, so an rsb allocation can be used by any request.
The constant size also allows us to finally use a slab
for the rsb structs.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

3881ac04

11 7月, 2011 1 次提交

dlm: keep lkbs in idr · 3d6aa675

由 David Teigland 提交于 7月 06, 2011

This is simpler and quicker than the hash table, and
avoids needing to search the hash list for every new
lkid to check if it's used.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

3d6aa675

05 4月, 2011 1 次提交

dlm: remove shared message stub for recovery · 2a7ce0ed

由 David Teigland 提交于 4月 04, 2011

kmalloc a stub message struct during recovery instead of sharing the
struct in the lockspace.  This leaves the lockspace stub_ms only for
faking downconvert replies, where it is never modified and sharing
is not a problem.

Also improve the debug messages in the same recovery function.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

2a7ce0ed

02 4月, 2011 1 次提交

dlm: delayed reply message warning · c6ff669b

由 David Teigland 提交于 3月 28, 2011

Add an option (disabled by default) to print a warning message
when a lock has been waiting a configurable amount of time for
a reply message from another node.  This is mainly for debugging.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

c6ff669b

11 3月, 2011 1 次提交

dlm: record full callback state · 8304d6f2

由 David Teigland 提交于 2月 21, 2011

Change how callbacks are recorded for locks. Previously, information
about multiple callbacks was combined into a couple of variables that
indicated what the end result should be. In some situations, we
could not tell from this combined state what the exact sequence of
callbacks were, and would end up either delivering the callbacks in
the wrong order, or suppress redundant callbacks incorrectly. This
new approach records all the data for each callback, leaving no
uncertainty about what needs to be delivered.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

8304d6f2

25 2月, 2010 1 次提交

dlm: fix ordering of bast and cast · 7fe2b319

由 David Teigland 提交于 2月 24, 2010

When both blocking and completion callbacks are queued for lock,
the dlm would always deliver the completion callback (cast) first.
In some cases the blocking callback (bast) is queued before the
cast, though, and should be delivered first.  This patch keeps
track of the order in which they were queued and delivers them
in that order.

This patch also keeps track of the granted mode in the last cast
and eliminates the following bast if the bast mode is compatible
with the preceding cast mode.  This happens when a remotely mastered
lock is demoted, e.g. EX->NL, in which case the local node queues
a cast immediately after sending the demote message.  In this way
a cast can be queued for a mode, e.g. NL, that makes an in-transit
bast extraneous.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

7fe2b319

01 12月, 2009 1 次提交

dlm: always use GFP_NOFS · 573c24c4

由 David Teigland 提交于 11月 30, 2009

Replace all GFP_KERNEL and ls_allocation with GFP_NOFS.
ls_allocation would be GFP_KERNEL for userland lockspaces
and GFP_NOFS for file system lockspaces.

It was discovered that any lockspaces on the system can
affect all others by triggering memory reclaim in the
file system which could in turn call back into the dlm
to acquire locks, deadlocking dlm threads that were
shared by all lockspaces, like dlm_recv.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

573c24c4

29 1月, 2009 1 次提交

dlm: Change rwlock which is only used in write mode to a spinlock · 305a47b1

由 Steven Whitehouse 提交于 1月 16, 2009

The ls_dirtbl[].lock was an rwlock, but since it was only used in write
mode a spinlock will suffice.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>
Signed-off-by: NDavid Teigland <teigland@redhat.com>

305a47b1

09 1月, 2009 1 次提交

dlm: change rsbtbl rwlock to spinlock · c7be761a

由 David Teigland 提交于 1月 07, 2009

The rwlock is almost always used in write mode, so there's no reason
to not use a spinlock instead.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

c7be761a

24 12月, 2008 3 次提交

dlm: add new debugfs entry · d022509d

由 David Teigland 提交于 12月 16, 2008

The new debugfs entry dumps all rsb and lkb structures, and includes
a lot more information than has been available before.  This includes
the new timestamps added by a previous patch for debugging callback
issues.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

d022509d

dlm: add time stamp of blocking callback · e3a84ad4

由 David Teigland 提交于 12月 09, 2008

Record the time the latest blocking callback was queued for
a lock.  This will be used for debugging in combination with
lock queue timestamp changes in the previous patch.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

e3a84ad4

dlm: change lock time stamping · eeda418d

由 David Teigland 提交于 12月 09, 2008

Use ktime instead of jiffies for timestamping lkb's.  Also stamp the
time on every lkb whenever it's added to a resource queue, instead of
just stamping locks subject to timeouts.  This will allow us to use
timestamps more widely for debugging all locks.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

eeda418d

29 8月, 2008 2 次提交

dlm: fix locking of lockspace list in dlm_scand · c1dcf65f

由 David Teigland 提交于 8月 18, 2008

The dlm_scand thread needs to lock the list of lockspaces
when going through it.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

c1dcf65f

dlm: allow multiple lockspace creates · 0f8e0d9a

由 David Teigland 提交于 8月 06, 2008

Add a count for lockspace create and release so that create can
be called multiple times to use the lockspace from different places.
Also add the new flag DLM_LSFL_NEWEXCL to create a lockspace with
the previous behavior of returning -EEXIST if the lockspace already
exists.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

0f8e0d9a

22 4月, 2008 3 次提交

dlm: common max length definitions · 3d564fa3

由 David Teigland 提交于 4月 14, 2008

Add central definitions for max lockspace name length and max resource
name length.  The lack of central definitions has resulted in scattered
private definitions which we can now clean up, including an unused one
in dlm_device.h.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

3d564fa3

dlm: move plock code from gfs2 · 2402211a

由 David Teigland 提交于 3月 14, 2008

Move the code that handles cluster posix locks from gfs2 into the dlm
so that it can be used by both gfs2 and ocfs2.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

2402211a

dlm: recover nodes that are removed and re-added · d44e0fc7

由 David Teigland 提交于 3月 18, 2008

If a node is removed from a lockspace, and then added back before the
dlm is notified of the removal, the dlm will not detect the removal
and won't clear the old state from the node. This is fixed by using a
list of added nodes so the membership recovery can detect when a newly
added node is already in the member list.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

d44e0fc7

19 4月, 2008 1 次提交

fs: Remove unnecessary inclusions of asm/semaphore.h · cb688371

由 Matthew Wilcox 提交于 2月 26, 2008

None of these files use any of the functionality promised by
asm/semaphore.h.
Signed-off-by: NMatthew Wilcox <willy@linux.intel.com>

cb688371

07 2月, 2008 1 次提交

dlm: eliminate astparam type casting · d292c0cc

由 David Teigland 提交于 2月 06, 2008

Put lkb_astparam in a union with a dlm_user_args pointer to
eliminate a lot of type casting.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

d292c0cc

06 2月, 2008 1 次提交

dlm: proper types for asts and basts · e5dae548

由 David Teigland 提交于 2月 06, 2008

Use proper types for ast and bast functions, and use
consistent type for ast param.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

e5dae548

04 2月, 2008 4 次提交

dlm: use proper type for ->ls_recover_buf · 4007685c

由 Al Viro 提交于 1月 25, 2008

Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
Signed-off-by: NDavid Teigland <teigland@redhat.com>

4007685c

dlm: do not byteswap rcom_config · 93ff2971

由 Al Viro 提交于 1月 25, 2008

Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
Signed-off-by: NDavid Teigland <teigland@redhat.com>

93ff2971

dlm: do not byteswap rcom_lock · 163a1859

由 Al Viro 提交于 1月 25, 2008

Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
Signed-off-by: NDavid Teigland <teigland@redhat.com>

163a1859

dlm: dlm_process_incoming_buffer() fixes · eef7d739

由 Al Viro 提交于 1月 25, 2008

* check that length is large enough to cover the non-variable part of message or
  rcom resp. (after checking that it's large enough to cover the header, of
  course).

* kill more pointless casts
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
Signed-off-by: NDavid Teigland <teigland@redhat.com>

eef7d739

30 1月, 2008 1 次提交

dlm: proper prototypes · e028398d

由 Adrian Bunk 提交于 11月 03, 2007

This patch adds a proper prototype for some functions in
fs/dlm/dlm_internal.h
Signed-off-by: NAdrian Bunk <bunk@kernel.org>
Signed-off-by: NDavid Teigland <teigland@redhat.com>

e028398d

10 10月, 2007 1 次提交

[DLM] block dlm_recv in recovery transition · c36258b5

由 David Teigland 提交于 9月 27, 2007

Introduce a per-lockspace rwsem that's held in read mode by dlm_recv
threads while working in the dlm.  This allows dlm_recv activity to be
suspended when the lockspace transitions to, from and between recovery
cycles.

The specific bug prompting this change is one where an in-progress
recovery cycle is aborted by a new recovery cycle.  While dlm_recv was
processing a recovery message, the recovery cycle was aborted and
dlm_recoverd began cleaning up.  dlm_recv decremented recover_locks_count
on an rsb after dlm_recoverd had reset it to zero.  This is fixed by
suspending dlm_recv (taking write lock on the rwsem) before aborting the
current recovery.

The transitions to/from normal and recovery modes are simplified by using
this new ability to block dlm_recv.  The switch from normal to recovery
mode means dlm_recv goes from processing locking messages, to saving them
for later, and vice versa.  Races are avoided by blocking dlm_recv when
setting the flag that switches between modes.
Signed-off-by: NDavid Teigland <teigland@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

c36258b5

09 7月, 2007 3 次提交

[DLM] dump more lock values · ac90a255

由 David Teigland 提交于 7月 06, 2007

Add two more output fields (lkb_flags and rsb nodeid) to the new debugfs
file that dumps one lock per line. Also, dump all locks instead of just
mastered locks. Accordingly, use a suffix of _locks instead of _master.
Signed-off-by: NDavid Teigland <teigland@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

ac90a255

[DLM] variable allocation · 44f487a5

由 Patrick Caulfield 提交于 6月 06, 2007

Add a new flag, DLM_LSFL_FS, to be used when a file system creates a lockspace.
This flag causes the dlm to use GFP_NOFS for allocations instead of GFP_KERNEL.
(This updated version of the patch uses gfp_t for ls_allocation.)
Signed-Off-By: NPatrick Caulfield <pcaulfie@redhat.com>
Signed-Off-By: NDavid Teigland <teigland@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

44f487a5

[DLM] dumping master locks · 9dd592d7

由 David Teigland 提交于 5月 29, 2007

Add a new debugfs file that dumps a compact list of mastered locks.
This will be used by a userland daemon to collect state for deadlock
detection.

Also, for the existing function that prints all lock state, lock the rsb
before going through the lock lists since they can be changing in the
course of normal dlm activity.
Signed-off-by: NDavid Teigland <teigland@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

9dd592d7

openanolis / cloud-kernel 大约 1 年 前同步成功

openanolis / cloud-kernel
大约 1 年前同步成功