提交 · da8c66638ae684c99abcb30e89d2803402e7ca20 · openanolis / cloud-kernel

17 11月, 2012 1 次提交

dlm: fix lvb invalidation conditions · da8c6663

由 David Teigland 提交于 11月 15, 2012

When a node is removed that held a PW/EX lock, the
existing master node should invalidate the lvb on the
resource due to the purged lock.

Previously, the existing master node was invalidating
the lvb if it found only NL/CR locks on the resource
during recovery for the removed node.  This could lead
to cases where it invalidated the lvb and shouldn't
have, or cases where it should have invalidated and
didn't.

When recovery selects a *new* master node for a
resource, and that new master finds only NL/CR locks
on the resource after lock recovery, it should
invalidate the lvb.  This case was handled correctly
(but was incorrectly applied to the existing master
case also.)

When a process exits while holding a PW/EX lock,
the lvb on the resource should be invalidated.
This was not happening.

The lvb contents and VALNOTVALID flag should be
recovered before granting locks in recovery so that
the recovered lvb state is provided in the callback.
The lvb was being recovered after the lock was granted.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

da8c6663

02 11月, 2012 2 次提交

fs/dlm: remove CONFIG_EXPERIMENTAL · a3de56bd

由 Kees Cook 提交于 10月 23, 2012

This config item has not carried much meaning for a while now and is
almost always enabled by default. As agreed during the Linux kernel
summit, remove it.

CC: Christine Caulfield <ccaulfie@redhat.com>
Signed-off-by: NKees Cook <keescook@chromium.org>
Signed-off-by: NDavid Teigland <teigland@redhat.com>

a3de56bd

dlm: remove unused variable in *dlm_lowcomms_get_buffer() · eeee2b5f

由 Wei Yongjun 提交于 10月 18, 2012

The variable users is initialized but never used
otherwise, so remove the unused variable.

dpatch engine is used to auto generate this patch.
(https://github.com/weiyj/dpatch)
Signed-off-by: NWei Yongjun <yongjun_wei@trendmicro.com.cn>
Signed-off-by: NDavid Teigland <teigland@redhat.com>

eeee2b5f

11 9月, 2012 1 次提交

netlink: Rename pid to portid to avoid confusion · 15e47304

由 Eric W. Biederman 提交于 9月 07, 2012

It is a frequent mistake to confuse the netlink port identifier with a
process identifier.  Try to reduce this confusion by renaming fields
that hold port identifiers portid instead of pid.

I have carefully avoided changing the structures exported to
userspace to avoid changing the userspace API.

I have successfully built an allyesconfig kernel with this change.
Signed-off-by: N"Eric W. Biederman" <ebiederm@xmission.com>
Acked-by: NStephen Hemminger <shemminger@vyatta.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

15e47304

10 9月, 2012 1 次提交

dlm: check the maximum size of a request from user · 2b75bc91

由 Sasha Levin 提交于 9月 09, 2012

device_write only checks whether the request size is big enough, but it doesn't
check if the size is too big.

At that point, it also tries to allocate as much memory as the user has requested
even if it's too much. This can lead to OOM killer kicking in, or memory corruption
if (count + 1) overflows.
Signed-off-by: NSasha Levin <levinsasha928@gmail.com>
Signed-off-by: NDavid Teigland <teigland@redhat.com>

2b75bc91

13 8月, 2012 1 次提交

dlm: cleanup send_to_sock routine · 9c5bef58

由 Ying Xue 提交于 8月 13, 2012

Remove unnecessary code form send_to_sock routine.
Signed-off-by: NYing Xue <ying.xue@windriver.com>
Signed-off-by: NDavid Teigland <teigland@redhat.com>

9c5bef58

10 8月, 2012 2 次提交

dlm: convert add_sock routine return value type to void · 4dd40f0c

由 Ying Xue 提交于 8月 10, 2012

Since add_sock() always returns a success code - 0, its return
value type should be changed from integer to void.
Signed-off-by: NYing Xue <ying.xue@windriver.com>
Signed-off-by: NDavid Teigland <teigland@redhat.com>

4dd40f0c

dlm: remove redundant variable assignments · b4c798cf

由 Xue Ying 提交于 8月 10, 2012

Once the tcp_create_listen_sock() is returned successfully, we
will invoke add_sock() immediately. In add_sock(), the 'con'
variable is assigned to 'sk_user_data', meanwhile, the 'sock' is
also set to 'con->sock'. So it's unnecessary to do the same thing
in tcp_create_listen_sock().
Signed-off-by: NXue Ying <ying.xue@windriver.com>
Signed-off-by: NDavid Teigland <teigland@redhat.com>

b4c798cf

09 8月, 2012 3 次提交

dlm: fix unlock balance warnings · 475f230c

由 David Teigland 提交于 8月 02, 2012

The in_recovery rw_semaphore has always been acquired and
released by different threads by design.  To work around
the "BUG: bad unlock balance detected!" messages, adjust
things so the dlm_recoverd thread always does both down_write
and up_write.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

475f230c

dlm: fix uninitialized spinlock · 6ad22916

由 David Teigland 提交于 8月 02, 2012

Use DEFINE_SPINLOCK for global dlm_cb_seq_spin.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

6ad22916

dlm: fix deadlock between dlm_send and dlm_controld · 36b71a8b

由 David Teigland 提交于 7月 26, 2012

A deadlock sometimes occurs between dlm_controld closing
a lowcomms connection through configfs and dlm_send looking
up the address for a new connection in configfs.

dlm_controld does a configfs rmdir which calls
dlm_lowcomms_close which waits for dlm_send to
cancel work on the workqueues.

The dlm_send workqueue thread has called
tcp_connect_to_sock which calls dlm_nodeid_to_addr
which does a configfs lookup and blocks on a lock
held by dlm_controld in the rmdir path.

The solution here is to save the node addresses within
the lowcomms code so that the lowcomms workqueue does
not need to step through configfs to get a node address.

dlm_controld:
wait_for_completion+0x1d/0x20
__cancel_work_timer+0x1b3/0x1e0
cancel_work_sync+0x10/0x20
dlm_lowcomms_close+0x4c/0xb0 [dlm]
drop_comm+0x22/0x60 [dlm]
client_drop_item+0x26/0x50 [configfs]
configfs_rmdir+0x180/0x230 [configfs]
vfs_rmdir+0xbd/0xf0
do_rmdir+0x103/0x120
sys_rmdir+0x16/0x20

dlm_send:
mutex_lock+0x2b/0x50
get_comm+0x34/0x140 [dlm]
dlm_nodeid_to_addr+0x18/0xd0 [dlm]
tcp_connect_to_sock+0xf4/0x2d0 [dlm]
process_send_sockets+0x1d2/0x260 [dlm]
worker_thread+0x170/0x2a0
Signed-off-by: NDavid Teigland <teigland@redhat.com>

36b71a8b

17 7月, 2012 6 次提交

dlm: fix missing dir remove · 96006ea6

由 David Teigland 提交于 6月 25, 2012

I don't know exactly how, but in some cases, a dir
record is not removed, or a new one is created when
it shouldn't be.  The result is that the dir node
lookup returns a master node where the rsb does not
exist.  In this case, The master node will repeatedly
return -EBADR for requests, and the lock requests will
be stuck.

Until all possible ways for this to happen can be
eliminated, a simple and effective way to recover from
this situation is for the supposed master node to send
a standard remove message to the dir node when it
receives a request for a resource it has no rsb for.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

96006ea6

dlm: fix conversion deadlock from recovery · c503a621

由 David Teigland 提交于 6月 05, 2012

The process of rebuilding locks on a new master during
recovery could re-order the locks on the convert queue,
creating an "in place" conversion deadlock that would
not be resolved.  Fix this by not considering queue
order when granting conversions after recovery.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

c503a621

dlm: use wait_event_timeout · 6d768177

由 David Teigland 提交于 6月 05, 2012

Use wait_event_timeout to avoid using a timer
directly.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

6d768177

dlm: fix race between remove and lookup · 05c32f47

由 David Teigland 提交于 6月 14, 2012

It was possible for a remove message on an old
rsb to be sent after a lookup message on a new
rsb, where the rsbs were for the same resource
name.  This could lead to a missing directory
entry for the new rsb.

It is fixed by keeping a copy of the resource
name being removed until after the remove has
been sent.  A lookup checks if this in-progress
remove matches the name it is looking up.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

05c32f47

dlm: use idr instead of list for recovered rsbs · 1d7c484e

由 David Teigland 提交于 5月 15, 2012

When a large number of resources are being recovered,
a linear search of the recover_list takes a long time.
Use an idr in place of a list.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

1d7c484e

dlm: use rsbtbl as resource directory · c04fecb4

由 David Teigland 提交于 5月 10, 2012

Remove the dir hash table (dirtbl), and use
the rsb hash table (rsbtbl) as the resource
directory.  It has always been an unnecessary
duplication of information.

This improves efficiency by using a single rsbtbl
lookup in many cases where both rsbtbl and dirtbl
lookups were needed previously.

This eliminates the need to handle cases of rsbtbl
and dirtbl being out of sync.

In many cases there will be memory savings because
the dir hash table no longer exists.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

c04fecb4

15 5月, 2012 1 次提交

dlm: NULL dereference on failure in kmem_cache_create() · 75af271e

由 Dan Carpenter 提交于 5月 15, 2012

We aren't allowed to pass NULL pointers to kmem_cache_destroy() so if
both allocations fail, it leads to a NULL dereference.
Signed-off-by: NDan Carpenter <dan.carpenter@oracle.com>
Signed-off-by: NDavid Teigland <teigland@redhat.com>

75af271e

03 5月, 2012 1 次提交

dlm: fixes for nodir mode · 4875647a

由 David Teigland 提交于 4月 26, 2012

The "nodir" mode (statically assign master nodes instead
of using the resource directory) has always been highly
experimental, and never seriously used.  This commit
fixes a number of problems, making nodir much more usable.

- Major change to recovery: recover all locks and restart
  all in-progress operations after recovery.  In some
  cases it's not possible to know which in-progess locks
  to recover, so recover all.  (Most require recovery
  in nodir mode anyway since rehashing changes most
  master nodes.)

- Change the way nodir mode is enabled, from a command
  line mount arg passed through gfs2, into a sysfs
  file managed by dlm_controld, consistent with the
  other config settings.

- Allow recovering MSTCPY locks on an rsb that has not
  yet been turned into a master copy.

- Ignore RCOM_LOCK and RCOM_LOCK_REPLY recovery messages
  from a previous, aborted recovery cycle.  Base this
  on the local recovery status not being in the state
  where any nodes should be sending LOCK messages for the
  current recovery cycle.

- Hold rsb lock around dlm_purge_mstcpy_locks() because it
  may run concurrently with dlm_recover_master_copy().

- Maintain highbast on process-copy lkb's (in addition to
  the master as is usual), because the lkb can switch
  back and forth between being a master and being a
  process copy as the master node changes in recovery.

- When recovering MSTCPY locks, flag rsb's that have
  non-empty convert or waiting queues for granting
  at the end of recovery.  (Rename flag from LOCKS_PURGED
  to RECOVER_GRANT and similar for the recovery function,
  because it's not only resources with purged locks
  that need grant a grant attempt.)

- Replace a couple of unnecessary assertion panics with
  error messages.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

4875647a

27 4月, 2012 5 次提交

dlm: improve error and debug messages · 6d40c4a7

由 David Teigland 提交于 4月 23, 2012

Change some existing error/debug messages to
collect more useful information, and add
some new error/debug messages to address
recently found problems.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

6d40c4a7

dlm: avoid unnecessary search in search_rsb · 57638bf3

由 David Teigland 提交于 4月 23, 2012

If the rsb is found in the "keep" tree, but is
not the right type (i.e. not MASTER), we can
return immediately with the result.  There's
no point in going on to search the "toss" list
as if we hadn't found it.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

57638bf3

dlm: limit rcom debug messages · d6e24788

由 David Teigland 提交于 4月 23, 2012

Unify the checking for both types of ignored
rcom messages, and replace the two log_debug
statements with a single, rate limited debug
message.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

d6e24788

dlm: fix waiter recovery · 13ef1111

由 David Teigland 提交于 4月 23, 2012

An outstanding remote operation (an lkb on the "waiter"
list) could sometimes miss being resent during recovery.
The decision was based on the lkb_nodeid field, which
could have changed during an earlier aborted recovery,
so it no longer represents the actual remote destination.
The lkb_wait_nodeid is always the actual remote node,
so it is the best value to use.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

13ef1111

dlm: prevent connections during shutdown · 513ef596

由 David Teigland 提交于 3月 30, 2012

During lowcomms shutdown, a new connection could possibly
be created, and attempt to use a workqueue that's been
destroyed.  Similarly, during startup, a new connection
could attempt to use a workqueue that's not been set up
yet.  Add a global variable to indicate when new connections
are allowed.

Based on patch by: Christine Caulfield <ccaulfie@redhat.com>
Reported-by: Ndann frazier <dann.frazier@canonical.com>
Reviewed-by: Ndann frazier <dann.frazier@canonical.com>
Signed-off-by: NDavid Teigland <teigland@redhat.com>

513ef596

24 4月, 2012 1 次提交

dlm: fix QUECVT when convert queue is empty · 53ad1c98

由 David Teigland 提交于 4月 04, 2012

The QUECVT flag should not prevent conversions from
being granted immediately when the convert queue is
empty.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

53ad1c98

06 4月, 2012 1 次提交

simple_open: automatically convert to simple_open() · 234e3405

由 Stephen Boyd 提交于 4月 05, 2012

Many users of debugfs copy the implementation of default_open() when
they want to support a custom read/write function op.  This leads to a
proliferation of the default_open() implementation across the entire
tree.

Now that the common implementation has been consolidated into libfs we
can replace all the users of this function with simple_open().

This replacement was done with the following semantic patch:

<smpl>
@ open @
identifier open_f != simple_open;
identifier i, f;
@@
-int open_f(struct inode *i, struct file *f)
-{
(
-if (i->i_private)
-f->private_data = i->i_private;
|
-f->private_data = i->i_private;
)
-return 0;
-}

@ has_open depends on open @
identifier fops;
identifier open.open_f;
@@
struct file_operations fops = {
...
-.open = open_f,
+.open = simple_open,
...
};
</smpl>

[akpm@linux-foundation.org: checkpatch fixes]
Signed-off-by: NStephen Boyd <sboyd@codeaurora.org>
Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Cc: Al Viro <viro@zeniv.linux.org.uk>
Cc: Julia Lawall <Julia.Lawall@lip6.fr>
Acked-by: NIngo Molnar <mingo@elte.hu>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

234e3405

21 3月, 2012 1 次提交

dlm: last element of dlm_local_addr[] never used · 1b189b88

由 David Teigland 提交于 3月 21, 2012

The last element of dlm_local_addr[DLM_MAX_ADDR_COUNT]
was not used because the loop ended at COUNT - 1.
Reported-by: NDan Carpenter <dan.carpenter@oracle.com>
Signed-off-by: NDavid Teigland <teigland@redhat.com>

1b189b88

09 3月, 2012 2 次提交

dlm: Do not allocate a fd for peeloff · 2f2d76cc

由 Benjamin Poirier 提交于 3月 08, 2012

avoids allocating a fd that a) propagates to every kernel thread and
usermodehelper b) is not properly released.

References: http://article.gmane.org/gmane.linux.network.drbd/22529Signed-off-by: NBenjamin Poirier <bpoirier@suse.de>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

2f2d76cc

dlm: fix slow rsb search in dir recovery · 7210cb7a

由 David Teigland 提交于 3月 08, 2012

The function used to find an rsb during directory
recovery was searching the single linear list of
rsb's.  This wasted a lot of time compared to
using the standard hash table to find the rsb.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

7210cb7a

04 1月, 2012 3 次提交

dlm: add recovery callbacks · 60f98d18

由 David Teigland 提交于 11月 02, 2011

These new callbacks notify the dlm user about lock recovery.
GFS2, and possibly others, need to be aware of when the dlm
will be doing lock recovery for a failed lockspace member.

In the past, this coordination has been done between dlm and
file system daemons in userspace, which then direct their
kernel counterparts.  These callbacks allow the same
coordination directly, and more simply.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

60f98d18

dlm: add node slots and generation · 757a4271

由 David Teigland 提交于 10月 20, 2011

Slot numbers are assigned to nodes when they join the lockspace.
The slot number chosen is the minimum unused value starting at 1.
Once a node is assigned a slot, that slot number will not change
while the node remains a lockspace member.  If the node leaves
and rejoins it can be assigned a new slot number.

A new generation number is also added to a lockspace.  It is
set and incremented during each recovery along with the slot
collection/assignment.

The slot numbers will be passed to gfs2 which will use them as
journal id's.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

757a4271

dlm: move recovery barrier calls · f95a34c6

由 David Teigland 提交于 10月 14, 2011

Put all the calls to recovery barriers in the same function
to clarify where they each happen.  Should not change any behavior.
Also modify some recovery debug lines to make them consistent.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

f95a34c6

23 11月, 2011 1 次提交

net: remove ipv6_addr_copy() · 4e3fd7a0

由 Alexey Dobriyan 提交于 11月 21, 2011

C assignment can handle struct in6_addr copying.
Signed-off-by: NAlexey Dobriyan <adobriyan@gmail.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

4e3fd7a0

19 11月, 2011 1 次提交

dlm: convert rsb list to rb_tree · 9beb3bf5

由 Bob Peterson 提交于 10月 26, 2011

Change the linked lists to rb_tree's in the rsb
hash table to speed up searches.  Slow rsb searches
were having a large impact on gfs2 performance due
to the large number of dlm locks gfs2 uses.
Signed-off-by: NBob Peterson <rpeterso@redhat.com>
Signed-off-by: NDavid Teigland <teigland@redhat.com>

9beb3bf5

21 7月, 2011 1 次提交

locks: rename lock-manager ops · 8fb47a4f

由 J. Bruce Fields 提交于 7月 20, 2011

Both the filesystem and the lock manager can associate operations with a
lock.  Confusingly, one of them (fl_release_private) actually has the
same name in both operation structures.

It would save some confusion to give the lock-manager ops different
names.
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>

8fb47a4f

20 7月, 2011 1 次提交

dlm: don't limit active work items · 10d1459f

由 David Teigland 提交于 7月 18, 2011

Allow multiple workqueue items (locks with callbacks) to be
processed concurrently.  There should be no reason not to
take advantage of this workqueue feature.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

10d1459f

16 7月, 2011 1 次提交

dlm: use workqueue for callbacks · 23e8e1aa

由 David Teigland 提交于 4月 05, 2011

Instead of creating our own kthread (dlm_astd) to deliver
callbacks for all lockspaces, use a per-lockspace workqueue
to deliver the callbacks.  This eliminates complications and
slowdowns from many lockspaces sharing the same thread.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

23e8e1aa

15 7月, 2011 1 次提交

dlm: remove deadlock debug print · 883ba74f

由 David Teigland 提交于 7月 14, 2011

gfs2 recently began using this feature heavily,
creating more debug output than we want to see.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

883ba74f

13 7月, 2011 1 次提交

dlm: improve rsb searches · 3881ac04

由 David Teigland 提交于 7月 07, 2011

By pre-allocating rsb structs before searching the hash
table, they can be inserted immediately.  This avoids
always having to repeat the search when adding the struct
to hash list.

This also adds space to the rsb struct for a max resource
name, so an rsb allocation can be used by any request.
The constant size also allows us to finally use a slab
for the rsb structs.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

3881ac04

11 7月, 2011 1 次提交

dlm: keep lkbs in idr · 3d6aa675

由 David Teigland 提交于 7月 06, 2011

This is simpler and quicker than the hash table, and
avoids needing to search the hash list for every new
lkid to check if it's used.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

3d6aa675

openanolis / cloud-kernel 大约 1 年 前同步成功

openanolis / cloud-kernel
大约 1 年前同步成功