提交 · 1bdad606338debc6384b2844f1b53cc436b3ac90 · openanolis / cloud-kernel

27 6月, 2008 8 次提交

[GFS2] Remove remote lock dropping code · 1bdad606

由 Steven Whitehouse 提交于 6月 03, 2008

There are several reasons why this is undesirable:

 1. It never happens during normal operation anyway
 2. If it does happen it causes performance to be very, very poor
 3. It isn't likely to solve the original problem (memory shortage
    on remote DLM node) it was supposed to solve
 4. It uses a bunch of arbitrary constants which are unlikely to be
    correct for any particular situation and for which the tuning seems
    to be a black art.
 5. In an N node cluster, only 1/N of the dropped locked will actually
    contribute to solving the problem on average.

So all in all we are better off without it. This also makes merging
the lock_dlm module into GFS2 a bit easier.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

1bdad606

[GFS2] kernel panic mounting volume · 9171f5a9

由 Bob Peterson 提交于 6月 09, 2008

This patch fixes Red Hat bugzilla bug 450156.

This started with a not-too-improbable mount failure because the
locking protocol was never set back to its proper "lock_dlm" after the
system was rebooted in the middle of a gfs2_fsck.  That left a
(purposely) invalid locking protocol in the superblock, which caused an
error when the file system was mounted the next time.

When there's an error mounting, vfs calls DQUOT_OFF, which calls
vfs_quota_off which calls gfs2_sync_fs.  Next, gfs2_sync_fs calls
gfs2_log_flush passing s_fs_info.  But due to the error, s_fs_info
had been previously set to NULL, and so we have the kernel oops.

My solution in this patch is to test for the NULL value before passing
it.  I tested this patch and it fixes the problem.
Signed-off-by: NBob Peterson <rpeterso@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

9171f5a9

[GFS2] Revise readpage locking · 01b7c7ae

由 Steven Whitehouse 提交于 6月 02, 2008

The previous attempt to fix the locking in readpage failed due
to the use of a "try lock" which resulted in occasional high
cpu usage during testing (due to repeated tries) and also it
did not resolve all the ordering problems wrt the transaction
lock (although it did solve all the inode lock ordering problems).

This patch avoids the problem by unlocking the page and getting the
locks in the correct order. This means that we have to retest the
page to ensure that it hasn't changed when we relock the page.

This now passes the tests which were previously failing.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

01b7c7ae

[GFS2] Fix ordering of args for list_add · 80274737

由 Steven Whitehouse 提交于 6月 02, 2008

The patch to remove lock_nolock managed to get the arguments
of this list_add backwards. This fixes it.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

80274737

[GFS2] trivial sparse lock annotations · 2d81afb8

由 Harvey Harrison 提交于 5月 29, 2008

Annotate the &sdp->sd_log_lock.
Signed-off-by: NHarvey Harrison <harvey.harrison@gmail.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

2d81afb8

[GFS2] No lock_nolock · 048bca22

由 Steven Whitehouse 提交于 5月 23, 2008

This patch merges the lock_nolock module into GFS2 itself. As well as removing
some of the overhead of the module, it also means that its now impossible to
build GFS2 without a lock module (which would be a pointless thing to do
anyway).

We also plan to merge lock_dlm into GFS2 in the future, but that is a more
tricky task, and will therefore be a separate patch.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>
Cc: David Teigland <teigland@redhat.com>

048bca22

[GFS2] Fix ordering bug in lock_dlm · f3c9d38a

由 Steven Whitehouse 提交于 5月 21, 2008

This looks like a lot of change, but in fact its not. Mostly its
things moving from one file to another. The change is just that
instead of queuing lock completions and callbacks from the DLM
we now pass them directly to GFS2.

This gives us a net loss of two list heads per glock (a fair
saving in memory) plus a reduction in the latency of delivering
the messages to GFS2, plus we now have one thread fewer as well.
There was a bug where callbacks and completions could be delivered
in the wrong order due to this unnecessary queuing which is fixed
by this patch.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>
Cc: Bob Peterson <rpeterso@redhat.com>

f3c9d38a

[GFS2] Clean up the glock core · 6802e340

由 Steven Whitehouse 提交于 5月 21, 2008

This patch implements a number of cleanups to the core of the
GFS2 glock code. As a result a lot of code is removed. It looks
like a really big change, but actually a large part of this patch
is either removing or moving existing code.

There are some new bits too though, such as the new run_queue()
function which is considerably streamlined. Highlights of this
patch include:

 o Fixes a cluster coherency bug during SH -> EX lock conversions
 o Removes the "glmutex" code in favour of a single bit lock
 o Removes the ->go_xmote_bh() for inodes since it was duplicating
   ->go_lock()
 o We now only use the ->lm_lock() function for both locks and
   unlocks (i.e. unlock is a lock with target mode LM_ST_UNLOCKED)
 o The fast path is considerably shortly, giving performance gains
   especially with lock_nolock
 o The glock_workqueue is now used for all the callbacks from the DLM
   which allows us to simplify the lock_dlm module (see following patch)
 o The way is now open to make further changes such as eliminating the two
   threads (gfs2_glockd and gfs2_scand) in favour of a more efficient
   scheme.

This patch has undergone extensive testing with various test suites
so it should be pretty stable by now.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>
Cc: Bob Peterson <rpeterso@redhat.com>

6802e340

25 6月, 2008 1 次提交

[GFS2] fix gfs2 block allocation (cleaned up) · 5af4e7a0

由 Benjamin Marzinski 提交于 6月 24, 2008

This patch fixes bz 450641.

This patch changes the computation for zero_metapath_length(), which it
renames to metapath_branch_start(). When you are extending the metadata
tree, The indirect blocks that point to the new data block must either
diverge from the existing tree either at the inode, or at the first
indirect block. They can diverge at the first indirect block because the
inode has room for 483 pointers while the indirect blocks have room for
509 pointers, so when the tree is grown, there is some free space in the
first indirect block. What metapath_branch_start() now computes is the
height where the first indirect block for the new data block is located.
It can either be 1 (if the indirect block diverges from the inode) or 2
(if it diverges from the first indirect block).
Signed-off-by: NBenjamin Marzinski <bmarzins@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

5af4e7a0

24 6月, 2008 1 次提交

[GFS2] BUG: unable to handle kernel paging request at ffff81002690e000 · 17c15da0

由 Bob Peterson 提交于 6月 18, 2008

This patch fixes bugzilla bug bz448866: gfs2: BUG: unable to
handle kernel paging request at ffff81002690e000.
Signed-off-by: NBob Peterson <rpeterso@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

17c15da0

12 5月, 2008 3 次提交

[GFS2] Prefer strlcpy() over snprintf() · 00377d8e

由 Jean Delvare 提交于 5月 09, 2008

strlcpy is faster than snprintf when you don't use the returned value.
Signed-off-by: NJean Delvare <khali@linux-fr.org>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

00377d8e

[GFS2] Fix cast from unsigned int to s64 · ad99f777

由 Andrew Price 提交于 5月 01, 2008

This fixes bz 444829 where allocating a new block caused gfs2 file systems to
report 0 bytes used in df. It was caused by a broken cast from an unsigned int
in gfs2_block_alloc() to a negative s64 in gfs2_statfs_change(). This patch
casts the unsigned int to an s64 before the unary minus is applied.
Signed-off-by: NAndrew Price <andy@andrewprice.me.uk>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

ad99f777

[GFS2] filesystem consistency error from do_strip · 091806ed

由 Bob Peterson 提交于 4月 29, 2008

This patch fixes a GFS2 filesystem consistency error reported from
function do_strip.  The problem was caused by a timing window
that allowed two vfs inodes to be created in memory that point
to the same file.  The problem is fixed by making the vfs's
iget_test, iget_set mechanism check and set a new bit in the
in-core gfs2_inode structure while the vfs inode spin_lock is held.
Signed-off-by: NBob Peterson <rpeterso@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

091806ed

30 4月, 2008 1 次提交

fs: replace remaining __FUNCTION__ occurrences · 8e24eea7

由 Harvey Harrison 提交于 4月 30, 2008

__FUNCTION__ is gcc-specific, use __func__
Signed-off-by: NHarvey Harrison <harvey.harrison@gmail.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

8e24eea7

28 4月, 2008 1 次提交

mm: remove nopage · 3c18ddd1

由 Nick Piggin 提交于 4月 28, 2008

Nothing in the tree uses nopage any more.  Remove support for it in the
core mm code and documentation (and a few stray references to it in
comments).
Signed-off-by: NNick Piggin <npiggin@suse.de>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

3c18ddd1

22 4月, 2008 1 次提交

dlm: move plock code from gfs2 · 2402211a

由 David Teigland 提交于 3月 14, 2008

Move the code that handles cluster posix locks from gfs2 into the dlm
so that it can be used by both gfs2 and ocfs2.
Signed-off-by: NDavid Teigland <teigland@redhat.com>

2402211a

18 4月, 2008 1 次提交

[GFS2] fix assertion in log_refund() · 62be1f71

由 Roel Kluin 提交于 4月 17, 2008

since unsigned, unused >= 0 is always true.
Signed-off-by: NRoel Kluin <12o3l@tiscali.nl>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

62be1f71

10 4月, 2008 1 次提交

[GFS2] fix GFP_KERNEL misuses · 16c5f06f

由 Josef Bacik 提交于 4月 09, 2008

There are several places where GFP_KERNEL allocations happen under a glock,
which will result in hangs if we're under memory pressure and go to re-enter the
fs in order to flush stuff out. This patch changes the culprits to GFS_NOFS to
keep this problem from happening. Thank you,
Signed-off-by: NJosef Bacik <jbacik@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

16c5f06f

31 3月, 2008 22 次提交

[GFS2] test for IS_ERR rather than 0 · 773adff8

由 Julia Lawall 提交于 3月 24, 2008

The function gfs2_inode_lookup always returns either a valid pointer or a
value made with ERR_PTR, so its result should be tested with IS_ERR, not
with a test for 0.

The problem was found using the following semantic match.
(http://www.emn.fr/x-info/coccinelle/)

//<smpl>
@a@
expression E, E1;
statement S,S1;
position p;
@@

E = gfs2_inode_lookup(...)
... when != E = E1
if@p (E) S else S1

@n@
position a.p;
expression E,E1;
statement S,S1;
@@

E = NULL
... when != E = E1
if@p (E) S else S1

@depends on !n@
expression E;
statement S,S1;
position a.p;
@@

* if@p (E)
  S else S1
//</smpl>
Signed-off-by: NJulia Lawall <julia@diku.dk>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

773adff8

[GFS2] Invalidate cache at correct point · 58e9fee1

由 Benjamin Marzinski 提交于 3月 14, 2008

GFS2 wasn't invalidating its cache before it called into the lock manager
with a request that could potentially drop a lock. This was leaving a
window where the lock could be actually be held by another node, but the
file's page cache would still appear valid, causing coherency problems.
This patch moves the cache invalidation to before the lock manager call
when dropping a lock. It also adds the option to the lock_dlm lock
manager to not use conversion mode deadlock avoidance, which, on a
conversion from shared to exclusive, could internally drop the lock, and
then reacquire in. GFS2 now asks lock_dlm to not do this. Instead, GFS2
manually drops the lock and reacquires it.
Signed-off-by: NBenjamin Marzinski <bmarzins@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

58e9fee1

[GFS2] fs/gfs2/recovery.c: suppress warnings · f5a8cd02

由 akpm@linux-foundation.org 提交于 3月 12, 2008

fs/gfs2/recovery.c: In function 'get_log_header':
fs/gfs2/recovery.c:152: warning: 'lh.lh_sequence' may be used uninitialized in this function
fs/gfs2/recovery.c:152: warning: 'lh.lh_flags' may be used uninitialized in this function
fs/gfs2/recovery.c:152: warning: 'lh.lh_tail' may be used uninitialized in this function
fs/gfs2/recovery.c:152: warning: 'lh.lh_blkno' may be used uninitialized in this function
fs/gfs2/recovery.c:152: warning: 'lh.lh_hash' may be used uninitialized in this function

Cc: David Teigland <teigland@redhat.com>
Cc: Bob Peterson <rpeterso@redhat.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

f5a8cd02

[GFS2] Faster gfs2_bitfit algorithm · 1f466a47

由 Bob Peterson 提交于 3月 10, 2008

This version of the gfs2_bitfit algorithm includes the latest
suggestions from Steve Whitehouse.  It is typically eight to
ten times faster than the version we're using today.  If there
is a lot of metadata mixed in (lots of small files) the
algorithm is often 15 times faster, and given the right
conditions, I've seen peaks of 20 times faster.
Signed-off-by: NBob Peterson <rpeterso@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

1f466a47

[GFS2] Streamline quota lock/check for no-quota case · d82661d9

由 Steven Whitehouse 提交于 3月 10, 2008

This patch streamlines the quota checking in the "no quota" case by
making the check inline in the calling function, thus reducing the
number of function calls. Eventually we might be able to remove the
checks from the gfs2_quota_lock() and gfs2_quota_check() functions, but
currently we can't as there are a very few places in the code which need
to call these functions directly still.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>
Cc: Abhijith Das <adas@redhat.com>

d82661d9

[GFS2] Remove drop of module ref where not needed · 860b25d4

由 Steven Whitehouse 提交于 3月 10, 2008

In an earlier patch "[GFS2] fix file_system_type leak on gfs2meta mount"
we removed the code to grab a ref to the module which was not needed
(since we know that the module cannot be unloaded at that time) so
this patch removes the code to drop that reference.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

860b25d4

[GFS2] gfs2_adjust_quota has broken unstuffing code · 20b95bf2

由 Abhijith Das 提交于 3月 06, 2008

This patch combines the 2 patches in bug 434736 to correct the lock
ordering in the unstuffing of the quota inode in gfs2_adjust_quota and
adjusting the number of revokes in gfs2_write_jdata_pagevec
Signed-off-by: NAbhijith Das <adas@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

20b95bf2

[GFS2] possible null pointer dereference fixup · 182fe5ab

由 Cyrill Gorcunov 提交于 3月 03, 2008

gfs2_alloc_get may fail so we have to check it to prevent
NULL pointer dereference.
Signed-off-by: NCyrill Gorcunov <gorcunov@gamil.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

182fe5ab

[GFS2] Need to ensure that sector_t is 64bits for GFS2 · 10528497

由 Steven Whitehouse 提交于 2月 27, 2008

We need to ensure that sector_t is 64bits for GFS2, so that we need to
depend on LBD as well as LSF.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

10528497

[GFS2] re-support special inode · 43a33c53

由 Denis Cheng 提交于 2月 26, 2008

a previous commit removed call to
init_special_inode from inode lookuping, this cause problems as:

 # mknod /mnt/gfs2/dev/null c 1 3
 # cat /mnt/gfs2/dev/null
 cat: /mnt/gfs2/dev/null: Invalid argument

without special inode, GFS2 cannot support char device file,
block device file, fifo pipe, and socket file, lose many important
features as a common file system.

this one line patch re add special inode support.
Signed-off-by: NDenis Cheng <crquan@gmail.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

43a33c53

[GFS2] remove gfs2_dev_iops · d83225d4

由 Denis Cheng 提交于 2月 26, 2008

struct inode_operations gfs2_dev_iops is always the same as gfs2_file_iops,
since Jan 2006, when GFS2 merged into mainstream kernel.

So one of them could be removed.
Signed-off-by: NDenis Cheng <crquan@gmail.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

d83225d4

[GFS2] fix file_system_type leak on gfs2meta mount · 7dc2cf1c

由 Christoph Hellwig 提交于 2月 25, 2008

get_gfs2_sb does a get_fs_type without doing a put_filesystem and
thus leaking a file_system_type reference everytime it's called.

Just use gfs2_fs_type directly instead of doing the lookup and thus
fix the problem.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

7dc2cf1c

[GFS2] Allow bmap to allocate extents · 9b8c81d1

由 Steven Whitehouse 提交于 2月 22, 2008

We've supported mapping of extents when no block allocation is required
for some time. This patch extends that to mapping of extents when an
allocation has been requested. In that case we try to allocate as many
blocks as are requested, but we might return fewer in case there is
something preventing us from returning the complete amount (e.g. an
already allocated block is in the way).

Currently the only code path which can actually request multiple data
blocks in a single bmap call is the page_mkwrite path and even then it
only happens if there are multiple blocks per page. What this patch does
do however, is merge the allocation requests for metadata (growing the
metadata tree in either height or depth) with the allocation of the data
blocks in the case that both are needed. This results in lower overheads
even in the single block allocation case.

The one thing which we can't handle here at the moment is unstuffing. I
would like to be able to do that, but the problem which arises is that
in order to unstuff one has to get a locked page from the page cache
which results in locking problems in the (usual) case that the caller is
holding the page lock on the page it wishes to map. So that case will
have to be addressed in future patches.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

9b8c81d1

[GFS2] Fix a page lock / glock deadlock · 7afd88d9

由 Steven Whitehouse 提交于 2月 22, 2008

We've previously been using a "try lock" in readpage on the basis that
it would prevent deadlocks due to the inverted lock ordering (our normal
lock ordering is glock first and then page lock). Unfortunately tests
have shown that this isn't enough. If the glock has a demote request
queued such that run_queue() in the glock code tries to do a demote when
its called under readpage then it will try and write out all the dirty
pages which requires locking them. This then deadlocks with the page
locked by readpage.

The solution is to always require two calls into readpage. The first
unlocks the page, gets the glock and returns AOP_TRUNCATED_PAGE, the
second does the actual readpage and unlocks the glock & page as
required.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

7afd88d9

[GFS2] proper extern for gfs2/locking/dlm/mount.c:gdlm_ops · 60b779cf

由 Adrian Bunk 提交于 2月 17, 2008

This patch adds a proper extern declaration for gdlm_ops in
fs/gfs2/locking/dlm/lock_dlm.h
Signed-off-by: NAdrian Bunk <bunk@kernel.org>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

60b779cf

[GFS2] gfs2/ops_file.c should #include "ops_inode.h" · 8af4c72f

由 Adrian Bunk 提交于 2月 17, 2008

Every file should include the headers containing the prototypes for
its global functions (in this case for gfs2_set_inode_flags()).
Signed-off-by: NAdrian Bunk <bunk@kernel.org>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

8af4c72f

[GFS2] be*_add_cpu conversion · bb16b342

由 Marcin Slusarz 提交于 2月 13, 2008

replace all:
big_endian_variable = cpu_to_beX(beX_to_cpu(big_endian_variable) +
					expression_in_cpu_byteorder);
with:
	beX_add_cpu(&big_endian_variable, expression_in_cpu_byteorder);
generated with semantic patch
Signed-off-by: NMarcin Slusarz <marcin.slusarz@gmail.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

bb16b342

[GFS2] Fix bug where we called drop_bh incorrectly · 840ca0ec

由 Steven Whitehouse 提交于 2月 12, 2008

As a result of an earlier patch, drop_bh was being called in cases
when it shouldn't have been. Since we never have a gh in the drop
case and we always have a gh in the promote case, we can use that
extra information to tell which case has been seen.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>
Cc: Bob Peterson <rpeterso@redhat.com>

840ca0ec

[GFS2] Get inode buffer only once per block map call · e23159d2

由 Steven Whitehouse 提交于 2月 12, 2008

In the case that we needed to grow the height of the metadata tree
we were looking up the inode buffer and then brelse()ing it despite
the fact that it is needed later in the block map process.

This patch ensures that we look up the inode's buffer once and only
once during the block map process.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

e23159d2

[GFS2] Eliminate (almost) duplicate field from gfs2_inode · 77658aad

由 Steven Whitehouse 提交于 2月 12, 2008

The blocks counter is almost a duplicate of the i_blocks
field in the VFS inode. The only difference is that i_blocks
can be only 32bits long for 32bit arch without large single file
support. Since GFS2 doesn't handle the non-large single file
case (for 32 bit anyway) this adds a new config dependency on
64BIT || LSF. This has always been the case, however we've never
explicitly said so before.

Even if we do add support for the non-LSF case, we will still
not require this field to be duplicated since we will not be
able to access oversized files anyway.

So the net result of all this is that we shave 8 bytes from a gfs2_inode
and get our config deps correct.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

77658aad

[GFS2] Add a function to interate over an extent · 30cbf189

由 Steven Whitehouse 提交于 2月 08, 2008

This adds a function (currently the only use is during mapping
of already allocated blocks, but watch this space) which iterates
over a number of pointers in a block and returns the extent length.

If the initial pointer is 0 (i.e. unallocated) it will return the
number of unallocated blocks in the extent. If the initial pointer
is allocated, then it returns the number of contiguously allocated
blocks in the extent.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

30cbf189

[GFS2] The case of the missing asterisk · c85a665f

由 Steven Whitehouse 提交于 2月 12, 2008

A dereference was forgotten. This adds it back correctly.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

c85a665f

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功