提交 · 13efdbecc65ef6ec4028551fb223dea5c5e3143c · openanolis / cloud-kernel

30 7月, 2009 2 次提交

GFS2: remove dcache entries for remote deleted inodes · b94a170e

由 Benjamin Marzinski 提交于 7月 23, 2009

When a file is deleted from a gfs2 filesystem on one node, a dcache
entry for it may still exist on other nodes in the cluster. If this
happens, gfs2 will be unable to free this file on disk. Because of this,
it's possible to have a gfs2 filesystem with no files on it and no free
space. With this patch, when a node receives a callback notifying it
that the file is being deleted on another node, it schedules a new
workqueue thread to remove the file's dcache entry.
Signed-off-by: NBenjamin Marzinski <bmarzins@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

b94a170e

GFS2: keep statfs info in sync on grows · 1946f70a

由 Benjamin Marzinski 提交于 6月 25, 2009

GFS2 wasn't syncing its statfs info on grows. This causes a problem
when you grow the filesystem on multiple nodes. GFS2 would calculate
the new space based on the resource groups (which are always current),
and then assume that the filesystem had grown the from the existing
statfs size. If you grew the filesystem on two different nodes in a
short time, the second node wouldn't see the statfs size change from the
first node, and would assume that it was grown by a larger amount than
it was. When all these changes were synced out, the total fileystem
size would be incorrect (the first grow would be counted twice).

This patch syncs makes GFS2 read in the statfs changes from disk before
a grow, and write them out after the grow, while the master statfs inode
is locked.
Signed-off-by: NBenjamin Marzinski <bmarzins@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

1946f70a

12 6月, 2009 3 次提交

GFS2: Remove lock_kernel from gfs2_put_super() · 3ea40058

由 Steven Whitehouse 提交于 6月 12, 2009

It is not required here.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat,com>
Cc: Christoph Hellwig <hch@infradead.org>

3ea40058

push BKL down into ->put_super · 6cfd0148

由 Christoph Hellwig 提交于 5月 05, 2009

Move BKL into ->put_super from the only caller.  A couple of
filesystems had trivial enough ->put_super (only kfree and NULLing of
s_fs_info + stuff in there) to not get any locking: coda, cramfs, efs,
hugetlbfs, omfs, qnx4, shmem, all others got the full treatment.  Most
of them probably don't need it, but I'd rather sort that out individually.
Preferably after all the other BKL pushdowns in that area.

[AV: original used to move lock_super() down as well; these changes are
removed since we don't do lock_super() at all in generic_shutdown_super()
now]
[AV: fuse, btrfs and xfs are known to need no damn BKL, exempt]
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

6cfd0148

gfs2: remove ->write_super and stop maintaining ->s_dirt · b7d245de

由 Christoph Hellwig 提交于 4月 27, 2009

Signed-off-by: NChristoph Hellwig <hch@lst.de>
Acked-by: NSteven Whitehouse <swhiteho@redhat.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

b7d245de

22 5月, 2009 1 次提交

GFS2: Merge mount.c and ops_super.c into super.c · 9e6e0a12

由 Steven Whitehouse 提交于 5月 22, 2009

mount.c only contained a single function, so is not really
worth retaining on its own. All of the super related code
is now either in super.c or ops_fstype.c
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

9e6e0a12

24 3月, 2009 2 次提交

GFS2: Fix freeze issue · df3647b2

由 Steven Whitehouse 提交于 3月 23, 2009

This removes some old code that was causing issues during
filesystem freeze.
Reported-by: NAndrew Price <andy@andrewprice.me.uk>
Tested-by: NAndrew Price <andy@andrewprice.me.uk>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

df3647b2

GFS2: Merge lock_dlm module into GFS2 · f057f6cd

由 Steven Whitehouse 提交于 1月 12, 2009

This is the big patch that I've been working on for some time
now. There are many reasons for wanting to make this change
such as:
 o Reducing overhead by eliminating duplicated fields between structures
 o Simplifcation of the code (reduces the code size by a fair bit)
 o The locking interface is now the DLM interface itself as proposed
   some time ago.
 o Fewer lookups of glocks when processing replies from the DLM
 o Fewer memory allocations/deallocations for each glock
 o Scope to do further optimisations in the future (but this patch is
   more than big enough for now!)

Please note that (a) this patch relates to the lock_dlm module and
not the DLM itself, that is still a separate module; and (b) that
we retain the ability to build GFS2 as a standalone single node
filesystem with out requiring the DLM.

This patch needs a lot of testing, hence my keeping it I restarted
my -git tree after the last merge window. That way, this has the maximum
exposure before its merged. This is (modulo a few minor bug fixes) the
same patch that I've been posting on and off the the last three months
and its passed a number of different tests so far.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

f057f6cd

05 1月, 2009 7 次提交

Revert "GFS2: Fix use-after-free bug on umount" · fefc03bf

由 Steven Whitehouse 提交于 12月 19, 2008

This reverts commit 78802499912f1ba31ce83a94c55b5a980f250a43.

The original patch is causing problems in relation to order of
operations at umount in relation to jdata files. I need to fix
this a different way.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

fefc03bf

GFS2: Fix use-after-free bug on umount · 3af165ac

由 Steven Whitehouse 提交于 11月 27, 2008

There was a use-after-free with the GFS2 super block during
umount. This patch moves almost all of the umount code from
->put_super into ->kill_sb, the only bit that cannot be moved
being the glock hash clearing which has to remain as ->put_super
due to umount ordering requirements. As a result its now obvious
that the kfree is the final operation, whereas before it was
hidden in ->put_super.

Also gfs2_jindex_free is then only referenced from a single file
so thats moved and marked static too.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

3af165ac

GFS2: Move four functions from super.c · 2bfb6449

由 Steven Whitehouse 提交于 11月 26, 2008

The functions which are being moved can all be marked
static in their new locations, since they only have
a single caller each. Their new locations are more
logical than before and some of the functions are
small enough that the compiler might well inline them.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

2bfb6449

GFS2: Fix bug in gfs2_lock_fs_check_clean() · b5289681

由 Steven Whitehouse 提交于 11月 26, 2008

gfs2_lock_fs_check_clean() should not be calling gfs2_jindex_hold()
since it doesn't work like rindex hold, despite the comment. That
allows gfs2_jindex_hold() to be moved into ops_fstype.c where it
can be made static.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

b5289681

GFS2: Banish struct gfs2_rgrpd_host · 73f74948

由 Steven Whitehouse 提交于 11月 04, 2008

This patch moves the final field so that we can get rid
of struct gfs2_rgrpd_host, as promised some time ago. Also
by rearranging the fields slightly, we are able to reduce
the size of the gfs2_rgrpd structure at the same time.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

73f74948

GFS2: Move rg_free from gfs2_rgrpd_host to gfs2_rgrpd · cfc8b549

由 Steven Whitehouse 提交于 11月 04, 2008

The second of three fields which need to move, in order
to remove the struct gfs2_rgrpd_host.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

cfc8b549

GFS2: Move i_size from gfs2_dinode_host and rename it to i_disksize · c9e98886

由 Steven Whitehouse 提交于 11月 04, 2008

This patch moved the i_size field from the gfs2_dinode_host and
following the ext3 convention renames it i_disksize.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

c9e98886

13 8月, 2008 1 次提交

GFS2: Fix metafs mounts · 9b8df98f

由 Steven Whitehouse 提交于 8月 08, 2008

This patch is intended to fix the issues reported in bz #457798. Instead
of having the metafs as a separate filesystem, it becomes a second root
of gfs2. As a result it will appear as type gfs2 in /proc/mounts, but it
is still possible (for backwards compatibility purposes) to mount it as
type gfs2meta. A new mount flag "meta" is introduced so that its possible
to tell the two cases apart in /proc/mounts.

As a result it becomes possible to mount type gfs2 with -o meta and
get the same result as mounting type gfs2meta. So it is possible to
mount just the metafs on its own. Currently if you do this, its then
impossible to mount the "normal" root of the gfs2 filesystem without
first unmounting the metafs root. I'm not sure if thats a feature or
a bug :-)

Either way, this is a great improvement on the previous scheme and I've
verified that it works ok with bind mounts on both the "normal" root
and the metafs root in various combinations.

There were also a bunch of functions in super.c which didn't belong there,
so this moves them into ops_fstype.c where they can be static. Hopefully
the mount/umount sequence is now more obvious as a result.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>
Cc: Alexander Viro <aviro@redhat.com>

9b8df98f

27 7月, 2008 1 次提交
- A
  [PATCH] don't pass nameidata to gfs2_lookupi() · a569c711
  由 Al Viro 提交于 7月 23, 2008
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  a569c711
10 7月, 2008 1 次提交

[GFS2] Remove support for unused and pointless flag · c9f6a6bb

由 Steven Whitehouse 提交于 7月 10, 2008

The ability to mark files for direct i/o access when opened
normally is both unused and pointless, so this patch removes
support for that feature.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

c9f6a6bb

27 6月, 2008 1 次提交

[GFS2] Clean up the glock core · 6802e340

由 Steven Whitehouse 提交于 5月 21, 2008

This patch implements a number of cleanups to the core of the
GFS2 glock code. As a result a lot of code is removed. It looks
like a really big change, but actually a large part of this patch
is either removing or moving existing code.

There are some new bits too though, such as the new run_queue()
function which is considerably streamlined. Highlights of this
patch include:

 o Fixes a cluster coherency bug during SH -> EX lock conversions
 o Removes the "glmutex" code in favour of a single bit lock
 o Removes the ->go_xmote_bh() for inodes since it was duplicating
   ->go_lock()
 o We now only use the ->lm_lock() function for both locks and
   unlocks (i.e. unlock is a lock with target mode LM_ST_UNLOCKED)
 o The fast path is considerably shortly, giving performance gains
   especially with lock_nolock
 o The glock_workqueue is now used for all the callbacks from the DLM
   which allows us to simplify the lock_dlm module (see following patch)
 o The way is now open to make further changes such as eliminating the two
   threads (gfs2_glockd and gfs2_scand) in favour of a more efficient
   scheme.

This patch has undergone extensive testing with various test suites
so it should be pretty stable by now.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>
Cc: Bob Peterson <rpeterso@redhat.com>

6802e340

10 4月, 2008 1 次提交

[GFS2] fix GFP_KERNEL misuses · 16c5f06f

由 Josef Bacik 提交于 4月 09, 2008

There are several places where GFP_KERNEL allocations happen under a glock,
which will result in hangs if we're under memory pressure and go to re-enter the
fs in order to flush stuff out. This patch changes the culprits to GFS_NOFS to
keep this problem from happening. Thank you,
Signed-off-by: NJosef Bacik <jbacik@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

16c5f06f

31 3月, 2008 1 次提交

[GFS2] Streamline indirect pointer tree height calculation · ecc30c79

由 Steven Whitehouse 提交于 1月 28, 2008

This patch improves the calculation of the tree height in order to reduce
the number of operations which are carried out on each call to gfs2_block_map.
In the common case, we now make a single comparison, rather than calculating
the required tree height from scratch each time. Also in the case that the
tree does need some extra height, we start from the current height rather from
zero when we work out what the new height ought to be.

In addition the di_height field is moved into the inode proper and reduced
in size to a u8 since the value must be between 0 and GFS2_MAX_META_HEIGHT (10).
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

ecc30c79

25 1月, 2008 7 次提交

[GFS2] Initialize extent_list earlier · 0811a127

由 Bob Peterson 提交于 1月 03, 2008

Here is a patch for the latest upstream GFS2 code:
The journal extent map needs to be initialized sooner than it
currently is.  Otherwise failed mount attempts (e.g. not enough
journals, etc.) may panic trying to access the uninitialized list.
Signed-off-by: NBob Peterson <rpeterso@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

0811a127

[GFS2] Eliminate the no longer needed sd_statfs_mutex · c3f60b6e

由 Bob Peterson 提交于 12月 12, 2007

This patch eliminates the unneeded sd_statfs_mutex mutex but preserves
the ordering as discussed.
Signed-off-by: NBob Peterson <rpeterso@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

c3f60b6e

[GFS2] Journal extent mapping · da6dd40d

由 Bob Peterson 提交于 12月 11, 2007

This patch saves a little time when gfs2 writes to the journals by
keeping a mapping between logical and physical blocks on disk.
That's better than constantly looking up indirect pointers in
buffers, when the journals are several levels of indirection
(which they typically are).
Signed-off-by: NBob Peterson <rpeterso@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

da6dd40d

[GFS2] Don't periodically update the jindex · e35b9211

由 Steven Whitehouse 提交于 11月 09, 2007

We only care about the content of the jindex in two cases,
one is when we mount the fs and the other is when we need
to recover another journal. In both cases we have to update
the jindex anyway, so there is no point in updating it
periodically between times, so this removes it to simplify
gfs2_logd.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

e35b9211

[GFS2] Remove "reclaim limit" · c2932e03

由 Steven Whitehouse 提交于 11月 01, 2007

This call to reclaim glocks is not needed, and in particular we don't want it
in the fast path for locking glocks. The limit was entirely arbitrary anyway
and we can't expect users to adjust things like this, the remaining code will
do the right thing on its own.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

c2932e03

[GFS2] Remove unused variables · 60b0d087

由 Steven Whitehouse 提交于 10月 31, 2007

These haven't been used for some time, remove them.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

60b0d087

[GFS2] Remove useless i_cache from inodes · f91a0d3e

由 Steven Whitehouse 提交于 10月 15, 2007

The i_cache was designed to keep references to the indirect blocks
used during block mapping so that they didn't have to be looked
up continually. The idea failed because there are too many places
where the i_cache needs to be freed, and this has in the past been
the cause of many bugs.

In addition there was no performance benefit being gained since the
disk blocks in question were cached anyway. So this patch removes
it in order to simplify the code to prepare for other changes which
would otherwise have had to add further support for this feature.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

f91a0d3e

12 10月, 2007 1 次提交

Fix up more bio fallout · 782e3b3b

由 Al Viro 提交于 10月 12, 2007

Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

782e3b3b

10 10月, 2007 2 次提交

[GFS2] Reduce number of gfs2_scand processes to one · 8fbbfd21

由 Steven Whitehouse 提交于 8月 01, 2007

We only need a single gfs2_scand process rather than the one
per filesystem which we had previously. As a result the parameter
determining the frequency of gfs2_scand runs becomes a module
parameter rather than a mount parameter as it was before.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

8fbbfd21

Drop 'size' argument from bio_endio and bi_end_io · 6712ecf8

由 NeilBrown 提交于 9月 27, 2007

As bi_end_io is only called once when the reqeust is complete,
the 'size' argument is now redundant.  Remove it.

Now there is no need for bio_endio to subtract the size completed
from bi_size.  So don't do that either.

While we are at it, change bi_end_io to return void.
Signed-off-by: NNeil Brown <neilb@suse.de>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

6712ecf8

09 7月, 2007 2 次提交

[GFS2] Fix sign problem in quota/statfs and cleanup _host structures · bb8d8a6f

由 Steven Whitehouse 提交于 6月 01, 2007

This patch fixes some sign issues which were accidentally introduced
into the quota & statfs code during the endianess annotation process.
Also included is a general clean up which moves all of the _host
structures out of gfs2_ondisk.h (where they should not have been to
start with) and into the places where they are actually used (often only
one place). Also those _host structures which are not required any more
are removed entirely (which is the eventual plan for all of them).

The conversion routines from ondisk.c are also moved into the places
where they are actually used, which for almost every one, was just one
single place, so all those are now static functions. This also cleans up
the end of gfs2_ondisk.h which no longer needs the #ifdef __KERNEL__.

The net result is a reduction of about 100 lines of code, many functions
now marked static plus the bug fixes as mentioned above. For good
measure I ran the code through sparse after making these changes to
check that there are no warnings generated.

This fixes Red Hat bz #239686
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

bb8d8a6f

[GFS2] Clean up inode number handling · dbb7cae2

由 Steven Whitehouse 提交于 5月 15, 2007

This patch cleans up the inode number handling code. The main difference
is that instead of looking up the inodes using a struct gfs2_inum_host
we now use just the no_addr member of this structure. The tests relating
to no_formal_ino can then be done by the calling code. This has
advantages in that we want to do different things in different code
paths if the no_formal_ino doesn't match. In the NFS patch we want to
return -ESTALE, but in the ->lookup() path, its a bug in the fs if the
no_formal_ino doesn't match and thus we can withdraw in this case.

In order to later fix bz #201012, we need to be able to look up an inode
without knowing no_formal_ino, as the only information that is known to
us is the on-disk location of the inode in question.

This patch will also help us to fix bz #236099 at a later date by
cleaning up a lot of the code in that area.

There are no user visible changes as a result of this patch and there
are no changes to the on-disk format either.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

dbb7cae2

08 3月, 2007 1 次提交
- S
  [GFS2] Remove unused variable · 04b159b1
  由 Steven Whitehouse 提交于 3月 01, 2007
```
Remove an unused variable.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>
```
  04b159b1
06 2月, 2007 4 次提交

[GFS2] Remove local exclusive glock mode · 1c0f4872

由 Steven Whitehouse 提交于 1月 22, 2007

Here is a patch for GFS2 to remove the local exclusive flag. In
the places it was used, mutex's are always held earlier in the
call path, so it appears redundant in the LM_ST_SHARED case.

Also, the GFS2 holders were setting local exclusive in any case where
the requested lock was LM_ST_EXCLUSIVE. So the other places in the glock
code where the flag was tested have been replaced with tests for the
lock state being LM_ST_EXCLUSIVE in order to ensure the logic is the
same as before (i.e. LM_ST_EXCLUSIVE is always locally exclusive as well
as globally exclusive).
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

1c0f4872

[GFS2] Remove the "greedy" function from glock.[ch] · e5dab552

由 Steven Whitehouse 提交于 1月 18, 2007

The "greedy" code was an attempt to retain glocks for a minimum length
of time when they relate to mmap()ed files. The current implementation
of this feature is not, however, ideal in that it required allocating
memory in order to do this and its overly complicated.

It also misses the mark by ignoring the other I/O operations which are
just as likely to suffer from the same problem. So the plan is to remove
this now and then add the functionality back as part of the glock state
machine at a later date (and thus take into account all the possible
users of this feature)
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

e5dab552

[GFS2] Remove max_atomic_write tunable · 330005c2

由 Steven Whitehouse 提交于 1月 15, 2007

This removes an unused sysfs tunable parameter.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

330005c2

[GFS2] Clean up/speed up readdir · 3699e3a4

由 Steven Whitehouse 提交于 1月 17, 2007

This removes the extra filldir callback which gfs2 was using to
enclose an attempt at readahead for inodes during readdir. The
code was too complicated and also hurts performance badly in the
case that the getdents64/readdir call isn't being followed by
stat() and it wasn't even getting it right all the time when it
was.

As a result, on my test box an "ls" of a directory containing 250000
files fell from about 7mins (freshly mounted, so nothing cached) to
between about 15 to 25 seconds. When the directory content was cached,
the time taken fell from about 3mins to about 4 or 5 seconds.

Interestingly in the cached case, running "ls -l" once reduced the time
taken for subsequent runs of "ls" to about 6 secs even without this
patch. Now it turns out that there was a special case of glocks being
used for prefetching the metadata, but because of the timeouts for these
locks (set to 10 secs) the metadata was being timed out before it was
being used and this the prefetch code was constantly trying to prefetch
the same data over and over.

Calling "ls -l" meant that the inodes were brought into memory and once
the inodes are cached, the glocks are not disposed of until the inodes
are pushed out of the cache, thus extending the lifetime of the glocks,
and thus bringing down the time for subsequent runs of "ls"
considerably.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

3699e3a4

30 11月, 2006 2 次提交

[GFS2] Add a comment about reading the super block · aac1a3c7

由 Steven Whitehouse 提交于 11月 30, 2006

The comment explains why we use the bio functions to read
the super block.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>
Cc: Andrew Morton <akpm@osdl.org>
Cc: Srinivasa Ds <srinivasa@in.ibm.com>

aac1a3c7

[GFS2] Mount problem with the GFS2 code · 0da3585e

由 Srinivasa Ds 提交于 11月 30, 2006

  While mounting the gfs2 filesystem,our test team had a problem and we
got this error message.
=======================================================

GFS2: fsid=: Trying to join cluster "lock_nolock", "dasde1"
GFS2: fsid=dasde1.0: Joined cluster. Now mounting FS...
GFS2: not a GFS2 filesystem
GFS2: fsid=dasde1.0: can't read superblock: -22

==========================================================================
On debugging further we found that problem is while reading the super
block(gfs2_read_super) and comparing the magic number in it.
When I  replace the submit_bio() call(present in gfs2_read_super) with
the sb_getblk() and ll_rw_block(), mount operation succeded.
On further analysis we found that before calling submit_bio(),
bio->bi_sector was set to "sector" variable. This "sector" variable has
the same value of bh->b_blocknr(block number). Hence there is a need to
multiply this valuwith (blocksize >> 9)(9 because,sector size
2^9,samething happens in ll_rw_block also, before calling submit_bio()).
So I have developed the patch which solves this problem. Please let me
know your comments.
================================================================
Signed-off-by: NSrinivasa DS <srinivasa@in.ibm.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

0da3585e

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功