提交 · 3b1d0b9d0b6f4b76293c8f30cc95aa946bd34150 · openanolis / cloud-kernel

24 9月, 2012 6 次提交

GFS2: Update rgblk_free() to use rbm · 3b1d0b9d

由 Steven Whitehouse 提交于 8月 03, 2012

Replace open coded version with a call to gfs2_rbm_from_block()
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

3b1d0b9d

GFS2: Update gfs2_get_block_type() to use rbm · 3983903a

由 Steven Whitehouse 提交于 8月 03, 2012

Use the new gfs2_rbm_from_block() function to replace an open
coded version of the same code.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

3983903a

GFS2: Replace rgblk_search with gfs2_rbm_find · 5b924ae2

由 Steven Whitehouse 提交于 8月 01, 2012

This is part of a series of patches which are introducing the
gfs2_rbm structure throughout the block allocation code. The
main aim of this part is to create a search function which can
deal directly with struct gfs2_rbm. In this case it specifies
the initial position at which to start the search and also the
point at which the search terminates.

The net result of this is to clean up the search code and make
it rather more readable, and the various possible exceptions which
may occur during the search are partitioned into their own functions.

There are some bug fixes too. We should not be checking the reservations
while allocating extents - the time for that is when we are searching
for where to put the extent, not when we've already made that decision.

Also, rgblk_search had two uses, and in only one of those cases did
it make sense to check for reservations. This is fixed in the new
gfs2_rbm_find function, which has a cleaner interface.

The reservation checking has been improved by always checking for
contiguous reservations, and returning the first free block after
all contiguous reservations. This is done under the spin lock to
ensure consistancy of the tree.

The allocation of extents is now in all cases done by the existing
allocation code, and if there is an active reservation, that is updated
after the fact. Again this is done under the spin lock, since it entails
changing the lookup key for the reservation in question.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

5b924ae2

GFS2: Add structure to contain rgrp, bitmap, offset tuple · 4a993fb1

由 Steven Whitehouse 提交于 7月 31, 2012

This patch introduces a new structure, gfs2_rbm, which is a
tuple of a resource group, a bitmap within the resource group
and an offset within that bitmap. This is designed to make
manipulating these sets of variables easier. There is also a
new helper function which converts this representation back
to a disk block address.

In addition, the rbtree nodes which are used for the reservations
were not being correctly initialised, which is now fixed. Also,
the tracing was not passing through the inode where it should
have been. That is mostly fixed aside from one corner case. This
needs to be revisited since there can also be a NULL rgrp in
some cases which results in the device being incorrect in the
trace.

This is intended to be the first step towards cleaning up some
of the allocation code, and some further bug fixes.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

4a993fb1

GFS2: Remove rs_requested field from reservations · 71f890f7

由 Steven Whitehouse 提交于 7月 30, 2012

The rs_requested field is left over from the original allocation
code, however this should have been a parameter passed to the
various functions from gfs2_inplace_reserve() and not a member of the
reservation structure as the value is not required after the
initial allocation.

This also helps simplify the code since we no longer need to set
the rs_requested to zero. Also the gfs2_inplace_release()
function can also be simplified since the reservation structure
will always be defined when it is called, and the only remaining
task is to unlock the rgrp if required. It can also now be
called unconditionally too, resulting in a further simplification.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

71f890f7

GFS2: Merge two nearly identical xattr functions · 1f981697

由 Steven Whitehouse 提交于 7月 26, 2012

There were two functions in the xattr code which were nearly
identical, the only difference being that one was copy data into
the unstuffed xattrs and the other was copying data out from it.

This patch merges the two functions such that the code which deal
with iteration over the unstuffed xattrs is no longer duplicated.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

1f981697

13 9月, 2012 3 次提交

GFS2: Take account of blockages when using reserved blocks · 62e252ee

由 Steven Whitehouse 提交于 7月 30, 2012

The claim_reserved_blks() function was not taking account of
the possibility of "blockages" while performing allocation.
This can be caused by another node allocating something in
the same extent which has been reserved locally.

This patch tests for this condition and then skips the remainder
of the reservation in this case. This is a relatively rare event,
so that it should not affect the general performance improvement
which the block reservations provide.

The claim_reserved_blks() function also appears not to be able
to deal with reservations which cross bitmap boundaries, but
that can be dealt with in a future patch since we don't generate
boundary crossing reservations currently.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>
Reported-by: NDavid Teigland <teigland@redhat.com>
Cc: Bob Peterson <rpeterso@redhat.com>

62e252ee

GFS2: Fix missing allocation data for set/remove xattr · 645b2ccc

由 Steven Whitehouse 提交于 7月 26, 2012

These entry points were missed in the original patch to allocate
this data structure.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

645b2ccc

GFS2: Make write size hinting code common · da1dfb6a

由 Steven Whitehouse 提交于 7月 26, 2012

This collects up the write size hinting code which is used by the
block reservation subsystem into a single function. At the same
time this also corrects the rounding for this calculation.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

da1dfb6a

04 8月, 2012 1 次提交

gfs2: nuke pdflush from comments · e76e0ec9

由 Artem Bityutskiy 提交于 7月 25, 2012

The pdflush thread is long gone, so this patch removes references to pdflush
from gfs comments.

Cc: Steven Whitehouse <swhiteho@redhat.com>
Signed-off-by: NArtem Bityutskiy <artem.bityutskiy@linux.intel.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

e76e0ec9

31 7月, 2012 2 次提交

gfs2: Convert to new freezing mechanism · 39263d5e

由 Jan Kara 提交于 6月 12, 2012

We update gfs2_page_mkwrite() to use new freeze protection and the transaction
code to use freeze protection while the transaction is running. That is needed
to stop iput() of unlinked file from modifying the filesystem. The rest is
handled by the generic code.

CC: cluster-devel@redhat.com
CC: Steven Whitehouse <swhiteho@redhat.com>
Acked-by: NSteven Whitehouse <swhiteho@redhat.com>
Signed-off-by: NJan Kara <jack@suse.cz>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

39263d5e

gfs2: Push file_update_time() into gfs2_page_mkwrite() · a63e9b2e

由 Jan Kara 提交于 6月 12, 2012

CC: Steven Whitehouse <swhiteho@redhat.com>
CC: cluster-devel@redhat.com
Acked-by: NSteven Whitehouse <swhiteho@redhat.com>
Signed-off-by: NJan Kara <jack@suse.cz>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

a63e9b2e

23 7月, 2012 2 次提交

quota: Move quota syncing to ->sync_fs method · a1177825

由 Jan Kara 提交于 7月 03, 2012

Since the moment writes to quota files are using block device page cache and
space for quota structures is reserved at the moment they are first accessed we
have no reason to sync quota before inode writeback. In fact this order is now
only harmful since quota information can easily change during inode writeback
(either because conversion of delayed-allocated extents or simply because of
allocation of new blocks for simple filesystems not using page_mkwrite).

So move syncing of quota information after writeback of inodes into ->sync_fs
method. This way we do not have to use ->quota_sync callback which is primarily
intended for use by quotactl syscall anyway and we get rid of calling
->sync_fs() twice unnecessarily. We skip quota syncing for OCFS2 since it does
proper quota journalling in all cases (unlike ext3, ext4, and reiserfs which
also support legacy non-journalled quotas) and thus there are no dirty quota
structures.

CC: "Theodore Ts'o" <tytso@mit.edu>
CC: Joel Becker <jlbec@evilplan.org>
CC: reiserfs-devel@vger.kernel.org
Acked-by: NSteven Whitehouse <swhiteho@redhat.com>
Acked-by: NDave Kleikamp <shaggy@kernel.org>
Reviewed-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NJan Kara <jack@suse.cz>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

a1177825

quota: Split dquot_quota_sync() to writeback and cache flushing part · ceed1723

由 Jan Kara 提交于 7月 03, 2012

Split off part of dquot_quota_sync() which writes dquots into a quota file
to a separate function. In the next patch we will use the function from
filesystems and we do not want to abuse ->quota_sync quotactl callback more
than necessary.
Acked-by: NSteven Whitehouse <swhiteho@redhat.com>
Reviewed-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NJan Kara <jack@suse.cz>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

ceed1723

21 7月, 2012 1 次提交

GFS2: Eliminate 64-bit divides · 15e1c960

由 Bob Peterson 提交于 7月 20, 2012

This patch removes the 64-bit divides introduced in the previous patch
in favor of shifting, so that it will compile properly on 32-bit machines.
Signed-off-by: NBob Peterson <rpeterso@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

15e1c960

19 7月, 2012 1 次提交

GFS2: Reduce file fragmentation · 8e2e0047

由 Bob Peterson 提交于 7月 19, 2012

This patch reduces GFS2 file fragmentation by pre-reserving blocks. The
resulting improved on disk layout greatly speeds up operations in cases
which would have resulted in interlaced allocation of blocks previously.
A typical example of this is 10 parallel dd processes, each writing to a
file in a common dirctory.

The implementation uses an rbtree of reservations attached to each
resource group (and each inode).
Signed-off-by: NBob Peterson <rpeterso@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

8e2e0047

18 7月, 2012 1 次提交

GFS2: kernel panic with small gfs2 filesystems - 1 RG · 294f2ad5

由 Abhijith Das 提交于 7月 18, 2012

In the unlikely setup where there's only one resource group in the gfs2
filesystem, gfs2_rgrpd_get_next() returns a NULL rgd that is not dealt with
properly, causing a kernel NULL ptr dereference. This patch fixes this issue.
Signed-off-by: NAbhi Das <adas@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

294f2ad5

14 7月, 2012 4 次提交

VFS: Pass mount flags to sget() · 9249e17f

由 David Howells 提交于 6月 25, 2012

Pass mount flags to sget() so that it can use them in initialising a new
superblock before the set function is called.  They could also be passed to the
compare function.
Signed-off-by: NDavid Howells <dhowells@redhat.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

9249e17f

don't pass nameidata to ->create() · ebfc3b49

由 Al Viro 提交于 6月 10, 2012

boolean "does it have to be exclusive?" flag is passed instead;
Local filesystem should just ignore it - the object is guaranteed
not to be there yet.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

ebfc3b49

stop passing nameidata to ->lookup() · 00cd8dd3

由 Al Viro 提交于 6月 10, 2012

Just the flags; only NFS cares even about that, but there are
legitimate uses for such argument.  And getting rid of that
completely would require splitting ->lookup() into a couple
of methods (at least), so let's leave that alone for now...
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

00cd8dd3

stop passing nameidata * to ->d_revalidate() · 0b728e19

由 Al Viro 提交于 6月 10, 2012

Just the lookup flags.  Die, bastard, die...
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

0b728e19

28 6月, 2012 1 次提交

GFS2: Fixing double brelse'ing bh allocated in gfs2_meta_read when EIO occurs · 44b8db13

由 Masatake YAMATO 提交于 6月 18, 2012

This patch fixes buffer_head double free in following code path:

gfs2_block_map
=> gfs2_meta_inode_buffer
 => gfs2_meta_indirect_buffer
  => gfs2_meta_read
=> release_metapath

gfs2_block_map calls gfs2_meta_inode_buffer with &mp.mp_bh[0]
as an argument. mp.mp_bh are filled with zero at the beginning
of gfs2_block_map.

If gfs2_meta_inode_buffer returns non-zero value, gfs2_block_map
calls release_metapath to free buffers chained to mp.mp_bh.
release_metapath checks each slot of mp.mp_bh[i] and
free(with brelse) unless the slot is filled with NULL.

&mp.mp_bh[0] passed to gfs2_meta_inode_buffer is filled at
gfs2_meta_read. gfs2_meta_read is filled a buffer allocated with
gfs2_getbuf even if EIO occurs. When EIO occurs, the allocated buffer
is brelse'ed though the pointer(wrong poiner) points the brelse'ed is
passed back to caller via an argument bhp.

gfs2_meta_indirect_buffer, the caller also pass the wrong pointer
to its caller with EIO. Finally gfs2_block_map gets both EIO and
&mp.mp_bh[0] filled with the wrong pointer. release_metapath
calls brelse again on the wrong pointer.
Signed-off-by: NMasatake YAMATO <yamato@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

44b8db13

14 6月, 2012 1 次提交

GFS2: Combine functions get_local_rgrp and gfs2_inplace_reserve · 666d1d8a

由 Bob Peterson 提交于 6月 13, 2012

This function combines rgrp functions get_local_rgrp and
gfs2_inplace_reserve so that the double retry loop is gone.
Signed-off-by: NBob Peterson <rpeterso@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

666d1d8a

13 6月, 2012 1 次提交

GFS2: Add kobject release method · 0d515210

由 Bob Peterson 提交于 6月 13, 2012

This patch adds a kobject release function that properly maintains
the kobject use count, so that accesses to the sysfs files do not
cause an access to freed kernel memory after an unmount.
Signed-off-by: NBob Peterson <rpeterso@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

0d515210

11 6月, 2012 2 次提交

GFS2: Size seq_file buffer more carefully · 0fe2f1e9

由 Steven Whitehouse 提交于 6月 11, 2012

This places a limit on the buffer size for archs with larger
PAGE_SIZE.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>
Reported-by: NEric Dumazet <eric.dumazet@gmail.com>

0fe2f1e9

GFS2: Use seq_vprintf for glocks debugfs file · 1bb49303

由 Steven Whitehouse 提交于 6月 11, 2012

Make use of the newly added seq_vprintf() function.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>
Reported-by: NEric Dumazet <eric.dumazet@gmail.com>
Acked-by: NAl Viro <viro@ZenIV.linux.org.uk>

1bb49303

08 6月, 2012 2 次提交

GFS2: Use lvbs for storing rgrp information with mount option · 90306c41

由 Benjamin Marzinski 提交于 5月 29, 2012

Instead of reading in the resource groups when gfs2 is checking
for free space to allocate from, gfs2 can store the necessary infromation
in the resource group's lvb. Also, instead of searching for unlinked
inodes in every resource group that's checked for free space, gfs2 can
store the number of unlinked but inodes in the lvb, and only check for
unlinked inodes if it will find some.

The first time a resource group is locked, the lvb must initialized.
Since this involves counting the unlinked inodes in the resource group,
this takes a little extra time. But after that, if the resource group
is locked with GL_SKIP, the buffer head won't be read in unless it's
actually needed.

Enabling the resource groups lvbs is done via the rgrplvb mount option. If
this option isn't set, the lvbs will still be set and updated, but they won't
be verfied or used by the filesystem. To safely turn on this option, all of
the nodes mounting the filesystem must be running code with this patch, and
the filesystem must have been completely unmounted since they were updated.
Signed-off-by: NBenjamin Marzinski <bmarzins@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

90306c41

GFS2: Cache last hash bucket for glock seq_files · ba1ddcb6

由 Steven Whitehouse 提交于 6月 08, 2012

For the glocks and glstats seq_files, which are exposed via debugfs
we should cache the most recent hash bucket, along with the offset
into that bucket. This allows us to restart from that point, rather
than having to begin at the beginning each time.

This is an idea from Eric Dumazet, however I've slightly extended it
so that if the position from which we are due to start is at any
point beyond the last cached point, we start from the last cached
point, plus whatever is the appropriate offset. I don't really expect
people to be lseeking around these files, but if they did so with only
positive offsets, then we'd still get some of the benefit of using a
cached offset.

With my simple test of around 200k entries in the file, I'm seeing
an approx 10x speed up.

Cc: Eric Dumazet <eric.dumazet@gmail.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

ba1ddcb6

07 6月, 2012 1 次提交

GFS2: Increase buffer size for glocks and glstats debugfs files · df5d2f55

由 Steven Whitehouse 提交于 6月 07, 2012

As per Al Viro's suggestion, this increases the buffer size used
for these two files. This provides a speed up of slightly less than
8x (i.e. proportional to the buffer size) for cases when we have
large numbers of glocks.

Cc: Al Viro <viro@ZenIV.linux.org.uk>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

df5d2f55

06 6月, 2012 4 次提交

GFS2: Fix error handling when reading an invalid block from the journal · 1b8ba31a

由 Steven Whitehouse 提交于 5月 29, 2012

When we read an invalid block from the journal, we should not call
withdraw, but simply print a message and return an error. It is
up to the caller to then handle that error. In the case of mount
that means a failed mount, rather than a withdraw (requiring a
reboot). In the case of recovering another nodes journal then
we return an error via the uevent.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

1b8ba31a

GFS2: Add "top dir" flag support · 23d0bb83

由 Steven Whitehouse 提交于 5月 28, 2012

This patch adds support for the "top dir" flag. Currently this is unused
but a subsequent patch is planned which will add support for the
Orlov allocation policy when allocating subdirectories in a parent
with this flag set.

In order to ensure backward compatible behaviour, mkfs.gfs2 does
not currently tag the root directory with this flag, it must always be
set manually.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

23d0bb83

GFS2: Fold quota data into the reservations struct · 5407e242

由 Bob Peterson 提交于 5月 18, 2012

This patch moves the ancillary quota data structures into the
block reservations structure. This saves GFS2 some time and
effort in allocating and deallocating the qadata structure.
Signed-off-by: NBob Peterson <rpeterso@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

5407e242

GFS2: Extend the life of the reservations · 0a305e49

由 Bob Peterson 提交于 6月 06, 2012

This patch lengthens the lifespan of the reservations structure for
inodes. Before, they were allocated and deallocated for every write
operation. With this patch, they are allocated when the first write
occurs, and deallocated when the last process closes the file.
It's more efficient to do it this way because it saves GFS2 a lot of
unnecessary allocates and frees. It also gives us more flexibility
for the future: (1) we can now fold the qadata structure back into
the structure and save those alloc/frees, (2) we can use this for
multi-block reservations.
Signed-off-by: NBob Peterson <rpeterso@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

0a305e49

30 5月, 2012 1 次提交

->encode_fh() API change · b0b0382b

由 Al Viro 提交于 4月 02, 2012

pass inode + parent's inode or NULL instead of dentry + bool saying
whether we want the parent or not.

NOTE: that needs ceph fix folded in.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

b0b0382b

16 5月, 2012 1 次提交

GFS2: Fix quota adjustment return code · 500242ac

由 Bob Peterson 提交于 5月 15, 2012

This patch changes function gfs2_adjust_quota so that it properly
returns a good (zero) return code on the normal path through the code.
Without this, mounting GFS2 with -o quota=account periodically gave
this error message: GFS2: fsid=cluster:fs: gfs2_quotad: sync error -5
Signed-off-by: NBob Peterson <rpeterso@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

500242ac

11 5月, 2012 3 次提交

GFS2: Add rgrp information to block_alloc trace point · 41db1ab9

由 Bob Peterson 提交于 5月 09, 2012

This is a second attempt at a patch that adds rgrp information to the
block allocation trace point for GFS2. As suggested, the patch was
modified to list the rgrp information _after_ the fields that exist today.

Again, the reason for this patch is to allow us to trace and debug
problems with the block reservations patch, which is still in the works.
We can debug problems with reservations if we can see what block allocations
result from the block reservations. It may also be handy in figuring out
if there are problems in rgrp free space accounting. In other words,
we can use it to track the rgrp and its free space along side the allocations
that are taking place.
Signed-off-by: NBob Peterson <rpeterso@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

41db1ab9

GFS2: Eliminate unused "new" parameter to gfs2_meta_indirect_buffer · f2f9c812

由 Bob Peterson 提交于 5月 10, 2012

It turns out that the "new" parameter to function gfs2_meta_indirect_buffer
was always being passed in as zero. Therefore, this patch eliminates it
and simplifies the function.
Signed-off-by: NBob Peterson <rpeterso@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

f2f9c812

vfs: make it possible to access the dentry hash/len as one 64-bit entry · 26fe5750

由 Linus Torvalds 提交于 5月 10, 2012

This allows comparing hash and len in one operation on 64-bit
architectures.  Right now only __d_lookup_rcu() takes advantage of this,
since that is the case we care most about.

The use of anonymous struct/unions hides the alternate 64-bit approach
from most users, the exception being a few cases where we initialize a
'struct qstr' with a static initializer.  This makes the problematic
cases use a new QSTR_INIT() helper function for that (but initializing
just the name pointer with a "{ .name = xyzzy }" initializer remains
valid, as does just copying another qstr structure).
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

26fe5750

08 5月, 2012 1 次提交

GFS2: Remove redundant metadata block type check · 6de1e2f3

由 Bob Peterson 提交于 4月 27, 2012

This patch removes a redundant metadata block check. See description below.
Signed-off-by: NBob Peterson <rpeterso@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

6de1e2f3

06 5月, 2012 1 次提交

vfs: Rename end_writeback() to clear_inode() · dbd5768f

由 Jan Kara 提交于 5月 03, 2012

After we moved inode_sync_wait() from end_writeback() it doesn't make sense
to call the function end_writeback() anymore. Rename it to clear_inode()
which well says what the function really does - set I_CLEAR flag.
Signed-off-by: NJan Kara <jack@suse.cz>
Signed-off-by: NFengguang Wu <fengguang.wu@intel.com>

dbd5768f

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功