提交 · 0afbba13226fcdbd4327f6b13a42f6efbb8c9caf · openeuler / Kernel

25 7月, 2011 5 次提交

S
ocfs2/dlm: Cleanup up dlm_finish_local_lockres_recovery() · 0afbba13
由 Sunil Mushran 提交于 7月 24, 2011
```
dlm_finish_local_lockres_recovery() needed a facelift.
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
```
0afbba13
S
ocfs2: Clean up messages in stack_o2cb.c · 394eb3d3
由 Sunil Mushran 提交于 7月 24, 2011
```
o2cb messages needed a facelift.
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
```
394eb3d3
S
ocfs2/dlm: Clean up messages in o2dlm · 8decab3c
由 Sunil Mushran 提交于 7月 24, 2011
```
o2dlm messages needed a facelift.
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
```
8decab3c
S
ocfs2/cluster: Clean up messages in o2net · 1dfecf81
由 Sunil Mushran 提交于 7月 24, 2011
```
o2net messages needed a facelift.
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
```
1dfecf81

ocfs2/cluster: Abort heartbeat start on hard-ro devices · d2eece37

由 Sunil Mushran 提交于 7月 24, 2011

Currently if the heartbeat device is hard-ro, the o2hb thread keeps chugging
along and dumping errors along the way. The user needs to manually stop the
heartbeat.

The patch addresses this shortcoming by adding a limit to the number of times
the hb thread will iterate in an unsteady state. If the hb thread does not
ready steady state in that many interation, the start is aborted.
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>

d2eece37

21 7月, 2011 5 次提交

fs: push i_mutex and filemap_write_and_wait down into ->fsync() handlers · 02c24a82

由 Josef Bacik 提交于 7月 16, 2011

Btrfs needs to be able to control how filemap_write_and_wait_range() is called
in fsync to make it less of a painful operation, so push down taking i_mutex and
the calling of filemap_write_and_wait() down into the ->fsync() handlers. Some
file systems can drop taking the i_mutex altogether it seems, like ext3 and
ocfs2. For correctness sake I just pushed everything down in all cases to make
sure that we keep the current behavior the same for everybody, and then each
individual fs maintainer can make up their mind about what to do from there.
Thanks,
Acked-by: NJan Kara <jack@suse.cz>
Signed-off-by: NJosef Bacik <josef@redhat.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

02c24a82

fs: move inode_dio_done to the end_io handler · 72c5052d

由 Christoph Hellwig 提交于 6月 24, 2011

For filesystems that delay their end_io processing we should keep our
i_dio_count until the the processing is done.  Enable this by moving
the inode_dio_done call to the end_io handler if one exist.  Note that
the actual move to the workqueue for ext4 and XFS is not done in
this patch yet, but left to the filesystem maintainers.  At least
for XFS it's not needed yet either as XFS has an internal equivalent
to i_dio_count.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

72c5052d

fs: always maintain i_dio_count · df2d6f26

由 Christoph Hellwig 提交于 6月 24, 2011

Maintain i_dio_count for all filesystems, not just those using DIO_LOCKING.
This these filesystems to also protect truncate against direct I/O requests
by using common code.  Right now the only non-DIO_LOCKING filesystem that
appears to do so is XFS, which uses an opencoded variant of the i_dio_count
scheme.

Behaviour doesn't change for filesystems never calling inode_dio_wait.
For ext4 behaviour changes when using the dioread_nonlock option, which
previously was missing any protection between truncate and direct I/O reads.
For ocfs2 that handcrafted i_dio_count manipulations are replaced with
the common code now enable.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

df2d6f26

fs: move inode_dio_wait calls into ->setattr · 562c72aa

由 Christoph Hellwig 提交于 6月 24, 2011

Let filesystems handle waiting for direct I/O requests themselves instead
of doing it beforehand. This means filesystem-specific locks to prevent
new dio referenes from appearing can be held. This is important to allow
generalizing i_dio_count to non-DIO_LOCKING filesystems.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

562c72aa

fs: kill i_alloc_sem · bd5fe6c5

由 Christoph Hellwig 提交于 6月 24, 2011

i_alloc_sem is a rather special rw_semaphore.  It's the last one that may
be released by a non-owner, and it's write side is always mirrored by
real exclusion.  It's intended use it to wait for all pending direct I/O
requests to finish before starting a truncate.

Replace it with a hand-grown construct:

 - exclusion for truncates is already guaranteed by i_mutex, so it can
   simply fall way
 - the reader side is replaced by an i_dio_count member in struct inode
   that counts the number of pending direct I/O requests.  Truncate can't
   proceed as long as it's non-zero
 - when i_dio_count reaches non-zero we wake up a pending truncate using
   wake_up_bit on a new bit in i_flags
 - new references to i_dio_count can't appear while we are waiting for
   it to read zero because the direct I/O count always needs i_mutex
   (or an equivalent like XFS's i_iolock) for starting a new operation.

This scheme is much simpler, and saves the space of a spinlock_t and a
struct list_head in struct inode (typically 160 bits on a non-debug 64-bit
system).
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

bd5fe6c5

20 7月, 2011 6 次提交

new helpers: kern_path_create/user_path_create · dae6ad8f

由 Al Viro 提交于 6月 26, 2011

combination of kern_path_parent() and lookup_create().  Does *not*
expose struct nameidata to caller.  Syscalls converted to that...
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

dae6ad8f

A
->permission() sanitizing: don't pass flags to ->permission() · 10556cb2
由 Al Viro 提交于 6月 20, 2011
```
not used by the instances anymore.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
10556cb2

->permission() sanitizing: don't pass flags to generic_permission() · 2830ba7f

由 Al Viro 提交于 6月 20, 2011

redundant; all callers get it duplicated in mask & MAY_NOT_BLOCK and none of
them removes that bit.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

2830ba7f

A
->permission() sanitizing: don't pass flags to ->check_acl() · 7e40145e
由 Al Viro 提交于 6月 20, 2011
```
not used in the instances anymore.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
7e40145e
A
->permission() sanitizing: pass MAY_NOT_BLOCK to ->check_acl() · 9c2c7039
由 Al Viro 提交于 6月 20, 2011
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
9c2c7039

kill check_acl callback of generic_permission() · 178ea735

由 Al Viro 提交于 6月 20, 2011

its value depends only on inode and does not change; we might as
well store it in ->i_op->check_acl and be done with that.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

178ea735

04 6月, 2011 1 次提交

more conservative S_NOSEC handling · 9e1f1de0

由 Al Viro 提交于 6月 03, 2011

Caching "we have already removed suid/caps" was overenthusiastic as merged.
On network filesystems we might have had suid/caps set on another client,
silently picked by this client on revalidate, all of that *without* clearing
the S_NOSEC flag.

AFAICS, the only reasonably sane way to deal with that is
	* new superblock flag; unless set, S_NOSEC is not going to be set.
	* local block filesystems set it in their ->mount() (more accurately,
mount_bdev() does, so does btrfs ->mount(), users of mount_bdev() other than
local block ones clear it)
	* if any network filesystem (or a cluster one) wants to use S_NOSEC,
it'll need to set MS_NOSEC in sb->s_flags *AND* take care to clear S_NOSEC when
inode attribute changes are picked from other clients.

It's not an earth-shattering hole (anybody that can set suid on another client
will almost certainly be able to write to the file before doing that anyway),
but it's a bug that needs fixing.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

9e1f1de0

27 5月, 2011 3 次提交

Ocfs2/move_extents: Validate moving goal after the adjustment. · ea5e1675

由 Tristan Ye 提交于 5月 27, 2011

though the goal_to_be_moved will be validated again in following moving, it's
still a good idea to validate it after adjustment at the very beginning, instead
of validating it before adjustment.
Signed-off-by: NTristan Ye <tristan.ye@oracle.com>

ea5e1675

Ocfs2/move_extents: Avoid doing division in extent moving. · 6aea6f50

由 Tristan Ye 提交于 5月 27, 2011

It's not wise enough to do a 64bits division anywhere in kernside, replace it
with a decent helper or proper shifts.
Signed-off-by: NTristan Ye <tristan.ye@oracle.com>

6aea6f50

ocfs2: add cleancache support · 1cfd8bd0

由 Dan Magenheimer 提交于 5月 26, 2011

This eighth patch of eight in this cleancache series "opts-in"
cleancache for ocfs2.  Clustered filesystems must explicitly enable
cleancache by calling cleancache_init_shared_fs anytime an instance
of the filesystem is mounted.  Ocfs2 is currently the only user of
the clustered filesystem interface but nevertheless, the cleancache
hooks in the VFS layer are sufficient for ocfs2 including the matching
cleancache_flush_fs hook which must be called on unmount.

Details and a FAQ can be found in Documentation/vm/cleancache.txt

[v8: trivial merge conflict update]
[v5: jeremy@goop.org: simplify init hook and any future fs init changes]
Signed-off-by: NDan Magenheimer <dan.magenheimer@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>
Reviewed-by: NJeremy Fitzhardinge <jeremy@goop.org>
Reviewed-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>
Cc: Mark Fasheh <mfasheh@suse.com>
Cc: Andrew Morton <akpm@linux-foundation.org>
Cc: Al Viro <viro@ZenIV.linux.org.uk>
Cc: Matthew Wilcox <matthew@wil.cx>
Cc: Nick Piggin <npiggin@kernel.dk>
Cc: Mel Gorman <mel@csn.ul.ie>
Cc: Rik Van Riel <riel@redhat.com>
Cc: Jan Beulich <JBeulich@novell.com>
Cc: Chris Mason <chris.mason@oracle.com>
Cc: Andreas Dilger <adilger@sun.com>
Cc: Ted Tso <tytso@mit.edu>
Cc: Nitin Gupta <ngupta@vflare.org>

1cfd8bd0

26 5月, 2011 6 次提交

ocfs2: remove unnecessary dentry_unhash on rmdir/rename_dir · 7ca57363

由 Sage Weil 提交于 5月 24, 2011

Ocfs2 has no issues with lingering references to unlinked directory inodes.

CC: Mark Fasheh <mfasheh@suse.com>
CC: ocfs2-devel@oss.oracle.com
Acked-by: NJoel Becker <jlbec@evilplan.org>
Signed-off-by: NSage Weil <sage@newdream.net>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

7ca57363

vfs: push dentry_unhash on rename_dir into file systems · e4eaac06

由 Sage Weil 提交于 5月 24, 2011

Only a few file systems need this.  Start by pushing it down into each
rename method (except gfs2 and xfs) so that it can be dealt with on a
per-fs basis.
Acked-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NSage Weil <sage@newdream.net>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

e4eaac06

vfs: push dentry_unhash on rmdir into file systems · 79bf7c73

由 Sage Weil 提交于 5月 24, 2011

Only a few file systems need this.  Start by pushing it down into each
fs rmdir method (except gfs2 and xfs) so it can be dealt with on a per-fs
basis.

This does not change behavior for any in-tree file systems.
Acked-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NSage Weil <sage@newdream.net>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

79bf7c73

Ocfs2: Teach local-mounted ocfs2 to handle unwritten_extents correctly. · 3d1c1829

由 Tristan Ye 提交于 5月 23, 2011

Oops, local-mounted of 'ocfs2_fops_no_plocks' is just missing the support
of unwritten_extents/punching-hole due to no func pointer was given correctly
to '.follocate' field.
Signed-off-by: NTristan Ye <tristan.ye@oracle.com>

3d1c1829

ocfs2/dlm: Do not migrate resource to a node that is leaving the domain · 66effd3c

由 Sunil Mushran 提交于 5月 19, 2011

During dlm domain shutdown, o2dlm has to free all the lock resources. Ones that
have no locks and references are freed. Ones that have locks and/or references
are migrated to another node.

The first task in migration is finding a target. Currently we scan the lock
resource and find one node that either has a lock or a reference. This is not
very efficient in a parallel umount case as we might end up migrating the
lock resource to a node which itself may have to migrate it to a third node.

The patch scans the dlm->exit_domain_map to ensure the target node is not
leaving the domain. If no valid target node is found, o2dlm does not migrate
the resource but instead waits for the unlock and deref messages that will
allow it to free the resource.
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Signed-off-by: NJoel Becker <jlbec@evilplan.org>

66effd3c

ocfs2/dlm: Add new dlm message DLM_BEGIN_EXIT_DOMAIN_MSG · bddefdee

由 Sunil Mushran 提交于 5月 19, 2011

This patch adds a new dlm message DLM_BEGIN_EXIT_DOMAIN_MSG and ups the dlm
protocol to 1.2.

o2dlm sends this new message in dlm_unregister_domain() to mark the beginning
of the exit domain. This message is sent to all nodes in the domain.

Currently o2dlm has no way of informing other nodes of its impending exit.
This information is useful as the other nodes could disregard the exiting
node in certain operations. For example, in resource migration. If two or
more nodes were umounting in parallel, it would be more efficient if o2dlm
were to choose a non-exiting node to be the new master node rather than an
exiting one.
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Reviewed-by: NMark Fasheh <mfasheh@suse.com>
Signed-off-by: NJoel Becker <jlbec@evilplan.org>

bddefdee

25 5月, 2011 14 次提交

Ocfs2/move_extents: Set several trivial constraints for threshold. · dda54e76

由 Tristan Ye 提交于 5月 25, 2011

The threshold should be greater than clustersize and less than i_size.
Signed-off-by: NTristan Ye <tristan.ye@oracle.com>

dda54e76

Ocfs2/move_extents: Let defrag handle partial extent moving. · 4dfa66bd

由 Tristan Ye 提交于 5月 25, 2011

We're going to support partial extent moving, which may split entire extent
movement into pieces to compromise the insuffice allocations, it eases the
'ENSPC' pain and makes the whole moving much less likely to fail, the downside
is it may make the fs even more fragmented before moving, just let the userspace
make a trade-off here.
Signed-off-by: NTristan Ye <tristan.ye@oracle.com>

4dfa66bd

Ocfs2/move_extents: move/defrag extents within a certain range. · 53069d4e

由 Tristan Ye 提交于 5月 25, 2011

the basic logic of moving extents for a file is pretty like punching-hole
sequence, walk the extents within the range as user specified, calculating
an appropriate len to defrag/move, then let ocfs2_defrag/move_extent() to
do the actual moving.

This func ends up setting 'OCFS2_MOVE_EXT_FL_COMPLETE' to userpace if operation
gets done successfully.
Signed-off-by: NTristan Ye <tristan.ye@oracle.com>

53069d4e

Ocfs2/move_extents: helper to calculate the defraging length in one run. · ee16cc03

由 Tristan Ye 提交于 3月 18, 2011

The helper is to calculate the defrag length in one run according to a threshold,
it will proceed doing defragmentation until the threshold was meet, and skip a
LARGE extent if any.
Signed-off-by: NTristan Ye <tristan.ye@oracle.com>

ee16cc03

Ocfs2/move_extents: move entire/partial extent. · e0847717

由 Tristan Ye 提交于 5月 24, 2011

ocfs2_move_extent() logic will validate the goal_offset_in_block,
where extents to be moved, what's more, it also compromises a bit
to probe the appropriate region around given goal_offset when the
original goal is not able to fit the movement.
Signed-off-by: NTristan Ye <tristan.ye@oracle.com>

e0847717

T
Ocfs2/move_extents: helpers to update the group descriptor and global bitmap inode. · 8473aa8a
由 Tristan Ye 提交于 5月 24, 2011
```
These helpers were actually borrowed from alloc.c, which may be publicized
later.
Signed-off-by: NTristan Ye <tristan.ye@oracle.com>
```
8473aa8a

Ocfs2/move_extents: helper to probe a proper region to move in an alloc group. · e6b5859c

由 Tristan Ye 提交于 3月 18, 2011

Before doing the movement of extents, we'd better probe the alloc group from
'goal_blk' for searching a contiguous region to fit the wanted movement, we
even will have a best-effort try by compromising to a threshold around the
given goal.
Signed-off-by: NTristan Ye <tristan.ye@oracle.com>

e6b5859c

Ocfs2/move_extents: helper to validate and adjust moving goal. · 99e4c750

由 Tristan Ye 提交于 3月 18, 2011

First best-effort attempt to validate and adjust the goal (physical address in
block), while it can't guarantee later operation can succeed all the time since
global_bitmap may change a bit over time.
Signed-off-by: NTristan Ye <tristan.ye@oracle.com>

99e4c750

Ocfs2/move_extents: find the victim alloc group, where the given #blk fits. · 1c06b912

由 Tristan Ye 提交于 3月 18, 2011

This function tries locate the right alloc group, where a given physical block
resides, it returns the caller a buffer_head of victim group descriptor, and also
the offset of block in this group, by passing the block number.
Signed-off-by: NTristan Ye <tristan.ye@oracle.com>

1c06b912

Ocfs2/move_extents: defrag a range of extent. · 202ee5fa

由 Tristan Ye 提交于 3月 18, 2011

It's a relatively complete function to accomplish defragmentation for entire
or partial extent, one journal handle was kept during the operation, it was
logically doing one more thing than ocfs2_move_extent() acutally, yes, it's
claiming the new clusters itself;-)
Signed-off-by: NTristan Ye <tristan.ye@oracle.com>

202ee5fa

Ocfs2/move_extents: move a range of extent. · 8f603e56

由 Tristan Ye 提交于 3月 18, 2011

The moving range of __ocfs2_move_extent() was within one extent always, it
consists following parts:

1. Duplicates the clusters in pages to new_blkoffset, where extent to be moved.

2. Split the original extent with new extent, coalecse the nearby extents if possible.

3. Append old clusters to truncate log, or decrease_refcount if the extent was refcounted.
Signed-off-by: NTristan Ye <tristan.ye@oracle.com>

8f603e56

Ocfs2/move_extents: lock allocators and reserve metadata blocks and data... · de474ee8

由 Tristan Ye 提交于 3月 18, 2011

Ocfs2/move_extents: lock allocators and reserve metadata blocks and data clusters for extents moving.

ocfs2_lock_allocators_move_extents() was like the common ocfs2_lock_allocators(),
to lock metadata and data alloctors during extents moving, reserve appropriate
metadata blocks and data clusters, also performa a best- effort to calculate the
credits for journal transaction in one run of movement.
Signed-off-by: NTristan Ye <tristan.ye@oracle.com>

de474ee8

Ocfs2/move_extents: Add basic framework and source files for extent moving. · 028ba5df

由 Tristan Ye 提交于 5月 24, 2011

Adding new files move_extents.[c|h] and fill it with nothing but
only a context structure.
Signed-off-by: NTristan Ye <tristan.ye@oracle.com>

028ba5df

T
Ocfs2/move_extents: Adding new ioctl code 'OCFS2_IOC_MOVE_EXT' to ocfs2. · 220ebc43
由 Tristan Ye 提交于 5月 25, 2011
```
Patch also manages to add a manipulative struture for this ioctl.
Signed-off-by: NTristan Ye <tristan.ye@oracle.com>
```
220ebc43

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功