提交 · d769b3c2ab7184ddd42056595b627cc871caa90e · openeuler / raspberrypi-kernel

23 7月, 2011 4 次提交

由 Jan Kara 提交于 7月 21, 2011

sbi->s_mutex isn't needed for isofs at all so we can just remove it. Generally,
since isofs is always mounted read-only, filesystem structure cannot change
under us. So buffer_head contents stays constant after it's filled in. That
leaves us with possible changes of global data structures. Superblock changes
only during filesystem mount (even remount does not change it), inodes are only
filled in during reading from disk. So there are no changes of these structures
to bother about.

Arguments why sbi->s_mutex can be removed at each place:
isofs_readdir: Accesses sb, inode, filp, local variables => s_mutex not needed
isofs_lookup: Protected by directory's i_mutex. Accesses sb, inode, dentry,
local variables => s_mutex not needed
rock_ridge_symlink_readpage: Protected by page lock. Accesses sb, inode,
local variables => s_mutex not needed.
Signed-off-by: NJan Kara <jack@suse.cz>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

d769b3c2

jffs2: fix IN_DELETE_SELF on overwriting rename() killing a directory · 22ba747f

由 Al Viro 提交于 7月 21, 2011

We don't generate IN_DELETE_SELF on victim of overwriting rename() if
it happens to be a directory. Trivially fixed by doing to ->i_nlink
what we do ->pino_nlink a couple of lines later in jffs2_rename().
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

22ba747f

fix IN_DELETE_SELF on overwriting rename() on ramfs et.al. · 841590ce

由 Al Viro 提交于 7月 21, 2011

On ramfs and other simple_rename() users IN_DELETE_SELF is not generated
for victim of overwriting rename() if it's is a directory. Works on
most of the local filesystems and really trivial to fix...
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

841590ce

mm/truncate.c: fix build for CONFIG_BLOCK not enabled · ed70afcd

由 Randy Dunlap 提交于 7月 21, 2011

Fix build error when CONFIG_BLOCK is not enabled by providing a stub
inode_dio_wait() function.

mm/truncate.c:612: error: implicit declaration of function 'inode_dio_wait'
Signed-off-by: NRandy Dunlap <rdunlap@xenotime.net>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

ed70afcd

21 7月, 2011 36 次提交

fs:update the NOTE of the file_operations structure · 295cc522

由 Wanlong Gao 提交于 7月 19, 2011

Big kernel lock had been removed and setlease now use the lock_flocks()
to hold a special spin lock file_lock_lock by Matthew.
So just remove the out-of-date NOTE.
Signed-off-by: NWanlong Gao <gaowanlong@cn.fujitsu.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

295cc522

A
Remove dead code in dget_parent() · 86c98e8c
由 Al Viro 提交于 7月 18, 2011
```
->d_parent is never NULL...
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
86c98e8c

AFS: Fix silly characters in a comment · e4b9f005

由 David Howells 提交于 7月 18, 2011

Fix silly characters in a comment in AFS code (some weird characters replaced
the word 'flag' some point way back).

Reported-by: viro@ZenIV.linux.org.uk
Signed-off-by: NDavid Howells <dhowells@redhat.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

e4b9f005

A
switch d_add_ci() to d_splice_alias() in "found negative" case as well · 4513d899
由 Al Viro 提交于 7月 17, 2011
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
4513d899

simplify gfs2_lookup() · 6c673ab3

由 Al Viro 提交于 7月 17, 2011

d_splice_alias() will DTRT when given NULL or ERR_PTR
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

6c673ab3

A
jfs_lookup(): don't bother with . or .. · 79ac5a46
由 Al Viro 提交于 7月 17, 2011
```
they'll never be passed to ->lookup()
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
79ac5a46
A
get rid of useless dget_parent() in btrfs rename() and link() · 10d9f309
由 Al Viro 提交于 7月 16, 2011
```
->d_parent is locked and stable there...
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
10d9f309

get rid of useless dget_parent() in fs/btrfs/ioctl.c · 2fbe8c8a

由 Al Viro 提交于 7月 16, 2011

both callers there have dentry->d_parent stabilized by the fact that
their caller had obtained dentry from lookup_one_len() and had not
dropped ->i_mutex on parent since then.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

2fbe8c8a

fs: push i_mutex and filemap_write_and_wait down into ->fsync() handlers · 02c24a82

由 Josef Bacik 提交于 7月 16, 2011

Btrfs needs to be able to control how filemap_write_and_wait_range() is called
in fsync to make it less of a painful operation, so push down taking i_mutex and
the calling of filemap_write_and_wait() down into the ->fsync() handlers. Some
file systems can drop taking the i_mutex altogether it seems, like ext3 and
ocfs2. For correctness sake I just pushed everything down in all cases to make
sure that we keep the current behavior the same for everybody, and then each
individual fs maintainer can make up their mind about what to do from there.
Thanks,
Acked-by: NJan Kara <jack@suse.cz>
Signed-off-by: NJosef Bacik <josef@redhat.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

02c24a82

drivers: fix up various ->llseek() implementations · 22735068

由 Josef Bacik 提交于 7月 18, 2011

Fix up a few ->llseek() implementations that won't deal with SEEK_HOLE/SEEK_DATA
properly.  Make them future proof so that if we ever add new options they will
return -EINVAL.  Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

22735068

fs: handle SEEK_HOLE/SEEK_DATA properly in all fs's that define their own llseek · 06222e49

由 Josef Bacik 提交于 7月 18, 2011

This converts everybody to handle SEEK_HOLE/SEEK_DATA properly. In some cases
we just return -EINVAL, in others we do the normal generic thing, and in others
we're simply making sure that the properly due-dilligence is done. For example
in NFS/CIFS we need to make sure the file size is update properly for the
SEEK_HOLE and SEEK_DATA case, but since it calls the generic llseek stuff itself
that is all we have to do. Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

06222e49

Ext4: handle SEEK_HOLE/SEEK_DATA generically · c334b113

由 Josef Bacik 提交于 7月 18, 2011

Since Ext4 has its own lseek we need to make sure it handles
SEEK_HOLE/SEEK_DATA. For now just do the same thing that is done in the generic
case, somebody else can come along and make it do fancy things later. Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

c334b113

Btrfs: implement our own ->llseek · b2675157

由 Josef Bacik 提交于 7月 18, 2011

In order to handle SEEK_HOLE/SEEK_DATA we need to implement our own llseek.
Basically for the normal SEEK_*'s we will just defer to the generic helper, and
for SEEK_HOLE/SEEK_DATA we will use our fiemap helper to figure out the nearest
hole or data. Currently this helper doesn't check for delalloc bytes for
prealloc space, so for now treat prealloc as data until that is fixed. Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

b2675157

fs: add SEEK_HOLE and SEEK_DATA flags · 982d8165

由 Josef Bacik 提交于 7月 18, 2011

This just gets us ready to support the SEEK_HOLE and SEEK_DATA flags.  Turns out
using fiemap in things like cp cause more problems than it solves, so lets try
and give userspace an interface that doesn't suck.  We need to match solaris
here, and the definitions are

*o* If /whence/ is SEEK_HOLE, the offset of the start of the
next hole greater than or equal to the supplied offset
is returned. The definition of a hole is provided near
the end of the DESCRIPTION.

*o* If /whence/ is SEEK_DATA, the file pointer is set to the
start of the next non-hole file region greater than or
equal to the supplied offset.

So in the generic case the entire file is data and there is a virtual hole at
the end.  That means we will just return i_size for SEEK_HOLE and will return
the same offset for SEEK_DATA.  This is how Solaris does it so we have to do it
the same way.

Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

982d8165

reiserfs: make reiserfs default to barrier=flush · b4d5b10f

由 Christoph Hellwig 提交于 7月 16, 2011

Change the default reiserfs mount option to barrier=flush.  Based on a patch
from Jeff Mahoney in the SuSE tree.
Signed-off-by: NJeff Mahoney <jeffm@suse.com>
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

b4d5b10f

ext3: make ext3 mount default to barrier=1 · 00eacd66

由 Christoph Hellwig 提交于 7月 16, 2011

This patch turns on barriers by default for ext3.  mount -o barrier=0
will turn them off.  Based on a patch from Chris Mason in the SuSE tree.
Signed-off-by: NChris Mason <chris.mason@oracle.com>
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Acked-by: NEric Sandeen <sandeen@redhat.com>
Acked-by: NJan Kara <jack@suse.cz>
Acked-by: NJeff Mahoney <jeffm@suse.com>
Acked-by: NTed Ts'o <tytso@mit.edu>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

00eacd66

A
don't open-code parent_ino() in assorted ->readdir() · b85fd6bd
由 Al Viro 提交于 7月 17, 2011
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
b85fd6bd
A
minix_getattr(): don't bother with ->d_parent · 2def9e4e
由 Al Viro 提交于 7月 16, 2011
```
we can find superblock easier, TYVM...
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
2def9e4e
A
coda_venus_readdir(): use offsetof() · ee60498f
由 Al Viro 提交于 7月 16, 2011
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
ee60498f

arm: don't create useless copies to pass into debugfs_create_dir() · c066b65a

由 Al Viro 提交于 7月 16, 2011

its first argument is const char * and it's really not modified...
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

c066b65a

A
switch assorted clock drivers to debugfs_remove_recursive() · 12520c43
由 Al Viro 提交于 7月 16, 2011
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
12520c43

fs: seq_file - add event counter to simplify poll() support · f1514638

由 Kay Sievers 提交于 7月 12, 2011

Moving the event counter into the dynamically allocated 'struc seq_file'
allows poll() support without the need to allocate its own tracking
structure.

All current users are switched over to use the new counter.

Requested-by: Andrew Morton akpm@linux-foundation.org
Acked-by: NNeilBrown <neilb@suse.de>
Tested-by: Lucas De Marchi lucas.demarchi@profusion.mobi
Signed-off-by: NKay Sievers <kay.sievers@vrfy.org>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

f1514638

fs: move inode_dio_done to the end_io handler · 72c5052d

由 Christoph Hellwig 提交于 6月 24, 2011

For filesystems that delay their end_io processing we should keep our
i_dio_count until the the processing is done.  Enable this by moving
the inode_dio_done call to the end_io handler if one exist.  Note that
the actual move to the workqueue for ext4 and XFS is not done in
this patch yet, but left to the filesystem maintainers.  At least
for XFS it's not needed yet either as XFS has an internal equivalent
to i_dio_count.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

72c5052d

fs: simplify the blockdev_direct_IO prototype · aacfc19c

由 Christoph Hellwig 提交于 6月 24, 2011

Simple filesystems always pass inode->i_sb_bdev as the block device
argument, and never need a end_io handler.  Let's simply things for
them and for my grepping activity by dropping these arguments.  The
only thing not falling into that scheme is ext4, which passes and
end_io handler without needing special flags (yet), but given how
messy the direct I/O code there is use of __blockdev_direct_IO
in one instead of two out of three cases isn't going to make a large
difference anyway.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

aacfc19c

fs: always maintain i_dio_count · df2d6f26

由 Christoph Hellwig 提交于 6月 24, 2011

Maintain i_dio_count for all filesystems, not just those using DIO_LOCKING.
This these filesystems to also protect truncate against direct I/O requests
by using common code.  Right now the only non-DIO_LOCKING filesystem that
appears to do so is XFS, which uses an opencoded variant of the i_dio_count
scheme.

Behaviour doesn't change for filesystems never calling inode_dio_wait.
For ext4 behaviour changes when using the dioread_nonlock option, which
previously was missing any protection between truncate and direct I/O reads.
For ocfs2 that handcrafted i_dio_count manipulations are replaced with
the common code now enable.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

df2d6f26

fs: move inode_dio_wait calls into ->setattr · 562c72aa

由 Christoph Hellwig 提交于 6月 24, 2011

Let filesystems handle waiting for direct I/O requests themselves instead
of doing it beforehand. This means filesystem-specific locks to prevent
new dio referenes from appearing can be held. This is important to allow
generalizing i_dio_count to non-DIO_LOCKING filesystems.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

562c72aa

rw_semaphore: remove up/down_read_non_owner · 11b80f45

由 Christoph Hellwig 提交于 6月 24, 2011

Now that the last users is gone these can be removed.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

11b80f45

fs: kill i_alloc_sem · bd5fe6c5

由 Christoph Hellwig 提交于 6月 24, 2011

i_alloc_sem is a rather special rw_semaphore.  It's the last one that may
be released by a non-owner, and it's write side is always mirrored by
real exclusion.  It's intended use it to wait for all pending direct I/O
requests to finish before starting a truncate.

Replace it with a hand-grown construct:

 - exclusion for truncates is already guaranteed by i_mutex, so it can
   simply fall way
 - the reader side is replaced by an i_dio_count member in struct inode
   that counts the number of pending direct I/O requests.  Truncate can't
   proceed as long as it's non-zero
 - when i_dio_count reaches non-zero we wake up a pending truncate using
   wake_up_bit on a new bit in i_flags
 - new references to i_dio_count can't appear while we are waiting for
   it to read zero because the direct I/O count always needs i_mutex
   (or an equivalent like XFS's i_iolock) for starting a new operation.

This scheme is much simpler, and saves the space of a spinlock_t and a
struct list_head in struct inode (typically 160 bits on a non-debug 64-bit
system).
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

bd5fe6c5

fs: simplify handling of zero sized reads in __blockdev_direct_IO · f9b5570d

由 Christoph Hellwig 提交于 6月 24, 2011

Reject zero sized reads as soon as we know our I/O length, and don't
borther with locks or allocations that might have to be cleaned up
otherwise.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

f9b5570d

ext4: Rewrite ext4_page_mkwrite() to use generic helpers · 9ea7df53

由 Jan Kara 提交于 6月 24, 2011

Rewrite ext4_page_mkwrite() to use __block_page_mkwrite() helper. This
removes the need of using i_alloc_sem to avoid races with truncate which
seems to be the wrong locking order according to lock ordering documented in
mm/rmap.c. Also calling ext4_da_write_begin() as used by the old code seems to
be problematic because we can decide to flush delay-allocated blocks which
will acquire s_umount semaphore - again creating unpleasant lock dependency
if not directly a deadlock.

Also add a check for frozen filesystem so that we don't busyloop in page fault
when the filesystem is frozen.
Signed-off-by: NJan Kara <jack@suse.cz>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

9ea7df53

fat: remove i_alloc_sem abuse · 58268691

由 Christoph Hellwig 提交于 6月 24, 2011

Add a new rw_semaphore to protect bmap against truncate.  Previous
i_alloc_sem was abused for this, but it's going away in this series.

Note that we can't simply use i_mutex, given that the swapon code
calls ->bmap under it.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

58268691

VFS: Fixup kerneldoc for generic_permission() · 8c5dc70a

由 Tobias Klauser 提交于 7月 01, 2011

The flags parameter went away in
d749519b444db985e40b897f73ce1898b11f997e
Signed-off-by: NTobias Klauser <tklauser@distanz.ch>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

8c5dc70a

anonfd: fix missing declaration · e46ebd27

由 Tomasz Stanislawski 提交于 7月 12, 2011

The forward declaration of struct file_operations is
added to avoid compilation warnings.
Signed-off-by: NTomasz Stanislawski <t.stanislaws@samsung.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

e46ebd27

xfs: make use of new shrinker callout for the inode cache · 8daaa831

由 Dave Chinner 提交于 7月 08, 2011

Convert the inode reclaim shrinker to use the new per-sb shrinker
operations. This allows much bigger reclaim batches to be used, and
allows the XFS inode cache to be shrunk in proportion with the VFS
dentry and inode caches. This avoids the problem of the VFS caches
being shrunk significantly before the XFS inode cache is shrunk
resulting in imbalances in the caches during reclaim.
Signed-off-by: NDave Chinner <dchinner@redhat.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

8daaa831

vfs: increase shrinker batch size · 8ab47664

由 Dave Chinner 提交于 7月 08, 2011

Now that the per-sb shrinker is responsible for shrinking 2 or more
caches, increase the batch size to keep econmies of scale for
shrinking each cache.  Increase the shrinker batch size to 1024
objects.

To allow for a large increase in batch size, add a conditional
reschedule to prune_icache_sb() so that we don't hold the LRU spin
lock for too long. This mirrors the behaviour of the
__shrink_dcache_sb(), and allows us to increase the batch size
without needing to worry about problems caused by long lock hold
times.

To ensure that filesystems using the per-sb shrinker callouts don't
cause problems, document that the object freeing method must
reschedule appropriately inside loops.
Signed-off-by: NDave Chinner <dchinner@redhat.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

8ab47664

superblock: add filesystem shrinker operations · 0e1fdafd

由 Dave Chinner 提交于 7月 08, 2011

Now we have a per-superblock shrinker implementation, we can add a
filesystem specific callout to it to allow filesystem internal
caches to be shrunk by the superblock shrinker.

Rather than perpetuate the multipurpose shrinker callback API (i.e.
nr_to_scan == 0 meaning "tell me how many objects freeable in the
cache), two operations will be added. The first will return the
number of objects that are freeable, the second is the actual
shrinker call.
Signed-off-by: NDave Chinner <dchinner@redhat.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

0e1fdafd