提交 · b8c2f3474f1077599ec6e90c2f263f17055cc3d8 · OpenHarmony / kernel_linux

11 6月, 2010 10 次提交

writeback: simplify wakeup_flusher_threads · b8c2f347

由 Christoph Hellwig 提交于 6月 08, 2010

bdi_writeback_all only has one caller, so fold it to simplify the code and
flatten the call stack.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NJens Axboe <jaxboe@fusionio.com>

b8c2f347

writeback: fix writeback_inodes_wb from writeback_inodes_sb · d19de7ed

由 Christoph Hellwig 提交于 6月 08, 2010

When we call writeback_inodes_wb from writeback_inodes_sb we always have
s_umount held, which currently makes the whole operation a no-op.

But if we are called to write out inodes for a specific superblock we always
have s_umount held, so replace the incorrect logic checking for WB_SYNC_ALL
which only worked by coincidence with the proper check for an explicit
superblock argument.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NJens Axboe <jaxboe@fusionio.com>

d19de7ed

writeback: enforce s_umount locking in writeback_inodes_sb · cf37e972

由 Christoph Hellwig 提交于 6月 08, 2010

Make sure that not only sync_filesystem but all callers of writeback_inodes_sb
have the superblock protected against remount.  As-is this disables all
functionality for these callers, but the next patch relies on this locking to
fix writeback_inodes_sb for sync_filesystem.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NJens Axboe <jaxboe@fusionio.com>

cf37e972

writeback: queue work on stack in writeback_inodes_sb · 3c4d7165

由 Christoph Hellwig 提交于 6月 08, 2010

If we want to rely on s_umount in the caller we need to wait for completion
of the I/O submission before returning to the caller.  Refactor
bdi_sync_writeback into a bdi_queue_work_onstack helper and use it for this
case.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NJens Axboe <jaxboe@fusionio.com>

3c4d7165

writeback: fix writeback completion notifications · 7f0e7bed

由 Christoph Hellwig 提交于 6月 08, 2010

The code dealing with bdi_work->state and completion of a bdi_work is a
major mess currently.  This patch makes sure we directly use one set of
flags to deal with it, and use it consistently, which means:

 - always notify about completion from the rcu callback.  We only ever
   wait for it from on-stack callers, so this simplification does not
   even cause a theoretical slowdown currently.  It also makes sure we
   don't miss out on the notification if we ever add other callers to
   wait for it.
 - make earlier completion notification depending on the on-stack
   allocation, not the sync mode.  If we introduce new callers that
   want to do WB_SYNC_NONE writeback from on-stack callers this will
   be nessecary.

Also rename bdi_wait_on_work_clear to bdi_wait_on_work_done and inline
a few small functions into their only caller to make the code
understandable.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NJens Axboe <jaxboe@fusionio.com>

7f0e7bed

pipe: fix check in "set size" fcntl · 6db40cf0

由 Miklos Szeredi 提交于 6月 09, 2010

As it stands this check compares the number of pages to the page size.
This makes no sense and makes the fcntl fail in almost any sane case.

Fix it by checking if nr_pages is not zero (it can become zero only if
arg is too big and round_pipe_size() overflows).
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>
Signed-off-by: NJens Axboe <jaxboe@fusionio.com>

6db40cf0

pipe: fix pipe buffer resizing · 1d862f41

由 Miklos Szeredi 提交于 6月 08, 2010

pipe_set_size() needs to copy pipe bufs from the old circular buffer
to the new.

The current code gets this wrong in multiple ways, resulting in oops.

Test program is available here:
  http://www.kernel.org/pub/linux/kernel/people/mszeredi/piperesize/Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>
Signed-off-by: NJens Axboe <jaxboe@fusionio.com>

1d862f41

block: remove duplicate BUG_ON() in bd_finish_claiming() · 3e6c0505

由 Jens Axboe 提交于 6月 07, 2010

We do the same BUG_ON() just a line later when calling into
__bd_abort_claiming().
Reported-by: NTejun Heo <tj@kernel.org>
Signed-off-by: NJens Axboe <jaxboe@fusionio.com>

3e6c0505

block: bd_start_claiming cleanup · b0018361

由 Nick Piggin 提交于 5月 26, 2010

I don't like the subtle multi-context code in bd_claim (ie.  detects where it
has been called based on bd_claiming). It seems clearer to just require a new
function to finish a 2-part claim.

Also improve commentary in bd_start_claiming as to how it should
be used.
Signed-off-by: NNick Piggin <npiggin@suse.de>
Acked-by: NTejun Heo <tj@kernel.org>
Signed-off-by: NJens Axboe <jaxboe@fusionio.com>

b0018361

block: bd_start_claiming fix module refcount · cf342570

由 Nick Piggin 提交于 5月 26, 2010

bd_start_claiming has an unbalanced module_put introduced in 6b4517a7.
Signed-off-by: NNick Piggin <npiggin@suse.de>
Acked-by: NTejun Heo <tj@kernel.org>
Signed-off-by: NJens Axboe <jaxboe@fusionio.com>

cf342570

09 6月, 2010 2 次提交

xfs: remove nr_to_write writeback windup. · 254c8c2d

由 Dave Chinner 提交于 6月 09, 2010

Now that the background flush code has been fixed, we shouldn't need to
silently multiply the wbc->nr_to_write to get good writeback. Remove
that code.
Signed-off-by: NDave Chinner <dchinner@redhat.com>
Reviewed-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

254c8c2d

nfsd4: shut down callback queue outside state lock · c3935e30

由 J. Bruce Fields 提交于 6月 04, 2010

This reportedly causes a lockdep warning on nfsd shutdown.  That looks
like a false positive to me, but there's no reason why this needs the
state lock anyway.
Reported-by: NJeff Layton <jlayton@redhat.com>
Signed-off-by: NJ. Bruce Fields <bfields@citi.umich.edu>

c3935e30

06 6月, 2010 1 次提交

jffs2: update ctime when changing the file's permission by setfacl · 1c24d06f

由 Jan Kara 提交于 6月 04, 2010

jffs2 didn't update the ctime of the file when its permission was changed.

Steps to reproduce:
 # touch aaa
 # stat -c %Z aaa
 1275289822
 # setfacl -m  'u::x,g::x,o::x' aaa
 # stat -c %Z aaa
 1275289822                         <- unchanged

But, according to the spec of the ctime, jffs2 must update it.

Port of ext3 patch by Miao Xie <miaox@cn.fujitsu.com>.
Signed-off-by: NJan Kara <jack@suse.cz>
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>

1c24d06f

05 6月, 2010 10 次提交

ext4: Fix remaining racy updates of EXT4_I(inode)->i_flags · 84a8dce2

由 Dmitry Monakhov 提交于 6月 05, 2010

A few functions were still modifying i_flags in a racy manner.
Signed-off-by: NDmitry Monakhov <dmonakhov@openvz.org>
Signed-off-by: N"Theodore Ts'o" <tytso@mit.edu>

84a8dce2

flat: fix unmap len in load error path · 1da083c9

由 Mike Frysinger 提交于 6月 04, 2010

The data chunk is mmaped with 'len' which remains unchanged, so use that
when unmapping in the error path rather than trying to recalculate (and
incorrectly so) the value used originally.
Signed-off-by: NMike Frysinger <vapier@gentoo.org>
Acked-by: NDavid McCullough <davidm@snapgear.com>
Acked-by: NGreg Ungerer <gerg@uclinux.org>
Cc: Paul Mundt <lethal@linux-sh.org>
Cc: Michal Simek <monstr@monstr.eu>
Cc: Hirokazu Takata <takata@linux-m32r.org>
Cc: Geert Uytterhoeven <geert@linux-m68k.org>
Acked-by: NDavid Howells <dhowells@redhat.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

1da083c9

fs/binfmt_flat.c: split the stack & data alignments · 2e94de8a

由 Mike Frysinger 提交于 6月 04, 2010

The stack and data have different alignment requirements, so don't force
them to wear the same shoe.  Increase the data alignment to match that
which the elf2flt linker script has always been using: 0x20 bytes.  Not
only does this bring the kernel loader in line with the toolchain, but it
also fixes a swath of gcc tests which try to force larger alignment values
but randomly fail when the FLAT loader fails to deliver.
Signed-off-by: NMike Frysinger <vapier@gentoo.org>
Cc: Herbert Xu <herbert@gondor.apana.org.au>
Cc: David Woodhouse <David.Woodhouse@intel.com>
Cc: Pekka Enberg <penberg@cs.helsinki.fi>
Acked-by: NDavid McCullough <davidm@snapgear.com>
Acked-by: NGreg Ungerer <gerg@uclinux.org>
Cc: Paul Mundt <lethal@linux-sh.org>
Tested-by: NMichal Simek <monstr@monstr.eu>
Cc: Hirokazu Takata <takata@linux-m32r.org>
Cc: Yoshinori Sato <ysato@users.sourceforge.jp>
Cc: Geert Uytterhoeven <geert@linux-m68k.org>
Cc: Jie Zhang <jie@codesourcery.com>
Cc: David Howells <dhowells@redhat.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

2e94de8a

fs/compat_rw_copy_check_uvector: add missing compat_ptr call · 7cbe1770

由 Heiko Carstens 提交于 6月 04, 2010

A call to access_ok is missing a compat_ptr conversion.  Introduced with
b8373363 "compat: factor out
compat_rw_copy_check_uvector from compat_do_readv_writev"

fs/compat.c: In function 'compat_rw_copy_check_uvector':
fs/compat.c:629: warning: passing argument 1 of '__access_ok' makes pointer from integer without a cast
Signed-off-by: NHeiko Carstens <heiko.carstens@de.ibm.com>
Reviewed-by: NJeff Moyer <jmoyer@redhat.com>
Cc: <stable@kernel.org>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

7cbe1770

Minix: Clean up left over label · 01afaf61

由 Andrew Hendry 提交于 6月 04, 2010

Remove a left over fail label.
Signed-off-by: NAndrew Hendry <andrew.hendry@gmail.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

01afaf61

fix truncate inode time modification breakage · af5a30d8

由 Nick Piggin 提交于 6月 03, 2010

mtime and ctime should be changed only if the file size has actually
changed. Patches changing ext2 and tmpfs from vmtruncate to new truncate
sequence has caused regressions where they always update timestamps.

There is some strange cases in POSIX where truncate(2) must not update
times unless the size has acutally changed, see 6e656be8.

This area is all still rather buggy in different ways in a lot of
filesystems and needs a cleanup and audit (ideally the vfs will provide
a simple attribute or call to direct all filesystems exactly which
attributes to change). But coming up with the best solution will take a
while and is not appropriate for rc anyway.

So fix recent regression for now.
Signed-off-by: NNick Piggin <npiggin@suse.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

af5a30d8

fix setattr error handling in sysfs, configfs · 8718d36c

由 Nick Piggin 提交于 5月 31, 2010

sysfs and configfs setattr functions have error cases after the generic inode's
attributes have been changed. Fix consistency by changing the generic inode
attributes only when it is guaranteed to succeed.
Signed-off-by: NNick Piggin <npiggin@suse.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

8718d36c

fcntl: return -EFAULT if copy_to_user fails · 5b54470d

由 Dan Carpenter 提交于 6月 03, 2010

copy_to_user() returns the number of bytes remaining, but we want to
return -EFAULT.
	ret = fcntl(fd, F_SETOWN_EX, NULL);
With the original code ret would be 8 here.

V2: Takuya Yoshikawa pointed out a similar issue in f_getown_ex()
Signed-off-by: NDan Carpenter <error27@gmail.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

5b54470d

wrong type for 'magic' argument in simple_fill_super() · 7d683a09

由 Roberto Sassu 提交于 6月 03, 2010

It's used to superblock ->s_magic, which is unsigned long.
Signed-off-by: NRoberto Sassu <roberto.sassu@polito.it>
Reviewed-by: NMimi Zohar <zohar@us.ibm.com>
Signed-off-by: NEric Paris <eparis@redhat.com>
CC: stable@kernel.org
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

7d683a09

fix setattr error handling in sysfs, configfs · 75de46b9

由 Nick Piggin 提交于 5月 31, 2010

sysfs and configfs setattr functions have error cases after the generic inode's
attributes have been changed. Fix consistency by changing the generic inode
attributes only when it is guaranteed to succeed.
Signed-off-by: NNick Piggin <npiggin@suse.de>
Acked-by: NJoel Becker <joel.becker@oracle.com>
Signed-off-by: NGreg Kroah-Hartman <gregkh@suse.de>

75de46b9

03 6月, 2010 9 次提交

pipe: change /proc/sys/fs/pipe-max-pages to byte sized interface · ff9da691

由 Jens Axboe 提交于 6月 03, 2010

This changes the interface to be based on bytes instead. The API
matches that of F_SETPIPE_SZ in that it rounds up the passed in
size so that the resulting page array is a power-of-2 in size.

The proc file is renamed to /proc/sys/fs/pipe-max-size to
reflect this change.
Signed-off-by: NJens Axboe <jaxboe@fusionio.com>

ff9da691

pipe: change the privilege required for growing a pipe beyond system max · 419f8367

由 Jens Axboe 提交于 6月 03, 2010

Change it to CAP_SYS_RESOURCE, as that more accurately models what
we want to control.
Suggested-by: NMichael Kerrisk <mtk.manpages@googlemail.com>
Signed-off-by: NJens Axboe <jaxboe@fusionio.com>

419f8367

pipe: adjust minimum pipe size to 1 page · 6a6ca57d

由 Jens Axboe 提交于 6月 03, 2010

We don't need to pages to guarantee the POSIX requirement
that upto a page size write must be atomic to an empty
pipe.
Signed-off-by: NJens Axboe <jaxboe@fusionio.com>

6a6ca57d

jffs2: Fix NFS race by using insert_inode_locked() · e72e6497

由 David Woodhouse 提交于 6月 03, 2010

New inodes need to be locked as we're creating them, so they don't get used
by other things (like NFSd) before they're ready.

Pointed out by Al Viro.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>

e72e6497

D
jffs2: Fix in-core inode leaks on error paths · f324e4cb
由 David Woodhouse 提交于 6月 03, 2010
```
Pointed out by Al Viro.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
```
f324e4cb

xfs: improve xfs_isilocked · f9369729

由 Christoph Hellwig 提交于 6月 03, 2010

Use rwsem_is_locked to make the assertations for shared locks work.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Reviewed-by: NDave Chinner <dchinner@redhat.com>

f9369729

xfs: skip writeback from reclaim context · 070ecdca

由 Christoph Hellwig 提交于 6月 03, 2010

Allowing writeback from reclaim context causes massive problems with stack
overflows as we can call into the writeback code which tends to be a heavy
stack user both in the generic code and XFS from random contexts that
perform memory allocations.

Follow the example of btrfs (and in slightly different form ext4) and refuse
to write out data from reclaim context.  This issue should really be handled
by the VM so that we can tune better for this case, but until we get it
sorted out there we have to hack around this in each filesystem with a
complex writeback path.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Reviewed-by: NDave Chinner <dchinner@redhat.com>

070ecdca

xfs: fix race in inode cluster freeing failing to stale inodes · 5b257b4a

由 Dave Chinner 提交于 6月 03, 2010

When an inode cluster is freed, it needs to mark all inodes in memory as
XFS_ISTALE before marking the buffer as stale. This is eeded because the inodes
have a different life cycle to the buffer, and once the buffer is torn down
during transaction completion, we must ensure none of the inodes get written
back (which is what XFS_ISTALE does).

Unfortunately, xfs_ifree_cluster() has some bugs that lead to inodes not being
marked with XFS_ISTALE. This shows up when xfs_iflush() is called on these
inodes either during inode reclaim or tail pushing on the AIL. The buffer is
read back, but no longer contains inodes and so triggers assert failures and
shutdowns. This was reproducable with at run.dbench10 invocation from xfstests.

There are two main causes of xfs_ifree_cluster() failing. The first is simple -
it checks in-memory inodes it finds in the per-ag icache to see if they are
clean without holding the flush lock. if they are clean it skips them
completely. However, If an inode is flushed delwri, it will
appear clean, but is not guaranteed to be written back until the flush lock has
been dropped. Hence we may have raced on the clean check and the inode may
actually be dirty. Hence always mark inodes found in memory stale before we
check properly if they are clean.

The second is more complex, and makes the first problem easier to hit.
Basically the in-memory inode scan is done with full knowledge it can be racing
with inode flushing and AIl tail pushing, which means that inodes that it can't
get the flush lock on might not be attached to the buffer after then in-memory
inode scan due to IO completion occurring. This is actually documented in the
code as "needs better interlocking". i.e. this is a zero-day bug.

Effectively, the in-memory scan must be done while the inode buffer is locked
and Io cannot be issued on it while we do the in-memory inode scan. This
ensures that inodes we couldn't get the flush lock on are guaranteed to be
attached to the cluster buffer, so we can then catch all in-memory inodes and
mark them stale.

Now that the inode cluster buffer is locked before the in-memory scan is done,
there is no need for the two-phase update of the in-memory inodes, so simplify
the code into two loops and remove the allocation of the temporary buffer used
to hold locked inodes across the phases.
Signed-off-by: NDave Chinner <dchinner@redhat.com>
Reviewed-by: NChristoph Hellwig <hch@lst.de>

5b257b4a

ext4: Make sure the MOVE_EXT ioctl can't overwrite append-only files · 1f5a81e4

由 Theodore Ts'o 提交于 6月 02, 2010

Dan Roseberg has reported a problem with the MOVE_EXT ioctl.  If the
donor file is an append-only file, we should not allow the operation
to proceed, lest we end up overwriting the contents of an append-only
file.
Signed-off-by: N"Theodore Ts'o" <tytso@mit.edu>
Cc: Dan Rosenberg <dan.j.rosenberg@gmail.com>

1f5a81e4

02 6月, 2010 4 次提交

nfsd: nfsd_setattr needs to call commit_metadata · b160fdab

由 Christoph Hellwig 提交于 6月 01, 2010

The conversion of write_inode_now calls to commit_metadata in commit
f501912a missed out the call in nfsd_setattr.

But without this conversion we can't guarantee that a SETATTR request
has actually been commited to disk with XFS, which causes a regression
from 2.6.32 (only for NFSv2, but anyway).
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Cc: stable@kernel.org
Signed-off-by: NJ. Bruce Fields <bfields@citi.umich.edu>

b160fdab

FS-Cache: Remove unneeded null checks · 08a66859

由 Dan Carpenter 提交于 6月 01, 2010

fscache_write_op() makes unnecessary checks of the page variable to see if it
is NULL.  It can't be NULL at those points as the kernel would already have
crashed a little higher up where we examined page->index.

Furthermore, unless radix_tree_gang_lookup_tag() can return 1 but no page, a
NULL pointer crash should not be encountered there as we can only get there if
r_t_g_l_t() returned 1.
Signed-off-by: NDan Carpenter <error27@gmail.com>
Signed-off-by: NDavid Howells <dhowells@redhat.com>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

08a66859

cifs: fix page refcount leak · 06b43672

由 Jeff Layton 提交于 6月 01, 2010

Commit 315e995c is causing OOM kills
when stress-testing a CIFS filesystem. The VFS readpages operation takes
a page reference. The older code just handed this reference off to the
page cache, but the new code takes an extra one. The simplest fix is to
put the new reference after add_to_page_cache_lru.
Signed-off-by: NJeff Layton <jlayton@redhat.com>
Acked-by: NNick Piggin <npiggin@suse.de>
Signed-off-by: NSteve French <sfrench@us.ibm.com>

06b43672

AFS: Fix possible null pointer dereference in afs_alloc_server() · 037776fc

由 Denis Kirjanov 提交于 6月 01, 2010

Fix a possible null pointer dereference in afs_alloc_server(): the server
pointer is NULL if there was an allocation failure, and under such a
condition, we can't dereference it in the _leave() statement.
Signed-off-by: NDenis Kirjanov <dkirjanov@kernel.org>
Signed-off-by: NDavid Howells <dhowells@redhat.com>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

037776fc

01 6月, 2010 3 次提交

binfmt_elf_fdpic: Fix clear_user() error handling · e30c7c3b

由 Takuya Yoshikawa 提交于 6月 01, 2010

clear_user() returns the number of bytes that could not be copied rather than
an error code.  So we should return -EFAULT rather than directly returning the
results.

Without this patch, positive values may be returned to elf_fdpic_map_file()
and the following error handlings do not function as expected.

1.
	ret = elf_fdpic_map_file_constdisp_on_uclinux(params, file, mm);
	if (ret < 0)
		return ret;
2.
	ret = elf_fdpic_map_file_by_direct_mmap(params, file, mm);
	if (ret < 0)
		return ret;
Signed-off-by: NTakuya Yoshikawa <yoshikawa.takuya@oss.ntt.co.jp>
Signed-off-by: NDavid Howells <dhowells@redhat.com>
Acked-by: NMike Frysinger <vapier@gentoo.org>
CC: Alexander Viro <viro@zeniv.linux.org.uk>
CC: Andrew Morton <akpm@linux-foundation.org>
CC: Daisuke HATAYAMA <d.hatayama@jp.fujitsu.com>
CC: Paul Mundt <lethal@linux-sh.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

e30c7c3b

Revert "writeback: fix WB_SYNC_NONE writeback from umount" · 0e3c9a22

由 Jens Axboe 提交于 6月 01, 2010

This reverts commit e913fc82.

We are investigating a hang associated with the WB_SYNC_NONE changes,
so revert them for now.

Conflicts:

	fs/fs-writeback.c
	mm/page-writeback.c
Signed-off-by: NJens Axboe <jaxboe@fusionio.com>

0e3c9a22

Revert "writeback: ensure that WB_SYNC_NONE writeback with sb pinned is sync" · f17625b3

由 Jens Axboe 提交于 6月 01, 2010

This reverts commit 7c8a3554.

We are investigating a hang associated with the WB_SYNC_NONE changes,
so revert them for now.
Signed-off-by: NJens Axboe <jaxboe@fusionio.com>

f17625b3

31 5月, 2010 1 次提交

nilfs2: remove obsolete declarations of cache constructor and destructor · c29684d6

由 Ryusuke Konishi 提交于 5月 23, 2010

The commit 41c88bd7 ("nilfs2: cleanup multi
kmem_cache_{create,destroy} code") consolidated slab constructors and
destructors used in nilfs, but it left some declarations in header
files.

This gets rid of the obsolete declarations.
Signed-off-by: NRyusuke Konishi <konishi.ryusuke@lab.ntt.co.jp>

c29684d6

OpenHarmony / kernel_linux 上一次同步 大约 4 年

OpenHarmony / kernel_linux
上一次同步大约 4 年