提交 · 596276c3571e2108f4b336be545ece2eacf3da59 · openanolis / cloud-kernel

17 12月, 2010 1 次提交

hfsplus: fix an artifact in ioctl flag checking · 596276c3

由 Anton Salikhmetov 提交于 12月 16, 2010

Fix a flag checking artifact in hfsplus_ioctl_getflags() routine
found while doing clean-up against assignments inside `if's.
Signed-off-by: NAnton Salikhmetov <alexo@tuxera.com>
Signed-off-by: NChristoph Hellwig <hch@tuxera.com>

596276c3

23 11月, 2010 11 次提交

hfsplus: flush disk caches in sync and fsync · 34a2d313

由 Christoph Hellwig 提交于 11月 23, 2010

Flush the disk cache in fsync and sync to make sure data actually is
on disk on completion of these system calls.  There is a nobarrier
mount option to disable this behaviour.  It's slightly misnamed now
that barrier actually are gone, but it matches the name used by all
major filesystems.
Signed-off-by: NChristoph Hellwig <hch@tuxera.com>

34a2d313

hfsplus: optimize fsync · e3494705

由 Christoph Hellwig 提交于 11月 23, 2010

Avoid doing unessecary work in fsync.  Do nothing unless the inode
was marked dirty, and only write the various metadata inodes out if
they contain any dirty state from this inode.  This is archived by
adding three new dirty bits to the hfsplus-specific inode which are
set in the correct places.
Signed-off-by: NChristoph Hellwig <hch@tuxera.com>

e3494705

hfsplus: split up inode flags · b33b7921

由 Christoph Hellwig 提交于 11月 23, 2010

Split the flags field in the hfsplus inode into an extent_state
flag that is locked by the extent_lock, and a new flags field
that uses atomic bitops.  The second will grow more flags in the
next patch.
Signed-off-by: NChristoph Hellwig <hch@tuxera.com>

b33b7921

hfsplus: write up fsync for directories · eb29d66d

由 Christoph Hellwig 提交于 11月 23, 2010

fsync is supposed to not just work on regular files, but also on
directories.  Fortunately enough hfsplus_file_fsync works just fine
for directories, so we can just wire it up.
Signed-off-by: NChristoph Hellwig <hch@tuxera.com>

eb29d66d

hfsplus: simplify fsync · 28146976

由 Christoph Hellwig 提交于 11月 23, 2010

Remove lots of code we don't need from fsync, we just need to call
->write_inode on the inode if it's dirty, for which sync_inode_metadata
is a lot more efficient than write_inode_now, and we need to write
out the various metadata inodes, which we now do explicitly instead
of by calling ->sync_fs.
Signed-off-by: NChristoph Hellwig <hch@tuxera.com>

28146976

hfsplus: avoid useless work in hfsplus_sync_fs · f02e26f8

由 Christoph Hellwig 提交于 11月 23, 2010

There is no reason to write out the metadata inodes or volume headers
during a non-blocking sync, as we are almost guaranteed to dirty them
again during the inode writeouts.
Signed-off-by: NChristoph Hellwig <hch@tuxera.com>

f02e26f8

hfsplus: make sure sync writes out all metadata · 7dc4f001

由 Christoph Hellwig 提交于 11月 23, 2010

hfsplus stores all metadata except for the volume headers in special
inodes.  While these are marked hashed and periodically written out
by the flusher threads, we can't rely on that for sync.  For the case
of a data integrity sync the VM has life-lock avoidance code that
avoids writing inodes again that are redirtied during the sync,
which is something that can happen easily for hfsplus.  So make sure
we explicitly write out the metadata inodes at the beginning of
hfsplus_sync_fs.
Signed-off-by: NChristoph Hellwig <hch@tuxera.com>

7dc4f001

hfsplus: use raw bio access for partition tables · 358f26d5

由 Christoph Hellwig 提交于 11月 23, 2010

Switch the hfsplus partition table reding for cdroms to use our bio
helpers.  Again we don't rely on any caching in the buffer_heads, and
this gets rid of the last buffer_head use in hfsplus.
Signed-off-by: NChristoph Hellwig <hch@tuxera.com>

358f26d5

hfsplus: use raw bio access for the volume headers · 52399b17

由 Christoph Hellwig 提交于 11月 23, 2010

The hfsplus backup volume header is located two blocks from the end of
the device.  In case of device sizes that are not 4k aligned this means
we can't access it using buffer_heads when using the default 4k block
size.

Switch to using raw bios to read/write all buffer headers.  We were not
relying on any caching behaviour of the buffer heads anyway.  Additionally
always read in the backup volume header during mount to verify that we
can actually read it.
Signed-off-by: NChristoph Hellwig <hch@tuxera.com>

52399b17

hfsplus: always use hfsplus_sync_fs to write the volume header · 3b5ce8ae

由 Christoph Hellwig 提交于 11月 23, 2010

Remove opencoded writing of the volume header in hfsplus_fill_super
and hfsplus_put_super and offload it to hfsplus_sync_fs.  In the
put_super case this means we only write the superblock once instead
of twice.
Signed-off-by: NChristoph Hellwig <hch@tuxera.com>

3b5ce8ae

hfsplus: silence a few debug printks · 6d1bbfc4

由 Christoph Hellwig 提交于 11月 23, 2010

Turn a few noisy debug printks that show up during xfstests into
complied out debug print statements.
Signed-off-by: NChristoph Hellwig <hch@tuxera.com>

6d1bbfc4

08 11月, 2010 1 次提交

hfsplus: fix option parsing during remount · 6f80dfe5

由 Christoph Hellwig 提交于 11月 07, 2010

hfsplus only actually uses the force option during remount, but it uses
the full option parser with a fake superblock to do so.  This means remount
will fail if any nls option is set (which happens frequently with older
mount tools), even if it is the same.

Fix this by adding a simpler version of the parser that only parses the force
option for remount.
Signed-off-by: NChristoph Hellwig <hch@tuxera.com>

6f80dfe5

02 11月, 2010 2 次提交

ext4: Remove useless spinlock in ext4_getattr() · eb8abb92

由 Theodore Ts'o 提交于 11月 02, 2010

Linus noted, and complained to me, that doing while lots of "git diff"'s
of kernel sources, these spinlocks were responsible for 27% of the
spinlock cost on his two-processor system as reported by perf.

Git was doing lots of parallel stats, and this was putting a lot of
pressure on ext4_getattr().  A spinlock to protect a single
memory-to-memory copy is pointless, so remove it.
Signed-off-by: N"Theodore Ts'o" <tytso@mit.edu>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

eb8abb92

fs: logfs: Fix up MTD=y build. · e99d11d1

由 Paul Mundt 提交于 11月 02, 2010

Commit 7d945a3a ("logfs get_sb, part 3") broke the logfs build when
CONFIG_MTD is set due to a mangled logfs_get_sb_mtd() definition.
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

e99d11d1

31 10月, 2010 7 次提交

locks: remove fl_copy_lock lock_manager operation · bb8430a2

由 Christoph Hellwig 提交于 10月 31, 2010

This one was only used for a nasty hack in nfsd, which has recently
been removed.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

bb8430a2

locks: let the caller free file_lock on ->setlease failure · 51ee4b84

由 Christoph Hellwig 提交于 10月 31, 2010

The caller allocated it, the caller should free it.

The only issue so far is that we could change the flp pointer even on an
error return if the fl_change callback failed. But we can simply move
the flp assignment after the fl_change invocation, as the callers don't
care about the flp return value if the setlease call failed.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

51ee4b84

nfsd4: initialize delegation pointer to lease · fcf744a9

由 J. Bruce Fields 提交于 10月 30, 2010

The NFSv4 server was initializing the dp->dl_flock pointer by the
somewhat ridiculous method of a locks_copy_lock callback.

Now that setlease uses the passed-in lock instead of doing a copy,
dl_flock no longer gets set, resulting in the lock leaking on delegation
release, and later possible hangs (among other problems).

So, initialize dl_flock and get rid of the callback.
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>
Acked-by: NArnd Bergmann <arnd@arndb.de>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

fcf744a9

locks: fix setlease methods to free passed-in lock · 05fa3135

由 J. Bruce Fields 提交于 10月 30, 2010

We modified setlease to require the caller to allocate the new lease in
the case of creating a new lease, but forgot to fix up the filesystem
methods.

Cc: Steven Whitehouse <swhiteho@redhat.com>
Cc: Steve French <sfrench@samba.org>
Cc: Trond Myklebust <Trond.Myklebust@netapp.com>
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>
Acked-by: NArnd Bergmann <arnd@arndb.de>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

05fa3135

locks: fix leaks on setlease errors · 096657b6

由 J. Bruce Fields 提交于 10月 30, 2010

We're depending on setlease to free the passed-in lease on failure.
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>
Acked-by: NArnd Bergmann <arnd@arndb.de>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

096657b6

locks: prevent ENOMEM on lease unlock · 0ceaf6c7

由 J. Bruce Fields 提交于 10月 30, 2010

Removing a lock shouldn't require any allocations; a failure due to
ENOMEM leaves the caller with a choice between retrying or giving up and
leaking an unused lease.

Next we should split the other lease calls into add and delete cases.
I wanted to start with just the bugfix.
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>
Acked-by: NArnd Bergmann <arnd@arndb.de>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

0ceaf6c7

make fanotify_read() restartable across signals · 1a5cea72

由 Lino Sanfilippo 提交于 10月 29, 2010

    In fanotify_read() return -ERESTARTSYS instead of -EINTR to
    make read() restartable across signals (BSD semantic).
Signed-off-by: NEric Paris <eparis@redhat.com>

1a5cea72

30 10月, 2010 18 次提交

fs-writeback.c: unify some common code · cdf01dd5

由 Linus Torvalds 提交于 10月 30, 2010

The btrfs merge looks like hell, because it changes fs-writeback.c, and
the crazy code has this repeated "estimate number of dirty pages"
counting that involves three different helper functions.  And it's done
in two different places.

Just unify that whole calculation as a "get_nr_dirty_pages()" helper
function, and the merge result will look half-way decent.
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

cdf01dd5

fs/compat.c: fix build on MIPS/s390 · 504b701b

由 wu zhangjin 提交于 10月 30, 2010

The definition of PAGE_CACHE_MASK in <linux/pagemap.h> is needed to use
MAX_RW_COUNT, and on x86-64 that gets done indirectly through the
architecture header includes. But on MIPS and s390 that doesn't happen,
and we need to make sure that fs/compat.c includes pagemap.h explicitly.

Introduced in commit 435f49a5 ("readv/writev: do the same
MAX_RW_COUNT truncation that read/write does").

Reported-by: Sachin Sant <sachinp@in.ibm.com> (S390)
Reported-by: wu zhangjin <wuzhangjin@gmail.com> (MIPS)
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

504b701b

Btrfs: deal with errors from updating the tree log · 6418c961

由 Chris Mason 提交于 10月 30, 2010

During unlink we remove any references to the inode from
the tree log.  It can return -ENOENT and other errors,
and this changes the unlink code to deal with it.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

6418c961

jbd2: Convert jbd2_slab_create_sem to mutex · 51dfacde

由 Thomas Gleixner 提交于 10月 16, 2010

jbd2_slab_create_sem is used as a mutex, so make it one.

[ akpm muttered: We may as well make it local to
jbd2_journal_create_slab() also. ]
Signed-off-by: NThomas Gleixner <tglx@linutronix.de>
Cc: Ted Ts'o <tytso@mit.edu>
Cc: Andrew Morton <akpm@linux-foundation.org>
LKML-Reference: <alpine.LFD.2.00.1010162231480.2496@localhost6.localdomain6>
Signed-off-by: NThomas Gleixner <tglx@linutronix.de>

51dfacde

hpfs: Convert sbi->hpfs_creation_de to mutex · 117bf5fb

由 Thomas Gleixner 提交于 9月 07, 2010

sbi->hpfs_creation_de is used as mutex so make it a mutex.
Signed-off-by: NThomas Gleixner <tglx@linutronix.de>
Acked-by: NChristoph Hellwig <hch@infradead.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Al Viro <viro@zeniv.linux.org.uk>
Cc: linux-fsdevel@vger.kernel.org
LKML-Reference: <20100907125056.228874895@linutronix.de>

117bf5fb

Btrfs: allow subvol deletion by unprivileged user with -o user_subvol_rm_allowed · 4260f7c7

由 Sage Weil 提交于 10月 29, 2010

Add a mount option user_subvol_rm_allowed that allows users to delete a
(potentially non-empty!) subvol when they would otherwise we allowed to do
an rmdir(2).  We duplicate the may_delete() checks from the core VFS code
to implement identical security checks (minus the directory size check).
We additionally require that the user has write+exec permission on the
subvol root inode.
Signed-off-by: NSage Weil <sage@newdream.net>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

4260f7c7

Btrfs: make SNAP_DESTROY async · 531cb13f

由 Sage Weil 提交于 10月 29, 2010

There is no reason to force an immediate commit when deleting a snapshot.
Users have some expectation that space from a deleted snapshot be freed
immediately, but even if we do commit the reclaim is a background process.

If users _do_ want the deletion to be durable, they can call 'sync'.
Signed-off-by: NSage Weil <sage@newdream.net>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

531cb13f

Btrfs: add SNAP_CREATE_ASYNC ioctl · 72fd032e

由 Sage Weil 提交于 10月 29, 2010

Create a snap without waiting for it to commit to disk. The ioctl is
ordered such that subsequent operations will not be contained by the
created snapshot, and the commit is initiated, but the ioctl does not
wait for the snapshot to commit to disk.

We return the specific transid to userspace so that an application can wait
for this specific snapshot creation to commit via the WAIT_SYNC ioctl.
Signed-off-by: NSage Weil <sage@newdream.net>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

72fd032e

Btrfs: add START_SYNC, WAIT_SYNC ioctls · 46204592

由 Sage Weil 提交于 10月 29, 2010

START_SYNC will start a sync/commit, but not wait for it to
complete.  Any modification started after the ioctl returns is
guaranteed not to be included in the commit.  If a non-NULL
pointer is passed, the transaction id will be returned to
userspace.

WAIT_SYNC will wait for any in-progress commit to complete.  If a
transaction id is specified, the ioctl will block and then
return (success) when the specified transaction has committed.
If it has already committed when we call the ioctl, it returns
immediately.  If the specified transaction doesn't exist, it
returns EINVAL.

If no transaction id is specified, WAIT_SYNC will wait for the
currently committing transaction to finish it's commit to disk.
If there is no currently committing transaction, it returns
success.

These ioctls are useful for applications which want to impose an
ordering on when fs modifications reach disk, but do not want to
wait for the full (slow) commit process to do so.

Picky callers can take the transid returned by START_SYNC and
feed it to WAIT_SYNC, and be certain to wait only as long as
necessary for the transaction _they_ started to reach disk.

Sloppy callers can START_SYNC and WAIT_SYNC without a transid,
and provided they didn't wait too long between the calls, they
will get the same result.  However, if a second commit starts
before they call WAIT_SYNC, they may end up waiting longer for
it to commit as well.  Even so, a START_SYNC+WAIT_SYNC still
guarantees that any operation completed before the START_SYNC
reaches disk.
Signed-off-by: NSage Weil <sage@newdream.net>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

46204592

Btrfs: async transaction commit · bb9c12c9

由 Sage Weil 提交于 10月 29, 2010

Add support for an async transaction commit that is ordered such that any
subsequent operations will join the following transaction, but does not
wait until the current commit is fully on disk. This avoids much of the
latency associated with the btrfs_commit_transaction for callers concerned
with serialization and not safety.

The wait_for_unblock flag controls whether we wait for the 'middle' portion
of commit_transaction to complete, which is necessary if the caller expects
some of the modifications contained in the commit to be available (this is
the case for subvol/snapshot creation).
Signed-off-by: NSage Weil <sage@newdream.net>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

bb9c12c9

Btrfs: fix deadlock in btrfs_commit_transaction · 99d16cbc

由 Sage Weil 提交于 10月 29, 2010

We calculate timeout (either 1 or MAX_SCHEDULE_TIMEOUT) based on whether
num_writers > 1 or should_grow at the top of the loop.  Then, much much
later, we wait for that timeout if either num_writers or should_grow is
true.  However, it's possible for a racing process (calling
btrfs_end_transaction()) to decrement num_writers such that we wait
forever instead of for 1.

Fix this by deciding how long to wait when we wait.  Include a smp_mb()
before checking if the waitqueue is active to ensure the num_writers
is visible.
Signed-off-by: NSage Weil <sage@newdream.net>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

99d16cbc

Btrfs: fix lockdep warning on clone ioctl · fccdae43

由 Sage Weil 提交于 10月 29, 2010

I'm no lockdep expert, but this appears to make the lockdep warning go
away for the i_mutex locking in the clone ioctl.
Signed-off-by: NSage Weil <sage@newdream.net>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

fccdae43

Btrfs: fix clone ioctl where range is adjacent to extent · 050006a7

由 Sage Weil 提交于 10月 29, 2010

We had an edge case issue where the requested range was just
following an existing extent. Instead of skipping to the next
extent, we used the previous one which lead to having zero
sized extents.
Signed-off-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

050006a7

Btrfs: fix delalloc checks in clone ioctl · 9a019196

由 Sage Weil 提交于 10月 29, 2010

The lookup_first_ordered_extent() was done on the wrong inode, and the
->delalloc_bytes test was wrong, as the following
btrfs_wait_ordered_range() would only invoke a range write and wouldn't
write the entire file data range. Also, a bad parameter was passed to
btrfs_wait_ordered_range().
Signed-off-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

9a019196

C
Btrfs: drop unused variable in block_alloc_rsv · d8e39c45
由 Chris Mason 提交于 10月 29, 2010
```
The alloc_target variable is not really used.
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
d8e39c45

Btrfs: cleanup warnings from gcc 4.6 (nonbugs) · 559af821

由 Andi Kleen 提交于 10月 29, 2010

These are all the cases where a variable is set, but not read which are
not bugs as far as I can see, but simply leftovers.

Still needs more review.

Found by gcc 4.6's new warnings
Signed-off-by: NAndi Kleen <ak@linux.intel.com>
Cc: Chris Mason <chris.mason@oracle.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

559af821

Btrfs: Fix variables set but not read (bugs found by gcc 4.6) · 411fc6bc

由 Andi Kleen 提交于 10月 29, 2010

These are all the cases where a variable is set, but not
read which are really bugs.

- Couple of incorrect error handling fixed.
- One incorrect use of a allocation policy
- Some other things

Still needs more review.

Found by gcc 4.6's new warnings.

[akpm@linux-foundation.org: fix build.  Might have been bitrot]
Signed-off-by: NAndi Kleen <ak@linux.intel.com>
Cc: Chris Mason <chris.mason@oracle.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

411fc6bc

Btrfs: Use ERR_CAST helpers · d0b678cb

由 Julia Lawall 提交于 10月 29, 2010

Use ERR_CAST(x) rather than ERR_PTR(PTR_ERR(x)).  The former makes more
clear what is the purpose of the operation, which otherwise looks like a
no-op.

The semantic patch that makes this change is as follows:
(http://coccinelle.lip6.fr/)

// <smpl>
@@
type T;
T x;
identifier f;
@@

T f (...) { <+...
- ERR_PTR(PTR_ERR(x))
+ x
 ...+> }

@@
expression x;
@@

- ERR_PTR(PTR_ERR(x))
+ ERR_CAST(x)
// </smpl>
Signed-off-by: NJulia Lawall <julia@diku.dk>
Cc: Chris Mason <chris.mason@oracle.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

d0b678cb

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功