提交 · b1e4adf4ea41bb8b5a7bfc1a7001f137e65495df · openeuler / raspberrypi-kernel

20 3月, 2009 2 次提交

NFS: Fix the notifications when renaming onto an existing file · b1e4adf4

由 Trond Myklebust 提交于 3月 19, 2009

NFS appears to be returning an unnecessary "delete" notification when
we're doing an atomic rename. See

  http://bugzilla.gnome.org/show_bug.cgi?id=575684

The fix is to get rid of the redundant call to d_delete().
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

b1e4adf4

NFS: Fix up a mismerged patch · 47c62564

由 Trond Myklebust 提交于 3月 16, 2009

Move the definition of nfs_need_commit() into the #ifdef CONFIG_NFS_V3
section as originally intended in the patch "NFS: cleanup - remove
struct nfs_inode->ncommit"
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

47c62564

12 3月, 2009 14 次提交

NFS: load the rpc/rdma transport module automatically · a67d18f8

由 Tom Talpey 提交于 3月 11, 2009

When mounting an NFS/RDMA server with the "-o proto=rdma" or
"-o rdma" options, attempt to dynamically load the necessary
"xprtrdma" client transport module. Doing so improves usability,
while avoiding a static module dependency and any unnecesary
resources.
Signed-off-by: NTom Talpey <tmtalpey@gmail.com>
Cc: Chuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

a67d18f8

NFS: Kill the "defined but not used" compile error on nommu machines · e1ebfd33

由 Trond Myklebust 提交于 3月 11, 2009

Bryan Wu reports that when compiling NFS on nommu machines he gets a
"defined but not used" error on nfs_file_mmap().

The easiest fix is simply to get rid of the special casing in NFS, and
just always call generic_file_mmap() to set up the file.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

e1ebfd33

NFS: Throttle page dirtying while we're flushing to disk · 72cb77f4

由 Trond Myklebust 提交于 3月 11, 2009

The following patch is a combination of a patch by myself and Peter
Staubach.

Trond: If we allow other processes to dirty pages while a process is doing
a consistency sync to disk, we can end up never making progress.

Peter: Attached is a patch which addresses a continuing problem with
the NFS client generating out of order WRITE requests.  While
this is compliant with all of the current protocol
specifications, there are servers in the market which can not
handle out of order WRITE requests very well.  Also, this may
lead to sub-optimal block allocations in the underlying file
system on the server.  This may cause the read throughputs to
be reduced when reading the file from the server.

Peter: There has been a lot of work recently done to address out of
order issues on a systemic level.  However, the NFS client is
still susceptible to the problem.  Out of order WRITE
requests can occur when pdflush is in the middle of writing
out pages while the process dirtying the pages calls
generic_file_buffered_write which calls
generic_perform_write which calls
balance_dirty_pages_rate_limited which ends up calling
writeback_inodes which ends up calling back into the NFS
client to writes out dirty pages for the same file that
pdflush happens to be working with.
Signed-off-by: NPeter Staubach <staubach@redhat.com>
[modification by Trond to merge the two similar patches]
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

72cb77f4

T
NFS: cleanup - remove struct nfs_inode->ncommit · fb8a1f11
由 Trond Myklebust 提交于 3月 11, 2009
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
fb8a1f11

NFSv4: Simplify some cache consistency post-op GETATTRs · a65318bf

由 Trond Myklebust 提交于 3月 11, 2009

Certain asynchronous operations such as write() do not expect
(or care) that other metadata such as the file owner, mode, acls, ...
change. All they want to do is update and/or check the change attribute,
ctime, and mtime.
By skipping the file owner and group update, we also avoid having to do a
potential idmapper upcall for these asynchronous RPC calls.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

a65318bf

NFSv4: A referral is assumed to always point to a directory. · 69aaaae1

由 Trond Myklebust 提交于 3月 11, 2009

Fix a bug whereby we would fail to create a mount point for a referral.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

69aaaae1

T
NFSv4: Make decode_getfattr() set fattr->valid to reflect what was decoded · 409924e4
由 Trond Myklebust 提交于 3月 11, 2009
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
409924e4
T
NFSv4: Clean up decode_getfattr() · f26c7a78
由 Trond Myklebust 提交于 3月 11, 2009
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
f26c7a78

NFS: Fix the type of struct nfs_fattr->mode · bca79478

由 Trond Myklebust 提交于 3月 11, 2009

There is no point in using anything other than umode_t, since we copy the
content pretty much directly into inode->i_mode.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

bca79478

NFS: Shrink the struct nfs_fattr · 1ca277d8

由 Trond Myklebust 提交于 3月 11, 2009

We don't need the bitmap[] field anymore, since the 'valid' field tells us
all we need to know about which attributes were filled in...
Also move the pre-op attributes in order to improve the structure packing.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

1ca277d8

NFSv4: Support NFSv4 optional attributes in the struct nfs_fattr · 9e6e70f8

由 Trond Myklebust 提交于 3月 11, 2009

Currently, filling struct nfs_fattr is more or less an all or nothing
operation, since NFSv2 and NFSv3 have only mandatory attributes.
In NFSv4, some attributes are optional, and so we may simply not be able to
fill in those fields. Furthermore, NFSv4 allows you to specify which
attributes you are interested in retrieving, thus permitting you to
optimise away retrieval of attributes that you know will no change...
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

9e6e70f8

NFSv4: Ignore errors on the post-op attributes in SETATTR calls · 78f945f8

由 Trond Myklebust 提交于 3月 11, 2009

There is no need to fail or retry a SETATTR call just because the post-op
GETATTR failed.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

78f945f8

NFS: flush cached directory information slightly more readily. · 37d9d76d

由 NeilBrown 提交于 3月 11, 2009

If cached directory contents becomes incorrect, there is no way to
flush the contents.  This contrasts with files where file locking is
the recommended way to ensure cache consistency between multiple
applications (a read-lock always flushes the cache).

Also while changes to files often change the size of the file (thus
triggering a cache flush), changes to directories often do not change
the apparent size (as the size is often rounded to a block size).

So it is particularly important with directories to avoid the
possibility of an incorrect cache wherever possible.

When the link count on a directory changes it implies a change in the
number of child directories, and so a change in the contents of this
directory.  So use that as a trigger to flush cached contents.

When the ctime changes but the mtime does not, there are two possible
reasons.
 1/ The owner/mode information has been changed.
 2/ utimes has been used to set the mtime backwards.

In the first case, a data-cache flush is not required.
In the second case it is.

So on the basis that correctness trumps performance, flush the
directory contents cache in this case also.
Signed-off-by: NNeilBrown <neilb@suse.de>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

37d9d76d

NFS: Minor __nfs_revalidate_inode cleanup · 2b57dc6c

由 Suresh Jayaraman 提交于 3月 11, 2009

Remove redundant NFS_STALE() check, a leftover due to the commit
691beb13Signed-off-by: NSuresh Jayaraman <sjayaraman@suse.de>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

2b57dc6c

11 3月, 2009 6 次提交

Bug 11061, NFS mounts dropped · d7371c41

由 Ian Dall 提交于 3月 10, 2009

Addresses: http://bugzilla.kernel.org/show_bug.cgi?id=11061

sockaddr structures can't be reliably compared using memcmp() because
there are padding bytes in the structure which can't be guaranteed to
be the same even when the sockaddr structures refer to the same
socket. Instead compare all the relevant fields. In the case of IPv6
sin6_flowinfo is not compared because it only affects QoS and
sin6_scope_id is only compared if the address is "link local" because
"link local" addresses need only be unique to a specific link.
Signed-off-by: NIan Dall <ian@beware.dropbear.id.au>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

d7371c41

NFS: Handle -ESTALE error in access() · a71ee337

由 Suresh Jayaraman 提交于 3月 10, 2009

Hi Trond,

I have been looking at a bugreport where trying to open applications on KDE
on a NFS mounted home fails temporarily. There have been multiple reports on
different kernel versions pointing to this common issue:
http://bugzilla.kernel.org/show_bug.cgi?id=12557
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/269954
http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=508866.html

This issue can be reproducible consistently by doing this on a NFS mounted
home (KDE):
1. Open 2 xterm sessions
2. From one of the xterm session, do "ssh -X <remote host>"
3. "stat ~/.Xauthority" on the remote SSH session
4. Close the two xterm sessions
5. On the server do a "stat ~/.Xauthority"
6. Now on the client, try to open xterm
This will fail.

Even if the filehandle had become stale, the NFS client should invalidate
the cache/inode and should repeat LOOKUP. Looking at the packet capture when
the failure occurs shows that there were two subsequent ACCESS() calls with
the same filehandle and both fails with -ESTALE error.

I have tested the fix below. Now the client issue a LOOKUP after the
ACCESS() call fails with -ESTALE. If all this makes sense to you, can you
consider this for inclusion?

Thanks,


If the server returns an -ESTALE error due to stale filehandle in response to
an ACCESS() call, we need to invalidate the cache and inode so that LOOKUP()
can be retried. Without this change, the nfs client retries ACCESS() with the
same filehandle, fails again and could lead to temporary failure of
applications running on nfs mounted home.
Signed-off-by: NSuresh Jayaraman <sjayaraman@suse.de>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

a71ee337

NLM: Fix GRANT callback address comparison when IPv6 is enabled · 57df675c

由 Chuck Lever 提交于 3月 10, 2009

The NFS mount command may pass an AF_INET server address to lockd.  If
lockd happens to be using a PF_INET6 listener, the nlm_cmp_addr() in
nlmclnt_grant() will fail to match requests from that host because they
will all have a mapped IPv4 AF_INET6 address.

Adopt the same solution used in nfs_sockaddr_match_ipaddr() for NFSv4
callbacks: if either address is AF_INET, map it to an AF_INET6 address
before doing the comparison.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

57df675c

NFSv3: Fix posix ACL code · ae46141f

由 Trond Myklebust 提交于 3月 10, 2009

Fix a memory leak due to allocation in the XDR layer. In cases where the
RPC call needs to be retransmitted, we end up allocating new pages without
clearing the old ones. Fix this by moving the allocation into
nfs3_proc_setacls().

Also fix an issue discovered by Kevin Rudd, whereby the amount of memory
reserved for the acls in the xdr_buf->head was miscalculated, and causing
corruption.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

ae46141f

NFS: Fix misparsing of nfsv4 fs_locations attribute (take 2) · ef95d31e

由 Trond Myklebust 提交于 3月 10, 2009

The changeset ea31a443 (nfs: Fix
misparsing of nfsv4 fs_locations attribute) causes the mountpath that is
calculated at the beginning of try_location() to be clobbered when we
later strncpy a non-nul terminated hostname using an incorrect buffer
length.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

ef95d31e

devpts: remove graffiti · 260219cc

由 Alexey Dobriyan 提交于 3月 10, 2009

Very annoying when working with containters.
Signed-off-by: NAlexey Dobriyan <adobriyan@gmail.com>
Cc: Alan Cox <alan@lxorguk.ukuu.org.uk>
Cc: "H. Peter Anvin" <hpa@zytor.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

260219cc

09 3月, 2009 1 次提交

Btrfs: fix spinlock assertions on UP systems · b9447ef8

由 Chris Mason 提交于 3月 09, 2009

btrfs_tree_locked was being used to make sure a given extent_buffer was
properly locked in a few places.  But, it wasn't correct for UP compiled
kernels.

This switches it to using assert_spin_locked instead, and renames it to
btrfs_assert_tree_locked to better reflect how it was really being used.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

b9447ef8

05 3月, 2009 3 次提交

R
Squashfs: frag_size should be signed, as it can hold an error result · f4f8056a
由 Roel Kluin 提交于 3月 05, 2009
```
Signed-off-by: NRoel Kluin <roel.kluin@gmail.com>
Signed-off-by: NPhillip Lougher <phillip@lougher.demon.co.uk>
```
f4f8056a

Squashfs: Fix oops when reading fsfuzzer corrupted filesystems · 118e1ef6

由 Phillip Lougher 提交于 3月 05, 2009

This fixes a code regression caused by the recent mainlining changes.
The recent code changes call zlib_inflate repeatedly, decompressing into
separate 4K buffers, this code didn't check for the possibility that
zlib_inflate might ask for too many buffers when decompressing corrupted
data.
Signed-off-by: NPhillip Lougher <phillip@lougher.demon.co.uk>

118e1ef6

ext4: fix ext4_free_inode() vs. ext4_claim_inode() race · 7ce9d5d1

由 Eric Sandeen 提交于 3月 04, 2009

I was seeing fsck errors on inode bitmaps after a 4 thread
dbench run on a 4 cpu machine:

Inode bitmap differences: -50736 -(50752--50753) etc...

I believe that this is because ext4_free_inode() uses atomic
bitops, and although ext4_new_inode() *used* to also use atomic 
bitops for synchronization, commit 
39341867 changed this to use
the sb_bgl_lock, so that we could also synchronize against
read_inode_bitmap and initialization of uninit inode tables.

However, that change left ext4_free_inode using atomic bitops,
which I think leaves no synchronization between setting & 
unsetting bits in the inode table.

The below patch fixes it for me, although I wonder if we're 
getting at all heavy-handed with this spinlock...
Signed-off-by: NEric Sandeen <sandeen@redhat.com>
Reviewed-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>
Signed-off-by: N"Theodore Ts'o" <tytso@mit.edu>

7ce9d5d1

28 2月, 2009 2 次提交

Fix FREEZE/THAW compat_ioctl regression · 5cf8cf41

由 Christoph Hellwig 提交于 2月 26, 2009

Commit 8e961870 removed the FREEZE/THAW
handling in xfs_compat_ioctl but never added any compat handler back, so
now any freeze/thaw request from a 32-bit binary ond 64-bit userspace
will fail.

As these ioctls are 32/64-bit compatible two simple COMPATIBLE_IOCTL
entries in fs/compat_ioctl.c will do the job.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

5cf8cf41

EXPORT_SYMBOL(d_obtain_alias) rather than EXPORT_SYMBOL_GPL · adc48720

由 Benny Halevy 提交于 2月 27, 2009

Commit 4ea3ada2 declares d_obtain_alias()
as EXPORT_SYMBOL_GPL where it's supposed to replace d_alloc_anon which was
previously declared as EXPORT_SYMBOL and thus available to any loadable
module.

This patch reverts that.
Signed-off-by: NBenny Halevy <bhalevy@panasas.com>
Acked-by: NLinus Torvalds <torvalds@linux-foundation.org>
Cc: Christoph Hellwig <hch@infradead.org>
Cc: "J. Bruce Fields" <bfields@fieldses.org>
Cc: Trond Myklebust <trond.myklebust@fys.uio.no>
Acked-by: NAl Viro <viro@zeniv.linux.org.uk>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

adc48720

27 2月, 2009 9 次提交

ocfs2: add IO error check in ocfs2_get_sector() · 28d57d43

由 wengang wang 提交于 2月 13, 2009

Check for IO error in ocfs2_get_sector().
Signed-off-by: NWengang Wang <wen.gang.wang@oracle.com>
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

28d57d43

ocfs2: set gap to seperate entry and value when xattr in bucket · 4442f518

由 Tiger Yang 提交于 2月 20, 2009

This patch set a gap (4 bytes) between xattr entry and
name/value when xattr in bucket. This gap use to seperate
entry and name/value when a bucket is full. It had already
been set when xattr in inode/block.
Signed-off-by: NTiger Yang <tiger.yang@oracle.com>
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

4442f518

ocfs2: lock the metaecc process for xattr bucket · c8b9cf9a

由 Tao Ma 提交于 2月 24, 2009

For other metadata in ocfs2, metaecc is checked in ocfs2_read_blocks
with io_mutex held. While for xattr bucket, it is calculated by
the whole buckets. So we have to add a spin_lock to prevent multiple
processes calculating metaecc.
Signed-off-by: NTao Ma <tao.ma@oracle.com>
Tested-by: NTristan Ye <tristan.ye@oracle.com>
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

c8b9cf9a

ocfs2: Use the right access_* method in ctime update of xattr. · 89a907af

由 Tao Ma 提交于 2月 17, 2009

In ctime updating of xattr, it use the wrong type of access for
inode, so use ocfs2_journal_access_di instead.
Reported-and-Tested-by: NTristan Ye <tristan.ye@oracle.com>
Signed-off-by: NTao Ma <tao.ma@oracle.com>
Acked-by: NJoel Becker <joel.becker@oracle.com>
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

89a907af

ocfs2/dlm: Make dlm_assert_master_handler() kill itself instead of the asserter · 53ecd25e

由 Sunil Mushran 提交于 2月 03, 2009

In dlm_assert_master_handler(), if we get an incorrect assert master from a node
that, we reply with EINVAL asking the asserter to die. The problem is that an
assert is sent after so many hoops, it is invariably the node that thinks the
asserter is wrong, is actually wrong. So instead of killing the asserter, this
patch kills the assertee.

This patch papers over a race that is still being addressed.
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Acked-by: NJoel Becker <joel.becker@oracle.com>
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

53ecd25e

ocfs2/dlm: Use ast_lock to protect ast_list · dabc47de

由 Sunil Mushran 提交于 2月 03, 2009

The code was using dlm->spinlock instead of dlm->ast_lock to protect the
ast_list. This patch fixes the issue.
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Acked-by: NJoel Becker <joel.becker@oracle.com>
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

dabc47de

ocfs2: Cleanup the lockname print in dlmglue.c · c74ff8bb

由 Sunil Mushran 提交于 2月 03, 2009

The dentry lock has a different format than other locks. This patch fixes
ocfs2_log_dlm_error() macro to make it print the dentry lock correctly.
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Acked-by: NJoel Becker <joel.becker@oracle.com>
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

c74ff8bb

ocfs2/dlm: Retract fix for race between purge and migrate · 7dc102b7

由 Sunil Mushran 提交于 2月 03, 2009

Mainline commit d4f7e650 attempts to delay
the dlm_thread from sending the drop ref message if the lockres is being
migrated. The problem is that we make the dlm_thread wait for the migration
to complete. This causes a deadlock as dlm_thread also participates in the
lockres migration process.

A better fix for the original oss bugzilla#1012 is in testing.
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Acked-by: NJoel Becker <joel.becker@oracle.com>
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

7dc102b7

ocfs2: Access and dirty the buffer_head in mark_written. · 47be12e4

由 Tao Ma 提交于 1月 09, 2009

In __ocfs2_mark_extent_written, when we meet with the situation
of c_split_covers_rec, the old solution just replace the extent
record and forget to access and dirty the buffer_head. This will
cause a problem when the unwritten extent is in an extent block.
So access and dirty it.
Signed-off-by: NTao Ma <tao.ma@oracle.com>
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

47be12e4

26 2月, 2009 2 次提交

block: fix bogus gcc warning for uninitialized var usage · b2bf9683

由 Jens Axboe 提交于 2月 19, 2009

Newer gcc throw this warning:

        fs/bio.c: In function ?bio_alloc_bioset?:
        fs/bio.c:305: warning: ?p? may be used uninitialized in this function

since it cannot figure out that 'p' is only ever used if 'bs' is non-NULL.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

b2bf9683

ext4: don't call jbd2_journal_force_commit_nested without journal · 8f64b32e

由 Eric Sandeen 提交于 2月 26, 2009

Running without a journal, I oopsed when I ran out of space,
because we called jbd2_journal_force_commit_nested() from
ext4_should_retry_alloc() without a journal.

This should take care of it, I think.
Signed-off-by: NEric Sandeen <sandeen@redhat.com>
Signed-off-by: N"Theodore Ts'o" <tytso@mit.edu>

8f64b32e

28 2月, 2009 1 次提交

ext4: Reorder fs/Makefile so that ext2 root fs's are mounted using ext2 · d8ae4601

由 Theodore Ts'o 提交于 2月 28, 2009

In fs/Makefile, ext3 was placed before ext2 so that a root filesystem
that possessed a journal, it would be mounted as ext3 instead of ext2.
This was necessary because a cleanly unmounted ext3 filesystem was
fully backwards compatible with ext2, and could be mounted by ext2 ---
but it was desirable that it be mounted with ext3 so that the
journaling would be enabled.

The ext4 filesystem supports new incompatible features, so there is no
danger of an ext4 filesystem being mistaken for an ext2 filesystem.
At that point, the relative ordering of ext4 with respect to ext2
didn't matter until ext4 gained the ability to mount filesystems
without a journal starting in 2.6.29-rc1.  Now that this is the case,
given that ext4 is before ext2, it means that root filesystems that
were using the plain-jane ext2 format are getting mounted using the
ext4 filesystem driver, which is a change in behavior which could be
surprising to users.

It's doubtful that there are that many ext2-only root filesystem users
that would also have ext4 compiled into the kernel, but to adhere to
the principle of least surprise, the correct ordering in fs/Makefile
is ext3, followed by ext2, and finally ext4.
Signed-off-by: N"Theodore Ts'o" <tytso@mit.edu>

d8ae4601