提交 · 3f7705eab6722ad1a346d748c4aad55755d6c241 · openanolis / cloud-kernel

21 10月, 2006 3 次提交

[PATCH] xdr annotations: NFS readdir entries · 0dbb4c67

由 Al Viro 提交于 10月 19, 2006

on-the-wire data is big-endian

[in large part pulled from Alexey's patch]
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
Acked-by: NTrond Myklebust <trond.myklebust@fys.uio.no>
Acked-by: NNeil Brown <neilb@suse.de>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

0dbb4c67

[PATCH] NFS: fix minor bug in new NFS symlink code · 39cf8a13

由 Chuck Lever 提交于 10月 19, 2006

The original code confused a zero return code from pagevec_add() as success.

Test plan:
None.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

39cf8a13

[PATCH] NFS: Deal with failure of invalidate_inode_pages2() · cd9ae2b6

由 Trond Myklebust 提交于 10月 19, 2006

If invalidate_inode_pages2() fails, then it should in principle just be
because the current process was signalled.  In that case, we just want to
ensure that the inode's page cache remains marked as invalid.

Also add a helper to allow the O_DIRECT code to simply mark the page cache as
invalid once it is finished writing, instead of calling
invalidate_inode_pages2() itself.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

cd9ae2b6

01 10月, 2006 2 次提交

[PATCH] r/o bind mounts: monitor zeroing of i_nlink · ce71ec36

由 Dave Hansen 提交于 9月 30, 2006

Some filesystems, instead of simply decrementing i_nlink, simply zero it
during an unlink operation.  We need to catch these in addition to the
decrement operations.
Signed-off-by: NDave Hansen <haveblue@us.ibm.com>
Acked-by: NChristoph Hellwig <hch@lst.de>
Cc: Al Viro <viro@zeniv.linux.org.uk>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

ce71ec36

[PATCH] r/o bind mounts: unlink: monitor i_nlink · 9a53c3a7

由 Dave Hansen 提交于 9月 30, 2006

When a filesystem decrements i_nlink to zero, it means that a write must be
performed in order to drop the inode from the filesystem.

We're shortly going to have keep filesystems from being remounted r/o between
the time that this i_nlink decrement and that write occurs.

So, add a little helper function to do the decrements.  We'll tie into it in a
bit to note when i_nlink hits zero.
Signed-off-by: NDave Hansen <haveblue@us.ibm.com>
Acked-by: NChristoph Hellwig <hch@lst.de>
Cc: Al Viro <viro@zeniv.linux.org.uk>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

9a53c3a7

25 9月, 2006 1 次提交

[PATCH] Allow file systems to manually d_move() inside of ->rename() · 349457cc

由 Mark Fasheh 提交于 9月 08, 2006

Some file systems want to manually d_move() the dentries involved in a
rename.  We can do this by making use of the FS_ODD_RENAME flag if we just
have nfs_rename() unconditionally do the d_move().  While there, we rename
the flag to be more descriptive.

OCFS2 uses this to protect that part of the rename operation with a cluster
lock.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>
Cc: Trond Myklebust <trond.myklebust@fys.uio.no>
Cc: Al Viro <viro@zeniv.linux.org.uk>
Cc: Christoph Hellwig <hch@lst.de>
Signed-off-by: NAndrew Morton <akpm@osdl.org>

349457cc

23 9月, 2006 10 次提交

NFS: nfs_lookup - don't hash dentry when optimising away the lookup · fd684071

由 Trond Myklebust 提交于 9月 05, 2006

If the open intents tell us that a given lookup is going to result in a,
exclusive create, we currently optimize away the lookup call itself. The
reason is that the lookup would not be atomic with the create RPC call, so
why do it in the first place?

A problem occurs, however, if the VFS aborts the exclusive create operation
after the lookup, but before the call to create the file/directory: in this
case we will end up with a hashed negative dentry in the dcache that has
never been looked up.
Fix this by only actually hashing the dentry once the create operation has
been successfully completed.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

fd684071

NFS: Use cached page as buffer for NFS symlink requests · 94a6d753

由 Chuck Lever 提交于 8月 22, 2006

Now that we have a copy of the symlink path in the page cache, we can pass
a struct page down to the XDR routines instead of a string buffer.

Test plan:
Connectathon, all NFS versions.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

94a6d753

NFS: copy symlinks into page cache before sending NFS SYMLINK request · 873101b3

由 Chuck Lever 提交于 8月 22, 2006

Currently the NFS client does not cache symlinks it creates.  They get
cached only when the NFS client reads them back from the server.

Copy the symlink into the page cache before sending it.

Test plan:
Connectathon, all NFS versions.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

873101b3

NFS: Fix double d_drop in nfs_instantiate() error path · 4f390c15

由 Chuck Lever 提交于 8月 22, 2006

If the LOOKUP or GETATTR in nfs_instantiate fail, nfs_instantiate will do a
d_drop before returning.  But some callers already do a d_drop in the case
of an error return.  Make certain we do only one d_drop in all error paths.

This issue was introduced because over time, the symlink proc API diverged
slightly from the create/mkdir/mknod proc API.  To prevent other coding
mistakes of this type, change the symlink proc API to be more like
create/mkdir/mknod and move the nfs_instantiate call into the symlink proc
routines so it is used in exactly the same way for create, mkdir, mknod,
and symlink.

Test plan:
Connectathon, all versions of NFS.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

4f390c15

NFS: remove a no-longer-needed error check in nfs_symlink() · d3db90e2

由 Chuck Lever 提交于 8月 22, 2006

In the early days of NFS, there was no duplicate reply cache on the server.
Thus retransmitted non-idempotent requests often found that the request had
already completed on the server.  To avoid passing an unanticipated return
code to unsuspecting applications, NFS clients would often shunt error
codes that implied the request had been retried but already completed.

Thanks to NFS over TCP, duplicate reply caches on the server, and network
performance and reliability improvements, it is safe to remove such checks.

Test plan:
None.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

d3db90e2

NFS: Share NFS superblocks per-protocol per-server per-FSID · 54ceac45

由 David Howells 提交于 8月 22, 2006

The attached patch makes NFS share superblocks between mounts from the same
server and FSID over the same protocol.

It does this by creating each superblock with a false root and returning the
real root dentry in the vfsmount presented by get_sb(). The root dentry set
starts off as an anonymous dentry if we don't already have the dentry for its
inode, otherwise it simply returns the dentry we already have.

We may thus end up with several trees of dentries in the superblock, and if at
some later point one of anonymous tree roots is discovered by normal filesystem
activity to be located in another tree within the superblock, the anonymous
root is named and materialises attached to the second tree at the appropriate
point.

Why do it this way? Why not pass an extra argument to the mount() syscall to
indicate the subpath and then pathwalk from the server root to the desired
directory? You can't guarantee this will work for two reasons:

 (1) The root and intervening nodes may not be accessible to the client.

     With NFS2 and NFS3, for instance, mountd is called on the server to get
     the filehandle for the tip of a path. mountd won't give us handles for
     anything we don't have permission to access, and so we can't set up NFS
     inodes for such nodes, and so can't easily set up dentries (we'd have to
     have ghost inodes or something).

     With this patch we don't actually create dentries until we get handles
     from the server that we can use to set up their inodes, and we don't
     actually bind them into the tree until we know for sure where they go.

 (2) Inaccessible symbolic links.

     If we're asked to mount two exports from the server, eg:

	mount warthog:/warthog/aaa/xxx /mmm
	mount warthog:/warthog/bbb/yyy /nnn

     We may not be able to access anything nearer the root than xxx and yyy,
     but we may find out later that /mmm/www/yyy, say, is actually the same
     directory as the one mounted on /nnn. What we might then find out, for
     example, is that /warthog/bbb was actually a symbolic link to
     /warthog/aaa/xxx/www, but we can't actually determine that by talking to
     the server until /warthog is made available by NFS.

     This would lead to having constructed an errneous dentry tree which we
     can't easily fix. We can end up with a dentry marked as a directory when
     it should actually be a symlink, or we could end up with an apparently
     hardlinked directory.

     With this patch we need not make assumptions about the type of a dentry
     for which we can't retrieve information, nor need we assume we know its
     place in the grand scheme of things until we actually see that place.

This patch reduces the possibility of aliasing in the inode and page caches for
inodes that may be accessed by more than one NFS export. It also reduces the
number of superblocks required for NFS where there are many NFS exports being
used from a server (home directory server + autofs for example).

This in turn makes it simpler to do local caching of network filesystems, as it
can then be guaranteed that there won't be links from multiple inodes in
separate superblocks to the same cache file.

Obviously, cache aliasing between different levels of NFS protocol could still
be a problem, but at least that gives us another key to use when indexing the
cache.

This patch makes the following changes:

 (1) The server record construction/destruction has been abstracted out into
     its own set of functions to make things easier to get right.  These have
     been moved into fs/nfs/client.c.

     All the code in fs/nfs/client.c has to do with the management of
     connections to servers, and doesn't touch superblocks in any way; the
     remaining code in fs/nfs/super.c has to do with VFS superblock management.

 (2) The sequence of events undertaken by NFS mount is now reordered:

     (a) A volume representation (struct nfs_server) is allocated.

     (b) A server representation (struct nfs_client) is acquired.  This may be
     	 allocated or shared, and is keyed on server address, port and NFS
     	 version.

     (c) If allocated, the client representation is initialised.  The state
     	 member variable of nfs_client is used to prevent a race during
     	 initialisation from two mounts.

     (d) For NFS4 a simple pathwalk is performed, walking from FH to FH to find
     	 the root filehandle for the mount (fs/nfs/getroot.c).  For NFS2/3 we
     	 are given the root FH in advance.

     (e) The volume FSID is probed for on the root FH.

     (f) The volume representation is initialised from the FSINFO record
     	 retrieved on the root FH.

     (g) sget() is called to acquire a superblock.  This may be allocated or
     	 shared, keyed on client pointer and FSID.

     (h) If allocated, the superblock is initialised.

     (i) If the superblock is shared, then the new nfs_server record is
     	 discarded.

     (j) The root dentry for this mount is looked up from the root FH.

     (k) The root dentry for this mount is assigned to the vfsmount.

 (3) nfs_readdir_lookup() creates dentries for each of the entries readdir()
     returns; this function now attaches disconnected trees from alternate
     roots that happen to be discovered attached to a directory being read (in
     the same way nfs_lookup() is made to do for lookup ops).

     The new d_materialise_unique() function is now used to do this, thus
     permitting the whole thing to be done under one set of locks, and thus
     avoiding any race between mount and lookup operations on the same
     directory.

 (4) The client management code uses a new debug facility: NFSDBG_CLIENT which
     is set by echoing 1024 to /proc/net/sunrpc/nfs_debug.

 (5) Clone mounts are now called xdev mounts.

 (6) Use the dentry passed to the statfs() op as the handle for retrieving fs
     statistics rather than the root dentry of the superblock (which is now a
     dummy).
Signed-Off-By: NDavid Howells <dhowells@redhat.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

54ceac45

NFS: Move rpc_ops from nfs_server to nfs_client · 8fa5c000

由 David Howells 提交于 8月 22, 2006

Move the rpc_ops from the nfs_server struct to the nfs_client struct as they're
common to all server records of a particular NFS protocol version.
Signed-Off-By: NDavid Howells <dhowells@redhat.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

8fa5c000

NFS: Add an ACCESS cache memory shrinker · 979df72e

由 Trond Myklebust 提交于 7月 25, 2006

A pinned inode may in theory end up filling memory with cached ACCESS
calls. This patch ensures that the VM may shrink away the cache in these
particular cases.
The shrinker works by iterating through the list of inodes on the global
nfs_access_lru_list, and removing the least recently used access
cache entry until it is done (or until the entire cache is empty).
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

979df72e

NFS: Add a global LRU list for the ACCESS cache · cfcea3e8

由 Trond Myklebust 提交于 7月 25, 2006

...in order to allow the addition of a memory shrinker.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

cfcea3e8

NFS: Add a new ACCESS rpc call cache to the linux nfs client · 1c3c07e9

由 Trond Myklebust 提交于 7月 25, 2006

The current access cache only allows one entry at a time to be cached for each
inode. Add a per-inode red-black tree in order to allow more than one to
be cached at a time.

Should significantly cut down the time spent in path traversal for shared
directories such as ${PATH}, /usr/share, etc.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

1c3c07e9

06 7月, 2006 1 次提交

NFS: Optimise away an excessive GETATTR call when a file is symlinked · 4e0641a7

由 Trond Myklebust 提交于 7月 05, 2006

In the case when compiling via a symlink tree, we want to ensure that the
close-to-open GETATTR call is applied only to the final file, and not to
the symlink.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

4e0641a7

09 6月, 2006 2 次提交

T
NFS: Ensure the client submounts, when it crosses a server mountpoint. · 55a97593
由 Trond Myklebust 提交于 6月 09, 2006
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
55a97593

NFS: Separate metadata and page cache revalidation mechanisms · 44b11874

由 Trond Myklebust 提交于 5月 25, 2006

Separate out the function of revalidating the inode metadata, and
revalidating the mapping. The former may be called by lookup(),
and only really needs to check that permissions, ctime, etc haven't changed
whereas the latter needs only done when we want to read data from the page
cache, and may need to sync and then invalidate the mapping.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

44b11874

20 4月, 2006 1 次提交

NFS: remove needless check in nfs_opendir() · 7451c4f0

由 Carsten Otte 提交于 4月 19, 2006

Local variable res was initialized to 0 - no check needed here.
Signed-off-by: NCarsten Otte <cotte@de.ibm.com>
Cc: Trond Myklebust <trond.myklebust@fys.uio.no>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

7451c4f0

29 3月, 2006 1 次提交

[PATCH] Make most file operations structs in fs/ const · 4b6f5d20

由 Arjan van de Ven 提交于 3月 28, 2006

This is a conversion to make the various file_operations structs in fs/
const.  Basically a regexp job, with a few manual fixups

The goal is both to increase correctness (harder to accidentally write to
shared datastructures) and reducing the false sharing of cachelines with
things that get dirty in .data (while .rodata is nicely read only and thus
cache clean)
Signed-off-by: NArjan van de Ven <arjan@infradead.org>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

4b6f5d20

21 3月, 2006 3 次提交

NFS: Make nfs_fhget() return appropriate error values · 03f28e3a

由 Trond Myklebust 提交于 3月 20, 2006

Currently it returns NULL, which usually gets interpreted as ENOMEM. In
fact it can mean a host of issues.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

03f28e3a

NFS: directory trace messages · 1e7cb3dc

由 Chuck Lever 提交于 3月 20, 2006

Reuse NFSDBG_DIRCACHE and NFSDBG_LOOKUPCACHE to provide additional
diagnostic messages that trace the operation of the NFS client's
directory behavior.  A few new messages are now generated when NFSDBG_VFS
is active, as well, to trace normal VFS activity.  This compromise
provides better trace debugging for those who use pre-built kernels,
without adding a lot of extra noise to the standard debug settings.

Test-plan:
Enable NFS trace debugging with flags 1, 2, or 4.  You should be able to
see different types of trace messages with each flag setting.
Signed-off-by: NChuck Lever <cel@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

1e7cb3dc

NFS: add I/O performance counters · 91d5b470

由 Chuck Lever 提交于 3月 20, 2006

Invoke the byte and event counter macros where we want to count bytes and
events.

Clean-up: fix a possible NULL dereference in nfs_lock, and simplify
nfs_file_open.

Test-plan:
fsx and iozone on UP and SMP systems, with and without pre-emption.  Watch
for memory overwrite bugs, and performance loss (significantly more CPU
required per op).
Signed-off-by: NChuck Lever <cel@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

91d5b470

10 1月, 2006 1 次提交

[PATCH] mutex subsystem, semaphore to mutex: VFS, ->i_sem · 1b1dcc1b

由 Jes Sorensen 提交于 1月 09, 2006

This patch converts the inode semaphore to a mutex. I have tested it on
XFS and compiled as much as one can consider on an ia64. Anyway your
luck with it might be different.
Modified-by: NIngo Molnar <mingo@elte.hu>

(finished the conversion)
Signed-off-by: NJes Sorensen <jes@sgi.com>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

1b1dcc1b

07 1月, 2006 1 次提交
- T
  NFSv4: Ensure that we return the delegation on the target of a rename too. · 24174119
  由 Trond Myklebust 提交于 1月 03, 2006
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
  24174119
04 12月, 2005 1 次提交

NFS: Fix post-op attribute revalidation... · 5ba7cc48

由 Trond Myklebust 提交于 12月 03, 2005

  - Missing nfs_mark_for_revalidate in nfs_proc_link()
  - Missing nfs_mark_for_revalidate in nfs_rename()
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

5ba7cc48

05 11月, 2005 1 次提交

NFSv4: Return any delegations before sillyrenaming the file · 34ea8188

由 Trond Myklebust 提交于 11月 04, 2005

 I missed this one... Any form of rename will result in a delegation
 recall, so it is more efficient to return the one we hold before
 trying the rename.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

34ea8188

28 10月, 2005 3 次提交

T
NFS: Ensure that nfs_link() instantiates the dentry correctly · cf809556
由 Trond Myklebust 提交于 10月 27, 2005
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
cf809556

NFS: nfs_lookup doesn't need to revalidate the parent directory's inode · 0c70b501

由 Chuck Lever 提交于 10月 25, 2005

 nfs_lookup() used to consult a lookup cache before trying an actual wire
 lookup operation.  The lookup cache would be invalid, of course, if the
 parent directory's mtime had changed, so nfs_lookup performed an inode
 revalidation on the parent.

 Since nfs_lookup() doesn't use a cache anymore, the revalidation is no
 longer necessary.  There are cases where it will generate a lot of
 unnecessary GETATTR traffic.

 See http://bugzilla.linux-nfs.org/show_bug.cgi?id=9

 Test-plan:
 Use lndir and "rm -rf" and watch for excess GETATTR traffic or application
 level errors.
Signed-off-by: NChuck Lever <cel@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

0c70b501

T
NFS: Cleanup initialisation of struct nfs_fattr · 0e574af1
由 Trond Myklebust 提交于 10月 27, 2005
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
0e574af1

19 10月, 2005 4 次提交

NFS: Fix rename of directory onto empty directory · 6fe43f9e

由 Trond Myklebust 提交于 10月 18, 2005

 If someone tries to rename a directory onto an empty directory, we
 currently fail and return EBUSY.
 This patch ensures that we try the rename if both source and target
 are directories, and that we fail with a correct error of EISDIR if
 the source is not a directory.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

6fe43f9e

T
NFSv4: Return delegation upon rename or removal of file. · cae7a073
由 Trond Myklebust 提交于 10月 18, 2005
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
cae7a073
T
[NFS]: Check that the server returns a valid regular file to our OPEN request · 6f926b5b
由 Trond Myklebust 提交于 10月 18, 2005
```
 Since it appears that some servers don't...
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
6f926b5b

NFSv4: Eliminate nfsv4 open race... · 02a913a7

由 Trond Myklebust 提交于 10月 18, 2005

 Make NFSv4 return the fully initialized file pointer with the
 stateid that it created in the lookup w/intent.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

02a913a7

24 9月, 2005 2 次提交

NFS: Don't expose internal READDIR errors to userspace · 9aa48b7e

由 Trond Myklebust 提交于 8月 25, 2005

 Fixes a condition whereby the kernel is returning the non-POSIX error
 EBADCOOKIE to userspace.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

9aa48b7e

NFS: Drop inode after rename · 20509f1b

由 Trond Myklebust 提交于 8月 25, 2005

 When doing a rename on top of an existing file that is not in use,
 the inode of the overwritten file will remain in the icache.

 The fix is to decrement i_nlink of the overwritten inode, like we
 do for unlink, rmdir etc already.

 Problem diagnosed by Olaf Kirch. This patch is a slight variation
 on his fix.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

20509f1b

20 8月, 2005 1 次提交

[PATCH] NFSv4: unbalanced BKL in nfs_atomic_lookup() · 01c314a0

由 Steve Dickson 提交于 8月 19, 2005

Added missing unlock_kernel() to NFSv4 atomic lookup.
Signed-off-by: NSteve Dickson <steved@redhat.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

01c314a0

19 8月, 2005 2 次提交

[PATCH] NFS: Introduce the use of inode->i_lock to protect fields in nfsi · dc59250c

由 Chuck Lever 提交于 8月 18, 2005

Down the road we want to eliminate the use of the global kernel lock entirely
from the NFS client.  To do this, we need to protect the fields in the
nfs_inode structure adequately.  Start by serializing updates to the
"cache_validity" field.

Note this change addresses an SMP hang found by njw@osdl.org, where processes
deadlock because nfs_end_data_update and nfs_revalidate_mapping update the
"cache_validity" field without proper serialization.

Test plan:
 Millions of fsx ops on SMP clients.  Run Nick Wilson's breaknfs program on
 large SMP clients.
Signed-off-by: NChuck Lever <cel@netapp.com>
Cc: Trond Myklebust <trond.myklebust@fys.uio.no>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

dc59250c

[PATCH] NFS: use atomic bitops to manipulate flags in nfsi->flags · 412d582e

由 Chuck Lever 提交于 8月 18, 2005

Introduce atomic bitops to manipulate the bits in the nfs_inode structure's
"flags" field.

Using bitops means we can use a generic wait_on_bit call instead of an ad hoc
locking scheme in fs/nfs/inode.c, so we can remove the "nfs_i_wait" field from
nfs_inode at the same time.

The other new flags field will continue to use bitmask and logic AND and OR.
This permits several flags to be set at the same time efficiently.  The
following patch adds a spin lock to protect these flags, and this spin lock
will later cover other fields in the nfs_inode structure, amortizing the cost
of using this type of serialization.

Test plan:
 Millions of fsx ops on SMP clients.
Signed-off-by: NChuck Lever <cel@netapp.com>
Cc: Trond Myklebust <trond.myklebust@fys.uio.no>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

412d582e

openanolis / cloud-kernel 大约 1 年 前同步成功

openanolis / cloud-kernel
大约 1 年前同步成功