提交 · ef79c097bbe9724e13937271b3457df560e00370 · openanolis / cloud-kernel

03 4月, 2009 2 次提交

NFS: Use local disk inode cache · ef79c097

由 David Howells 提交于 4月 03, 2009

Bind data storage objects in the local cache to NFS inodes.
Signed-off-by: NDavid Howells <dhowells@redhat.com>
Acked-by: NSteve Dickson <steved@redhat.com>
Acked-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
Acked-by: NAl Viro <viro@zeniv.linux.org.uk>
Tested-by: NDaire Byrne <Daire.Byrne@framestore.com>

ef79c097

NFS: Register NFS for caching and retrieve the top-level index · 8ec442ae

由 David Howells 提交于 4月 03, 2009

Register NFS for caching and retrieve the top-level cache index object cookie.
Signed-off-by: NDavid Howells <dhowells@redhat.com>
Acked-by: NSteve Dickson <steved@redhat.com>
Acked-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
Acked-by: NAl Viro <viro@zeniv.linux.org.uk>
Tested-by: NDaire Byrne <Daire.Byrne@framestore.com>

8ec442ae

20 3月, 2009 1 次提交

NFS: Optimise NFS close() · 7fe5c398

由 Trond Myklebust 提交于 3月 19, 2009

Close-to-open cache consistency rules really only require us to flush out
writes on calls to close(), and require us to revalidate attributes on the
very last close of the file.

Currently we appear to be doing a lot of extra attribute revalidation
and cache flushes.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

7fe5c398

12 3月, 2009 5 次提交

NFS: Throttle page dirtying while we're flushing to disk · 72cb77f4

由 Trond Myklebust 提交于 3月 11, 2009

The following patch is a combination of a patch by myself and Peter
Staubach.

Trond: If we allow other processes to dirty pages while a process is doing
a consistency sync to disk, we can end up never making progress.

Peter: Attached is a patch which addresses a continuing problem with
the NFS client generating out of order WRITE requests.  While
this is compliant with all of the current protocol
specifications, there are servers in the market which can not
handle out of order WRITE requests very well.  Also, this may
lead to sub-optimal block allocations in the underlying file
system on the server.  This may cause the read throughputs to
be reduced when reading the file from the server.

Peter: There has been a lot of work recently done to address out of
order issues on a systemic level.  However, the NFS client is
still susceptible to the problem.  Out of order WRITE
requests can occur when pdflush is in the middle of writing
out pages while the process dirtying the pages calls
generic_file_buffered_write which calls
generic_perform_write which calls
balance_dirty_pages_rate_limited which ends up calling
writeback_inodes which ends up calling back into the NFS
client to writes out dirty pages for the same file that
pdflush happens to be working with.
Signed-off-by: NPeter Staubach <staubach@redhat.com>
[modification by Trond to merge the two similar patches]
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

72cb77f4

T
NFS: cleanup - remove struct nfs_inode->ncommit · fb8a1f11
由 Trond Myklebust 提交于 3月 11, 2009
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
fb8a1f11

NFSv4: Support NFSv4 optional attributes in the struct nfs_fattr · 9e6e70f8

由 Trond Myklebust 提交于 3月 11, 2009

Currently, filling struct nfs_fattr is more or less an all or nothing
operation, since NFSv2 and NFSv3 have only mandatory attributes.
In NFSv4, some attributes are optional, and so we may simply not be able to
fill in those fields. Furthermore, NFSv4 allows you to specify which
attributes you are interested in retrieving, thus permitting you to
optimise away retrieval of attributes that you know will no change...
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

9e6e70f8

NFS: flush cached directory information slightly more readily. · 37d9d76d

由 NeilBrown 提交于 3月 11, 2009

If cached directory contents becomes incorrect, there is no way to
flush the contents.  This contrasts with files where file locking is
the recommended way to ensure cache consistency between multiple
applications (a read-lock always flushes the cache).

Also while changes to files often change the size of the file (thus
triggering a cache flush), changes to directories often do not change
the apparent size (as the size is often rounded to a block size).

So it is particularly important with directories to avoid the
possibility of an incorrect cache wherever possible.

When the link count on a directory changes it implies a change in the
number of child directories, and so a change in the contents of this
directory.  So use that as a trigger to flush cached contents.

When the ctime changes but the mtime does not, there are two possible
reasons.
 1/ The owner/mode information has been changed.
 2/ utimes has been used to set the mtime backwards.

In the first case, a data-cache flush is not required.
In the second case it is.

So on the basis that correctness trumps performance, flush the
directory contents cache in this case also.
Signed-off-by: NNeilBrown <neilb@suse.de>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

37d9d76d

NFS: Minor __nfs_revalidate_inode cleanup · 2b57dc6c

由 Suresh Jayaraman 提交于 3月 11, 2009

Remove redundant NFS_STALE() check, a leftover due to the commit
691beb13Signed-off-by: NSuresh Jayaraman <sjayaraman@suse.de>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

2b57dc6c

24 12月, 2008 2 次提交

optimize attribute timeouts for "noac" and "actimeo=0" · 64672d55

由 Peter Staubach 提交于 12月 23, 2008

Hi.

I've been looking at a bugzilla which describes a problem where
a customer was advised to use either the "noac" or "actimeo=0"
mount options to solve a consistency problem that they were
seeing in the file attributes.  It turned out that this solution
did not work reliably for them because sometimes, the local
attribute cache was believed to be valid and not timed out.
(With an attribute cache timeout of 0, the cache should always
appear to be timed out.)

In looking at this situation, it appears to me that the problem
is that the attribute cache timeout code has an off-by-one
error in it.  It is assuming that the cache is valid in the
region, [read_cache_jiffies, read_cache_jiffies + attrtimeo].  The
cache should be considered valid only in the region,
[read_cache_jiffies, read_cache_jiffies + attrtimeo).  With this
change, the options, "noac" and "actimeo=0", work as originally
expected.

This problem was previously addressed by special casing the
attrtimeo == 0 case.  However, since the problem is only an off-
by-one error, the cleaner solution is address the off-by-one
error and thus, not require the special case.

    Thanx...

        ps
Signed-off-by: NPeter Staubach <staubach@redhat.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

64672d55

T
NFSv4: Convert the open and close ops to use fmode · dc0b027d
由 Trond Myklebust 提交于 12月 23, 2008
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
dc0b027d

29 10月, 2008 1 次提交

NFS: Convert nfs_attr_generation_counter into an atomic_long · ae05f269

由 Trond Myklebust 提交于 10月 28, 2008

The most important property we need from nfs_attr_generation_counter is
monotonicity, which is not guaranteed by the current system of smp memory
barriers. We should convert it to an atomic_long_t, and drop the memory
barriers.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

ae05f269

27 10月, 2008 1 次提交

Switch to a valid email address... · 526719ba

由 Alan Cox 提交于 10月 27, 2008

Signed-off-by: NAlan Cox <alan@redhat.com>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

526719ba

15 10月, 2008 3 次提交

NFS: Fix a resolution problem with nfs_inode->cache_change_attribute · 011935a0

由 Trond Myklebust 提交于 10月 14, 2008

The cache_change_attribute is used to decide whether or not a directory has
changed, in which case we may need to look it up again. Again, the use of
'jiffies' leads to an issue of resolution.

Once again, the fix is to change nfs_inode->cache_change_attribute, and
just make it a simple counter.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

011935a0

NFS: Fix the resolution problem with nfs_inode_attrs_need_update() · 4704f0e2

由 Trond Myklebust 提交于 10月 14, 2008

It appears that 'jiffies' timestamps do not have high enough resolution for
nfs_inode_attrs_need_update(). One problem is that a GETATTR can be
launched within < 1 jiffy of the last operation that updated the attribute.
Another problem is that RPC calls can take < 1 jiffy to execute.

We can fix this by switching the variables to use a simple global counter
that gets incremented every time we start another GETATTR call.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

4704f0e2

T
NFS: Changes to inode->i_nlinks must set the NFS_INO_INVALID_ATTR flag · 921615f1
由 Trond Myklebust 提交于 10月 14, 2008
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
921615f1

10 10月, 2008 1 次提交

NFS: Fix attribute updates · 03254e65

由 Trond Myklebust 提交于 10月 09, 2008

This fixes a regression seen when running the Connectathon testsuite
against an ext3 filesystem. The reason was that the inode was constantly
being marked as 'just updated' by the jiffy wraparound test.
This again meant that newer GETATTR calls were failing to pass the
nfs_inode_attrs_need_update() test unless the changes caused a ctime update
on the server, since they were perceived as having been started before the
latest inode update.

Given that nfs_inode_attrs_need_update() already checks for wraparound
of nfsi->last_updated, we can drop the buggy "protection" in
nfs_update_inode().

Also make a slight micro-optimisation of nfs_inode_attrs_need_update(): we
are more often going to see time_after(fattr->time_start, nfsi->last_updated)
be true, rather than seeing an update of ctime/size, so put that test
first to ensure that we optimise away the ctime/size tests.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

03254e65

08 10月, 2008 7 次提交

NFS: Allow concurrent inode revalidation · 691beb13

由 Trond Myklebust 提交于 10月 05, 2008

Currently, if two processes are both trying to revalidate metadata for the
same inode, they will find themselves being serialised. There is no good
justification for this now that we have improved our ability to detect
stale attribute data, so we should remove that serialisation.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

691beb13

NFS: Fix up nfs_setattr_update_inode() · 2f28ea61

由 Trond Myklebust 提交于 10月 05, 2008

Ensure that it sets the inode metadata under the correct spinlock.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

2f28ea61

NFS: Don't clear nfsi->cache_validity in nfs_check_inode_attributes() · 076f1fc9

由 Trond Myklebust 提交于 10月 05, 2008

If we're merely checking the inode attributes because we suspect that the
'updated' attributes returned by the RPC call are stale, then we shouldn't
be doing weak cache consistency updates or clearing the cache_validity
flags.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

076f1fc9

NFS: Convert __nfs_revalidate_inode() to use nfs_refresh_inode() · 4dc05efb

由 Trond Myklebust 提交于 9月 23, 2008

In the case where there are parallel RPC calls to the same inode, we may
receive stale metadata due to the lack of ordering, hence the sanity
checking of metadata in nfs_refresh_inode().
Currently, __nfs_revalidate_inode() is calling nfs_update_inode() directly,
without any further sanity checks, and hence may end up setting the inode
up with stale metadata.

Fix is to use nfs_refresh_inode() instead of nfs_update_inode().
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

4dc05efb

NFS: Fix nfs_post_op_update_inode_force_wcc() · d65f557f

由 Trond Myklebust 提交于 10月 05, 2008

If we believe that the attributes are old (see nfs_refresh_inode()), then
we shouldn't force an update.
Also ensure that we hold the inode->i_lock across attribute checks and the
call to nfs_refresh_inode_locked() to ensure that we don't race with other
attribute updates.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

d65f557f

NFS: Fix the NFS attribute update · a10ad176

由 Trond Myklebust 提交于 9月 23, 2008

Currently nfs_refresh_inode() will only update the inode metadata if it
sees that the RPC call that returned the nfs_fattr was started
after the last update of the inode. This means that if we have parallel
RPC calls to the same inode (when sending WRITE calls, for instance), we
may often miss updates.

This patch attempts to recover those missed updates by also accepting
them if the ctime in the nfs_fattr is more recent than the inode's
cached ctime.
It also recovers the case where the file size has increased, but the
ctime has not been updated due to limited ctime resolution.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

a10ad176

NFS: Clean up nfs_refresh_inode() and nfs_post_op_update_inode() · 870a5be8

由 Trond Myklebust 提交于 10月 05, 2008

Try to avoid taking and dropping the inode->i_lock more than once. Do so by
moving the code in nfs_refresh_inode() that needs to be done under the
spinlock into a function nfs_refresh_inode_locked(), and then having both
nfs_refresh_inode() and nfs_post_op_update_inode() call it directly.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

870a5be8

27 7月, 2008 1 次提交

SL*B: drop kmem cache argument from constructor · 51cc5068

由 Alexey Dobriyan 提交于 7月 25, 2008

Kmem cache passed to constructor is only needed for constructors that are
themselves multiplexeres.  Nobody uses this "feature", nor does anybody uses
passed kmem cache in non-trivial way, so pass only pointer to object.

Non-trivial places are:
	arch/powerpc/mm/init_64.c
	arch/powerpc/mm/hugetlbpage.c

This is flag day, yes.
Signed-off-by: NAlexey Dobriyan <adobriyan@gmail.com>
Acked-by: NPekka Enberg <penberg@cs.helsinki.fi>
Acked-by: NChristoph Lameter <cl@linux-foundation.org>
Cc: Jon Tollefson <kniht@linux.vnet.ibm.com>
Cc: Nick Piggin <nickpiggin@yahoo.com.au>
Cc: Matt Mackall <mpm@selenic.com>
[akpm@linux-foundation.org: fix arch/powerpc/mm/hugetlbpage.c]
[akpm@linux-foundation.org: fix mm/slab.c]
[akpm@linux-foundation.org: fix ubifs]
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

51cc5068

16 7月, 2008 2 次提交

T
NFS: Remove attribute update related BKL references · fa6dc9dc
由 Trond Myklebust 提交于 6月 11, 2008
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
fa6dc9dc

NFS: Remove BKL requirement from attribute updates · a3d01454

由 Trond Myklebust 提交于 6月 11, 2008

The main problem is dealing with inode->i_size: we need to set the
inode->i_lock on all attribute updates, and so vmtruncate won't cut it.
Make an NFS-private version of vmtruncate that has the necessary locking
semantics.

The result should be that the following inode attribute updates are
protected by inode->i_lock
	nfsi->cache_validity
	nfsi->read_cache_jiffies
	nfsi->attrtimeo
	nfsi->attrtimeo_timestamp
	nfsi->change_attr
	nfsi->last_updated
	nfsi->cache_change_attribute
	nfsi->access_cache
	nfsi->access_cache_entry_lru
	nfsi->access_cache_inode_lru
	nfsi->acl_access
	nfsi->acl_default
	nfsi->nfs_page_tree
	nfsi->ncommit
	nfsi->npages
	nfsi->open_files
	nfsi->silly_list
	nfsi->acl
	nfsi->open_states
	inode->i_size
	inode->i_atime
	inode->i_mtime
	inode->i_ctime
	inode->i_nlink
	inode->i_uid
	inode->i_gid

The following is protected by dir->i_mutex
	nfsi->cookieverf
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

a3d01454

10 7月, 2008 2 次提交

NFS: Ensure we zap only the access and acl caches when setting new acls · f41f7418

由 Trond Myklebust 提交于 6月 11, 2008

...and ensure that we obey the NFS_INO_INVALID_ACL flag when retrieving the
acls.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

f41f7418

NFS: Fix the ftruncate() credential problem · 659bfcd6

由 Trond Myklebust 提交于 6月 10, 2008

ftruncate() access checking is supposed to be performed at open() time,
just like reads and writes.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

659bfcd6

17 5月, 2008 3 次提交

nfs: path_{get,put}() cleanups · 31f31db1

由 Jan Blunck 提交于 5月 02, 2008

Here are some more places where path_{get,put}() can be used instead of
dput()/mntput() pair.
Signed-off-by: NJan Blunck <jblunck@suse.de>
Cc: Trond Myklebust <trond.myklebust@fys.uio.no>
Cc: "J. Bruce Fields" <bfields@fieldses.org>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

31f31db1

nfs: replace remaining __FUNCTION__ occurrences · 3110ff80

由 Harvey Harrison 提交于 5月 02, 2008

__FUNCTION__ is gcc-specific, use __func__
Signed-off-by: NHarvey Harrison <harvey.harrison@gmail.com>
Cc: Trond Myklebust <trond.myklebust@fys.uio.no>
Cc: "J. Bruce Fields" <bfields@fieldses.org>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

3110ff80

NFS: Ensure that 'noac' and/or 'actimeo=0' turn off attribute caching · b0b53973

由 Trond Myklebust 提交于 5月 05, 2008

Both the 'noac' and 'actimeo=0' mount options should ensure that attributes
are not cached, however a bug in nfs_attribute_timeout() means that
currently, the attributes may in fact get cached for up to one jiffy. This
has been seen to cause corruption in some applications.

The reason for the bug is that the time_in_range() test returns 'true' as
long as the current time lies between nfsi->read_cache_jiffies and
nfsi->read_cache_jiffies + nfsi->attrtimeo. In other words, if jiffies
equals nfsi->read_cache_jiffies, then we still cache the attribute data.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

b0b53973

09 4月, 2008 1 次提交

NFS: initialize flags field in nfs_open_context · 66d3aac0

由 Jeff Layton 提交于 3月 31, 2008

The nfs_open_context struct had a "flags" field added recently, but the
allocator isn't initializing it. It also looks like the allocator isn't
initializing the mode or list either, but they seem to be overwritten
by the caller, so that's less of an issue.
Signed-off-by: NJeff Layton <jlayton@redhat.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

66d3aac0

15 3月, 2008 1 次提交

SUNRPC: Add a helper rpcauth_lookup_generic_cred() · 98a8e323

由 Trond Myklebust 提交于 3月 12, 2008

The NFSv4 protocol allows clients to negotiate security protocols on the
fly in the case where an administrator on the server changes the export
settings and/or in the case where we may have a filesystem migration event.

Instead of having the NFS client code cache credentials that are tied to a
particular AUTH method it is therefore preferable to have a generic credential
that can be converted into whatever AUTH is in use by the RPC client when
the read/write/sillyrename/... is put on the wire.

We do this by means of the new "generic" credential, which basically just
caches the minimal information that is needed to look up an RPCSEC_GSS,
AUTH_SYS, or AUTH_NULL credential.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

98a8e323

08 3月, 2008 1 次提交

NFS: Fix the fsid revalidation in nfs_update_inode() · c37dcd33

由 Trond Myklebust 提交于 3月 06, 2008

When we detect that we've crossed a mountpoint on the remote server, we
must take care not to use that inode to revalidate the fsid on our
current superblock. To do so, we label the inode as a remote mountpoint,
and check for that in nfs_update_inode().
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

c37dcd33

26 2月, 2008 2 次提交

NFS: Add an nfsiod workqueue · 5746006f

由 Trond Myklebust 提交于 2月 19, 2008

NFS post-rpciod cleanups often involve tasks that cannot be safely
performed within the rpciod context (due to deadlock concerns). We
therefore add a dedicated NFS workqueue that can perform tasks like
cleaning up state after an interrupted NFSv4 open() call, or calling
put_nfs_open_context() after an asynchronous read or write call.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

5746006f

NFS: Fix a deadlock with lazy umount · 383ba719

由 Trond Myklebust 提交于 2月 19, 2008

We can't allow rpc callback functions like task->tk_ops->rpc_call_prepare()
and task->tk_ops->rpc_call_done() to call mntput() in any way, since
that will cause a deadlock when the call to rpc_shutdown_client() attempts
to wait on 'task' to complete.

We can avoid the above deadlock by moving calls to mntput to
task->tk_ops->rpc_release() callback, since at that time the task will be
marked as completed, and so rpc_shutdown_client won't attempt to wait on
it.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

383ba719

30 1月, 2008 4 次提交

NFS: Add an asynchronous delegreturn operation for use in nfs_clear_inode · e6f81075

由 Trond Myklebust 提交于 1月 24, 2008

Otherwise, there is a potential deadlock if the last dput() from an NFSv4
close() or other asynchronous operation leads to nfs_clear_inode calling
the synchronous delegreturn.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

e6f81075

nfs: convert NFS_*(inode) helpers to static inline · 99fadcd7

由 Benny Halevy 提交于 1月 23, 2008

Signed-off-by: NBenny Halevy <bhalevy@panasas.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

99fadcd7

nfs: obliterate NFS_FLAGS macro · 3a10c30a

由 Benny Halevy 提交于 1月 23, 2008

use NFS_I(inode)->flags instead
Signed-off-by: NBenny Halevy <bhalevy@panasas.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

3a10c30a

T
NFS: define a function to update nfsi->cache_change_attribute · bfc69a45
由 Trond Myklebust 提交于 10月 15, 2007
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
bfc69a45

openanolis / cloud-kernel 大约 1 年 前同步成功

openanolis / cloud-kernel
大约 1 年前同步成功