提交 · ccf01ef7aa9c6c293a1c64c27331a2ce227916ec · openanolis / cloud-kernel

25 6月, 2006 7 次提交

T

Merge branch 'odirect' · ccf01ef7
由 Trond Myklebust 提交于 6月 25, 2006

ccf01ef7

NFS: alloc nfs_read/write_data as direct I/O is scheduled · 82b145c5

由 Chuck Lever 提交于 6月 20, 2006

Re-arrange the logic in the NFS direct I/O path so that nfs_read/write_data
structs are allocated just before they are scheduled, rather than
allocating them all at once before we start scheduling requests.
Signed-off-by: NChuck Lever <cel@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

82b145c5

NFS: Eliminate nfs_get_user_pages() · 06cf6f2e

由 Chuck Lever 提交于 6月 20, 2006

Neil Brown observed that the kmalloc() in nfs_get_user_pages() is more
likely to fail if the I/O is large enough to require the allocation of more
than a single page to keep track of all the pinned pages in the user's
buffer.

Instead of tracking one large page array per dreq/iocb, track pages per
nfs_read/write_data, just like the cached I/O path does.  An array for
pages is already allocated for us by nfs_readdata_alloc() (and the write
and commit equivalents).

This is also required for adding support for vectored I/O to the NFS direct
I/O path.

The original reason to pin the user buffer and allocate all the NFS data
structures before trying to schedule I/O was to ensure all needed resources
are allocated on the client before starting to send requests.  This reduces
the chance that resource exhaustion on the client will cause a short read
or write.

On the other hand, for an application making very large application I/O
requests, this means that it will be nearly impossible for the application
to make forward progress on a resource-limited client.

Thus, moving the buffer pinning functionality into the I/O scheduling
loops should be good for scalability.  The next patch will do the same for
NFS data structure allocation.
Signed-off-by: NChuck Lever <cel@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

06cf6f2e

NFS: refactor nfs_direct_free_user_pages · 9c93ab7d

由 Chuck Lever 提交于 6月 20, 2006

Clean-up and fix a minor bug: the logic was dirtying page cache pages on
both read and write operations.
Signed-off-by: NChuck Lever <cel@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

9c93ab7d

NFS: remove user_addr, user_count, and pos from nfs_direct_req · 51a7bc6c

由 Chuck Lever 提交于 6月 20, 2006

Make the user_addr, user_count, and pos parameters explicit to the
scheduler routines, and remove the fields from nfs_direct_req.  The
iovec API will be passing in a series of these, not just one set.
Signed-off-by: NChuck Lever <cel@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

51a7bc6c

NFS: "open code" the NFS direct write rescheduler · fedb595c

由 Chuck Lever 提交于 6月 20, 2006

An NFSv3/v4 client must reschedule on-the-wire writes if the writes are
UNSTABLE, and the server reboots before the client can complete a
subsequent COMMIT request.

To support direct asynchronous scatter-gather writes, the write
rescheduler in fs/nfs/direct.c must not depend on the I/O parameters
in the controlling nfs_direct_req structure.  iovecs can be somewhat
arbitrarily complex, so there could be an unbounded amount of information
to save for a rarely encountered requirement.

Refactor the direct write rescheduler so it uses information from each
nfs_write_data structure to reschedule writes, instead of caching that
information in the controlling nfs_direct_req structure.
Signed-off-by: NChuck Lever <cel@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

fedb595c

NFS: Separate functions for counting outstanding NFS direct I/Os · b1c5921c

由 Chuck Lever 提交于 6月 20, 2006

Factor out the logic that increments and decrements the outstanding I/O
count.  This will be a commonly used bit of code in upcoming patches.
Also make this an atomic_t again, since it will be very often manipulated
outside dreq->spin lock.
Signed-off-by: NChuck Lever <cel@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

b1c5921c

23 6月, 2006 3 次提交

[PATCH] vfs: add lock owner argument to flush operation · 75e1fcc0

由 Miklos Szeredi 提交于 6月 23, 2006

Pass the POSIX lock owner ID to the flush operation.

This is useful for filesystems which don't want to store any locking state
in inode->i_flock but want to handle locking/unlocking POSIX locks
internally.  FUSE is one such filesystem but I think it possible that some
network filesystems would need this also.

Also add a flag to indicate that a POSIX locking request was generated by
close(), so filesystems using the above feature won't send an extra locking
request in this case.
Signed-off-by: NMiklos Szeredi <miklos@szeredi.hu>
Cc: Trond Myklebust <trond.myklebust@fys.uio.no>
Cc: Al Viro <viro@zeniv.linux.org.uk>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

75e1fcc0

[PATCH] VFS: Permit filesystem to perform statfs with a known root dentry · 726c3342

由 David Howells 提交于 6月 23, 2006

Give the statfs superblock operation a dentry pointer rather than a superblock
pointer.

This complements the get_sb() patch.  That reduced the significance of
sb->s_root, allowing NFS to place a fake root there.  However, NFS does
require a dentry to use as a target for the statfs operation.  This permits
the root in the vfsmount to be used instead.

linux/mount.h has been added where necessary to make allyesconfig build
successfully.

Interest has also been expressed for use with the FUSE and XFS filesystems.
Signed-off-by: NDavid Howells <dhowells@redhat.com>
Acked-by: NAl Viro <viro@zeniv.linux.org.uk>
Cc: Nathan Scott <nathans@sgi.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

726c3342

[PATCH] VFS: Permit filesystem to override root dentry on mount · 454e2398

由 David Howells 提交于 6月 23, 2006

Extend the get_sb() filesystem operation to take an extra argument that
permits the VFS to pass in the target vfsmount that defines the mountpoint.

The filesystem is then required to manually set the superblock and root dentry
pointers.  For most filesystems, this should be done with simple_set_mnt()
which will set the superblock pointer and then set the root dentry to the
superblock's s_root (as per the old default behaviour).

The get_sb() op now returns an integer as there's now no need to return the
superblock pointer.

This patch permits a superblock to be implicitly shared amongst several mount
points, such as can be done with NFS to avoid potential inode aliasing.  In
such a case, simple_set_mnt() would not be called, and instead the mnt_root
and mnt_sb would be set directly.

The patch also makes the following changes:

 (*) the get_sb_*() convenience functions in the core kernel now take a vfsmount
     pointer argument and return an integer, so most filesystems have to change
     very little.

 (*) If one of the convenience function is not used, then get_sb() should
     normally call simple_set_mnt() to instantiate the vfsmount. This will
     always return 0, and so can be tail-called from get_sb().

 (*) generic_shutdown_super() now calls shrink_dcache_sb() to clean up the
     dcache upon superblock destruction rather than shrink_dcache_anon().

     This is required because the superblock may now have multiple trees that
     aren't actually bound to s_root, but that still need to be cleaned up. The
     currently called functions assume that the whole tree is rooted at s_root,
     and that anonymous dentries are not the roots of trees which results in
     dentries being left unculled.

     However, with the way NFS superblock sharing are currently set to be
     implemented, these assumptions are violated: the root of the filesystem is
     simply a dummy dentry and inode (the real inode for '/' may well be
     inaccessible), and all the vfsmounts are rooted on anonymous[*] dentries
     with child trees.

     [*] Anonymous until discovered from another tree.

 (*) The documentation has been adjusted, including the additional bit of
     changing ext2_* into foo_* in the documentation.

[akpm@osdl.org: convert ipath_fs, do other stuff]
Signed-off-by: NDavid Howells <dhowells@redhat.com>
Acked-by: NAl Viro <viro@zeniv.linux.org.uk>
Cc: Nathan Scott <nathans@sgi.com>
Cc: Roland Dreier <rolandd@cisco.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

454e2398

09 6月, 2006 30 次提交

T
NFS: Display the chosen RPCSEC_GSS security flavour in /proc/mounts · 81039f1f
由 Trond Myklebust 提交于 6月 09, 2006
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
81039f1f

NFS: Split fs/nfs/inode.c · f7b422b1

由 David Howells 提交于 6月 09, 2006

As fs/nfs/inode.c is rather large, heterogenous and unwieldy, the attached
patch splits it up into a number of files:

 (*) fs/nfs/inode.c

     Strictly inode specific functions.

 (*) fs/nfs/super.c

     Superblock management functions for NFS and NFS4, normal access, clones
     and referrals.  The NFS4 superblock functions _could_ move out into a
     separate conditionally compiled file, but it's probably not worth it as
     there're so many common bits.

 (*) fs/nfs/namespace.c

     Some namespace-specific functions have been moved here.

 (*) fs/nfs/nfs4namespace.c

     NFS4-specific namespace functions (this could be merged into the previous
     file).  This file is conditionally compiled.

 (*) fs/nfs/internal.h

     Inter-file declarations, plus a few simple utility functions moved from
     fs/nfs/inode.c.

     Additionally, all the in-.c-file externs have been moved here, and those
     files they were moved from now includes this file.

For the most part, the functions have not been changed, only some multiplexor
functions have changed significantly.

I've also:

 (*) Added some extra banner comments above some functions.

 (*) Rearranged the function order within the files to be more logical and
     better grouped (IMO), though someone may prefer a different order.

 (*) Reduced the number of #ifdefs in .c files.

 (*) Added missing __init and __exit directives.
Signed-Off-By: NDavid Howells <dhowells@redhat.com>

f7b422b1

T
NFS: Fix typo in nfs_do_clone_mount() · 4e5ccf60
由 Trond Myklebust 提交于 6月 09, 2006
```
Doh!
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
4e5ccf60
T
NFS: Fix compile errors introduced by referrals patches · 860de071
由 Trond Myklebust 提交于 6月 09, 2006
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
860de071
T
NFSv4: Ensure that referral mounts bind to a reserved port · 87e4ba1a
由 Trond Myklebust 提交于 6月 09, 2006
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
87e4ba1a
A
NFSv4: A root pathname is sent as a zero component4 · 33a43f28
由 Andy Adamson 提交于 6月 09, 2006
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
33a43f28

NFSv4: Follow a referral · 6b97fd3d

由 Manoj Naik 提交于 6月 09, 2006

Respond to a moved error on NFS lookup by setting up the referral.
Note: We don't actually follow the referral during lookup/getattr, but
later when we detect fsid mismatch in inode revalidation (similar to the
processing done for cloning submounts). Referrals will have fake attributes
until they are actually followed or traversed.
Signed-off-by: NManoj Naik <manoj@almaden.ibm.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

6b97fd3d

NFSv4: Ensure client submounts when following a referral · 9cdb3883

由 Manoj Naik 提交于 6月 09, 2006

Set up mountpoint when hitting a referral on moved error by getting
fs_locations.
Signed-off-by: NManoj Naik <manoj@almaden.ibm.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

9cdb3883

NFS: Expand clone mounts to include other servers · 61f5164c

由 Manoj Naik 提交于 6月 09, 2006

Signed-off-by: NManoj Naik <manoj@almaden.ibm.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

61f5164c

NFSv4: Create NFSv4 transport and client · c818ba43

由 Manoj Naik 提交于 6月 09, 2006

Move existing code into a separate function so that it can be also used by
referral code.
Signed-off-by: NManoj Naik <manoj@almaden.ibm.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

c818ba43

NFSv4: Define an fs_locations bitmap · 830b8e33

由 Manoj Naik 提交于 6月 09, 2006

This is (similar to getattr bitmap) but includes fs_locations and
mounted_on_fileid attributes. Use this bitmap for encoding in fs_locations
requests.
Note: We can probably do better by requesting locations as part of fsinfo
itself.
Signed-off-by: NManoj Naik <manoj@almaden.ibm.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

830b8e33

NFSv4: GETATTR attributes on referral · 361e624f

由 Manoj Naik 提交于 6月 09, 2006

Per referral draft, only fs_locations, fsid, and mounted_on_fileid can be
requested in a GETATTR on referrals.
Signed-off-by: NManoj Naik <manoj@almaden.ibm.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

361e624f

NFSv4: Decode mounted_on_fileid attribute in getattr. · 99baf625

由 Manoj Naik 提交于 6月 09, 2006

It is ignored if fileid is also requested. This will be used on referrals
(fs_locations).
Signed-off-by: NManoj Naik <manoj@almaden.ibm.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

99baf625

NFSv4: convert fs-locations-components to conform to RFC3530 · 7aaa0b3b

由 Manoj Naik 提交于 6月 09, 2006

Use component4-style formats for decoding list of servers and pathnames in
fs_locations.
Signed-off-by: NManoj Naik <manoj@almaden.ibm.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

7aaa0b3b

NFSv4: Implement the fs_locations function call · 683b57b4

由 Trond Myklebust 提交于 6月 09, 2006

NFSv4 allows for the fact that filesystems may be replicated across
several servers or that they may be migrated to a backup server in case of
failure of the primary server.
fs_locations is an NFSv4 operation for retrieving information about the
location of migrated and/or replicated filesystems.

Based on an initial implementation by Jiaying Zhang <jiayingz@citi.umich.edu>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

683b57b4

NFS: Add timeout to submounts · 51d8fa6a

由 Trond Myklebust 提交于 6月 09, 2006

Make automounted partitions expire using the mark_mounts_for_expiry()
function. The timeout is controlled via a sysctl.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

51d8fa6a

T
NFS: Ensure the client submounts, when it crosses a server mountpoint. · 55a97593
由 Trond Myklebust 提交于 6月 09, 2006
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
55a97593

NFS: Store the file system "fsid" value in the NFS super block. · 8b4bdcf8

由 Trond Myklebust 提交于 6月 09, 2006

This should enable us to detect if we are crossing a mountpoint in the
case where the server is exporting "nohide" mounts.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

8b4bdcf8

VFS: Remove dependency of ->umount_begin() call on MNT_FORCE · 8b512d9a

由 Trond Myklebust 提交于 6月 09, 2006

Allow filesystems to decide to perform pre-umount processing whether or not
MNT_FORCE is set.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

8b512d9a

NFS: Remove nfs_delete_inode() · da6d503a

由 Trond Myklebust 提交于 6月 01, 2006

Now that we have a real nfs_invalidate_page() to ensure that
truncate_inode_pages() does the right thing when there are pending dirty
pages, we can get rid of nfs_delete_inode().
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

da6d503a

NFS: Flesh out nfs_invalidate_page() · d2ccddf0

由 Trond Myklebust 提交于 5月 31, 2006

In the case of a call to truncate_inode_pages(), we should really try to
cancel any pending writes on the page.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

d2ccddf0

NFSv4: remove obviously bogus comparison from decode_getacl · c04871e6

由 J. Bruce Fields 提交于 5月 30, 2006

We just set *acl_len to zero, and attrlen is unsigned, so this comparison
is clearly bogus.  I have no idea what I was thinking.

Fixes a bug that caused getacl to fail over krb5p.
Signed-off-by: NJ. Bruce Fields <bfields@citi.umich.edu>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

c04871e6

NFSv4: really return status from decode_recall_args() · 3873bc50

由 Alexey Dobriyan 提交于 5月 27, 2006

Signed-off-by: NAlexey Dobriyan <adobriyan@gmail.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

3873bc50

NFSv3: Client-side nfsacl caching fix · 4814f56d

由 Andreas Gruenbacher 提交于 5月 25, 2006

Fix two errors in the client-side acl cache: First, when nfs3_proc_getacl
requests only the default acl of a file and the access acl is not cached
already, a NULL access acl entry is cached instead of ERR_PTR(-EAGAIN)
("not cached").

Second, update the cached acls in nfs3_proc_setacls: nfs_refresh_inode does
not always invalidate the cached acls, and when it does not, the cached acls
get out of sync.
Signed-off-by: NAndreas Gruenbacher <agruen@suse.de>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

4814f56d

NFS: Fix up inode revalidation accounting · 1842bfb4

由 Trond Myklebust 提交于 5月 25, 2006

Currently, we are accounting for all calls to nfs_revalidate_inode(), but not
to nfs_revalidate_mapping(), or nfs_lookup_verify_inode(), etc...
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

1842bfb4

NFS: Separate metadata and page cache revalidation mechanisms · 44b11874

由 Trond Myklebust 提交于 5月 25, 2006

Separate out the function of revalidating the inode metadata, and
revalidating the mapping. The former may be called by lookup(),
and only really needs to check that permissions, ctime, etc haven't changed
whereas the latter needs only done when we want to read data from the page
cache, and may need to sync and then invalidate the mapping.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

44b11874

NFS: More page cache revalidation fixups · 38478b24

由 Trond Myklebust 提交于 5月 25, 2006

Whenever the directory changes, we want to make sure that we always
invalidate its page cache. Fix up update_changeattr() and
nfs_mark_for_revalidate() so that they do so.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

38478b24

NFS: Fix page cache revalidation · f1bb0b92

由 Trond Myklebust 提交于 5月 25, 2006

Fix up a bug in the handling of NFS_INO_REVAL_PAGECACHE: make sure that
nfs_update_inode() clears it when we're sure we're not racing with other
updates.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

f1bb0b92

NFS: Optimize allocation of nfs_read/write_data structures · 0d0b5cb3

由 Chuck Lever 提交于 5月 25, 2006

Clean up use of page_array, and fix an off-by-one error noticed by Tom
Talpey which causes kmalloc calls in cases where using the page_array
is sufficient.

Test plan:
Normal client functional testing with r/wsize=32768.
Signed-off-by: NChuck Lever <cel@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

0d0b5cb3

T
NFS: Clean up inode metadata updates · 73a3d07c
由 Trond Myklebust 提交于 5月 25, 2006
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
73a3d07c

openanolis / cloud-kernel 大约 1 年 前同步成功

openanolis / cloud-kernel
大约 1 年前同步成功