提交 · d8cb1a7ce36d44602946f06af4267da304fb4011 · openanolis / cloud-kernel

06 12月, 2009 5 次提交

nfs41: check if session exists and if it is persistent · d8cb1a7c

由 Alexandros Batsakis 提交于 12月 05, 2009

Signed-off-by: NAlexandros Batsakis <batsakis@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

d8cb1a7c

nfs41: V2 initial support for CB_RECALL_ANY · 31f09607

由 Alexandros Batsakis 提交于 12月 05, 2009

For now the clients returns _all_ the delegations of the specificed type
it holds
Signed-off-by: NAlexandros Batsakis <batsakis@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

31f09607

nfs4: V2 return/expire delegations depending on their type · c79571a5

由 Alexandros Batsakis 提交于 12月 05, 2009

Signed-off-by: NAlexandros Batsakis <batsakis@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

c79571a5

nfs4: minor delegation cleaning · b4a6f496

由 Alexandros Batsakis 提交于 12月 05, 2009

Signed-off-by: NAlexandros Batsakis <batsakis@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

b4a6f496

nfs41: add support for callback with RPC version number 4 · 07bccc2d

由 Alexandros Batsakis 提交于 12月 05, 2009

The NFSv4.1 spec-29 (18.36.3) says that the server MUST use an ONC RPC
(program) version number equal to 4 in callbacks sent to the client.
For now we allow both versions 1 and 4.
Signed-off-by: NAlexandros Batsakis <batsakis@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

07bccc2d

05 12月, 2009 12 次提交

nfs41: only state manager sets NFS4CLNT_SESSION_SETUP · 0b9e2d41

由 Andy Adamson 提交于 12月 04, 2009

Replace sync and async handlers setting of the NFS4CLNT_SESSION_SETUP bit with
setting NFS4CLNT_CHECK_LEASE, and let the state manager decide to reset the session.
Signed-off-by: NAndy Adamson <andros@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

0b9e2d41

nfs41: drain session cleanup · 691daf3b

由 Andy Adamson 提交于 12月 04, 2009

Do not wake up the next slot_tbl_waitq task in nfs4_free_slot because we
may be draining the slot. Either signal the state manager that the session
is drained (the state manager wakes up tasks) OR wake up the next task.

In nfs41_sequence_done, the slot dereference is only needed in the sequence
operation success case.
Signed-off-by: NAndy Adamson <andros@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

691daf3b

nfs41: nfs41: fix state manager deadlock in session reset · ea028ac9

由 Andy Adamson 提交于 12月 04, 2009

If the session is reset during state recovery, the state manager thread can
sleep on the slot_tbl_waitq causing a deadlock.

Add a completion framework to the session.  Have the state manager thread set
a new session state (NFS4CLNT_SESSION_DRAINING) and wait for the session slot
table to drain.

Signal the state manager thread in nfs41_sequence_free_slot when the
NFS4CLNT_SESSION_DRAINING bit is set and the session is drained.
Reported-by: NTrond Myklebust <trond@netapp.com>
Signed-off-by: NAndy Adamson <andros@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

ea028ac9

nfs41: remove nfs4_recover_session · 05f0d236

由 Andy Adamson 提交于 12月 04, 2009

nfs4_recover_session can put rpciod to sleep. Just use nfs4_schedule_recovery.
Reported-by: NTrond Myklebust <trond.myklebust@netapp.com>
Signed-off-by: NAndy Adamson <andros@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

05f0d236

nfs41: don't clear tk_action on success · 2628eddf

由 Andy Adamson 提交于 12月 04, 2009

Signed-off-by: NAndy Adamson <andros@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

2628eddf

nfs41: fix switch in nfs4_recovery_handle_error · 8ba9bf8e

由 Andy Adamson 提交于 12月 04, 2009

Do not fall through and set NFS4CLNT_SESSION_RESET bit on NFS4ERR_EXPIRED
Signed-off-by: NAndy Adamson <andros@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

8ba9bf8e

nfs41: fix switch in nfs4_handle_exception · b9179237

由 Andy Adamson 提交于 12月 04, 2009

Do not fall through and call nfs4_delay on session error handling.
Signed-off-by: NAndy Adamson <andros@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

b9179237

nfs41: free the slot on unhandled read errors · 36bbe342

由 Andy Adamson 提交于 12月 04, 2009

nfs4_read_done returns zero on unhandled errors. nfs_readpage_result will
return on a negative tk_status without freeing the slot.
Call nfs4_sequence_free_slot on unhandled errors in nfs4_read_done.
Signed-off-by: NAndy Adamson <andros@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

36bbe342

nfs41: call free slot from nfs4_restart_rpc · e608e79f

由 Andy Adamson 提交于 12月 04, 2009

nfs41_sequence_free_slot can be called multiple times on SEQUENCE operation
errors.
No reason to inline nfs4_restart_rpc
Reported-by: NTrond Myklebust <trond.myklebust@netapp.com>

nfs_writeback_done and nfs_readpage_retry call nfs4_restart_rpc outside the
error handler, and the slot is not freed prior to restarting in the rpc_prepare
state during session reset.

Fix this by moving the call to nfs41_sequence_free_slot from the error
path of nfs41_sequence_done into nfs4_restart_rpc, and by removing the test
for NFS4CLNT_SESSION_SETUP.
Always free slot and goto the rpc prepare state on async errors.
Signed-off-by: NAndy Adamson <andros@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

e608e79f

nfs41: nfs4_get_lease_time will never session reset · 1d9ddde9

由 Andy Adamson 提交于 12月 04, 2009

Make this clear by calling rpc_restart-call.
Prepare for nfs4_restart_rpc() to free slots.
Signed-off-by: NAndy Adamson <andros@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

1d9ddde9

nfs41: rename cl_state session SETUP bit to RESET · 6df08189

由 Andy Adamson 提交于 12月 04, 2009

The bit is no longer used for session setup, only for session reset.
Signed-off-by: NAndy Adamson <andros@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

6df08189

nfs41: add create session into establish_clid · 4d643d1d

由 Andy Adamson 提交于 12月 04, 2009

Reported-by: NTrond Myklebust <trond.myklebust@netapp.com>

Resetting the clientid from the state manager could result in not confirming
the clientid due to create session not being called.

Move the create session call from the NFS4CLNT_SESSION_SETUP state manager
initialize session case into the NFS4CLNT_LEASE_EXPIRED case establish_clid
call.
Signed-off-by: NAndy Adamson <andros@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

4d643d1d

04 12月, 2009 23 次提交

T

Merge branch 'devel' into linux-next · 7285f2d2
由 Trond Myklebust 提交于 12月 03, 2009

7285f2d2

NFS4ERR_FILE_OPEN handling in Linux/NFS · 44ed3556

由 NeilBrown 提交于 12月 03, 2009

NFS4ERR_FILE_OPEN is return by the server when an operation cannot be
performed because the file is currently open and local (to the server)
semantics prohibit the operation while the file is open.
A typical case is a RENAME operation on an MS-Windows platform, which
prevents rename while the file is open.

While it is possible that such a condition is transitory, it is also
very possible that the file will be held open for an extended period
of time thus preventing the operation.

The current behaviour of Linux/NFS is to retry the operation
indefinitely.  This is not appropriate - we do not expect a rename to
take an arbitrary amount of time to complete.

Rather, and error should be returned.  The most obvious error code
would be EBUSY, which is a legal at least for 'rename' and 'unlink',
and accurately captures the reason for the error.

This patch allows a few retries until about 2 seconds have elapsed,
then returns EBUSY.
Signed-off-by: NNeilBrown <neilb@suse.de>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

44ed3556

T

Merge branch 'bugfixes' into nfs-for-next · 0b08b075
由 Trond Myklebust 提交于 12月 03, 2009

0b08b075

nfs: clean up sillyrenaming in nfs_rename() · 24e93025

由 Miklos Szeredi 提交于 12月 03, 2009

The d_instantiate(new_dentry, NULL) is superfluous, the dentry is
already negative.  Rehashing this dummy dentry isn't needed either,
d_move() works fine on an unhashed target.

The re-checking for busy after a failed nfs_sillyrename() is bogus
too: new_dentry->d_count < 2 would be a bug here.
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

24e93025

nfs: dont unhash target if renaming a directory · 27226104

由 Miklos Szeredi 提交于 12月 03, 2009

Move unhashing the target to after the check for existence and being a
non-directory.

If renaming a directory then the VFS already unhashes the target if it
is not busy.  If it's busy then acquiring more references during the
rename makes no difference.
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

27226104

nfs: fix comments in nfs_rename() · 28f79a1a

由 Miklos Szeredi 提交于 12月 03, 2009

Comments are wrong or out of date.  In particular d_drop() doesn't
free the inode it just unhashes the dentry.  And if target is a
directory then it is not checked for being busy.
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

28f79a1a

nfs: remove unnecessary check from nfs_rename() · e48de5ec

由 Miklos Szeredi 提交于 12月 03, 2009

VFS already checks if both source and target are directories.
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

e48de5ec

T
NFSv4.1: Handle NFSv4.1 session errors in the lock recovery code · 9c4c761a
由 Trond Myklebust 提交于 12月 03, 2009
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
9c4c761a

SUNRPC: soft connect semantics for UDP · 3a28becc

由 Chuck Lever 提交于 12月 03, 2009

Introduce soft connect behavior for UDP transports.  In this case, a
major timeout returns ETIMEDOUT instead of EIO.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

3a28becc

SUNRPC: Use soft connect semantics when performing RPC ping · caabea8a

由 Chuck Lever 提交于 12月 03, 2009

Currently, if a remote RPC service is unreachable, an RPC ping will
hang until the underlying transport connect attempt times out.  A more
desirable behavior might be to have the ping fail immediately so upper
layers can recover appropriately.

In the case of an NFS mount, for instance, this would mean the
mount(2) system call could fail immediately if the server isn't
listening, rather than hanging uninterruptibly for more than 3
minutes.

Change rpc_ping() so that it fails immediately for connection-oriented
transports.  rpc_create() will then fail immediately for such
transports if an RPC ping was requested.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

caabea8a

SUNRPC: Use soft connects for autobinding over TCP · 012da158

由 Chuck Lever 提交于 12月 03, 2009

Autobinding is handled by the rpciod process, not in user processes
that are generating regular RPC requests.  Thus autobinding is usually
not affected by signals targetting user processes, such as KILL or
timer expiration events.

In addition, an RPC request generated by a user process that has
RPC_TASK_SOFTCONN set and needs to perform an autobind will hang if
the remote rpcbind service is not available.

For rpcbind queries on connection-oriented transports, let's use the
new soft connect semantic to return control to the user's process
quickly, if the kernel's rpcbind client can't connect to the remote
rpcbind service.

Logic is introduced in call_bind_status() to handle connection errors
that occurred during an asynchronous rpcbind query.  The logic
abandons the rpcbind query if the RPC request has SOFTCONN set, and
retries after a few seconds in the normal case.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

012da158

SUNRPC: Use TCP for local rpcbind upcalls · 2a76b3bf

由 Chuck Lever 提交于 12月 03, 2009

Use TCP with the soft connect semantic for local rpcbind upcalls so
the kernel can detect immediately if the local rpcbind daemon is not
running.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>

2a76b3bf

SUNRPC: Use a cached RPC client and transport for rpcbind upcalls · c526611d

由 Chuck Lever 提交于 12月 03, 2009

The kernel's rpcbind client creates and deletes an rpc_clnt and its
underlying transport socket for every upcall to the local rpcbind
daemon.

When starting a typical NFS server on IPv4 and IPv6, the NFS service
itself does three upcalls (one per version) times two upcalls (one
per transport) times two upcalls (one per address family), making 12,
plus another one for the initial call to unregister previous NFS
services.  Starting the NLM service adds an additional 13 upcalls,
for similar reasons.

(Currently the NFS service doesn't start IPv6 listeners, but it will
soon enough).

Instead, let's create an rpc_clnt for rpcbind upcalls during the
first local rpcbind query, and cache it.  This saves the overhead of
creating and destroying an rpc_clnt and a socket for every upcall.

The new logic also prevents the kernel from attempting an RPCB_SET or
RPCB_UNSET if it knows from the start that the local portmapper does
not support rpcbind protocol version 4.  This will cut down on the
number of rpcbind upcalls in legacy environments.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>

c526611d

SUNRPC: Simplify synopsis of rpcb_local_clnt() · 5a462115

由 Chuck Lever 提交于 12月 03, 2009

Clean up: At one point, rpcb_local_clnt() handled IPv6 loopback
addresses too, but it doesn't any more; only IPv4 loopback is used
now.  Get rid of the @addr and @addrlen arguments to
rpcb_local_clnt().
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

5a462115

SUNRPC: Allow RPCs to fail quickly if the server is unreachable · 09a21c41

由 Chuck Lever 提交于 12月 03, 2009

The kernel sometimes makes RPC calls to services that aren't running.
Because the kernel's RPC client always assumes the hard retry semantic
when reconnecting a connection-oriented RPC transport, the underlying
reconnect logic takes a long while to time out, even though the remote
may have responded immediately with ECONNREFUSED.

In certain cases, like upcalls to our local rpcbind daemon, or for NFS
mount requests, we'd like the kernel to fail immediately if the remote
service isn't reachable.  This allows another transport to be tried
immediately, or the pending request can be abandoned quickly.

Introduce a per-request flag which controls how call_transmit_status()
behaves when request transmission fails because the server cannot be
reached.

We don't want soft connection semantics to apply to other errors.  The
default case of the switch statement in call_transmit_status() no
longer falls through; the fall through code is copied to the default
case, and a "break;" is added.

The transport's connection re-establishment timeout is also ignored for
such requests.  We want the request to fail immediately, so the
reconnect delay is skipped.  Additionally, we don't want a connect
failure here to further increase the reconnect timeout value, since
this request will not be retried.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

09a21c41

SUNRPC: Check explicitly for tk_status == 0 in call_transmit_status() · 206a134b

由 Chuck Lever 提交于 12月 03, 2009

The success case, where task->tk_status == 0, is by far the most
frequent case in call_transmit_status().

The default: arm of the switch statement in call_transmit_status()
handles the 0 case.  default: was moved close to the top of the switch
statement in call_transmit_status() under the theory that the compiler
places object code for the earliest arms of a switch statement first,
making the CPU do less work.

The default: arm of a switch statement, however, is executed only
after all the other cases have been checked.  Even if the compiler
rearranges the object code, the default: arm is the "last resort",
meaning all of the other cases have been explicitly exhausted.  That
makes the current arrangement about as inefficient as it gets for the
common case.

To fix this, add an explicit check for zero before the switch
statement.  That forces the compiler to do the zero check first, no
matter what optimizations it might try to do to the switch statement.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

206a134b

NFS: Revert default r/wsize behavior · dd47f96c

由 Chuck Lever 提交于 12月 03, 2009

When the "rsize=" or "wsize=" mount options are not specified,
text-based mounts have slightly different behavior than legacy binary
mounts.  Text-based mounts use the smaller of the server's maximum
and the client's maximum, but binary mounts use the smaller of the
server's _preferred_ size and the client's maximum.

This difference is actually pretty subtle.  Most servers advertise
the same value as their maximum and their preferred transfer size, so
the end result is the same in most cases.

The reason for this difference is that for text-based mounts, if
r/wsize are not specified, they are set to the largest value supported
by the client.  For legacy mounts, the values are set to zero if these
options are not specified.

nfs_server_set_fsinfo() can negotiate the transfer size defaults
correctly in any case.  There's no need to specify any particular
value as default in the text-based option parsing logic.

Note that nfs4 doesn't use nfs_server_set_fsinfo(), but the mount.nfs4
command does set rsize and wsize to 0 if the user didn't specify these
options.  So, make the same change for text-based NFSv4 mounts.

Thanks to James Pearson <james-p@moving-picture.com> for reporting and
diagnosing the problem.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

dd47f96c

NFS: Display compressed (shorthand) IPv6 in /proc/mounts · d250e190

由 Chuck Lever 提交于 12月 03, 2009

Recent changes to snprintf() introduced the %pI6c formatter, which can
display an IPv6 address with standard shorthanding. Use this new
formatter when displaying IPv6 server addresses in /proc/mounts.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

d250e190

SUNRPC: Display compressed (shorthand) IPv6 presentation addresses · dd1fd90f

由 Chuck Lever 提交于 12月 03, 2009

Recent changes to snprintf() introduced the %pI6c formatter, which can
display an IPv6 address with standard shorthanding.  Using a
shorthanded address can save us a few bytes of memory for each stored
presentation address, or a few bytes on the wire when sending these in
a universal address.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

dd1fd90f

NFS: reorder nfs4_sequence_regs to remove 8 bytes of padding on 64 bits · a01878aa

由 Richard Kennedy 提交于 12月 03, 2009

reorder nfs4_sequence_args to remove 8 bytes of padding on 64 bit
builds.

The size of this structure drops to 24 bytes from 32 and reduces the
text size of nfs.ko.
On my x86_64 size reports

		text       data     bss
2.6.32-rc5 	200996	   8512	    432	 209940	  33414	nfs.ko
+patch 		200884	   8512	    432	 209828	  333a4	nfs.ko
Signed-off-by: NRichard Kennedy <richard@rsk.demon.co.uk>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

a01878aa

NFS: convert proto= option to use netids rather than a protoname · ee671b01

由 Jeff Layton 提交于 12月 03, 2009

Solaris uses netids as values for the proto= option, so that when
someone specifies "tcp6" they get traffic over TCP + IPv6. Until
recently, this has never really been an issue for Linux since it didn't
support NFS over IPv6. The netid and the protocol name were generally
always the same (modulo any strange configuration in /etc/netconfig).

The solaris manpage documents their proto= option as:

    proto= _netid_ | rdma

This patch is intended to bring Linux closer to how the Solaris proto=
option works, by declaring a static netid mapping in the kernel and
converting the proto= and mountproto= options to follow it and display
the proper values in /proc/mounts.

Much of this functionality will need to be provided by a userspace
mount.nfs patch. Chuck Lever has a patch to change mount.nfs in
the same way. In principle, we could do *all* of this in userspace but
that would mean that the options in /proc/mounts may not match the
options used by userspace.

The alternative to the static mapping here is to add a mechanism to
upcall to userspace for netid's. I'm not opposed to that option, but
it'll probably mean more overhead (and quite a bit more code). Rather
than shoot for that at first, I figured it was probably better to
start simply.

Comments welcome.
Signed-off-by: NJeff Layton <jlayton@redhat.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

ee671b01

J
The rpc server does not require that service threads take the BKL. · d4e935bd
由 J. Bruce Fields 提交于 12月 03, 2009
```
Signed-off-by: NJ. Bruce Fields <bfields@citi.umich.edu>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
d4e935bd
T
NFSv4: Ensure nfs4_close_context() is declared as static · 1185a552
由 Trond Myklebust 提交于 12月 03, 2009
```
Fix another 'sparse' warning in fs/nfs/nfs4proc.c
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
1185a552

openanolis / cloud-kernel 接近 2 年 前同步成功

openanolis / cloud-kernel
接近 2 年前同步成功