提交 · 774d5f14ee1ecac55f42a84ff35eb00b896b00b6 · openeuler / raspberrypi-kernel

21 5月, 2013 1 次提交

NFSv4.1 Fix a pNFS session draining deadlock · 774d5f14

由 Andy Adamson 提交于 5月 20, 2013

On a CB_RECALL the callback service thread flushes the inode using
filemap_flush prior to scheduling the state manager thread to return the
delegation. When pNFS is used and I/O has not yet gone to the data server
servicing the inode, a LAYOUTGET can preceed the I/O. Unlike the async
filemap_flush call, the LAYOUTGET must proceed to completion.

If the state manager starts to recover data while the inode flush is sending
the LAYOUTGET, a deadlock occurs as the callback service thread holds the
single callback session slot until the flushing is done which blocks the state
manager thread, and the state manager thread has set the session draining bit
which puts the inode flush LAYOUTGET RPC to sleep on the forechannel slot
table waitq.

Separate the draining of the back channel from the draining of the fore channel
by moving the NFS4_SESSION_DRAINING bit from session scope into the fore
and back slot tables. Drain the back channel first allowing the LAYOUTGET
call to proceed (and fail) so the callback service thread frees the callback
slot. Then proceed with draining the forechannel.
Signed-off-by: NAndy Adamson <andros@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

774d5f14

07 5月, 2013 1 次提交

NFSv4.1: Ensure that we free the lock stateid on the server · c8b2d0bf

由 Trond Myklebust 提交于 5月 03, 2013

This ensures that the server doesn't need to keep huge numbers of
lock stateids waiting around for the final CLOSE.
See section 8.2.4 in RFC5661.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

c8b2d0bf

23 4月, 2013 1 次提交

NFS: Retry SETCLIENTID with AUTH_SYS instead of AUTH_NONE · 79d852bf

由 Chuck Lever 提交于 4月 22, 2013

Recently I changed the SETCLIENTID code to use AUTH_GSS(krb5i), and
then retry with AUTH_NONE if that didn't work.  This was to enable
Kerberos NFS mounts to work without forcing Linux NFS clients to
have a keytab on hand.

Rick Macklem reports that the FreeBSD server accepts AUTH_NONE only
for NULL operations (thus certainly not for SETCLIENTID).  Falling
back to AUTH_NONE means our proposed 3.10 NFS client will not
interoperate with FreeBSD servers over NFSv4 unless Kerberos is
fully configured on both ends.

If the Linux client falls back to using AUTH_SYS instead for
SETCLIENTID, all should work fine as long as the NFS server is
configured to allow AUTH_SYS for SETCLIENTID.

This may still prevent access to Kerberos-only FreeBSD servers by
Linux clients with no keytab.  Rick is of the opinion that the
security settings the server applies to its pseudo-fs should also
apply to the SETCLIENTID operation.

Linux and Solaris NFS servers do not place that limitation on
SETCLIENTID.  The security settings for the server's pseudo-fs are
determined automatically as the union of security flavors allowed on
real exports, as recommended by RFC 3530bis; and the flavors allowed
for SETCLIENTID are all flavors supported by the respective server
implementation.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

79d852bf

20 4月, 2013 1 次提交

NFSv4: Use the open stateid if the delegation has the wrong mode · 92b40e93

由 Trond Myklebust 提交于 4月 20, 2013

Fix nfs4_select_rw_stateid() so that it chooses the open stateid
(or an all-zero stateid) if the delegation does not match the selected
read/write mode.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

92b40e93

09 4月, 2013 1 次提交

NFSv4: Handle timeouts correctly when probing for lease validity · bc7a05ca

由 Trond Myklebust 提交于 4月 08, 2013

When we send a RENEW or SEQUENCE operation in order to probe if the
lease is still valid, we want it to be able to time out since the
lease we are probing is likely to time out too. Currently, because
we use soft mount semantics for these RPC calls, the return value
is EIO, which causes the state manager to exit with an "unhandled
error" message.
This patch changes the call semantics, so that the RPC layer returns
ETIMEDOUT instead of EIO. We then have the state manager default to
a simple retry instead of exiting.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

bc7a05ca

06 4月, 2013 3 次提交

NFSv4: Fix a memory leak in nfs4_discover_server_trunking · b193d59a

由 Trond Myklebust 提交于 4月 04, 2013

When we assign a new rpc_client to clp->cl_rpcclient, we need to destroy
the old one.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
Cc: Chuck Lever <chuck.lever@oracle.com>
Cc: stable@vger.kernel.org [>=3.7]

b193d59a

NFSv4: Don't clear the machine cred when client establish returns EACCES · 845cbceb

由 Trond Myklebust 提交于 4月 05, 2013

The expected behaviour is that the client will decide at mount time
whether or not to use a krb5i machine cred, or AUTH_NULL.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
Cc: Chuck Lever <chuck.lever@oracle.com>
Cc: Bryan Schumaker <bjschuma@netapp.com>

845cbceb

NFSv4: Fix issues in nfs4_discover_server_trunking · ea33e6c3

由 Trond Myklebust 提交于 4月 05, 2013

- Ensure that we exit with ENOENT if the call to ops->get_clid_cred()
  fails.
- Handle the case where ops->detect_trunking() exits with an
  unexpected error, and return EIO.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

ea33e6c3

30 3月, 2013 1 次提交

NFS: Use "krb5i" to establish NFSv4 state whenever possible · 4edaa308

由 Chuck Lever 提交于 3月 16, 2013

Currently our client uses AUTH_UNIX for state management on Kerberos
NFS mounts in some cases.  For example, if the first mount of a
server specifies "sec=sys," the SETCLIENTID operation is performed
with AUTH_UNIX.  Subsequent mounts using stronger security flavors
can not change the flavor used for lease establishment.  This might
be less security than an administrator was expecting.

Dave Noveck's migration issues draft recommends the use of an
integrity-protecting security flavor for the SETCLIENTID operation.
Let's ignore the mount's sec= setting and use krb5i as the default
security flavor for SETCLIENTID.

If our client can't establish a GSS context (eg. because it doesn't
have a keytab or the server doesn't support Kerberos) we fall back
to using AUTH_NULL.  For an operation that requires a
machine credential (which never represents a particular user)
AUTH_NULL is as secure as AUTH_UNIX.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

4edaa308

29 3月, 2013 1 次提交

NFSv4: Fix another reboot recovery race · 91876b13

由 Trond Myklebust 提交于 3月 28, 2013

If the open_context for the file is not yet fully initialised,
then open recovery cannot succeed, and since nfs4_state_find_open_context
returns an ENOENT, we end up treating the file as being irrecoverable.

What we really want to do, is just defer the recovery until later.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

91876b13

26 3月, 2013 4 次提交

NFSv4.1: Select the "most recent locking state" for read/write/setattr stateids · 3b66486c

由 Trond Myklebust 提交于 3月 17, 2013

Follow the practice described in section 8.2.2 of RFC5661: When sending a
read/write or setattr stateid, set the seqid field to zero in order to
signal that the NFS server should apply the most recent locking state.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

3b66486c

NFSv4: Resend the READ/WRITE RPC call if a stateid change causes an error · 5521abfd

由 Trond Myklebust 提交于 3月 16, 2013

Adds logic to ensure that if the server returns a BAD_STATEID,
or other state related error, then we check if the stateid has
already changed. If it has, then rather than start state recovery,
we should just resend the failed RPC call with the new stateid.

Allow nfs4_select_rw_stateid to notify that the stateid is unstable by
having it return -EWOULDBLOCK if an RPC is underway that might change the
stateid.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

5521abfd

NFS: Don't accept more reads/writes if the open context recovery failed · c58c8441

由 Trond Myklebust 提交于 3月 18, 2013

If the state recovery failed, we want to ensure that the application
doesn't try to use the same file descriptor for more reads or writes.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

c58c8441

NFSv4: Fail I/O if the state recovery fails irrevocably · 5d422301

由 Trond Myklebust 提交于 3月 14, 2013

If state recovery fails with an ESTALE or a ENOENT, then we shouldn't
keep retrying. Instead, mark the stateid as being invalid and
fail the I/O with an EIO error.
For other operations such as POSIX and BSD file locking, truncate
etc, fail with an EBADF to indicate that this file descriptor is no
longer valid.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

5d422301

12 2月, 2013 2 次提交

NFSv4: Ensure delegation recall and byte range lock removal don't conflict · 65b62a29

由 Trond Myklebust 提交于 2月 07, 2013

Add a mutex to the struct nfs4_state_owner to ensure that delegation
recall doesn't conflict with byte range lock removal.

Note that we nest the new mutex _outside_ the state manager reclaim
protection (nfsi->rwsem) in order to avoid deadlocks.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

65b62a29

NFSv4: Allow the state manager to mark an open_owner as being recovered · c137afab

由 Trond Myklebust 提交于 2月 07, 2013

This patch adds a seqcount_t lock for use by the state manager to
signal that an open owner has been recovered. This mechanism will be
used by the delegation, open and byte range lock code in order to
figure out if they need to replay requests due to collisions with
lock recovery.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

c137afab

31 1月, 2013 1 次提交

NFSv4.1: Handle NFS4ERR_DELAY when resetting the NFSv4.1 session · c489ee29

由 Trond Myklebust 提交于 1月 30, 2013

NFS4ERR_DELAY is a legal reply when we call DESTROY_SESSION. It
usually means that the server is busy handling an unfinished RPC
request. Just sleep for a second and then retry.
We also need to be able to handle the NFS4ERR_BACK_CHAN_BUSY return
value. If the NFS server has outstanding callbacks, we just want to
similarly sleep & retry.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
Cc: stable@vger.kernel.org

c489ee29

28 1月, 2013 1 次提交

NFSv4: Fix NFSv4 trunking discovery · 202c312d

由 Trond Myklebust 提交于 1月 18, 2013

If walking the list in nfs4[01]_walk_client_list fails, then the most
likely explanation is that the server dropped the clientid before we
actually managed to confirm it. As long as our nfs_client is the very
last one in the list to be tested, the caller can be assured that this
is the case when the final return value is NFS4ERR_STALE_CLIENTID.
Reported-by: NBen Greear <greearb@candelatech.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
Cc: Chuck Lever <chuck.lever@oracle.com>
Cc: stable@vger.kernel.org [>=3.7]
Tested-by: NBen Greear <greearb@candelatech.com>

202c312d

13 12月, 2012 2 次提交

nfs: Remove unused list nfs4_clientid_list · 48d7a576

由 Yanchuan Nian 提交于 12月 13, 2012

This list was designed to store struct nfs4_client in the client side.
But nfs4_client was obsolete and has been removed from the source code.
So remove the unused list.
Signed-off-by: NYanchuan Nian <ycnian@gmail.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

48d7a576

SUNRPC handle EKEYEXPIRED in call_refreshresult · eb96d5c9

由 Andy Adamson 提交于 11月 27, 2012

Currently, when an RPCSEC_GSS context has expired or is non-existent
and the users (Kerberos) credentials have also expired or are non-existent,
the client receives the -EKEYEXPIRED error and tries to refresh the context
forever. If an application is performing I/O, or other work against the share,
the application hangs, and the user is not prompted to refresh/establish their
credentials. This can result in a denial of service for other users.

Users are expected to manage their Kerberos credential lifetimes to mitigate
this issue.

Move the -EKEYEXPIRED handling into the RPC layer. Try tk_cred_retry number
of times to refresh the gss_context, and then return -EACCES to the application.
Signed-off-by: NAndy Adamson <andros@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

eb96d5c9

06 12月, 2012 11 次提交

NFSv4.1: Ensure smooth handover of slots from one task to the next waiting · b75ad4cd

由 Trond Myklebust 提交于 11月 29, 2012

Currently, we see a lot of bouncing for the value of highest_used_slotid
due to the fact that slots are getting freed, instead of getting instantly
transmitted to the next waiting task.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

b75ad4cd

NFSv4.1: Remove the 'FIFO' behaviour for nfs41_setup_sequence · 275e7e20

由 Trond Myklebust 提交于 11月 01, 2012

It is more important to preserve the task priority behaviour, which ensures
that things like reclaim writes take precedence over background and kupdate
writes.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

275e7e20

NFSv4.1: Ping server when our session table limits are too high · c10e4498

由 Trond Myklebust 提交于 11月 26, 2012

If the server requests a lower target_highest_slotid, then ensure
that we ping it with at least one RPC call containing an
appropriate SEQUENCE op. This ensures that the server won't need to
send a recall callback in order to shrink the slot table.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

c10e4498

T
NFSv4.1: Move slot table and session struct definitions to nfs4session.h · 76e697ba
由 Trond Myklebust 提交于 11月 26, 2012
```
Clean up. Gather NFSv4.1 slot definitions in fs/nfs/nfs4session.h.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
76e697ba

NFSv4: Move nfs4_wait_clnt_recover and nfs4_client_recover_expired_lease · 33021279

由 Trond Myklebust 提交于 11月 26, 2012

nfs4_wait_clnt_recover and nfs4_client_recover_expired_lease are both
generic state related functions. As such, they belong in nfs4state.c,
and not nfs4proc.c
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

33021279

NFSv4.1: Clean up session draining · 5d63360d

由 Trond Myklebust 提交于 11月 23, 2012

Coalesce nfs4_check_drain_bc_complete and nfs4_check_drain_fc_complete
into a single function that can be called when the slot table is known
to be empty, then change nfs4_callback_free_slot() and nfs4_free_slot()
to use it.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

5d63360d

NFSv4.1: CB_RECALL_SLOT must schedule a sequence op after updating targets · ac074835

由 Trond Myklebust 提交于 11月 21, 2012

RFC5661 requires us to make sure that the server knows we've updated
our slot table size by sending at least one SEQUENCE op containing the
new 'highest_slotid' value.
We can do so using the 'CHECK_LEASE' functionality of the state
manager.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

ac074835

NFSv4.1: Remove the state manager code to resize the slot table · afa29610

由 Trond Myklebust 提交于 11月 20, 2012

The state manager no longer needs any special machinery to stop the
session flow and resize the slot table. It is all done on the fly by
the SEQUENCE op code now.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

afa29610

NFSv4.1: Allow SEQUENCE to resize the slot table on the fly · 87dda67e

由 Trond Myklebust 提交于 11月 20, 2012

Instead of an array of slots, use a singly linked list of slots that
can be dynamically appended to or shrunk.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

87dda67e

NFSv4.1: Support dynamic resizing of the session slot table · 97e548a9

由 Trond Myklebust 提交于 11月 20, 2012

Allow the server to control the size of the session slot table
by adjusting the value of sr_target_max_slots in the reply to the
SEQUENCE operation.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

97e548a9

NFSv4.1: Ensure that the client tracks the server target_highest_slotid · 464ee9f9

由 Trond Myklebust 提交于 11月 20, 2012

Dynamic slot allocation in NFSv4.1 depends on the client being able to
track the server's target value for the highest slotid in the
slot table.  See the reference in Section 2.10.6.1 of RFC5661.

To avoid ordering problems in the case where 2 SEQUENCE replies contain
conflicting updates to this target value, we also introduce a generation
counter, to track whether or not an RPC containing a SEQUENCE operation
was launched before or after the last update.

Also rename the nfs4_slot_table target_max_slots field to
'target_highest_slotid' to avoid confusion with a slot
table size or number of slots.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

464ee9f9

27 11月, 2012 1 次提交

NFSv4.1: Shrink struct nfs4_sequence_res by moving the session pointer · e3725ec0

由 Trond Myklebust 提交于 11月 16, 2012

Move the session pointer into the slot table, then have struct nfs4_slot
point to that slot table.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

e3725ec0

21 11月, 2012 2 次提交

T
NFSv4.1: clean up nfs4_recall_slot to use nfs4_alloc_slots · 9216106a
由 Trond Myklebust 提交于 11月 19, 2012
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
9216106a

NFSv4.1: Handle session reset and bind_conn_to_session before lease check · 5df904ae

由 Trond Myklebust 提交于 11月 21, 2012

We can't send a SEQUENCE op unless the session is OK, so it is pointless
to handle the CHECK_LEASE state before we've dealt with SESSION_RESET
and BIND_CONN_TO_SESSION.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

5df904ae

05 11月, 2012 1 次提交
- T
  NFSv4: Get rid of unnecessary BUG_ON()s · 4ea8fed5
  由 Trond Myklebust 提交于 10月 15, 2012
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
  4ea8fed5
03 10月, 2012 1 次提交

NFSv4.1: don't do two EXCHANGE_IDs on mount · fd483570

由 Weston Andros Adamson 提交于 10月 02, 2012

Since the addition of NFSv4 server trunking detection the mount context
calls nfs4_proc_exchange_id then schedules the state manager, which also
calls nfs4_proc_exchange_id. Setting the NFS4CLNT_LEASE_CONFIRM bit
makes the state manager skip the unneeded EXCHANGE_ID and continue on
with session creation.
Reported-by: NJorge Mora <mora@netapp.com>
Signed-off-by: NWeston Andros Adamson <dros@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

fd483570

02 10月, 2012 4 次提交

NFSv4.0 reclaim reboot state when re-establishing clientid · 47b803c8

由 Andy Adamson 提交于 10月 01, 2012

We should reclaim reboot state when the clientid is stale.
Signed-off-by: NAndy Adamson <andros@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

47b803c8

T
NFSv4: Fix up a merge conflict between migration and container changes · 9f62387d
由 Trond Myklebust 提交于 10月 01, 2012
```
nfs_callback_tcpport is now per-net_namespace.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
9f62387d

NFS: Discover NFSv4 server trunking when mounting · 05f4c350

由 Chuck Lever 提交于 9月 14, 2012

"Server trunking" is a fancy named for a multi-homed NFS server.
Trunking might occur if a client sends NFS requests for a single
workload to multiple network interfaces on the same server.  There
are some implications for NFSv4 state management that make it useful
for a client to know if a single NFSv4 server instance is
multi-homed.  (Note this is only a consideration for NFSv4, not for
legacy versions of NFS, which are stateless).

If a client cares about server trunking, no NFSv4 operations can
proceed until that client determines who it is talking to.  Thus
server IP trunking discovery must be done when the client first
encounters an unfamiliar server IP address.

The nfs_get_client() function walks the nfs_client_list and matches
on server IP address.  The outcome of that walk tells us immediately
if we have an unfamiliar server IP address.  It invokes
nfs_init_client() in this case.  Thus, nfs4_init_client() is a good
spot to perform trunking discovery.

Discovery requires a client to establish a fresh client ID, so our
client will now send SETCLIENTID or EXCHANGE_ID as the first NFS
operation after a successful ping, rather than waiting for an
application to perform an operation that requires NFSv4 state.

The exact process for detecting trunking is different for NFSv4.0 and
NFSv4.1, so a minorversion-specific init_client callout method is
introduced.

CLID_INUSE recovery is important for the trunking discovery process.
CLID_INUSE is a sign the server recognizes the client's nfs_client_id4
id string, but the client is using the wrong principal this time for
the SETCLIENTID operation.  The SETCLIENTID must be retried with a
series of different principals until one works, and then the rest of
trunking discovery can proceed.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

05f4c350

NFS: Slow down state manager after an unhandled error · ffe5a830

由 Chuck Lever 提交于 9月 14, 2012

If the state manager thread is not actually able to fully recover from
some situation, it wakes up waiters, who kick off a new state manager
thread.  Quite often the fresh invocation of the state manager is just
as successful.

This results in a livelock as the client dumps thousands of NFS
requests a second on the network in a vain attempt to recover.  Not
very friendly.

To mitigate this situation, add a delay in the state manager after
an unhandled error, so that the client sends just a few requests
every second in this case.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

ffe5a830