提交 · bc23676caf54c9b6e2521ef065dfddf6c50211de · openeuler / Kernel

25 6月, 2016 1 次提交

NFSv4.1/pnfs: Ensure we handle delegation errors in nfs4_proc_layoutget() · bc23676c

由 Trond Myklebust 提交于 6月 17, 2016

nfs4_handle_exception() relies on the caller setting the 'inode' field
in the struct nfs4_exception argument when the error applies to a
delegation.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
Reviewed-by: NJeff Layton <jlayton@poochiereds.net>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

bc23676c

28 5月, 2016 1 次提交

switch xattr_handler->set() to passing dentry and inode separately · 59301226

由 Al Viro 提交于 5月 27, 2016

preparation for similar switch in ->setxattr() (see the next commit for
rationale).
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

59301226

18 5月, 2016 7 次提交

pnfs: rework LAYOUTGET retry handling · 183d9e7b

由 Jeff Layton 提交于 5月 17, 2016

There are several problems in the way a stateid is selected for a
LAYOUTGET operation:

We pick a stateid to use in the RPC prepare op, but that makes
it difficult to serialize LAYOUTGETs that use the open stateid. That
serialization is done in pnfs_update_layout, which occurs well before
the rpc_prepare operation.

Between those two events, the i_lock is dropped and reacquired.
pnfs_update_layout can find that the list has lsegs in it and not do any
serialization, but then later pnfs_choose_layoutget_stateid ends up
choosing the open stateid.

This patch changes the client to select the stateid to use in the
LAYOUTGET earlier, when we're searching for a usable layout segment.
This way we can do it all while holding the i_lock the first time, and
ensure that we serialize any LAYOUTGET call that uses a non-layout
stateid.

This also means a rework of how LAYOUTGET replies are handled, as we
must now get the latest stateid if we want to retransmit in response
to a retryable error.

Most of those errors boil down to the fact that the layout state has
changed in some fashion. Thus, what we really want to do is to re-search
for a layout when it fails with a retryable error, so that we can avoid
reissuing the RPC at all if possible.

While the LAYOUTGET RPC is async, the initiating thread always waits for
it to complete, so it's effectively synchronous anyway. Currently, when
we need to retry a LAYOUTGET because of an error, we drive that retry
via the rpc state machine.

This means that once the call has been submitted, it runs until it
completes. So, we must move the error handling for this RPC out of the
rpc_call_done operation and into the caller.

In order to handle errors like NFS4ERR_DELAY properly, we must also
pass a pointer to the sliding timeout, which is now moved to the stack
in pnfs_update_layout.

The complicating errors are -NFS4ERR_RECALLCONFLICT and
-NFS4ERR_LAYOUTTRYLATER, as those involve a timeout after which we give
up and return NULL back to the caller. So, there is some special
handling for those errors to ensure that the layers driving the retries
can handle that appropriately.
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

183d9e7b

pnfs: only tear down lsegs that precede seqid in LAYOUTRETURN args · 6d597e17

由 Jeff Layton 提交于 5月 17, 2016

LAYOUTRETURN is "special" in that servers and clients are expected to
work with old stateids. When the client sends a LAYOUTRETURN with an old
stateid in it then the server is expected to only tear down layout
segments that were present when that seqid was current. Ensure that the
client handles its accounting accordingly.
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

6d597e17

NFSv4: Use the right stateid for delegations in setattr, read and write · abf4e13c

由 Trond Myklebust 提交于 5月 16, 2016

When we're using a delegation to represent our open state, we should
ensure that we use the stateid that was used to create that delegation.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

abf4e13c

NFSv4: Label stateids with the type · 93b717fd

由 Trond Myklebust 提交于 5月 16, 2016

In order to more easily distinguish what kind of stateid we are dealing
with, introduce a type that can be used to label the stateid structure.

The label will be useful both for debugging, but also when dealing with
operations like SETATTR, READ and WRITE that can take several different
types of stateid as arguments.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

93b717fd

sunrpc: Advertise maximum backchannel payload size · 6b26cc8c

由 Chuck Lever 提交于 5月 02, 2016

RPC-over-RDMA transports have a limit on how large a backward
direction (backchannel) RPC message can be. Ensure that the NFSv4.x
CREATE_SESSION operation advertises this limit to servers.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Tested-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

6b26cc8c

nfs4: client: do not send empty SETATTR after OPEN_CREATE · a1d1c4f1

由 Tigran Mkrtchyan 提交于 5月 12, 2016

OPEN_CREATE with EXCLUSIVE4_1 sends initial file permission.
Ignoring  fact, that server have indicated that file mod is set, client
will send yet another SETATTR request, but, as mode is already set,
new SETATTR will be empty. This is not a problem, nevertheless
an extra roundtrip and slow open on high latency networks.

This change is aims to skip extra setattr after open  if there are
no attributes to be set.
Signed-off-by: NTigran Mkrtchyan <tigran.mkrtchyan@desy.de>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

a1d1c4f1

NFS: Add COPY nfs operation · 2e72448b

由 Anna Schumaker 提交于 5月 21, 2013

This adds the copy_range file_ops function pointer used by the
sys_copy_range() function call. This patch only implements sync copies,
so if an async copy happens we decode the stateid and ignore it.
Signed-off-by: NAnna Schumaker <bjschuma@netapp.com>

2e72448b

09 5月, 2016 2 次提交

nfs: per-name sillyunlink exclusion · 884be175

由 Al Viro 提交于 4月 28, 2016

use d_alloc_parallel() for sillyunlink/lookup exclusion and
explicit rwsem (nfs_rmdir() being a writer and nfs_call_unlink() -
a reader) for rmdir/sillyunlink one.

That ought to make lookup/readdir/!O_CREAT atomic_open really
parallel on NFS.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

884be175

NFS: Fix an LOCK/OPEN race when unlinking an open file · 11476e9d

由 Chuck Lever 提交于 4月 11, 2016

At Connectathon 2016, we found that recent upstream Linux clients
would occasionally send a LOCK operation with a zero stateid. This
appeared to happen in close proximity to another thread returning
a delegation before unlinking the same file while it remained open.

Earlier, the client received a write delegation on this file and
returned the open stateid. Now, as it is getting ready to unlink the
file, it returns the write delegation. But there is still an open
file descriptor on that file, so the client must OPEN the file
again before it returns the delegation.

Since commit 24311f88 ('NFSv4: Recovery of recalled read
delegations is broken'), nfs_open_delegation_recall() clears the
NFS_DELEGATED_STATE flag _before_ it sends the OPEN. This allows a
racing LOCK on the same inode to be put on the wire before the OPEN
operation has returned a valid open stateid.

To eliminate this race, serialize delegation return with the
acquisition of a file lock on the same file. Adopt the same approach
as is used in the unlock path.

This patch also eliminates a similar race seen when sending a LOCK
operation at the same time as returning a delegation on the same file.

Fixes: 24311f88 ('NFSv4: Recovery of recalled read ... ')
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
[Anna: Add sentence about LOCK / delegation race]
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

11476e9d

11 4月, 2016 1 次提交
- A
  xattr_handler: pass dentry and inode as separate arguments of ->get() · b296821a
  由 Al Viro 提交于 4月 10, 2016
```
... and do not assume they are already attached to each other
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  b296821a
14 3月, 2016 1 次提交

replace d_add_unique() with saner primitive · 668d0cd5

由 Al Viro 提交于 3月 08, 2016

new primitive: d_exact_alias(dentry, inode).  If there is an unhashed
dentry with the same name/parent and given inode, rehash, grab and
return it.  Otherwise, return NULL.  The only caller of d_add_unique()
switched to d_exact_alias() + d_splice_alias().
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

668d0cd5

18 2月, 2016 1 次提交

NFSv4: Fix a dentry leak on alias use · d9dfd8d7

由 Benjamin Coddington 提交于 2月 17, 2016

In the case where d_add_unique() finds an appropriate alias to use it will
have already incremented the reference count. An additional dget() to swap
the open context's dentry is unnecessary and will leak a reference.
Signed-off-by: NBenjamin Coddington <bcodding@redhat.com>
Fixes: 275bb307 ("NFSv4: Move dentry instantiation into the NFSv4-...")
Cc: stable@vger.kernel.org # 3.10+
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

d9dfd8d7

06 2月, 2016 2 次提交

NFS add callback_ops to nfs4_proc_bind_conn_to_session_callback · 02a95dee

由 Andy Adamson 提交于 2月 05, 2016

Fix oops when NULL callback_ops pointer accessed in rpc_init_task
Signed-off-by: NAndy Adamson <andros@netapp.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

02a95dee

NFSv4.1: nfs4_proc_bind_conn_to_session must iterate over all connections · d9ddbf5d

由 Trond Myklebust 提交于 1月 30, 2016

Use the new helper to ensure that nfs4_proc_bind_conn_to_session() is called
for all connections.
However ensure that we only set the backchannel flag for the connection
pointed to by rpc_clnt->cl_xprt.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

d9ddbf5d

25 1月, 2016 1 次提交

NFSv4.x: Remove hard coded slotids in callback channel · f4f58ed1

由 Trond Myklebust 提交于 1月 23, 2016

Instead, use the values encoded in the slot table itself.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

f4f58ed1

29 12月, 2015 4 次提交

NFSv4.1/pNFS: Add a helper to mark the layout as returned · 0654cc72

由 Trond Myklebust 提交于 12月 28, 2015

This ensures that we don't reuse the stateid if a layout return or
implied layout return means that we've returned all layout segments
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

0654cc72

pNFS: Ensure nfs4_layoutget_prepare returns the correct error · ab7d763e

由 Trond Myklebust 提交于 12月 28, 2015

If we're unable to perform the layoutget due to an invalid open stateid
or a bulk recall, ensure that we return the error so that the caller
can decide on an appropriate action.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

ab7d763e

NFS41: map NFS4ERR_LAYOUTUNAVAILABLE to ENODATA · 7c1e6e58

由 Peng Tao 提交于 12月 05, 2015

Instead of mapping it to EIO that is a fatal error and
fails application. We'll go inband after getting
NFS4ERR_LAYOUTUNAVAILABLE.
Signed-off-by: NPeng Tao <tao.peng@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

7c1e6e58

NFSv4.1/pnfs: Fixup an lo->plh_block_lgets imbalance in layoutreturn · 1a093ceb

由 Trond Myklebust 提交于 12月 28, 2015

Since commit 2d8ae84f, nothing is bumping lo->plh_block_lgets in the
layoutreturn path, so it should not be touched in nfs4_layoutreturn_release
either.

Fixes: 2d8ae84f ("NFSv4.1/pnfs: Remove redundant lo->plh_block_lgets...")
Cc: stable@vger.kernel.org # 4.3+
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

1a093ceb

28 12月, 2015 6 次提交

nfs: machine credential support for additional operations · 99ade3c7

由 Andrew Elble 提交于 12月 02, 2015

Allow LAYOUTRETURN and DELEGRETURN to use machine credentials if the
server supports it. Add request for OPEN_DOWNGRADE as the close path
also uses that.
Signed-off-by: NAndrew Elble <aweits@rit.edu>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

99ade3c7

T
NFSv4: Fix unused variable warnings in nfs4_init_*_client_string() · f2dd436e
由 Trond Myklebust 提交于 10月 08, 2015
```
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
f2dd436e

Adding tracepoint to cached open · 9759b0fb

由 Olga Kornievskaia 提交于 11月 24, 2015

Signed-off-by: NOlga Kornievskaia <kolga@netapp.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

9759b0fb

Adding stateid information to tracepoints · 48c9579a

由 Olga Kornievskaia 提交于 11月 24, 2015

Operations to which stateid information is added:
close, delegreturn, open, read, setattr, layoutget, layoutcommit, test_stateid,
write, lock, locku, lockt

Format is "stateid=<seqid>:<crc32 hash stateid.other>", also "openstateid=",
"layoutstateid=", and "lockstateid=" for open_file, layoutget, set_lock
tracepoints.

New function is added to internal.h, nfs_stateid_hash(), to compute the hash

trace_nfs4_setattr() is moved from nfs4_do_setattr() to _nfs4_do_setattr()
to get access to stateid.

trace_nfs4_setattr and trace_nfs4_delegreturn are changed from INODE_EVENT
to new event type, INODE_STATEID_EVENT which is same as INODE_EVENT but adds
stateid information

for locking tracepoints, moved trace_nfs4_set_lock() into _nfs4_do_setlk()
to get access to stateid information, and removed trace_nfs4_lock_reclaim(),
trace_nfs4_lock_expired() as they call into _nfs4_do_setlk() and both were
previously same LOCK_EVENT type.
Signed-off-by: NOlga Kornievskaia <kolga@netapp.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

48c9579a

NFS: Allow the combination pNFS and labeled NFS · 95864c91

由 Trond Myklebust 提交于 12月 26, 2015

Fix the nfs4_pnfs_open_bitmap so that it also allows for labeled NFS.
Signed-off-by: NTrond Myklebust <trond,myklebust@primarydata.com>

95864c91

nfs: Fix race in __update_open_stateid() · 361cad3c

由 Andrew Elble 提交于 12月 02, 2015

We've seen this in a packet capture - I've intermixed what I
think was going on. The fix here is to grab the so_lock sooner.

1964379 -> #1 open (for write) reply seqid=1
1964393 -> #2 open (for read) reply seqid=2

  __nfs4_close(), state->n_wronly--
  nfs4_state_set_mode_locked(), changes state->state = [R]
  state->flags is [RW]
  state->state is [R], state->n_wronly == 0, state->n_rdonly == 1

1964398 -> #3 open (for write) call -> because close is already running
1964399 -> downgrade (to read) call seqid=2 (close of #1)
1964402 -> #3 open (for write) reply seqid=3

 __update_open_stateid()
   nfs_set_open_stateid_locked(), changes state->flags
   state->flags is [RW]
   state->state is [R], state->n_wronly == 0, state->n_rdonly == 1
   new sequence number is exposed now via nfs4_stateid_copy()

   next step would be update_open_stateflags(), pending so_lock

1964403 -> downgrade reply seqid=2, fails with OLD_STATEID (close of #1)

   nfs4_close_prepare() gets so_lock and recalcs flags -> send close

1964405 -> downgrade (to read) call seqid=3 (close of #1 retry)

   __update_open_stateid() gets so_lock
 * update_open_stateflags() updates state->n_wronly.
   nfs4_state_set_mode_locked() updates state->state

   state->flags is [RW]
   state->state is [RW], state->n_wronly == 1, state->n_rdonly == 1

 * should have suppressed the preceding nfs4_close_prepare() from
   sending open_downgrade

1964406 -> write call
1964408 -> downgrade (to read) reply seqid=4 (close of #1 retry)

   nfs_clear_open_stateid_locked()
   state->flags is [R]
   state->state is [RW], state->n_wronly == 1, state->n_rdonly == 1

1964409 -> write reply (fails, openmode)
Signed-off-by: NAndrew Elble <aweits@rit.edu>
Cc: stable@vger,kernel.org
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

361cad3c

14 12月, 2015 2 次提交

xattr handlers: Simplify list operation · 764a5c6b

由 Andreas Gruenbacher 提交于 12月 02, 2015

Change the list operation to only return whether or not an attribute
should be listed.  Copying the attribute names into the buffer is moved
to the callers.

Since the result only depends on the dentry and not on the attribute
name, we do not pass the attribute name to list operations.
Signed-off-by: NAndreas Gruenbacher <agruenba@redhat.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

764a5c6b

nfs: Move call to security_inode_listsecurity into nfs_listxattr · c4803c49

由 Andreas Gruenbacher 提交于 12月 02, 2015

Add a nfs_listxattr operation.  Move the call to security_inode_listsecurity
from list operation of the "security.*" xattr handler to nfs_listxattr.
Signed-off-by: NAndreas Gruenbacher <agruenba@redhat.com>
Cc: Trond Myklebust <trond.myklebust@primarydata.com>
Cc: Anna Schumaker <anna.schumaker@netapp.com>
Cc: linux-nfs@vger.kernel.org
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

c4803c49

07 12月, 2015 1 次提交

vfs: Distinguish between full xattr names and proper prefixes · 98e9cb57

由 Andreas Gruenbacher 提交于 12月 02, 2015

Add an additional "name" field to struct xattr_handler.  When the name
is set, the handler matches attributes with exactly that name.  When the
prefix is set instead, the handler matches attributes with the given
prefix and with a non-empty suffix.

This patch should avoid bugs like the one fixed in commit c361016a in
the future.
Signed-off-by: NAndreas Gruenbacher <agruenba@redhat.com>
Reviewed-by: NJames Morris <james.l.morris@oracle.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

98e9cb57

24 11月, 2015 1 次提交

nfs: use sliding delay when LAYOUTGET gets NFS4ERR_DELAY · 91ab4b4d

由 Jeff Layton 提交于 11月 19, 2015

When LAYOUTGET gets NFS4ERR_DELAY, we currently will wait 15s before
retrying the call. That is a _very_ long time, so add a timeout value to
struct nfs4_layoutget and pass nfs4_async_handle_error a pointer to it.
This allows the RPC engine to use a sliding delay window, instead of a
15s delay.
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

91ab4b4d

14 11月, 2015 1 次提交

xattr handlers: Pass handler to operations instead of flags · d9a82a04

由 Andreas Gruenbacher 提交于 10月 04, 2015

The xattr_handler operations are currently all passed a file system
specific flags value which the operations can use to disambiguate between
different handlers; some file systems use that to distinguish the xattr
namespace, for example. In some oprations, it would be useful to also have
access to the handler prefix. To allow that, pass a pointer to the handler
to operations instead of the flags value alone.
Signed-off-by: NAndreas Gruenbacher <agruenba@redhat.com>
Reviewed-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

d9a82a04

04 11月, 2015 1 次提交

nfs: Remove unused xdr page offsets in getacl/setacl arguments · 8fbcf237

由 Andreas Gruenbacher 提交于 11月 03, 2015

The arguments passed around for getacl and setacl xdr encoding, struct
nfs_setaclargs and struct nfs_getaclargs, both contain an array of
pages, an offset into the first page, and the length of the page data.
The offset is unused as it is always zero; remove it.
Signed-off-by: NAndreas Gruenbacher <agruenba@redhat.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

8fbcf237

23 10月, 2015 1 次提交

Move locks API users to locks_lock_inode_wait() · 4f656367

由 Benjamin Coddington 提交于 10月 22, 2015

Instead of having users check for FL_POSIX or FL_FLOCK to call the correct
locks API function, use the check within locks_lock_inode_wait().  This
allows for some later cleanup.
Signed-off-by: NBenjamin Coddington <bcodding@redhat.com>
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>

4f656367

16 10月, 2015 2 次提交

nfs: get clone_blksize when probing fsinfo · 2a92ee92

由 Peng Tao 提交于 9月 26, 2015

NFSv42 CLONE operation is supposed to respect it.
Signed-off-by: NPeng Tao <tao.peng@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

2a92ee92

nfs42: add CLONE proc functions · e5341f3a

由 Peng Tao 提交于 9月 26, 2015

Signed-off-by: NPeng Tao <tao.peng@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

e5341f3a

08 10月, 2015 4 次提交

NFSv4: Unify synchronous and asynchronous error handling · 037fc980

由 Trond Myklebust 提交于 9月 20, 2015

They now only differ in the way we handle waiting, so let's unify.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

037fc980

NFSv4: Don't use synchronous delegation recall in exception handling · 4816fdad

由 Trond Myklebust 提交于 9月 20, 2015

The code needs to be able to work from inside an asynchronous context.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

4816fdad

NFSv4: nfs4_async_handle_error should take a non-const nfs_server · 516285eb

由 Trond Myklebust 提交于 9月 20, 2015

For symmetry with the synchronous handler, and so that we can potentially
handle errors such as NFS4ERR_BADNAME.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

516285eb

T
NFSv4: Update the delay statistics counter for synchronous delays · 2598ed34
由 Trond Myklebust 提交于 9月 20, 2015
```
Currently, we only do so for asynchronous delays.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
2598ed34

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功