提交 · 451d26e151f0792601d10378a608c52304b6a357 · openeuler / Kernel

14 7月, 2017 40 次提交

xprtrdma: Pass only the list of registered MRs to ro_unmap_sync · 451d26e1

由 Chuck Lever 提交于 6月 08, 2017

There are rare cases where an rpcrdma_req can be re-used (via
rpcrdma_buffer_put) while the RPC reply handler is still running.
This is due to a signal firing at just the wrong instant.

Since commit 9d6b0409 ("xprtrdma: Place registered MWs on a
per-req list"), rpcrdma_mws are self-contained; ie., they fully
describe an MR and scatterlist, and no part of that information is
stored in struct rpcrdma_req.

As part of closing the above race window, pass only the req's list
of registered MRs to ro_unmap_sync, rather than the rpcrdma_req
itself.

Some extra transport header sanity checking is removed. Since the
client depends on its own recollection of what memory had been
registered, there doesn't seem to be a way to abuse this change.

And, the check was not terribly effective. If the client had sent
Read chunks, the "list_empty" test is negative in both of the
removed cases, which are actually looking for Write or Reply
chunks.

BugLink: https://bugzilla.linux-nfs.org/show_bug.cgi?id=305
Fixes: 68791649 ('xprtrdma: Invalidate in the RPC reply ... ')
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

451d26e1

xprtrdma: Pre-mark remotely invalidated MRs · 4b196dc6

由 Chuck Lever 提交于 6月 08, 2017

There are rare cases where an rpcrdma_req and its matched
rpcrdma_rep can be re-used, via rpcrdma_buffer_put, while the RPC
reply handler is still using that req. This is typically due to a
signal firing at just the wrong instant.

As part of closing this race window, avoid using the wrong
rpcrdma_rep to detect remotely invalidated MRs. Mark MRs as
invalidated while we are sure the rep is still OK to use.

BugLink: https://bugzilla.linux-nfs.org/show_bug.cgi?id=305
Fixes: 68791649 ('xprtrdma: Invalidate in the RPC reply ... ')
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

4b196dc6

xprtrdma: On invalidation failure, remove MWs from rl_registered · 04d25b7d

由 Chuck Lever 提交于 6月 08, 2017

Callers assume the ro_unmap_sync and ro_unmap_safe methods empty
the list of registered MRs. Ensure that all paths through
fmr_op_unmap_sync() remove MWs from that list.

Fixes: 9d6b0409 ("xprtrdma: Place registered MWs on a ... ")
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

04d25b7d

NFS: check for nfs_refresh_inode() errors in nfs_fhget() · 26fde4df

由 NeilBrown 提交于 7月 03, 2017

If an NFS server returns a filehandle that we have previously
seen, and reports a different type, then nfs_refresh_inode()
will log a warning and return an error.

nfs_fhget() does not check for this error and may return an
inode with a different type than the one that the server
reported.

This is likely to cause confusion, and is one way that
->open_context() could return a directory inode as discussed
in the previous patch.

So if nfs_refresh_inode() returns and error, return that error
from nfs_fhget() to avoid the confusion propagating.
Signed-off-by: NNeilBrown <neilb@suse.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

26fde4df

NFS: guard against confused server in nfs_atomic_open() · eaa2b82c

由 NeilBrown 提交于 7月 03, 2017

A confused server could return a filehandle for an
NFSv4 OPEN request, which it previously returned for a directory.
So the inode returned by  ->open_context() in nfs_atomic_open()
could conceivably be a directory inode.

This has particular implications for the call to
nfs_file_set_open_context() in nfs_finish_open().
If that is called on a directory inode, then the nfs_open_context
that gets stored in the filp->private_data will be linked to
nfs_inode->open_files.

When the directory is closed, nfs_closedir() will (ultimately)
free the ->private_data, but not unlink it from nfs_inode->open_files
(because it doesn't expect an nfs_open_context there).

Subsequently the memory could get used for something else and eventually
if the ->open_files list is walked, the walker will fall off the end and
crash.

So: change nfs_finish_open() to only call nfs_file_set_open_context()
for regular-file inodes.

This failure mode has been seen in a production setting (unknown NFS
server implementation).  The kernel was v3.0 and the specific sequence
seen would not affect more recent kernels, but I think a risk is still
present, and caution is wise.
Signed-off-by: NNeilBrown <neilb@suse.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

eaa2b82c

NFS: only invalidate dentrys that are clearly invalid. · cc89684c

由 NeilBrown 提交于 7月 05, 2017

Since commit bafc9b75 ("vfs: More precise tests in d_invalidate")
in v3.18, a return of '0' from ->d_revalidate() will cause the dentry
to be invalidated even if it has filesystems mounted on or it or on a
descendant.  The mounted filesystem is unmounted.

This means we need to be careful not to return 0 unless the directory
referred to truly is invalid.  So -ESTALE or -ENOENT should invalidate
the directory.  Other errors such a -EPERM or -ERESTARTSYS should be
returned from ->d_revalidate() so they are propagated to the caller.

A particular problem can be demonstrated by:

1/ mount an NFS filesystem using NFSv3 on /mnt
2/ mount any other filesystem on /mnt/foo
3/ ls /mnt/foo
4/ turn off network, or otherwise make the server unable to respond
5/ ls /mnt/foo &
6/ cat /proc/$!/stack # note that nfs_lookup_revalidate is in the call stack
7/ kill -9 $! # this results in -ERESTARTSYS being returned
8/ observe that /mnt/foo has been unmounted.

This patch changes nfs_lookup_revalidate() to only treat
  -ESTALE from nfs_lookup_verify_inode() and
  -ESTALE or -ENOENT from ->lookup()
as indicating an invalid inode.  Other errors are returned.

Also nfs_check_inode_attributes() is changed to return -ESTALE rather
than -EIO.  This is consistent with the error returned in similar
circumstances from nfs_update_inode().

As this bug allows any user to unmount a filesystem mounted on an NFS
filesystem, this fix is suitable for stable kernels.

Fixes: bafc9b75 ("vfs: More precise tests in d_invalidate")
Cc: stable@vger.kernel.org (v3.18+)
Signed-off-by: NNeilBrown <neilb@suse.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

cc89684c

PNFS for stateid errors retry against MDS first · 22368ff1

由 Olga Kornievskaia 提交于 6月 23, 2017

Upon receiving a stateid error such as BAD_STATEID, the client
should retry the operation against the MDS before deciding to
do stateid recovery.

Previously, the code would initiate state recovery and it could
lead to a race in a state manager that could chose an incorrect
recovery method which would lead to the EIO failure for the
application.
Signed-off-by: NOlga Kornievskaia <kolga@netapp.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

22368ff1

PNFS fix EACCESS on commit to DS handling · a0bc01e0

由 Olga Kornievskaia 提交于 6月 23, 2017

Commit fabbbee0 "PNFS fix fallback to MDS if got error on
commit to DS" moved the pnfs_set_lo_fail() to unhandled errors
which was not correct and lead to a kernel oops on umount.

Instead, fix the original EACCESS on commit to DS error by
getting the new layout and re-doing the IO.

Fixes: fabbbee0 ("PNFS fix fallback to MDS if got error on commit to DS")
Signed-off-by: NOlga Kornievskaia <kolga@netapp.com>
Cc: stable@vger.kernel.org # v4.12
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

a0bc01e0

NFS: silence a uninitialized variable warning · 4cd1ec95

由 Dan Carpenter 提交于 6月 23, 2017

Static checkers have gotten clever enough to complain that "id_long" is
uninitialized on the failure path. It's harmless, but simple to fix.
Signed-off-by: NDan Carpenter <dan.carpenter@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

4cd1ec95

nfs: Fix fscache stat printing in nfs_show_stats() · ce85bd29

由 Tuo Chen Peng 提交于 6月 06, 2017

nfs_show_stats() was incorrectly reading statistics for bytes when printing that
for fsc. It caused files like /proc/self/mountstats to report incorrect fsc
statistics for NFS mounts.
Signed-off-by: NTuo Chen Peng <tpeng@nvidia.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

ce85bd29

NFS: Fix initialization of nfs_page_array->npages · 2eb3aea7

由 Benjamin Coddington 提交于 6月 09, 2017

Commit 8ef9b0b9 open-coded nfs_pgarray_set(), and left out the
initialization of the nfs_page_array's npages.  This mistake didn't show up
until testing with block layouts, and there shows that all pNFS reads
return -EIO.

Fixes: 8ef9b0b9 ("NFS: move nfs_pgarray_set() to open code")
Signed-off-by: NBenjamin Coddington <bcodding@redhat.com>
Cc: stable@vger.kernel.org # 4.12
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

2eb3aea7

NFS: Fix commit policy for non-blocking calls to nfs_write_inode() · 1a4edf0f

由 Trond Myklebust 提交于 6月 20, 2017

Now that the writes will schedule a commit on their own, we don't
need nfs_write_inode() to schedule one if there are outstanding
writes, and we're being called in non-blocking mode.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

1a4edf0f

NFS: Ensure we commit after writeback is complete · 919e3bd9

由 Trond Myklebust 提交于 6月 20, 2017

If the page cache is being flushed, then we want to ensure that we
do start a commit once the pages are done being flushed.
If we just wait until all I/O is done to that file, we can end up
livelocking until the balance_dirty_pages() mechanism puts its
foot down and forces I/O to stop.
So instead we do more or less the same thing that O_DIRECT does,
and set up a counter to tell us when the flush is done,
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

919e3bd9

NFS: Remove unused fields in the page I/O structures · b5973a8c

由 Trond Myklebust 提交于 6月 20, 2017

Remove the 'layout_private' fields that were only used by the pNFS OSD
layout driver.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

b5973a8c

SUNRPC: Make slot allocation more reliable · 92ea011f

由 Trond Myklebust 提交于 6月 20, 2017

In xprt_alloc_slot(), the spin lock is only needed to provide atomicity
between the atomic_add_unless() failure and the call to xprt_add_backlog().
We do not actually need to hold it across the memory allocation itself.

By dropping the lock, we can use a more resilient GFP_NOFS allocation,
just as we now do in the rest of the RPC client code.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

92ea011f

NFS: nfs_rename() - revalidate directories on -ERESTARTSYS · 818a8dbe

由 Benjamin Coddington 提交于 6月 16, 2017

An interrupted rename will leave the old dentry behind if the rename
succeeds.  Fix this by forcing a lookup the next time through
->d_revalidate.

A previous attempt at solving this problem took the approach to complete
the work of the rename asynchronously, however that approach was wrong
since it would allow the d_move() to occur after the directory's i_mutex
had been dropped by the original process.
Signed-off-by: NBenjamin Coddington <bcodding@redhat.com>
Reviewed-by: NJeff Layton <jlayton@redhat.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

818a8dbe

NFS: convert flags to bool · a7a3b1e9

由 Benjamin Coddington 提交于 6月 20, 2017

NFS uses some int, and unsigned int :1, and bool as flags in structs and
args.  Assert the preference for uniformly replacing these with the bool
type.
Signed-off-by: NBenjamin Coddington <bcodding@redhat.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

a7a3b1e9

NFS: Set FATTR4_WORD0_TYPE for . and .. entries · 18fe6a23

由 Anna Schumaker 提交于 6月 16, 2017

The current code worked okay for getdents(), but getdents64() expects
the d_type field to get filled out properly in the stat structure.
Setting this field fixes xfstests generic/401.
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

18fe6a23

nfsd4: const-ify nfsd4_ops · 800222f8

由 Christoph Hellwig 提交于 5月 08, 2017

nfsd4_ops contains function pointers, and marking it as constant avoids
it being able to be used as an attach vector for code injections.
Signed-off-by: NChristoph Hellwig <hch@lst.de>

800222f8

C
sunrpc: mark all struct svc_version instances as const · aa8217d5
由 Christoph Hellwig 提交于 5月 12, 2017
```
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Acked-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
aa8217d5

sunrpc: mark all struct svc_procinfo instances as const · b9c744c1

由 Christoph Hellwig 提交于 5月 12, 2017

struct svc_procinfo contains function pointers, and marking it as
constant avoids it being able to be used as an attach vector for
code injections.
Signed-off-by: NChristoph Hellwig <hch@lst.de>

b9c744c1

sunrpc: move pc_count out of struct svc_procinfo · 0becc118

由 Christoph Hellwig 提交于 5月 08, 2017

pc_count is the only writeable memeber of struct svc_procinfo, which is
a good candidate to be const-ified as it contains function pointers.

This patch moves it into out out struct svc_procinfo, and into a
separate writable array that is pointed to by struct svc_version.
Signed-off-by: NChristoph Hellwig <hch@lst.de>

0becc118

nfsd4: properly type op_func callbacks · 72edc37a

由 Christoph Hellwig 提交于 5月 08, 2017

Pass union nfsd4_op_u to the op_func callbacks instead of using unsafe
function pointer casts.

It also adds two missing structures to struct nfsd4_op.u to facilitate
this.
Signed-off-by: NChristoph Hellwig <hch@lst.de>

72edc37a

nfsd4: remove nfsd4op_rsize · 62bbf8bb

由 Christoph Hellwig 提交于 5月 08, 2017

Except for a lot of unnecessary casts this typedef only has one user,
so remove the casts and expand it in struct nfsd4_operation.
Signed-off-by: NChristoph Hellwig <hch@lst.de>

62bbf8bb

nfsd4: properly type op_get_currentstateid callbacks · c2a1102a

由 Christoph Hellwig 提交于 5月 08, 2017

Pass union nfsd4_op_u to the op_set_currentstateid callbacks instead of
using unsafe function pointer casts.
Signed-off-by: NChristoph Hellwig <hch@lst.de>

c2a1102a

nfsd4: properly type op_set_currentstateid callbacks · 6c9600a7

由 Christoph Hellwig 提交于 5月 08, 2017

Given the args union in struct nfsd4_op a name, and pass it to the
op_set_currentstateid callbacks instead of using unsafe function
pointer casts.
Signed-off-by: NChristoph Hellwig <hch@lst.de>

6c9600a7

C
sunrpc: remove kxdrproc_t · 408b3d46
由 Christoph Hellwig 提交于 5月 08, 2017
```
Remove the now unused typedef.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
```
408b3d46

sunrpc: properly type pc_encode callbacks · d16d1867

由 Christoph Hellwig 提交于 5月 08, 2017

Drop the resp argument as it can trivially be derived from the rqstp
argument.  With that all functions now have the same prototype, and we
can remove the unsafe casting to kxdrproc_t.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Acked-by: NTrond Myklebust <trond.myklebust@primarydata.com>

d16d1867

sunrpc: properly type pc_decode callbacks · cc6acc20

由 Christoph Hellwig 提交于 5月 08, 2017

Drop the argp argument as it can trivially be derived from the rqstp
argument.  With that all functions now have the same prototype, and we
can remove the unsafe casting to kxdrproc_t.
Signed-off-by: NChristoph Hellwig <hch@lst.de>

cc6acc20

sunrpc: properly type pc_release callbacks · 1150ded8

由 Christoph Hellwig 提交于 5月 08, 2017

Drop the p and resp arguments as they are always NULL or can trivially
be derived from the rqstp argument. With that all functions now have the
same prototype, and we can remove the unsafe casting to kxdrproc_t.
Signed-off-by: NChristoph Hellwig <hch@lst.de>

1150ded8

sunrpc: properly type pc_func callbacks · 1c8a5409

由 Christoph Hellwig 提交于 5月 08, 2017

Drop the argp and resp arguments as they can trivially be derived from
the rqstp argument.  With that all functions now have the same prototype,
and we can remove the unsafe casting to svc_procfunc as well as the
svc_procfunc typedef itself.
Signed-off-by: NChristoph Hellwig <hch@lst.de>

1c8a5409

C
nfsd: remove the unused PROC() macro in nfs3proc.c · 36ba89c2
由 Christoph Hellwig 提交于 5月 08, 2017
```
Signed-off-by: NChristoph Hellwig <hch@lst.de>
```
36ba89c2
C
nfsd: use named initializers in PROC() · ec7e8cae
由 Christoph Hellwig 提交于 5月 08, 2017
```
Signed-off-by: NChristoph Hellwig <hch@lst.de>
```
ec7e8cae
C
nfsd4: const-ify nfs_cb_version4 · 39d43f75
由 Christoph Hellwig 提交于 5月 12, 2017
```
Signed-off-by: NChristoph Hellwig <hch@lst.de>
```
39d43f75

sunrpc: mark all struct rpc_procinfo instances as const · 511e936b

由 Christoph Hellwig 提交于 5月 12, 2017

struct rpc_procinfo contains function pointers, and marking it as
constant avoids it being able to be used as an attach vector for
code injections.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Acked-by: NTrond Myklebust <trond.myklebust@primarydata.com>

511e936b

C
nfs: use ARRAY_SIZE() in the nfsacl_version3 declaration · 9ae7d8ff
由 Christoph Hellwig 提交于 5月 12, 2017
```
Signed-off-by: NChristoph Hellwig <hch@lst.de>
```
9ae7d8ff

sunrpc: move p_count out of struct rpc_procinfo · c551858a

由 Christoph Hellwig 提交于 5月 08, 2017

p_count is the only writeable memeber of struct rpc_procinfo, which is
a good candidate to be const-ified as it contains function pointers.

This patch moves it into out out struct rpc_procinfo, and into a
separate writable array that is pointed to by struct rpc_version and
indexed by p_statidx.
Signed-off-by: NChristoph Hellwig <hch@lst.de>

c551858a

lockd: fix some weird indentation · e91ff8e3

由 Christoph Hellwig 提交于 5月 08, 2017

Remove double indentation of a few struct rpc_version and
struct rpc_program instance.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Acked-by: NTrond Myklebust <trond.myklebust@primarydata.com>

e91ff8e3

nfs: don't cast callback decode/proc/encode routines · 947c6e43

由 Christoph Hellwig 提交于 5月 11, 2017

Instead declare all functions with the proper methods signature.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Reviewed-by: NJeff Layton <jlayton@redhat.com>
Acked-by: NTrond Myklebust <trond.myklebust@primarydata.com>

947c6e43

nfs: fix decoder callback prototypes · fc016483

由 Christoph Hellwig 提交于 5月 08, 2017

Declare the p_decode callbacks with the proper prototype instead of
casting to kxdrdproc_t and losing all type safety.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Reviewed-by: NJeff Layton <jlayton@redhat.com>
Acked-by: NTrond Myklebust <trond.myklebust@primarydata.com>

fc016483

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功