提交 · f1ecbc21eb097ecf0b73793c44b0197584db9e2c · openanolis / cloud-kernel

07 9月, 2017 2 次提交

SUNRPC: remove some dead code. · f1ecbc21

由 NeilBrown 提交于 8月 18, 2017

RPC_TASK_NO_RETRANS_TIMEOUT is set when cl_noretranstimeo
is set, which happens when  RPC_CLNT_CREATE_NO_RETRANS_TIMEOUT is set,
which happens when NFS_CS_NO_RETRANS_TIMEOUT is set.

This flag means "don't resend on a timeout, only resend if the
connection gets broken for some reason".

cl_discrtry is set when RPC_CLNT_CREATE_DISCRTRY is set, which
happens when NFS_CS_DISCRTRY is set.

This flag means "always disconnect before resending".

NFS_CS_NO_RETRANS_TIMEOUT and NFS_CS_DISCRTRY are both only set
in nfs4_init_client(), and it always sets both.

So we will never have a situation where only one of the flags is set.
So this code, which tests if timeout retransmits are allowed, and
disconnection is required, will never run.

So it makes sense to remove this code as it cannot be tested and
could confuse people reading the code (like me).

(alternately we could leave it there with a comment saying
 it is never actually used).
Signed-off-by: NNeilBrown <neilb@suse.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

f1ecbc21

NFS: don't expect errors from mempool_alloc(). · 237f8306

由 NeilBrown 提交于 8月 18, 2017

Commit fbe77c30 ("NFS: move rw_mode to nfs_pageio_header")
reintroduced some pointless code that commit 518662e0 ("NFS: fix
usage of mempools.") had recently removed.

Remove it again.

Cc: Benjamin Coddington <bcodding@redhat.com>
Signed-off-by: NNeilBrown <neilb@suse.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

237f8306

06 9月, 2017 2 次提交

xprtrdma: Use xprt_pin_rqst in rpcrdma_reply_handler · 9590d083

由 Chuck Lever 提交于 8月 23, 2017

Adopt the use of xprt_pin_rqst to eliminate contention between
Call-side users of rb_lock and the use of rb_lock in
rpcrdma_reply_handler.

This replaces the mechanism introduced in 431af645 ("xprtrdma:
Fix client lock-up after application signal fires").

Use recv_lock to quickly find the completing rqst, pin it, then
drop the lock. At that point invalidation and pull-up of the Reply
XDR can be done. Both are often expensive operations.

Finally, take recv_lock again to signal completion to the RPC
layer. It also protects adjustment of "cwnd".

This greatly reduces the amount of time a lock is held by the
reply handler. Comparing lock_stat results shows a marked decrease
in contention on rb_lock and recv_lock.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
[trond.myklebust@primarydata.com: Remove call to rpcrdma_buffer_put() from
   the "out_norqst:" path in rpcrdma_reply_handler.]
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

9590d083

Merge tag 'nfs-rdma-for-4.14-1' of git://git.linux-nfs.org/projects/anna/linux-nfs into linux-next · f9773b22

由 Trond Myklebust 提交于 9月 05, 2017

NFS-over-RDMA client updates for Linux 4.14

Bugfixes and cleanups:
- Constify rpc_xprt_ops
- Harden RPC call encoding and decoding
- Clean up rpc call decoding to use xdr_streams
- Remove unused variables from various structures
- Refactor code to remove imul instructions
- Rearrange rx_stats structure for better cacheline sharing

f9773b22

23 8月, 2017 1 次提交

xprtrdma: Re-arrange struct rx_stats · 67af6f65

由 Chuck Lever 提交于 8月 22, 2017

To reduce false cacheline sharing, separate counters that are likely
to be accessed in the Call path from those accessed in the Reply
path.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

67af6f65

21 8月, 2017 4 次提交

T

Merge branch 'bugfixes' · 7af7a596
由 Trond Myklebust 提交于 8月 20, 2017

7af7a596

NFS: Fix NFSv2 security settings · 53a75f22

由 Chuck Lever 提交于 8月 10, 2017

For a while now any NFSv2 mount where sec= is specified uses
AUTH_NULL. If sec= is not specified, the mount uses AUTH_UNIX.
Commit e68fd7c8 ("mount: use sec= that was specified on the
command line") attempted to address a very similar problem with
NFSv3, and should have fixed this too, but it has a bug.

The MNTv1 MNT procedure does not return a list of security flavors,
so our client makes up a list containing just AUTH_NULL. This should
enable nfs_verify_authflavors() to assign the sec= specified flavor,
but instead, it incorrectly sets it to AUTH_NULL.

I expect this would also be a problem for any NFSv3 server whose
MNTv3 MNT procedure returned a security flavor list containing only
AUTH_NULL.

Fixes: e68fd7c8 ("mount: use sec= that was specified on ... ")
BugLink: https://bugzilla.linux-nfs.org/show_bug.cgi?id=310Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

53a75f22

NFSv4.1: don't use machine credentials for CLOSE when using 'sec=sys' · b79e87e0

由 NeilBrown 提交于 8月 18, 2017

An NFSv4.1 client might close a file after the user who opened it has
logged off.  In this case the user's credentials may no longer be
valid, if they are e.g. kerberos credentials that have expired.

NFSv4.1 has a mechanism to allow the client to use machine credentials
to close a file.  However due to a short-coming in the RFC, a CLOSE
with those credentials may not be possible if the file in question
isn't exported to the same security flavor - the required PUTFH must
be rejected when this is the case.

Specifically if a server and client support kerberos in general and
have used it to form a machine credential, but the file is only
exported to "sec=sys", a PUTFH with the machine credentials will fail,
so CLOSE is not possible.

As RPC_AUTH_UNIX (used by sec=sys) credentials can never expire, there
is no value in using the machine credential in place of them.
So in that case, just use the users credentials for CLOSE etc, as you would
in NFSv4.0
Signed-off-by: NNeil Brown <neilb@suse.com>
Signed-off-by: NNeilBrown <neilb@suse.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

b79e87e0

SUNRPC: ECONNREFUSED should cause a rebind. · fd01b259

由 NeilBrown 提交于 8月 18, 2017

If you
 - mount and NFSv3 filesystem
 - do some file locking which requires the server
   to make a GRANT call back
 - unmount
 - mount again and do the same locking

then the second attempt at locking suffers a 30 second delay.
Unmounting and remounting causes lockd to stop and restart,
which causes it to bind to a new port.
The server still thinks the old port is valid and gets ECONNREFUSED
when trying to contact it.
ECONNREFUSED should be seen as a hard error that is not worth
retrying.  Rebinding is the only reasonable response.

This patch forces a rebind if that makes sense.
Signed-off-by: NNeilBrown <neilb@suse.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

fd01b259

20 8月, 2017 2 次提交

NFS: Remove unused parameter gfp_flags from nfs_pageio_init() · 3bde7afd

由 Trond Myklebust 提交于 8月 20, 2017

Now that the mirror allocation has been moved, the parameter can go.
Also remove the redundant symbol export.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

3bde7afd

NFSv4: Fix up mirror allocation · 14abcb0b

由 Trond Myklebust 提交于 8月 19, 2017

There are a number of callers of nfs_pageio_complete() that want to
continue using the nfs_pageio_descriptor without needing to call
nfs_pageio_init() again. Examples include nfs_pageio_resend() and
nfs_pageio_cond_complete().

The problem is that nfs_pageio_complete() also calls
nfs_pageio_cleanup_mirroring(), which frees up the array of mirrors.
This can lead to writeback errors, in the next call to
nfs_pageio_setup_mirroring().

Fix by simply moving the allocation of the mirrors to
nfs_pageio_setup_mirroring().

Link: https://bugzilla.kernel.org/show_bug.cgi?id=196709Reported-by: NJianhongYin <yin-jianhong@163.com>
Cc: stable@vger.kernel.org # 4.0+
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

14abcb0b

19 8月, 2017 2 次提交

T

Merge branch 'writeback' · b7561e51
由 Trond Myklebust 提交于 8月 18, 2017

b7561e51

SUNRPC: Add a separate spinlock to protect the RPC request receive list · ce7c252a

由 Trond Myklebust 提交于 8月 16, 2017

This further reduces contention with the transport_lock, and allows us
to convert to using a non-bh-safe spinlock, since the list is now never
accessed from a bh context.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

ce7c252a

17 8月, 2017 4 次提交

SUNRPC: Cleanup xs_tcp_read_common() · 040249df

由 Trond Myklebust 提交于 8月 13, 2017

Simplify the code to avoid a full copy of the struct xdr_skb_reader.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

040249df

SUNRPC: Don't loop forever in xs_tcp_data_receive() · 8d6f97d6

由 Trond Myklebust 提交于 8月 12, 2017

Ensure that we don't hog the workqueue thread by requeuing the job
every 64 loops.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

8d6f97d6

SUNRPC: Don't hold the transport lock when receiving backchannel data · c89091c8

由 Trond Myklebust 提交于 8月 16, 2017

The backchannel request has no associated task, so it is going nowhere
until we call xprt_complete_bc_request().
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

c89091c8

SUNRPC: Don't hold the transport lock across socket copy operations · 729749bb

由 Trond Myklebust 提交于 8月 13, 2017

Instead add a mechanism to ensure that the request doesn't disappear
from underneath us while copying from the socket. We do this by
preventing xprt_release() from freeing the XDR buffers until the
flag RPC_TASK_MSG_RECV has been cleared from the request.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
Reviewed-by: NChuck Lever <chuck.lever@oracle.com>

729749bb

16 8月, 2017 2 次提交

xprtrdma: Remove imul instructions from chunk list encoders · 6748b0ca

由 Chuck Lever 提交于 8月 14, 2017

Re-arrange the pointer arithmetic in the chunk list encoders to
eliminate several more integer multiplication instructions during
Transport Header encoding.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

6748b0ca

xprtrdma: Remove imul instructions from rpcrdma_convert_iovs() · 28d9d56f

由 Chuck Lever 提交于 8月 14, 2017

Re-arrange the pointer arithmetic in rpcrdma_convert_iovs() to
eliminate several integer multiplication instructions during
Transport Header encoding.

Also, array overflow does not occur outside development
environments, so replace overflow checking with one spot check
at the end. This reduces the number of conditional branches in
the common case.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

28d9d56f

15 8月, 2017 21 次提交

NFS: Wait for requests that are locked on the commit list · 2ce209c4

由 Trond Myklebust 提交于 8月 01, 2017

If a request is on the commit list, but is locked, we will currently skip
it, which can lead to livelocking when the commit count doesn't reduce
to zero.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

2ce209c4

NFSv4/pnfs: Replace pnfs_put_lseg_locked() with pnfs_put_lseg() · 8205b9ce

由 Trond Myklebust 提交于 8月 01, 2017

Now that we no longer hold the inode->i_lock when manipulating the
commit lists, it is safe to call pnfs_put_lseg() again.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

8205b9ce

NFS: Switch to using mapping->private_lock for page writeback lookups. · 4b9bb25b

由 Trond Myklebust 提交于 8月 01, 2017

Switch from using the inode->i_lock for this to avoid contention with
other metadata manipulation.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

4b9bb25b

T
NFS: Use an atomic_long_t to count the number of commits · 5cb953d4
由 Trond Myklebust 提交于 8月 01, 2017
```
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
5cb953d4

NFS: Use an atomic_long_t to count the number of requests · a6b6d5b8

由 Trond Myklebust 提交于 8月 01, 2017

Rather than forcing us to take the inode->i_lock just in order to bump
the number.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

a6b6d5b8

NFSv4: Use a mutex to protect the per-inode commit lists · e824f99a

由 Trond Myklebust 提交于 8月 01, 2017

The commit lists can get very large, so using the inode->i_lock can
end up affecting general metadata performance.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

e824f99a

NFS: Refactor nfs_page_find_head_request() · b30d2f04

由 Trond Myklebust 提交于 8月 01, 2017

Split out the 2 cases so that we can treat the locking differently.
The issue is that the locking in the pageswapcache cache is highly
linked to the commit list locking.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

b30d2f04

NFSv4: Convert nfs_lock_and_join_requests() to use nfs_page_find_head_request() · bd37d6fc

由 Trond Myklebust 提交于 8月 01, 2017

Hide the locking from nfs_lock_and_join_requests() so that we can
separate out the requirements for swapcache pages.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

bd37d6fc

NFS: Fix up nfs_page_group_covers_page() · 7e8a30f8

由 Trond Myklebust 提交于 7月 17, 2017

Fix up the test in nfs_page_group_covers_page(). The simplest implementation
is to check that we have a set of intersecting or contiguous subrequests
that connect page offset 0 to nfs_page_length(req->wb_page).
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

7e8a30f8

NFS: Remove unused parameter from nfs_page_group_lock() · 1344b7ea

由 Trond Myklebust 提交于 7月 17, 2017

nfs_page_group_lock() is now always called with the 'nonblock'
parameter set to 'false'.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

1344b7ea

T
NFS: Remove unuse function nfs_page_group_lock_wait() · dee83046
由 Trond Myklebust 提交于 7月 17, 2017
```
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
dee83046

NFS: Remove nfs_page_group_clear_bits() · 902a4c00

由 Trond Myklebust 提交于 7月 19, 2017

At this point, we only expect ever to potentially see PG_REMOVE and
PG_TEARDOWN being set on the subrequests.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

902a4c00

NFS: Fix nfs_page_group_destroy() and nfs_lock_and_join_requests() race cases · 5b2b5187

由 Trond Myklebust 提交于 7月 19, 2017

Since nfs_page_group_destroy() does not take any locks on the requests
to be freed, we need to ensure that we don't inadvertently free the
request in nfs_destroy_unlinked_subrequests() while the last reference
is being released elsewhere.

Do this by:

1) Taking a reference to the request unless it is already being freed
2) Checking (under the page group lock) if PG_TEARDOWN is already set before
   freeing an unreferenced request in nfs_destroy_unlinked_subrequests()
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

5b2b5187

NFS: Further optimise nfs_lock_and_join_requests() · 74a6d4b5

由 Trond Myklebust 提交于 7月 19, 2017

When locking the entire group in order to remove subrequests,
the locks are always taken in order, and with the page group
lock being taken after the page head is locked. The intention
is that:

1) The lock on the group head guarantees that requests may not
   be removed from the group (although new entries could be appended
   if we're not holding the group lock).
2) It is safe to drop and retake the page group lock while iterating
   through the list, in particular when waiting for a subrequest lock.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

74a6d4b5

NFS: Reduce inode->i_lock contention in nfs_lock_and_join_requests() · b5bab9bf

由 Trond Myklebust 提交于 7月 17, 2017

We should no longer need the inode->i_lock, now that we've
straightened out the request locking. The locking schema is now:

1) Lock page head request
2) Lock the page group
3) Lock the subrequests one by one

Note that there is a subtle race with nfs_inode_remove_request() due
to the fact that the latter does not lock the page head, when removing
it from the struct page. Only the last subrequest is locked, hence
we need to re-check that the PagePrivate(page) is still set after
we've locked all the subrequests.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

b5bab9bf

NFS: Remove page group limit in nfs_flush_incompatible() · 7e6cca6c

由 Trond Myklebust 提交于 7月 17, 2017

nfs_try_to_update_request() should be able to cope now.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

7e6cca6c

T
NFS: Teach nfs_try_to_update_request() to deal with request page_groups · f6032f21
由 Trond Myklebust 提交于 7月 17, 2017
```
Simplify the code, and avoid some flushes to disk.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
f6032f21

NFS: Fix the inode request accounting when pages have subrequests · b66aaa8d

由 Trond Myklebust 提交于 7月 18, 2017

Both nfs_destroy_unlinked_subrequests() and nfs_lock_and_join_requests()
manipulate the inode flags adjusting the NFS_I(inode)->nrequests.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

b66aaa8d

NFS: Don't unlock writebacks before declaring PG_WB_END · 31a01f09

由 Trond Myklebust 提交于 7月 18, 2017

We don't want nfs_lock_and_join_requests() to start fiddling with
the request before the call to nfs_page_group_sync_on_bit().
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

31a01f09

NFS: Don't check request offset and size without holding a lock · e14bebf6

由 Trond Myklebust 提交于 7月 17, 2017

Request offsets and sizes are not guaranteed to be stable unless you
are holding the request locked.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

e14bebf6

NFS: Fix an ABBA issue in nfs_lock_and_join_requests() · a0e265bc

由 Trond Myklebust 提交于 7月 17, 2017

All other callers of nfs_page_group_lock() appear to already hold the
page lock on the head page, so doing it in the opposite order here
is inefficient, although not deadlock prone since we roll back all
locks on contention.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

a0e265bc

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功