提交 · be189f7e7f03de35887e5a85ddcf39b91b5d7fc1 · openeuler / Kernel

01 10月, 2018 40 次提交

NFS: Fix dentry revalidation on NFSv4 lookup · be189f7e

由 Trond Myklebust 提交于 9月 27, 2018

We need to ensure that inode and dentry revalidation occurs correctly
on reopen of a file that is already open. Currently, we can end up
not revalidating either in the case of NFSv4.0, due to the 'cached open'
path.
Let's fix that by ensuring that we only do cached open for the special
cases of open recovery and delegation return.
Reported-by: NStan Hu <stanhu@gmail.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

be189f7e

T
SUNRPC: Replace krb5_seq_lock with a lockless scheme · 571ed1fd
由 Trond Myklebust 提交于 9月 29, 2018
```
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
```
571ed1fd

SUNRPC: Lockless lookup of RPCSEC_GSS mechanisms · 0c1c19f4

由 Trond Myklebust 提交于 9月 29, 2018

Use RCU protected lookups for discovering the supported mechanisms.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

0c1c19f4

SUNRPC: Remove rpc_authflavor_lock in favour of RCU locking · 4e4c3bef

由 Trond Myklebust 提交于 9月 27, 2018

Module removal is RCU safe by design, so we really have no need to
lock the auth_flavors[] array. Substitute a lockless scheme to
add/remove entries in the array, and then use rcu.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

4e4c3bef

NFS: Remove private spinlock in struct nfs_pgio_header · 1c6c4b74

由 Trond Myklebust 提交于 9月 25, 2018

Now that each struct nfs_pgio_header corresponds to one RPC call, we
only have one writer to the struct nfs_pgio_header.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

1c6c4b74

NFSv4: Save a few bytes in the nfs_pgio_args/res · 28d52235

由 Trond Myklebust 提交于 9月 24, 2018

Save a few bytes by allowing the read/write specific fields of the
structures to share storage.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

28d52235

NFSv3: Improve NFSv3 performance when server returns no post-op attributes · 8d8928d8

由 Trond Myklebust 提交于 3月 05, 2018

When the server fails to return post-op attributes, the client's
attempt to place read data directly in the page cache fails, and
so we have to do an extra copy in order to realign the data with
page borders.
This patch attempts to detect servers that don't return post-op
attributes on read (e.g. for pNFS) and adjusts the placement
calculation accordingly.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

8d8928d8

NFSv4: Split out NFS v4.2 copy completion functions · 80f42368

由 Anna Schumaker 提交于 9月 20, 2018

The convention in the rest of the code is to have a separate function
for anything that might be ifdef-ed out.
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

80f42368

NFS: Reduce indentation of nfs4_recovery_handle_error() · 000d3f95

由 Anna Schumaker 提交于 9月 11, 2018

This is to match kernel coding style for switch statements.
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

000d3f95

NFS: Reduce indentation of the switch statement in nfs4_reclaim_open_state() · 35a61606

由 Anna Schumaker 提交于 9月 11, 2018

Most places in the kernel tend to line up cases with the switch to
reduce indentation, so move this over to match that style.
Additionally, I handle the (status >= 0) case in the switch so that we
only "goto restart" from a single place after error handling.
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

35a61606

NFS: Split out the body of nfs4_reclaim_open_state() · cb7a8384

由 Anna Schumaker 提交于 9月 11, 2018

Moving all of this into a new function removes the need for cramped
indentation, making the code overall easier to look at.   I also take
this chance to switch copy recovery over to using
nfs4_stateid_match_other()
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

cb7a8384

nfs4: flex_file: ignore synthetic uid/gid for tightly coupled DSes · 10ec57e4

由 Tigran Mkrtchyan 提交于 8月 20, 2018

for tightly coupled DSes client must ignore provided synthetic uid and
gid as stated in draft-ietf-nfsv4-flex-files-19#section-5.1.
Signed-off-by: NTigran Mkrtchyan <tigran.mkrtchyan@desy.de>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

10ec57e4

NFSv4.1: Fix the r/wsize checking · 943cff67

由 Trond Myklebust 提交于 9月 18, 2018

The intention of nfs4_session_set_rwsize() was to cap the r/wsize to the
buffer sizes negotiated by the CREATE_SESSION. The initial code had a
bug whereby we would not check the values negotiated by nfs_probe_fsinfo()
(the assumption being that CREATE_SESSION will always negotiate buffer values
that are sane w.r.t. the server's preferred r/wsizes) but would only check
values set by the user in the 'mount' command.

The code was changed in 4.11 to _always_ set the r/wsize, meaning that we
now never use the server preferred r/wsizes. This is the regression that
this patch fixes.
Also rename the function to nfs4_session_limit_rwsize() in order to avoid
future confusion.

Fixes: 03385332 (NFSv4.1 respect server's max size in CREATE_SESSION")
Cc: stable@vger.kernel.org # v4.11+
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

943cff67

T
NFSv4: Convert struct nfs4_state to use refcount_t · ace9fad4
由 Trond Myklebust 提交于 9月 02, 2018
```
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
```
ace9fad4

NFSv4: Convert open state lookup to use RCU · 9ae075fd

由 Trond Myklebust 提交于 9月 02, 2018

Further reduce contention on the inode->i_lock.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

9ae075fd

NFS: Convert lookups of the open context to RCU · 0de43976

由 Trond Myklebust 提交于 9月 02, 2018

Reduce contention on the inode->i_lock by ensuring that we use RCU
when looking up the NFS open context.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

0de43976

T
NFS: Simplify internal check for whether file is open for write · 6ba0c4e5
由 Trond Myklebust 提交于 9月 02, 2018
```
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
```
6ba0c4e5

NFS: Convert lookups of the lock context to RCU · 1db97eaa

由 Trond Myklebust 提交于 9月 02, 2018

Speed up lookups of an existing lock context by avoiding the inode->i_lock,
and using RCU instead.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

1db97eaa

pNFS: Don't allocate more pages than we need to fit a layoutget response · 28ced9a8

由 Trond Myklebust 提交于 9月 03, 2018

For the 'files' and 'flexfiles' layout types, we do not expect the reply
to be any larger than 4k. The block and scsi layout types are a little more
greedy, so we keep allocating the maximum response size for now.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

28ced9a8

pNFS: Don't zero out the array in nfs4_alloc_pages() · a2791d3a

由 Trond Myklebust 提交于 9月 03, 2018

We don't need a zeroed out array, since it is immediately being filled.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

a2791d3a

SUNRPC: Unexport xdr_partial_copy_from_skb() · ec846469

由 Trond Myklebust 提交于 9月 14, 2018

It is no longer used outside of net/sunrpc/socklib.c
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

ec846469

T
SUNRPC: Clean up xs_udp_data_receive() · 4f546149
由 Trond Myklebust 提交于 9月 14, 2018
```
Simplify the retry logic.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
```
4f546149
T
SUNRPC: Allow AF_LOCAL sockets to use the generic stream receive · 550aebfe
由 Trond Myklebust 提交于 9月 14, 2018
```
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
```
550aebfe
T
SUNRPC: Clean up - rename xs_tcp_data_receive() to xs_stream_data_receive() · c50b8ee0
由 Trond Myklebust 提交于 9月 14, 2018
```
In preparation for sharing with AF_LOCAL.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
```
c50b8ee0

SUNRPC: Simplify TCP receive code by switching to using iterators · 277e4ab7

由 Trond Myklebust 提交于 9月 14, 2018

Most of this code should also be reusable with other socket types.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

277e4ab7

SUNRPC: Add a bvec array to struct xdr_buf for use with iovec_iter() · 9d96acbc

由 Trond Myklebust 提交于 9月 13, 2018

Add a bvec array to struct xdr_buf, and have the client allocate it
when we need to receive data into pages.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

9d96acbc

SUNRPC: Add a label for RPC calls that require allocation on receive · 431f6eb3

由 Trond Myklebust 提交于 9月 16, 2018

If the RPC call relies on the receive call allocating pages as buffers,
then let's label it so that we
a) Don't leak memory by allocating pages for requests that do not expect
   this behaviour
b) Can optimise for the common case where calls do not require allocation.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

431f6eb3

SUNRPC: Convert the xprt->sending queue back to an ordinary wait queue · 79c99152

由 Trond Myklebust 提交于 9月 09, 2018

We no longer need priority semantics on the xprt->sending queue, because
the order in which tasks are sent is now dictated by their position in
the send queue.
Note that the backlog queue remains a priority queue, meaning that
slot resources are still managed in order of task priority.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

79c99152

SUNRPC: Fix priority queue fairness · f42f7c28

由 Trond Myklebust 提交于 9月 08, 2018

Fix up the priority queue to not batch by owner, but by queue, so that
we allow '1 << priority' elements to be dequeued before switching to
the next priority queue.
The owner field is still used to wake up requests in round robin order
by owner to avoid single processes hogging the RPC layer by loading the
queues.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

f42f7c28

SUNRPC: Convert xprt receive queue to use an rbtree · 95f7691d

由 Trond Myklebust 提交于 9月 07, 2018

If the server is slow, we can find ourselves with quite a lot of entries
on the receive queue. Converting the search from an O(n) to O(log(n))
can make a significant difference, particularly since we have to hold
a number of locks while searching.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

95f7691d

T
SUNRPC: Don't take transport->lock unnecessarily when taking XPRT_LOCK · bd79bc57
由 Trond Myklebust 提交于 9月 07, 2018
```
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
```
bd79bc57
T
SUNRPC: Cleanup: remove the unused 'task' argument from the request_send() · adfa7144
由 Trond Myklebust 提交于 9月 03, 2018
```
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
```
adfa7144

SUNRPC: Clean up transport write space handling · c544577d

由 Trond Myklebust 提交于 9月 03, 2018

Treat socket write space handling in the same way we now treat transport
congestion: by denying the XPRT_LOCK until the transport signals that it
has free buffer space.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

c544577d

SUNRPC: Turn off throttling of RPC slots for TCP sockets · 36bd7de9

由 Trond Myklebust 提交于 9月 03, 2018

The theory was that we would need to grab the socket lock anyway, so we
might as well use it to gate the allocation of RPC slots for a TCP
socket.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

36bd7de9

SUNRPC: Allow soft RPC calls to time out when waiting for the XPRT_LOCK · f05d54ec

由 Trond Myklebust 提交于 9月 03, 2018

This no longer causes them to lose their place in the transmission queue.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

f05d54ec

SUNRPC: Allow calls to xprt_transmit() to drain the entire transmit queue · 89f90fe1

由 Trond Myklebust 提交于 8月 29, 2018

Rather than forcing each and every RPC task to grab the socket write
lock in order to send itself, we allow whichever task is holding the
write lock to attempt to drain the entire transmit queue.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

89f90fe1

SUNRPC: Enqueue swapper tagged RPCs at the head of the transmit queue · 86aeee0e

由 Trond Myklebust 提交于 9月 08, 2018

Avoid memory starvation by giving RPCs that are tagged with the
RPC_TASK_SWAPPER flag the highest priority.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

86aeee0e

SUNRPC: Support for congestion control when queuing is enabled · 75891f50

由 Trond Myklebust 提交于 9月 03, 2018

Both RDMA and UDP transports require the request to get a "congestion control"
credit before they can be transmitted. Right now, this is done when
the request locks the socket. We'd like it to happen when a request attempts
to be transmitted for the first time.
In order to support retransmission of requests that already hold such
credits, we also want to ensure that they get queued first, so that we
don't deadlock with requests that have yet to obtain a credit.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

75891f50

SUNRPC: Improve latency for interactive tasks · 918f3c1f

由 Trond Myklebust 提交于 9月 09, 2018

One of the intentions with the priority queues was to ensure that no
single process can hog the transport. The field task->tk_owner therefore
identifies the RPC call's origin, and is intended to allow the RPC layer
to organise queues for fairness.
This commit therefore modifies the transmit queue to group requests
by task->tk_owner, and ensures that we round robin among those groups.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

918f3c1f

T
SUNRPC: Move RPC retransmission stat counter to xprt_transmit() · dcbbeda8
由 Trond Myklebust 提交于 9月 01, 2018
```
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
```
dcbbeda8

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功