提交 · 1f7d1c73c58a3d07a951ce23acfb4ec91a31d1e9 · openeuler / Kernel

26 4月, 2019 7 次提交

SUNRPC: Update comments based on recent changes · 1f7d1c73

由 Chuck Lever 提交于 4月 24, 2019

Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

1f7d1c73

SUNRPC: Start the first major timeout calculation at task creation · da953063

由 Trond Myklebust 提交于 4月 07, 2019

When calculating the major timeout for a new task, when we know that the
connection has been broken, use the task->tk_start to ensure that we also
take into account the time spent waiting for a slot or session slot. This
ensures that we fail over soft requests relatively quickly once the
connection has actually been broken, and the first requests have
started to fail.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

da953063

SUNRPC: Ensure that the transport layer respect major timeouts · 9e910bff

由 Trond Myklebust 提交于 4月 07, 2019

Ensure that when in the transport layer, we don't sleep past
a major timeout.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

9e910bff

SUNRPC: Declare RPC timers as TIMER_DEFERRABLE · 43123581

由 Trond Myklebust 提交于 4月 07, 2019

Don't wake idle CPUs only for the purpose of servicing an RPC
queue timeout.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

43123581

SUNRPC: Add function rpc_sleep_on_timeout() · 6b2e6856

由 Trond Myklebust 提交于 4月 07, 2019

Clean up the RPC task sleep interfaces by replacing the task->tk_timeout
'hidden parameter' to rpc_sleep_on() with a new function that takes an
absolute timeout.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

6b2e6856

SUNRPC: Refactor xprt_request_wait_receive() · 8ba6a92d

由 Trond Myklebust 提交于 4月 07, 2019

Convert the transport callback to actually put the request to sleep
instead of just setting a timeout. This is in preparation for
rpc_sleep_on_timeout().
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

8ba6a92d

SUNRPC: Fix up task signalling · ae67bd38

由 Trond Myklebust 提交于 4月 07, 2019

The RPC_TASK_KILLED flag should really not be set from another context
because it can clobber data in the struct task when task->tk_flags is
changed non-atomically.
Let's therefore swap out RPC_TASK_KILLED with an atomic flag, and add
a function to set that flag and safely wake up the task.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

ae67bd38

16 3月, 2019 1 次提交

SUNRPC: Use the ENOTCONN error on socket disconnect · 27adc785

由 Trond Myklebust 提交于 3月 15, 2019

When the socket is closed, we currently send an EAGAIN error to all
pending requests in order to ask them to retransmit. Use ENOTCONN
instead, to ensure that they try to reconnect before attempting to
transmit.
This also helps SOFTCONN tasks to behave correctly in this
situation.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

27adc785

02 3月, 2019 1 次提交

NFSv4/flexfiles: Abort I/O early if the layout segment was invalidated · a79f194a

由 Trond Myklebust 提交于 2月 27, 2019

If a layout segment gets invalidated while a pNFS I/O operation
is queued for transmission, then we ideally want to abort
immediately. This is particularly the case when there is a large
number of I/O related RPCs queued in the RPC layer, and the layout
segment gets invalidated due to an ENOSPC error, or an EACCES (because
the client was fenced). We may end up forced to spam the MDS with a
lot of otherwise unnecessary LAYOUTERRORs after that I/O fails.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

a79f194a

21 2月, 2019 3 次提交

SUNRPC: Convert socket page send code to use iov_iter() · 0472e476

由 Trond Myklebust 提交于 2月 19, 2019

Simplify the page send code using iov_iter and bvecs.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

0472e476

SUNRPC: Ensure rq_bytes_sent is reset before request transmission · b9779a54

由 Trond Myklebust 提交于 1月 02, 2019

When we resend a request, ensure that the 'rq_bytes_sent' is reset
to zero.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

b9779a54

SUNRPC: Set memalloc_nofs_save() on all rpciod/xprtiod jobs · a1231fda

由 Trond Myklebust 提交于 2月 18, 2019

Set memalloc_nofs_save() on all the rpciod/xprtiod jobs so that we
ensure memory allocations for asynchronous rpc calls don't ever end
up recursing back to the NFS layer for memory reclaim.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

a1231fda

14 2月, 2019 1 次提交

SUNRPC: Introduce trace points in rpc_auth_gss.ko · 0c77668d

由 Chuck Lever 提交于 2月 11, 2019

Add infrastructure for trace points in the RPC_AUTH_GSS kernel
module, and add a few sample trace points. These report exceptional
or unexpected events, and observe the assignment of GSS sequence
numbers.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

0c77668d

16 1月, 2019 2 次提交

SUNRPC: Address Kerberos performance/behavior regression · deaa5c96

由 Chuck Lever 提交于 1月 09, 2019

When using Kerberos with v4.20, I've observed frequent connection
loss on heavy workloads. I traced it down to the client underrunning
the GSS sequence number window -- NFS servers are required to drop
the RPC with the low sequence number, and also drop the connection
to signal that an RPC was dropped.

Bisected to commit 918f3c1f ("SUNRPC: Improve latency for
interactive tasks").

I've got a one-line workaround for this issue, which is easy to
backport to v4.20 while a more permanent solution is being derived.
Essentially, tk_owner-based sorting is disabled for RPCs that carry
a GSS sequence number.

Fixes: 918f3c1f ("SUNRPC: Improve latency for interactive ... ")
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

deaa5c96

SUNRPC: Ensure rq_bytes_sent is reset before request transmission · e66721f0

由 Trond Myklebust 提交于 1月 02, 2019

When we resend a request, ensure that the 'rq_bytes_sent' is reset
to zero.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

e66721f0

19 12月, 2018 2 次提交

SUNRPC: Remove xprt_connect_status() · abc13275

由 Trond Myklebust 提交于 12月 17, 2018

Over the years, xprt_connect_status() has been superseded by
call_connect_status(), which now handles all the errors that
xprt_connect_status() does and more. Since the latter converts
all errors that it doesn't recognise to EIO, then it is time
for it to be retired.
Reported-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Tested-by: NChuck Lever <chuck.lever@oracle.com>

abc13275

SUNRPC: Fix disconnection races · 0445f92c

由 Trond Myklebust 提交于 12月 17, 2018

When the socket is closed, we need to call xprt_disconnect_done() in order
to clean up the XPRT_WRITE_SPACE flag, and wake up the sleeping tasks.

However, we also want to ensure that we don't wake them up before the socket
is closed, since that would cause thundering herd issues with everyone
piling up to retransmit before the TCP shutdown dance has completed.
Only the task that holds XPRT_LOCKED needs to wake up early in order to
allow the close to complete.
Reported-by: NDave Wysochanski <dwysocha@redhat.com>
Reported-by: NScott Mayhew <smayhew@redhat.com>
Cc: Chuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Tested-by: NChuck Lever <chuck.lever@oracle.com>

0445f92c

02 12月, 2018 2 次提交

SUNRPC: Fix a potential race in xprt_connect() · 0a9a4304

由 Trond Myklebust 提交于 12月 01, 2018

If an asynchronous connection attempt completes while another task is
in xprt_connect(), then the call to rpc_sleep_on() could end up
racing with the call to xprt_wake_pending_tasks().
So add a second test of the connection state after we've put the
task to sleep and set the XPRT_CONNECTING flag, when we know that there
can be no asynchronous connection attempts still in progress.

Fixes: 0b9e7943 ("SUNRPC: Move the test for XPRT_CONNECTING into...")
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

0a9a4304

SUNRPC: Fix a memory leak in call_encode() · 71700bb9

由 Trond Myklebust 提交于 11月 30, 2018

If we retransmit an RPC request, we currently end up clobbering the
value of req->rq_rcv_buf.bvec that was allocated by the initial call to
xprt_request_prepare(req).
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

71700bb9

03 10月, 2018 1 次提交

sunrpc: Fix connect metrics · 3968a8a5

由 Chuck Lever 提交于 10月 01, 2018

For TCP, the logic in xprt_connect_status is currently never invoked
to record a successful connection. Commit 2a491991 ("SUNRPC:
Return EAGAIN instead of ENOTCONN when waking up xprt->pending")
changed the way TCP xprt's are awoken after a connect succeeds.

Instead, change connection-oriented transports to bump connect_count
and compute connect_time the moment that XPRT_CONNECTED is set.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

3968a8a5

01 10月, 2018 20 次提交

SUNRPC: Add a bvec array to struct xdr_buf for use with iovec_iter() · 9d96acbc

由 Trond Myklebust 提交于 9月 13, 2018

Add a bvec array to struct xdr_buf, and have the client allocate it
when we need to receive data into pages.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

9d96acbc

SUNRPC: Convert the xprt->sending queue back to an ordinary wait queue · 79c99152

由 Trond Myklebust 提交于 9月 09, 2018

We no longer need priority semantics on the xprt->sending queue, because
the order in which tasks are sent is now dictated by their position in
the send queue.
Note that the backlog queue remains a priority queue, meaning that
slot resources are still managed in order of task priority.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

79c99152

SUNRPC: Convert xprt receive queue to use an rbtree · 95f7691d

由 Trond Myklebust 提交于 9月 07, 2018

If the server is slow, we can find ourselves with quite a lot of entries
on the receive queue. Converting the search from an O(n) to O(log(n))
can make a significant difference, particularly since we have to hold
a number of locks while searching.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

95f7691d

T
SUNRPC: Don't take transport->lock unnecessarily when taking XPRT_LOCK · bd79bc57
由 Trond Myklebust 提交于 9月 07, 2018
```
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
```
bd79bc57
T
SUNRPC: Cleanup: remove the unused 'task' argument from the request_send() · adfa7144
由 Trond Myklebust 提交于 9月 03, 2018
```
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
```
adfa7144

SUNRPC: Clean up transport write space handling · c544577d

由 Trond Myklebust 提交于 9月 03, 2018

Treat socket write space handling in the same way we now treat transport
congestion: by denying the XPRT_LOCK until the transport signals that it
has free buffer space.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

c544577d

SUNRPC: Turn off throttling of RPC slots for TCP sockets · 36bd7de9

由 Trond Myklebust 提交于 9月 03, 2018

The theory was that we would need to grab the socket lock anyway, so we
might as well use it to gate the allocation of RPC slots for a TCP
socket.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

36bd7de9

SUNRPC: Allow soft RPC calls to time out when waiting for the XPRT_LOCK · f05d54ec

由 Trond Myklebust 提交于 9月 03, 2018

This no longer causes them to lose their place in the transmission queue.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

f05d54ec

SUNRPC: Allow calls to xprt_transmit() to drain the entire transmit queue · 89f90fe1

由 Trond Myklebust 提交于 8月 29, 2018

Rather than forcing each and every RPC task to grab the socket write
lock in order to send itself, we allow whichever task is holding the
write lock to attempt to drain the entire transmit queue.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

89f90fe1

SUNRPC: Enqueue swapper tagged RPCs at the head of the transmit queue · 86aeee0e

由 Trond Myklebust 提交于 9月 08, 2018

Avoid memory starvation by giving RPCs that are tagged with the
RPC_TASK_SWAPPER flag the highest priority.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

86aeee0e

SUNRPC: Support for congestion control when queuing is enabled · 75891f50

由 Trond Myklebust 提交于 9月 03, 2018

Both RDMA and UDP transports require the request to get a "congestion control"
credit before they can be transmitted. Right now, this is done when
the request locks the socket. We'd like it to happen when a request attempts
to be transmitted for the first time.
In order to support retransmission of requests that already hold such
credits, we also want to ensure that they get queued first, so that we
don't deadlock with requests that have yet to obtain a credit.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

75891f50

SUNRPC: Improve latency for interactive tasks · 918f3c1f

由 Trond Myklebust 提交于 9月 09, 2018

One of the intentions with the priority queues was to ensure that no
single process can hog the transport. The field task->tk_owner therefore
identifies the RPC call's origin, and is intended to allow the RPC layer
to organise queues for fairness.
This commit therefore modifies the transmit queue to group requests
by task->tk_owner, and ensures that we round robin among those groups.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

918f3c1f

T
SUNRPC: Move RPC retransmission stat counter to xprt_transmit() · dcbbeda8
由 Trond Myklebust 提交于 9月 01, 2018
```
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
```
dcbbeda8

SUNRPC: Simplify xprt_prepare_transmit() · 5f2f6bd9

由 Trond Myklebust 提交于 9月 01, 2018

Remove the checks for whether or not we need to transmit, and whether
or not a reply has been received. Those are already handled in
call_transmit() itself.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

5f2f6bd9

SUNRPC: Don't reset the request 'bytes_sent' counter when releasing XPRT_LOCK · 04b3b88f

由 Trond Myklebust 提交于 9月 01, 2018

If the request is still on the queue, this will be incorrect behaviour.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

04b3b88f

SUNRPC: Treat the task and request as separate in the xprt_ops->send_request() · 50f484e2

由 Trond Myklebust 提交于 8月 30, 2018

When we shift to using the transmit queue, then the task that holds the
write lock will not necessarily be the same as the one being transmitted.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

50f484e2

SUNRPC: Fix up the back channel transmit · 902c5887

由 Trond Myklebust 提交于 9月 01, 2018

Fix up the back channel code to recognise that it has already been
transmitted, so does not need to be called again.
Also ensure that we set req->rq_task.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

902c5887

SUNRPC: Refactor RPC call encoding · 762e4e67

由 Trond Myklebust 提交于 8月 24, 2018

Move the call encoding so that it occurs before the transport connection
etc.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

762e4e67

SUNRPC: Add a transmission queue for RPC requests · 944b0429

由 Trond Myklebust 提交于 8月 09, 2018

Add the queue that will enforce the ordering of RPC task transmission.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

944b0429

SUNRPC: Distinguish between the slot allocation list and receive queue · ef3f5434

由 Trond Myklebust 提交于 8月 08, 2018

When storing a struct rpc_rqst on the slot allocation list, we currently
use the same field 'rq_list' as we use to store the request on the
receive queue. Since the structure is never on both lists at the same
time, this is OK.
However, for clarity, let's make that a union with different names for
the different lists so that we can more easily distinguish between
the two states.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

ef3f5434

openeuler / Kernel 大约 1 年 前同步成功

openeuler / Kernel
大约 1 年前同步成功