提交 · c209e49ceac0ff479f79ac5cd2fbf8be80621203 · openeuler / Kernel

26 4月, 2019 28 次提交

xprtrdma: More Send completion batching · c209e49c

由 Chuck Lever 提交于 4月 24, 2019

Instead of using a fixed number, allow the amount of Send completion
batching to vary based on the client's maximum credit limit.

- A larger default gives a small boost to IOPS throughput

- Reducing it based on max_requests gives a safe result when the
  max credit limit is cranked down (eg. when the device has a small
  max_qp_wr).
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

c209e49c

xprtrdma: Clean up sendctx functions · dbcc53a5

由 Chuck Lever 提交于 4月 24, 2019

Minor clean-ups I've stumbled on since sendctx was merged last year.
In particular, making Send completion processing more efficient
appears to have a measurable impact on IOPS throughput.

Note: test_and_clear_bit() returns a value, thus an explicit memory
barrier is not necessary.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

dbcc53a5

xprtrdma: Trace marshaling failures · 17e4c443

由 Chuck Lever 提交于 4月 24, 2019

Record an event when rpcrdma_marshal_req returns a non-zero return
value to help track down why an xprt close might have occurred.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

17e4c443

xprtrdma: Increase maximum number of backchannel requests · 4ba02e8d

由 Chuck Lever 提交于 4月 24, 2019

Reflects the change introduced in commit 067c4696 ("NFSv4.1:
Bump the default callback session slot count to 16").
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

4ba02e8d

xprtrdma: Backchannel can use GFP_KERNEL allocations · 3f9c7e76

由 Chuck Lever 提交于 4月 24, 2019

The Receive handler runs in process context, thus can use on-demand
GFP_KERNEL allocations instead of pre-allocation.

This makes the xprtrdma backchannel independent of the number of
backchannel session slots provisioned by the Upper Layer protocol.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

3f9c7e76

xprtrdma: Clean up regbuf helpers · d2832af3

由 Chuck Lever 提交于 4月 24, 2019

For code legibility, clean up the function names to be consistent
with the pattern: "rpcrdma" _ object-type _ action

Also rpcrdma_regbuf_alloc and rpcrdma_regbuf_free no longer have any
callers outside of verbs.c, and can thus be made static.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

d2832af3

xprtrdma: De-duplicate "allocate new, free old regbuf" · 0f665ceb

由 Chuck Lever 提交于 4月 24, 2019

Clean up by providing an API to do this common task.

At this point, the difference between rpcrdma_get_sendbuf and
rpcrdma_get_recvbuf has become tiny. These can be collapsed into a
single helper.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

0f665ceb

xprtrdma: Allocate req's regbufs at xprt create time · bb93a1ae

由 Chuck Lever 提交于 4月 24, 2019

Allocating an rpcrdma_req's regbufs at xprt create time enables
a pair of micro-optimizations:

First, if these regbufs are always there, we can eliminate two
conditional branches from the hot xprt_rdma_allocate path.

Second, by allocating a 1KB buffer, it places a lower bound on the
size of these buffers, without adding yet another conditional
branch. The lower bound reduces the number of hardway re-
allocations. In fact, for some workloads it completely eliminates
hardway allocations.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

bb93a1ae

xprtrdma: rpcrdma_regbuf alignment · 8cec3dba

由 Chuck Lever 提交于 4月 24, 2019

Allocate the struct rpcrdma_regbuf separately from the I/O buffer
to better guarantee the alignment of the I/O buffer and eliminate
the wasted space between the rpcrdma_regbuf metadata and the buffer
itself.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

8cec3dba

xprtrdma: Clean up rpcrdma_create_rep() and rpcrdma_destroy_rep() · 23146500

由 Chuck Lever 提交于 4月 24, 2019

For code legibility, clean up the function names to be consistent
with the pattern: "rpcrdma" _ object-type _ action
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

23146500

xprtrdma: Clean up rpcrdma_create_req() · 1769e6a8

由 Chuck Lever 提交于 4月 24, 2019

Eventually, I'd like to invoke rpcrdma_create_req() during the
call_reserve step. Memory allocation there probably needs to use
GFP_NOIO. Therefore a set of GFP flags needs to be passed in.

As an additional clean up, just return a pointer or NULL, because
the only error return code here is -ENOMEM.

Lastly, clean up the function names to be consistent with the
pattern: "rpcrdma" _ object-type _ action
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

1769e6a8

xprtrdma: Fix an frwr_map recovery nit · b2ca473b

由 Chuck Lever 提交于 4月 24, 2019

After a DMA map failure in frwr_map, mark the MR so that recycling
won't attempt to DMA unmap it.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Fixes: e2f34e26 ("xprtrdma: Yet another double DMA-unmap")
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

b2ca473b

SUNRPC: Avoid digging into the ATOMIC pool · 52db6f9a

由 Chuck Lever 提交于 4月 24, 2019

Page allocation requests made when the SPARSE_PAGES flag is set are
allowed to fail, and are not critical. No need to spend a rare
resource.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

52db6f9a

SUNRPC: Add the 'softerr' rpc_client flag · ae6ec918

由 Trond Myklebust 提交于 4月 07, 2019

Add the 'softerr' rpc client flag that sets the RPC_TASK_TIMEOUT
flag on all new rpc tasks that are attached to that rpc client.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

ae6ec918

SUNRPC: Ensure to ratelimit the "server not responding" syslog messages · 0729d995

由 Trond Myklebust 提交于 4月 07, 2019

In particular, the timeout messages can be very noisy, so we ought to
ratelimit them in order to avoid spamming the syslog.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

0729d995

SUNRPC: Start the first major timeout calculation at task creation · da953063

由 Trond Myklebust 提交于 4月 07, 2019

When calculating the major timeout for a new task, when we know that the
connection has been broken, use the task->tk_start to ensure that we also
take into account the time spent waiting for a slot or session slot. This
ensures that we fail over soft requests relatively quickly once the
connection has actually been broken, and the first requests have
started to fail.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

da953063

SUNRPC: Make "no retrans timeout" soft tasks behave like softconn for timeouts · e4ec48d3

由 Trond Myklebust 提交于 4月 07, 2019

If a soft NFSv4 request is sent, then we don't need it to time out unless
the connection breaks. The reason is that as long as the connection is
unbroken, the protocol states that the server is not allowed to drop the
request. IOW: as long as the connection remains unbroken, the client may
assume that all transmitted RPC requests are being processed by the server,
and that retransmissions and timeouts of those requests are unwarranted.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

e4ec48d3

SUNRPC: Add tracking of RPC level errors · 5ad64b36

由 Trond Myklebust 提交于 4月 07, 2019

Add variables to track RPC level errors so that we can distinguish
between issue that arose in the RPC transport layer as opposed to
those arising from the reply message.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

5ad64b36

SUNRPC: Ensure that the transport layer respect major timeouts · 9e910bff

由 Trond Myklebust 提交于 4月 07, 2019

Ensure that when in the transport layer, we don't sleep past
a major timeout.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

9e910bff

SUNRPC: Declare RPC timers as TIMER_DEFERRABLE · 43123581

由 Trond Myklebust 提交于 4月 07, 2019

Don't wake idle CPUs only for the purpose of servicing an RPC
queue timeout.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

43123581

SUNRPC: Simplify queue timeouts using timer_reduce() · 24a9d9a2

由 Trond Myklebust 提交于 4月 07, 2019

Simplify the setting of queue timeouts by using the timer_reduce()
function.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

24a9d9a2

SUNRPC: Fix up tracking of timeouts · 5efd1876

由 Trond Myklebust 提交于 4月 07, 2019

Add a helper to ensure that debugfs and friends print out the
correct current task timeout value.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

5efd1876

SUNRPC: Add function rpc_sleep_on_timeout() · 6b2e6856

由 Trond Myklebust 提交于 4月 07, 2019

Clean up the RPC task sleep interfaces by replacing the task->tk_timeout
'hidden parameter' to rpc_sleep_on() with a new function that takes an
absolute timeout.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

6b2e6856

SUNRPC: Remove unused argument 'action' from rpc_sleep_on_priority() · 8357a9b6

由 Trond Myklebust 提交于 4月 07, 2019

None of the callers set the 'action' argument, so let's just remove it.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

8357a9b6

SUNRPC: Refactor rpc_sleep_on() · 87150aae

由 Trond Myklebust 提交于 4月 07, 2019

rpc_sleep_on() does not need to set the task->tk_callback under the
queue lock, so move that out.
Also refactor the check for whether the task is active.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

87150aae

SUNRPC: Refactor xprt_request_wait_receive() · 8ba6a92d

由 Trond Myklebust 提交于 4月 07, 2019

Convert the transport callback to actually put the request to sleep
instead of just setting a timeout. This is in preparation for
rpc_sleep_on_timeout().
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

8ba6a92d

SUNRPC: Refactor rpc_restart_call/rpc_restart_call_prepare · 9e6fa0bb

由 Trond Myklebust 提交于 4月 07, 2019

Clean up.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

9e6fa0bb

SUNRPC: Fix up task signalling · ae67bd38

由 Trond Myklebust 提交于 4月 07, 2019

The RPC_TASK_KILLED flag should really not be set from another context
because it can clobber data in the struct task when task->tk_flags is
changed non-atomically.
Let's therefore swap out RPC_TASK_KILLED with an atomic flag, and add
a function to set that flag and safely wake up the task.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

ae67bd38

18 4月, 2019 1 次提交

SUNRPC: Ignore queue transmission errors on successful transmission · a7b1a483

由 Trond Myklebust 提交于 4月 15, 2019

If a request transmission fails due to write space or slot unavailability
errors, but the queued task then gets transmitted before it has time to
process the error in call_transmit_status() or call_bc_transmit_status(),
we need to suppress the transmission error code to prevent it from leaking
out of the RPC layer.
Reported-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Tested-by: NChuck Lever <chuck.lever@oracle.com>

a7b1a483

12 4月, 2019 2 次提交

Revert "SUNRPC: Micro-optimise when the task is known not to be sleeping" · af6b61d7

由 Trond Myklebust 提交于 4月 11, 2019

This reverts commit 009a82f6.

The ability to optimise here relies on compiler being able to optimise
away tail calls to avoid stack overflows. Unfortunately, we are seeing
reports of problems, so let's just revert.
Reported-by: NDaniel Mack <daniel@zonque.org>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

af6b61d7

xprtrdma: Fix helper that drains the transport · e1ede312

由 Chuck Lever 提交于 4月 09, 2019

We want to drain only the RQ first. Otherwise the transport can
deadlock on ->close if there are outstanding Send completions.

Fixes: 6d2d0ee2 ("xprtrdma: Replace rpcrdma_receive_wq ... ")
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Cc: stable@vger.kernel.org # v5.0+
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

e1ede312

27 3月, 2019 1 次提交

SUNRPC: fix uninitialized variable warning · 01f2f5b8

由 Alakesh Haloi 提交于 3月 26, 2019

Avoid following compiler warning on uninitialized variable

net/sunrpc/xprtsock.c: In function ‘xs_read_stream_request.constprop’:
net/sunrpc/xprtsock.c:525:10: warning: ‘read’ may be used uninitialized in this function [-Wmaybe-uninitialized]
   return read;
          ^~~~
net/sunrpc/xprtsock.c:529:23: warning: ‘ret’ may be used uninitialized in this function [-Wmaybe-uninitialized]
  return ret < 0 ? ret : read;
         ~~~~~~~~~~~~~~^~~~~~
Signed-off-by: NAlakesh Haloi <alakesh.haloi@gmail.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

01f2f5b8

23 3月, 2019 1 次提交

SUNRPC: Don't let RPC_SOFTCONN tasks time out if the transport is connected · d84dd3fb

由 Trond Myklebust 提交于 3月 19, 2019

If the transport is still connected, then we do want to allow
RPC_SOFTCONN tasks to retry. They should time out if and only if
the connection is broken.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

d84dd3fb

16 3月, 2019 6 次提交

SUNRPC: Remove redundant check for the reply length in call_decode() · 5e3863fd

由 Trond Myklebust 提交于 3月 15, 2019

Now that we're using the xdr_stream functions to decode the header,
the test for the minimum reply length is redundant.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

5e3863fd

SUNRPC: Handle the SYSTEM_ERR rpc error · 928d42f7

由 Trond Myklebust 提交于 3月 15, 2019

Handle the SYSTEM_ERR rpc error by retrying the RPC call as if it
were a garbage argument.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

928d42f7

SUNRPC: rpc_decode_header() must always return a non-zero value on error · eb90a16e

由 Trond Myklebust 提交于 3月 15, 2019

Ensure that when the "garbage args" case falls through, we do set
an error of EIO.

Fixes: a0584ee9 ("SUNRPC: Use struct xdr_stream when decoding...")
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

eb90a16e

SUNRPC: Use the ENOTCONN error on socket disconnect · 27adc785

由 Trond Myklebust 提交于 3月 15, 2019

When the socket is closed, we currently send an EAGAIN error to all
pending requests in order to ask them to retransmit. Use ENOTCONN
instead, to ensure that they try to reconnect before attempting to
transmit.
This also helps SOFTCONN tasks to behave correctly in this
situation.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

27adc785

SUNRPC: Fix the minimal size for reply buffer allocation · 51314960

由 Trond Myklebust 提交于 3月 15, 2019

We must at minimum allocate enough memory to be able to see any auth
errors in the reply from the server.

Fixes: 2c94b8ec ("SUNRPC: Use au_rslack when computing reply...")
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

51314960

SUNRPC: Fix a client regression when handling oversized replies · 9734ad57

由 Trond Myklebust 提交于 3月 15, 2019

If the server sends a reply that is larger than the pre-allocated
buffer, then the current code may fail to register how much of
the stream that it has finished reading. This again can lead to
hangs.

Fixes: e92053a5 ("SUNRPC: Handle zero length fragments correctly")
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

9734ad57

11 3月, 2019 1 次提交

SUNRPC: Take the transport send lock before binding+connecting · 4d6c671a

由 Trond Myklebust 提交于 3月 10, 2019

Before trying to bind a port, ensure we grab the send lock to
ensure that we don't change the port while another task is busy
transmitting requests.
The connect code already takes the send lock in xprt_connect(),
but it is harmless to take it before that.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

4d6c671a

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功