提交 · 1da38549dd64c7f5dd22427f12dfa8db3d8a722b · openeuler / Kernel

01 10月, 2021 1 次提交

SUNRPC: fix sign error causing rpcsec_gss drops · 2ba5acfb

由 J. Bruce Fields 提交于 10月 01, 2021

If sd_max is unsigned, then sd_max - GSS_SEQ_WIN is a very large number
whenever sd_max is less than GSS_SEQ_WIN, and the comparison:

	seq_num <= sd->sd_max - GSS_SEQ_WIN

in gss_check_seq_num is pretty much always true, even when that's
clearly not what was intended.

This was causing pynfs to hang when using krb5, because pynfs uses zero
as the initial gss sequence number.  That's perfectly legal, but this
logic error causes knfsd to drop the rpc in that case.  Out-of-order
sequence IDs in the first GSS_SEQ_WIN (128) calls will also cause this.

Fixes: 10b9d99a ("SUNRPC: Augment server-side rpcgss tracepoints")
Cc: stable@vger.kernel.org
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>

2ba5acfb

04 9月, 2021 1 次提交

SUNRPC: improve error response to over-size gss credential · 0c217d50

由 NeilBrown 提交于 9月 02, 2021

When the NFS server receives a large gss (kerberos) credential and tries
to pass it up to rpc.svcgssd (which is deprecated), it triggers an
infinite loop in cache_read().

cache_request() always returns -EAGAIN, and this causes a "goto again".

This patch:
 - changes the error to -E2BIG to avoid the infinite loop, and
 - generates a WARN_ONCE when rsi_request first sees an over-sized
   credential.  The warning suggests switching to gssproxy.

Link: https://bugzilla.kernel.org/show_bug.cgi?id=196583Signed-off-by: NNeilBrown <neilb@suse.de>
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>

0c217d50

01 9月, 2021 1 次提交

SUNRPC: don't pause on incomplete allocation · e38b3f20

由 NeilBrown 提交于 8月 30, 2021

alloc_pages_bulk_array() attempts to allocate at least one page based on
the provided pages, and then opportunistically allocates more if that
can be done without dropping the spinlock.

So if it returns fewer than requested, that could just mean that it
needed to drop the lock.  In that case, try again immediately.

Only pause for a time if no progress could be made.
Reported-and-tested-by: NMike Javorski <mike.javorski@gmail.com>
Reported-and-tested-by: NLothar Paltins <lopa@mailbox.org>
Fixes: f6e70aab ("SUNRPC: refresh rq_pages using a bulk page allocator")
Signed-off-by: NNeilBrown <neilb@suse.de>
Acked-by: NMel Gorman <mgorman@suse.com>
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>

e38b3f20

28 8月, 2021 5 次提交

SUNRPC enforce creation of no more than max_connect xprts · dc48e0ab

由 Olga Kornievskaia 提交于 8月 27, 2021

If we are adding new transports via rpc_clnt_test_and_add_xprt()
then check if we've reached the limit. Currently only pnfs path
adds transports via that function but this is done in
preparation when the client would add new transports when
session trunking is detected. A warning is logged if the
limit is reached.
Signed-off-by: NOlga Kornievskaia <kolga@netapp.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

dc48e0ab

SUNRPC add xps_nunique_destaddr_xprts to xprt_switch_info in sysfs · df205d0a

由 Olga Kornievskaia 提交于 8月 27, 2021

In sysfs's xprt_switch_info attribute also display the value of
number of transports with unique destination addresses for this
xprt_switch.
Signed-off-by: NOlga Kornievskaia <kolga@netapp.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

df205d0a

SUNRPC keep track of number of transports to unique addresses · 3a3f9766

由 Olga Kornievskaia 提交于 8月 27, 2021

Currently, xprt_switch keeps a number of all xprts (xps_nxprts)
that were added to the switch regardless of whethere it's an
nconnect transport or a transport to a trunkable address.
Introduce a new counter to keep track of transports to unique
destination addresses per xprt_switch.
Signed-off-by: NOlga Kornievskaia <kolga@netapp.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

3a3f9766

SUNRPC: Tweak TCP socket shutdown in the RPC client · 7c81e6a9

由 Trond Myklebust 提交于 8月 24, 2021

We only really need to call shutdown() if we're in the ESTABLISHED TCP
state, since that is the only case where the client is initiating a
close of an established connection.

If the socket is in FIN_WAIT1 or FIN_WAIT2, then we've already initiated
socket shutdown and are waiting for the server's reply, so do nothing.

In all other cases where we've already received a FIN from the server,
we should be able to just close the socket.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

7c81e6a9

SUNRPC: Simplify socket shutdown when not reusing TCP ports · 0a6ff58e

由 Trond Myklebust 提交于 8月 24, 2021

If we're not required to reuse the TCP port, then we can just
immediately close the socket, and leave the cleanup details to the TCP
layer.

Fixes: e6237b6f ("NFSv4.1: Don't rebind to the same source port when reconnecting to the server")
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

0a6ff58e

26 8月, 2021 1 次提交

SUNRPC: Fix XPT_BUSY flag leakage in svc_handle_xprt()... · 062b829c

由 Trond Myklebust 提交于 8月 25, 2021

If the attempt to reserve a slot fails, we currently leak the XPT_BUSY
flag on the socket. Among other things, this make it impossible to close
the socket.

Fixes: 82011c80 ("SUNRPC: Move svc_xprt_received() call sites")
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>

062b829c

21 8月, 2021 3 次提交

SUNRPC: Server-side disconnect injection · 3a126180

由 Chuck Lever 提交于 8月 03, 2021

Disconnect injection stress-tests the ability for both client and
server implementations to behave resiliently in the face of network
instability.

A file called /sys/kernel/debug/fail_sunrpc/ignore-server-disconnect
enables administrators to turn off server-side disconnect injection
while allowing other types of sunrpc errors to be injected. The
default setting is that server-side disconnect injection is enabled
(ignore=false).
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>

3a126180

SUNRPC: Move client-side disconnect injection · a4ae3081

由 Chuck Lever 提交于 8月 05, 2021

Disconnect injection stress-tests the ability for both client and
server implementations to behave resiliently in the face of network
instability.

Convert the existing client-side disconnect injection infrastructure
to use the kernel's generic error injection facility. The generic
facility has a richer set of injection criteria.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>

a4ae3081

SUNRPC: Add a /sys/kernel/debug/fail_sunrpc/ directory · c782af25

由 Chuck Lever 提交于 8月 03, 2021

This directory will contain a set of administrative controls for
enabling error injection for kernel RPC consumers.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>

c782af25

19 8月, 2021 1 次提交

svcrdma: xpt_bc_xprt is already clear in __svc_rdma_free() · 729580dd

由 Chuck Lever 提交于 8月 18, 2021

svc_xprt_free() already "puts" the bc_xprt before calling the
transport's "free" method. No need to do it twice.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>

729580dd

17 8月, 2021 6 次提交

rpc: fix gss_svc_init cleanup on failure · 5a475344

由 J. Bruce Fields 提交于 8月 12, 2021

The failure case here should be rare, but it's obviously wrong.
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>

5a475344

SUNRPC: Fix a NULL pointer deref in trace_svc_stats_latency() · 5c117207

由 Chuck Lever 提交于 8月 05, 2021

Some paths through svc_process() leave rqst->rq_procinfo set to
NULL, which triggers a crash if tracing happens to be enabled.

Fixes: 89ff8749 ("SUNRPC: Display RPC procedure names instead of proc numbers")
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>

5c117207

svcrdma: Convert rdma->sc_rw_ctxts to llist · 07a92d00

由 Chuck Lever 提交于 2月 08, 2021

Relieve contention on sc_rw_ctxt_lock by converting rdma->sc_rw_ctxts
to an llist.

The goal is to reduce the average overhead of Send completions,
because a transport's completion handlers are single-threaded on
one CPU core. This change reduces CPU utilization of each Send
completion by 2-3% on my server.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Reviewed-By: NTom Talpey <tom@talpey.com>

07a92d00

svcrdma: Relieve contention on sc_send_lock. · b6c2bfea

由 Chuck Lever 提交于 2月 09, 2021

/proc/lock_stat indicates the the sc_send_lock is heavily
contended when the server is under load from a single client.

To address this, convert the send_ctxt free list to an llist.
Returning an item to the send_ctxt cache is now waitless, which
reduces the instruction path length in the single-threaded Send
handler (svc_rdma_wc_send).

The goal is to enable the ib_comp_wq worker to handle a higher
RPC/RDMA Send completion rate given the same CPU resources. This
change reduces CPU utilization of Send completion by 2-3% on my
server.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Reviewed-By: NTom Talpey <tom@talpey.com>

b6c2bfea

svcrdma: Fewer calls to wake_up() in Send completion handler · 6c8c84f5

由 Chuck Lever 提交于 7月 07, 2021

Because wake_up() takes an IRQ-safe lock, it can be expensive,
especially to call inside of a single-threaded completion handler.
What's more, the Send wait queue almost never has waiters, so
most of the time, this is an expensive no-op.

As always, the goal is to reduce the average overhead of each
completion, because a transport's completion handlers are single-
threaded on one CPU core. This change reduces CPU utilization of
the Send completion thread by 2-3% on my server.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Reviewed-By: NTom Talpey <tom@talpey.com>

6c8c84f5

SUNRPC: Add svc_rqst_replace_page() API · 2f0f88f4

由 Chuck Lever 提交于 7月 01, 2021

Replacing a page in rq_pages[] requires a get_page(), which is a
bus-locked operation, and a put_page(), which can be even more
costly.

To reduce the cost of replacing a page in rq_pages[], batch the
put_page() operations by collecting "freed" pages in a pagevec,
and then release those pages when the pagevec is full. This
pagevec is also emptied when each RPC completes.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>

2f0f88f4

16 8月, 2021 1 次提交

params: lift param_set_uint_minmax to common code · 2a14c9ae

由 Sagi Grimberg 提交于 6月 16, 2021

It is a useful helper hence move it to common code so others can enjoy
it.
Suggested-by: NChristoph Hellwig <hch@lst.de>
Reviewed-by: NChaitanya Kulkarni <chaitanya.kulkarni@wdc.com>
Reviewed-by: NHannes Reinecke <hare@suse.com>
Signed-off-by: NSagi Grimberg <sagi@grimberg.me>
Signed-off-by: NChristoph Hellwig <hch@lst.de>

2a14c9ae

11 8月, 2021 3 次提交

SUNRPC: Eliminate the RQ_AUTHERR flag · 9082e1d9

由 Chuck Lever 提交于 7月 15, 2021

Now that there is an alternate method for returning an auth_stat
value, replace the RQ_AUTHERR flag with use of that new method.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

9082e1d9

SUNRPC: Set rq_auth_stat in the pg_authenticate() callout · 5c2465df

由 Chuck Lever 提交于 7月 15, 2021

In a few moments, rq_auth_stat will need to be explicitly set to
rpc_auth_ok before execution gets to the dispatcher.

svc_authenticate() already sets it, but it often gets reset to
rpc_autherr_badcred right after that call, even when authentication
is successful. Let's ensure that the pg_authenticate callout and
svc_set_client() set it properly in every case.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

5c2465df

SUNRPC: Add svc_rqst::rq_auth_stat · 438623a0

由 Chuck Lever 提交于 7月 15, 2021

I'd like to take commit 4532608d ("SUNRPC: Clean up generic
dispatcher code") even further by using only private local SVC
dispatchers for all kernel RPC services. This change would enable
the removal of the logic that switches between
svc_generic_dispatch() and a service's private dispatcher, and
simplify the invocation of the service's pc_release method
so that humans can visually verify that it is always invoked
properly.

All that will come later.

First, let's provide a better way to return authentication errors
from SVC dispatcher functions. Instead of overloading the dispatch
method's *statp argument, add a field to struct svc_rqst that can
hold an error value.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

438623a0

10 8月, 2021 14 次提交

SUNRPC: Add dst_port to the sysfs xprt info file · 69f2cd6d

由 Anna Schumaker 提交于 7月 29, 2021

This is most likely going to be 2049 for NFS, but some servers might be
configured to export on a non-standard port. Let's show this information
just in case somebody needs it.
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

69f2cd6d

SUNRPC: Add srcaddr as a file in sysfs · e44773da

由 Anna Schumaker 提交于 7月 29, 2021

I don't support changing it right now, but it could be useful
information for clients with multiple network cards.
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

e44773da

sunrpc: Fix return value of get_srcport() · 5d46dd04

由 Anna Schumaker 提交于 7月 20, 2021

Since bc1c56e9 transport->srcport may by unset, causing
get_srcport() to return 0 when called. Fix this by querying the port
from the underlying socket instead of the transport.

Fixes: bc1c56e9 (SUNRPC: prevent port reuse on transports which don't request it)
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

5d46dd04

SUNRPC/xprtrdma: Fix reconnection locking · f99fa508

由 Trond Myklebust 提交于 7月 26, 2021

The xprtrdma client code currently relies on the task that initiated the
connect to hold the XPRT_LOCK for the duration of the connection
attempt. If the task is woken early, due to some other event, then that
lock could get released early.
Avoid races by using the same mechanism that the socket code uses of
transferring lock ownership to the RDMA connect worker itself. That
frees us to call rpcrdma_xprt_disconnect() directly since we're now
guaranteed exclusion w.r.t. other callers.

Fixes: 4cf44be6 ("xprtrdma: Fix recursion into rpcrdma_xprt_disconnect()")
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

f99fa508

SUNRPC: Clean up scheduling of autoclose · e26d9972

由 Trond Myklebust 提交于 7月 26, 2021

Consolidate duplicated code in xprt_force_disconnect() and
xprt_conditional_disconnect().
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

e26d9972

SUNRPC: Fix potential memory corruption · c2dc3e5f

由 Trond Myklebust 提交于 7月 26, 2021

We really should not call rpc_wake_up_queued_task_set_status() with
xprt->snd_task as an argument unless we are certain that is actually an
rpc_task.

Fixes: 0445f92c ("SUNRPC: Fix disconnection races")
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

c2dc3e5f

SUNRPC: Convert rpc_client refcount to use refcount_t · 71d3d0eb

由 Trond Myklebust 提交于 7月 26, 2021

There are now tools in the refcount library that allow us to convert the
client shutdown code.
Reported-by: NXiyu Yang <xiyuyang19@fudan.edu.cn>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

71d3d0eb

xprtrdma: Eliminate rpcrdma_post_sends() · 8d863b1f

由 Chuck Lever 提交于 8月 02, 2021

Clean up.

Now that there is only one registration mode, there is only one
target "post_send" method: frwr_send(). rpcrdma_post_sends() no
longer adds much value, especially since all of its call sites
ignore the return code value except to check if it's non-zero.

Just have them call frwr_send() directly instead.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

8d863b1f

xprtrdma: Add an xprtrdma_post_send_err tracepoint · d9ae8134

由 Chuck Lever 提交于 8月 02, 2021

Unlike xprtrdma_post_send(), this one can be left enabled all the
time, and should almost never fire. But we do want to know about
immediate errors when they happen.

Note that there is already a similar post_linv_err tracepoint.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

d9ae8134

xprtrdma: Add xprtrdma_post_recvs_err() tracepoint · 683f31c3

由 Chuck Lever 提交于 8月 02, 2021

In the vast majority of cases, rc=0. Don't record that in the
post_recvs tracepoint. Instead, add a separate tracepoint that can
be left enabled all the time to capture the very rare immediate
errors returned by ib_post_recv().
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

683f31c3

xprtrdma: Put rpcrdma_reps before waking the tear-down completion · 97480cae

由 Chuck Lever 提交于 8月 02, 2021

Ensure the tear-down completion is awoken only /after/ we've stopped
fiddling with rpcrdma_rep objects in rpcrdma_post_recvs().

Fixes: 15788d1d ("xprtrdma: Do not refresh Receive Queue while it is draining")
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

97480cae

xprtrdma: Disconnect after an ib_post_send() immediate error · 1143129e

由 Chuck Lever 提交于 8月 02, 2021

ib_post_send() does not disconnect the QP when it returns an
immediate error. Thus, the code that posts LocalInv has to
explicitly disconnect after an immediate error. This is just
like the frwr_send() callers handle it.

If a disconnect isn't done here, the transport deadlocks.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

1143129e

SUNRPC: Unset RPC_TASK_NO_RETRANS_TIMEOUT for NULL RPCs · 823c73d0

由 Chuck Lever 提交于 7月 19, 2021

In some rare failure modes, the server is actually reading the
transport, but then just dropping the requests on the floor.
TCP_USER_TIMEOUT cannot detect that case.

Prevent such a stuck server from pinning client resources
indefinitely by ensuring that certain idempotent requests
(such as NULL) can time out even if the connection is still
operational.

Otherwise rpc_bind_new_program(), gss_destroy_cred(), or
rpc_clnt_test_and_add_xprt() can wait forever.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

823c73d0

SUNRPC: Refactor rpc_ping() · aede5172

由 Chuck Lever 提交于 7月 19, 2021

Make it use the rpc_null_call_helper() so that it can share the
new rpc_call_ops structure to be introduced in the next patch.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

aede5172

09 7月, 2021 3 次提交

sunrpc: remove an offlined xprt using sysfs · 6f081693

由 Olga Kornievskaia 提交于 6月 23, 2021

Once a transport has been put offline, this transport can be also
removed from the list of transports. Any tasks that have been stuck
on this transport would find the next available active transport
and be re-tried. This transport would be removed from the xprt_switch
list and freed.
Signed-off-by: NOlga Kornievskaia <kolga@netapp.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

6f081693

sunrpc: provide showing transport's state info in the sysfs directory · 681d5699

由 Olga Kornievskaia 提交于 6月 08, 2021

In preparation of being able to change the xprt's state, add a way
to show currect state of the transport.
Signed-off-by: NOlga Kornievskaia <kolga@netapp.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

681d5699

sunrpc: display xprt's queuelen of assigned tasks via sysfs · 6a284059

由 Olga Kornievskaia 提交于 6月 23, 2021

Once a task grabs a trasnport it's reflected in the queuelen of
the rpc_xprt structure. Add display of that value in the xprt's
info file in sysfs.
Signed-off-by: NOlga Kornievskaia <kolga@netapp.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

6a284059

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功