提交 · 331bc71cb1751d78f6807ad8e6162b07c67cdd1b · openeuler / Kernel

24 10月, 2018 4 次提交

T
SUNRPC: Convert the auth cred cache to use refcount_t · 331bc71c
由 Trond Myklebust 提交于 10月 14, 2018
```
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
```
331bc71c
T
SUNRPC: Convert auth creds to use refcount_t · 79b18181
由 Trond Myklebust 提交于 10月 14, 2018
```
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
```
79b18181

由 Trond Myklebust 提交于 10月 12, 2018

We no longer need to worry about whether or not the entry is hashed in
order to figure out if the contents are valid. We only care whether or
not the refcount is non-zero.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

07d02a67

T
SUNRPC: Clean up the AUTH cache code · 95cd6232
由 Trond Myklebust 提交于 10月 11, 2018
```
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
```
95cd6232

19 10月, 2018 2 次提交

sunrpc: safely reallow resvport min/max inversion · 826799e6

由 J. Bruce Fields 提交于 10月 18, 2018

Commits ffb6ca33 and e08ea3a9 prevent setting xprt_min_resvport
greater than xprt_max_resvport, but may also break simple code that sets
one parameter then the other, if the new range does not overlap the old.

Also it looks racy to me, unless there's some serialization I'm not
seeing.  Granted it would probably require malicious privileged processes
(unless there's a chance these might eventually be settable in unprivileged
containers), but still it seems better not to let userspace panic the
kernel.

Simpler seems to be to allow setting the parameters to whatever you want
but interpret xprt_min_resvport > xprt_max_resvport as the empty range.

Fixes: ffb6ca33 "sunrpc: Prevent resvport min/max inversion..."
Fixes: e08ea3a9 "sunrpc: Prevent rexvport min/max inversion..."
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

826799e6

T
SUNRPC: Fix a compile warning for cmpxchg64() · e732f448
由 Trond Myklebust 提交于 10月 18, 2018
```
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
```
e732f448

05 10月, 2018 1 次提交

SUNRPC: use cmpxchg64() in gss_seq_send64_fetch_and_inc() · 21924765

由 Arnd Bergmann 提交于 10月 02, 2018

The newly introduced gss_seq_send64_fetch_and_inc() fails to build on
32-bit architectures:

net/sunrpc/auth_gss/gss_krb5_seal.c:144:14: note: in expansion of macro 'cmpxchg'
   seq_send = cmpxchg(&ctx->seq_send64, old, old + 1);
              ^~~~~~~
arch/x86/include/asm/cmpxchg.h:128:3: error: call to '__cmpxchg_wrong_size' declared with attribute error: Bad argument size for cmpxchg
   __cmpxchg_wrong_size();     \

As the message tells us, cmpxchg() cannot be used on 64-bit arguments,
that's what cmpxchg64() does.

Fixes: 571ed1fd ("SUNRPC: Replace krb5_seq_lock with a lockless scheme")
Signed-off-by: NArnd Bergmann <arnd@arndb.de>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

21924765

04 10月, 2018 4 次提交

xprtrdma: Clean up xprt_rdma_disconnect_inject · ad091180

由 Chuck Lever 提交于 10月 01, 2018

Clean up: Use the appropriate C macro instead of open-coding
container_of() .
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

ad091180

xprtrdma: Add documenting comments · f26c32fa

由 Chuck Lever 提交于 10月 01, 2018

Clean up: fill in or update documenting comments for transport
switch entry points.

For xprt_rdma_allocate:

The first paragraph is no longer true since commit 5a6d1db4
("SUNRPC: Add a transport-specific private field in rpc_rqst").

The second paragraph is no longer true since commit 54cbd6b0
("xprtrdma: Delay DMA mapping Send and Receive buffers").
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

f26c32fa

xprtrdma: Report when there were zero posted Receives · 61c208a5

由 Chuck Lever 提交于 10月 01, 2018

To show that a caller did attempt to allocate and post more Receive
buffers, the trace point in rpcrdma_post_recvs() should report when
rpcrdma_post_recvs() was invoked but no new Receive buffers were
posted.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

61c208a5

xprtrdma: Move rb_flags initialization · 512ccfb6

由 Chuck Lever 提交于 10月 01, 2018

Clean up: rb_flags might be used for other things besides
RPCRDMA_BUF_F_EMPTY_SCQ, so initialize it in a generic spot
instead of in a send-completion-queue-related helper.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

512ccfb6

03 10月, 2018 15 次提交

xprtrdma: Don't disable BH's in backchannel server · f7d46681

由 Chuck Lever 提交于 10月 01, 2018

Clean up: This code was copied from xprtsock.c and
backchannel_rqst.c. For rpcrdma, the backchannel server runs
exclusively in process context, thus disabling bottom-halves is
unnecessary.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

f7d46681

xprtrdma: Remove memory address of "ep" from an error message · 83e301dd

由 Chuck Lever 提交于 10月 01, 2018

Clean up: Replace the hashed memory address of the target rpcrdma_ep
with the server's IP address and port. The server address is more
useful in an administrative error message.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

83e301dd

xprtrdma: Rename rpcrdma_qp_async_error_upcall · f9521d53

由 Chuck Lever 提交于 10月 01, 2018

Clean up: Use a function name that is consistent with the RDMA core
API and with other consumers. Because this is a function that is
invoked from outside the rpcrdma.ko module, add an appropriate
documenting comment.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

f9521d53

xprtrdma: Simplify RPC wake-ups on connect · 31e62d25

由 Chuck Lever 提交于 10月 01, 2018

Currently, when a connection is established, rpcrdma_conn_upcall
invokes rpcrdma_conn_func and then
wake_up_all(&ep->rep_connect_wait). The former wakes waiting RPCs,
but the connect worker is not done yet, and that leads to races,
double wakes, and difficulty understanding how this logic is
supposed to work.

Instead, collect all the "connection established" logic in the
connect worker (xprt_rdma_connect_worker). A disconnect worker is
retained to handle provider upcalls safely.

Fixes: 254f91e2 ("xprtrdma: RPC/RDMA must invoke ... ")
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

31e62d25

xprtrdma: Re-organize the switch() in rpcrdma_conn_upcall · 316a616e

由 Chuck Lever 提交于 10月 01, 2018

Clean up: Eliminate the FALLTHROUGH into the default arm to make the
switch easier to understand.

Also, as long as I'm here, do not display the memory address of the
target rpcrdma_ep. A hashed memory address is of marginal use here.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

316a616e

xprtrdma: Eliminate "connstate" variable from rpcrdma_conn_upcall() · aadc5a94

由 Chuck Lever 提交于 10月 01, 2018

Clean up.

Since commit 173b8f49 ("xprtrdma: Demote "connect" log messages")
there has been no need to initialize connstat to zero. In fact, in
this code path there's now no reason not to set rep_connected
directly.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

aadc5a94

xprtrdma: Conventional variable names in rpcrdma_conn_upcall · ed97f1f7

由 Chuck Lever 提交于 10月 01, 2018

Clean up: The convention throughout other parts of xprtrdma is to
name variables of type struct rpcrdma_xprt "r_xprt", not "xprt".
This convention enables the use of the name "xprt" for a "struct
rpc_xprt" type variable, as in other parts of the RPC client.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

ed97f1f7

xprtrdma: Rename rpcrdma_conn_upcall · ae38288e

由 Chuck Lever 提交于 10月 01, 2018

Clean up: Use a function name that is consistent with the RDMA core
API and with other consumers. Because this is a function that is
invoked from outside the rpcrdma.ko module, add an appropriate
documenting comment.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

ae38288e

sunrpc: Report connect_time in seconds · 8440a886

由 Chuck Lever 提交于 10月 01, 2018

The way connection-oriented transports report connect_time is wrong:
it's supposed to be in seconds, not in jiffies.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

8440a886

sunrpc: Fix connect metrics · 3968a8a5

由 Chuck Lever 提交于 10月 01, 2018

For TCP, the logic in xprt_connect_status is currently never invoked
to record a successful connection. Commit 2a491991 ("SUNRPC:
Return EAGAIN instead of ENOTCONN when waking up xprt->pending")
changed the way TCP xprt's are awoken after a connect succeeds.

Instead, change connection-oriented transports to bump connect_count
and compute connect_time the moment that XPRT_CONNECTED is set.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

3968a8a5

xprtrdma: Name MR trace events consistently · d379eaa8

由 Chuck Lever 提交于 10月 01, 2018

Clean up the names of trace events related to MRs so that it's
easy to enable these with a glob.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

d379eaa8

xprtrdma: Explicitly resetting MRs is no longer necessary · 61da886b

由 Chuck Lever 提交于 10月 01, 2018

When a memory operation fails, the MR's driver state might not match
its hardware state. The only reliable recourse is to dereg the MR.
This is done in ->ro_recover_mr, which then attempts to allocate a
fresh MR to replace the released MR.

Since commit e2ac236c ("xprtrdma: Allocate MRs on demand"),
xprtrdma dynamically allocates MRs. It can add more MRs whenever
they are needed.

That makes it possible to simply release an MR when a memory
operation fails, instead of "recovering" it. It will automatically
be replaced by the on-demand MR allocator.

This commit is a little larger than I wanted, but it replaces
->ro_recover_mr, rb_recovery_lock, rb_recovery_worker, and the
rb_stale_mrs list with a generic work queue.

Since MRs are no longer orphaned, the mrs_orphaned metric is no
longer used.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

61da886b

xprtrdma: Create more MRs at a time · c421ece6

由 Chuck Lever 提交于 10月 01, 2018

Some devices require more than 3 MRs to build a single 1MB I/O.
Ensure that rpcrdma_mrs_create() will add enough MRs to build that
I/O.

In a subsequent patch I'm changing the MR recovery logic to just
toss out the MRs. In that case it's possible for ->send_request to
loop acquiring some MRs, not getting enough, getting called again,
recycling the previous MRs, then not getting enough, lather rinse
repeat. Thus first we need to ensure enough MRs are created to
prevent that loop.

I'm "reusing" ia->ri_max_segs. All of its accessors seem to want the
maximum number of data segments plus two, so I'm going to bake that
into the initial calculation.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

c421ece6

xprtrdma: Reset credit grant properly after a disconnect · ef739b21

由 Chuck Lever 提交于 10月 01, 2018

On a fresh connection, an RPC/RDMA client is supposed to send only
one RPC Call until it gets a credit grant in the first RPC Reply
from the server [RFC 8166, Section 3.3.3].

There is a bug in the Linux client's credit accounting mechanism
introduced by commit e7ce710a ("xprtrdma: Avoid deadlock when
credit window is reset"). On connect, it simply dumps all pending
RPC Calls onto the new connection.

Servers have been tolerant of this bad behavior. Currently no server
implementation ever changes its credit grant over reconnects, and
servers always repost enough Receives before connections are fully
established.

To correct this issue, ensure that the client resets both the credit
grant _and_ the congestion window when handling a reconnect.

Fixes: e7ce710a ("xprtrdma: Avoid deadlock when credit ... ")
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Cc: stable@kernel.org
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

ef739b21

xprtrdma: xprt_release_rqst_cong is called outside of transport_lock · 91ca1866

由 Chuck Lever 提交于 10月 01, 2018

Since commit ce7c252a ("SUNRPC: Add a separate spinlock to
protect the RPC request receive list") the RPC/RDMA reply handler
has been calling xprt_release_rqst_cong without holding
xprt->transport_lock.

I think the only way this call is ever made is if the credit grant
increases and there are RPCs pending. Current server implementations
do not change their credit grant during operation (except at
connect time).

Commit e7ce710a ("xprtrdma: Avoid deadlock when credit window is
reset") added the ->release_rqst call because UDP invokes
xprt_adjust_cwnd(), which calls __xprt_put_cong() after adjusting
xprt->cwnd. Both xprt_release() and ->xprt_release_xprt already wake
another task in this case, so it is safe to remove this call from
the reply handler.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

91ca1866

01 10月, 2018 14 次提交

T
SUNRPC: Replace krb5_seq_lock with a lockless scheme · 571ed1fd
由 Trond Myklebust 提交于 9月 29, 2018
```
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
```
571ed1fd

SUNRPC: Lockless lookup of RPCSEC_GSS mechanisms · 0c1c19f4

由 Trond Myklebust 提交于 9月 29, 2018

Use RCU protected lookups for discovering the supported mechanisms.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

0c1c19f4

SUNRPC: Remove rpc_authflavor_lock in favour of RCU locking · 4e4c3bef

由 Trond Myklebust 提交于 9月 27, 2018

Module removal is RCU safe by design, so we really have no need to
lock the auth_flavors[] array. Substitute a lockless scheme to
add/remove entries in the array, and then use rcu.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

4e4c3bef

SUNRPC: Unexport xdr_partial_copy_from_skb() · ec846469

由 Trond Myklebust 提交于 9月 14, 2018

It is no longer used outside of net/sunrpc/socklib.c
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

ec846469

T
SUNRPC: Clean up xs_udp_data_receive() · 4f546149
由 Trond Myklebust 提交于 9月 14, 2018
```
Simplify the retry logic.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
```
4f546149
T
SUNRPC: Allow AF_LOCAL sockets to use the generic stream receive · 550aebfe
由 Trond Myklebust 提交于 9月 14, 2018
```
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
```
550aebfe
T
SUNRPC: Clean up - rename xs_tcp_data_receive() to xs_stream_data_receive() · c50b8ee0
由 Trond Myklebust 提交于 9月 14, 2018
```
In preparation for sharing with AF_LOCAL.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
```
c50b8ee0

SUNRPC: Simplify TCP receive code by switching to using iterators · 277e4ab7

由 Trond Myklebust 提交于 9月 14, 2018

Most of this code should also be reusable with other socket types.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

277e4ab7

SUNRPC: Add a bvec array to struct xdr_buf for use with iovec_iter() · 9d96acbc

由 Trond Myklebust 提交于 9月 13, 2018

Add a bvec array to struct xdr_buf, and have the client allocate it
when we need to receive data into pages.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

9d96acbc

SUNRPC: Add a label for RPC calls that require allocation on receive · 431f6eb3

由 Trond Myklebust 提交于 9月 16, 2018

If the RPC call relies on the receive call allocating pages as buffers,
then let's label it so that we
a) Don't leak memory by allocating pages for requests that do not expect
   this behaviour
b) Can optimise for the common case where calls do not require allocation.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

431f6eb3

SUNRPC: Convert the xprt->sending queue back to an ordinary wait queue · 79c99152

由 Trond Myklebust 提交于 9月 09, 2018

We no longer need priority semantics on the xprt->sending queue, because
the order in which tasks are sent is now dictated by their position in
the send queue.
Note that the backlog queue remains a priority queue, meaning that
slot resources are still managed in order of task priority.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

79c99152

SUNRPC: Fix priority queue fairness · f42f7c28

由 Trond Myklebust 提交于 9月 08, 2018

Fix up the priority queue to not batch by owner, but by queue, so that
we allow '1 << priority' elements to be dequeued before switching to
the next priority queue.
The owner field is still used to wake up requests in round robin order
by owner to avoid single processes hogging the RPC layer by loading the
queues.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

f42f7c28

SUNRPC: Convert xprt receive queue to use an rbtree · 95f7691d

由 Trond Myklebust 提交于 9月 07, 2018

If the server is slow, we can find ourselves with quite a lot of entries
on the receive queue. Converting the search from an O(n) to O(log(n))
can make a significant difference, particularly since we have to hold
a number of locks while searching.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

95f7691d

T
SUNRPC: Don't take transport->lock unnecessarily when taking XPRT_LOCK · bd79bc57
由 Trond Myklebust 提交于 9月 07, 2018
```
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
```
bd79bc57

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功