提交 · 5fe6eaa1f9a00b9a5927e3b791ecad2f3eaab130 · openeuler / Kernel

20 9月, 2016 8 次提交

SUNRPC: Generalize the RPC buffer allocation API · 5fe6eaa1

由 Chuck Lever 提交于 9月 15, 2016

xprtrdma needs to allocate the Call and Reply buffers separately.
TBH, the reliance on using a single buffer for the pair of XDR
buffers is transport implementation-specific.

Transports that want to allocate separate Call and Reply buffers
will ignore the "size" argument anyway.  Don't bother passing it.

The buf_alloc method can't return two pointers. Instead, make the
method's return value an error code, and set the rq_buffer pointer
in the method itself.

This gives call_allocate an opportunity to terminate an RPC instead
of looping forever when a permanent problem occurs. If a request is
just bogus, or the transport is in a state where it can't allocate
resources for any request, there needs to be a way to kill the RPC
right there and not loop.

This immediately fixes a rare problem in the backchannel send path,
which loops if the server happens to send a CB request whose
call+reply size is larger than a page (which it shouldn't do yet).

One more issue: looks like xprt_inject_disconnect was incorrectly
placed in the failure path in call_allocate. It needs to be in the
success path, as it is for other call-sites.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

5fe6eaa1

SUNRPC: Refactor rpc_xdr_buf_init() · b9c5bc03

由 Chuck Lever 提交于 9月 15, 2016

Clean up: there is some XDR initialization logic that is common
to the forward channel and backchannel. Move it to an XDR header
so it can be shared.

rpc_rqst::rq_buffer points to a buffer containing big-endian data.
Update its annotation as part of the clean up.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

b9c5bc03

SUNRPC: rpc_clnt_add_xprt setup function for NFS layer · fda0ab41

由 Andy Adamson 提交于 9月 09, 2016

Use a setup function to call into the NFS layer to test an rpc_xprt
for session trunking so as to not leak the rpc_xprt_switch into
the nfs layer.

Search for the address in the rpc_xprt_switch first so as not to
put an unnecessary EXCHANGE_ID on the wire.
Signed-off-by: NAndy Adamson <andros@netapp.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

fda0ab41

SUNRPC search xprt switch for sockaddr · 39e5d2df

由 Andy Adamson 提交于 9月 09, 2016

Signed-off-by: NAndy Adamson <andros@netapp.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

39e5d2df

SUNRPC rpc_clnt_xprt_switch_add_xprt · dd691717

由 Andy Adamson 提交于 9月 09, 2016

Give the NFS layer access to the rpc_xprt_switch_add_xprt function
Signed-off-by: NAndy Adamson <andros@netapp.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

dd691717

SUNRPC rpc_clnt_xprt_switch_put · 3b58a8a9

由 Andy Adamson 提交于 9月 09, 2016

Give the NFS layer access to the xprt_switch_put function
Signed-off-by: NAndy Adamson <andros@netapp.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

3b58a8a9

SUNRPC remove rpc_task_release_client from rpc_task_set_client · 7705f6ab

由 Andy Adamson 提交于 9月 09, 2016

rpc_task_set_client is only called from rpc_run_task after
rpc_new_task and rpc_task_release_client is not needed as the
task is new.

When called from rpc_new_task, rpc_task_set_client also removed the
assigned rpc_xprt which is not desired.
Signed-off-by: NAndy Adamson <andros@netapp.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

7705f6ab

sunrpc: Remove unnecessary variable · 2813b626

由 Amitoj Kaur Chawla 提交于 8月 08, 2016

The variable `err` is not used anywhere and just returns the
predefined value `0` at the end of the function. Hence, remove the
variable and return 0 explicitly.
Signed-off-by: NAmitoj Kaur Chawla <amitoj1606@gmail.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

2813b626

25 8月, 2016 1 次提交

SUNRPC: Silence WARN_ON when NFSv4.1 over RDMA is in use · 16590a22

由 Chuck Lever 提交于 8月 22, 2016

Using NFSv4.1 on RDMA should be safe, so broaden the new checks in
rpc_create().

WARN_ON_ONCE is used, matching most other WARN call sites in clnt.c.

Fixes: 39a9beab ("rpc: share one xps between all backchannels")
Fixes: d50039ea ("nfsd4/rpc: move backchannel create logic...")
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Reviewed-by: NJ. Bruce Fields <bfields@fieldses.org>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

16590a22

06 8月, 2016 2 次提交

NFSv4: Cap the transport reconnection timer at 1/2 lease period · 8d480326

由 Trond Myklebust 提交于 8月 05, 2016

We don't want to miss a lease period renewal due to the TCP connection
failing to reconnect in a timely fashion. To ensure this doesn't happen,
cap the reconnection timer so that we retry the connection attempt
at least every 1/2 lease period.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

8d480326

SUNRPC: Limit the reconnect backoff timer to the max RPC message timeout · 3851f1cd

由 Trond Myklebust 提交于 8月 04, 2016

...and ensure that we propagate it to new transports on the same
client.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

3851f1cd

25 7月, 2016 1 次提交

SUNRPC: Fix a compiler warning in fs/nfs/clnt.c · ce272302

由 Trond Myklebust 提交于 7月 24, 2016

Fix the report:

net/sunrpc/clnt.c:2580:1: warning: ‘static’ is not at beginning of declaration [-Wold-style-declaration]
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

ce272302

15 6月, 2016 3 次提交

rpc: share one xps between all backchannels · 39a9beab

由 J. Bruce Fields 提交于 5月 17, 2016

The spec allows backchannels for multiple clients to share the same tcp
connection.  When that happens, we need to use the same xprt for all of
them.  Similarly, we need the same xps.

This fixes list corruption introduced by the multipath code.

Cc: stable@vger.kernel.org
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>
Acked-by: NTrond Myklebust <trondmy@primarydata.com>

39a9beab

nfsd4/rpc: move backchannel create logic into rpc code · d50039ea

由 J. Bruce Fields 提交于 5月 16, 2016

Also simplify the logic a bit.

Cc: stable@vger.kernel.org
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>
Acked-by: NTrond Myklebust <trondmy@primarydata.com>

d50039ea

SUNRPC: fix xprt leak on xps allocation failure · 1208fd56

由 J. Bruce Fields 提交于 5月 20, 2016

Callers of rpc_create_xprt expect it to put the xprt on success and
failure.

Cc: stable@vger.kernel.org
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>
Acked-by: NTrond Myklebust <trondmy@primarydata.com>

1208fd56

18 5月, 2016 1 次提交

sunrpc: Advertise maximum backchannel payload size · 6b26cc8c

由 Chuck Lever 提交于 5月 02, 2016

RPC-over-RDMA transports have a limit on how large a backward
direction (backchannel) RPC message can be. Ensure that the NFSv4.x
CREATE_SESSION operation advertises this limit to servers.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Tested-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

6b26cc8c

06 2月, 2016 5 次提交

SUNRPC: Allow addition of new transports to a struct rpc_clnt · 7f554890

由 Trond Myklebust 提交于 1月 30, 2016

Add a function to allow creation and addition of a new transport
to an existing rpc_clnt
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

7f554890

T
SUNRPC: Make NFS swap work with multipath · 15001e5a
由 Trond Myklebust 提交于 1月 30, 2016
```
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
15001e5a

SUNRPC: Add a helper to apply a function to all the rpc_clnt's transports · 3227886c

由 Trond Myklebust 提交于 1月 30, 2016

Add a helper for tasks that require us to apply a function to all the
transports in an rpc_clnt.
An example of a usecase would be BIND_CONN_TO_SESSION, where we want
to send one RPC call down each transport.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

3227886c

T
SUNRPC: Use the multipath iterator to assign a transport to each task · fb43d172
由 Trond Myklebust 提交于 1月 30, 2016
```
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
fb43d172

SUNRPC: Make rpc_clnt store the multipath iterators · ad01b2c6

由 Trond Myklebust 提交于 1月 30, 2016

This is a pre-patch for the RPC multipath code. It sets up the storage in
struct rpc_clnt for the multipath code.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

ad01b2c6

01 2月, 2016 1 次提交
- T
  SUNRPC: Remove unused function rpc_task_reset_client · 58f13692
  由 Trond Myklebust 提交于 1月 30, 2016
```
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  58f13692
31 12月, 2015 1 次提交

SUNRPC: Fix a missing break in rpc_anyaddr() · 0b161e63

由 Trond Myklebust 提交于 12月 30, 2015

The missing break means that we always return EAFNOSUPPORT when
faced with a request for an IPv6 loopback address.

Reported-by: coverity (CID 401987)
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

0b161e63

03 7月, 2015 1 次提交

SUNRPC: Don't reencode message if transmission failed with ENOBUFS · 93aa6c7b

由 Trond Myklebust 提交于 7月 03, 2015

If we're running out of buffer memory when transmitting data, then
we want to just delay for a moment, and then continue transmitting
the remainder of the message.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

93aa6c7b

20 6月, 2015 1 次提交

SUNRPC: Handle connection issues correctly on the back channel · 3832591e

由 Trond Myklebust 提交于 6月 19, 2015

If the back channel is disconnected, we can and should just fail the
transmission. The expectation is that the NFSv4.1 server will always
retransmit any outstanding callbacks once the connection is
re-established.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

3832591e

11 6月, 2015 4 次提交

SUNRPC: Transport fault injection · 4a068258

由 Chuck Lever 提交于 5月 11, 2015

It has been exceptionally useful to exercise the logic that handles
local immediate errors and RDMA connection loss.  To enable
developers to test this regularly and repeatably, add logic to
simulate connection loss every so often.

Fault injection is disabled by default. It is enabled with

  $ sudo echo xxx > /sys/kernel/debug/sunrpc/inject_fault/disconnect

where "xxx" is a large positive number of transport method calls
before a disconnect. A value of several thousand is usually a good
number that allows reasonable forward progress while still causing a
lot of connection drops.

These hooks are disabled when SUNRPC_DEBUG is turned off.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

4a068258

sunrpc: turn swapper_enable/disable functions into rpc_xprt_ops · d67fa4d8

由 Jeff Layton 提交于 6月 03, 2015

RDMA xprts don't have a sock_xprt, but an rdma_xprt, so the
xs_swapper_enable/disable functions will likely oops when fed an RDMA
xprt. Turn these functions into rpc_xprt_ops so that that doesn't
occur. For now the RDMA versions are no-ops that just return -EINVAL
on an attempt to swapon.

Cc: Chuck Lever <chuck.lever@oracle.com>
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Reviewed-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

d67fa4d8

sunrpc: make xprt->swapper an atomic_t · 8e228133

由 Jeff Layton 提交于 6月 03, 2015

Split xs_swapper into enable/disable functions and eliminate the
"enable" flag.

Currently, it's racy if you have multiple swapon/swapoff operations
running in parallel over the same xprt. Also fix it so that we only
set it to a memalloc socket on a 0->1 transition and only clear it
on a 1->0 transition.

Cc: Mel Gorman <mgorman@suse.de>
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Reviewed-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

8e228133

sunrpc: keep a count of swapfiles associated with the rpc_clnt · 3c87ef6e

由 Jeff Layton 提交于 6月 03, 2015

Jerome reported seeing a warning pop when working with a swapfile on
NFS. The nfs_swap_activate can end up calling sk_set_memalloc while
holding the rcu_read_lock and that function can sleep.

To fix that, we need to take a reference to the xprt while holding the
rcu_read_lock, set the socket up for swapping and then drop that
reference. But, xprt_put is not exported and having NFS deal with the
underlying xprt is a bit of layering violation anyway.

Fix this by adding a set of activate/deactivate functions that take a
rpc_clnt pointer instead of an rpc_xprt, and have nfs_swap_activate and
nfs_swap_deactivate call those.

Also, add a per-rpc_clnt atomic counter to keep track of the number of
active swapfiles associated with it. When the counter does a 0->1
transition, we enable swapping on the xprt, when we do a 1->0 transition
we disable swapping on it.

This also allows us to be a bit more selective with the RPC_TASK_SWAPPER
flag. If non-swapper and swapper clnts are sharing a xprt, then we only
need to flag the tasks from the swapper clnt with that flag.
Acked-by: NMel Gorman <mgorman@suse.de>
Reported-by: NJerome Marchand <jmarchan@redhat.com>
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Reviewed-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

3c87ef6e

05 6月, 2015 1 次提交
- T
  SUNRPC: Remove unused argument 'tk_ops' in rpc_run_bc_task · 0f419791
  由 Trond Myklebust 提交于 6月 01, 2015
```
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  0f419791
03 6月, 2015 1 次提交

SUNRPC: Backchannel handle socket nospace · 1193d58f

由 Trond Myklebust 提交于 6月 02, 2015

If the socket was busy due to a socket nospace error, then we should
retry the send.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

1193d58f

24 4月, 2015 1 次提交

sunrpc: make debugfs file creation failure non-fatal · 3f940098

由 Jeff Layton 提交于 3月 31, 2015

v2: gracefully handle the case where some dentry pointers end up NULL
    and be more dilligent about zeroing out dentry pointers

We currently have a problem that SELinux policy is being enforced when
creating debugfs files. If a debugfs file is created as a side effect of
doing some syscall, then that creation can fail if the SELinux policy
for that process prevents it.

This seems wrong. We don't do that for files under /proc, for instance,
so Bruce has proposed a patch to fix that.

While discussing that patch however, Greg K.H. stated:

    "No kernel code should care / fail if a debugfs function fails, so
     please fix up the sunrpc code first."

This patch converts all of the sunrpc debugfs setup code to be void
return functins, and the callers to not look for errors from those
functions.

This should allow rpc_clnt and rpc_xprt creation to work, even if the
kernel fails to create debugfs files for some reason.

Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Acked-by: N"J. Bruce Fields" <bfields@fieldses.org>
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

3f940098

01 4月, 2015 1 次提交

sunrpc: make debugfs file creation failure non-fatal · f9c72d10

由 Jeff Layton 提交于 3月 31, 2015

We currently have a problem that SELinux policy is being enforced when
creating debugfs files. If a debugfs file is created as a side effect of
doing some syscall, then that creation can fail if the SELinux policy
for that process prevents it.

This seems wrong. We don't do that for files under /proc, for instance,
so Bruce has proposed a patch to fix that.

While discussing that patch however, Greg K.H. stated:

    "No kernel code should care / fail if a debugfs function fails, so
     please fix up the sunrpc code first."

This patch converts all of the sunrpc debugfs setup code to be void
return functins, and the callers to not look for errors from those
functions.

This should allow rpc_clnt and rpc_xprt creation to work, even if the
kernel fails to create debugfs files for some reason.

Symptoms were failing krb5 mounts on systems using gss-proxy and
selinux.

Fixes: 388f0c77 "sunrpc: add a debugfs rpc_xprt directory..."
Cc: stable@vger.kernel.org
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Acked-by: NGreg Kroah-Hartman <gregkh@linuxfoundation.org>
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>

f9c72d10

09 2月, 2015 1 次提交

SUNRPC: Handle EADDRINUSE on connect · 3913c78c

由 Trond Myklebust 提交于 2月 08, 2015

Now that we're setting SO_REUSEPORT, we still need to handle the
case where a connect() is attempted, but the old socket is still
lingering.
Essentially, all we want to do here is handle the error by waiting
a few seconds and then retrying.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

3913c78c

04 2月, 2015 1 次提交

SUNRPC: NULL utsname dereference on NFS umount during namespace cleanup · 03a9a42a

由 Trond Myklebust 提交于 1月 30, 2015

Fix an Oopsable condition when nsm_mon_unmon is called as part of the
namespace cleanup, which now apparently happens after the utsname
has been freed.

Link: http://lkml.kernel.org/r/20150125220604.090121ae@neptune.homeReported-by: NBruno Prémont <bonbons@linux-vserver.org>
Cc: stable@vger.kernel.org # 3.18
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

03a9a42a

28 11月, 2014 1 次提交

sunrpc: add debugfs file for displaying client rpc_task queue · b4b9d2cc

由 Jeff Layton 提交于 11月 26, 2014

It's possible to get a dump of the RPC task queue by writing a value to
/proc/sys/sunrpc/rpc_debug. If you write any value to that file, you get
a dump of the RPC client task list into the log buffer. This is a rather
inconvenient interface however, and makes it hard to get immediate info
about the task queue.

Add a new directory hierarchy under debugfs:

    sunrpc/
        rpc_clnt/
            <clientid>/

Within each clientid directory we create a new "tasks" file that will
dump info similar to what shows up in the log buffer, but with a few
small differences -- we avoid printing raw kernel addresses in favor of
symbolic names and the XID is also displayed.
Signed-off-by: NJeff Layton <jlayton@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

b4b9d2cc

25 11月, 2014 1 次提交

sunrpc: eliminate RPC_DEBUG · f895b252

由 Jeff Layton 提交于 11月 17, 2014

It's always set to whatever CONFIG_SUNRPC_DEBUG is, so just use that.
Signed-off-by: NJeff Layton <jlayton@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

f895b252

26 9月, 2014 1 次提交

SUNRPC: Add missing support for RPC_CLNT_CREATE_NO_RETRANS_TIMEOUT · 2aca5b86

由 Trond Myklebust 提交于 9月 24, 2014

The flag RPC_CLNT_CREATE_NO_RETRANS_TIMEOUT was intended introduced in
order to allow NFSv4 clients to disable resend timeouts. Since those
cause the RPC layer to break the connection, they mess up the duplicate
reply caches that remain indexed on the port number in NFSv4..

This patch includes the code that was missing in the original to
set the appropriate flag in struct rpc_clnt, when the caller of
rpc_create() sets RPC_CLNT_CREATE_NO_RETRANS_TIMEOUT.

Fixes: 8a19a0b6 (SUNRPC: Add RPC task and client level options to...)
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

2aca5b86

25 9月, 2014 1 次提交

rpc: Add -EPERM processing for xs_udp_send_request() · 3dedbb5c

由 Jason Baron 提交于 9月 24, 2014

If an iptables drop rule is added for an nfs server, the client can end up in
a softlockup. Because of the way that xs_sendpages() is structured, the -EPERM
is ignored since the prior bits of the packet may have been successfully queued
and thus xs_sendpages() returns a non-zero value. Then, xs_udp_send_request()
thinks that because some bits were queued it should return -EAGAIN. We then try
the request again and again, resulting in cpu spinning. Reproducer:

1) open a file on the nfs server '/nfs/foo' (mounted using udp)
2) iptables -A OUTPUT -d <nfs server ip> -j DROP
3) write to /nfs/foo
4) close /nfs/foo
5) iptables -D OUTPUT -d <nfs server ip> -j DROP

The softlockup occurs in step 4 above.

The previous patch, allows xs_sendpages() to return both a sent count and
any error values that may have occurred. Thus, if we get an -EPERM, return
that to the higher level code.

With this patch in place we can successfully abort the above sequence and
avoid the softlockup.

I also tried the above test case on an nfs mount on tcp and although the system
does not softlockup, I still ended up with the 'hung_task' firing after 120
seconds, due to the i/o being stuck. The tcp case appears a bit harder to fix,
since -EPERM appears to get ignored much lower down in the stack and does not
propogate up to xs_sendpages(). This case is not quite as insidious as the
softlockup and it is not addressed here.
Reported-by: NYigong Lou <ylou@akamai.com>
Signed-off-by: NJason Baron <jbaron@akamai.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

3dedbb5c

03 7月, 2014 1 次提交

SUNRPC: Handle EPIPE in xprt_connect_status · 2fc193cf

由 Trond Myklebust 提交于 7月 03, 2014

The callback handler xs_error_report() can end up propagating an EPIPE
error by means of the call to xprt_wake_pending_tasks(). Ensure that
xprt_connect_status() does not automatically convert this into an
EIO error.
Reported-by: NWeston Andros Adamson <dros@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

2fc193cf

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功