提交 · a52458b48af142bcc2b72fe810c0db20cfae7fdd · openeuler / Kernel

You need to sign in or sign up before continuing.

20 12月, 2018 11 次提交

NFS/NFSD/SUNRPC: replace generic creds with 'struct cred'. · a52458b4

由 NeilBrown 提交于 12月 03, 2018

SUNRPC has two sorts of credentials, both of which appear as
"struct rpc_cred".
There are "generic credentials" which are supplied by clients
such as NFS and passed in 'struct rpc_message' to indicate
which user should be used to authorize the request, and there
are low-level credentials such as AUTH_NULL, AUTH_UNIX, AUTH_GSS
which describe the credential to be sent over the wires.

This patch replaces all the generic credentials by 'struct cred'
pointers - the credential structure used throughout Linux.

For machine credentials, there is a special 'struct cred *' pointer
which is statically allocated and recognized where needed as
having a special meaning.  A look-up of a low-level cred will
map this to a machine credential.
Signed-off-by: NNeilBrown <neilb@suse.com>
Acked-by: NJ. Bruce Fields <bfields@redhat.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

a52458b4

SUNRPC: remove RPCAUTH_AUTH_NO_CRKEY_TIMEOUT · 354698b7

由 NeilBrown 提交于 12月 03, 2018

This is no longer used.
Signed-off-by: NNeilBrown <neilb@suse.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

354698b7

NFS: move credential expiry tracking out of SUNRPC into NFS. · ddf529ee

由 NeilBrown 提交于 12月 03, 2018

NFS needs to know when a credential is about to expire so that
it can modify write-back behaviour to finish the write inside the
expiry time.
It currently uses functions in SUNRPC code which make use of a
fairly complex callback scheme and flags in the generic credientials.

As I am working to discard the generic credentials, this has to change.

This patch moves the logic into NFS, in part by finding and caching
the low-level credential in the open_context.  We then make direct
cred-api calls on that.

This makes the code much simpler and removes a dependency on generic
rpc credentials.
Signed-off-by: NNeilBrown <neilb@suse.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

ddf529ee

SUNRPC: add side channel to use non-generic cred for rpc call. · 1de7eea9

由 NeilBrown 提交于 12月 03, 2018

The credential passed in rpc_message.rpc_cred is always a
generic credential except in one instance.
When gss_destroying_context() calls rpc_call_null(), it passes
a specific credential that it needs to destroy.
In this case the RPC acts *on* the credential rather than
being authorized by it.

This special case deserves explicit support and providing that will
mean that rpc_message.rpc_cred is *always* generic, allowing
some optimizations.

So add "tk_op_cred" to rpc_task and "rpc_op_cred" to the setup data.
Use this to pass the cred down from rpc_call_null(), and have
rpcauth_bindcred() notice it and bind it in place.

Credit to kernel test robot <fengguang.wu@intel.com> for finding
a bug in earlier version of this patch.
Signed-off-by: NNeilBrown <neilb@suse.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

1de7eea9

SUNRPC: introduce RPC_TASK_NULLCREDS to request auth_none · a68a72e1

由 NeilBrown 提交于 12月 03, 2018

In almost all cases the credential stored in rpc_message.rpc_cred
is a "generic" credential.  One of the two expections is when an
AUTH_NULL credential is used such as for RPC ping requests.

To improve consistency, don't pass an explicit credential in
these cases, but instead pass NULL and set a task flag,
similar to RPC_TASK_ROOTCREDS, which requests that NULL credentials
be used by default.
Signed-off-by: NNeilBrown <neilb@suse.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

a68a72e1

NFS/SUNRPC: don't lookup machine credential until rpcauth_bindcred(). · 5e16923b

由 NeilBrown 提交于 12月 03, 2018

When NFS creates a machine credential, it is a "generic" credential,
not tied to any auth protocol, and is really just a container for
the princpal name.
This doesn't get linked to a genuine credential until rpcauth_bindcred()
is called.
The lookup always succeeds, so various places that test if the machine
credential is NULL, are pointless.

As a step towards getting rid of generic credentials, this patch gets
rid of generic machine credentials.  The nfs_client and rpc_client
just hold a pointer to a constant principal name.
When a machine credential is wanted, a special static 'struct rpc_cred'
pointer is used. rpcauth_bindcred() recognizes this, finds the
principal from the client, and binds the correct credential.
Signed-off-by: NNeilBrown <neilb@suse.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

5e16923b

SUNRPC: remove machine_cred field from struct auth_cred · 1a80810f

由 NeilBrown 提交于 12月 03, 2018

The cred is a machine_cred iff ->principal is set, so there is no
need for the extra flag.

There is one case which deserves some
explanation. nfs4_root_machine_cred() calls rpc_lookup_machine_cred()
with a NULL principal name which results in not getting a machine
credential, but getting a root credential instead.
This appears to be what is expected of the caller, and is
clearly the result provided by both auth_unix and auth_gss
which already ignore the flag.
Signed-off-by: NNeilBrown <neilb@suse.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

1a80810f

SUNRPC: remove uid and gid from struct auth_cred · 8276c902

由 NeilBrown 提交于 12月 03, 2018

Use cred->fsuid and cred->fsgid instead.
Signed-off-by: NNeilBrown <neilb@suse.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

8276c902

SUNRPC: remove groupinfo from struct auth_cred. · fc0664fd

由 NeilBrown 提交于 12月 03, 2018

We can use cred->groupinfo (from the 'struct cred') instead.
Signed-off-by: NNeilBrown <neilb@suse.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

fc0664fd

SUNRPC: add 'struct cred *' to auth_cred and rpc_cred · 97f68c6b

由 NeilBrown 提交于 12月 03, 2018

The SUNRPC credential framework was put together before
Linux has 'struct cred'.  Now that we have it, it makes sense to
use it.
This first step just includes a suitable 'struct cred *' pointer
in every 'struct auth_cred' and almost every 'struct rpc_cred'.

The rpc_cred used for auth_null has a NULL 'struct cred *' as nothing
else really makes sense.

For rpc_cred, the pointer is reference counted.
For auth_cred it isn't.  struct auth_cred are either allocated on
the stack, in which case the thread owns a reference to the auth,
or are part of 'struct generic_cred' in which case gc_base owns the
reference, and "acred" shares it.
Signed-off-by: NNeilBrown <neilb@suse.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

97f68c6b

SUNRPC: allow /proc entries without CONFIG_SUNRPC_DEBUG · 8e2e5b7c

由 Ben Dooks 提交于 11月 28, 2018

If we want /proc/sys/sunrpc the current kernel also drags in other debug
features which we don't really want. Instead, we should always show the
following entries:

/proc/sys/sunrpc/udp_slot_table_entries
/proc/sys/sunrpc/tcp_slot_table_entries
/proc/sys/sunrpc/tcp_max_slot_table_entries
/proc/sys/sunrpc/min_resvport
/proc/sys/sunrpc/max_resvport
/proc/sys/sunrpc/tcp_fin_timeout
Signed-off-by: NBen Dooks <ben.dooks@codethink.co.uk>
Signed-off-by: NThomas Preston <thomas.preston@codethink.co.uk>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

8e2e5b7c

19 12月, 2018 3 次提交

SUNRPC: Remove xprt_connect_status() · abc13275

由 Trond Myklebust 提交于 12月 17, 2018

Over the years, xprt_connect_status() has been superseded by
call_connect_status(), which now handles all the errors that
xprt_connect_status() does and more. Since the latter converts
all errors that it doesn't recognise to EIO, then it is time
for it to be retired.
Reported-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Tested-by: NChuck Lever <chuck.lever@oracle.com>

abc13275

SUNRPC: Fix a race with XPRT_CONNECTING · cf76785d

由 Trond Myklebust 提交于 12月 17, 2018

Ensure that we clear XPRT_CONNECTING before releasing the XPRT_LOCK so that
we don't have races between the (asynchronous) socket setup code and
tasks in xprt_connect().
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Tested-by: NChuck Lever <chuck.lever@oracle.com>

cf76785d

SUNRPC: Fix disconnection races · 0445f92c

由 Trond Myklebust 提交于 12月 17, 2018

When the socket is closed, we need to call xprt_disconnect_done() in order
to clean up the XPRT_WRITE_SPACE flag, and wake up the sleeping tasks.

However, we also want to ensure that we don't wake them up before the socket
is closed, since that would cause thundering herd issues with everyone
piling up to retransmit before the TCP shutdown dance has completed.
Only the task that holds XPRT_LOCKED needs to wake up early in order to
allow the close to complete.
Reported-by: NDave Wysochanski <dwysocha@redhat.com>
Reported-by: NScott Mayhew <smayhew@redhat.com>
Cc: Chuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Tested-by: NChuck Lever <chuck.lever@oracle.com>

0445f92c

05 12月, 2018 6 次提交

SUNRPC: Don't force a redundant disconnection in xs_read_stream() · 79462857

由 Trond Myklebust 提交于 12月 03, 2018

If the connection is broken, then xs_tcp_state_change() will take care
of scheduling the socket close as soon as appropriate. xs_read_stream()
just needs to report the error.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

79462857

SUNRPC: Fix up socket polling · dfcf0380

由 Trond Myklebust 提交于 12月 04, 2018

Ensure that we do not exit the socket read callback without clearing
XPRT_SOCK_DATA_READY.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

dfcf0380

SUNRPC: Use the discard iterator rather than MSG_TRUNC · b76a5afd

由 Trond Myklebust 提交于 12月 03, 2018

When discarding message data from the stream, we're better off using
the discard iterator, since that will work with non-TCP streams.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

b76a5afd

T
SUNRPC: Treat EFAULT as a truncated message in xs_read_stream_request() · 26781eab
由 Trond Myklebust 提交于 12月 03, 2018
```
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
```
26781eab

SUNRPC: Fix up handling of the XDRBUF_SPARSE_PAGES flag · 16e5e90f

由 Trond Myklebust 提交于 12月 02, 2018

If the allocator fails before it has reached the target number of pages,
then we need to recheck that we're not seeking past the page buffer.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

16e5e90f

SUNRPC: Fix RPC receive hangs · c4433055

由 Trond Myklebust 提交于 12月 04, 2018

The RPC code is occasionally hanging when the receive code fails to
empty the socket buffer due to a partial read of the data. When we
convert that to an EAGAIN, it appears we occasionally leave data in the
socket. The fix is to just keep reading until the socket returns
EAGAIN/EWOULDBLOCK.
Reported-by: NCatalin Marinas <catalin.marinas@arm.com>
Reported-by: NCristian Marussi <cristian.marussi@arm.com>
Reported-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Tested-by: NCatalin Marinas <catalin.marinas@arm.com>
Tested-by: NCristian Marussi <cristian.marussi@arm.com>

c4433055

02 12月, 2018 4 次提交

SUNRPC: Fix a potential race in xprt_connect() · 0a9a4304

由 Trond Myklebust 提交于 12月 01, 2018

If an asynchronous connection attempt completes while another task is
in xprt_connect(), then the call to rpc_sleep_on() could end up
racing with the call to xprt_wake_pending_tasks().
So add a second test of the connection state after we've put the
task to sleep and set the XPRT_CONNECTING flag, when we know that there
can be no asynchronous connection attempts still in progress.

Fixes: 0b9e7943 ("SUNRPC: Move the test for XPRT_CONNECTING into...")
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

0a9a4304

SUNRPC: Fix a memory leak in call_encode() · 71700bb9

由 Trond Myklebust 提交于 11月 30, 2018

If we retransmit an RPC request, we currently end up clobbering the
value of req->rq_rcv_buf.bvec that was allocated by the initial call to
xprt_request_prepare(req).
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

71700bb9

SUNRPC: Fix leak of krb5p encode pages · 8dae5398

由 Chuck Lever 提交于 11月 30, 2018

call_encode can be invoked more than once per RPC call. Ensure that
each call to gss_wrap_req_priv does not overwrite pointers to
previously allocated memory.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Cc: stable@kernel.org
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

8dae5398

SUNRPC: call_connect_status() must handle tasks that got transmitted · 9bd11523

由 Trond Myklebust 提交于 11月 30, 2018

If a task failed to get the write lock in the call to xprt_connect(), then
it will be queued on xprt->sending. In that case, it is possible for it
to get transmitted before the call to call_connect_status(), in which
case it needs to be handled by call_transmit_status() instead.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

9bd11523

13 11月, 2018 2 次提交

T
SUNRPC: Fix a bogus get/put in generic_key_to_expire() · e3d5e573
由 Trond Myklebust 提交于 11月 12, 2018
```
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
```
e3d5e573

SUNRPC: Fix a Oops when destroying the RPCSEC_GSS credential cache · a652a4bc

由 Trond Myklebust 提交于 11月 12, 2018

Commit 07d02a67 causes a use-after free in the RPCSEC_GSS credential
destroy code, because the call to get_rpccred() in gss_destroying_context()
will now always fail to increment the refcount.

While we could just replace the get_rpccred() with a refcount_set(), that
would have the unfortunate consequence of resurrecting a credential in
the credential cache for which we are in the process of destroying the
RPCSEC_GSS context. Rather than do this, we choose to make a copy that
is never added to the cache and use that to destroy the context.

Fixes: 07d02a67 ("SUNRPC: Simplify lookup code")
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

a652a4bc

09 11月, 2018 1 次提交

SUNRPC: drop pointless static qualifier in xdr_get_next_encode_buffer() · 025911a5

由 YueHaibing 提交于 11月 08, 2018

There is no need to have the '__be32 *p' variable static since new value
always be assigned before use it.
Signed-off-by: NYueHaibing <yuehaibing@huawei.com>
Cc: stable@vger.kernel.org
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>

025911a5

06 11月, 2018 1 次提交

sunrpc: correct the computation for page_ptr when truncating · 5d7a5bcb

由 Frank Sorenson 提交于 10月 30, 2018

When truncating the encode buffer, the page_ptr is getting
advanced, causing the next page to be skipped while encoding.
The page is still included in the response, so the response
contains a page of bogus data.

We need to adjust the page_ptr backwards to ensure we encode
the next page into the correct place.

We saw this triggered when concurrent directory modifications caused
nfsd4_encode_direct_fattr() to return nfserr_noent, and the resulting
call to xdr_truncate_encode() corrupted the READDIR reply.
Signed-off-by: NFrank Sorenson <sorenson@redhat.com>
Cc: stable@vger.kernel.org
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>

5d7a5bcb

02 11月, 2018 2 次提交

missing bits of "iov_iter: Separate type from direction and use accessor functions" · 0e9b4a82

由 Al Viro 提交于 11月 01, 2018

sunrpc patches from nfs tree conflict with calling conventions change done
in iov_iter work.  Trivial fixup...
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

0e9b4a82

SUNRPC: Use atomic(64)_t for seq_send(64) · c3be6577

由 Paul Burton 提交于 11月 01, 2018

The seq_send & seq_send64 fields in struct krb5_ctx are used as
atomically incrementing counters. This is implemented using cmpxchg() &
cmpxchg64() to implement what amount to custom versions of
atomic_fetch_inc() & atomic64_fetch_inc().

Besides the duplication, using cmpxchg64() has another major drawback in
that some 32 bit architectures don't provide it. As such commit
571ed1fd ("SUNRPC: Replace krb5_seq_lock with a lockless scheme")
resulted in build failures for some architectures.

Change seq_send to be an atomic_t and seq_send64 to be an atomic64_t,
then use atomic(64)_* functions to manipulate the values. The atomic64_t
type & associated functions are provided even on architectures which
lack real 64 bit atomic memory access via CONFIG_GENERIC_ATOMIC64 which
uses spinlocks to serialize access. This fixes the build failures for
architectures lacking cmpxchg64().

A potential alternative that was raised would be to provide cmpxchg64()
on the 32 bit architectures that currently lack it, using spinlocks.
However this would provide a version of cmpxchg64() with semantics a
little different to the implementations on architectures with real 64
bit atomics - the spinlock-based implementation would only work if all
access to the memory used with cmpxchg64() is *always* performed using
cmpxchg64(). That is not currently a requirement for users of
cmpxchg64(), and making it one seems questionable. As such avoiding
cmpxchg64() outside of architecture-specific code seems best,
particularly in cases where atomic64_t seems like a better fit anyway.

The CONFIG_GENERIC_ATOMIC64 implementation of atomic64_* functions will
use spinlocks & so faces the same issue, but with the key difference
that the memory backing an atomic64_t ought to always be accessed via
the atomic64_* functions anyway making the issue moot.
Signed-off-by: NPaul Burton <paul.burton@mips.com>
Fixes: 571ed1fd ("SUNRPC: Replace krb5_seq_lock with a lockless scheme")
Cc: Trond Myklebust <trond.myklebust@hammerspace.com>
Cc: Anna Schumaker <anna.schumaker@netapp.com>
Cc: J. Bruce Fields <bfields@fieldses.org>
Cc: Jeff Layton <jlayton@kernel.org>
Cc: David S. Miller <davem@davemloft.net>
Cc: linux-nfs@vger.kernel.org
Cc: netdev@vger.kernel.org
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

c3be6577

30 10月, 2018 10 次提交

nfsd: Fix an Oops in free_session() · bb6ad557

由 Trond Myklebust 提交于 10月 09, 2018

In call_xpt_users(), we delete the entry from the list, but we
do not reinitialise it. This triggers the list poisoning when
we later call unregister_xpt_user() in nfsd4_del_conns().
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Cc: stable@vger.kernel.org
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>

bb6ad557

svcrdma: Remove try_module_get from backchannel · 07880fa4

由 Chuck Lever 提交于 10月 01, 2018

Since commit ffe1f0df ("rpcrdma: Merge svcrdma and xprtrdma
modules into one"), the forward and backchannel components are part
of the same kernel module. A separate try_module_get() call in the
backchannel code is no longer necessary.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Reviewed-by: NSagi Grimberg <sagi@grimberg.me>
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>

07880fa4

svcrdma: Remove ->release_rqst call in bc reply handler · 596f2a19

由 Chuck Lever 提交于 10月 01, 2018

Similar to a change made in the client's forward channel reply
handler: The xprt_release_rqst_cong() call is not necessary.

Also, release xprt->recv_lock when taking xprt->transport_lock
to avoid disabling and enabling BH's while holding another
spin lock.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Reviewed-by: NSagi Grimberg <sagi@grimberg.me>
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>

596f2a19

svcrdma: Reduce max_send_sges · f3c1fd0e

由 Chuck Lever 提交于 10月 01, 2018

There's no need to request a large number of send SGEs because the
inline threshold already constrains the number of SGEs per Send.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Reviewed-by: NSagi Grimberg <sagi@grimberg.me>
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>

f3c1fd0e

SUNRPC: Simplify TCP receive code · 4c8e5537

由 Trond Myklebust 提交于 10月 01, 2018

Use the fact that the iov iterators already have functionality for
skipping a base offset.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>

4c8e5537

SUNRPC: Replace the cache_detail->hash_lock with a regular spinlock · 1863d77f

由 Trond Myklebust 提交于 10月 01, 2018

Now that the reader functions are all RCU protected, use a regular
spinlock rather than a reader/writer lock.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>

1863d77f

SUNRPC: Remove non-RCU protected lookup · d48cf356

由 Trond Myklebust 提交于 10月 01, 2018

Clean up the cache code by removing the non-RCU protected lookup.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>

d48cf356

SUNRPC: Lockless server RPCSEC_GSS context lookup · 6d1616b2

由 Trond Myklebust 提交于 10月 01, 2018

Use RCU protection for looking up the RPCSEC_GSS context.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>

6d1616b2

SUNRPC: Make server side AUTH_UNIX use lockless lookups · fd5d2f78

由 Trond Myklebust 提交于 10月 01, 2018

Convert structs ip_map and unix_gid to use RCU protected lookups.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>

fd5d2f78

SUNRPC: Allow cache lookups to use RCU protection rather than the r/w spinlock · ae74136b

由 Trond Myklebust 提交于 10月 03, 2018

Instead of the reader/writer spinlock, allow cache lookups to use RCU
for looking up entries. This is more efficient since modifications can
occur while other entries are being looked up.

Note that for now, we keep the reader/writer spinlock until all users
have been converted to use RCU-safe freeing of their cache entries.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>

ae74136b

openeuler / Kernel 接近 3 年 前同步成功

openeuler / Kernel
接近 3 年前同步成功