提交 · f45663ce5fb30f76a3414ab3ac69f4dd320e760a · openeuler / raspberrypi-kernel

10 7月, 2008 19 次提交

SUNRPC: Use only rpcbind v2 for AF_INET requests · 40fef8a6

由 Chuck Lever 提交于 6月 25, 2008

Some server vendors support the higher versions of rpcbind only for
AF_INET6.  The kernel doesn't need to use v3 or v4 for AF_INET anyway,
so change the kernel's rpcbind client to query AF_INET servers over
rpcbind v2 only.

This has a few interesting benefits:

1. If the rpcbind request is going over TCP, and the server doesn't
   support rpcbind versions 3 or 4, the client reduces by two the number
   of ephemeral ports left in TIME_WAIT for each rpcbind request.  This
   will help during NFS mount storms.

2. The rpcbind interaction with servers that don't support rpcbind
   versions 3 or 4 will use less network traffic.  Also helpful
   during mount storms.

3. We can eliminate the kernel build option that controls whether the
   kernel's rpcbind client uses rpcbind version 3 and 4 for AF_INET
   servers.  Less complicated kernel configuration...
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

40fef8a6

SUNRPC: Use GETADDR for rpcbind version 4 queries · 8842413a

由 Chuck Lever 提交于 6月 25, 2008

Some rpcbind servers that do support rpcbind version 4 do not support
the GETVERSADDR procedure.  Use GETADDR for querying rpcbind servers
via rpcbind version 4 instead of GETVERSADDR.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

8842413a

SUNRPC: Use rpcbind version 2 GETPORT · 6a774051

由 Chuck Lever 提交于 6月 25, 2008

Clean up: Change the version 2 procedure name to GETPORT.  It's the same
procedure number as GETADDR, but version 2 implementations usually refer
to it as GETPORT.

This also now matches the procedure name used in the version 2 procedure
entry in the rpcb_next_version[] array, making it slightly less confusing.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

6a774051

SUNRPC: Document some naked integers in rpcbind client · fc200e79

由 Chuck Lever 提交于 6月 25, 2008

Clean up: Replace naked integers that represent rpcbind protocol versions.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

fc200e79

SUNRPC: More useful debugging output for rpcb client · 877fcf10

由 Chuck Lever 提交于 6月 25, 2008

Clean up dprintk's in rpcb client's XDR decoder functions.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

877fcf10

SUNRPC: Ensure all transports set rq_xtime consistently · b22602a6

由 Chuck Lever 提交于 6月 06, 2008

The RPC client uses the rq_xtime field in each RPC request to determine the
round-trip time of the request.  Currently, the rq_xtime field is
initialized by each transport just before it starts enqueing a request to
be sent.  However, transports do not handle initializing this value
consistently; sometimes they don't initialize it at all.

To make the measurement of request round-trip time consistent for all
RPC client transport capabilities, pull rq_xtime initialization into the
RPC client's generic transport logic.  Now all transports will get a
standardized RTT measure automatically, from:

  xprt_transmit()

to

  xprt_complete_rqst()

This makes round-trip time calculation more accurate for the TCP transport.
The socket ->sendmsg() method can return "-EAGAIN" if the socket's output
buffer is full, so the TCP transport's ->send_request() method may call
the ->sendmsg() method repeatedly until it gets all of the request's bytes
queued in the socket's buffer.

Currently, the TCP transport sets the rq_xtime field every time through
that loop so the final value is the timestamp just before the *last* call
to the underlying socket's ->sendmsg() method.  After this patch, the
rq_xtime field contains a timestamp that reflects the time just before the
*first* call to ->sendmsg().

This is consequential under heavy workloads because large requests often
take multiple ->sendmsg() calls to get all the bytes of a request queued.
The TCP transport causes the request to sleep until the remote end of the
socket has received enough bytes to clear space in the socket's local
output buffer.  This delay can be quite significant.

The method introduced by this patch is a more accurate measure of RTT
for stream transports, since the server can cause enough back pressure
to delay (ie increase the latency of) requests from the client.

Additionally, this patch corrects the behavior of the RDMA transport, which
entirely neglected to initialize the rq_xtime field.  RPC performance
metrics for RDMA transports now display correct RPC request round trip
times.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Acked-by: NTom Talpey <thomas.talpey@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

b22602a6

rpc: minor cleanup of scheduler callback code · a486aeda

由 \\\"J. Bruce Fields\\\ 提交于 6月 09, 2008

Try to make the comment here a little more clear and concise.

Also, this macro definition seems unnecessary.
Signed-off-by: NJ. Bruce Fields <bfields@citi.umich.edu>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

a486aeda

rpc: remove some unused macros · d25a03cf

由 \\\"J. Bruce Fields\\\ 提交于 6月 09, 2008

There used to be a print_hexl() function that used isprint(), now gone.
I don't know why NFS_NGROUPS and CA_RUN_AS_MACHINE were here.

I also don't know why another #define that's actually used was marked
"unused".
Signed-off-by: NJ. Bruce Fields <bfields@citi.umich.edu>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

d25a03cf

rpc: eliminate unused variable in auth_gss upcall code · 720b8f2d

由 \\\"J. Bruce Fields\\\ 提交于 6月 09, 2008

Also, a minor comment grammar fix in the same file.
Signed-off-by: NJ. Bruce Fields <bfields@citi.umich.edu>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

720b8f2d

rpc: bring back cl_chatty · b6b6152c

由 Olga Kornievskaia 提交于 6月 09, 2008

The cl_chatty flag alows us to control whether a given rpc client leaves

	"server X not responding, timed out"

messages in the syslog.  Such messages make sense for ordinary nfs
clients (where an unresponsive server means applications on the
mountpoint are probably hanging), but not for the callback client (which
can fail more commonly, with the only result just of disabling some
optimizations).

Previously cl_chatty was removed, do to lack of users; reinstate it, and
use it for the nfsd's callback client.
Signed-off-by: NJ. Bruce Fields <bfields@citi.umich.edu>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

b6b6152c

SUNRPC: Remove obsolete messages during transport connect · cd983ef8

由 Chuck Lever 提交于 6月 11, 2008

Recent changes to the RPC client's transport connect logic make connect
status values ECONNREFUSED and ECONNRESET impossible.

Clean up xprt_connect_status() to account for these changes.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

cd983ef8

SUNRPC: Display some debugging information as text rather than numbers · cb3997b5

由 Chuck Lever 提交于 5月 21, 2008

In rpc_show_tasks(), display the program name, version number, procedure
name and tk_action as human-readable variable-length text fields rather
than columnar numbers.

Doing the symbol lookup here helps in cases where we have actual
debugging output from a kernel log, but don't have access to the kernel
image or RPC module that generated the output.

Sample output:

-pid- flgs status -client- --rqstp- -timeout ---ops--
5608 0001 -11 eeb42690 f6d93710 0 f8fa1764 nfsv3 WRITE a:call_transmit_status q:none
5609 0001 -11 eeb42690 f6d937e0 0 f8fa1764 nfsv3 WRITE a:call_status q:xprt_sending
5610 0001 -11 eeb42690 f6d93230 0 f8fa1764 nfsv3 WRITE a:call_status q:xprt_sending
5611 0001 -11 eeb42690 f6d93300 0 f8fa1764 nfsv3 WRITE a:call_status q:xprt_sending
5612 0001 -11 eeb42690 f6d93090 0 f8fa1764 nfsv3 WRITE a:call_status q:xprt_sending
5613 0001 -11 eeb42690 f6d933d0 0 f8fa1764 nfsv3 WRITE a:call_status q:xprt_sending
5614 0001 -11 eeb42690 f6d93cc0 0 f8fa1764 nfsv3 WRITE a:call_status q:xprt_sending
5615 0001 -11 eeb42690 f6d93a50 0 f8fa1764 nfsv3 WRITE a:call_status q:xprt_sending
5616 0001 -11 eeb42690 f6d93640 0 f8fa1764 nfsv3 WRITE a:call_status q:xprt_sending
5617 0001 -11 eeb42690 f6d93b20 0 f8fa1764 nfsv3 WRITE a:call_status q:xprt_sending
5618 0001 -11 eeb42690 f6d93160 0 f8fa1764 nfsv3 WRITE a:call_status q:xprt_sending
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

cb3997b5

SUNRPC: Refactor rpc_show_tasks · 38e886e0

由 Chuck Lever 提交于 5月 21, 2008

Clean up: move the logic that displays each task to its own function.
This removes indentation and makes future changes easier.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

38e886e0

SUNRPC: Don't display the rpc_show_tasks header if there are no tasks · 68a23ee9

由 Chuck Lever 提交于 5月 21, 2008

Clean up: don't display the rpc_show_tasks column header unless there is at
least one task to display. As far as I can tell, it is safe to let the
list_for_each_entry macro decide that each list is empty.

scripts/checkpatch.pl also wants a KERN_FOO at the start of any newly added
printk() calls, so this and subsequent patches will also add KERN_INFO.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

68a23ee9

SUNRPC: Rename "call_" functions that are no longer FSM states · b0e1c57e

由 Chuck Lever 提交于 5月 21, 2008

The RPC client uses a finite state machine to move RPC tasks through each
step of an RPC request. Each state is contained in a function in
net/sunrpc/clnt.c, and named call_foo.

Some of the functions named call_foo have changed over the past few years and
are no longer states in the FSM. These include: call_encode, call_header,
and call_verify. As a clean up, rename the functions that have changed.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

b0e1c57e

SUNRPC: Add a function to display the name of an RPC procedure · 3748f1e4

由 Chuck Lever 提交于 5月 21, 2008

Improve debugging messages in call_start() and call_verify() by having
them show the RPC procedure name instead of the procedure number.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

3748f1e4

SUNRPC: Use GFP_NOFS when allocating credentials · 0f38b873

由 Trond Myklebust 提交于 6月 10, 2008

Since the credentials may be allocated during the call to rpc_new_task(),
which again may be called by a memory allocator...
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

0f38b873

SUNRPC: An ENOMEM error from call_encode is always fatal · b390c2b5

由 Trond Myklebust 提交于 6月 10, 2008

The special 'ENOMEM' case that was previously flagged as non-fatal is
bogus: auth_gss always returns EAGAIN for non-fatal errors, and may in fact
return ENOMEM in the special case where xdr_buf_read_netobj runs out of
preallocated buffer space (invariably a _fatal_ error, since there is no
provision for preallocating larger buffers).
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

b390c2b5

SUNRPC: Ensure we exit early in case of an encode error · 8b39f2b4

由 Trond Myklebust 提交于 5月 14, 2008

All errors from call_encode(), with exception of EAGAIN are fatal, so we
should immediately return instead of proceeding to xprt_transmit().
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

8b39f2b4

09 7月, 2008 2 次提交

SUNRPC: Fix an rpcbind breakage for the case of IPv6 lookups · 803a9067

由 Trond Myklebust 提交于 7月 01, 2008

Now that rpcb_next_version has been split into an IPv4 version and an IPv6
version, we Oops when rpcb_call_async attempts to look up the IPv6-specific
RPC procedure in rpcb_next_version.

Fix the Oops simply by having rpcb_getport_async pass the correct RPC
procedure as an argument.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

803a9067

SUNRPC: Fix a double-free in rpcbind · 0d3a34b4

由 Trond Myklebust 提交于 7月 07, 2008

It is wrong to be freeing up the rpcbind arguments if the call to
rpcb_call_async() fails, since they should already have been freed up by
rpcb_map_release().
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

0d3a34b4

04 7月, 2008 1 次提交

svcrpc: fix handling of garbage args · b620754b

由 J. Bruce Fields 提交于 7月 03, 2008

To return garbage_args, the accept_stat must be 0, and we must have a
verifier.  So we shouldn't be resetting the write pointer as we reject
the call.

Also, we must add the two placeholder words here regardless of success
of the unwrap, to ensure the output buffer is left in a consistent state
for svcauth_gss_release().

This fixes a BUG() in svcauth_gss.c:svcauth_gss_release().

Thanks to Aime Le Rouzic for bug report, debugging help, and testing.
Signed-off-by: NJ. Bruce Fields <bfields@citi.umich.edu>
Tested-by: NAime Le Rouzic <aime.le-rouzic@bull.net>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

b620754b

19 5月, 2008 18 次提交

svcrdma: Verify read-list fits within RPCSVC_MAXPAGES · a6f911c0

由 Tom Tucker 提交于 5月 13, 2008

A RDMA read-list cannot contain more elements than RPCSVC_MAXPAGES or
it will overflow the DTO context. Verify this when processing the
protocol header.
Signed-off-by: NTom Tucker <tom@opengridcomputing.com>

a6f911c0

svcrdma: Change svc_rdma_send_error return type to void · 008fdbc5

由 Tom Tucker 提交于 5月 07, 2008

The svc_rdma_send_error function is called when an RPCRDMA protocol
error is detected. This function attempts to post an error reply message.
Since an error posting to a transport in error is ignored, change
the return type to void.
Signed-off-by: NTom Tucker <tom@opengridcomputing.com>

008fdbc5

svcrdma: Copy transport address and arm CQ before calling rdma_accept · af261af4

由 Tom Tucker 提交于 5月 07, 2008

This race was found by inspection. Messages can be received from the peer
immediately following the rdma_accept call, however, the CQ have not yet
been armed and the transport address has not yet been set.

Set the transport address in the connect request handler and arm the CQ
prior to calling rdma_accept.
Signed-off-by: NTom Tucker <tom@opengridcomputing.com>

af261af4

svcrdma: Set rqstp transport address in rdma_read_complete function · 69500c43

由 Tom Tucker 提交于 5月 07, 2008

The rdma_read_complete function needs to copy the rqstp transport address
from the transport. Failure to do so can result in using the wrong
authentication method for the RPC or bug checking if the rqstp address
is not valid.
Signed-off-by: NTom Tucker <tom@opengridcomputing.com>

69500c43

svcrdma: Use ib verbs version of dma_unmap · 97a3df38

由 Tom Tucker 提交于 5月 01, 2008

Use the ib_verbs version of the dma_unmap service in the
svc_rdma_put_context function. This should support providers
using software rdma.
Signed-off-by: NTom Tucker <tom@opengridcomputing.com>

97a3df38

svcrdma: Cleanup queued, but unprocessed I/O in svc_rdma_free · 356d0a15

由 Tom Tucker 提交于 5月 01, 2008

When the transport is closing, the DTO tasklet may queue data
that never gets processed. Clean up resources associated with
this I/O.
Signed-off-by: NTom Tucker <tom@opengridcomputing.com>

356d0a15

svcrdma: Move the QP and cm_id destruction to svc_rdma_free · 1711386c

由 Tom Tucker 提交于 5月 01, 2008

Move the destruction of the QP and CM_ID to the free path so that the
QP cleanup code doesn't race with the dto_tasklet handling flushed WR.
The QP reference is not needed because we now have a reference for
every WR.

Also add a guard in the SQ and RQ completion handlers to ignore
calls generated by some providers when the QP is destroyed.
Signed-off-by: NTom Tucker <tom@opengridcomputing.com>

1711386c

svcrdma: Add reference for each SQ/RQ WR · 0905c0f0

由 Tom Tucker 提交于 5月 01, 2008

Add a reference on the transport for every outstanding WR.
Signed-off-by: NTom Tucker <tom@opengridcomputing.com>

0905c0f0

svcrdma: Move destroy to kernel thread · 8da91ea8

由 Tom Tucker 提交于 4月 30, 2008

Some providers may wait while destroying adapter resources.
Since it is possible that the last reference is put on the
dto_tasklet, the actual destroy must be scheduled as a work item.
Signed-off-by: NTom Tucker <tom@opengridcomputing.com>

8da91ea8

svcrdma: Shrink scope of spinlock on RQ CQ · 47698e08

由 Tom Tucker 提交于 5月 06, 2008

The rq_cq_reap function is only called from the dto_tasklet. The
only resource shared with other threads is the sc_rq_dto_q. Move the
spin lock to protect only this list.
Signed-off-by: NTom Tucker <tom@opengridcomputing.com>

47698e08

svcrdma: Use standard Linux lists for context cache · 87407673

由 Tom Tucker 提交于 4月 30, 2008

Replace the one-off linked list implementation used to implement the
context cache with the standard Linux list_head lists. Add a context
counter to catch resource leaks. A WARN_ON will be added later to
ensure that we've freed all contexts.
Signed-off-by: NTom Tucker <tom@opengridcomputing.com>

87407673

svcrdma: Simplify RDMA_READ deferral buffer management · 02e7452d

由 Tom Tucker 提交于 4月 30, 2008

An NFS_WRITE requires a set of RDMA_READ requests to fetch the write
data from the client. There are two principal pieces of data that
need to be tracked: the list of pages that comprise the completed RPC
and the SGE of dma mapped pages to refer to this list of pages. Previously
this whole bit was managed as a linked list of contexts with the
context containing the page list buried in this list. This patch
simplifies this processing by not keeping a linked list, but rather only
a pionter from the last submitted RDMA_READ's context to the context
that maps the set of pages that describe the RPC. This significantly
simplifies this code path. SGE contexts are cleaned up inline in the DTO
path instead of at read completion time.
Signed-off-by: NTom Tucker <tom@opengridcomputing.com>

02e7452d

svcrdma: Remove unused READ_DONE context flags bit · 10a38c33

由 Tom Tucker 提交于 4月 30, 2008

The RDMACTXT_F_READ_DONE bit is not longer used. Remove it.
Signed-off-by: NTom Tucker <tom@opengridcomputing.com>

10a38c33

svcrdma: Return error from rdma_read_xdr so caller knows to free context · d16d4009

由 Tom Tucker 提交于 5月 06, 2008

The rdma_read_xdr function did not discriminate between no read-list and
an error posting the read-list. This results in a leak of a page if there
is an error posting the read-list.
Signed-off-by: NTom Tucker <tom@opengridcomputing.com>

d16d4009

svcrdma: Fix error handling during listening endpoint creation · 58e8f621

由 Tom Tucker 提交于 5月 06, 2008

A listening endpoint isn't known to the generic transport switch until
the svc_create_xprt function returns without error. Calling
svc_xprt_put within the xpo_create function causes the module reference
count to be erroneously decremented.
Signed-off-by: NTom Tucker <tom@opengridcomputing.com>

58e8f621

svcrdma: Free context on post_recv error in send_reply · 5ac461a6

由 Tom Tucker 提交于 4月 25, 2008

If an error is encountered trying to post a recv buffer in send_reply,
free the passed in context. Return an error to the caller so it is
aware that the request was not posted.
Signed-off-by: NTom Tucker <tom@opengridcomputing.com>

5ac461a6

svcrdma: Free context on ib_post_recv error · 05a0826a

由 Tom Tucker 提交于 4月 25, 2008

If there is an error posting the recv WR to the RQ, free the
context associated with the WR. This would leak a context when
asynchronous errors occurred on the transport while conccurent threads
were processing their RPC.
Signed-off-by: NTom Tucker <tom@opengridcomputing.com>

05a0826a

svcrdma: Add put of connection ESTABLISHED reference in rdma_cma_handler · 120693d1

由 Tom Tucker 提交于 4月 24, 2008

The svcrdma transport takes a reference when it gets the ESTABLISHED
event from the provider. This reference is supposed to be removed when
the DISCONNECT event is received, however, the call to svc_xprt_put
was missing in the switch statement. This results in the memory
associated with the transport never being freed.
Signed-off-by: NTom Tucker <tom@opengridcomputing.com>

120693d1