提交 · 0c78789e3a030615c6650fde89546cadf40ec2cc · openeuler / raspberrypi-kernel

30 8月, 2015 1 次提交

SUNRPC: xs_reset_transport must mark the connection as disconnected · 0c78789e

由 Trond Myklebust 提交于 8月 29, 2015

In case the reconnection attempt fails.

Cc: stable@vger.kernel.org
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

0c78789e

20 8月, 2015 1 次提交

SUNRPC: Allow sockets to do GFP_NOIO allocations · c2126157

由 Trond Myklebust 提交于 8月 19, 2015

Follow up to commit c4a7ca77 ("SUNRPC: Allow waiting on memory
allocation"). Allows the RPC socket code to do non-IO blocking.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

c2126157

18 8月, 2015 1 次提交

SUNRPC: Fix a thinko in xs_connect() · 99b1a4c3

由 Trond Myklebust 提交于 8月 13, 2015

It is rather pointless to test the value of transport->inet after
calling xs_reset_transport(), since it will always be zero, and
so we will never see any exponential back off behaviour.
Also don't force early connections for SOFTCONN tasks. If the server
disconnects us, we should respect the exponential backoff.

Cc: stable@vger.kernel.org # 4.0+
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

99b1a4c3

28 7月, 2015 1 次提交
- T
  SUNRPC: Report TCP errors to the caller · f580dd04
  由 Trond Myklebust 提交于 7月 11, 2015
```
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  f580dd04
27 7月, 2015 1 次提交

sunrpc: translate -EAGAIN to -ENOBUFS when socket is writable. · 743c69e7

由 NeilBrown 提交于 7月 27, 2015

The networking layer does not reliably report the distinction between
a non-block write failing because:
 1/ the queue is too full already and
 2/ a memory allocation attempt failed.

The distinction is important because in the first case it is
appropriate to retry as soon as the socket reports that it is
writable, and in the second case a small delay is required as the
socket will most likely report as writable but kmalloc could still
fail.

sk_stream_wait_memory() exhibits this distinction nicely, setting
'vm_wait' if a small wait is needed.  However in the non-blocking case
it always returns -EAGAIN no matter the cause of the failure.  This
-EAGAIN call get all the way to sunrpc.

The sunrpc layer expects EAGAIN to indicate the first cause, and
ENOBUFS to indicate the second.  Various documentation suggests that
this is not unreasonable, but does not guarantee the desired error
codes.

The result of getting -EAGAIN when -ENOBUFS is expected is that the
send is tried again in a tight loop and soft lockups are reported.

so: add tests after calls to xs_sendpages() to translate -EAGAIN into
-ENOBUFS if the socket is writable.  This cannot happen inside
xs_sendpages() as the test for "is socket writable" is different
between TCP and UDP.

With this change, the tight loop retrying xs_sendpages() becomes a
loop which only retries every 250ms, and so will not trigger a
soft-lockup warning.

It is possible that the write did fail because the queue was too full
and by the time xs_sendpages() completed, the queue was writable
again.  In this case an extra 250ms delay is inserted that isn't
really needed.  This circumstance suggests a degree of congestion so a
delay is not necessarily a bad thing, and it can only cause a single
250ms delay, not a series of them.
Signed-off-by: NNeilBrown <neilb@suse.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

743c69e7

03 7月, 2015 1 次提交

SUNRPC: Don't confuse ENOBUFS with a write_space issue · b5872f0c

由 Trond Myklebust 提交于 7月 03, 2015

ENOBUFS means that memory allocations are failing due to an actual
low memory situation. It should not be confused with being out of
socket buffer space.

Handle the problem by just punting to the delay in call_status.
Reported-by: NNeil Brown <neilb@suse.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

b5872f0c

21 6月, 2015 1 次提交

SUNRPC: Set the TCP user timeout option on client sockets · 775f06ab

由 Trond Myklebust 提交于 6月 20, 2015

Use the TCP_USER_TIMEOUT socket option to advertise to the server
how long we will keep the connection open if there is unacknowledged
data. See RFC5482.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

775f06ab

20 6月, 2015 1 次提交

SUNRPC: Ensure we release the TCP socket once it has been closed · 4876cc77

由 Trond Myklebust 提交于 6月 19, 2015

This fixes a regression introduced by commit caf4ccd4 ("SUNRPC:
Make xs_tcp_close() do a socket shutdown rather than a sock_release").
Prior to that commit, the autoclose feature would ensure that an
idle connection would result in the socket being both disconnected and
released, whereas now only gets disconnected.

While the current behaviour is harmless, it does leave the port bound
until either RPC traffic resumes or the RPC client is shut down.
Reported-by: NSteven Rostedt <rostedt@goodmis.org>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

4876cc77

11 6月, 2015 6 次提交

SUNRPC: Transport fault injection · 4a068258

由 Chuck Lever 提交于 5月 11, 2015

It has been exceptionally useful to exercise the logic that handles
local immediate errors and RDMA connection loss.  To enable
developers to test this regularly and repeatably, add logic to
simulate connection loss every so often.

Fault injection is disabled by default. It is enabled with

  $ sudo echo xxx > /sys/kernel/debug/sunrpc/inject_fault/disconnect

where "xxx" is a large positive number of transport method calls
before a disconnect. A value of several thousand is usually a good
number that allows reasonable forward progress while still causing a
lot of connection drops.

These hooks are disabled when SUNRPC_DEBUG is turned off.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

4a068258

sunrpc: turn swapper_enable/disable functions into rpc_xprt_ops · d67fa4d8

由 Jeff Layton 提交于 6月 03, 2015

RDMA xprts don't have a sock_xprt, but an rdma_xprt, so the
xs_swapper_enable/disable functions will likely oops when fed an RDMA
xprt. Turn these functions into rpc_xprt_ops so that that doesn't
occur. For now the RDMA versions are no-ops that just return -EINVAL
on an attempt to swapon.

Cc: Chuck Lever <chuck.lever@oracle.com>
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Reviewed-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

d67fa4d8

sunrpc: lock xprt before trying to set memalloc on the sockets · d6e971d8

由 Jeff Layton 提交于 6月 03, 2015

It's possible that we could race with a call to xs_reset_transport, in
which case the xprt->inet pointer could be zeroed out while we're
accessing it. Lock the xprt before we try to set memalloc on it.

Cc: Mel Gorman <mgorman@suse.de>
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Reviewed-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

d6e971d8

sunrpc: if we're closing down a socket, clear memalloc on it first · 264d1df3

由 Jeff Layton 提交于 6月 03, 2015

We currently increment the memalloc_socks counter if we have a xprt that
is associated with a swapfile. That socket can be replaced however
during a reconnect event, and the memalloc_socks counter is never
decremented if that occurs.

When tearing down a xprt socket, check to see if the xprt is set up for
swapping and sk_clear_memalloc before releasing the socket if so.
Acked-by: NMel Gorman <mgorman@suse.de>
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Reviewed-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

264d1df3

sunrpc: make xprt->swapper an atomic_t · 8e228133

由 Jeff Layton 提交于 6月 03, 2015

Split xs_swapper into enable/disable functions and eliminate the
"enable" flag.

Currently, it's racy if you have multiple swapon/swapoff operations
running in parallel over the same xprt. Also fix it so that we only
set it to a memalloc socket on a 0->1 transition and only clear it
on a 1->0 transition.

Cc: Mel Gorman <mgorman@suse.de>
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Reviewed-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

8e228133

sunrpc: keep a count of swapfiles associated with the rpc_clnt · 3c87ef6e

由 Jeff Layton 提交于 6月 03, 2015

Jerome reported seeing a warning pop when working with a swapfile on
NFS. The nfs_swap_activate can end up calling sk_set_memalloc while
holding the rcu_read_lock and that function can sleep.

To fix that, we need to take a reference to the xprt while holding the
rcu_read_lock, set the socket up for swapping and then drop that
reference. But, xprt_put is not exported and having NFS deal with the
underlying xprt is a bit of layering violation anyway.

Fix this by adding a set of activate/deactivate functions that take a
rpc_clnt pointer instead of an rpc_xprt, and have nfs_swap_activate and
nfs_swap_deactivate call those.

Also, add a per-rpc_clnt atomic counter to keep track of the number of
active swapfiles associated with it. When the counter does a 0->1
transition, we enable swapping on the xprt, when we do a 1->0 transition
we disable swapping on it.

This also allows us to be a bit more selective with the RPC_TASK_SWAPPER
flag. If non-swapper and swapper clnts are sharing a xprt, then we only
need to flag the tasks from the swapper clnt with that flag.
Acked-by: NMel Gorman <mgorman@suse.de>
Reported-by: NJerome Marchand <jmarchan@redhat.com>
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Reviewed-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

3c87ef6e

02 6月, 2015 1 次提交

SUNRPC: drop stale doc comments in xprtsock.c · 9300fdba

由 Stefan Hajnoczi 提交于 5月 06, 2015

Several functions have outdated arguments listed in the doc comments.
Drop documentation for arguments that no longer exist.
Signed-off-by: NStefan Hajnoczi <stefanha@redhat.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

9300fdba

28 5月, 2015 1 次提交

kernel/params: constify struct kernel_param_ops uses · 9c27847d

由 Luis R. Rodriguez 提交于 5月 27, 2015

Most code already uses consts for the struct kernel_param_ops,
sweep the kernel for the last offending stragglers. Other than
include/linux/moduleparam.h and kernel/params.c all other changes
were generated with the following Coccinelle SmPL patch. Merge
conflicts between trees can be handled with Coccinelle.

In the future git could get Coccinelle merge support to deal with
patch --> fail --> grammar --> Coccinelle --> new patch conflicts
automatically for us on patches where the grammar is available and
the patch is of high confidence. Consider this a feature request.

Test compiled on x86_64 against:

	* allnoconfig
	* allmodconfig
	* allyesconfig

@ const_found @
identifier ops;
@@

const struct kernel_param_ops ops = {
};

@ const_not_found depends on !const_found @
identifier ops;
@@

-struct kernel_param_ops ops = {
+const struct kernel_param_ops ops = {
};

Generated-by: Coccinelle SmPL
Cc: Rusty Russell <rusty@rustcorp.com.au>
Cc: Junio C Hamano <gitster@pobox.com>
Cc: Andrew Morton <akpm@linux-foundation.org>
Cc: Kees Cook <keescook@chromium.org>
Cc: Tejun Heo <tj@kernel.org>
Cc: Ingo Molnar <mingo@kernel.org>
Cc: cocci@systeme.lip6.fr
Cc: linux-kernel@vger.kernel.org
Signed-off-by: NLuis R. Rodriguez <mcgrof@suse.com>
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>

9c27847d

11 2月, 2015 1 次提交

SUNRPC: Cleanup to remove xs_tcp_close() · c627d31b

由 Trond Myklebust 提交于 2月 10, 2015

xs_tcp_close() is now just a call to xs_tcp_shutdown(), so remove it,
and replace the entry in xs_tcp_ops.
Suggested-by: NAnna Schumaker <anna.schumaker@netapp.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

c627d31b

10 2月, 2015 5 次提交

SUNRPC: Fix stupid typo in xs_sock_set_reuseport · 402e23b4

由 Trond Myklebust 提交于 2月 09, 2015

Yes, kernel_setsockopt() hates you for using a char argument.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

402e23b4

SUNRPC: Define xs_tcp_fin_timeout only if CONFIG_SUNRPC_DEBUG · 54c09874

由 Trond Myklebust 提交于 2月 09, 2015

Now that the linger code is gone, the xs_tcp_fin_timeout variable has
no real function. Keep it for now, since it is part of the /proc
interface, but only define it if that /proc interface is enabled.
Suggested-by: NAnna Schumaker <Anna.Schumaker@netapp.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

54c09874

SUNRPC: Handle connection reset more efficiently. · b70ae915

由 Trond Myklebust 提交于 2月 09, 2015

If the connection reset is due to an active call on our side, then
the state change is sometimes not reported. Catch those instances
using xs_error_report() instead.
Also remove the xs_tcp_shutdown() call in xs_tcp_send_request() as
the change in behaviour makes it redundant.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

b70ae915

T
SUNRPC: Remove the redundant XPRT_CONNECTION_CLOSE flag · 9e2b9f37
由 Trond Myklebust 提交于 2月 08, 2015
```
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
9e2b9f37

SUNRPC: Make xs_tcp_close() do a socket shutdown rather than a sock_release · caf4ccd4

由 Trond Myklebust 提交于 2月 09, 2015

Use of socket shutdown() means that we monitor the shutdown process
through the xs_tcp_state_change() callback, so it is preferable to
a full close in all cases unless we're destroying the transport.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

caf4ccd4

09 2月, 2015 10 次提交

SUNRPC: Ensure xs_tcp_shutdown() requests a full close of the connection · 0efeac26

由 Trond Myklebust 提交于 2月 09, 2015

The previous behaviour left the connection half-open in order to try
to scrape the last replies from the socket. Now that we have more reliable
reconnection, change the behaviour to close down the socket faster.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

0efeac26

T
SUNRPC: Cleanup to remove remaining uses of XPRT_CONNECTION_ABORT · 505936f5
由 Trond Myklebust 提交于 2月 08, 2015
```
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
505936f5

SUNRPC: Remove TCP socket linger code · 9cbc94fb

由 Trond Myklebust 提交于 2月 08, 2015

Now that we no longer use the partial shutdown code when closing the
socket, we no longer need to worry about the TCP linger2 state.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

9cbc94fb

SUNRPC: Remove TCP client connection reset hack · 4efdd92c

由 Trond Myklebust 提交于 2月 08, 2015

Instead we rely on SO_REUSEPORT to provide the reconnection semantics
that we need for NFSv2/v3.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

4efdd92c

SUNRPC: TCP/UDP always close the old socket before reconnecting · de84d890

由 Trond Myklebust 提交于 2月 08, 2015

It is not safe to call xs_reset_transport() from inside xs_udp_setup_socket()
or xs_tcp_setup_socket(), since they do not own the correct locks. Instead,
do it in xs_connect().
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

de84d890

SUNRPC: Add helpers to prevent socket create from racing · 718ba5b8

由 Trond Myklebust 提交于 2月 08, 2015

The socket lock is currently held by the task that is requesting the
connection be established. While that is efficient in the case where
the connection happens quickly, it is racy in the case where it doesn't.
What we really want is for the connect helper to be able to block access
to the socket while it is being set up.

This patch does so by arranging to transfer the socket lock from the
task that is requesting the connect attempt, and then releasing that
lock once everything is done.
This scheme also gives us automatic protection against collisions with
the RPC close code, so we can kill the cancel_delayed_work_sync()
call in xs_close().
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

718ba5b8

T
SUNRPC: Ensure xs_reset_transport() resets the close connection flags · 6cc7e908
由 Trond Myklebust 提交于 2月 08, 2015
```
Otherwise, we may end up looping.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
6cc7e908

SUNRPC: Do not clear the source port in xs_reset_transport · 76698b23

由 Trond Myklebust 提交于 2月 08, 2015

Now that we can reuse bound ports after a close, we never really want to
clear the transport's source port after it has been set. Doing so really
messes up the NFSv3 DRC on the server.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

76698b23

SUNRPC: Handle EADDRINUSE on connect · 3913c78c

由 Trond Myklebust 提交于 2月 08, 2015

Now that we're setting SO_REUSEPORT, we still need to handle the
case where a connect() is attempted, but the old socket is still
lingering.
Essentially, all we want to do here is handle the error by waiting
a few seconds and then retrying.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

3913c78c

SUNRPC: Set SO_REUSEPORT socket option for TCP connections · 4dda9c8a

由 Trond Myklebust 提交于 2月 08, 2015

When using TCP, we need the ability to reuse port numbers after
a disconnection, so that the NFSv3 server knows that we're the same
client. Currently we use a hack to work around the TCP socket's
TIME_WAIT: we send an RST instead of closing, which doesn't
always work...
The SO_REUSEPORT option added in Linux 3.9 allows us to bind multiple
TCP connections to the same source address+port combination, and thus
to use ordinary TCP close() instead of the current hack.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

4dda9c8a

25 11月, 2014 3 次提交

sunrpc: eliminate RPC_DEBUG · f895b252

由 Jeff Layton 提交于 11月 17, 2014

It's always set to whatever CONFIG_SUNRPC_DEBUG is, so just use that.
Signed-off-by: NJeff Layton <jlayton@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

f895b252

sunrpc: add tracepoints in xs_tcp_data_recv · 1a867a08

由 Jeff Layton 提交于 10月 28, 2014

Add tracepoints inside the main loop on xs_tcp_data_recv that allow
us to keep an eye on what's happening during each phase of it.
Signed-off-by: NJeff Layton <jlayton@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

1a867a08

sunrpc: add new tracepoints in xprt handling code · 3705ad64

由 Jeff Layton 提交于 10月 28, 2014

...so we can keep track of when calls are sent and replies received.
Signed-off-by: NJeff Layton <jlayton@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

3705ad64

25 9月, 2014 4 次提交

NFS/SUNRPC: Remove other deadlock-avoidance mechanisms in nfs_release_page() · 1aff5256

由 NeilBrown 提交于 9月 24, 2014

Now that nfs_release_page() doesn't block indefinitely, other deadlock
avoidance mechanisms aren't needed.
 - it doesn't hurt for kswapd to block occasionally.  If it doesn't
   want to block it would clear __GFP_WAIT.  The current_is_kswapd()
   was only added to avoid deadlocks and we have a new approach for
   that.
 - memory allocation in the SUNRPC layer can very rarely try to
   ->releasepage() a page it is trying to handle.  The deadlock
   is removed as nfs_release_page() doesn't block indefinitely.

So we don't need to set PF_FSTRANS for sunrpc network operations any
more.
Signed-off-by: NNeilBrown <neilb@suse.de>
Acked-by: NJeff Layton <jlayton@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

1aff5256

rpc: Add -EPERM processing for xs_udp_send_request() · 3dedbb5c

由 Jason Baron 提交于 9月 24, 2014

If an iptables drop rule is added for an nfs server, the client can end up in
a softlockup. Because of the way that xs_sendpages() is structured, the -EPERM
is ignored since the prior bits of the packet may have been successfully queued
and thus xs_sendpages() returns a non-zero value. Then, xs_udp_send_request()
thinks that because some bits were queued it should return -EAGAIN. We then try
the request again and again, resulting in cpu spinning. Reproducer:

1) open a file on the nfs server '/nfs/foo' (mounted using udp)
2) iptables -A OUTPUT -d <nfs server ip> -j DROP
3) write to /nfs/foo
4) close /nfs/foo
5) iptables -D OUTPUT -d <nfs server ip> -j DROP

The softlockup occurs in step 4 above.

The previous patch, allows xs_sendpages() to return both a sent count and
any error values that may have occurred. Thus, if we get an -EPERM, return
that to the higher level code.

With this patch in place we can successfully abort the above sequence and
avoid the softlockup.

I also tried the above test case on an nfs mount on tcp and although the system
does not softlockup, I still ended up with the 'hung_task' firing after 120
seconds, due to the i/o being stuck. The tcp case appears a bit harder to fix,
since -EPERM appears to get ignored much lower down in the stack and does not
propogate up to xs_sendpages(). This case is not quite as insidious as the
softlockup and it is not addressed here.
Reported-by: NYigong Lou <ylou@akamai.com>
Signed-off-by: NJason Baron <jbaron@akamai.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

3dedbb5c

rpc: return sent and err from xs_sendpages() · f279cd00

由 Jason Baron 提交于 9月 24, 2014

If an error is returned after the first bits of a packet have already been
successfully queued, xs_sendpages() will return a positive 'int' value
indicating success. Callers seem to treat this as -EAGAIN.

However, there are cases where its not a question of waiting for the write
queue to drain. For example, when there is an iptables rule dropping packets
to the destination, the lower level code can return -EPERM only after parts
of the packet have been successfully queued. In this case, we can end up
continuously retrying resulting in a kernel softlockup.

This patch is intended to make no changes in behavior but is in preparation for
subsequent patches that can make decisions based on both on the number of bytes
sent by xs_sendpages() and any errors that may have be returned.
Signed-off-by: NJason Baron <jbaron@akamai.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

f279cd00

SUNRPC: Don't wake tasks during connection abort · a743419f

由 Benjamin Coddington 提交于 9月 23, 2014

When aborting a connection to preserve source ports, don't wake the task in
xs_error_report. This allows tasks with RPC_TASK_SOFTCONN to succeed if the
connection needs to be re-established since it preserves the task's status
instead of setting it to the status of the aborting kernel_connect().

This may also avoid a potential conflict on the socket's lock.
Signed-off-by: NBenjamin Coddington <bcodding@redhat.com>
Cc: stable@vger.kernel.org # 3.14+
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

a743419f

11 9月, 2014 1 次提交

rpc: xs_bind - do not bind when requesting a random ephemeral port · 0f7a622c

由 Chris Perl 提交于 9月 05, 2014

When attempting to establish a local ephemeral endpoint for a TCP or UDP
socket, do not explicitly call bind, instead let it happen implicilty when the
socket is first used.

The main motivating factor for this change is when TCP runs out of unique
ephemeral ports (i.e.  cannot find any ephemeral ports which are not a part of
*any* TCP connection).  In this situation if you explicitly call bind, then the
call will fail with EADDRINUSE.  However, if you allow the allocation of an
ephemeral port to happen implicitly as part of connect (or other functions),
then ephemeral ports can be reused, so long as the combination of (local_ip,
local_port, remote_ip, remote_port) is unique for TCP sockets on the system.

This doesn't matter for UDP sockets, but it seemed easiest to treat TCP and UDP
sockets the same.

This can allow mount.nfs(8) to continue to function successfully, even in the
face of misbehaving applications which are creating a large number of TCP
connections.
Signed-off-by: NChris Perl <chris.perl@gmail.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

0f7a622c