提交 · 1c2bc7b948a2adee0d3e070f4ce14645efa0a2d2 · openanolis / cloud-kernel

14 9月, 2016 13 次提交

rxrpc: Use rxrpc_extract_addr_from_skb() rather than doing this manually · 1c2bc7b9

由 David Howells 提交于 9月 13, 2016

There are two places that want to transmit a packet in response to one just
received and manually pick the address to reply to out of the sk_buff.
Make them use rxrpc_extract_addr_from_skb() instead so that IPv6 is handled
automatically.
Signed-off-by: NDavid Howells <dhowells@redhat.com>

1c2bc7b9

rxrpc: Don't specify protocol to when creating transport socket · aaa31cbc

由 David Howells 提交于 9月 13, 2016

Pass 0 as the protocol argument when creating the transport socket rather
than IPPROTO_UDP.
Signed-off-by: NDavid Howells <dhowells@redhat.com>

aaa31cbc

rxrpc: Create an address for sendmsg() to bind unbound socket with · cd5892c7

由 David Howells 提交于 9月 13, 2016

Create an address for sendmsg() to bind unbound socket with rather than
using a completely blank address otherwise the transport socket creation
will fail because it will try to use address family 0.

We use the address family specified in the protocol argument when the
AF_RXRPC socket was created and SOCK_DGRAM as the default.  For anything
else, bind() must be used.
Signed-off-by: NDavid Howells <dhowells@redhat.com>

cd5892c7

rxrpc: Correctly initialise, limit and transmit call->rx_winsize · 75e42126

由 David Howells 提交于 9月 13, 2016

call->rx_winsize should be initialised to the sysctl setting and the sysctl
setting should be limited to the maximum we want to permit. Further, we
need to place this in the ACK info instead of the sysctl setting.

Furthermore, discard the idea of accepting the subpackets of a jumbo packet
that lie beyond the receive window when the first packet of the jumbo is
within the window. Just discard the excess subpackets instead. This
allows the receive window to be opened up right to the buffer size less one
for the dead slot.
Signed-off-by: NDavid Howells <dhowells@redhat.com>

75e42126

rxrpc: Fix prealloc refcounting · 3432a757

由 David Howells 提交于 9月 13, 2016

The preallocated call buffer holds a ref on the calls within that buffer.
The ref was being released in the wrong place - it worked okay for incoming
calls to the AFS cache manager service, but doesn't work right for incoming
calls to a userspace service.

Instead of releasing an extra ref service calls in rxrpc_release_call(),
the ref needs to be released during the acceptance/rejectance process.  To
this end:

 (1) The prealloc ref is now normally released during
     rxrpc_new_incoming_call().

 (2) For preallocated kernel API calls, the kernel API's ref needs to be
     released when the call is discarded on socket close.

 (3) We shouldn't take a second ref in rxrpc_accept_call().

 (4) rxrpc_recvmsg_new_call() needs to get a ref of its own when it adds
     the call to the to_be_accepted socket queue.

In doing (4) above, we would prefer not to put the call's refcount down to
0 as that entails doing cleanup in softirq context, but it's unlikely as
there are several refs held elsewhere, at least one of which must be put by
someone in process context calling rxrpc_release_call().  However, it's not
a problem if we do have to do that.
Signed-off-by: NDavid Howells <dhowells@redhat.com>

3432a757

rxrpc: Adjust the call ref tracepoint to show kernel API refs · cbd00891

由 David Howells 提交于 9月 13, 2016

Adjust the call ref tracepoint to show references held on a call by the
kernel API separately as much as possible and add an additional trace to at
the allocation point from the preallocation buffer for an incoming call.

Note that this doesn't show the allocation of a client call for the kernel
separately at the moment.
Signed-off-by: NDavid Howells <dhowells@redhat.com>

cbd00891

rxrpc: Allow tx_winsize to grow in response to an ACK · 01fd0742

由 David Howells 提交于 9月 13, 2016

Allow tx_winsize to grow when the ACK info packet shows a larger receive
window at the other end rather than only permitting it to shrink.
Signed-off-by: NDavid Howells <dhowells@redhat.com>

01fd0742

rxrpc: Use skb->len not skb->data_len · 89a80ed4

由 David Howells 提交于 9月 13, 2016

skb->len should be used rather than skb->data_len when referring to the
amount of data in a packet.  This will only cause a malfunction in the
following cases:

 (1) We receive a jumbo packet (validation and splitting both are wrong).

 (2) We see if there's extra ACK info in an ACK packet (we think it's not
     there and just ignore it).
Signed-off-by: NDavid Howells <dhowells@redhat.com>

89a80ed4

rxrpc: Add missing unlock in rxrpc_call_accept() · b25de360

由 David Howells 提交于 9月 13, 2016

Add a missing unlock in rxrpc_call_accept() in the path taken if there's no
call to wake up.
Signed-off-by: NDavid Howells <dhowells@redhat.com>

b25de360

rxrpc: Requeue call for recvmsg if more data · 33b603fd

由 David Howells 提交于 9月 13, 2016

rxrpc_recvmsg() needs to make sure that the call it has just been
processing gets requeued for further attention if the buffer has been
filled and there's more data to be consumed. The softirq producer only
queues the call and wakes the socket if it fills the first slot in the
window, so userspace might end up sleeping forever otherwise, despite there
being data available.

This is not a problem provided the userspace buffer is big enough or it
empties the buffer completely before more data comes in.
Signed-off-by: NDavid Howells <dhowells@redhat.com>

33b603fd

rxrpc: The IDLE ACK packet should use rxrpc_idle_ack_delay · 91c2c7b6

由 David Howells 提交于 9月 13, 2016

The IDLE ACK packet should use the rxrpc_idle_ack_delay setting when the
timer is set for it.
Signed-off-by: NDavid Howells <dhowells@redhat.com>

91c2c7b6

rxrpc: Add missing wakeup on Tx window rotation · bc4abfcf

由 David Howells 提交于 9月 13, 2016

We need to wake up the sender when Tx window rotation due to an incoming
ACK makes space in the buffer otherwise the sender is liable to just hang
endlessly.

This problem isn't noticeable if the Tx phase transfers no more than will
fit in a single window or the Tx window rotates fast enough that it doesn't
get full.
Signed-off-by: NDavid Howells <dhowells@redhat.com>

bc4abfcf

rxrpc: Make sure we initialise the peer hash key · 08a39685

由 David Howells 提交于 9月 13, 2016

Peer records created for incoming connections weren't getting their hash
key set.  This meant that incoming calls wouldn't see more than one DATA
packet - which is not a problem for AFS CM calls with small request data
blobs.
Signed-off-by: NDavid Howells <dhowells@redhat.com>

08a39685

13 9月, 2016 2 次提交

tipc: fix possible memory leak in tipc_udp_enable() · c20cb811

由 Wei Yongjun 提交于 9月 10, 2016

'ub' is malloced in tipc_udp_enable() and should be freed before
leaving from the error handling cases, otherwise it will cause
memory leak.

Fixes: ba5aa84a ("tipc: split UDP nl address parsing")
Signed-off-by: NWei Yongjun <weiyongjun1@huawei.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

c20cb811

net: bridge: add helper to call /sbin/bridge-stp · 30843315

由 Vivien Didelot 提交于 9月 08, 2016

If /sbin/bridge-stp is available on the system, bridge tries to execute
it instead of the kernel implementation when starting/stopping STP.

If anything goes wrong with /sbin/bridge-stp, bridge silently falls back
to kernel STP, making hard to debug userspace STP.

This patch adds a br_stp_call_user helper to start/stop userspace STP
and debug errors from the program: abnormal exit status is stored in the
lower byte and normal exit status is stored in higher byte.

Below is a simple example on a kernel with dynamic debug enabled:

    # ln -s /bin/false /sbin/bridge-stp
    # brctl stp br0 on
    br0: failed to start userspace STP (256)
    # dmesg
    br0: /sbin/bridge-stp exited with code 1
    br0: failed to start userspace STP (256)
    br0: using kernel STP
Signed-off-by: NVivien Didelot <vivien.didelot@savoirfairelinux.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

30843315

11 9月, 2016 21 次提交

net: ipv6: Remove l3mdev_get_saddr6 · 8a966fc0

由 David Ahern 提交于 9月 10, 2016

No longer needed
Signed-off-by: NDavid Ahern <dsa@cumulusnetworks.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

8a966fc0

net: ipv4: Remove l3mdev_get_saddr · d66f6c0a

由 David Ahern 提交于 9月 10, 2016

No longer needed
Signed-off-by: NDavid Ahern <dsa@cumulusnetworks.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

d66f6c0a

net: l3mdev: remove redundant calls · e0d56fdd

由 David Ahern 提交于 9月 10, 2016

A previous patch added l3mdev flow update making these hooks
redundant. Remove them.
Signed-off-by: NDavid Ahern <dsa@cumulusnetworks.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

e0d56fdd

net: vrf: Flip IPv6 output path from FIB lookup hook to out hook · 4c1feac5

由 David Ahern 提交于 9月 10, 2016

Flip the IPv6 output path to use the l3mdev tx out hook. The VRF dst
is not returned on the first FIB lookup. Instead, the dst on the
skb is switched at the beginning of the IPv6 output processing to
send the packet to the VRF driver on xmit.

Link scope addresses (linklocal and multicast) need special handling:
specifically the oif the flow struct can not be changed because we
want the lookup tied to the enslaved interface. ie., the source address
and the returned route MUST point to the interface scope passed in.
Convert the existing vrf_get_rt6_dst to handle only link scope addresses.
Signed-off-by: NDavid Ahern <dsa@cumulusnetworks.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

4c1feac5

net: vrf: Flip IPv4 output path from FIB lookup hook to out hook · ebfc102c

由 David Ahern 提交于 9月 10, 2016

Flip the IPv4 output path to use the l3mdev tx out hook. The VRF dst
is not returned on the first FIB lookup. Instead, the dst on the
skb is switched at the beginning of the IPv4 output processing to
send the packet to the VRF driver on xmit.
Signed-off-by: NDavid Ahern <dsa@cumulusnetworks.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

ebfc102c

net: l3mdev: Allow the l3mdev to be a loopback · 5f02ce24

由 David Ahern 提交于 9月 10, 2016

Allow an L3 master device to act as the loopback for that L3 domain.
For IPv4 the device can also have the address 127.0.0.1.
Signed-off-by: NDavid Ahern <dsa@cumulusnetworks.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

5f02ce24

net: l3mdev: Add hook to output path · a8e3e1a9

由 David Ahern 提交于 9月 10, 2016

This patch adds the infrastructure to the output path to pass an skb
to an l3mdev device if it has a hook registered. This is the Tx parallel
to l3mdev_ip{6}_rcv in the receive path and is the basis for removing
the existing hook that returns the vrf dst on the fib lookup.
Signed-off-by: NDavid Ahern <dsa@cumulusnetworks.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

a8e3e1a9

net: flow: Add l3mdev flow update · 9ee0034b

由 David Ahern 提交于 9月 10, 2016

Add l3mdev hook to set FLOWI_FLAG_SKIP_NH_OIF flag and update oif/iif
in flow struct if its oif or iif points to a device enslaved to an L3
Master device. Only 1 needs to be converted to match the l3mdev FIB
rule. This moves the flow adjustment for l3mdev to a single point
catching all lookups. It is redundant for existing hooks (those are
removed in later patches) but is needed for missed lookups such as
PMTU updates.
Signed-off-by: NDavid Ahern <dsa@cumulusnetworks.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

9ee0034b

tcp: better use ooo_last_skb in tcp_data_queue_ofo() · 2594a2a9

由 Eric Dumazet 提交于 9月 09, 2016

Willem noticed that we could avoid an rbtree lookup if the
the attempt to coalesce incoming skb to the last skb failed
for some reason.

Since most ooo additions are at the tail, this is definitely
worth adding a test and fast path.
Suggested-by: NWillem de Bruijn <willemb@google.com>
Signed-off-by: NEric Dumazet <edumazet@google.com>
Cc: Yaogong Wang <wygivan@google.com>
Cc: Yuchung Cheng <ycheng@google.com>
Cc: Neal Cardwell <ncardwell@google.com>
Cc: Ilpo Järvinen <ilpo.jarvinen@helsinki.fi>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

2594a2a9

openvswitch: use alias for genetlink family names · ed227099

由 Thadeu Lima de Souza Cascardo 提交于 9月 09, 2016

When userspace tries to create datapaths and the module is not loaded,
it will simply fail. With this patch, the module will be automatically
loaded.
Signed-off-by: NThadeu Lima de Souza Cascardo <cascardo@redhat.com>
Acked-by: NPravin B Shelar <pshelar@ovn.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

ed227099

xfrm: use IS_ENABLED() instead of checking for built-in or module · 65b323e2

由 Javier Martinez Canillas 提交于 9月 09, 2016