提交 · 8bc0034cf6951a107e0c75c2d10b17b57d681229 · openeuler / raspberrypi-kernel

08 4月, 2015 9 次提交

由 Sheng Yong 提交于 4月 08, 2015

Signed-off-by: NSheng Yong <shengyong1@huawei.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

8bc0034c

tcp: RFC7413 option support for Fast Open client · 2646c831

由 Daniel Lee 提交于 4月 06, 2015

Fast Open has been using an experimental option with a magic number
(RFC6994). This patch makes the client by default use the RFC7413
option (34) to get and send Fast Open cookies.  This patch makes
the client solicit cookies from a given server first with the
RFC7413 option. If that fails to elicit a cookie, then it tries
the RFC6994 experimental option. If that also fails, it uses the
RFC7413 option on all subsequent connect attempts.  If the server
returns a Fast Open cookie then the client caches the form of the
option that successfully elicited a cookie, and uses that form on
later connects when it presents that cookie.

The idea is to gradually obsolete the use of experimental options as
the servers and clients upgrade, while keeping the interoperability
meanwhile.
Signed-off-by: NDaniel Lee <Longinus00@gmail.com>
Signed-off-by: NYuchung Cheng <ycheng@google.com>
Signed-off-by: NNeal Cardwell <ncardwell@google.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

2646c831

tcp: RFC7413 option support for Fast Open server · 7f9b838b

由 Daniel Lee 提交于 4月 06, 2015

Fast Open has been using the experimental option with a magic number
(RFC6994) to request and grant Fast Open cookies. This patch enables
the server to support the official IANA option 34 in RFC7413 in
addition.

The change has passed all existing Fast Open tests with both
old and new options at Google.
Signed-off-by: NDaniel Lee <Longinus00@gmail.com>
Signed-off-by: NYuchung Cheng <ycheng@google.com>
Signed-off-by: NNeal Cardwell <ncardwell@google.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

7f9b838b

netns: allow to dump netns ids · a143c40c

由 Nicolas Dichtel 提交于 4月 07, 2015

Which this patch, it's possible to dump the list of ids allocated for peer
netns.
Signed-off-by: NNicolas Dichtel <nicolas.dichtel@6wind.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

a143c40c

netns: notify netns id events · 9a963454

由 Nicolas Dichtel 提交于 4月 07, 2015

With this patch, netns ids that are created and deleted are advertised into the
group RTNLGRP_NSID.

Because callers of rtnl_net_notifyid() already know the id of the peer, there is
no need to call __peernet2id() in rtnl_net_fill().
Signed-off-by: NNicolas Dichtel <nicolas.dichtel@6wind.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

9a963454

netns: minor cleanup in rtnl_net_getid() · b111e4e1

由 Nicolas Dichtel 提交于 4月 07, 2015

No need to initialize err, it will be overridden by the value of nlmsg_parse().
Signed-off-by: NNicolas Dichtel <nicolas.dichtel@6wind.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

b111e4e1

udp_tunnel: Pass UDP socket down through udp_tunnel{, 6}_xmit_skb(). · 79b16aad

由 David Miller 提交于 4月 05, 2015

That was we can make sure the output path of ipv4/ipv6 operate on
the UDP socket rather than whatever random thing happens to be in
skb->sk.

Based upon a patch by Jiri Pirko.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
Acked-by: NHannes Frederic Sowa <hannes@stressinduktion.org>

79b16aad

netfilter: Pass socket pointer down through okfn(). · 7026b1dd

由 David Miller 提交于 4月 05, 2015

On the output paths in particular, we have to sometimes deal with two
socket contexts.  First, and usually skb->sk, is the local socket that
generated the frame.

And second, is potentially the socket used to control a tunneling
socket, such as one the encapsulates using UDP.

We do not want to disassociate skb->sk when encapsulating in order
to fix this, because that would break socket memory accounting.

The most extreme case where this can cause huge problems is an
AF_PACKET socket transmitting over a vxlan device.  We hit code
paths doing checks that assume they are dealing with an ipv4
socket, but are actually operating upon the AF_PACKET one.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

7026b1dd

netfilter: Add socket pointer to nf_hook_state. · 1c984f8a

由 David Miller 提交于 4月 05, 2015

It is currently always set to NULL, but nf_queue is adjusted to be
prepared for it being set to a real socket by taking and releasing a
reference to that socket when necessary.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

1c984f8a

07 4月, 2015 4 次提交

net: dsa: fix filling routing table from OF description · 30303813

由 Pavel Nakonechny 提交于 4月 05, 2015

According to description in 'include/net/dsa.h', in cascade switches
configurations where there are more than one interconnected devices,
'rtable' array in 'dsa_chip_data' structure is used to indicate which
port on this switch should be used to send packets to that are destined
for corresponding switch.

However, dsa_of_setup_routing_table() fills 'rtable' with port numbers
of the _target_ switch, but not current one.

This commit removes redundant devicetree parsing and adds needed port
number as a function argument. So dsa_of_setup_routing_table() now just
looks for target switch number by parsing parent of 'link' device node.

To remove possible misunderstandings with the way of determining target
switch number, a corresponding comment was added to the source code and
to the DSA device tree bindings documentation file.

This was tested on a custom board with two Marvell 88E6095 switches with
following corresponding routing tables: { -1, 10 } and { 8, -1 }.
Signed-off-by: NPavel Nakonechny <pavel.nakonechny@skitlab.ru>
Reviewed-by: NAndrew Lunn <andrew@lunn.ch>
Reviewed-by: NFlorian Fainelli <f.fainelli@gmail.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

30303813

l2tp: unregister l2tp_net_ops on failure path · 67e04c29

由 WANG Cong 提交于 4月 03, 2015

Signed-off-by: NCong Wang <xiyou.wangcong@gmail.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

67e04c29

tc: bpf: add checksum helpers · 91bc4822

由 Alexei Starovoitov 提交于 4月 01, 2015

Commit 608cd71a ("tc: bpf: generalize pedit action") has added the
possibility to mangle packet data to BPF programs in the tc pipeline.
This patch adds two helpers bpf_l3_csum_replace() and bpf_l4_csum_replace()
for fixing up the protocol checksums after the packet mangling.

It also adds 'flags' argument to bpf_skb_store_bytes() helper to avoid
unnecessary checksum recomputations when BPF programs adjusting l3/l4
checksums and documents all three helpers in uapi header.

Moreover, a sample program is added to show how BPF programs can make use
of the mangle and csum helpers.
Signed-off-by: NAlexei Starovoitov <ast@plumgrid.com>
Acked-by: NDaniel Borkmann <daniel@iogearbox.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

91bc4822

ipv6: protect skb->sk accesses from recursive dereference inside the stack · f60e5990

由 hannes@stressinduktion.org 提交于 4月 01, 2015

We should not consult skb->sk for output decisions in xmit recursion
levels > 0 in the stack. Otherwise local socket settings could influence
the result of e.g. tunnel encapsulation process.

ipv6 does not conform with this in three places:

1) ip6_fragment: we do consult ipv6_npinfo for frag_size

2) sk_mc_loop in ipv6 uses skb->sk and checks if we should
   loop the packet back to the local socket

3) ip6_skb_dst_mtu could query the settings from the user socket and
   force a wrong MTU

Furthermore:
In sk_mc_loop we could potentially land in WARN_ON(1) if we use a
PF_PACKET socket ontop of an IPv6-backed vxlan device.

Reuse xmit_recursion as we are currently only interested in protecting
tunnel devices.

Cc: Jiri Pirko <jiri@resnulli.us>
Signed-off-by: NHannes Frederic Sowa <hannes@stressinduktion.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

f60e5990

05 4月, 2015 9 次提交
- D
  netfilter: Pass nf_hook_state through arpt_do_table(). · b85c3dc9
  由 David S. Miller 提交于 4月 03, 2015
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  b85c3dc9
- D
  netfilter: Pass nf_hook_state through nft_set_pktinfo*(). · 073bfd56
  由 David S. Miller 提交于 4月 03, 2015
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  073bfd56
- D
  netfilter: Pass nf_hook_state through ip6t_do_table(). · 8f8a3715
  由 David S. Miller 提交于 4月 03, 2015
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  8f8a3715
- D
  netfilter: Pass nf_hook_state through nf_nat_ipv6_{in,out,fn,local_fn}(). · 8fe22382
  由 David S. Miller 提交于 4月 03, 2015
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  8fe22382
- D
  netfilter: Pass nf_hook_state through ipt_do_table(). · 1c491ba2
  由 David S. Miller 提交于 4月 03, 2015
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  1c491ba2
- D
  netfilter: Pass nf_hook_state through nf_nat_ipv4_{in,out,fn,local_fn}(). · d7cf4081
  由 David S. Miller 提交于 4月 03, 2015
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  d7cf4081
- D
  netfilter: Make nf_hookfn use nf_hook_state. · 238e54c9
  由 David S. Miller 提交于 4月 03, 2015
```
Pass the nf_hook_state all the way down into the hook
functions themselves.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  238e54c9
- D
  netfilter: Use nf_hook_state in nf_queue_entry. · 1d1de89b
  由 David S. Miller 提交于 4月 03, 2015
```
That way we don't have to reinstantiate another nf_hook_state
on the stack of the nf_reinject() path.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  1d1de89b
- D
  netfilter: Create and use nf_hook_state. · cfdfab31
  由 David S. Miller 提交于 4月 03, 2015
```
Instead of passing a large number of arguments down into the nf_hook()
entry points, create a structure which carries this state down through
the hook processing layers.

This makes is so that if we want to change the types or signatures of
any of these pieces of state, there are less places that need to be
changed.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  cfdfab31
04 4月, 2015 12 次提交

Bluetooth: Fix location of TX power field in LE advertising data · 38c8af60

由 Marcel Holtmann 提交于 4月 03, 2015

The TX power field in the LE advertising data should be placed last
since it needs to be possible to enable kernel controlled TX power,
but still allow for userspace provided flags field.
Signed-off-by: NMarcel Holtmann <marcel@holtmann.org>
Signed-off-by: NJohan Hedberg <johan.hedberg@intel.com>

38c8af60

Bluetooth: hidp: Use BIT(x) instead of (1 << x) · fd6413d8

由 Marcel Holtmann 提交于 4月 03, 2015

Signed-off-by: NMarcel Holtmann <marcel@holtmann.org>
Signed-off-by: NJohan Hedberg <johan.hedberg@intel.com>

fd6413d8

Bluetooth: cmtp: Use BIT(x) instead of (1 << x) · b2ddeb11

由 Marcel Holtmann 提交于 4月 03, 2015

Signed-off-by: NMarcel Holtmann <marcel@holtmann.org>
Signed-off-by: NJohan Hedberg <johan.hedberg@intel.com>

b2ddeb11

Bluetooth: bnep: Handle BNEP connection setup request · 836a061b

由 Grzegorz Kolodziejczyk 提交于 4月 03, 2015

With this patch kernel will be able to handle setup request. This is
needed if we would like to handle control mesages with extension
headers. User space will be only resposible for reading setup data and
checking if scenario is conformance to specification (dst and src device
bnep role). In case of new user space, setup data must be leaved(peek
msg) on queue. New bnep session will be responsible for handling this
data.
Signed-off-by: NGrzegorz Kolodziejczyk <grzegorz.kolodziejczyk@tieto.com>
Signed-off-by: NMarcel Holtmann <marcel@holtmann.org>

836a061b

Bluetooth: bnep: Add support to extended headers of control frames · bf8b9a9c

由 Grzegorz Kolodziejczyk 提交于 4月 03, 2015

Handling extended headers of control frames is required BNEP
functionality. This patch refractor bnep rx frame handling function.
Extended header for control frames shouldn't be omitted as it was
previously done. Every control frame should be checked if it contains
extended header and then every extension should be parsed separately.
Signed-off-by: NGrzegorz Kolodziejczyk <grzegorz.kolodziejczyk@tieto.com>
Signed-off-by: NMarcel Holtmann <marcel@holtmann.org>

bf8b9a9c

Bluetooth: bnep: Add support for get bnep features via ioctl · 0477e2e8

由 Grzegorz Kolodziejczyk 提交于 4月 03, 2015

This is needed if user space wants to know supported bnep features
by kernel, e.g. if kernel supports sending response to bnep setup
control message. By now there is no possibility to know supported
features by kernel in case of bnep. Ioctls allows only to add connection,
delete connection, get connection list, get connection info. Adding
connection if it's possible (establishing network device connection) is
equivalent to starting bnep session. Bnep session handles data queue of
transmit, receive messages over bnep channel. It means that if we add
connection the received/transmitted data will be parsed immediately. In
case of get bnep features we want to know before session start, if we
should leave setup data on socket queue and let kernel to handle with it,
or in case of no setup handling support, if we should pull this message
and handle setup response within user space.
Signed-off-by: NGrzegorz Kolodziejczyk <grzegorz.kolodziejczyk@tieto.com>
Signed-off-by: NMarcel Holtmann <marcel@holtmann.org>

0477e2e8

ebpf: add skb->priority to offset map for usage in {cls, act}_bpf · bcad5718

由 Daniel Borkmann 提交于 4月 03, 2015

This adds the ability to read out the skb->priority from an eBPF
program, so that it can be taken into account from a tc filter
or action for the use-case where the priority is not being used
to directly override the filter classification in a qdisc, but
to tag traffic otherwise for the classifier; the priority can be
assigned from various places incl. user space, in future we may
also mangle it from an eBPF program.
Signed-off-by: NDaniel Borkmann <daniel@iogearbox.net>
Cc: Alexei Starovoitov <ast@plumgrid.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

bcad5718

Bluetooth: bnep: Return err value while sending cmd is not understood · e0fdbab1

由 Grzegorz Kolodziejczyk 提交于 4月 03, 2015

Send command not understood response should be verified if it was
successfully sent, like all send responses.
Signed-off-by: NGrzegorz Kolodziejczyk <grzegorz.kolodziejczyk@tieto.com>
Signed-off-by: NMarcel Holtmann <marcel@holtmann.org>

e0fdbab1

netns: don't allocate an id for dead netns · 576b7cd2

由 Nicolas Dichtel 提交于 4月 03, 2015

First, let's explain the problem.
Suppose you have an ipip interface that stands in the netns foo and its link
part in the netns bar (so the netns bar has an nsid into the netns foo).
Now, you remove the netns bar:
 - the bar nsid into the netns foo is removed
 - the netns exit method of ipip is called, thus our ipip iface is removed:
   => a netlink message is built in the netns foo to advertise this deletion
   => this netlink message requests an nsid for bar, thus a new nsid is
      allocated for bar and never removed.

This patch adds a check in peernet2id() so that an id cannot be allocated for
a netns which is currently destroyed.
Signed-off-by: NNicolas Dichtel <nicolas.dichtel@6wind.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

576b7cd2

Revert "netns: don't clear nsid too early on removal" · 6d458f5b

由 Nicolas Dichtel 提交于 4月 03, 2015

This reverts
commit 4217291e ("netns: don't clear nsid too early on removal").

This is not the right fix, it introduces races.
Signed-off-by: NNicolas Dichtel <nicolas.dichtel@6wind.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

6d458f5b

ipv4: coding style: comparison for inequality with NULL · 00db4124

由 Ian Morris 提交于 4月 03, 2015

The ipv4 code uses a mixture of coding styles. In some instances check
for non-NULL pointer is done as x != NULL and sometimes as x. x is
preferred according to checkpatch and this patch makes the code
consistent by adopting the latter form.

No changes detected by objdiff.
Signed-off-by: NIan Morris <ipm@chirality.org.uk>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

00db4124

ipv4: coding style: comparison for equality with NULL · 51456b29

由 Ian Morris 提交于 4月 03, 2015

The ipv4 code uses a mixture of coding styles. In some instances check
for NULL pointer is done as x == NULL and sometimes as !x. !x is
preferred according to checkpatch and this patch makes the code
consistent by adopting the latter form.

No changes detected by objdiff.
Signed-off-by: NIan Morris <ipm@chirality.org.uk>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

51456b29

03 4月, 2015 6 次提交

ip6mr: call del_timer_sync() in ip6mr_free_table() · 7ba0c47c

由 WANG Cong 提交于 3月 31, 2015

We need to wait for the flying timers, since we
are going to free the mrtable right after it.

Cc: Hannes Frederic Sowa <hannes@stressinduktion.org>
Signed-off-by: NCong Wang <xiyou.wangcong@gmail.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

7ba0c47c

net: move fib_rules_unregister() under rtnl lock · 419df12f

由 WANG Cong 提交于 3月 31, 2015

We have to hold rtnl lock for fib_rules_unregister()
otherwise the following race could happen:

fib_rules_unregister():	fib_nl_delrule():
...				...
...				ops = lookup_rules_ops();
list_del_rcu(&ops->list);
				list_for_each_entry(ops->rules) {
fib_rules_cleanup_ops(ops);	  ...
  list_del_rcu();		  list_del_rcu();
				}

Note, net->rules_mod_lock is actually not needed at all,
either upper layer netns code or rtnl lock guarantees
we are safe.

Cc: Alexander Duyck <alexander.h.duyck@redhat.com>
Cc: Thomas Graf <tgraf@suug.ch>
Signed-off-by: NCong Wang <xiyou.wangcong@gmail.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

419df12f

ipv4: take rtnl_lock and mark mrt table as freed on namespace cleanup · ed785309

由 WANG Cong 提交于 3月 31, 2015

This is the IPv4 part for commit 905a6f96
(ipv6: take rtnl_lock and mark mrt6 table as freed on namespace cleanup).

Cc: Hannes Frederic Sowa <hannes@stressinduktion.org>
Acked-by: NHannes Frederic Sowa <hannes@stressinduktion.org>
Signed-off-by: NCong Wang <xiyou.wangcong@gmail.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

ed785309

tcp: fix FRTO undo on cumulative ACK of SACKed range · 666b8051

由 Neal Cardwell 提交于 4月 01, 2015

On processing cumulative ACKs, the FRTO code was not checking the
SACKed bit, meaning that there could be a spurious FRTO undo on a
cumulative ACK of a previously SACKed skb.

The FRTO code should only consider a cumulative ACK to indicate that
an original/unretransmitted skb is newly ACKed if the skb was not yet
SACKed.

The effect of the spurious FRTO undo would typically be to make the
connection think that all previously-sent packets were in flight when
they really weren't, leading to a stall and an RTO.
Signed-off-by: NNeal Cardwell <ncardwell@google.com>
Signed-off-by: NYuchung Cheng <ycheng@google.com>
Fixes: e33099f9 ("tcp: implement RFC5682 F-RTO")
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

666b8051

tipc: simplify link mtu negotiation · ed193ece

由 Jon Paul Maloy 提交于 4月 02, 2015

When a link is being established, the two endpoints advertise their
respective interface MTU in the transmitted RESET and ACTIVATE messages.
If there is any difference, the lower of the two MTUs will be selected
for use by both endpoints.

However, as a remnant of earlier attempts to introduce TIPC level
routing. there also exists an MTU discovery mechanism. If an intermediate
node has a lower MTU than the two endpoints, they will discover this
through a bisectional approach, and finally adopt this MTU for common use.

Since there is no TIPC level routing, and probably never will be,
this mechanism doesn't make any sense, and only serves to make the
link level protocol unecessarily complex.

In this commit, we eliminate the MTU discovery algorithm,and fall back
to the simple MTU advertising approach. This change is fully backwards
compatible.
Reviewed-by: NYing Xue <ying.xue@windriver.com>
Signed-off-by: NJon Maloy <jon.maloy@ericsson.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

ed193ece

tipc: eliminate delayed link deletion at link failover · dff29b1a

由 Jon Paul Maloy 提交于 4月 02, 2015

When a bearer is disabled manually, all its links have to be reset
and deleted. However, if there is a remaining, parallel link ready
to take over a deleted link's traffic, we currently delay the delete
of the removed link until the failover procedure is finished. This
is because the remaining link needs to access state from the reset
link, such as the last received packet number, and any partially
reassembled buffer, in order to perform a successful failover.

In this commit, we do instead move the state data over to the new
link, so that it can fulfill the procedure autonomously, without
accessing any data on the old link. This means that we can now
proceed and delete all pertaining links immediately when a bearer
is disabled. This saves us from some unnecessary complexity in such
situations.

We also choose to change the confusing definitions CHANGEOVER_PROTOCOL,
ORIGINAL_MSG and DUPLICATE_MSG to the more descriptive TUNNEL_PROTOCOL,
FAILOVER_MSG and SYNCH_MSG respectively.
Reviewed-by: NYing Xue <ying.xue@windriver.com>
Signed-off-by: NJon Maloy <jon.maloy@ericsson.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

dff29b1a