提交 · 1e53d5bb8878dcbdbffde334ab89b1f57778b48c · openanolis / cloud-kernel

10 4月, 2015 1 次提交

net: Pass VLAN ID to rtnl_fdb_notify. · 1e53d5bb

由 Hubert Sokolowski 提交于 4月 09, 2015

When an FDB entry is added or deleted the information about VLAN
is not passed to listening applications like 'bridge monitor fdb'.
With this patch VLAN ID is passed if it was set in the original
netlink message.

Also remove an unused bdev variable.
Signed-off-by: NHubert Sokolowski <hubert.sokolowski@intel.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

1e53d5bb

09 4月, 2015 4 次提交

tcp: do not rearm rsk_timer on FastOpen requests · dd929c1b

由 Eric Dumazet 提交于 4月 08, 2015

FastOpen requests are not like other regular request sockets.

They do not yet use rsk_timer : tcp_fastopen_queue_check()
simply manually removes one expired request from fastopenq->rskq_rst
list.

Therefore, tcp_check_req() must not call mod_timer_pending(),
otherwise we crash because rsk_timer was not initialized.

Fixes: fa76ce73 ("inet: get rid of central tcp/dccp listener timer")
Signed-off-by: NEric Dumazet <edumazet@google.com>
Signed-off-by: NYuchung Cheng <ycheng@google.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

dd929c1b

netfilter: Fix switch statement warnings with recent gcc. · c1f86676

由 David Miller 提交于 4月 07, 2015

More recent GCC warns about two kinds of switch statement uses:

1) Switching on an enumeration, but not having an explicit case
   statement for all members of the enumeration.  To show the
   compiler this is intentional, we simply add a default case
   with nothing more than a break statement.

2) Switching on a boolean value.  I think this warning is dumb
   but nevertheless you get it wholesale with -Wswitch.

This patch cures all such warnings in netfilter.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
Acked-by: NPablo Neira Ayuso <pablo@netfilter.org>

c1f86676

ipv6: call iptunnel_xmit with NULL sock pointer if no tunnel sock is available · 1b112871

由 Hannes Frederic Sowa 提交于 4月 08, 2015

Fixes: 79b16aad ("udp_tunnel: Pass UDP socket down through udp_tunnel{, 6}_xmit_skb().")
Reported-by: NDavid S. Miller <davem@davemloft.net>
Signed-off-by: NHannes Frederic Sowa <hannes@stressinduktion.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

1b112871

ipv4: ip_tunnel: use net namespace from rtable not socket · 926a882f

由 Hannes Frederic Sowa 提交于 4月 08, 2015

The socket parameter might legally be NULL, thus sock_net is sometimes
causing a NULL pointer dereference. Using net_device pointer in dst_entry
is more reliable.

Fixes: b6a7719a ("ipv4: hash net ptr into fragmentation bucket selection")
Reported-by: NRick Jones <rick.jones2@hp.com>
Cc: Rick Jones <rick.jones2@hp.com>
Cc: David S. Miller <davem@davemloft.net>
Signed-off-by: NHannes Frederic Sowa <hannes@stressinduktion.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

926a882f

08 4月, 2015 21 次提交

netfilter: nf_tables: support optional userdata for set elements · 68e942e8

由 Patrick McHardy 提交于 4月 05, 2015

Add an userdata set extension and allow the user to attach arbitrary
data to set elements. This is intended to hold TLV encoded data like
comments or DNS annotations that have no meaning to the kernel.
Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NPablo Neira Ayuso <pablo@netfilter.org>

68e942e8

netfilter: nf_tables: add support for dynamic set updates · 22fe54d5

由 Patrick McHardy 提交于 4月 05, 2015

Add a new "dynset" expression for dynamic set updates.

A new set op ->update() is added which, for non existant elements,
invokes an initialization callback and inserts the new element.
For both new or existing elements the extenstion pointer is returned
to the caller to optionally perform timer updates or other actions.

Element removal is not supported so far, however that seems to be a
rather exotic need and can be added later on.
Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NPablo Neira Ayuso <pablo@netfilter.org>

22fe54d5

netfilter: nf_tables: support different set binding types · 11113e19

由 Patrick McHardy 提交于 4月 05, 2015

Currently a set binding is assumed to be related to a lookup and, in
case of maps, a data load.

In order to use bindings for set updates, the loop detection checks
must be restricted to map operations only. Add a flags member to the
binding struct to hold the set "action" flags such as NFT_SET_MAP,
and perform loop detection based on these.
Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NPablo Neira Ayuso <pablo@netfilter.org>

11113e19

netfilter: nf_tables: prepare set element accounting for async updates · 3dd0673a

由 Patrick McHardy 提交于 4月 05, 2015

Use atomic operations for the element count to avoid races with async
updates.

To properly handle the transactional semantics during netlink updates,
deleted but not yet committed elements are accounted for seperately and
are treated as being already removed. This means for the duration of
a netlink transaction, the limit might be exceeded by the amount of
elements deleted. Set implementations must be prepared to handle this.
Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NPablo Neira Ayuso <pablo@netfilter.org>

3dd0673a

netfilter: nf_tables: fix set selection when timeouts are requested · 4a8678ef

由 Patrick McHardy 提交于 4月 05, 2015

The NFT_SET_TIMEOUT flag is ignore in nft_select_set_ops, which may
lead to selection of a set implementation that doesn't actually
support timeouts.
Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NPablo Neira Ayuso <pablo@netfilter.org>

4a8678ef

netfilter: bridge: make BRNF_PKT_TYPE flag a bool · a1e67951

由 Florian Westphal 提交于 4月 02, 2015

nf_bridge_info->mask is used for several things, for example to
remember if skb->pkt_type was set to OTHER_HOST.

For a bridge, OTHER_HOST is expected case. For ip forward its a non-starter
though -- routing expects PACKET_HOST.

Bridge netfilter thus changes OTHER_HOST to PACKET_HOST before hook
invocation and then un-does it after hook traversal.

This information is irrelevant outside of br_netfilter.

After this change, ->mask now only contains flags that need to be
known outside of br_netfilter in fast-path.

Future patch changes mask into a 2bit state field in sk_buff, so that
we can remove skb->nf_bridge pointer for good and consider all remaining
places that access nf_bridge info content a not-so fastpath.
Signed-off-by: NFlorian Westphal <fw@strlen.de>
Signed-off-by: NPablo Neira Ayuso <pablo@netfilter.org>

a1e67951

netfilter: bridge: start splitting mask into public/private chunks · 3eaf4025

由 Florian Westphal 提交于 4月 02, 2015

->mask is a bit info field that mixes various use cases.

In particular, we have flags that are mutually exlusive, and flags that
are only used within br_netfilter while others need to be exposed to
other parts of the kernel.

Remove BRNF_8021Q/PPPoE flags.  They're mutually exclusive and only
needed within br_netfilter context.
Signed-off-by: NFlorian Westphal <fw@strlen.de>
Signed-off-by: NPablo Neira Ayuso <pablo@netfilter.org>

3eaf4025

netfilter: bridge: add and use nf_bridge_info_get helper · 38330783

由 Florian Westphal 提交于 4月 02, 2015

Don't access skb->nf_bridge directly, this pointer will be removed soon.
Signed-off-by: NFlorian Westphal <fw@strlen.de>
Signed-off-by: NPablo Neira Ayuso <pablo@netfilter.org>

38330783

netfilter: physdev: use helpers · a99074ae

由 Florian Westphal 提交于 4月 02, 2015

Avoid skb->nf_bridge accesses where possible.
Signed-off-by: NFlorian Westphal <fw@strlen.de>
Signed-off-by: NPablo Neira Ayuso <pablo@netfilter.org>

a99074ae

netfilter: bridge: add helpers for fetching physin/outdev · c737b7c4

由 Florian Westphal 提交于 4月 02, 2015

right now we store this in the nf_bridge_info struct, accessible
via skb->nf_bridge.  This patch prepares removal of this pointer from skb:

Instead of using skb->nf_bridge->x, we use helpers to obtain the in/out
device (or ifindexes).

Followup patches to netfilter will then allow nf_bridge_info to be
obtained by a call into the br_netfilter core, rather than keeping a
pointer to it in sk_buff.
Signed-off-by: NFlorian Westphal <fw@strlen.de>
Signed-off-by: NPablo Neira Ayuso <pablo@netfilter.org>

c737b7c4

netfilter: bridge: don't use nf_bridge_info data to store mac header · e70deecb

由 Florian Westphal 提交于 4月 02, 2015

br_netfilter maintains an extra state, nf_bridge_info, which is attached
to skb via skb->nf_bridge pointer.

Amongst other things we use skb->nf_bridge->data to store the original
mac header for every processed skb.

This is required for ip refragmentation when using conntrack
on top of bridge, because ip_fragment doesn't copy it from original skb.

However there is no need anymore to do this unconditionally.

Move this to the one place where its needed -- when br_netfilter calls
ip_fragment().

Also switch to percpu storage for this so we can handle fragmenting
without accessing nf_bridge meta data.

Only user left is neigh resolution when DNAT is detected, to hold
the original source mac address (neigh resolution builds new mac header
using bridge mac), so rename ->data and reduce its size to whats needed.
Signed-off-by: NFlorian Westphal <fw@strlen.de>
Signed-off-by: NPablo Neira Ayuso <pablo@netfilter.org>

e70deecb

netfilter: x_tables: don't extract flow keys on early demuxed sks in socket match · d64d80a2

由 Daniel Borkmann 提交于 4月 02, 2015

Currently in xt_socket, we take advantage of early demuxed sockets
since commit 00028aa3 ("netfilter: xt_socket: use IP early demux")
in order to avoid a second socket lookup in the fast path, but we
only make partial use of this:

We still unnecessarily parse headers, extract proto, {s,d}addr and
{s,d}ports from the skb data, accessing possible conntrack information,
etc even though we were not even calling into the socket lookup via
xt_socket_get_sock_{v4,v6}() due to skb->sk hit, meaning those cycles
can be spared.

After this patch, we only proceed the slower, manual lookup path
when we have a skb->sk miss, thus time to match verdict for early
demuxed sockets will improve further, which might be i.e. interesting
for use cases such as mentioned in 681f130f ("netfilter: xt_socket:
add XT_SOCKET_NOWILDCARD flag").
Signed-off-by: NDaniel Borkmann <daniel@iogearbox.net>
Signed-off-by: NPablo Neira Ayuso <pablo@netfilter.org>

d64d80a2

net: remove extra newlines · 8bc0034c

由 Sheng Yong 提交于 4月 08, 2015

Signed-off-by: NSheng Yong <shengyong1@huawei.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

8bc0034c

tcp: RFC7413 option support for Fast Open client · 2646c831

由 Daniel Lee 提交于 4月 06, 2015

Fast Open has been using an experimental option with a magic number
(RFC6994). This patch makes the client by default use the RFC7413
option (34) to get and send Fast Open cookies.  This patch makes
the client solicit cookies from a given server first with the
RFC7413 option. If that fails to elicit a cookie, then it tries
the RFC6994 experimental option. If that also fails, it uses the
RFC7413 option on all subsequent connect attempts.  If the server
returns a Fast Open cookie then the client caches the form of the
option that successfully elicited a cookie, and uses that form on
later connects when it presents that cookie.

The idea is to gradually obsolete the use of experimental options as
the servers and clients upgrade, while keeping the interoperability
meanwhile.
Signed-off-by: NDaniel Lee <Longinus00@gmail.com>
Signed-off-by: NYuchung Cheng <ycheng@google.com>
Signed-off-by: NNeal Cardwell <ncardwell@google.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

2646c831

tcp: RFC7413 option support for Fast Open server · 7f9b838b

由 Daniel Lee 提交于 4月 06, 2015

Fast Open has been using the experimental option with a magic number
(RFC6994) to request and grant Fast Open cookies. This patch enables
the server to support the official IANA option 34 in RFC7413 in
addition.

The change has passed all existing Fast Open tests with both
old and new options at Google.
Signed-off-by: NDaniel Lee <Longinus00@gmail.com>
Signed-off-by: NYuchung Cheng <ycheng@google.com>
Signed-off-by: NNeal Cardwell <ncardwell@google.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

7f9b838b

netns: allow to dump netns ids · a143c40c

由 Nicolas Dichtel 提交于 4月 07, 2015

Which this patch, it's possible to dump the list of ids allocated for peer
netns.
Signed-off-by: NNicolas Dichtel <nicolas.dichtel@6wind.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

a143c40c

netns: notify netns id events · 9a963454

由 Nicolas Dichtel 提交于 4月 07, 2015

With this patch, netns ids that are created and deleted are advertised into the
group RTNLGRP_NSID.

Because callers of rtnl_net_notifyid() already know the id of the peer, there is
no need to call __peernet2id() in rtnl_net_fill().
Signed-off-by: NNicolas Dichtel <nicolas.dichtel@6wind.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

9a963454

netns: minor cleanup in rtnl_net_getid() · b111e4e1

由 Nicolas Dichtel 提交于 4月 07, 2015

No need to initialize err, it will be overridden by the value of nlmsg_parse().
Signed-off-by: NNicolas Dichtel <nicolas.dichtel@6wind.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

b111e4e1

udp_tunnel: Pass UDP socket down through udp_tunnel{, 6}_xmit_skb(). · 79b16aad

由 David Miller 提交于 4月 05, 2015

That was we can make sure the output path of ipv4/ipv6 operate on
the UDP socket rather than whatever random thing happens to be in
skb->sk.

Based upon a patch by Jiri Pirko.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
Acked-by: NHannes Frederic Sowa <hannes@stressinduktion.org>

79b16aad

netfilter: Pass socket pointer down through okfn(). · 7026b1dd

由 David Miller 提交于 4月 05, 2015

On the output paths in particular, we have to sometimes deal with two
socket contexts.  First, and usually skb->sk, is the local socket that
generated the frame.

And second, is potentially the socket used to control a tunneling
socket, such as one the encapsulates using UDP.

We do not want to disassociate skb->sk when encapsulating in order
to fix this, because that would break socket memory accounting.

The most extreme case where this can cause huge problems is an
AF_PACKET socket transmitting over a vxlan device.  We hit code
paths doing checks that assume they are dealing with an ipv4
socket, but are actually operating upon the AF_PACKET one.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

7026b1dd

netfilter: Add socket pointer to nf_hook_state. · 1c984f8a

由 David Miller 提交于 4月 05, 2015

It is currently always set to NULL, but nf_queue is adjusted to be
prepared for it being set to a real socket by taking and releasing a
reference to that socket when necessary.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

1c984f8a

07 4月, 2015 4 次提交

net: dsa: fix filling routing table from OF description · 30303813

由 Pavel Nakonechny 提交于 4月 05, 2015

According to description in 'include/net/dsa.h', in cascade switches
configurations where there are more than one interconnected devices,
'rtable' array in 'dsa_chip_data' structure is used to indicate which
port on this switch should be used to send packets to that are destined
for corresponding switch.

However, dsa_of_setup_routing_table() fills 'rtable' with port numbers
of the _target_ switch, but not current one.

This commit removes redundant devicetree parsing and adds needed port
number as a function argument. So dsa_of_setup_routing_table() now just
looks for target switch number by parsing parent of 'link' device node.

To remove possible misunderstandings with the way of determining target
switch number, a corresponding comment was added to the source code and
to the DSA device tree bindings documentation file.

This was tested on a custom board with two Marvell 88E6095 switches with
following corresponding routing tables: { -1, 10 } and { 8, -1 }.
Signed-off-by: NPavel Nakonechny <pavel.nakonechny@skitlab.ru>
Reviewed-by: NAndrew Lunn <andrew@lunn.ch>
Reviewed-by: NFlorian Fainelli <f.fainelli@gmail.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

30303813

l2tp: unregister l2tp_net_ops on failure path · 67e04c29

由 WANG Cong 提交于 4月 03, 2015

Signed-off-by: NCong Wang <xiyou.wangcong@gmail.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

67e04c29

tc: bpf: add checksum helpers · 91bc4822

由 Alexei Starovoitov 提交于 4月 01, 2015

Commit 608cd71a ("tc: bpf: generalize pedit action") has added the
possibility to mangle packet data to BPF programs in the tc pipeline.
This patch adds two helpers bpf_l3_csum_replace() and bpf_l4_csum_replace()
for fixing up the protocol checksums after the packet mangling.

It also adds 'flags' argument to bpf_skb_store_bytes() helper to avoid
unnecessary checksum recomputations when BPF programs adjusting l3/l4
checksums and documents all three helpers in uapi header.

Moreover, a sample program is added to show how BPF programs can make use
of the mangle and csum helpers.
Signed-off-by: NAlexei Starovoitov <ast@plumgrid.com>
Acked-by: NDaniel Borkmann <daniel@iogearbox.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

91bc4822

ipv6: protect skb->sk accesses from recursive dereference inside the stack · f60e5990

由 hannes@stressinduktion.org 提交于 4月 01, 2015

We should not consult skb->sk for output decisions in xmit recursion
levels > 0 in the stack. Otherwise local socket settings could influence
the result of e.g. tunnel encapsulation process.

ipv6 does not conform with this in three places:

1) ip6_fragment: we do consult ipv6_npinfo for frag_size

2) sk_mc_loop in ipv6 uses skb->sk and checks if we should
   loop the packet back to the local socket

3) ip6_skb_dst_mtu could query the settings from the user socket and
   force a wrong MTU

Furthermore:
In sk_mc_loop we could potentially land in WARN_ON(1) if we use a
PF_PACKET socket ontop of an IPv6-backed vxlan device.

Reuse xmit_recursion as we are currently only interested in protecting
tunnel devices.

Cc: Jiri Pirko <jiri@resnulli.us>
Signed-off-by: NHannes Frederic Sowa <hannes@stressinduktion.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

f60e5990

05 4月, 2015 9 次提交
- D
  netfilter: Pass nf_hook_state through arpt_do_table(). · b85c3dc9
  由 David S. Miller 提交于 4月 03, 2015
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  b85c3dc9
- D
  netfilter: Pass nf_hook_state through nft_set_pktinfo*(). · 073bfd56
  由 David S. Miller 提交于 4月 03, 2015
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  073bfd56
- D
  netfilter: Pass nf_hook_state through ip6t_do_table(). · 8f8a3715
  由 David S. Miller 提交于 4月 03, 2015
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  8f8a3715
- D
  netfilter: Pass nf_hook_state through nf_nat_ipv6_{in,out,fn,local_fn}(). · 8fe22382
  由 David S. Miller 提交于 4月 03, 2015
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  8fe22382
- D
  netfilter: Pass nf_hook_state through ipt_do_table(). · 1c491ba2
  由 David S. Miller 提交于 4月 03, 2015
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  1c491ba2
- D
  netfilter: Pass nf_hook_state through nf_nat_ipv4_{in,out,fn,local_fn}(). · d7cf4081
  由 David S. Miller 提交于 4月 03, 2015
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  d7cf4081
- D
  netfilter: Make nf_hookfn use nf_hook_state. · 238e54c9
  由 David S. Miller 提交于 4月 03, 2015
```
Pass the nf_hook_state all the way down into the hook
functions themselves.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  238e54c9
- D
  netfilter: Use nf_hook_state in nf_queue_entry. · 1d1de89b
  由 David S. Miller 提交于 4月 03, 2015
```
That way we don't have to reinstantiate another nf_hook_state
on the stack of the nf_reinject() path.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  1d1de89b
- D
  netfilter: Create and use nf_hook_state. · cfdfab31
  由 David S. Miller 提交于 4月 03, 2015
```
Instead of passing a large number of arguments down into the nf_hook()
entry points, create a structure which carries this state down through
the hook processing layers.

This makes is so that if we want to change the types or signatures of
any of these pieces of state, there are less places that need to be
changed.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  cfdfab31
04 4月, 2015 1 次提交

Bluetooth: Fix location of TX power field in LE advertising data · 38c8af60

由 Marcel Holtmann 提交于 4月 03, 2015

The TX power field in the LE advertising data should be placed last
since it needs to be possible to enable kernel controlled TX power,
but still allow for userspace provided flags field.
Signed-off-by: NMarcel Holtmann <marcel@holtmann.org>
Signed-off-by: NJohan Hedberg <johan.hedberg@intel.com>

38c8af60

openanolis / cloud-kernel 接近 2 年 前同步成功

openanolis / cloud-kernel
接近 2 年前同步成功