提交 · 25a6e6b84fba601eff7c28d30da8ad7cfbef0d43 · openeuler / raspberrypi-kernel

05 9月, 2013 1 次提交

ipv6: Don't depend on per socket memory for neighbour discovery messages · 25a6e6b8

由 Thomas Graf 提交于 9月 03, 2013

Allocating skbs when sending out neighbour discovery messages
currently uses sock_alloc_send_skb() based on a per net namespace
socket and thus share a socket wmem buffer space.

If a netdevice is temporarily unable to transmit due to carrier
loss or for other reasons, the queued up ndisc messages will cosnume
all of the wmem space and will thus prevent from any more skbs to
be allocated even for netdevices that are able to transmit packets.

The number of neighbour discovery messages sent is very limited,
use of alloc_skb() bypasses the socket wmem buffer size enforcement
while the manual call to skb_set_owner_w() maintains the socket
reference needed for the IPv6 output path.

This patch has orginally been posted by Eric Dumazet in a modified
form.
Signed-off-by: NThomas Graf <tgraf@suug.ch>
Cc: Eric Dumazet <eric.dumazet@gmail.com>
Cc: Hannes Frederic Sowa <hannes@stressinduktion.org>
Cc: Stephen Warren <swarren@wwwdotorg.org>
Cc: Fabio Estevam <festevam@gmail.com>
Tested-by: NFabio Estevam <fabio.estevam@freescale.com>
Tested-by: NStephen Warren <swarren@nvidia.com>
Acked-by: NHannes Frederic Sowa <hannes@stressinduktion.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

25a6e6b8

31 8月, 2013 1 次提交

Revert "ipv6: Don't depend on per socket memory for neighbour discovery messages" · 25ad6117

由 David S. Miller 提交于 8月 30, 2013

This reverts commit 1f324e38.

It seems to cause regressions, and in particular the output path
really depends upon there being a socket attached to skb->sk for
checks such as sk_mc_loop(skb->sk) for example.  See ip6_output_finish2().
Reported-by: NStephen Warren <swarren@wwwdotorg.org>
Reported-by: NFabio Estevam <festevam@gmail.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

25ad6117

30 8月, 2013 1 次提交

ipv6: Don't depend on per socket memory for neighbour discovery messages · 1f324e38

由 Thomas Graf 提交于 8月 28, 2013

Allocating skbs when sending out neighbour discovery messages
currently uses sock_alloc_send_skb() based on a per net namespace
socket and thus share a socket wmem buffer space.

If a netdevice is temporarily unable to transmit due to carrier
loss or for other reasons, the queued up ndisc messages will cosnume
all of the wmem space and will thus prevent from any more skbs to
be allocated even for netdevices that are able to transmit packets.

The number of neighbour discovery messages sent is very limited,
simply use alloc_skb() and don't depend on any socket wmem space any
longer.

This patch has orginally been posted by Eric Dumazet in a modified
form.
Signed-off-by: NThomas Graf <tgraf@suug.ch>
Cc: Eric Dumazet <eric.dumazet@gmail.com>
Acked-by: NHannes Frederic Sowa <hannes@stressinduktion.org>
Acked-by: NEric Dumazet <edumazet@google.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

1f324e38

23 8月, 2013 1 次提交

ipv6: handle Redirect ICMP Message with no Redirected Header option · c92a59ec

由 Duan Jiong 提交于 8月 22, 2013

rfc 4861 says the Redirected Header option is optional, so
the kernel should not drop the Redirect Message that has no
Redirected Header option. In this patch, the function
ip6_redirect_no_header() is introduced to deal with that
condition.
Signed-off-by: NDuan Jiong <duanj.fnst@cn.fujitsu.com>
Acked-by: NHannes Frederic Sowa <hannes@stressinduktion.org>

c92a59ec

02 8月, 2013 1 次提交

ipv6: prevent fib6_run_gc() contention · 2ac3ac8f

由 Michal Kubeček 提交于 8月 01, 2013

On a high-traffic router with many processors and many IPv6 dst
entries, soft lockup in fib6_run_gc() can occur when number of
entries reaches gc_thresh.

This happens because fib6_run_gc() uses fib6_gc_lock to allow
only one thread to run the garbage collector but ip6_dst_gc()
doesn't update net->ipv6.ip6_rt_last_gc until fib6_run_gc()
returns. On a system with many entries, this can take some time
so that in the meantime, other threads pass the tests in
ip6_dst_gc() (ip6_rt_last_gc is still not updated) and wait for
the lock. They then have to run the garbage collector one after
another which blocks them for quite long.

Resolve this by replacing special value ~0UL of expire parameter
to fib6_run_gc() by explicit "force" parameter to choose between
spin_lock_bh() and spin_trylock_bh() and call fib6_run_gc() with
force=false if gc_thresh is reached but not max_size.
Signed-off-by: NMichal Kubecek <mkubecek@suse.cz>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

2ac3ac8f

17 7月, 2013 1 次提交
- D
  ndisc: bool initializations should use true and false · f2f79cca
  由 Daniel Baluta 提交于 7月 13, 2013
```
Signed-off-by: NDaniel Baluta <dbaluta@ixiacom.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  f2f79cca
18 6月, 2013 1 次提交

ipv6: ndisc: fix ndisc_send_redirect writing to the wrong skb · 33be081a

由 Matthias Schiffer 提交于 5月 31, 2013

Since some refactoring in 5f5a0115, ndisc_send_redirect called
ndisc_fill_redirect_hdr_option on the wrong skb, leading to data corruption or
in the worst case a panic when the skb_put failed.
Signed-off-by: NMatthias Schiffer <mschiffer@universe-factory.net>
Reviewed-by: NCong Wang <xiyou.wangcong@gmail.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

33be081a

29 5月, 2013 2 次提交

ipv6: Correct comparisons and calculations using skb->tail and skb-transport_header · 29a3cad5

由 Simon Horman 提交于 5月 28, 2013

This corrects an regression introduced by "net: Use 16bits for *_headers
fields of struct skbuff" when NET_SKBUFF_DATA_USES_OFFSET is not set. In
that case skb->tail will be a pointer whereas skb->transport_header
will be an offset from head. This is corrected by using wrappers that
ensure that comparisons and calculations are always made using pointers.
Signed-off-by: NSimon Horman <horms@verge.net.au>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

29a3cad5

net: pass info struct via netdevice notifier · 351638e7

由 Jiri Pirko 提交于 5月 28, 2013

So far, only net_device * could be passed along with netdevice notifier
event. This patch provides a possibility to pass custom structure
able to provide info that event listener needs to know.
Signed-off-by: NJiri Pirko <jiri@resnulli.us>

v2->v3: fix typo on simeth
	shortened dev_getter
	shortened notifier_info struct name
v1->v2: fix notifier_call parameter in call_netdevice_notifier()
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

351638e7

09 3月, 2013 1 次提交

ipv6: ndisc: remove redundant check for !dev->addr_len · 80580d4b

由 Thomas Graf 提交于 3月 08, 2013

send_sllao is already initialized with the value of dev->addr_len

Cc: Hideaki YOSHIFUJI <yoshfuji@linux-ipv6.org>
Signed-off-by: NThomas Graf <tgraf@suug.ch>
Acked-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

80580d4b

22 1月, 2013 19 次提交

Y
ndisc: Use compound literals to build redirect message. · 4d5c152e
由 YOSHIFUJI Hideaki / 吉藤英明提交于 1月 21, 2013
```
Signed-off-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
4d5c152e

ndisc: Break down ndisc_build_skb() and build message directly. · 1cb3fe51

由 YOSHIFUJI Hideaki / 吉藤英明提交于 1月 21, 2013

Construct NS/NA/RS message directly using C99 compound literals.
Signed-off-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

1cb3fe51

ndisc: Break down __ndisc_send(). · b44b5f4a

由 YOSHIFUJI Hideaki / 吉藤英明提交于 1月 21, 2013

Signed-off-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

b44b5f4a

Y
ndisc: Fill in ICMPv6 checksum and IPv6 header in ndisc_send_skb(). · 7b3d9b06
由 YOSHIFUJI Hideaki / 吉藤英明提交于 1月 21, 2013
```
Signed-off-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
7b3d9b06

ndisc: Use ndisc_send_skb() for redirect. · f4de84c6

由 YOSHIFUJI Hideaki / 吉藤英明提交于 1月 21, 2013

Reuse dst if one is attached with skb.
Signed-off-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

f4de84c6

ndisc: Remove icmp6h argument from ndisc_send_skb(). · aa4bdd4b

由 YOSHIFUJI Hideaki / 吉藤英明提交于 1月 21, 2013

skb_transport_header() (thus icmp6_hdr()) is available here,
use it.
Signed-off-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

aa4bdd4b

Y
ndisc: Make ndisc_fill_xxx_option() for sk_buff. · 5f5a0115
由 YOSHIFUJI Hideaki / 吉藤英明提交于 1月 21, 2013
```
Signed-off-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
5f5a0115
Y
ndisc: Calculate message body length and option length separately. · 2ce13576
由 YOSHIFUJI Hideaki / 吉藤英明提交于 1月 21, 2013
```
Signed-off-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
2ce13576
Y
ndisc: Reset skb->trasport_headner inside ndisc_alloc_send_skb(). · 5135e633
由 YOSHIFUJI Hideaki / 吉藤英明提交于 1月 21, 2013
```
Signed-off-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
5135e633

ndisc: Defer building IPv6 header. · 527a150f

由 YOSHIFUJI Hideaki / 吉藤英明提交于 1月 21, 2013

Build ICMPv6 message first and make buffer management easier;
we can use skb->len when filling checksum in ICMPv6 header,
and then build IP header with length field.
Signed-off-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

527a150f

ndisc: Remove dev argument for ndisc_send_skb(). · af9a9976

由 YOSHIFUJI Hideaki / 吉藤英明提交于 1月 21, 2013

Since we have skb->dev, use it.
Signed-off-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

af9a9976

Y
ndisc: Set skb->dev and skb->protocol inside ndisc_alloc_skb(). · f382d03a
由 YOSHIFUJI Hideaki / 吉藤英明提交于 1月 21, 2013
```
Signed-off-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
f382d03a
Y
ndisc: Simplify arguments for ip6_nd_hdr(). · c8d6c380
由 YOSHIFUJI Hideaki / 吉藤英明提交于 1月 21, 2013
```
Signed-off-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
c8d6c380

ipv6: Unshare ip6_nd_hdr() and change return type to void. · 2576f17d

由 YOSHIFUJI Hideaki / 吉藤英明提交于 1月 21, 2013

- move ip6_nd_hdr() to its users' source files.
  In net/ipv6/mcast.c, it will be called ip6_mc_hdr().
- make return type to void since this function never fails.
Signed-off-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

2576f17d

Y
ndisc: Introduce ndisc_alloc_skb() helper. · de09334b
由 YOSHIFUJI Hideaki / 吉藤英明提交于 1月 21, 2013
```
Signed-off-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
de09334b
Y
ndisc: Introduce ndisc_fill_redirect_hdr_option(). · 9c86dafe
由 YOSHIFUJI Hideaki / 吉藤英明提交于 1月 21, 2013
```
Signed-off-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
9c86dafe

ndisc: Use skb_linearize() instead of pskb_may_pull(skb, skb->len). · 6bce6b4e

由 YOSHIFUJI Hideaki / 吉藤英明提交于 1月 21, 2013

Suggested by Eric Dumazet <edumazet@google.com>.
Signed-off-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

6bce6b4e

ndisc: Move ndisc_opt_addr_space() to include/net/ndisc.h. · c558e9fc

由 YOSHIFUJI Hideaki / 吉藤英明提交于 1月 21, 2013

This also makes ndisc_opt_addr_data() and ndisc_fill_addr_option()
use ndisc_opt_addr_space().
Signed-off-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

c558e9fc

ndisc: Reduce number of arguments for ndisc_fill_addr_option(). · 315ff09d

由 YOSHIFUJI Hideaki / 吉藤英明提交于 1月 21, 2013

Add pointer to struct net_device (dev) and remove
data_len (= dev->addr_len) and addr_type (= dev->type).
Signed-off-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

315ff09d

21 1月, 2013 2 次提交
- Y
  ndisc: Make several arguments for ndisc_send_na() boolean. · fb568637
  由 YOSHIFUJI Hideaki / 吉藤英明提交于 1月 20, 2013
```
Signed-off-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  fb568637
- Y
  ipv6: Introduce ipv6_addr_is_solict_mult() to check Solicited Node Multicast Addresses. · ca97a644
  由 YOSHIFUJI Hideaki / 吉藤英明提交于 1月 20, 2013
```
Signed-off-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  ca97a644
19 1月, 2013 2 次提交

ndisc: Check NS message length before access. · 115b0aa6

由 YOSHIFUJI Hideaki / 吉藤英明提交于 1月 18, 2013

Check message length before accessing "target" field,
as we do for other types.
Signed-off-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

115b0aa6

ipv6: Remove unused neigh argument for icmp6_dst_alloc() and its callers. · 12fd84f4

由 YOSHIFUJI Hideaki / 吉藤英明提交于 1月 18, 2013

Because of rt->n removal, we do not need neigh argument any more.
Signed-off-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

12fd84f4

07 1月, 2013 1 次提交
- Y
  ndisc: Use struct rd_msg for redirect message. · 71bcdba0
  由 YOSHIFUJI Hideaki / 吉藤英明提交于 1月 05, 2013
```
Signed-off-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  71bcdba0
05 1月, 2013 1 次提交

ndisc: Remove unused space at tail of skb for ndisc messages. (TAKE 3) · b7dc8c39

由 YOSHIFUJI Hideaki / 吉藤英明提交于 1月 04, 2013

Currently, the size of skb allocated for NDISC is MAX_HEADER +
LL_RESERVED_SPACE(dev) + packet length + dev->needed_tailroom,
but only LL_RESERVED_SPACE(dev) bytes is "reserved" for headers.
As a result, the skb looks like this (after construction of the
message):

head       data                   tail                       end
+--------------------------------------------------------------+
+           |                      |          |                |
+--------------------------------------------------------------+
|<-hlen---->|<---ipv6 packet------>|<--tlen-->|<--MAX_HEADER-->|
    =LL_                               = dev
     RESERVED_                           ->needed_
     SPACE(dev)                            tailroom

As the name implies, "MAX_HEADER" is used for headers, and should
be "reserved" in prior to packet construction.  Or, if some space
is really required at the tail of ther skb, it should be
explicitly documented.

We have several option after construction of NDISC message:

Option 1:

head       data                   tail       end
+---------------------------------------------+
+           |                      |          |
+---------------------------------------------+
|<-hlen---->|<---ipv6 packet------>|<--tlen-->|
   =LL_                                = dev
    RESERVED_                           ->needed_
    SPACE(dev)                            tailroom

Option 2:

head            data                   tail       end
+--------------------------------------------------+
+                |                      |          |
+--------------------------------------------------+
|<--MAX_HEADER-->|<---ipv6 packet------>|<--tlen-->|
                                            = dev
                                             ->needed_
                                               tailroom

Option 3:

head                        data                   tail       end
+--------------------------------------------------------------+
+                |           |                      |          |
+--------------------------------------------------------------+
|<--MAX_HEADER-->|<-hlen---->|<---ipv6 packet------>|<--tlen-->|
                    =LL_                                = dev
                     RESERVED_                          ->needed_
                     SPACE(dev)                           tailroom

Our tunnel drivers try expanding headroom and the space for tunnel
encapsulation was not a mandatory space -- so we are not seeing
bugs here --, but just for optimization for performance critial
situations.

Since NDISC messages are not performance critical unlike TCP,
and as we know outgoing device, LL_RESERVED_SPACE(dev) should be
just enough for the device in most (if not all) cases:
  LL_RESERVED_SPACE(dev) <= LL_MAX_HEADER <= MAX_HEADER
Note that LL_RESERVED_SPACE(dev) is also enough for NDISC over
SIT (e.g., ISATAP).

So, I think Option 1 is just fine here.
Signed-off-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Acked-by: NEric Dumazet <edumazet@google.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

b7dc8c39

15 12月, 2012 1 次提交

ipv6: Change skb->data before using icmpv6_notify() to propagate redirect · 093d04d4

由 Duan Jiong 提交于 12月 14, 2012

In function ndisc_redirect_rcv(), the skb->data points to the transport
header, but function icmpv6_notify() need the skb->data points to the
inner IP packet. So before using icmpv6_notify() to propagate redirect,
change skb->data to point the inner IP packet that triggered the sending
of the Redirect, and introduce struct rd_msg to make it easy.
Signed-off-by: NDuan Jiong <djduanjiong@gmail.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

093d04d4

14 12月, 2012 1 次提交

ndisc: Fix padding error in link-layer address option. · 7bdc1b4a

由 YOSHIFUJI Hideaki / 吉藤英明提交于 12月 13, 2012

If a natural number n exists where 2 + data_len <= 8n < 2 + data_len + pad,
post padding is not initialized correctly.

(Un)fortunately, the only type that requires pad is Infiniband,
whose pad is 2 and data_len is 20, and this logical error has not
become obvious, but it is better to fix.

Note that ndisc_opt_addr_space() handles the situation described
above correctly.
Signed-off-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

7bdc1b4a

13 12月, 2012 1 次提交

ndisc: Unexport ndisc_{build,send}_skb(). · fd0ea7db

由 YOSHIFUJI Hideaki 提交于 12月 13, 2012

These symbols were exported for bonding device by commit 305d552a
("bonding: send IPv6 neighbor advertisement on failover").

It bacame obsolete by commit 7c899432 ("bonding, ipv4, ipv6, vlan: Handle
NETDEV_BONDING_FAILOVER like NETDEV_NOTIFY_PEERS") and removed by
commit 4f5762ec ("bonding: Remove obsolete source file 'bond_ipv6.c'").
Signed-off-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

fd0ea7db

02 12月, 2012 1 次提交

ipv6: unify logic evaluating inet6_dev's accept_ra property · aeaf6e9d

由 Shmulik Ladkani 提交于 11月 30, 2012

As of 026359bc [ipv6: Send ICMPv6 RSes only when RAs are accepted], the
logic determining whether to send Router Solicitations is identical
to the logic determining whether kernel accepts Router Advertisements.

However the condition itself is repeated in several code locations.

Unify it by introducing 'ipv6_accept_ra()' accessor.

Also, simplify the condition expression, making it more readable.
No semantic change.
Signed-off-by: NShmulik Ladkani <shmulik.ladkani@gmail.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

aeaf6e9d

14 11月, 2012 1 次提交

ipv6: add knob to send unsolicited ND on link-layer address change · 5cb04436

由 Hannes Frederic Sowa 提交于 11月 06, 2012

This patch introduces a new knob ndisc_notify. If enabled, the kernel
will transmit an unsolicited neighbour advertisement on link-layer address
change to update the neighbour tables of the corresponding hosts more quickly.

This is the equivalent to arp_notify in ipv4 world.
Signed-off-by: NHannes Frederic Sowa <hannes@stressinduktion.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

5cb04436