提交 · acd6e00b8e4db542cb6bc9ddfbb4e18bbe29ce4d · openeuler / raspberrypi-kernel

18 8月, 2006 2 次提交

[MCAST]: Fix filter leak on device removal. · acd6e00b

由 David L Stevens 提交于 8月 17, 2006

This fixes source filter leakage when a device is removed and a
process leaves the group thereafter.

This also includes corresponding fixes for IPv6 multicast source
filters on device removal.
Signed-off-by: NDavid L Stevens <dlstevens@us.ibm.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

acd6e00b

[IPV4]: Possible leak of multicast source filter sctructure · bb699cbc

由 Michal Ruzicka 提交于 8月 15, 2006

There is a leak of a socket's multicast source filter list structure
on closing a socket with a multicast source filter set on an interface
that does not exist any more.
Signed-off-by: NMichal Ruzicka <michal.ruzicka@comstar.cz>
Acked-by: NDavid L Stevens <dlstevens@us.ibm.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

bb699cbc

14 8月, 2006 5 次提交

[INET]: Use pskb_trim_unique when trimming paged unique skbs · e9fa4f7b

由 Herbert Xu 提交于 8月 13, 2006

The IPv4/IPv6 datagram output path was using skb_trim to trim paged
packets because they know that the packet has not been cloned yet
(since the packet hasn't been given to anything else in the system).

This broke because skb_trim no longer allows paged packets to be
trimmed.  Paged packets must be given to one of the pskb_trim functions
instead.

This patch adds a new pskb_trim_unique function to cover the IPv4/IPv6
datagram output path scenario and replaces the corresponding skb_trim
calls with it.
Signed-off-by: NHerbert Xu <herbert@gondor.apana.org.au>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

e9fa4f7b

[NETFILTER]: ulog: fix panic on SMP kernels · dcb7cd97

由 Mark Huang 提交于 8月 13, 2006

Fix kernel panic on various SMP machines. The culprit is a null
ub->skb in ulog_send(). If ulog_timer() has already been scheduled on
one CPU and is spinning on the lock, and ipt_ulog_packet() flushes the
queue on another CPU by calling ulog_send() right before it exits,
there will be no skbuff when ulog_timer() acquires the lock and calls
ulog_send(). Cancelling the timer in ulog_send() doesn't help because
it has already been scheduled and is running on the first CPU.

Similar problem exists in ebt_ulog.c and nfnetlink_log.c.
Signed-off-by: NMark Huang <mlhuang@cs.princeton.edu>
Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

dcb7cd97

[NETFILTER]: {arp,ip,ip6}_tables: proper error recovery in init path · 0eff66e6

由 Patrick McHardy 提交于 8月 13, 2006

Neither of {arp,ip,ip6}_tables cleans up behind itself when something goes
wrong during initialization.

Noticed by Rennie deGraaf <degraaf@cpsc.ucalgary.ca>
Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

0eff66e6

[NETFILTER]: xt_hashlimit: fix limit off-by-one · 1c7628bd

由 Patrick McHardy 提交于 8月 13, 2006

Hashlimit doesn't account for the first packet, which is inconsistent
with the limit match.

Reported by ryan.castellucci@gmail.com, netfilter bugzilla #500.
Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

1c7628bd

[TCP]: Fix botched memory leak fix to tcpprobe_read(). · 18b6fe64

由 David S. Miller 提交于 8月 13, 2006

Somehow I clobbered James's original fix and only my
subsequent compiler warning change went in for that
changeset.

Get the real fix in there.

Noticed by Jesper Juhl.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

18b6fe64

08 8月, 2006 2 次提交

[TCP]: SNMPv2 tcpOutSegs counter error · bd37a088

由 Wei Yongjun 提交于 8月 07, 2006

Do not count retransmitted segments.
Signed-off-by: NWei Yongjun <yjwei@nanjing-fnst.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

bd37a088

[IPV4]: Limit rt cache size properly. · 8d1502de

由 Kirill Korotaev 提交于 8月 07, 2006

From: Kirill Korotaev <dev@sw.ru>

During OpenVZ stress testing we found that UDP traffic with random src
can generate too much excessive rt hash growing leading finally to OOM
and kernel panics.

It was found that for 4GB i686 system (having 1048576 total pages and
  225280 normal zone pages) kernel allocates the following route hash:
syslog: IP route cache hash table entries: 262144 (order: 8, 1048576
bytes) => ip_rt_max_size = 4194304 entries, i.e.  max rt size is
4194304 * 256b = 1Gb of RAM > normal_zone

Attached the patch which removes HASH_HIGHMEM flag from
alloc_large_system_hash() call.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

8d1502de

05 8月, 2006 1 次提交

[TCP]: Fixes IW > 2 cases when TCP is application limited · d254bcdb

由 Ilpo Järvinen 提交于 8月 04, 2006

Whenever a transfer is application limited, we are allowed at least
initial window worth of data per window unless cwnd is previously
less than that.
Signed-off-by: NIlpo Järvinen <ilpo.jarvinen@helsinki.fi>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

d254bcdb

03 8月, 2006 9 次提交

[NET]: Fix more per-cpu typos · 29bbd72d

由 Alexey Dobriyan 提交于 8月 02, 2006

Signed-off-by: NAlexey Dobriyan <adobriyan@gmail.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

29bbd72d

[AF_UNIX]: Kernel memory leak fix for af_unix datagram getpeersec patch · dc49c1f9

由 Catherine Zhang 提交于 8月 02, 2006

From: Catherine Zhang <cxzhang@watson.ibm.com>

This patch implements a cleaner fix for the memory leak problem of the
original unix datagram getpeersec patch.  Instead of creating a
security context each time a unix datagram is sent, we only create the
security context when the receiver requests it.

This new design requires modification of the current
unix_getsecpeer_dgram LSM hook and addition of two new hooks, namely,
secid_to_secctx and release_secctx.  The former retrieves the security
context and the latter releases it.  A hook is required for releasing
the security context because it is up to the security module to decide
how that's done.  In the case of Selinux, it's a simple kfree
operation.
Acked-by: NStephen Smalley <sds@tycho.nsa.gov>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

dc49c1f9

[IPV6]: SNMPv2 "ipv6IfStatsOutFragCreates" counter error · dafee490

由 Wei Dong 提交于 8月 02, 2006

  When I tested linux kernel 2.6.71.7 about statistics
"ipv6IfStatsOutFragCreates", and found that it couldn't increase
correctly. The criteria is RFC 2465:

  ipv6IfStatsOutFragCreates OBJECT-TYPE
      SYNTAX      Counter32
      MAX-ACCESS  read-only
      STATUS      current
      DESCRIPTION
         "The number of output datagram fragments that have
         been generated as a result of fragmentation at
         this output interface."
      ::= { ipv6IfStatsEntry 15 }

I think there are two issues in Linux kernel. 
1st:
RFC2465 specifies the counter is "The number of output datagram
fragments...". I think increasing this counter after output a fragment
successfully is better. And it should not be increased even though a
fragment is created but failed to output.

2nd:
If we send a big ICMP/ICMPv6 echo request to a host, and receive
ICMP/ICMPv6 echo reply consisted of some fragments. As we know that in
Linux kernel first fragmentation occurs in ICMP layer(maybe saying
transport layer is better), but this is not the "real"
fragmentation,just do some "pre-fragment" -- allocate space for date,
and form a frag_list, etc. The "real" fragmentation happens in IP layer
-- set offset and MF flag and so on. So I think in "fast path" for
ip_fragment/ip6_fragment, if we send a fragment which "pre-fragment" by
upper layer we should also increase "ipv6IfStatsOutFragCreates".
Signed-off-by: NWei Dong <weid@nanjing-fnst.com>
Acked-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

dafee490

[NETFILTER]: xt_hashlimit/xt_string: missing string validation · 3ab72088

由 Patrick McHardy 提交于 7月 31, 2006

The hashlimit table name and the textsearch algorithm need to be
terminated, the textsearch pattern length must not exceed the
maximum size.
Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

3ab72088

[NETFILTER]: SIP helper: expect RTP streams in both directions · b10866fd

由 Patrick McHardy 提交于 7月 31, 2006

Since we don't know in which direction the first packet will arrive, we
need to create one expectation for each direction, which is currently
prevented by max_expected beeing set to 1.
Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

b10866fd

[TCP]: Process linger2 timeout consistently. · 52499afe

由 David S. Miller 提交于 7月 31, 2006

Based upon guidance from Alexey Kuznetsov.

When linger2 is active, we check to see if the fin_wait2
timeout is longer than the timewait.  If it is, we schedule
the keepalive timer for the difference between the timewait
timeout and the fin_wait2 timeout.

When this orphan socket is seen by tcp_keepalive_timer()
it will try to transform this fin_wait2 socket into a
fin_wait2 mini-socket, again if linger2 is active.

Not all paths were setting this initial keepalive timer correctly.
The tcp input path was doing it correctly, but tcp_close() wasn't,
potentially making the socket linger longer than it really needs to.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

52499afe

[NET]: Core net changes to generate netevents · 8d71740c

由 Tom Tucker 提交于 7月 30, 2006

Generate netevents for:
- neighbour changes
- routing redirects
- pmtu changes
Signed-off-by: NTom Tucker <tom@opengridcomputing.com>
Signed-off-by: NSteve Wise <swise@opengridcomputing.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

8d71740c

[TCP]: SNMPv2 tcpAttemptFails counter error · 3687b1dc

由 Wei Yongjun 提交于 7月 30, 2006

Refer to RFC2012, tcpAttemptFails is defined as following:
  tcpAttemptFails OBJECT-TYPE
      SYNTAX      Counter32
      MAX-ACCESS  read-only
      STATUS      current
      DESCRIPTION
              "The number of times TCP connections have made a direct
              transition to the CLOSED state from either the SYN-SENT
              state or the SYN-RCVD state, plus the number of times TCP
              connections have made a direct transition to the LISTEN
              state from the SYN-RCVD state."
      ::= { tcp 7 }

When I lookup into RFC793, I found that the state change should occured
under following condition:
  1. SYN-SENT -> CLOSED
     a) Received ACK,RST segment when SYN-SENT state.

  2. SYN-RCVD -> CLOSED
     b) Received SYN segment when SYN-RCVD state(came from LISTEN).
     c) Received RST segment when SYN-RCVD state(came from SYN-SENT).
     d) Received SYN segment when SYN-RCVD state(came from SYN-SENT).

  3. SYN-RCVD -> LISTEN
     e) Received RST segment when SYN-RCVD state(came from LISTEN).

In my test, those direct state transition can not be counted to
tcpAttemptFails.
Signed-off-by: NWei Yongjun <yjwei@nanjing-fnst.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

3687b1dc

[TCP]: fix memory leak in net/ipv4/tcp_probe.c::tcpprobe_read() · 118075b3

由 James Morris 提交于 7月 30, 2006

Based upon a patch by Jesper Juhl.
Signed-off-by: NJames Morris <jmorris@namei.org>
Acked-by: NStephen Hemminger <shemminger@osdl.org>
Acked-by: NJesper Juhl <jesper.juhl@gmail.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

118075b3

26 7月, 2006 2 次提交

[IPV4/IPV6]: Setting 0 for unused port field in RAW IP recvmsg(). · f59fc7f3

由 Tetsuo Handa 提交于 7月 25, 2006

From: Tetsuo Handa from-linux-kernel@i-love.sakura.ne.jp

The recvmsg() for raw socket seems to return random u16 value
from the kernel stack memory since port field is not initialized.
But I'm not sure this patch is correct.
Does raw socket return any information stored in port field?

[ BSD defines RAW IP recvmsg to return a sin_port value of zero.
  This is described in Steven's TCP/IP Illustrated Volume 2 on
  page 1055, which is discussing the BSD rip_input() implementation. ]
Acked-by: NYOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

f59fc7f3

[IPV4] ipmr: ip multicast route bug fix. · 72287490

由 Alexey Kuznetsov 提交于 7月 25, 2006

IP multicast route code was reusing an skb which causes use after free
and double free.

From: Alexey Kuznetsov <kuznet@ms2.inr.ac.ru>

Note, it is real skb_clone(), not alloc_skb(). Equeued skb contains
the whole half-prepared netlink message plus room for the rest.
It could be also skb_copy(), if we want to be puristic about mangling
cloned data, but original copy is really not going to be used.  
Acked-by: NStephen Hemminger <shemminger@osdl.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

72287490

25 7月, 2006 4 次提交

G
[IPV4]: Clear the whole IPCB, this clears also IPCB(skb)->flags. · d569f1d7
由 Guillaume Chazarain 提交于 7月 24, 2006
```
Signed-off-by: NGuillaume Chazarain <guichaz@yahoo.fr>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
d569f1d7

[NETFILTER]: SNMP NAT: fix byteorder confusion · 8cf8fb56

由 Patrick McHardy 提交于 7月 24, 2006

Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

8cf8fb56

[NETFILTER]: conntrack: fix SYSCTL=n compile · 72b55823

由 Adrian Bunk 提交于 7月 24, 2006

Signed-off-by: NAdrian Bunk <bunk@stusta.de>
Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

72b55823

[NETFILTER]: H.323 helper: fix possible NULL-ptr dereference · 083edca0

由 Patrick McHardy 提交于 7月 24, 2006

An RCF message containing a timeout results in a NULL-ptr dereference if
no RRQ has been seen before.

Noticed by the "SATURN tool", reported by Thomas Dillig <tdillig@stanford.edu>
and Isil Dillig <isil@stanford.edu>.
Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

083edca0

22 7月, 2006 3 次提交

[IPV4]: Fix nexthop realm dumping for multipath routes · 8265abc0

由 Patrick McHardy 提交于 7月 21, 2006

Routing realms exist per nexthop, but are only returned to userspace
for the first nexthop. This is due to the fact that iproute2 only
allows to set the realm for the first nexthop and the kernel refuses
multipath routes where only a single realm is present.

Dump all realms for multipath routes to enable iproute to correctly
display them.
Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

8265abc0

P
[NET]: Conversions from kmalloc+memset to k(z|c)alloc. · 0da974f4
由 Panagiotis Issaris 提交于 7月 21, 2006
```
Signed-off-by: NPanagiotis Issaris <takis@issaris.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
0da974f4

[IPV4]: Get rid of redundant IPCB->opts initialisation · 5d9c5a32

由 Herbert Xu 提交于 7月 21, 2006

Now that we always zero the IPCB->opts in ip_rcv, it is no longer
necessary to do so before calling netif_rx for tunneled packets.
Signed-off-by: NHerbert Xu <herbert@gondor.apana.org.au>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

5d9c5a32

15 7月, 2006 1 次提交

[IPV4]: Clear skb cb on IP input · 53602f92

由 Stephen Hemminger 提交于 7月 14, 2006

when data arrives at IP through loopback (and possibly other devices).
So the field needs to be cleared before it confuses the route code.
This was seen when running netem over loopback, but there are probably
other device cases. Maybe this should go into stable?
Signed-off-by: NStephen Hemminger <shemminger@osdl.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

53602f92

13 7月, 2006 3 次提交

[IPV4]: Fix error handling for fib_insert_node call · b47b2ec1

由 Herbert Xu 提交于 7月 12, 2006

The error handling around fib_insert_node was broken because we always
zeroed the error before checking it.
Signed-off-by: NHerbert Xu <herbert@gondor.apana.org.au>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

b47b2ec1

[IPCOMP]: Fix truesize after decompression · da952315

由 Herbert Xu 提交于 7月 11, 2006

The truesize check has uncovered the fact that we forgot to update truesize
after pskb_expand_head. Unfortunately pskb_expand_head can't update it for
us because it's used in all sorts of different contexts, some of which would
not allow truesize to be updated by itself.

So the solution for now is to simply update it in IPComp.

This patch also changes skb_put to __skb_put since we've just expanded
tailroom by exactly that amount so we know it's there (but gcc does not).
Signed-off-by: NHerbert Xu <herbert@gondor.apana.org.au>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

da952315

[TCP] tcp_highspeed: Fix AI updates. · 6150c22e

由 Xiaoliang (David) Wei 提交于 7月 11, 2006

I think there is still a problem with the AIMD parameter update in
HighSpeed TCP code.

Line 125~138 of the code (net/ipv4/tcp_highspeed.c):

	/* Update AIMD parameters */
	if (tp->snd_cwnd > hstcp_aimd_vals[ca->ai].cwnd) {
		while (tp->snd_cwnd > hstcp_aimd_vals[ca->ai].cwnd &&
		       ca->ai < HSTCP_AIMD_MAX - 1)
			ca->ai++;
	} else if (tp->snd_cwnd < hstcp_aimd_vals[ca->ai].cwnd) {
		while (tp->snd_cwnd > hstcp_aimd_vals[ca->ai].cwnd &&
		       ca->ai > 0)
			ca->ai--;

In fact, the second part (decreasing ca->ai) never decreases since the
while loop's inequality is in the reverse direction. This leads to
unfairness with multiple flows (once a flow happens to enjoy a higher
ca->ai, it keeps enjoying that even its cwnd decreases)

Here is a tentative fix (I also added a comment, trying to keep the
change clear):
Acked-by: NStephen Hemminger <shemminger@osdl.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

6150c22e

11 7月, 2006 2 次提交

[TCP]: Remove TCP Compound · c427d274

由 David S. Miller 提交于 7月 10, 2006

This reverts: f890f921

The inclusion of TCP Compound needs to be reverted at this time
because it is not 100% certain that this code conforms to the
requirements of Developer's Certificate of Origin 1.1 paragraph (b).
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

c427d274

[IPV4] inetpeer: Get rid of volatile from peer_total · 7466d90f

由 Herbert Xu 提交于 7月 09, 2006

The variable peer_total is protected by a lock.  The volatile marker
makes no sense.  This shaves off 20 bytes on i386.
Signed-off-by: NHerbert Xu <herbert@gondor.apana.org.au>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

7466d90f

09 7月, 2006 3 次提交

[NET]: Fix IPv4/DECnet routing rule dumping · 26e0fd1c

由 Patrick McHardy 提交于 7月 08, 2006

When more rules are present than fit in a single skb, the remaining
rules are incorrectly skipped.
Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

26e0fd1c

[NET] gso: Fix up GSO packets with broken checksums · a430a43d

由 Herbert Xu 提交于 7月 08, 2006

Certain subsystems in the stack (e.g., netfilter) can break the partial
checksum on GSO packets. Until they're fixed, this patch allows this to
work by recomputing the partial checksums through the GSO mechanism.

Once they've all been converted to update the partial checksum instead of
clearing it, this workaround can be removed.
Signed-off-by: NHerbert Xu <herbert@gondor.apana.org.au>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

a430a43d

[NET] gso: Add skb_is_gso · 89114afd

由 Herbert Xu 提交于 7月 08, 2006

This patch adds the wrapper function skb_is_gso which can be used instead
of directly testing skb_shinfo(skb)->gso_size. This makes things a little
nicer and allows us to change the primary key for indicating whether an skb
is GSO (if we ever want to do that).
Signed-off-by: NHerbert Xu <herbert@gondor.apana.org.au>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

89114afd

04 7月, 2006 3 次提交

[NET]: Verify gso_type too in gso_segment · bbcf467d

由 Herbert Xu 提交于 7月 03, 2006

We don't want nasty Xen guests to pass a TCPv6 packet in with gso_type set
to TCPv4 or even UDP (or a packet that's both TCP and UDP).
Signed-off-by: NHerbert Xu <herbert@gondor.apana.org.au>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

bbcf467d

[PATCH] lockdep: annotate bh_lock_sock() · c6366184

由 Ingo Molnar 提交于 7月 03, 2006

Teach special (recursive) locking code to the lock validator.  Has no effect
on non-lockdep kernels.
Signed-off-by: NIngo Molnar <mingo@elte.hu>
Signed-off-by: NArjan van de Ven <arjan@linux.intel.com>
Cc: "David S. Miller" <davem@davemloft.net>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

c6366184

[PATCH] lockdep: fix RT_HASH_LOCK_SZ · 62051200

由 Ingo Molnar 提交于 7月 03, 2006

On lockdep we have a quite big spinlock_t, so keep the size down.
Signed-off-by: NIngo Molnar <mingo@elte.hu>
Signed-off-by: NArjan van de Ven <arjan@linux.intel.com>
Cc: "David S. Miller" <davem@davemloft.net>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

62051200