提交 · d0410051164bbbc597e15f068b53c06a954ae0d4 · openeuler / raspberrypi-kernel

11 7月, 2007 6 次提交

[TCP]: SACK fastpath did override adjusted fackets_out · d0410051

由 Ilpo Järvinen 提交于 7月 02, 2007

Do same adjustment to SACK fastpath counters provided that
they're valid.
Signed-off-by: NIlpo Järvinen <ilpo.jarvinen@helsinki.fi>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

d0410051

[UDP]: Introduce UDP encapsulation type for L2TP · 342f0234

由 James Chapman 提交于 6月 27, 2007

This patch adds a new UDP_ENCAP_L2TPINUDP encapsulation type for UDP
sockets. When a UDP socket's encap_type is UDP_ENCAP_L2TPINUDP, the
skb is delivered to a function pointed to by the udp_sock's
encap_rcv funcptr. If the skb isn't wanted by L2TP, it returns >0, which
causes it to be passed through to UDP.

Include padding to put the new encap_rcv field on a 4-byte boundary.

Previously, the only user of UDP encap sockets was ESP, so when
CONFIG_XFRM was not defined, some of the encap code was compiled
out. This patch changes that. As a result, udp_encap_rcv() will
now do a little more work when CONFIG_XFRM is not defined.
Signed-off-by: NJames Chapman <jchapman@katalix.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

342f0234

[NET]: IPV6 checksum offloading in network devices · d212f87b

由 Stephen Hemminger 提交于 6月 27, 2007

The existing model for checksum offload does not correctly handle
devices that can offload IPV4 and IPV6 only. The NETIF_F_HW_CSUM flag
implies device can do any arbitrary protocol.

This patch:
 * adds NETIF_F_IPV6_CSUM for those devices
 * fixes bnx2 and tg3 devices that need it
 * add NETIF_F_IPV6_CSUM to ipv6 output (incl GSO)
 * fixes assumptions about NETIF_F_ALL_CSUM in nat
 * adjusts bridge union of checksumming computation
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

d212f87b

[XFRM]: Add module alias for transformation type. · d3d6dd3a

由 Masahide NAKAMURA 提交于 6月 26, 2007

It is clean-up for XFRM type modules and adds aliases with its
protocol:
 ESP, AH, IPCOMP, IPIP and IPv6 for IPsec
 ROUTING and DSTOPTS for MIPv6

It is almost the same thing as XFRM mode alias, but it is added
new defines XFRM_PROTO_XXX for preprocessing since some protocols
are defined as enum.
Signed-off-by: NMasahide NAKAMURA <nakam@linux-ipv6.org>
Acked-by: NIngo Oeser <netdev@axxeo.de>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

d3d6dd3a

[TCPv4]: Improve BH latency in /proc/net/tcp · a7ab4b50

由 Herbert Xu 提交于 6月 10, 2007

Currently the code for /proc/net/tcp disable BH while iterating
over the entire established hash table.  Even though we call
cond_resched_softirq for each entry, we still won't process
softirq's as regularly as we would otherwise do which results
in poor performance when the system is loaded near capacity.

This anomaly comes from the 2.4 code where this was all in a
single function and the local_bh_disable might have made sense
as a small optimisation.

The cost of each local_bh_disable is so small when compared
against the increased latency in keeping it disabled over a
large but mostly empty TCP established hash table that we
should just move it to the individual read_lock/read_unlock
calls as we do in inet_diag.
Signed-off-by: NHerbert Xu <herbert@gondor.apana.org.au>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

a7ab4b50

D
[IPV4]: The scheduled removal of multipath cached routing support. · e06e7c61
由 David S. Miller 提交于 6月 10, 2007
```
With help from Chris Wedgwood.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
e06e7c61

24 6月, 2007 1 次提交

[TCP] tcp_read_sock: Allow recv_actor() return return negative error value. · ddb61a57

由 Jens Axboe 提交于 6月 23, 2007

tcp_read_sock() currently assumes that the recv_actor() only returns
number of bytes copied. For network splice receive, we may have to
return an error in some cases. So allow the actor to return a negative
error value.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

ddb61a57

19 6月, 2007 1 次提交

[IPVS]: Fix state variable on failure to start ipvs threads · cc0191ae

由 Neil Horman 提交于 6月 18, 2007

ip_vs currently fails to reset its ip_vs_sync_state variable if the
sync thread fails to start properly.  The result is that the kernel
will report a running daemon when their actuall is none.

If you issue the following commands:

1. ipvsadm --start-daemon master --mcast-interface bla
2. ipvsadm -L --daemon
3. ipvsadm --stop-daemon master

Assuming that bla is not an actual interface, step 2 should return no
data, but instead returns:

$ ipvsadm -L --daemon
master sync daemon (mcast=bla, syncid=0)
Signed-off-by: NNeil Horman <nhorman@tuxdriver.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

cc0191ae

16 6月, 2007 2 次提交

[TCP]: Fix logic breakage due to DSACK separation · 7769f406

由 Ilpo Järvinen 提交于 6月 15, 2007

Commit 6f74651a is found guilty
of breaking DSACK counting, which should be done only for the
SACK block reported by the DSACK instead of every SACK block
that is received along with DSACK information.
Signed-off-by: NIlpo Järvinen <ilpo.jarvinen@helsinki.fi>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

7769f406

[TCP]: Congestion control API RTT sampling fix · b9ce204f

由 Ilpo Järvinen 提交于 6月 15, 2007

Commit 164891aa broke RTT
sampling of congestion control modules. Inaccurate timestamps
could be fed to them without providing any way for them to
identify such cases. Previously RTT sampler was called only if
FLAG_RETRANS_DATA_ACKED was not set filtering inaccurate
timestamps nicely. In addition, the new behavior could give an
invalid timestamp (zero) to RTT sampler if only skbs with
TCPCB_RETRANS were ACKed. This solves both problems.
Signed-off-by: NIlpo Järvinen <ilpo.jarvinen@helsinki.fi>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

b9ce204f

15 6月, 2007 1 次提交

[TCP]: Add missing break to TCP option parsing code · d7ea5b91

由 Ilpo Järvinen 提交于 6月 14, 2007

This flaw does not affect any behavior (currently).
Signed-off-by: NIlpo Järvinen <ilpo.jarvinen@helsinki.fi>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

d7ea5b91

13 6月, 2007 3 次提交

[TCP]: Set initial_ssthresh default to zero in Cubic and BIC. · 66e1e3b2

由 David S. Miller 提交于 6月 13, 2007

Because of the current default of 100, Cubic and BIC perform very
poorly compared to standard Reno.

In the worst case, this change makes Cubic and BIC as aggressive as
Reno.  So this change should be very safe.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

66e1e3b2

[TCP]: Fix left_out setting during FRTO · af15cc7b

由 Ilpo Järvinen 提交于 6月 12, 2007

Without FRTO, the tcp_try_to_open is never called with
lost_out > 0 (see tcp_time_to_recover). However, when FRTO is
enabled, the !tp->lost condition is not used until end of FRTO
because that way TCP avoids premature entry to fast recovery
during FRTO.
Signed-off-by: NIlpo Järvinen <ilpo.jarvinen@helsinki.fi>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

af15cc7b

D
[TCP]: Disable TSO if MD5SIG is enabled. · 3d7dbeac
由 David S. Miller 提交于 6月 12, 2007
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
3d7dbeac

09 6月, 2007 3 次提交

[CIPSO]: Fix several unaligned kernel accesses in the CIPSO engine. · 50e5d35c

由 Paul Moore 提交于 6月 07, 2007

IPv4 options are not very well aligned within the packet and the
format of a CIPSO option is even worse.  The result is that the CIPSO
engine in the kernel does a few unaligned accesses when parsing and
validating incoming packets with CIPSO options attached which generate
error messages on certain alignment sensitive platforms.  This patch
fixes this by marking these unaligned accesses with the
get_unaliagned() macro.
Signed-off-by: NPaul Moore <paul.moore@hp.com>
Acked-by: NJames Morris <jmorris@namei.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

50e5d35c

[NetLabel]: consolidate the struct socket/sock handling to just struct sock · ba6ff9f2

由 Paul Moore 提交于 6月 07, 2007

The current NetLabel code has some redundant APIs which allow both
"struct socket" and "struct sock" types to be used; this may have made
sense at some point but it is wasteful now.  Remove the functions that
operate on sockets and convert the callers.  Not only does this make
the code smaller and more consistent but it pushes the locking burden
up to the caller which can be more intelligent about the locks.  Also,
perform the same conversion (socket to sock) on the SELinux/NetLabel
glue code where it make sense.
Signed-off-by: NPaul Moore <paul.moore@hp.com>
Acked-by: NJames Morris <jmorris@namei.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

ba6ff9f2

[IPV4]: Do not remove idev when addresses are cleared · 6363097c

由 Herbert Xu 提交于 6月 07, 2007

Now that we create idev before addresses are added, it no longer makes
sense to remove them when addresses are all deleted.
Signed-off-by: NHerbert Xu <herbert@gondor.apana.org.au>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

6363097c

08 6月, 2007 11 次提交

[UDP]: Revert 2-pass hashing changes. · df2bc459

由 David S. Miller 提交于 6月 05, 2007

This reverts changesets:

6aaf47fa
b7b5f487
de34ed91
fc038410

There are still some correctness issues recently
discovered which do not have a known fix that doesn't
involve doing a full hash table scan on port bind.

So revert for now.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

df2bc459

[NETFILTER]: ip_tables: fix compat related crash · 4c1b52bc

由 Dmitry Mishin 提交于 6月 05, 2007

check_compat_entry_size_and_hooks iterates over the matches and calls
compat_check_calc_match, which loads the match and calculates the
compat offsets, but unlike the non-compat version, doesn't call
->checkentry yet. On error however it calls cleanup_matches, which in
turn calls ->destroy, which can result in crashes if the destroy
function (validly) expects to only get called after the checkentry
function.

Add a compat_release_match function that only drops the module reference
on error and rename compat_check_calc_match to compat_find_calc_match to
reflect the fact that it doesn't call the checkentry function.

Reported by Jan Engelhardt <jengelh@linux01.gwdg.de>
Signed-off-by: NDmitry Mishin <dim@openvz.org>
Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

4c1b52bc

[NETFILTER]: nf_conntrack: fix helper module unload races · 3c158f7f

由 Patrick McHarrdy 提交于 6月 05, 2007

When a helper module is unloaded all conntracks refering to it have their
helper pointer NULLed out, leading to lots of races. In most places this
can be fixed by proper use of RCU (they do already check for != NULL,
but in a racy way), additionally nf_conntrack_expect_related needs to
bail out when no helper is present.

Also remove two paranoid BUG_ONs in nf_conntrack_proto_gre that are racy
and not worth fixing.
Signed-off-by: NPatrick McHarrdy <kaber@trash.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

3c158f7f

[NETLINK]: Mark netlink policies const · ef7c79ed

由 Patrick McHardy 提交于 6月 05, 2007

Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

ef7c79ed

[TCP] tcp_probe: Attach printf attribute properly to printl(). · 14a49e1f

由 David S. Miller 提交于 6月 05, 2007

GCC doesn't like the way Stephen initially did it:

net/ipv4/tcp_probe.c:83: warning: empty declaration
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

14a49e1f

[TCP]: Use LIMIT_NETDEBUG in tcp_retransmit_timer(). · 274707cf

由 Eric Dumazet 提交于 6月 05, 2007

LIMIT_NETDEBUG allows the admin to disable some warning messages (echo 0
 >/proc/sys/net/core/warnings).

The "TCP: Treason uncloaked!" message can use this facility.
Signed-off-by: NEric Dumazet <dada1@cosmosbay.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

274707cf

[IPV4]: Restore old behaviour of default config values · 71e27da9

由 Herbert Xu 提交于 6月 04, 2007

Previously inet devices were only constructed when addresses are added
(or rarely in ipmr).  Therefore the default config values they get are
the ones at the time of these operations.

Now that we're creating inet devices earlier, this changes the
behaviour of default config values in an incompatible way (see bug
#8519).

This patch creates a compromise by setting the default values at the
same point as before but only for those that have not been explicitly
set by the user since the inet device's creation.
Signed-off-by: NHerbert Xu <herbert@gondor.apana.org.au>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

71e27da9

[IPV4]: Add default config support after inetdev_init · 31be3085

由 Herbert Xu 提交于 6月 04, 2007

Previously once inetdev_init has been called on a device any changes
made to ipv4_devconf_dflt would have no effect on that device's
configuration.

This creates a problem since we have moved the point where
inetdev_init is called from when an address is added to where the
device is registered.

This patch is the first half of a set that tries to mimic the old
behaviour while still calling inetdev_init.

It propagates any changes to ipv4_devconf_dflt to those devices that
have not had the corresponding attribute set.

The next patch will forcibly set all values at the point where
inetdev_init was previously called.
Signed-off-by: NHerbert Xu <herbert@gondor.apana.org.au>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

31be3085

[IPV4]: Convert IPv4 devconf to an array · 42f811b8

由 Herbert Xu 提交于 6月 04, 2007

This patch converts the ipv4_devconf config members (everything except
sysctl) to an array. This allows easier manipulation which will be
needed later on to provide better management of default config values.
Signed-off-by: NHerbert Xu <herbert@gondor.apana.org.au>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

42f811b8

[IPV4]: Only panic if inetdev_init fails for loopback · 8d76527e

由 Herbert Xu 提交于 6月 04, 2007

When I made the inetdev_init call work on all devices I incorrectly
left in the panic call as well.  It is obviously undesirable to
panic on an allocation failure for a normal network device.  This
patch moves the panic call under the loopback if clause.
Signed-off-by: NHerbert Xu <herbert@gondor.apana.org.au>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

8d76527e

[TCP]: Honour sk_bound_dev_if in tcp_v4_send_ack · f0e48dbf

由 Patrick McHardy 提交于 6月 04, 2007

A time_wait socket inherits sk_bound_dev_if from the original socket,
but it is not used when sending ACK packets using ip_send_reply.

Fix by passing the oif to ip_send_reply in struct ip_reply_arg and
use it for output routing.
Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

f0e48dbf

04 6月, 2007 4 次提交

[ICMP]: Fix icmp_errors_use_inbound_ifaddr sysctl · 6e1d9103

由 Patrick McHardy 提交于 6月 01, 2007

Currently when icmp_errors_use_inbound_ifaddr is set and an ICMP error is
sent after the packet passed through ip_output(), an address from the
outgoing interface is chosen as ICMP source address since skb->dev doesn't
point to the incoming interface anymore.

Fix this by doing an interface lookup on rt->dst.iif and using that device.
Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

6e1d9103

W
[IPV4]: Fix "ipOutNoRoutes" counter error for TCP and UDP · 584bdf8c
由 Wei Dong 提交于 5月 31, 2007
```
Signed-off-by: NWei Dong <weidong@cn.fujitsu.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
584bdf8c

[TCP]: Fix GSO ignorance of pkts_acked arg (cong.cntrl modules) · 6418204f

由 Ilpo Järvinen 提交于 5月 31, 2007

The code used to ignore GSO completely, passing either way too
small or zero pkts_acked when GSO skb or part of it got ACKed.
In addition, there is no need to calculate the value in the loop
but simple arithmetics after the loop is sufficient. There is
no need to handle SYN case specially because congestion control
modules are not yet initialized when FLAG_SYN_ACKED is set.
Signed-off-by: NIlpo Järvinen <ilpo.jarvinen@helsinki.fi>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

6418204f

[TCP]: Use default 32768-61000 outgoing port range in all cases. · 3f196eb5

由 Mark Glines 提交于 5月 31, 2007

This diff changes the default port range used for outgoing connections,
from "use 32768-61000 in most cases, but use N-4999 on small boxes
(where N is a multiple of 1024, depending on just *how* small the box
is)" to just "use 32768-61000 in all cases".

I don't believe there are any drawbacks to this change, and it keeps
outgoing connection ports farther away from the mess of
IANA-registered ports.
Signed-off-by: NMark Glines <mark@glines.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

3f196eb5

31 5月, 2007 5 次提交

[TCP] tcp_probe: use GCC printf attribute · 67403754

由 Stephen Hemminger 提交于 5月 29, 2007

The function in tcp_probe is printf like, use GCC to check the args.
Sighed-off-by: NStephen Hemminger <shemminger@linux-foundation.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

67403754

[TCP] tcp_probe: a trivial fix for mismatched number of printl arguments. · 63313494

由 Sangtae Ha 提交于 5月 29, 2007

Just a fix to correct the number of printl arguments. Now, srtt is
logging correctly.
Signed-off-by: NSangtae Ha <sangtae.ha@gmail.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

63313494

[TCP]: Consolidate checking for tcp orphan count being too big. · e4fd5da3

由 Pavel Emelianov 提交于 5月 29, 2007

tcp_out_of_resources() and tcp_close() perform the
same checking of number of orphan sockets. Move this
code into common place.
Signed-off-by: NPavel Emelianov <xemul@openvz.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

e4fd5da3

D
[IPV4]: Kill references to bogus non-existent CONFIG_IP_NOSIOCRT · ddc31ce3
由 David S. Miller 提交于 5月 29, 2007
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
ddc31ce3
K
[IPSEC]: Fix panic when using inter address familiy IPsec on loopback. · f282d45c
由 Kazunori MIYAZAWA 提交于 5月 29, 2007
```
Signed-off-by: NKazunori MIYAZAWA <kazunori@miyazawa.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
f282d45c

25 5月, 2007 3 次提交

[XFRM]: Allow packet drops during larval state resolution. · 14e50e57

由 David S. Miller 提交于 5月 24, 2007

The current IPSEC rule resolution behavior we have does not work for a
lot of people, even though technically it's an improvement from the
-EAGAIN buisness we had before.

Right now we'll block until the key manager resolves the route.  That
works for simple cases, but many folks would rather packets get
silently dropped until the key manager resolves the IPSEC rules.

We can't tell these folks to "set the socket non-blocking" because
they don't have control over the non-block setting of things like the
sockets used to resolve DNS deep inside of the resolver libraries in
libc.

With that in mind I coded up the patch below with some help from
Herbert Xu which provides packet-drop behavior during larval state
resolution, controllable via sysctl and off by default.

This lays the framework to either:

1) Make this default at some point or...

2) Move this logic into xfrm{4,6}_policy.c and implement the
   ARP-like resolution queue we've all been dreaming of.
   The idea would be to queue packets to the policy, then
   once the larval state is resolved by the key manager we
   re-resolve the route and push the packets out.  The
   packets would timeout if the rule didn't get resolved
   in a certain amount of time.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

14e50e57

[NETFILTER]: nf_nat_h323: call set_h225_addr instead of set_h225_addr_hook · 1ff75ed2

由 Jing Min Zhao 提交于 5月 24, 2007

They're the same.
Signed-off-by: NJing Min Zhao <zhaojingmin@vivecode.com>
Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

1ff75ed2

[NETFILTER]: nf_conntrack_ftp: fix newline sequence number calculation · 25b86e05

由 Patrick McHardy 提交于 5月 24, 2007

When the packet size is changed by the FTP NAT helper, the connection
tracking helper adjusts the sequence number of the newline character
by the size difference. This is wrong because NAT sequence number
adjustment happens after helpers are called, so the unadjusted number
is compared to the already adjusted one.

Based on report by YU, Haitao <yuhaitao@tsinghua.org.cn>
Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

25b86e05