提交 · 9ed19f339e12e731986de84134ac293cd15910a7 · openeuler / raspberrypi-kernel

19 6月, 2005 7 次提交

[NETLINK]: Set correct pid for ioctl originating netlink events · 9ed19f33

由 Jamal Hadi Salim 提交于 6月 18, 2005

This patch ensures that netlink events created as a result of programns
using ioctls (such as ifconfig, route etc) contains the correct PID of
those events.
Signed-off-by: NJamal Hadi Salim <hadi@cyberus.ca>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

9ed19f33

[NETLINK]: Correctly set NLM_F_MULTI without checking the pid · b6544c0b

由 Jamal Hadi Salim 提交于 6月 18, 2005

This patch rectifies some rtnetlink message builders that derive the
flags from the pid. It is now explicit like the other cases
which get it right. Also fixes half a dozen dumpers which did not
set NLM_F_MULTI at all.
Signed-off-by: NJamal Hadi Salim <hadi@cyberus.ca>
Signed-off-by: NThomas Graf <tgraf@suug.ch>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

b6544c0b

[NET]: Move sysctl_max_syn_backlog into request_sock.c · e52c1f17

由 David S. Miller 提交于 6月 18, 2005

This fixes the CONFIG_INET=n build failure noticed
by Andrew Morton.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

e52c1f17

[NET] rename struct tcp_listen_opt to struct listen_sock · 2ad69c55

由 Arnaldo Carvalho de Melo 提交于 6月 18, 2005

Signed-off-by: NArnaldo Carvalho de Melo <acme@ghostprotocols.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

2ad69c55

[NET] Generalise tcp_listen_opt · 0e87506f

由 Arnaldo Carvalho de Melo 提交于 6月 18, 2005

This chunks out the accept_queue and tcp_listen_opt code and moves
them to net/core/request_sock.c and include/net/request_sock.h, to
make it useful for other transport protocols, DCCP being the first one
to use it.

Next patches will rename tcp_listen_opt to accept_sock and remove the
inline tcp functions that just call a reqsk_queue_ function.
Signed-off-by: NArnaldo Carvalho de Melo <acme@ghostprotocols.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

0e87506f

[NET] Rename open_request to request_sock · 60236fdd

由 Arnaldo Carvalho de Melo 提交于 6月 18, 2005

Ok, this one just renames some stuff to have a better namespace and to
dissassociate it from TCP:

struct open_request  -> struct request_sock
tcp_openreq_alloc    -> reqsk_alloc
tcp_openreq_free     -> reqsk_free
tcp_openreq_fastfree -> __reqsk_free

With this most of the infrastructure closely resembles a struct
sock methods subset.
Signed-off-by: NArnaldo Carvalho de Melo <acme@ghostprotocols.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

60236fdd

[NET] Generalise TCP's struct open_request minisock infrastructure · 2e6599cb

由 Arnaldo Carvalho de Melo 提交于 6月 18, 2005

Kept this first changeset minimal, without changing existing names to
ease peer review.

Basicaly tcp_openreq_alloc now receives the or_calltable, that in turn
has two new members:

->slab, that replaces tcp_openreq_cachep
->obj_size, to inform the size of the openreq descendant for
  a specific protocol

The protocol specific fields in struct open_request were moved to a
class hierarchy, with the things that are common to all connection
oriented PF_INET protocols in struct inet_request_sock, the TCP ones
in tcp_request_sock, that is an inet_request_sock, that is an
open_request.

I.e. this uses the same approach used for the struct sock class
hierarchy, with sk_prot indicating if the protocol wants to use the
open_request infrastructure by filling in sk_prot->rsk_prot with an
or_calltable.

Results? Performance is improved and TCP v4 now uses only 64 bytes per
open request minisock, down from 96 without this patch :-)

Next changeset will rename some of the structs, fields and functions
mentioned above, struct or_calltable is way unclear, better name it
struct request_sock_ops, s/struct open_request/struct request_sock/g,
etc.
Signed-off-by: NArnaldo Carvalho de Melo <acme@ghostprotocols.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

2e6599cb

16 6月, 2005 1 次提交

[NETFILTER]: ipt_recent: last_pkts is an array of "unsigned long" not "u_int32_t" · bcfff0b4

由 David S. Miller 提交于 6月 15, 2005

This fixes various crashes on 64-bit when using this module.

Based upon a patch by Juergen Kreileder <jk@blackdown.de>.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
ACKed-by: NPatrick McHardy <kaber@trash.net>

bcfff0b4

14 6月, 2005 5 次提交

P
[NETFILTER]: Advance seq-file position in exp_next_seq() · a96aca88
由 Patrick McHardy 提交于 6月 13, 2005
```
Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
a96aca88

[IPV4]: Sysctl configurable icmp error source address. · 1c2fb7f9

由 J. Simonetti 提交于 6月 13, 2005

This patch alows you to change the source address of icmp error
messages. It applies cleanly to 2.6.11.11 and retains the default
behaviour.

In the old (default) behaviour icmp error messages are sent with the ip
of the exiting interface.

The new behaviour (when the sysctl variable is toggled on), it will send
the message with the ip of the interface that received the packet that
caused the icmp error. This is the behaviour network administrators will
expect from a router. It makes debugging complicated network layouts
much easier. Also, all 'vendor routers' I know of have the later
behaviour.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

1c2fb7f9

[SCTP] Add support for ip_nonlocal_bind sysctl & IP_FREEBIND socket option · cdac4e07

由 Neil Horman 提交于 6月 13, 2005

Signed-off-by: NNeil Horman <nhorman@redhat.com>
Signed-off-by: NSridhar Samudrala <sri@us.ibm.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

cdac4e07

R
[IPV4]: Multipath modules need a license to prevent kernel tainting. · 6efd8455
由 Randy Dunlap 提交于 6月 13, 2005
```
Signed-off-by: NRandy Dunlap <rdunlap@xenotime.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
6efd8455
A
[TCP]: Adjust TCP mem order check to new alloc_large_system_hash · e7626486
由 Andi Kleen 提交于 6月 13, 2005
```
Signed-off-by: NAndi Kleen <ak@suse.de>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
e7626486

03 6月, 2005 1 次提交

[IPVS]: remove net/ipv4/ipvs/ip_vs_proto_icmp.c · 64a6c7aa

由 Adrian Bunk 提交于 6月 02, 2005

ip_vs_proto_icmp.c was never finished.
Signed-off-by: NAdrian Bunk <bunk@stusta.de>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

64a6c7aa

01 6月, 2005 1 次提交
- E
  [IPSEC]: Fix esp_decap_data size verification in esp4. · 36839836
  由 Edgar E Iglesias 提交于 5月 31, 2005
```
Signed-off-by: NEdgar E Iglesias <edgar@axis.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
  36839836
31 5月, 2005 2 次提交

[IPV4]: Fix BUG() in 2.6.x, udp_poll(), fragments + CONFIG_HIGHMEM · 208d8984

由 Herbert Xu 提交于 5月 30, 2005

Steven Hand <Steven.Hand@cl.cam.ac.uk> wrote:
> 
> Reconstructed forward trace: 
> 
>   net/ipv4/udp.c:1334   spin_lock_irq() 
>   net/ipv4/udp.c:1336   udp_checksum_complete() 
> net/core/skbuff.c:1069   skb_shinfo(skb)->nr_frags > 1
> net/core/skbuff.c:1086   kunmap_skb_frag()
> net/core/skbuff.h:1087   local_bh_enable()
> kernel/softirq.c:0140   WARN_ON(irqs_disabled());

The receive queue lock is never taken in IRQs (and should never be) so
we can simply substitute bh for irq.
Signed-off-by: NHerbert Xu <herbert@gondor.apana.org.au>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

208d8984

[NETFILTER]: Fix deadlock with ip_queue and tcp local input path. · 9bb7bc94

由 Harald Welte 提交于 5月 30, 2005

When we have ip_queue being used from LOCAL_IN, then we end up with a
situation where the verdicts coming back from userspace traverse the TCP
input path from syscall context.  While this seems to work most of the
time, there's an ugly deadlock:

syscall context is interrupted by the timer interrupt.  When the timer
interrupt leaves, the timer softirq get's scheduled and calls
tcp_delack_timer() and alike.  They themselves do bh_lock_sock(sk),
which is already held from somewhere else -> boom.

I've now tested the suggested solution by Patrick McHardy and Herbert Xu to
simply use local_bh_{en,dis}able().
Signed-off-by: NHarald Welte <laforge@netfilter.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

9bb7bc94

30 5月, 2005 2 次提交

[IPV4]: Kill MULTIPATHHOLDROUTE flag. · 37e20a66

由 Pravin B. Shelar 提交于 5月 29, 2005

It cannot work properly, so just ignore it in drr
and rr multipath algorithms just like the random
multipath algorithm does.

Suggested by Herbert Xu.

Signed-off by: Pravin B. Shelar <pravins@calsoftinc.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

37e20a66

[IPV4]: Primary and secondary addresses · 8f937c60

由 Harald Welte 提交于 5月 29, 2005

Add an option to make secondary IP addresses get promoted
when primary IP addresses are removed from the device.
It defaults to off to preserve existing behavior.
Signed-off-by: NHarald Welte <laforge@gnumonks.org>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

8f937c60

24 5月, 2005 1 次提交

[TCP]: Fix stretch ACK performance killer when doing ucopy. · 31432412

由 David S. Miller 提交于 5月 23, 2005

When we are doing ucopy, we try to defer the ACK generation to
cleanup_rbuf().  This works most of the time very well, but if the
ucopy prequeue is large, this ACKing behavior kills performance.

With TSO, it is possible to fill the prequeue so large that by the
time the ACK is sent and gets back to the sender, most of the window
has emptied of data and performance suffers significantly.

This behavior does help in some cases, so we should think about
re-enabling this trick in the future, using some kind of limit in
order to avoid the bug case.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

31432412

20 5月, 2005 2 次提交

[NETFILTER]: Do not be clever about SKB ownership in ip_ct_gather_frags(). · 8be58932

由 David S. Miller 提交于 5月 19, 2005

Just do an skb_orphan() and be done with it.
Based upon discussions with Herbert Xu on netdev.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

8be58932

[IP_VS]: Remove extra __ip_vs_conn_put() for incoming ICMP. · d9fa0f39

由 Julian Anastasov 提交于 5月 19, 2005

Remove extra __ip_vs_conn_put for incoming ICMP in direct routing
mode. Mark de Vries reports that IPVS connections are not leaked anymore.
Signed-off-by: NJulian Anastasov <ja@ssi.bg>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

d9fa0f39

19 5月, 2005 1 次提交

[IPV4/IPV6] Ensure all frag_list members have NULL sk · 2fdba6b0

由 Herbert Xu 提交于 5月 18, 2005

Having frag_list members which holds wmem of an sk leads to nightmares
with partially cloned frag skb's.  The reason is that once you unleash
a skb with a frag_list that has individual sk ownerships into the stack
you can never undo those ownerships safely as they may have been cloned
by things like netfilter.  Since we have to undo them in order to make
skb_linearize happy this approach leads to a dead-end.

So let's go the other way and make this an invariant:

	For any skb on a frag_list, skb->sk must be NULL.

That is, the socket ownership always belongs to the head skb.
It turns out that the implementation is actually pretty simple.

The above invariant is actually violated in the following patch
for a short duration inside ip_fragment.  This is OK because the
offending frag_list member is either destroyed at the end of the
slow path without being sent anywhere, or it is detached from
the frag_list before being sent.
Signed-off-by: NHerbert Xu <herbert@gondor.apana.org.au>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

2fdba6b0

06 5月, 2005 2 次提交

[PATCH] update Ross Biro bouncing email address · 02c30a84

由 Jesper Juhl 提交于 5月 05, 2005

Ross moved.  Remove the bad email address so people will find the correct
one in ./CREDITS.
Signed-off-by: NJesper Juhl <juhl-lkml@dif.dk>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

02c30a84

[IPV4]: multipath_wrandom.c GPF fixes · 60d53065

由 Patrick McHardy 提交于 5月 05, 2005

multipath_wrandom needs to use GFP_ATOMIC.
Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

60d53065

04 5月, 2005 7 次提交

[IPSEC]: Store idev entries · aabc9761

由 Herbert Xu 提交于 5月 03, 2005

I found a bug that stopped IPsec/IPv6 from working.  About
a month ago IPv6 started using rt6i_idev->dev on the cached socket dst
entries.  If the cached socket dst entry is IPsec, then rt6i_idev will
be NULL.

Since we want to look at the rt6i_idev of the original route in this
case, the easiest fix is to store rt6i_idev in the IPsec dst entry just
as we do for a number of other IPv6 route attributes.  Unfortunately
this means that we need some new code to handle the references to
rt6i_idev.  That's why this patch is bigger than it would otherwise be.

I've also done the same thing for IPv4 since it is conceivable that
once these idev attributes start getting used for accounting, we
probably need to dereference them for IPv4 IPsec entries too.
Signed-off-by: NHerbert Xu <herbert@gondor.apana.org.au>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

aabc9761

P
[NETFILTER]: Drop conntrack reference in ip_dev_loopback_xmit() · bd96535b
由 Patrick McHardy 提交于 5月 03, 2005
```
Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
bd96535b

[NETLINK]: Synchronous message processing. · 2a0a6ebe

由 Herbert Xu 提交于 5月 03, 2005

Let's recap the problem.  The current asynchronous netlink kernel
message processing is vulnerable to these attacks:

1) Hit and run: Attacker sends one or more messages and then exits
before they're processed.  This may confuse/disable the next netlink
user that gets the netlink address of the attacker since it may
receive the responses to the attacker's messages.

Proposed solutions:

a) Synchronous processing.
b) Stream mode socket.
c) Restrict/prohibit binding.

2) Starvation: Because various netlink rcv functions were written
to not return until all messages have been processed on a socket,
it is possible for these functions to execute for an arbitrarily
long period of time.  If this is successfully exploited it could
also be used to hold rtnl forever.

Proposed solutions:

a) Synchronous processing.
b) Stream mode socket.

Firstly let's cross off solution c).  It only solves the first
problem and it has user-visible impacts.  In particular, it'll
break user space applications that expect to bind or communicate
with specific netlink addresses (pid's).

So we're left with a choice of synchronous processing versus
SOCK_STREAM for netlink.

For the moment I'm sticking with the synchronous approach as
suggested by Alexey since it's simpler and I'd rather spend
my time working on other things.

However, it does have a number of deficiencies compared to the
stream mode solution:

1) User-space to user-space netlink communication is still vulnerable.

2) Inefficient use of resources.  This is especially true for rtnetlink
since the lock is shared with other users such as networking drivers.
The latter could hold the rtnl while communicating with hardware which
causes the rtnetlink user to wait when it could be doing other things.

3) It is still possible to DoS all netlink users by flooding the kernel
netlink receive queue.  The attacker simply fills the receive socket
with a single netlink message that fills up the entire queue.  The
attacker then continues to call sendmsg with the same message in a loop.

Point 3) can be countered by retransmissions in user-space code, however
it is pretty messy.

In light of these problems (in particular, point 3), we should implement
stream mode netlink at some point.  In the mean time, here is a patch
that implements synchronous processing.  
Signed-off-by: NHerbert Xu <herbert@gondor.apana.org.au>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

2a0a6ebe

[TCP]: Optimize check in port-allocation code. · 0b2531bd

由 Folkert van Heusden 提交于 5月 03, 2005

Signed-off-by: NFolkert van Heusden <folkert@vanheusden.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

0b2531bd

[RTNETLINK] Cleanup rtnetlink_link tables · db46edc6

由 Thomas Graf 提交于 5月 03, 2005

Converts remaining rtnetlink_link tables to use c99 designated
initializers to make greping a little bit easier.
Signed-off-by: NThomas Graf <tgraf@suug.ch>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

db46edc6

P
[NETFILTER]: Don't checksum CHECKSUM_UNNECESSARY skbs in TCP connection tracking · 31da185d
由 Patrick McHardy 提交于 5月 03, 2005
```
Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
31da185d
P
[NETFILTER]: Missing owner-field initialization in iptable_raw · b4330957
由 Patrick McHardy 提交于 5月 03, 2005
```
Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
b4330957

29 4月, 2005 2 次提交

[NET]: /proc/net/stat/* header cleanup · 5bec0039

由 Olaf Rempel 提交于 4月 28, 2005

Signed-off-by: NOlaf Rempel <razzor@kopf-tisch.de>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

5bec0039

[IPV4]: Incorrect permissions on route flush sysctl · 7e3e0360

由 Dave Jones 提交于 4月 28, 2005

This has been brought up before.. http://lkml.org/lkml/2000/1/21/116
but didnt seem to get resolved.  This morning I got someone
file a bugzilla about it breaking sysctl(8).
Signed-off-by: NDave Jones <davej@redhat.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

7e3e0360

26 4月, 2005 5 次提交

[NET]: kill gratitious includes of major.h · 5523662c

由 Al Viro 提交于 4月 25, 2005

A lot of places in there are including major.h for no reason
whatsoever. Removed. And yes, it still builds.

The history of that stuff is often amusing. E.g. for net/core/sock.c
the story looks so, as far as I've been able to reconstruct it: we used to
need major.h in net/socket.c circa 1.1.early. In 1.1.13 that need had
disappeared, along with register_chrdev(SOCKET_MAJOR, "socket", &net_fops)
in sock_init(). Include had not. When 1.2 -> 1.3 reorg of net/* had moved
a lot of stuff from net/socket.c to net/core/sock.c, this crap had followed...
Signed-off-by: NAl Viro <viro@parcelfarce.linux.theplanet.co.uk>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

5523662c

[TCP]: Trivial tcp_data_queue() cleanup · 088dd3a4

由 James Morris 提交于 4月 25, 2005

This patch removes a superfluous intialization from tcp_data_queue().
Signed-off-by: NJames Morris <jmorris@redhat.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

088dd3a4

[PATCH] kill gratitious includes of major.h under net/* · b453257f

由 Al Viro 提交于 4月 25, 2005

A lot of places in there are including major.h for no reason whatsoever.
Removed.  And yes, it still builds. 

The history of that stuff is often amusing.  E.g.  for net/core/sock.c
the story looks so, as far as I've been able to reconstruct it: we used
to need major.h in net/socket.c circa 1.1.early.  In 1.1.13 that need
had disappeared, along with register_chrdev(SOCKET_MAJOR, "socket",
&net_fops) in sock_init().  Include had not.  When 1.2 -> 1.3 reorg of
net/* had moved a lot of stuff from net/socket.c to net/core/sock.c,
this crap had followed... 
Signed-off-by: NAl Viro <viro@parcelfarce.linux.theplanet.co.uk>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

b453257f

[NETFILTER]: Drop conntrack reference when packet leaves IP · b31e5b1b

由 Patrick McHardy 提交于 4月 25, 2005

In the event a raw socket is created for sending purposes only, the creator
never bothers to check the socket's receive queue.  But we continue to
add skbs to its queue until it fills up.

Unfortunately, if ip_conntrack is loaded on the box, each skb we add to the
queue potentially holds a reference to a conntrack.  If the user attempts
to unload ip_conntrack, we will spin around forever since the queued skbs
are pinned.
Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

b31e5b1b

[NETFILTER]: Fix truncated sequence numbers in FTP helper · f649a3bf

由 Yasuyuki KOZAKAI 提交于 4月 25, 2005

Signed-off-by: NYasuyuki KOZAKAI <yasuyuki.kozkaai@toshiba.co.jp>
Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

f649a3bf

25 4月, 2005 1 次提交

[TCP]: skb pcount with MTU discovery · d5ac99a6

由 David S. Miller 提交于 4月 24, 2005

The problem is that when doing MTU discovery, the too-large segments in
the write queue will be calculated as having a pcount of >1.  When
tcp_write_xmit() is trying to send, tcp_snd_test() fails the cwnd test
when pcount > cwnd.

The segments are eventually transmitted one at a time by keepalive, but
this can take a long time.

This patch checks if TSO is enabled when setting pcount.
Signed-off-by: NJohn Heffner <jheffner@psc.edu>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

d5ac99a6