提交 · 0cebe4b4163b6373c9d24c1a192939777bc27e55 · openanolis / cloud-kernel

03 2月, 2010 5 次提交

netfilter: ctnetlink: support selective event delivery · 0cebe4b4

由 Patrick McHardy 提交于 2月 03, 2010

Add two masks for conntrack end expectation events to struct nf_conntrack_ecache
and use them to filter events. Their default value is "all events" when the
event sysctl is on and "no events" when it is off. A following patch will add
specific initializations. Expectation events depend on the ecache struct of
their master conntrack.
Signed-off-by: NPatrick McHardy <kaber@trash.net>

0cebe4b4

netfilter: nf_conntrack: split up IPCT_STATUS event · 858b3133

由 Patrick McHardy 提交于 2月 03, 2010

Split up the IPCT_STATUS event into an IPCT_REPLY event, which is generated
when the IPS_SEEN_REPLY bit is set, and an IPCT_ASSURED event, which is
generated when the IPS_ASSURED bit is set.

In combination with a following patch to support selective event delivery,
this can be used for "sparse" conntrack replication: start replicating the
conntrack entry after it reached the ASSURED state and that way it's SYN-flood
resistant.
Signed-off-by: NPatrick McHardy <kaber@trash.net>

858b3133

P
netfilter: add struct net * to target parameters · add67461
由 Patrick McHardy 提交于 2月 03, 2010
```
Signed-off-by: NPatrick McHardy <kaber@trash.net>
```
add67461

netfilter: ctnetlink: only assign helpers for matching protocols · 794e6871

由 Patrick McHardy 提交于 2月 03, 2010

Make sure not to assign a helper for a different network or transport
layer protocol to a connection.

Additionally change expectation deletion by helper to compare the name
directly - there might be multiple helper registrations using the same
name, currently one of them is chosen in an unpredictable manner and
only those expectations are removed.
Signed-off-by: NPatrick McHardy <kaber@trash.net>

794e6871

netfilter: xt_hashlimit: fix race condition and simplify locking · 2eff25c1

由 Patrick McHardy 提交于 2月 03, 2010

As noticed by Shin Hong <hongshin@gmail.com>, there is a race between
htable_find_get() and htable_put():

htable_put():				htable_find_get():

					spin_lock_bh(&hashlimit_lock);
					<search entry>
atomic_dec_and_test(&hinfo->use)
					atomic_inc(&hinfo->use)
					spin_unlock_bh(&hashlimit_lock)
					return hinfo;
spin_lock_bh(&hashlimit_lock);
hlist_del(&hinfo->node);
spin_unlock_bh(&hashlimit_lock);
htable_destroy(hinfo);

The entire locking concept is overly complicated, tables are only
created/referenced and released in process context, so a single
mutex works just fine. Remove the hashinfo_spinlock and atomic
reference count and use the mutex to protect table lookups/creation
and reference count changes.
Signed-off-by: NPatrick McHardy <kaber@trash.net>

2eff25c1

02 2月, 2010 2 次提交

netfilter: xt_TCPMSS: SYN packets are allowed to contain data · 10a19939

由 Simon Arlott 提交于 2月 02, 2010

The TCPMSS target is dropping SYN packets where:
  1) There is data, or
  2) The data offset makes the TCP header larger than the packet.

Both of these result in an error level printk. This printk has been
removed.

This change avoids dropping SYN packets containing data. If there
is also no MSS option (as well as data), one will not be added
because of possible complications due to the increased packet size.
Signed-off-by: NSimon Arlott <simon@fire.lp0.eu>
Signed-off-by: NPatrick McHardy <kaber@trash.net>

10a19939

netfilter: xtables: CONFIG_COMPAT redux · c30f540b

由 Alexey Dobriyan 提交于 2月 02, 2010

Ifdef out
	struct nf_sockopt_ops::compat_set
	struct nf_sockopt_ops::compat_get
	struct xt_match::compat_from_user
	struct xt_match::compat_to_user
	struct xt_match::compatsize
to make structures smaller on COMPAT=n kernels.
Signed-off-by: NAlexey Dobriyan <adobriyan@gmail.com>
Signed-off-by: NPatrick McHardy <kaber@trash.net>

c30f540b

23 1月, 2010 1 次提交

netfiltr: ipt_CLUSTERIP: simplify seq_file codeA · 47778147

由 Alexey Dobriyan 提交于 1月 22, 2010

Pass "struct clusterip_config" itself to seq_file iterators
and save one dereference. Proc entry itself isn't interesting.
Signed-off-by: NAlexey Dobriyan <adobriyan@gmail.com>
Signed-off-by: NPatrick McHardy <kaber@trash.net>

47778147

20 1月, 2010 2 次提交

IPv6: reassembly: replace magic number with macro definitions · 7c070aa9

由 Shan Wei 提交于 1月 20, 2010

Use macro to define high/low thresh value, refer to IPV6_FRAG_TIMEOUT.
Signed-off-by: NShan Wei <shanwei@cn.fujitsu.com>
Acked-by: NDavid S. Miller <davem@davemloft.net>
Signed-off-by: NPatrick McHardy <kaber@trash.net>

7c070aa9

netfilter: nf_conntrack_ipv6: delete the redundant macro definitions · b38f6edd

由 Shan Wei 提交于 1月 20, 2010

The following three macro definitions are never used, so delete them.
Signed-off-by: NShan Wei <shanwei@cn.fujitsu.com>
Acked-by: NDavid S. Miller <davem@davemloft.net>
Signed-off-by: NPatrick McHardy <kaber@trash.net>

b38f6edd

18 1月, 2010 8 次提交

netfilter: nfnetlink_queue: simplify warning message · a5d896ad

由 Eric Leblond 提交于 1月 18, 2010

This patch remove variable part from a debug message to have
message concatenation from syslog.
Signed-off-by: NEric Leblond <eric@inl.fr>
Signed-off-by: NPatrick McHardy <kaber@trash.net>

a5d896ad

netfilter: xt_hashlimit: netns support · e89fc3f1

由 Alexey Dobriyan 提交于 1月 18, 2010

Make hashtable per-netns.
Make proc files per-netns.
Signed-off-by: NAlexey Dobriyan <adobriyan@gmail.com>
Signed-off-by: NPatrick McHardy <kaber@trash.net>

e89fc3f1

netfilter: xt_recent: netns support · 7d07d563

由 Alexey Dobriyan 提交于 1月 18, 2010

Make recent table list per-netns.
Make proc files per-netns.
Signed-off-by: NAlexey Dobriyan <adobriyan@gmail.com>
Signed-off-by: NPatrick McHardy <kaber@trash.net>

7d07d563

netfilter: xtables: add struct xt_mtdtor_param::net · f54e9367

由 Alexey Dobriyan 提交于 1月 18, 2010

Add ->net to match destructor list like ->net in constructor list.

Make sure it's set in ebtables/iptables/ip6tables, this requires to
propagate netns up to *_unregister_table().
Signed-off-by: NAlexey Dobriyan <adobriyan@gmail.com>
Signed-off-by: NPatrick McHardy <kaber@trash.net>

f54e9367

netfilter: xtables: add struct xt_mtchk_param::net · a83d8e8d

由 Alexey Dobriyan 提交于 1月 18, 2010

Some complex match modules (like xt_hashlimit/xt_recent) want netns
information at constructor and destructor time. We propably can play
games at match destruction time, because netns can be passed in object,
but I think it's cleaner to explicitly pass netns.

Add ->net, make sure it's set from ebtables/iptables/ip6tables code.
Signed-off-by: NAlexey Dobriyan <adobriyan@gmail.com>
Signed-off-by: NPatrick McHardy <kaber@trash.net>

a83d8e8d

netfilter: xt_hashlimit: simplify seqfile code · a1004d8e

由 Alexey Dobriyan 提交于 1月 18, 2010

Simply pass hashtable to seqfile iterators, proc entry itself is not needed.
Signed-off-by: NAlexey Dobriyan <adobriyan@gmail.com>
Signed-off-by: NPatrick McHardy <kaber@trash.net>

a1004d8e

netfilter: netns: #ifdef ->iptable_security, ->ip6table_security · e9d3897c

由 Alexey Dobriyan 提交于 1月 18, 2010

'security' tables depend on SECURITY, so ifdef them.
Signed-off-by: NAlexey Dobriyan <adobriyan@gmail.com>
Signed-off-by: NPatrick McHardy <kaber@trash.net>

e9d3897c

netfilter: xt_connlimit: netns support · 83fc8102

由 Alexey Dobriyan 提交于 1月 18, 2010

Signed-off-by: NAlexey Dobriyan <adobriyan@gmail.com>
Signed-off-by: NPatrick McHardy <kaber@trash.net>

83fc8102

13 1月, 2010 2 次提交

netfilter: ctnetlink: netns support · 9592a5c0

由 Alexey Dobriyan 提交于 1月 13, 2010

Signed-off-by: NAlexey Dobriyan <adobriyan@gmail.com>
Signed-off-by: NPatrick McHardy <kaber@trash.net>

9592a5c0

netfilter: nfnetlink: netns support · cd8c20b6

由 Alexey Dobriyan 提交于 1月 13, 2010

Make nfnl socket per-petns.
Signed-off-by: NAlexey Dobriyan <adobriyan@gmail.com>
Signed-off-by: NPatrick McHardy <kaber@trash.net>

cd8c20b6

11 1月, 2010 3 次提交

netfilter: xt_osf: change %pi4 to %pI4 · 7f635d0d

由 Joe Perches 提交于 1月 11, 2010

commit 8a27f7c9
changed the output style of %pi4 to use fixed
width leading zero IP addresses "001.002.003.004".

It's useful when printing multiple lines of
addresses, but was a change in output style for
some existing uses.

Using %pI4 restores the previous output style.
Signed-off-by: NJoe Perches <joe@perches.com>
Signed-off-by: NPatrick McHardy <kaber@trash.net>

7f635d0d

ipvs: use standardized format in sprintf · a79e7ac4

由 Joe Perches 提交于 1月 11, 2010

Use the same format string as net/ipv4/netfilter/nf_nat_ftp.c
to encode an ipv4 address and port.

Both uses should be a single common function.
Signed-off-by: NJoe Perches <joe@perches.com>
Acked-by: NSimon Horman <horms@verge.net.au>
Signed-off-by: NPatrick McHardy <kaber@trash.net>

a79e7ac4

netfilter: nf_nat_ftp: remove (*mangle[]) array and functions, use %pI4 · c299bd53

由 Joe Perches 提交于 1月 11, 2010

These functions merely exist to format a buffer and call
nf_nat_mangle_tcp_packet.

Format the buffer and perform the call in nf_nat_ftp instead.

Use %pI4 for the IP address.

Saves ~600 bytes of text

old:
$ size net/ipv4/netfilter/nf_nat_ftp.o
   text	   data	    bss	    dec	    hex	filename
   2187	    160	    408	   2755	    ac3	net/ipv4/netfilter/nf_nat_ftp.o
new:
$ size net/ipv4/netfilter/nf_nat_ftp.o
   text    data     bss     dec     hex filename
   1532     112     288    1932     78c net/ipv4/netfilter/nf_nat_ftp.o
Signed-off-by: NJoe Perches <joe@perches.com>
Signed-off-by: NPatrick McHardy <kaber@trash.net>

c299bd53

05 1月, 2010 1 次提交

IPVS: Allow boot time change of hash size · 6f7edb48

由 Catalin(ux) M. BOIE 提交于 1月 05, 2010

I was very frustrated about the fact that I have to recompile the kernel
to change the hash size. So, I created this patch.

If IPVS is built-in you can append ip_vs.conn_tab_bits=?? to kernel
command line, or, if you built IPVS as modules, you can add
options ip_vs conn_tab_bits=??.

To keep everything backward compatible, you still can select the size at
compile time, and that will be used as default.

It has been about a year since this patch was originally posted
and subsequently dropped on the basis of insufficient test data.

Mark Bergsma has provided the following test results which seem
to strongly support the need for larger hash table sizes:

We do however run into the same problem with the default setting (212 =
4096 entries), as most of our LVS balancers handle around a million
connections/SLAB entries at any point in time (around 100-150 kpps
load). With only 4096 hash table entries this implies that each entry
consists of a linked list of 256 connections *on average*.

To provide some statistics, I did an oprofile run on an 2.6.31 kernel,
with both the default 4096 table size, and the same kernel recompiled
with IP_VS_CONN_TAB_BITS set to 18 (218 = 262144 entries). I built a
quick test setup with a part of Wikimedia/Wikipedia's live traffic
mirrored by the switch to the test host.

With the default setting, at ~ 120 kpps packet load we saw a typical %si
CPU usage of around 30-35%, and oprofile reported a hot spot in
ip_vs_conn_in_get:

samples  %        image name               app name
symbol name
1719761  42.3741  ip_vs.ko                 ip_vs.ko      ip_vs_conn_in_get
302577    7.4554  bnx2                     bnx2          /bnx2
181984    4.4840  vmlinux                  vmlinux       __ticket_spin_lock
128636    3.1695  vmlinux                  vmlinux       ip_route_input
74345     1.8318  ip_vs.ko                 ip_vs.ko      ip_vs_conn_out_get
68482     1.6874  vmlinux                  vmlinux       mwait_idle

After loading the recompiled kernel with 218 entries, %si CPU usage
dropped in half to around 12-18%, and oprofile looks much healthier,
with only 7% spent in ip_vs_conn_in_get:

samples  %        image name               app name
symbol name
265641   14.4616  bnx2                     bnx2         /bnx2
143251    7.7986  vmlinux                  vmlinux      __ticket_spin_lock
140661    7.6576  ip_vs.ko                 ip_vs.ko     ip_vs_conn_in_get
94364     5.1372  vmlinux                  vmlinux      mwait_idle
86267     4.6964  vmlinux                  vmlinux      ip_route_input

[ horms@verge.net.au: trivial up-port and minor style fixes ]
Signed-off-by: NCatalin(ux) M. BOIE <catab@embedromix.ro>
Cc: Mark Bergsma <mark@wikimedia.org>
Signed-off-by: NSimon Horman <horms@verge.net.au>
Signed-off-by: NPatrick McHardy <kaber@trash.net>

6f7edb48

04 1月, 2010 10 次提交

netfilter: xtables: obtain random bytes earlier, in checkentry · 294188ae

由 Jan Engelhardt 提交于 1月 04, 2010

We can initialize the random hash bytes on checkentry. This is
preferable since it is outside the hot path.

Reference: http://bugzilla.netfilter.org/show_bug.cgi?id=621Signed-off-by: NJan Engelhardt <jengelh@medozas.de>
Signed-off-by: NPatrick McHardy <kaber@trash.net>

294188ae

netfilter: xtables: do not grab random bytes at __init · 5191d501

由 Jan Engelhardt 提交于 1月 04, 2010

"It is deliberately not done in the init function, since we might not
have sufficient random while booting."
Signed-off-by: NJan Engelhardt <jengelh@medozas.de>
Signed-off-by: NPatrick McHardy <kaber@trash.net>

5191d501

netfilter: xt_recent: save 8 bytes per htable · 89bc7a0f

由 Jan Engelhardt 提交于 1月 04, 2010

Moving rnd_inited into the hole after the uint8 lets go of the uint32
rnd_inited was using, plus the padding that would follow the int group.
Signed-off-by: NJan Engelhardt <jengelh@medozas.de>
Signed-off-by: NPatrick McHardy <kaber@trash.net>

89bc7a0f

netfilter: SNMP NAT: correct the size argument to kzalloc · 71c3ebfd

由 Julia Lawall 提交于 1月 04, 2010

obj has type struct snmp_object **, not struct snmp_object *.  But indeed
it is not even clear why kmalloc is needed.  The memory is freed by the end
of the function, so the local variable of pointer type should be sufficient.

The semantic patch that makes this change is as follows:
(http://coccinelle.lip6.fr/)

// <smpl>
@disable sizeof_type_expr@
type T;
T **x;
@@

  x =
  <+...sizeof(
- T
+ *x
  )...+>
// </smpl>
Signed-off-by: NJulia Lawall <julia@diku.dk>
Signed-off-by: NPatrick McHardy <kaber@trash.net>

71c3ebfd

axnet_cs: remove unnecessary spin_unlock_irqrestore · ceba0b29

由 Ken Kawasaki 提交于 12月 28, 2009

axnet_cs:
    remove unnecessary spin_unlock_irqrestore,spin_lock_irqsave.
Signed-off-by: NKen Kawasaki <ken_kawasaki@spring.nifty.jp>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

ceba0b29

tipc: use kconfig to limit numeric ranges · ee983ac7

由 Amerigo Wang 提交于 12月 24, 2009

We can rely on kconfig to limit these numbers,
no need to limit them at compile time/run time.

Users who modify these numbers manually should
be responsible for themself. :)
Signed-off-by: NWANG Cong <amwang@redhat.com>
Cc: Per Liden <per.liden@ericsson.com>
Cc: Jon Maloy <jon.maloy@ericsson.com>
Cc: Allan Stephens <allan.stephens@windriver.com>
Cc: David S. Miller <davem@davemloft.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

ee983ac7

can/netlink: add CAN_CTRLMODE_ONE_SHOT · c1c5523d

由 Marc Kleine-Budde 提交于 12月 23, 2009

This patch adds the flag CAN_CTRLMODE_ONE_SHOT. It is used as mask
or flag in the "struct can_ctrlmode".

It allows userspace via netlink to set a CAN controller into the special
"one-shot" mode. In this mode, if supported by the CAN controller, it
tries only once to deliver a CAN frame and aborts it if an error
(e.g.: arbitration lost) happens.
Signed-off-by: NMarc Kleine-Budde <mkl@pengutronix.de>
Acked-by: NWolfgang Grandegger <wg@grandegger.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

c1c5523d

can: Speed up CAN frame receiption by using ml_priv · 20dd3850

由 Oliver Hartkopp 提交于 12月 25, 2009

this patch removes the hlist that contains the CAN receiver filter lists.
It uses the 'midlayer private' pointer ml_priv and links the filters directly
to the CAN netdevice, which allows to omit the walk through the complete CAN
devices hlist for each received CAN frame.

This patch is tested and does not remove any locking.
Signed-off-by: NOliver Hartkopp <oliver@hartkopp.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

20dd3850

drivers/net/cxgb3: Use kzalloc for allocating only one thing · 75ed0a89

由 Julia Lawall 提交于 12月 18, 2009

Use kzalloc rather than kcalloc(1,...)

The semantic patch that makes this change is as follows:
(http://coccinelle.lip6.fr/)

// <smpl>
@@
@@

- kcalloc(1,
+ kzalloc(
          ...)
// </smpl>
Signed-off-by: NJulia Lawall <julia@diku.dk>
Acked-by: NDivy Le Ray <divy@chelsio.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

75ed0a89

bonding: allow arp_ip_targets on separate vlans to use arp validation · 1f3c8804

由 Andy Gospodarek 提交于 12月 14, 2009

This allows a bond device to specify an arp_ip_target as a host that is
not on the same vlan as the base bond device and still use arp
validation.  A configuration like this, now works:

BONDING_OPTS="mode=active-backup arp_interval=1000 arp_ip_target=10.0.100.1 arp_validate=3"

1: lo: <LOOPBACK,UP,LOWER_UP> mtu 16436 qdisc noqueue
    link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
    inet 127.0.0.1/8 scope host lo
    inet6 ::1/128 scope host
       valid_lft forever preferred_lft forever
2: eth1: <BROADCAST,MULTICAST,SLAVE,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast master bond0 qlen 1000
    link/ether 00:13:21:be:33:e9 brd ff:ff:ff:ff:ff:ff
3: eth0: <BROADCAST,MULTICAST,SLAVE,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast master bond0 qlen 1000
    link/ether 00:13:21:be:33:e9 brd ff:ff:ff:ff:ff:ff
8: bond0: <BROADCAST,MULTICAST,MASTER,UP,LOWER_UP> mtu 1500 qdisc noqueue
    link/ether 00:13:21:be:33:e9 brd ff:ff:ff:ff:ff:ff
    inet6 fe80::213:21ff:febe:33e9/64 scope link
       valid_lft forever preferred_lft forever
9: bond0.100@bond0: <BROADCAST,MULTICAST,MASTER,UP,LOWER_UP> mtu 1500 qdisc noqueue
    link/ether 00:13:21:be:33:e9 brd ff:ff:ff:ff:ff:ff
    inet 10.0.100.2/24 brd 10.0.100.255 scope global bond0.100
    inet6 fe80::213:21ff:febe:33e9/64 scope link
       valid_lft forever preferred_lft forever

Ethernet Channel Bonding Driver: v3.6.0 (September 26, 2009)

Bonding Mode: fault-tolerance (active-backup)
Primary Slave: None
Currently Active Slave: eth1
MII Status: up
MII Polling Interval (ms): 0
Up Delay (ms): 0
Down Delay (ms): 0
ARP Polling Interval (ms): 1000
ARP IP target/s (n.n.n.n form): 10.0.100.1

Slave Interface: eth1
MII Status: up
Link Failure Count: 1
Permanent HW addr: 00:40:05:30:ff:30

Slave Interface: eth0
MII Status: up
Link Failure Count: 0
Permanent HW addr: 00:13:21:be:33:e9
Signed-off-by: NAndy Gospodarek <andy@greyhouse.net>
Signed-off-by: NJay Vosburgh <fubar@us.ibm.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

1f3c8804

31 12月, 2009 2 次提交
- D
  
  Merge branch 'master' of git://git.kernel.org/pub/scm/linux/kernel/git/linville/wireless-next-2.6 · 3a999e6e
  由 David S. Miller 提交于 12月 30, 2009
  
  3a999e6e
- J
  Merge git://git.kernel.org/pub/scm/linux/kernel/git/linville/wireless-2.6 · 891dc5e7
  由 John W. Linville 提交于 12月 30, 2009
```
Conflicts:
	drivers/net/wireless/libertas/scan.c
```
  891dc5e7
30 12月, 2009 4 次提交

Subject: drivers/net/sh_eth.c: use %pM to shown MAC address · 6cd9b49d

由 H Hartley Sweeten 提交于 12月 29, 2009

Use the %pM kernel extension to display the MAC address.
Signed-off-by: NH Hartley Sweeten <hsweeten@visionengravers.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

6cd9b49d

drivers/net/r8169.c: use %pM to shown MAC address · 30a6ae8d

由 H Hartley Sweeten 提交于 12月 29, 2009

Use the %pM kernel extension to display the MAC address.
Signed-off-by: NH Hartley Sweeten <hsweeten@visionengravers.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

30a6ae8d

drivers/net/octeon/octeon_mgmt.c: use %pM to shown MAC address · e5834820

由 H Hartley Sweeten 提交于 12月 29, 2009

Use the %pM kernel extension to display the MAC address.
Signed-off-by: NH Hartley Sweeten <hsweeten@visionengravers.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

e5834820

drivers/net/smc911x.c: use %pM to shown MAC address · fa876b47

由 H Hartley Sweeten 提交于 12月 29, 2009

Use the %pM kernel extension to display the MAC address.
Signed-off-by: NH Hartley Sweeten <hsweeten@visionengravers.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

fa876b47

openanolis / cloud-kernel 11 个月 前同步成功

openanolis / cloud-kernel
11 个月前同步成功