提交 · e7272403d2f9be3dbb7cc185fcc390e781b1af6b · openeuler / Kernel

08 9月, 2008 1 次提交

pkt_sched: Fix qdisc state in net_tx_action() · e8a83e10

由 Jarek Poplawski 提交于 9月 07, 2008

net_tx_action() can skip __QDISC_STATE_SCHED bit clearing while qdisc
is neither ran nor rescheduled, which may cause endless loop in
dev_deactivate().
Reported-by: NDenys Fedoryshchenko <denys@visp.net.lb>
Tested-by: NDenys Fedoryshchenko <denys@visp.net.lb>
Signed-off-by: NJarek Poplawski <jarkao2@gmail.com>
Acked-by: NHerbert Xu <herbert@gondor.apana.org.au>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

e8a83e10

19 8月, 2008 1 次提交

pkt_sched: Prevent livelock in TX queue running. · 195648bb

由 David S. Miller 提交于 8月 19, 2008

If dev_deactivate() is trying to quiesce the queue, it
is theoretically possible for another cpu to livelock
trying to process that queue.  This happens because
dev_deactivate() grabs the queue spinlock as it checks
the queue state, whereas net_tx_action() does a trylock
and reschedules the qdisc if it hits the lock.

This breaks the livelock by adding a check on
__QDISC_STATE_DEACTIVATED to net_tx_action() when
the trylock fails.

Based upon feedback from Herbert Xu and Jarek Poplawski.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

195648bb

18 8月, 2008 3 次提交

D
pkt_sched: Fix missed RCU unlock in dev_queue_xmit() · 96d20316
由 David S. Miller 提交于 8月 17, 2008
```
Noticed by Jarek Poplawski.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
96d20316

net: Change handling of the __QDISC_STATE_SCHED flag in net_tx_action(). · def82a1d

由 Jarek Poplawski 提交于 8月 17, 2008

Change handling of the __QDISC_STATE_SCHED flag in net_tx_action() to
enable proper control in dev_deactivate(). Now, if this flag is seen
as unset under root_lock means a qdisc can't be netif_scheduled.
Signed-off-by: NJarek Poplawski <jarkao2@gmail.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

def82a1d

pkt_sched: Add 'deactivated' state. · a9312ae8

由 David S. Miller 提交于 8月 17, 2008

This new state lets dev_deactivate() mark a qdisc as having been
deactivated.

dev_queue_xmit() and ing_filter() check for this bit and do not
try to process the qdisc if the bit is set.

dev_deactivate() polls the qdisc after setting the bit, waiting
for both __QDISC_STATE_RUNNING and __QDISC_STATE_SCHED to clear.

This isn't perfect yet, but subsequent changesets will make it so.
This part is just one piece of the puzzle.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

a9312ae8

07 8月, 2008 3 次提交

net/core: Allow receive on active slaves. · f982307f

由 Joe Eykholt 提交于 7月 02, 2008

If a packet_type specifies an active slave to bonding and not just any
interface, allow it to receive frames that came in on that interface.
Signed-off-by: NJoe Eykholt <jre@nuovasystems.com>
Signed-off-by: NJay Vosburgh <fubar@us.ibm.com>
Signed-off-by: NJeff Garzik <jgarzik@redhat.com>

f982307f

net/core: Allow certain receives on inactive slave. · 0d7a3681

由 Joe Eykholt 提交于 7月 02, 2008

Allow a packet_type that specifies the exact device to receive
even on an inactive bonding slave devices.  This is important for some
L2 protocols such as LLDP and FCoE.  This can eventually be used
for the bonding special cases as well.
Signed-off-by: NJoe Eykholt <jre@nuovasystems.com>
Signed-off-by: NJay Vosburgh <fubar@us.ibm.com>
Signed-off-by: NJeff Garzik <jgarzik@redhat.com>

0d7a3681

net/core: Uninline skb_bond(). · cc9bd5ce

由 Joe Eykholt 提交于 7月 02, 2008

Otherwise subsequent changes need multiple return values.
Signed-off-by: NJoe Eykholt <jre@nuovasystems.com>
Signed-off-by: NJay Vosburgh <fubar@us.ibm.com>
Signed-off-by: NJeff Garzik <jgarzik@redhat.com>

cc9bd5ce

05 8月, 2008 1 次提交

net_sched: Add qdisc __NET_XMIT_BYPASS flag · c27f339a

由 Jarek Poplawski 提交于 8月 04, 2008

Patrick McHardy <kaber@trash.net> noticed that it would be nice to
handle NET_XMIT_BYPASS by NET_XMIT_SUCCESS with an internal qdisc flag
__NET_XMIT_BYPASS and to remove the mapping from dev_queue_xmit().

David Miller <davem@davemloft.net> spotted a serious bug in the first
version of this patch.
Signed-off-by: NJarek Poplawski <jarkao2@gmail.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

c27f339a

04 8月, 2008 1 次提交

net: eliminate refcounting in backlog queue · 6e583ce5

由 Stephen Hemminger 提交于 8月 03, 2008

Avoid the overhead of atomic increment/decrement on each received packet.
This helps performance of non-NAPI devices (like loopback).
Use cleanup function to walk queue on each cpu and clean out any
left over packets.
Signed-off-by: NStephen Hemminger <shemminger@vyatta.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

6e583ce5

03 8月, 2008 2 次提交

net: use software GSO for SG+CSUM capable netdevices · e5a4a72d

由 Lennert Buytenhek 提交于 8月 03, 2008

If a netdevice does not support hardware GSO, allowing the stack to
use GSO anyway and then splitting the GSO skb into MSS-sized pieces
as it is handed to the netdevice for transmitting is likely still
a win as far as throughput and/or CPU usage are concerned, since it
reduces the number of trips through the output path.

This patch enables the use of GSO on any netdevice that supports SG.
If a GSO skb is then sent to a netdevice that supports SG but does not
support hardware GSO, net/core/dev.c:dev_hard_start_xmit() will take
care of doing the necessary GSO segmentation in software.
Signed-off-by: NLennert Buytenhek <buytenh@marvell.com>
Signed-off-by: NHerbert Xu <herbert@gondor.apana.org.au>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

e5a4a72d

pkt_sched: Use qdisc_lock() on already sampled root qdisc. · 5fb66229

由 David S. Miller 提交于 8月 02, 2008

Based upon a bug report by Jeff Kirsher.

Don't use qdisc_root_lock() in these cases as the root
qdisc could have been changed, and we'd thus lock the
wrong object.

Tested by Emil S Tantilov who confirms that this seems
to fix the problem.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

5fb66229

01 8月, 2008 1 次提交

netdev: Fix lockdep warnings in multiqueue configurations. · c3f26a26

由 David S. Miller 提交于 7月 31, 2008

When support for multiple TX queues were added, the
netif_tx_lock() routines we converted to iterate over
all TX queues and grab each queue's spinlock.

This causes heartburn for lockdep and it's not a healthy
thing to do with lots of TX queues anyways.

So modify this to use a top-level lock and a "frozen"
state for the individual TX queues.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

c3f26a26

30 7月, 2008 1 次提交

pkt_sched: Fix OOPS on ingress qdisc add. · 8d50b53d

由 David S. Miller 提交于 7月 30, 2008

Bug report from Steven Jan Springl:

	Issuing the following command causes a kernel oops:
		tc qdisc add dev eth0 handle ffff: ingress

The problem mostly stems from all of the special case handling of
ingress qdiscs.

So, to fix this, do the grafting operation the same way we do for TX
qdiscs.  Which means that dev_activate() and dev_deactivate() now do
the "qdisc_sleeping <--> qdisc" transitions on dev->rx_queue too.

Future simplifications are possible now, mainly because it is
impossible for dev_queue->{qdisc,qdisc_sleeping} to be NULL.  There
are NULL checks all over to handle the ingress qdisc special case
that used to exist before this commit.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

8d50b53d

26 7月, 2008 1 次提交

net: convert BUG_TRAP to generic WARN_ON · 547b792c

由 Ilpo Järvinen 提交于 7月 25, 2008

Removes legacy reinvent-the-wheel type thing. The generic
machinery integrates much better to automated debugging aids
such as kerneloops.org (and others), and is unambiguous due to
better naming. Non-intuively BUG_TRAP() is actually equal to
WARN_ON() rather than BUG_ON() though some might actually be
promoted to BUG_ON() but I left that to future.

I could make at least one BUILD_BUG_ON conversion.
Signed-off-by: NIlpo Järvinen <ilpo.jarvinen@helsinki.fi>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

547b792c

24 7月, 2008 1 次提交

netdev: Remove warning from __netif_schedule(). · 5b3ab1db

由 David S. Miller 提交于 7月 23, 2008

It isn't helping anything and we aren't going to be able to change all
the drivers that do queue wakeups in strange situations.

Just letting a noop_qdisc get scheduled will work because when
qdisc_run() executes via net_tx_work() it will simply find no packets
pending when it makes the ->dequeue() call in qdisc_restart.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

5b3ab1db

23 7月, 2008 2 次提交

netdev: Handle ->addr_list_lock just like ->_xmit_lock for lockdep. · cf508b12

由 David S. Miller 提交于 7月 22, 2008

The new address list lock needs to handle the same device layering
issues that the _xmit_lock one does.

This integrates work done by Patrick McHardy.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

cf508b12

net: Fix build failure with 'make mandocs'. · d29f749e

由 Dave Jones 提交于 7月 22, 2008

The function header comments have to go with the functions
they are documenting, or things go horribly wrong when we
try to process them with the docbook tools.

Warning(include/linux/netdevice.h:1006): No description found for parameter 'dev_queue'
Warning(include/linux/netdevice.h:1033): No description found for parameter 'dev_queue'
Warning(include/linux/netdevice.h:1067): No description found for parameter 'dev_queue'
Warning(include/linux/netdevice.h:1093): No description found for parameter 'dev_queue'
Warning(include/linux/netdevice.h:1474): No description found for parameter 'txq'
Error(net/core/dev.c:1674): cannot understand prototype: 'u32 simple_tx_hashrnd; '
Signed-off-by: NDave Jones <davej@redhat.com>
Acked-by: NRandy Dunlap <randy.dunlap@oracle.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

d29f749e

22 7月, 2008 4 次提交

net: Print the module name as part of the watchdog message · 6579e57b

由 Arjan van de Ven 提交于 7月 21, 2008

As suggested by Dave:

This patch adds a function to get the driver name from a struct net_device,
and consequently uses this in the watchdog timeout handler to print as 
part of the message. 
Signed-off-by: NArjan van de Ven <arjan@linux.intel.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

6579e57b

net: use kcalloc in netdev_queue alloc · 7943986c

由 Stephen Hemminger 提交于 7月 21, 2008

Minor nit, use size_t for allocation size and kcalloc to allocate
an array. Probably makes no actual code difference.
Signed-off-by: NStephen Hemminger <shemminger@vyatta.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

7943986c

L
net: In __netif_schedule() use WARN_ON instead of BUG_ON · 867d79fb
由 Linus Torvalds 提交于 7月 21, 2008
```
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
867d79fb

net: Improve simple_tx_hash(). · b6b2fed1

由 David S. Miller 提交于 7月 21, 2008

Based upon feedback from Eric Dumazet and Andi Kleen.

Cure several deficiencies in simple_tx_hash() by using
jhash + reciprocol multiply.

1) Eliminates expensive modulus operation.

2) Makes hash less attackable by using random seed.

3) Eliminates endianness hash distribution issues.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

b6b2fed1

20 7月, 2008 1 次提交

net_sched: Add qdisc_enqueue wrapper · 5f86173b

由 Jussi Kivilinna 提交于 7月 20, 2008

Signed-off-by: NJussi Kivilinna <jussi.kivilinna@mbnet.fi>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

5f86173b

19 7月, 2008 1 次提交

pkt_sched: Manage qdisc list inside of root qdisc. · 30723673

由 David S. Miller 提交于 7月 18, 2008

Idea is from Patrick McHardy.

Instead of managing the list of qdiscs on the device level, manage it
in the root qdisc of a netdev_queue.  This solves all kinds of
visibility issues during qdisc destruction.

The way to iterate over all qdiscs of a netdev_queue is to visit
the netdev_queue->qdisc, and then traverse it's list.

The only special case is to ignore builting qdiscs at the root when
dumping or doing a qdisc_lookup().  That was not needed previously
because builtin qdiscs were not added to the device's qdisc_list.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

30723673

18 7月, 2008 7 次提交

pkt_sched: Kill netdev_queue lock. · 83874000

由 David S. Miller 提交于 7月 17, 2008

We can simply use the qdisc->q.lock for all of the
qdisc tree synchronization.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

83874000

D
netdevice: Move qdisc_list back into net_device proper. · ead81cc5
由 David S. Miller 提交于 7月 17, 2008
```
And give it it's own lock.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
```
ead81cc5

pkt_sched: Schedule qdiscs instead of netdev_queue. · 37437bb2

由 David S. Miller 提交于 7月 16, 2008

When we have shared qdiscs, packets come out of the qdiscs
for multiple transmit queues.

Therefore it doesn't make any sense to schedule the transmit
queue when logically we cannot know ahead of time the TX
queue of the SKB that the qdisc->dequeue() will give us.

Just for sanity I added a BUG check to make sure we never
get into a state where the noop_qdisc is scheduled.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

37437bb2

net: Implement simple sw TX hashing. · 8f0f2223

由 David S. Miller 提交于 7月 15, 2008

It just xor hashes over IPv4/IPv6 addresses and ports of transport.

The only assumption it makes is that skb_network_header() is set
correctly.

With bug fixes from Eric Dumazet.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

8f0f2223

netdev: Add netdev->select_queue() method. · eae792b7

由 David S. Miller 提交于 7月 15, 2008

Devices or device layers can set this to control the queue selection
performed by dev_pick_tx().

This function runs under RCU protection, which allows overriding
functions to have some way of synchronizing with things like dynamic
->real_num_tx_queues adjustments.

This makes the spinlock prefetch in dev_queue_xmit() a little bit
less effective, but that's the price right now for correctness.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

eae792b7

net: Use queue aware tests throughout. · fd2ea0a7

由 David S. Miller 提交于 7月 17, 2008

This effectively "flips the switch" by making the core networking
and multiqueue-aware drivers use the new TX multiqueue structures.

Non-multiqueue drivers need no changes.  The interfaces they use such
as netif_stop_queue() degenerate into an operation on TX queue zero.
So everything "just works" for them.

Code that really wants to do "X" to all TX queues now invokes a
routine that does so, such as netif_tx_wake_all_queues(),
netif_tx_stop_all_queues(), etc.

pktgen and netpoll required a little bit more surgery than the others.

In particular the pktgen changes, whilst functional, could be largely
improved.  The initial check in pktgen_xmit() will sometimes check the
wrong queue, which is mostly harmless.  The thing to do is probably to
invoke fill_packet() earlier.

The bulk of the netpoll changes is to make the code operate solely on
the TX queue indicated by by the SKB queue mapping.

Setting of the SKB queue mapping is entirely confined inside of
net/core/dev.c:dev_pick_tx().  If we end up needing any kind of
special semantics (drops, for example) it will be implemented here.

Finally, we now have a "real_num_tx_queues" which is where the driver
indicates how many TX queues are actually active.

With IGB changes from Jeff Kirsher.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

fd2ea0a7

netdev: Allocate multiple queues for TX. · e8a0464c

由 David S. Miller 提交于 7月 17, 2008

alloc_netdev_mq() now allocates an array of netdev_queue
structures for TX, based upon the queue_count argument.

Furthermore, all accesses to the TX queues are now vectored
through the netdev_get_tx_queue() and netdev_for_each_tx_queue()
interfaces.  This makes it easy to grep the tree for all
things that want to get to a TX queue of a net device.

Problem spots which are not really multiqueue aware yet, and
only work with one queue, can easily be spotted by grepping
for all netdev_get_tx_queue() calls that pass in a zero index.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

e8a0464c

15 7月, 2008 4 次提交

netdev: Do not use TX lock to protect address lists. · b9e40857

由 David S. Miller 提交于 7月 15, 2008

Now that we have a specific lock to protect the network
device unicast and multicast lists, remove extraneous
grabs of the TX lock in cases where the code only needs
address list protection.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

b9e40857

netdev: Add netdev->addr_list_lock protection. · e308a5d8

由 David S. Miller 提交于 7月 15, 2008

Add netif_addr_{lock,unlock}{,_bh}() helpers.

Use them to protect operations that operate on or read
the network device unicast and multicast address lists.

Also use them in cases where the code simply wants to
block calls into the driver's ->set_rx_mode() and
->set_multicast_list() methods.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

e308a5d8

netdev: Add addr_list_lock to struct net_device. · f1f28aa3

由 David S. Miller 提交于 7月 15, 2008

This will be used to protect the per-device unicast and multicast
address lists, as well as the callbacks into the drivers which
configure such state such as ->set_rx_mode() and ->set_multicast_list().
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

f1f28aa3

vlan: deliver packets received with VLAN acceleration to network taps · bc1d0411

由 Patrick McHardy 提交于 7月 14, 2008

When VLAN header stripping is used, packets currently bypass packet
sockets (and other network taps) completely. For locally existing
VLANs, they appear directly on the VLAN device, for unknown VLANs
they are silently dropped.

Add a new function netif_nit_deliver() to deliver incoming packets
to all network interface taps and use it in __vlan_hwaccel_rx() to
make VLAN packets visible on the underlying device.
Signed-off-by: NPatrick McHardy <kaber@trash.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

bc1d0411

09 7月, 2008 5 次提交

netdev: Move _xmit_lock and xmit_lock_owner into netdev_queue. · c773e847

由 David S. Miller 提交于 7月 08, 2008

Accesses are mostly structured such that when there are multiple TX
queues the code transformations will be a little bit simpler.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

c773e847

pkt_sched: Make qdisc_run take a netdev_queue. · eb6aafe3

由 David S. Miller 提交于 7月 08, 2008

This allows us to use this calling convention all the way down into
qdisc_restart().
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

eb6aafe3

netdev: Make netif_schedule() routines work with netdev_queue objects. · 86d804e1

由 David S. Miller 提交于 7月 08, 2008

Only plain netif_schedule() remains taking a net_device, mostly as a
compatability item while we transition the rest of these interfaces.

Everything else calls netif_schedule_queue() or __netif_schedule(),
both of which take a netdev_queue pointer.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

86d804e1

netdev: Move next_sched into struct netdev_queue. · ee609cb3

由 David S. Miller 提交于 7月 08, 2008

We schedule queues, not the device, for output queue processing in BH.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

ee609cb3

netdev: Kill qdisc_ingress, use netdev->rx_queue.qdisc instead. · 816f3258

由 David S. Miller 提交于 7月 08, 2008

Now that our qdisc management is bi-directional, per-queue, and fully
orthogonal, there is no reason to have a special ingress qdisc pointer
in struct net_device.
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

816f3258

openeuler / Kernel 大约 1 年 前同步成功

openeuler / Kernel
大约 1 年前同步成功