提交 · 98262f2762f0067375f83824d81ea929e37e6bfe · openeuler / raspberrypi-kernel

30 11月, 2009 1 次提交

Revert "cfq: Make use of service count to estimate the rb_key offset" · 464191c6

由 Jens Axboe 提交于 11月 30, 2009

This reverts commit 3586e917.

Corrado Zoccolo <czoccolo@gmail.com> correctly points out, that we need
consistency of rb_key offset across groups. This means we cannot properly
use the per-service_tree service count. Revert this change.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

464191c6

26 11月, 2009 6 次提交

cfq-iosched: fix corner cases in idling logic · 8e550632

由 Corrado Zoccolo 提交于 11月 26, 2009

Idling logic was disabled in some corner cases, leading to unfair share
 for noidle queues.
 * the idle timer was not armed if there were other requests in the
   driver. unfortunately, those requests could come from other workloads,
   or queues for which we don't enable idling. So we will check only
   pending requests from the active queue
 * rq_noidle check on no-idle queue could disable the end of tree idle if
   the last completed request was rq_noidle. Now, we will disable that
   idle only if all the queues served in the no-idle tree had rq_noidle
   requests.
Reported-by: NVivek Goyal <vgoyal@redhat.com>
Signed-off-by: NCorrado Zoccolo <czoccolo@gmail.com>
Acked-by: NVivek Goyal <vgoyal@redhat.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

8e550632

cfq-iosched: idling on deep seeky sync queues · 76280aff

由 Corrado Zoccolo 提交于 11月 26, 2009

Seeky sync queues with large depth can gain unfairly big share of disk
time, at the expense of other seeky queues. This patch ensures that
idling will be enabled for queues with I/O depth at least 4, and small
think time. The decision to enable idling is sticky, until an idle
window times out without seeing a new request.

The reasoning behind the decision is that, if an application is using
large I/O depth, it is already optimized to make full utilization of
the hardware, and therefore we reserve a slice of exclusive use for it.
Reported-by: NVivek Goyal <vgoyal@redhat.com>
Signed-off-by: NCorrado Zoccolo <czoccolo@gmail.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

76280aff

cfq-iosched: fix no-idle preemption logic · e4a22919

由 Corrado Zoccolo 提交于 11月 26, 2009

An incoming no-idle queue should preempt the active no-idle queue
 only if the active queue is idling due to service tree empty.
 Previous code was buggy in two ways:
 * it relied on service_tree field to be set on the active queue, while
   it is not set when the code is idling for a new request
 * it didn't check for the service tree empty condition, so could lead to
   LIFO behaviour if multiple queues with depth > 1 were preempting each
   other on an non-NCQ device.
Reported-by: NVivek Goyal <vgoyal@redhat.com>
Signed-off-by: NCorrado Zoccolo <czoccolo@gmail.com>
Acked-by: NVivek Goyal <vgoyal@redhat.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

e4a22919

cfq-iosched: fix ncq detection code · e459dd08

由 Corrado Zoccolo 提交于 11月 26, 2009

CFQ's detection of queueing devices initially assumes a queuing device
and detects if the queue depth reaches a certain threshold.
However, it will reconsider this choice periodically.

Unfortunately, if device is considered not queuing, CFQ will force a
unit queue depth for some workloads, thus defeating the detection logic.
This leads to poor performance on queuing hardware,
since the idle window remains enabled.

Given this premise, switching to hw_tag = 0 after we have proved at
least once that the device is NCQ capable is not a good choice.

The new detection code starts in an indeterminate state, in which CFQ behaves
as if hw_tag = 1, and then, if for a long observation period we never saw
large depth, we switch to hw_tag = 0, otherwise we stick to hw_tag = 1,
without reconsidering it again.
Signed-off-by: NCorrado Zoccolo <czoccolo@gmail.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

e459dd08

cfq-iosched: cleanup unreachable code · c16632ba

由 Corrado Zoccolo 提交于 11月 26, 2009

cfq_should_idle returns false for no-idle queues that are not the last,
so the control flow will never reach the removed code in a state that
satisfies the if condition.
The unreachable code was added to emulate previous cfq behaviour for
non-NCQ rotational devices. My tests show that even without it, the
performances and fairness are comparable with previous cfq, thanks to
the fact that all seeky queues are grouped together, and that we idle at
the end of the tree.
Signed-off-by: NCorrado Zoccolo <czoccolo@gmail.com>
Acked-by: NVivek Goyal <vgoyal@redhat.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

c16632ba

cfq: Make use of service count to estimate the rb_key offset · 3586e917

由 Gui Jianfeng 提交于 11月 26, 2009

For the moment, different workload cfq queues are put into different
service trees. But CFQ still uses "busy_queues" to estimate rb_key
offset when inserting a cfq queue into a service tree. I think this
isn't appropriate, and it should make use of service tree count to do
this estimation. This patch is for for-2.6.33 branch.
Signed-off-by: NGui Jianfeng <guijianfeng@cn.fujitsu.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

3586e917

11 11月, 2009 1 次提交

block: jiffies fixes · ad5ebd2f

由 Randy Dunlap 提交于 11月 11, 2009

Use HZ-independent calculation of milliseconds.
Add jiffies.h where it was missing since functions or macros
from it are used.
Signed-off-by: NRandy Dunlap <randy.dunlap@oracle.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

ad5ebd2f

09 11月, 2009 1 次提交

cfq-iosched: fix next_rq computation · cf7c25cf

由 Corrado Zoccolo 提交于 11月 08, 2009

Cfq has a bug in computation of next_rq, that affects transition
between multiple sequential request streams in a single queue
(e.g.: two sequential buffered writers of the same priority),
causing the alternation between the two streams for a transient period.

8,0 1 18737 0.260400660 5312 D W 141653311 + 256
8,0 1 20839 0.273239461 5400 D W 141653567 + 256
8,0 1 20841 0.276343885 5394 D W 142803919 + 256
8,0 1 20843 0.279490878 5394 D W 141668927 + 256
8,0 1 20845 0.292459993 5400 D W 142804175 + 256
8,0 1 20847 0.295537247 5400 D W 141668671 + 256
8,0 1 20849 0.298656337 5400 D W 142804431 + 256
8,0 1 20851 0.311481148 5394 D W 141668415 + 256
8,0 1 20853 0.314421305 5394 D W 142804687 + 256
8,0 1 20855 0.318960112 5400 D W 142804943 + 256

The fix makes sure that the next_rq is computed from the last
dispatched request, and not affected by merging.

8,0 1 37776 4.305161306 0 D W 141738087 + 256
8,0 1 37778 4.308298091 0 D W 141738343 + 256
8,0 1 37780 4.312885190 0 D W 141738599 + 256
8,0 1 37782 4.315933291 0 D W 141738855 + 256
8,0 1 37784 4.319064459 0 D W 141739111 + 256
8,0 1 37786 4.331918431 5672 D W 142803007 + 256
8,0 1 37788 4.334930332 5672 D W 142803263 + 256
8,0 1 37790 4.337902723 5672 D W 142803519 + 256
8,0 1 37792 4.342359774 5672 D W 142803775 + 256
8,0 1 37794 4.345318286 0 D W 142804031 + 256
Signed-off-by: NCorrado Zoccolo <czoccolo@gmail.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

cf7c25cf

04 11月, 2009 4 次提交

cfq-iosched: get rid of the coop_preempt flag · e00ef799

由 Jens Axboe 提交于 11月 04, 2009

We need to rework this logic post the cooperating cfq_queue merging,
for now just get rid of it and Jeff Moyer will fix the fall out.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

e00ef799

cfq-iosched: fix merge error · 125c4f22

由 Jens Axboe 提交于 11月 03, 2009

We ended up with testing the same condition twice, pretty
pointless. Remove that first if.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

125c4f22

cfq-iosched: limit coop preemption · 4b27e1bb

由 Shaohua Li 提交于 11月 03, 2009

CFQ has an optimization for cooperated applications. if several
io-context have close requests, they will get boost. But the
optimization get abused. Considering thread a, b, which work on one
file. a reads sectors s, s+2, s+4, ...; b reads sectors s+1, s+3, s
+5, ... Both a and b are sequential read, so they can open idle window.
a reads a sector s and goes to idle window and wakeup b. b reads sector
s+1, since in current implementation, cfq_should_preempt() thinks a and
b are cooperators, b will preempt a. b then reads sector s+1 and goes to
idle window and wakeup a. for the same reason, a will preempt b and
reads s+2. a and b will continue the circle. The circle will be very
long, and a and b will occupy whole disk queue. Other applications will
nearly have no chance to run.

Fix this limiting coop preempt until a queue is scheduled normally
again.
Signed-off-by: NShaohua Li <shaohua.li@intel.com>
Acked-by: NJeff Moyer <jmoyer@redhat.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

4b27e1bb

cfq-iosched: fix bad return value cfq_should_preempt() · e6ec4fe2

由 Jens Axboe 提交于 11月 03, 2009

Commit a6151c3a inadvertently reversed
a preempt condition check, potentially causing a performance regression.
Make the meta check correct again.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

e6ec4fe2

02 11月, 2009 1 次提交

cfq-iosched: simplify prio-unboost code · dddb7451

由 Corrado Zoccolo 提交于 11月 02, 2009

Eliminate redundant checks.
Signed-off-by: NCorrado Zoccolo <czoccolo@gmail.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

dddb7451

28 10月, 2009 6 次提交

J
cfq-iosched: fix style issue in cfq_get_avg_queues() · 5869619c
由 Jens Axboe 提交于 10月 28, 2009
```
Line breaks and bad brace placement.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>
```
5869619c

cfq-iosched: fairness for sync no-idle queues · 718eee05

由 Corrado Zoccolo 提交于 10月 26, 2009

Currently no-idle queues in cfq are not serviced fairly:
even if they can only dispatch a small number of requests at a time,
they have to compete with idling queues to be serviced, experiencing
large latencies.

We should notice, instead, that no-idle queues are the ones that would
benefit most from having low latency, in fact they are any of:
* processes with large think times (e.g. interactive ones like file
  managers)
* seeky (e.g. programs faulting in their code at startup)
* or marked as no-idle from upper levels, to improve latencies of those
  requests.

This patch improves the fairness and latency for those queues, by:
* separating sync idle, sync no-idle and async queues in separate
  service_trees, for each priority
* service all no-idle queues together
* and idling when the last no-idle queue has been serviced, to
  anticipate for more no-idle work
* the timeslices allotted for idle and no-idle service_trees are
  computed proportionally to the number of processes in each set.

Servicing all no-idle queues together should have a performance boost
for NCQ-capable drives, without compromising fairness.
Signed-off-by: NCorrado Zoccolo <czoccolo@gmail.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

718eee05

cfq-iosched: enable idling for last queue on priority class · a6d44e98

由 Corrado Zoccolo 提交于 10月 26, 2009

cfq can disable idling for queues in various circumstances.
When workloads of different priorities are competing, if the higher
priority queue has idling disabled, lower priority queues may steal
its disk share. For example, in a scenario with an RT process
performing seeky reads vs a BE process performing sequential reads,
on an NCQ enabled hardware, with low_latency unset,
the RT process will dispatch only the few pending requests every full
slice of service for the BE process.

The patch solves this issue by always performing idle on the last
queue at a given priority class > idle. If the same process, or one
that can pre-empt it (so at the same priority or higher), submits a
new request within the idle window, the lower priority queue won't
dispatch, saving the disk bandwidth for higher priority ones.

Note: this doesn't touch the non_rotational + NCQ case (no hardware
to test if this is a benefit in that case).
Signed-off-by: NCorrado Zoccolo <czoccolo@gmail.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

a6d44e98

cfq-iosched: reimplement priorities using different service trees · c0324a02

由 Corrado Zoccolo 提交于 10月 27, 2009

We use different service trees for different priority classes.
This allows a simplification in the service tree insertion code, that no
longer has to consider priority while walking the tree.
Signed-off-by: NCorrado Zoccolo <czoccolo@gmail.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

c0324a02

cfq-iosched: preparation to handle multiple service trees · aa6f6a3d

由 Corrado Zoccolo 提交于 10月 26, 2009

We embed a pointer to the service tree in each queue, to handle multiple
service trees easily.
Service trees are enriched with a counter.
cfq_add_rq_rb is invoked after putting the rq in the fifo, to ensure
that all fields in rq are properly initialized.
Signed-off-by: NCorrado Zoccolo <czoccolo@gmail.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

aa6f6a3d

cfq-iosched: adapt slice to number of processes doing I/O · 5db5d642

由 Corrado Zoccolo 提交于 10月 26, 2009

When the number of processes performing I/O concurrently increases,
a fixed time slice per process will cause large latencies.

This patch, if low_latency mode is enabled,  will scale the time slice
assigned to each process according to a 300ms target latency.

In order to keep fairness among processes:
* The number of active processes is computed using a special form of
running average, that quickly follows sudden increases (to keep latency low),
and decrease slowly (to have fairness in spite of rapid decreases of this
value).

To safeguard sequential bandwidth, we impose a minimum time slice
(computed using 2*cfq_slice_idle as base, adjusted according to priority
and async-ness).
Signed-off-by: NCorrado Zoccolo <czoccolo@gmail.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

5db5d642

27 10月, 2009 1 次提交

cfq-iosched: improve hw_tag detection · 1a1238a7

由 Shaohua Li 提交于 10月 27, 2009

If active queue hasn't enough requests and idle window opens, cfq will not
dispatch sufficient requests to hardware. In such situation, current code
will zero hw_tag. But this is because cfq doesn't dispatch enough requests
instead of hardware queue doesn't work. Don't zero hw_tag in such case.
Signed-off-by: NShaohua Li <shaohua.li@intel.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

1a1238a7

26 10月, 2009 4 次提交

cfq: break apart merged cfqqs if they stop cooperating · e6c5bc73

由 Jeff Moyer 提交于 10月 23, 2009

cfq_queues are merged if they are issuing requests within the mean seek
distance of one another.  This patch detects when the coopearting stops and
breaks the queues back up.
Signed-off-by: NJeff Moyer <jmoyer@redhat.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

e6c5bc73

cfq: change the meaning of the cfqq_coop flag · b3b6d040

由 Jeff Moyer 提交于 10月 23, 2009

The flag used to indicate that a cfqq was allowed to jump ahead in the
scheduling order due to submitting a request close to the queue that
just executed.  Since closely cooperating queues are now merged, the flag
holds little meaning.  Change it to indicate that multiple queues were
merged.  This will later be used to allow the breaking up of merged queues
when they are no longer cooperating.
Signed-off-by: NJeff Moyer <jmoyer@redhat.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

b3b6d040

cfq: merge cooperating cfq_queues · df5fe3e8

由 Jeff Moyer 提交于 10月 23, 2009

When cooperating cfq_queues are detected currently, they are allowed to
skip ahead in the scheduling order. It is much more efficient to
automatically share the cfq_queue data structure between cooperating processes.
Performance of the read-test2 benchmark (which is written to emulate the
dump(8) utility) went from 12MB/s to 90MB/s on my SATA disk. NFS servers
with multiple nfsd threads also saw performance increases.
Signed-off-by: NJeff Moyer <jmoyer@redhat.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

df5fe3e8

cfq: calculate the seek_mean per cfq_queue not per cfq_io_context · b2c18e1e

由 Jeff Moyer 提交于 10月 23, 2009

async cfq_queue's are already shared between processes within the same
priority, and forthcoming patches will change the mapping of cic to sync
cfq_queue from 1:1 to 1:N.  So, calculate the seekiness of a process
based on the cfq_queue instead of the cfq_io_context.
Signed-off-by: NJeff Moyer <jmoyer@redhat.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

b2c18e1e

08 10月, 2009 3 次提交

cfq-iosched: avoid probable slice overrun when idling · 355b659c

由 Corrado Zoccolo 提交于 10月 08, 2009

If the average think time is larger than the remaining time slice
for any given queue, don't allow it to idle. A succesful idle also
means that we need to dispatch and complete a request, so if we don't
even have time left for the idle process, we would overrun the slice
in any case.
Signed-off-by: NCorrado Zoccolo <czoccolo@gmail.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

355b659c

cfq-iosched: apply bool value where we return 0/1 · a6151c3a

由 Jens Axboe 提交于 10月 07, 2009

Saves 16 bytes of text, woohoo. But the more important point is
that it makes the code more readable when returning bool for 0/1
cases.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

a6151c3a

cfq-iosched: fix think time allowed for seekers · ec60e4f6

由 Corrado Zoccolo 提交于 10月 07, 2009

CFQ enables idle only for processes that think less than the allowed
idle time. Since idle time is lower for seeky queues, we should use the
correct value in the comparison.
Signed-off-by: NCorrado Zoccolo <czoccolo@gmail.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

ec60e4f6

07 10月, 2009 2 次提交

cfq-iosched: fix the slice residual sign · b9c8946b

由 Jens Axboe 提交于 10月 06, 2009

We should subtract the slice residual from the rb tree key, since
a negative residual count indicates that the cfqq overran its slice
the last time. Hence we want to add the overrun time, to position
it a bit further away in the service tree.
Reported-by: NCorrado Zoccolo <czoccolo@gmail.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

b9c8946b

cfq-iosched: abstract out the 'may this cfqq dispatch' logic · 0b182d61

由 Jens Axboe 提交于 10月 06, 2009

Makes the whole thing easier to read, cfq_dispatch_requests() was
a bit messy before.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

0b182d61

05 10月, 2009 4 次提交

block: get rid of kblock_schedule_delayed_work() · 23e018a1

由 Jens Axboe 提交于 10月 05, 2009

It was briefly introduced to allow CFQ to to delayed scheduling,
but we ended up removing that feature again. So lets kill the
function and export, and just switch CFQ back to the normal work
schedule since it is now passing in a '0' delay from all call
sites.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

23e018a1

cfq-iosched: fix possible problem with jiffies wraparound · 48e025e6

由 Corrado Zoccolo 提交于 10月 05, 2009

The RR service tree is indexed by a key that is relative to current jiffies.
This can cause problems on jiffies wraparound.

The patch fixes it using time_before comparison, and changing
the add_front path to use a relative number, too.
Signed-off-by: NCorrado Zoccolo <czoccolo@gmail.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

48e025e6

cfq-iosched: fix issue with rq-rq merging and fifo list ordering · 30996f40

由 Jens Axboe 提交于 10月 05, 2009

cfq uses rq->start_time as the fifo indicator, but that field may
get modified prior to cfq doing it's fifo list adjustment when
a request gets merged with another request. This can cause the
fifo list to become unordered.
Reported-by: NCorrado Zoccolo <czoccolo@gmail.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

30996f40

cfq-iosched: don't delay async queue if it hasn't dispatched at all · e00c54c3

由 Jens Axboe 提交于 10月 04, 2009

We cannot delay for the first dispatch of the async queue if it
hasn't dispatched at all, since that could present a local user
DoS attack vector using an app that just did slow timed sync reads
while filling memory.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

e00c54c3

04 10月, 2009 2 次提交

cfq-iosched: use assigned slice sync value, not default · 61f0c1dc

由 Jens Axboe 提交于 10月 03, 2009

We should use the sysfs modified slice sync value, in case it differs
from the default.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

61f0c1dc

cfq-iosched: rename 'desktop' sysfs entry to 'low_latency' · 963b72fc

由 Jens Axboe 提交于 10月 03, 2009

Don't think that's necessarily a perfect description of what this
option fiddles with, but it's probably better than 'desktop'.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

963b72fc

03 10月, 2009 3 次提交

cfq-iosched: implement slower async initiate and queue ramp up · 8e296755

由 Jens Axboe 提交于 10月 03, 2009

This slowly ramps up the async queue depth based on the time
passed since the sync IO, and doesn't allow async at all until
a sync slice period has passed.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

8e296755

cfq-iosched: delay async IO dispatch, if sync IO was just done · 365722bb

由 Vivek Goyal 提交于 10月 03, 2009

o Do not allow more than max_dispatch requests from an async queue, if some
  sync request has finished recently. This is in the hope that sync activity
  is still going on in the system and we might receive a sync request soon.
  Most likely from a sync queue which finished a request and we did not enable
  idling on it.
Signed-off-by: NVivek Goyal <vgoyal@redhat.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

365722bb

cfq-iosched: add a knob for desktop interactiveness · 1d223515

由 Jens Axboe 提交于 10月 02, 2009

This is basically identical to what Vivek Goyal posted, but combined
into one and labelled 'desktop' instead of 'fairness'. The goal
is to continue to improve on the latency side of things as it relates
to interactiveness, keeping the questionable bits under this sysfs
tunable so it would be easy for throughput-only people to turn off.

Apart from adding the interactive sysfs knob, it also adds the
behavioural change of allowing slice idling even if the hardware
does tagged command queuing.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

1d223515

14 9月, 2009 1 次提交

cfq: choose a new next_req when a request is dispatched · 06d21886

由 Jeff Moyer 提交于 9月 11, 2009

This patch addresses http://bugzilla.kernel.org/show_bug.cgi?id=13401, a
regression introduced in 2.6.30.

From the bug report:
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

06d21886