提交 · e26feff647ef34423b048b940540a0059001ddb0 · openeuler / Kernel

09 10月, 2008 4 次提交

block: as/cfq ssd idle check update · f7d7b7a7

由 Jens Axboe 提交于 9月 25, 2008

We really need to know about the hardware tagging support as well,
since if the SSD does not do tagging then we still want to idle.
Otherwise have the same dependent sync IO vs flooding async IO
problem as on rotational media.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

f7d7b7a7

block: add queue flag for SSD/non-rotational devices · a68bbddb

由 Jens Axboe 提交于 9月 24, 2008

We don't want to idle in AS/CFQ if the device doesn't have a seek
penalty. So add a QUEUE_FLAG_NONROT to indicate a non-rotational
device, low level drivers should set this flag upon discovery of
an SSD or similar device type.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

a68bbddb

cfq-iosched: fix queue depth detection · 45333d5a

由 Aaron Carroll 提交于 8月 26, 2008

CFQ's detection of queueing devices assumes a non-queuing device and detects
if the queue depth reaches a certain threshold. Under some workloads (e.g.
synchronous reads), CFQ effectively forces a unit queue depth, thus defeating
the detection logic. This leads to poor performance on queuing hardware,
since the idle window remains enabled.

This patch inverts the sense of the logic: assume a queuing-capable device,
and detect if the depth does not exceed the threshold.
Signed-off-by: NAaron Carroll <aaronc@gelato.unsw.edu.au>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

45333d5a

J
block: make kblockd_schedule_work() take the queue as parameter · 18887ad9
由 Jens Axboe 提交于 7月 28, 2008
```
Preparatory patch for checking queuing affinity.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>
```
18887ad9

03 7月, 2008 3 次提交

J
cfq-iosched: get rid of enable_idle being unused warning · c265a7f4
由 Jens Axboe 提交于 6月 26, 2008
```
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>
```
c265a7f4

cfq-iosched: add message logging through blktrace · 7b679138

由 Jens Axboe 提交于 5月 30, 2008

Now that blktrace has the ability to carry arbitrary messages in
its stream, use that for some CFQ logging.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

7b679138

cfq-iosched: properly protect ioc_gone and ioc count · 9a11b4ed

由 Jens Axboe 提交于 5月 29, 2008

If we have multiple tasks freeing cfq_io_contexts when cfq-iosched
is being unloaded, we could complete() ioc_gone twice. Fix that by
protecting ioc_gone complete() and clearing with a spinlock for
just that purpose. Doesn't matter from a performance perspective,
since it'll only enter that path when ioc_gone != NULL (when cfq-iosched
is being rmmod'ed).
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

9a11b4ed

28 5月, 2008 2 次提交

cfq-iosched: fix RCU problem in cfq_cic_lookup() · d6de8be7

由 Jens Axboe 提交于 5月 28, 2008

cfq_cic_lookup() needs to properly protect ioc->ioc_data before
dereferencing it and also exclude updaters of ioc->ioc_data as well.

Also add a number of comments documenting why the existing RCU usage
is OK.

Thanks a lot to "Paul E. McKenney" <paulmck@linux.vnet.ibm.com> for
review and comments!
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

d6de8be7

block: reorder cfq_queue to save space on 64bit builds · be754d2c

由 Richard Kennedy 提交于 5月 23, 2008

saves 8 bytes of padding & increases objects/slab from 30 to 32 on my
AMD64 config
Signed-off-by: NRichard Kennedy <richard@rsk.demon.co.uk>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

be754d2c

07 5月, 2008 2 次提交

cfq-iosched: make io priorities inherit CPU scheduling class as well as nice · 6d63c275

由 Jens Axboe 提交于 5月 07, 2008

We currently set all processes to the best-effort scheduling class,
regardless of what CPU scheduling class they belong to. Improve that
so that we correctly track idle and rt scheduling classes as well.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

6d63c275

cfq-iosched: fix RCU race in the cfq io_context destructor handling · 07416d29

由 Jens Axboe 提交于 5月 07, 2008

put_io_context() drops the RCU read lock before calling into cfq_dtor(),
however we need to hold off freeing there before grabbing and
dereferencing the first object on the list.

So extend the rcu_read_lock() scope to cover the calling of cfq_dtor(),
and optimize cfq_free_io_context() to use a new variant for
call_for_each_cic() that assumes the RCU read lock is already held.

Hit in the wild by Alexey Dobriyan <adobriyan@gmail.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

07416d29

10 4月, 2008 1 次提交

cfq-iosched: do not leak ioc_data across iosched switches · 4faa3c81

由 Fabio Checconi 提交于 4月 10, 2008

When switching scheduler from cfq, cfq_exit_queue() does not clear
ioc->ioc_data, leaving a dangling pointer that can deceive the following
lookups when the iosched is switched back to cfq.  The pattern that can
trigger that is the following:

    - elevator switch from cfq to something else;
    - module unloading, with elv_unregister() that calls cfq_free_io_context()
      on ioc freeing the cic (via the .trim op);
    - module gets reloaded and the elevator switches back to cfq;
    - reallocation of a cic at the same address as before (with a valid key).

To fix it just assign NULL to ioc_data in __cfq_exit_single_io_context(),
that is called from the regular exit path and from the elevator switching
code.  The only path that frees a cic and is not covered is the error handling
one, but cic's freed in this way are never cached in ioc_data.
Signed-off-by: NFabio Checconi <fabio@gandalf.sssup.it>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

4faa3c81

02 4月, 2008 1 次提交

cfq-iosched: fix rcu freeing of cfq io contexts · 34e6bbf2

由 Fabio Checconi 提交于 4月 02, 2008

SLAB_DESTROY_BY_RCU is not a direct substitute for normal call_rcu()
freeing, since it'll page freeing but NOT object freeing. So change
cfq to do the freeing on its own.
Signed-off-by: NFabio Checconi <fabio@gandalf.sssup.it>
Acked-by: NPaul E. McKenney <paulmck@linux.vnet.ibm.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

34e6bbf2

19 2月, 2008 1 次提交

cfq-iosched: add hlist for browsing parallel to the radix tree · ffc4e759

由 Jens Axboe 提交于 2月 19, 2008

It's cumbersome to browse a radix tree from start to finish, especially
since we modify keys when a process exits. So add a hlist for the single
purpose of browsing over all known cfq_io_contexts, used for exit,
io prio change, etc.

This fixes http://bugzilla.kernel.org/show_bug.cgi?id=9948Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

ffc4e759

01 2月, 2008 1 次提交
- J
  cfq-iosched: make checkpatch compliant · fe094d98
  由 Jens Axboe 提交于 1月 31, 2008
```
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>
```
  fe094d98
28 1月, 2008 5 次提交

cfq-iosched: kill some big inlines · febffd61

由 Jens Axboe 提交于 1月 28, 2008

Use of inlines were a bit over the top, trim them down a bit.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

febffd61

cfq-iosched: relax IOPRIO_CLASS_IDLE restrictions · 0871714e

由 Jens Axboe 提交于 1月 28, 2008

Currently you must be root to set idle io prio class on a process. This
is due to the fact that the idle class is implemented as a true idle
class, meaning that it will not make progress if someone else is
requesting disk access. Unfortunately this means that it opens DOS
opportunities by locking down file system resources, hence it is root
only at the moment.

This patch relaxes the idle class a little, by removing the truly idle
part (which entals a grace period with associated timer). The
modifications make the idle class as close to zero impact as can be done
while still guarenteeing progress. This means we can relax the root only
criteria as well.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

0871714e

block: cfq: make the io contect sharing lockless · 4ac845a2

由 Jens Axboe 提交于 1月 24, 2008

The io context sharing introduced a per-ioc spinlock, that would protect
the cfq io context lookup. That is a regression from the original, since
we never needed any locking there because the ioc/cic were process private.

The cic lookup is changed from an rbtree construct to a radix tree, which
we can then use RCU to make the reader side lockless. That is the performance
critical path, modifying the radix tree is only done on process creation
(when that process first does IO, actually) and on process exit (if that
process has done IO).

As it so happens, radix trees are also much faster for this type of
lookup where the key is a pointer. It's a very sparse tree.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

4ac845a2

N
io_context sharing - cfq changes · 66dac98e
由 Nikanth Karthikesan 提交于 11月 27, 2007
```
changes in the cfq for io_context sharing
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>
```
66dac98e

ioprio: move io priority from task_struct to io_context · fd0928df

由 Jens Axboe 提交于 1月 24, 2008

This is where it belongs and then it doesn't take up space for a
process that doesn't do IO.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

fd0928df

18 12月, 2007 1 次提交

block: let elv_register() return void · 2fdd82bd

由 Adrian Bunk 提交于 12月 12, 2007

elv_register() always returns 0, and there isn't anything it does where
it should return an error (the only error condition is so grave that
it's handled with a BUG_ON).
Signed-off-by: NAdrian Bunk <bunk@kernel.org>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

2fdd82bd

07 11月, 2007 3 次提交

cfq_idle_class_timer: add paranoid checks for jiffies overflow · 0e7be9ed

由 Oleg Nesterov 提交于 11月 07, 2007

In theory, if the queue was idle long enough, cfq_idle_class_timer may have
a false (and very long) timeout because jiffies can wrap into the past wrt
->last_end_request.
Signed-off-by: NOleg Nesterov <oleg@tv-sign.ru>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

0e7be9ed

cfq: fix IOPRIO_CLASS_IDLE delays · b70c864d

由 Oleg Nesterov 提交于 11月 07, 2007

After the fresh boot:

	ionice -c3 -p $$
	echo cfq >> /sys/block/XXX/queue/scheduler
	dd if=/dev/XXX of=/dev/null bs=512 count=1

Now dd hangs in D state and the queue is completely stalled for approximately
INITIAL_JIFFIES + CFQ_IDLE_GRACE jiffies. This is because cfq_init_queue()
forgets to initialize cfq_data->last_end_request.

(I guess this patch is not complete, overflow is still possible)
Signed-off-by: NOleg Nesterov <oleg@tv-sign.ru>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

b70c864d

cfq: fix IOPRIO_CLASS_IDLE accounting · 2389d1ef

由 Oleg Nesterov 提交于 11月 05, 2007

Spotted by Nick <gentuu@gmail.com>, hopefully can explain the second trace in
http://bugzilla.kernel.org/show_bug.cgi?id=9180.

If ->async_idle_cfqq != NULL cfq_put_async_queues() puts it IOPRIO_BE_NR times
in a loop. Fix this.
Signed-off-by: NOleg Nesterov <oleg@tv-sign.ru>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

2389d1ef

29 10月, 2007 2 次提交

cfq_get_queue: fix possible NULL pointer access · 0a0836a0

由 Oleg Nesterov 提交于 10月 23, 2007

cfq_get_queue()->cfq_find_alloc_queue() can fail, check the returned value.
Signed-off-by: NOleg Nesterov <oleg@tv-sign.ru>

Note that this isn't a bug at the moment, since the regular IO path
does not call this path without __GFP_WAIT set. However, it could be a
future bug, so I've applied it.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

0a0836a0

cfq_exit_queue() should cancel cfq_data->unplug_work · 4310864b

由 Oleg Nesterov 提交于 10月 23, 2007

Spotted by Nick <gentuu@gmail.com>, perhaps explains the first trace in
http://bugzilla.kernel.org/show_bug.cgi?id=9180.

cfq_exit_queue() should cancel cfqd->unplug_work before freeing cfqd.
blk_sync_queue() seems unneeded, removed.

Q: why cfq_exit_queue() calls cfq_shutdown_timer_wq() twice?
Signed-off-by: NOleg Nesterov <oleg@tv-sign.ru>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

4310864b

24 7月, 2007 1 次提交

[BLOCK] Get rid of request_queue_t typedef · 165125e1

由 Jens Axboe 提交于 7月 24, 2007

Some of the code has been gradually transitioned to using the proper
struct request_queue, but there's lots left. So do a full sweet of
the kernel and get rid of this typedef and replace its uses with
the proper type.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

165125e1

20 7月, 2007 2 次提交

cfq: Write-only stuff in CFQ data structures · 8350163a

由 Alexey Dobriyan 提交于 7月 20, 2007

There are some leftover bits from the task cooperator patch, that was
yanked out again. While it will get reintroduced, no point in having
this write-only stuff in the tree. So yank it.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

8350163a

cfq: async queue allocation per priority · c2dea2d1

由 Vasily Tarasov 提交于 7月 20, 2007

If we have two processes with different ioprio_class, but the same
ioprio_data, their async requests will fall into the same queue. I guess
such behavior is not expected, because it's not right to put real-time
requests and best-effort requests in the same queue.

The attached patch fixes the problem by introducing additional *cfqq
fields on cfqd, pointing to per-(class,priority) async queues.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

c2dea2d1

18 7月, 2007 1 次提交

Slab allocators: Replace explicit zeroing with __GFP_ZERO · 94f6030c

由 Christoph Lameter 提交于 7月 17, 2007

kmalloc_node() and kmem_cache_alloc_node() were not available in a zeroing
variant in the past.  But with __GFP_ZERO it is possible now to do zeroing
while allocating.

Use __GFP_ZERO to remove the explicit clearing of memory via memset whereever
we can.
Signed-off-by: NChristoph Lameter <clameter@sgi.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

94f6030c

10 7月, 2007 1 次提交

cfq-iosched: fix async queue behaviour · 15c31be4

由 Jens Axboe 提交于 7月 10, 2007

With the cfq_queue hash removal, we inadvertently got rid of the
async queue sharing. This was not intentional, in fact CFQ purposely
shares the async queue per priority level to get good merging for
async writes.

So put some logic in cfq_get_queue() to track the shared queues.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

15c31be4

08 5月, 2007 1 次提交

KMEM_CACHE(): simplify slab cache creation · 0a31bd5f

由 Christoph Lameter 提交于 5月 06, 2007

This patch provides a new macro

KMEM_CACHE(<struct>, <flags>)

to simplify slab creation. KMEM_CACHE creates a slab with the name of the
struct, with the size of the struct and with the alignment of the struct.
Additional slab flags may be specified if necessary.

Example

struct test_slab {
	int a,b,c;
	struct list_head;
} __cacheline_aligned_in_smp;

test_slab_cache = KMEM_CACHE(test_slab, SLAB_PANIC)

will create a new slab named "test_slab" of the size sizeof(struct
test_slab) and aligned to the alignment of test slab.  If it fails then we
panic.
Signed-off-by: NChristoph Lameter <clameter@sgi.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

0a31bd5f

30 4月, 2007 8 次提交

cfq-iosched: speedup cic rb lookup · 597bc485

由 Jens Axboe 提交于 4月 24, 2007

We often lookup the same queue many times in succession, so cache
the last looked up queue to avoid browsing the rbtree.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

597bc485

cfq-iosched: get rid of cfqq hash · 91fac317

由 Vasily Tarasov 提交于 4月 25, 2007

cfq hash is no more necessary.  We always can get cfqq from io context.
cfq_get_io_context_noalloc() function is introduced, because we don't
want to allocate cic on merging and checking may_queue.  In order to
identify sync queue we've used hash key = CFQ_KEY_ASYNC. Since hash is
eliminated we need to use other criterion: sync flag for queue is added.
In all places where we dig in rb_tree we're in current context, so no
additional locking is required.

Advantages of this patch: no additional memory for hash, no seeking in
hash, code is cleaner. But it is necessary now to seek cic in per-ioc
rbtree, but it is faster:
- most processes work only with few devices
- most systems have only few block devices
- it is a rb-tree
Signed-off-by: NVasily Tarasov <vtaras@openvz.org>

Changes by me:

- Merge into CFQ devel branch
- Get rid of cfq_get_io_context_noalloc()
- Fix various bugs with dereferencing cic->cfqq[] with offset other
  than 0 or 1.
- Fix bug in cfqq setup, is_sync condition was reversed.
- Fix bug where only bio_sync() is used, we need to check for a READ too
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

91fac317

cfq-iosched: tighten queue request overlap condition · cc197479

由 Jens Axboe 提交于 4月 20, 2007

For tagged devices, allow overlap of requests if the idle window
isn't enabled on the current active queue.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

cc197479

J
cfq-iosched: improve sync vs async workloads · 3ed9a296
由 Jens Axboe 提交于 4月 23, 2007
```
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>
```
3ed9a296

cfq-iosched: never allow an async queue idling · 1be92f2f

由 Jens Axboe 提交于 4月 19, 2007

We don't enable it by default, don't let it get enabled during
runtime.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

1be92f2f

cfq-iosched: get rid of ->dispatch_slice · 20e493a8

由 Jens Axboe 提交于 4月 23, 2007

We can track it fairly accurately locally, let the slice handling
take care of the rest.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

20e493a8

J
cfq-iosched: don't pass unused preemption variable around · 6084cdda
由 Jens Axboe 提交于 4月 23, 2007
```
We don't use it anymore in the slice expiry handling.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>
```
6084cdda

cfq-iosched: get rid of ->cur_rr and ->cfq_list · edd75ffd

由 Jens Axboe 提交于 4月 19, 2007

It's only used for preemption now that the IDLE and RT queues also
use the rbtree. If we pass an 'add_front' variable to
cfq_service_tree_add(), we can set ->rb_key to 0 to force insertion
at the front of the tree.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

edd75ffd

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功