提交 · cfd0c552a8272d691691f40073654d775836e23a · openeuler / Kernel

22 10月, 2015 7 次提交

blk-mq: mark ctx as pending at batch in flush plug path · cfd0c552

由 Ming Lei 提交于 10月 20, 2015

Most of times, flush plug should be the hottest I/O path,
so mark ctx as pending after all requests in the list are
inserted.
Reviewed-by: NJeff Moyer <jmoyer@redhat.com>
Signed-off-by: NMing Lei <ming.lei@canonical.com>
Signed-off-by: NJens Axboe <axboe@fb.com>

cfd0c552

blk-mq: fix for trace_block_plug() · 676d0607

由 Ming Lei 提交于 10月 20, 2015

The trace point is for tracing plug event of each request
queue instead of each task, so we should check the request
count in the plug list from current queue instead of
current task.
Signed-off-by: NMing Lei <ming.lei@canonical.com>
Reviewed-by: NJeff Moyer <jmoyer@redhat.com>
Signed-off-by: NJens Axboe <axboe@fb.com>

676d0607

block: check bio_mergeable() early before merging · 7460d389

由 Ming Lei 提交于 10月 20, 2015

After bio splitting is introduced, one bio can be splitted
and it is marked as NOMERGE because it is too fat to be merged,
so check bio_mergeable() earlier to avoid to try to merge it
unnecessarily.
Signed-off-by: NMing Lei <ming.lei@canonical.com>
Reviewed-by: NJeff Moyer <jmoyer@redhat.com>
Signed-off-by: NJens Axboe <axboe@fb.com>

7460d389

blk-mq: check bio_mergeable() early before merging · e18378a6

由 Ming Lei 提交于 10月 20, 2015

It isn't necessary to try to merge the bio which is marked
as NOMERGE.
Reviewed-by: NJeff Moyer <jmoyer@redhat.com>
Signed-off-by: NMing Lei <ming.lei@canonical.com>
Signed-off-by: NJens Axboe <axboe@fb.com>

e18378a6

block: avoid to merge splitted bio · 6ac45aeb

由 Ming Lei 提交于 10月 20, 2015

The splitted bio has been already too fat to merge, so mark it
as NOMERGE.
Reviewed-by: NJeff Moyer <jmoyer@redhat.com>
Signed-off-by: NMing Lei <ming.lei@canonical.com>
Signed-off-by: NJens Axboe <axboe@fb.com>

6ac45aeb

block: setup bi_phys_segments after splitting · bdced438

由 Ming Lei 提交于 10月 20, 2015

The number of bio->bi_phys_segments is always obtained
during bio splitting, so it is natural to setup it
just after bio splitting, then we can avoid to compute
nr_segment again during merge.
Reviewed-by: NJeff Moyer <jmoyer@redhat.com>
Signed-off-by: NMing Lei <ming.lei@canonical.com>
Signed-off-by: NJens Axboe <axboe@fb.com>

bdced438

block: fix plug list flushing for nomerge queues · 0809e3ac

由 Jeff Moyer 提交于 10月 20, 2015

Request queues with merging disabled will not flush the plug list after
BLK_MAX_REQUEST_COUNT requests have been queued, since the code relies
on blk_attempt_plug_merge to compute the request_count.  Fix this by
computing the number of queued requests even for nomerge queues.
Signed-off-by: NJeff Moyer <jmoyer@redhat.com>
Signed-off-by: NJens Axboe <axboe@fb.com>

0809e3ac

10 10月, 2015 2 次提交

C
blk-mq: remove unused blk_mq_clone_flush_request prototype · 3380f458
由 Christoph Hellwig 提交于 10月 09, 2015
```
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NJens Axboe <axboe@fb.com>
```
3380f458

blk-mq: fix waitqueue_active without memory barrier in block/blk-mq-tag.c · 8ee1b7b9

由 Kosuke Tatsukawa 提交于 10月 09, 2015

blk_mq_tag_update_depth() seems to be missing a memory barrier which
might cause the waker to not notice the waiter and fail to send a
wake_up as in the following figure.

	blk_mq_tag_update_depth			bt_get
------------------------------------------------------------------------
if (waitqueue_active(&bs->wait))
/* The CPU might reorder the test for
   the waitqueue up here, before
   prior writes complete */
					prepare_to_wait(&bs->wait, &wait,
					  TASK_UNINTERRUPTIBLE);
					tag = __bt_get(hctx, bt, last_tag,
					  tags);
					/* Value set in bt_update_count not
					   visible yet */
bt_update_count(&tags->bitmap_tags, tdepth);
/* blk_mq_tag_wakeup_all(tags, false); */
 bt = &tags->bitmap_tags;
 wake_index = atomic_read(&bt->wake_index);
					...
					io_schedule();
------------------------------------------------------------------------

This patch adds the missing memory barrier.

I found this issue when I was looking through the linux source code
for places calling waitqueue_active() before wake_up*(), but without
preceding memory barriers, after sending a patch to fix a similar
issue in drivers/tty/n_tty.c  (Details about the original issue can be
found here: https://lkml.org/lkml/2015/9/28/849).
Signed-off-by: NKosuke Tatsukawa <tatsu@ab.jp.nec.com>
Signed-off-by: NJens Axboe <axboe@fb.com>

8ee1b7b9

01 10月, 2015 2 次提交

blk-mq: factor out a helper to iterate all tags for a request_queue · 0bf6cd5b

由 Christoph Hellwig 提交于 9月 27, 2015

And replace the blk_mq_tag_busy_iter with it - the driver use has been
replaced with a new helper a while ago, and internal to the block we
only need the new version.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NJens Axboe <axboe@fb.com>

0bf6cd5b

blk-mq: fix racy updates of rq->errors · f4829a9b

由 Christoph Hellwig 提交于 9月 27, 2015

blk_mq_complete_request may be a no-op if the request has already
been completed by others means (e.g. a timeout or cancellation), but
currently drivers have to set rq->errors before calling
blk_mq_complete_request, which might leave us with the wrong error value.

Add an error parameter to blk_mq_complete_request so that we can
defer setting rq->errors until we known we won the race to complete the
request.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Reviewed-by: NSagi Grimberg <sagig@mellanox.com>
Signed-off-by: NJens Axboe <axboe@fb.com>

f4829a9b

30 9月, 2015 6 次提交

blk-mq: fix deadlock when reading cpu_list · 60de074b