提交 5dc0acc2 编写于 作者: M Ming Lei 提交者: Jeffle Xu

blk-mq: balance mapping between present CPUs and queues

fix #27417914

commit 556f36e90dbe7dded81f4fac084d2bc8a2458330 upstream

Spread queues among present CPUs first, then building mapping on other
non-present CPUs.

So we can minimize count of dead queues which are mapped by un-present
CPUs only. Then bad IO performance can be avoided by unbalanced mapping
between present CPUs and queues.

The similar policy has been applied on Managed IRQ affinity.

Cc: Yi Zhang <yi.zhang@redhat.com>
Reported-by: NYi Zhang <yi.zhang@redhat.com>
Reviewed-by: NBob Liu <bob.liu@oracle.com>
Signed-off-by: NMing Lei <ming.lei@redhat.com>
Signed-off-by: NJens Axboe <axboe@kernel.dk>
[jeffle: remove code supporting multiple queue maps, which is merged since v5.0]
Signed-off-by: NJeffle Xu <jefflexu@linux.alibaba.com>
Reviewed-by: NJoseph Qi <joseph.qi@linux.alibaba.com>
上级 9686c568
...@@ -14,9 +14,9 @@ ...@@ -14,9 +14,9 @@
#include "blk.h" #include "blk.h"
#include "blk-mq.h" #include "blk-mq.h"
static int cpu_to_queue_index(unsigned int nr_queues, const int cpu) static int queue_index(unsigned int nr_queues, const int q)
{ {
return cpu % nr_queues; return q % nr_queues;
} }
static int get_first_sibling(unsigned int cpu) static int get_first_sibling(unsigned int cpu)
...@@ -34,9 +34,24 @@ int blk_mq_map_queues(struct blk_mq_tag_set *set) ...@@ -34,9 +34,24 @@ int blk_mq_map_queues(struct blk_mq_tag_set *set)
{ {
unsigned int *map = set->mq_map; unsigned int *map = set->mq_map;
unsigned int nr_queues = set->nr_hw_queues; unsigned int nr_queues = set->nr_hw_queues;
unsigned int cpu, first_sibling; unsigned int cpu, first_sibling, q = 0;
for_each_possible_cpu(cpu)
map[cpu] = -1;
/*
* Spread queues among present CPUs first for minimizing
* count of dead queues which are mapped by all un-present CPUs
*/
for_each_present_cpu(cpu) {
if (q >= nr_queues)
break;
map[cpu] = queue_index(nr_queues, q++);
}
for_each_possible_cpu(cpu) { for_each_possible_cpu(cpu) {
if (map[cpu] != -1)
continue;
/* /*
* First do sequential mapping between CPUs and queues. * First do sequential mapping between CPUs and queues.
* In case we still have CPUs to map, and we have some number of * In case we still have CPUs to map, and we have some number of
...@@ -44,11 +59,11 @@ int blk_mq_map_queues(struct blk_mq_tag_set *set) ...@@ -44,11 +59,11 @@ int blk_mq_map_queues(struct blk_mq_tag_set *set)
* performace optimizations. * performace optimizations.
*/ */
if (cpu < nr_queues) { if (cpu < nr_queues) {
map[cpu] = cpu_to_queue_index(nr_queues, cpu); map[cpu] = queue_index(nr_queues, q++);
} else { } else {
first_sibling = get_first_sibling(cpu); first_sibling = get_first_sibling(cpu);
if (first_sibling == cpu) if (first_sibling == cpu)
map[cpu] = cpu_to_queue_index(nr_queues, cpu); map[cpu] = queue_index(nr_queues, q++);
else else
map[cpu] = map[first_sibling]; map[cpu] = map[first_sibling];
} }
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册