提交 b406c8f5 编写于 作者: M Mike Snitzer 提交者: Xie XiuQi

dm: fix redundant IO accounting for bios that need splitting

mainline inclusion
from mainline-5.0-rc4
commit a1e1cb72d96491277ede8d257ce6b48a381dd336
category: bugfix
bugzilla: 18695
CVE: NA

---------------------------

The risk of redundant IO accounting was not taken into consideration
when commit 18a25da8 ("dm: ensure bio submission follows a
depth-first tree walk") introduced IO splitting in terms of recursion
via generic_make_request().

Fix this by subtracting the split bio's payload from the IO stats that
were already accounted for by start_io_acct() upon dm_make_request()
entry.  This repeat oscillation of the IO accounting, up then down,
isn't ideal but refactoring DM core's IO splitting to pre-split bios
_before_ they are accounted turned out to be an excessive amount of
change that will need a full development cycle to refine and verify.

Before this fix:

  /dev/mapper/stripe_dev is a 4-way stripe using a 32k chunksize, so
  bios are split on 32k boundaries.

  # fio --name=16M --filename=/dev/mapper/stripe_dev --rw=write --bs=64k --size=16M \
    	--iodepth=1 --ioengine=libaio --direct=1 --refill_buffers

  with debugging added:
  [103898.310264] device-mapper: core: start_io_acct: dm-2 WRITE bio->bi_iter.bi_sector=0 len=128
  [103898.318704] device-mapper: core: __split_and_process_bio: recursing for following split bio:
  [103898.329136] device-mapper: core: start_io_acct: dm-2 WRITE bio->bi_iter.bi_sector=64 len=64
  ...

  16M written yet 136M (278528 * 512b) accounted:
  # cat /sys/block/dm-2/stat | awk '{ print $7 }'
  278528

After this fix:

  16M written and 16M (32768 * 512b) accounted:
  # cat /sys/block/dm-2/stat | awk '{ print $7 }'
  32768

Conflicts:
  drivers/md/dm.c

Fixes: 18a25da8 ("dm: ensure bio submission follows a depth-first tree walk")
Cc: stable@vger.kernel.org # 4.16+
Reported-by: NBryan Gurney <bgurney@redhat.com>
Reviewed-by: NMing Lei <ming.lei@redhat.com>
Signed-off-by: NMike Snitzer <snitzer@redhat.com>
Signed-off-by: NZhihao Cheng <chengzhihao1@huawei.com>
Reviewed-by: NZhang Xiaoxu <zhangxiaoxu5@huawei.com>
Signed-off-by: NYang Yingliang <yangyingliang@huawei.com>
上级 fa23f552
...@@ -1581,6 +1581,8 @@ static void init_clone_info(struct clone_info *ci, struct mapped_device *md, ...@@ -1581,6 +1581,8 @@ static void init_clone_info(struct clone_info *ci, struct mapped_device *md,
ci->sector = bio->bi_iter.bi_sector; ci->sector = bio->bi_iter.bi_sector;
} }
#define __dm_part_stat_sub(part, field, subnd) \
(part_stat_get(part, field) -= (subnd))
/* /*
* Entry point to split a bio into clones and submit them to the targets. * Entry point to split a bio into clones and submit them to the targets.
*/ */
...@@ -1626,7 +1628,6 @@ static blk_qc_t __split_and_process_bio(struct mapped_device *md, ...@@ -1626,7 +1628,6 @@ static blk_qc_t __split_and_process_bio(struct mapped_device *md,
* the usage of io->orig_bio in dm_remap_zone_report() * the usage of io->orig_bio in dm_remap_zone_report()
* won't be affected by this reassignment. * won't be affected by this reassignment.
*/ */
int cpu;
struct bio *b = bio_split(bio, bio_sectors(bio) - ci.sector_count, struct bio *b = bio_split(bio, bio_sectors(bio) - ci.sector_count,
GFP_NOIO, &md->queue->bio_split); GFP_NOIO, &md->queue->bio_split);
ci.io->orig_bio = b; ci.io->orig_bio = b;
...@@ -1638,9 +1639,9 @@ static blk_qc_t __split_and_process_bio(struct mapped_device *md, ...@@ -1638,9 +1639,9 @@ static blk_qc_t __split_and_process_bio(struct mapped_device *md,
* significant refactoring of DM core's bio splitting * significant refactoring of DM core's bio splitting
* (by eliminating DM's splitting and just using bio_split) * (by eliminating DM's splitting and just using bio_split)
*/ */
cpu = part_stat_lock(); part_stat_lock();
__part_stat_add(cpu, &dm_disk(md)->part0, __dm_part_stat_sub(&dm_disk(md)->part0,
sectors[op_stat_group(bio_op(bio))], -(ci.sector_count)); sectors[op_stat_group(bio_op(bio))], ci.sector_count);
part_stat_unlock(); part_stat_unlock();
bio_chain(b, bio); bio_chain(b, bio);
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册