提交 3e997379 编写于 作者: H Heinz Mauelshagen 提交者: Zheng Zengkai

dm raid: fix inconclusive reshape layout on fast raid4/5/6 table reload sequences

stable inclusion
from stable-5.10.36
commit 0cd2d2577a982863a65d1f7546771bb6547d92c5
bugzilla: 51867
CVE: NA

--------------------------------

commit f99a8e43 upstream.

If fast table reloads occur during an ongoing reshape of raid4/5/6
devices the target may race reading a superblock vs the the MD resync
thread; causing an inconclusive reshape state to be read in its
constructor.

lvm2 test lvconvert-raid-reshape-stripes-load-reload.sh can cause
BUG_ON() to trigger in md_run(), e.g.:
"kernel BUG at drivers/md/raid5.c:7567!".

Scenario triggering the bug:

1. the MD sync thread calls end_reshape() from raid5_sync_request()
   when done reshaping. However end_reshape() _only_ updates the
   reshape position to MaxSector keeping the changed layout
   configuration though (i.e. any delta disks, chunk sector or RAID
   algorithm changes). That inconclusive configuration is stored in
   the superblock.

2. dm-raid constructs a mapping, loading named inconsistent superblock
   as of step 1 before step 3 is able to finish resetting the reshape
   state completely, and calls md_run() which leads to mentioned bug
   in raid5.c.

3. the MD RAID personality's finish_reshape() is called; which resets
   the reshape information on chunk sectors, delta disks, etc. This
   explains why the bug is rarely seen on multi-core machines, as MD's
   finish_reshape() superblock update races with the dm-raid
   constructor's superblock load in step 2.

Fix identifies inconclusive superblock content in the dm-raid
constructor and resets it before calling md_run(), factoring out
identifying checks into rs_is_layout_change() to share in existing
rs_reshape_requested() and new rs_reset_inclonclusive_reshape(). Also
enhance a comment and remove an empty line.

Cc: stable@vger.kernel.org
Signed-off-by: NHeinz Mauelshagen <heinzm@redhat.com>
Signed-off-by: NMike Snitzer <snitzer@redhat.com>
Signed-off-by: NGreg Kroah-Hartman <gregkh@linuxfoundation.org>
Signed-off-by: NChen Jun <chenjun102@huawei.com>
Acked-by: NWeilong Chen <chenweilong@huawei.com>
Signed-off-by: NZheng Zengkai <zhengzengkai@huawei.com>
上级 aeefa199
...@@ -1869,6 +1869,14 @@ static bool rs_takeover_requested(struct raid_set *rs) ...@@ -1869,6 +1869,14 @@ static bool rs_takeover_requested(struct raid_set *rs)
return rs->md.new_level != rs->md.level; return rs->md.new_level != rs->md.level;
} }
/* True if layout is set to reshape. */
static bool rs_is_layout_change(struct raid_set *rs, bool use_mddev)
{
return (use_mddev ? rs->md.delta_disks : rs->delta_disks) ||
rs->md.new_layout != rs->md.layout ||
rs->md.new_chunk_sectors != rs->md.chunk_sectors;
}
/* True if @rs is requested to reshape by ctr */ /* True if @rs is requested to reshape by ctr */
static bool rs_reshape_requested(struct raid_set *rs) static bool rs_reshape_requested(struct raid_set *rs)
{ {
...@@ -1881,9 +1889,7 @@ static bool rs_reshape_requested(struct raid_set *rs) ...@@ -1881,9 +1889,7 @@ static bool rs_reshape_requested(struct raid_set *rs)
if (rs_is_raid0(rs)) if (rs_is_raid0(rs))
return false; return false;
change = mddev->new_layout != mddev->layout || change = rs_is_layout_change(rs, false);
mddev->new_chunk_sectors != mddev->chunk_sectors ||
rs->delta_disks;
/* Historical case to support raid1 reshape without delta disks */ /* Historical case to support raid1 reshape without delta disks */
if (rs_is_raid1(rs)) { if (rs_is_raid1(rs)) {
...@@ -2818,7 +2824,7 @@ static sector_t _get_reshape_sectors(struct raid_set *rs) ...@@ -2818,7 +2824,7 @@ static sector_t _get_reshape_sectors(struct raid_set *rs)
} }
/* /*
* * Reshape:
* - change raid layout * - change raid layout
* - change chunk size * - change chunk size
* - add disks * - add disks
...@@ -2927,6 +2933,20 @@ static int rs_setup_reshape(struct raid_set *rs) ...@@ -2927,6 +2933,20 @@ static int rs_setup_reshape(struct raid_set *rs)
return r; return r;
} }
/*
* If the md resync thread has updated superblock with max reshape position
* at the end of a reshape but not (yet) reset the layout configuration
* changes -> reset the latter.
*/
static void rs_reset_inconclusive_reshape(struct raid_set *rs)
{
if (!rs_is_reshaping(rs) && rs_is_layout_change(rs, true)) {
rs_set_cur(rs);
rs->md.delta_disks = 0;
rs->md.reshape_backwards = 0;
}
}
/* /*
* Enable/disable discard support on RAID set depending on * Enable/disable discard support on RAID set depending on
* RAID level and discard properties of underlying RAID members. * RAID level and discard properties of underlying RAID members.
...@@ -3213,11 +3233,14 @@ static int raid_ctr(struct dm_target *ti, unsigned int argc, char **argv) ...@@ -3213,11 +3233,14 @@ static int raid_ctr(struct dm_target *ti, unsigned int argc, char **argv)
if (r) if (r)
goto bad; goto bad;
/* Catch any inconclusive reshape superblock content. */
rs_reset_inconclusive_reshape(rs);
/* Start raid set read-only and assumed clean to change in raid_resume() */ /* Start raid set read-only and assumed clean to change in raid_resume() */
rs->md.ro = 1; rs->md.ro = 1;
rs->md.in_sync = 1; rs->md.in_sync = 1;
/* Keep array frozen */ /* Keep array frozen until resume. */
set_bit(MD_RECOVERY_FROZEN, &rs->md.recovery); set_bit(MD_RECOVERY_FROZEN, &rs->md.recovery);
/* Has to be held on running the array */ /* Has to be held on running the array */
...@@ -3231,7 +3254,6 @@ static int raid_ctr(struct dm_target *ti, unsigned int argc, char **argv) ...@@ -3231,7 +3254,6 @@ static int raid_ctr(struct dm_target *ti, unsigned int argc, char **argv)
} }
r = md_start(&rs->md); r = md_start(&rs->md);
if (r) { if (r) {
ti->error = "Failed to start raid array"; ti->error = "Failed to start raid array";
mddev_unlock(&rs->md); mddev_unlock(&rs->md);
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册