提交 · 89343da077ad564ed130c46e5ea6a79388410fa5 · openeuler / Kernel

10 10月, 2008 15 次提交

由 Mikulas Patocka 提交于 10月 10, 2008

Publish dm_get_mapinfo in include/linux/device-mapper.h because this function
is used by targets.
Signed-off-by: NMikulas Patocka <mpatocka@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

89343da0

dm: export struct dm_dev · 82b1519b

由 Mikulas Patocka 提交于 10月 10, 2008

Split struct dm_dev in two and publish the part that other targets need in
include/linux/device-mapper.h.
Signed-off-by: NMikulas Patocka <mpatocka@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

82b1519b

dm crypt: avoid unnecessary wait when splitting bio · 933f01d4

由 Milan Broz 提交于 10月 10, 2008

Don't wait between submitting crypt requests for a bio unless
we are short of memory.

There are two situations when we must split an encrypted bio:
  1) there are no free pages;
  2) the new bio would violate underlying device restrictions
(e.g. max hw segments).

In case (2) we do not need to wait.

Add output variable to crypt_alloc_buffer() to distinguish between
these cases.
Signed-off-by: NMilan Broz <mbroz@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

933f01d4

dm crypt: tidy ctx pending · c8081618

由 Milan Broz 提交于 10月 10, 2008

Move the initialisation of ctx->pending into one place, at the
start of crypt_convert().

Introduce crypt_finished to indicate whether or not the encryption
is finished, for use in a later patch.

No functional change.
Signed-off-by: NMilan Broz <mbroz@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

c8081618

dm crypt: fix async inc_pending · 4e594098

由 Milan Broz 提交于 10月 10, 2008

The pending reference count must be incremented *before* the async work is
queued to another thread, not after. Otherwise there's a race if the
work completes and decrements the reference count before it gets incremented.
Signed-off-by: NMilan Broz <mbroz@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

4e594098

dm crypt: move dec_pending on error into write_io_submit · 6c031f41

由 Milan Broz 提交于 10月 10, 2008

Make kcryptd_crypt_write_io_submit() responsible for decrementing
the pending count after an error.

Also fixes a bug in the async path that forgot to decrement it.
Signed-off-by: NMilan Broz <mbroz@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

6c031f41

dm crypt: remove inc_pending from write_io_submit · 1e37bb8e

由 Alasdair G Kergon 提交于 10月 10, 2008

Make the caller reponsible for incrementing the pending count before calling
kcryptd_crypt_write_io_submit() in the non-async case to bring it into line
with the async case.
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

1e37bb8e

dm crypt: tidy write loop pending · fc5a5e9a

由 Milan Broz 提交于 10月 10, 2008

Move kcryptd_crypt_write_convert_loop inside kcryptd_crypt_write_convert.
This change is needed for a later patch.

No functional change.
Signed-off-by: NMilan Broz <mbroz@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

fc5a5e9a

dm crypt: tidy crypt alloc · dc440d1e

由 Milan Broz 提交于 10月 10, 2008

Factor out crypt io allocation code.
Later patches will call it from another place.

No functional change.
Signed-off-by: NMilan Broz <mbroz@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

dc440d1e

dm crypt: tidy inc pending · 3e1a8bdd

由 Milan Broz 提交于 10月 10, 2008

Move io pending to one place.

No functional change, usefull to simplify debugging.
Signed-off-by: NMilan Broz <mbroz@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

3e1a8bdd

dm exception store: use chunk_t for_areas · fd14acf6

由 Mikulas Patocka 提交于 10月 10, 2008

Change uint32_t into chunk_t to remove 32-bit limitation on the
number of chunks on systems with 64-bit sector numbers.
Signed-off-by: NMikulas Patocka <mpatocka@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

fd14acf6

dm exception store: introduce area_location function · a481db78

由 Mikulas Patocka 提交于 10月 10, 2008

Move this logic to a function, because it will be reused later.
Signed-off-by: NMikulas Patocka <mpatocka@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

a481db78

dm raid1: kcopyd should stop on error if errors handled · f7c83e2e

由 Jonathan Brassow 提交于 10月 10, 2008

dm-raid1 is setting the 'DM_KCOPYD_IGNORE_ERROR' flag unconditionally
when assigning kcopyd work.  kcopyd is responsible for copying an
assigned section of disk to one or more other disks.  The
'DM_KCOPYD_IGNORE_ERROR' flag affects kcopyd in the following way:

When not set:
kcopyd will immediately stop the copy operation when an error is
encountered.

When set:
kcopyd will try to proceed regardless of errors and try to continue
copying any remaining amount.

Since dm-raid1 tracks regions of the address space that are (or
are not) in sync and it now has the ability to handle these
errors, we can safely enable this optimization.  This optimization
is conditional on whether mirror error handling has been enabled.
Signed-off-by: NJonathan Brassow <jbrassow@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

f7c83e2e

dm mpath: remove is_active from struct dm_path · 6680073d

由 Kiyoshi Ueda 提交于 10月 10, 2008

This patch moves 'is_active' from struct dm_path to struct pgpath
as it does not need exporting.
Signed-off-by: NKiyoshi Ueda <k-ueda@ct.jp.nec.com>
Signed-off-by: NJun'ichi Nomura <j-nomura@ce.jp.nec.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

6680073d

dm mpath: use more error codes · 01460f35

由 Benjamin Marzinski 提交于 10月 10, 2008

This patch allows path errors from the multipath ctr function to
propagate up to userspace as errno values from the ioctl() call.

This is in response to
  https://www.redhat.com/archives/dm-devel/2008-May/msg00000.html
and
  https://bugzilla.redhat.com/show_bug.cgi?id=444421

The patch only lets through the errors that it needs to in order to
get the path errors from parse_path().
Signed-off-by: NBenjamin Marzinski <bmarzins@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

01460f35

01 10月, 2008 3 次提交

dm mpath: add missing path switching locking · 7253a334

由 Chandra Seetharaman 提交于 10月 01, 2008

Moving the path activation to workqueue along with scsi_dh patches introduced
a race. It is due to the fact that the current_pgpath (in the multipath data
structure) can be modified if changes happen in any of the paths leading to
the lun. If the changes lead to current_pgpath being set to NULL, then it
leads to the invalid access which results in the panic below.

This patch fixes that by storing the pgpath to activate in the multipath data
structure and properly protecting it.

Note that if activate_path is called twice in succession with different pgpath,
with the second one being called before the first one is done, then activate
path will be called twice for the second pgpath, which is fine.

Unable to handle kernel paging request for data at address 0x00000020
Faulting instruction address: 0xd000000000aa1844
cpu 0x1: Vector: 300 (Data Access) at [c00000006b987a80]
    pc: d000000000aa1844: .activate_path+0x30/0x218 [dm_multipath]
    lr: c000000000087a2c: .run_workqueue+0x114/0x204
    sp: c00000006b987d00
   msr: 8000000000009032
   dar: 20
 dsisr: 40000000
  current = 0xc0000000676bb3f0
  paca    = 0xc0000000006f3680
    pid   = 2528, comm = kmpath_handlerd
enter ? for help
[c00000006b987da0] c000000000087a2c .run_workqueue+0x114/0x204
[c00000006b987e40] c000000000088b58 .worker_thread+0x120/0x144
[c00000006b987f00] c00000000008ca70 .kthread+0x78/0xc4
[c00000006b987f90] c000000000027cc8 .kernel_thread+0x4c/0x68
Signed-off-by: NChandra Seetharaman <sekharan@us.ibm.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

7253a334

dm: cope with access beyond end of device in dm_merge_bvec · b01cd5ac

由 Mikulas Patocka 提交于 10月 01, 2008

If for any reason dm_merge_bvec() is given an offset beyond the end of the
device, avoid an oops and always allow one page to be added to an empty bio.
We'll reject the I/O later after the bio is submitted.
Signed-off-by: NMikulas Patocka <mpatocka@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

b01cd5ac

dm: always allow one page in dm_merge_bvec · 5037108a

由 Mikulas Patocka 提交于 10月 01, 2008

Some callers assume they can always add at least one page to an empty bio,
so dm_merge_bvec should not return 0 in this case: we'll reject the I/O
later after the bio is submitted.
Signed-off-by: NMikulas Patocka <mpatocka@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

5037108a

19 9月, 2008 1 次提交

md: Don't wait UNINTERRUPTIBLE for other resync to finish · 9744197c

由 NeilBrown 提交于 9月 19, 2008

When two md arrays share some block device (e.g each uses different
partitions on the one device), a resync of one array will wait for
the resync on the other to finish.

This can be a long time and as it currently waits TASK_UNINTERRUPTIBLE,
the softlockup code notices and complains.

So use TASK_INTERRUPTIBLE instead and make sure to flush signals
before calling schedule.
Signed-off-by: NNeilBrown <neilb@suse.de>

9744197c

01 9月, 2008 2 次提交

Fix problem with waiting while holding rcu read lock in md/bitmap.c · b2d2c4ce

由 NeilBrown 提交于 9月 01, 2008

A recent patch to protect the rdev list with rcu locking leaves us
with a problem because we can sleep on memalloc while holding the
rcu lock.

The rcu lock is only needed while walking the linked list as
uninteresting devices (failed or spares) can be removed at any time.

So only take the rcu lock while actually walking the linked list.
Take a refcount on the rdev during the time when we drop the lock
and do the memalloc to start IO.
When we return to the locked code, all the interesting devices
on the list will not have moved, so we can simply use
list_for_each_continue_rcu to pick up where we left off.
Signed-off-by: NNeilBrown <neilb@suse.de>

b2d2c4ce

Remove invalidate_partition call from do_md_stop. · 271f5a9b

由 NeilBrown 提交于 9月 01, 2008

When stopping an md array, or just switching to read-only, we
currently call invalidate_partition while holding the mddev lock.
The main reason for this is probably to ensure all dirty buffers
are flushed (invalidate_partition calls fsync_bdev).

However if any dirty buffers are found, it will almost certainly cause
a deadlock as starting writeout will require an update to the
superblock, and performing that updates requires taking the mddev
lock - which is already held.

This deadlock can be demonstrated by running "reboot -f -n" with
a root filesystem on md/raid, and some dirty buffers in memory.

All other calls to stop an array should already happen after a flush.
The normal sequence is to stop using the array (e.g. umount) which
will cause __blkdev_put to call sync_blockdev.  Then open the
array and issue the STOP_ARRAY ioctl while the buffers are all still
clean.

So this invalidate_partition is normally a no-op, except for one case
where it will cause a deadlock.

So remove it.

This patch possibly addresses the regression recored in
   http://bugzilla.kernel.org/show_bug.cgi?id=11460
and
   http://bugzilla.kernel.org/show_bug.cgi?id=11452

though it isn't yet clear how it ever worked.
Signed-off-by: NNeilBrown <neilb@suse.de>

271f5a9b

08 8月, 2008 1 次提交

md: cancel check/repair requests when recovery is needed · 56ac36d7

由 Dan Williams 提交于 8月 07, 2008

If a 'repair' is requested when an array is in a position to 'recover' raid1
will perform the repair while md believes a recovery is happening.  Address
this at both ends, i.e. cancel check/repair requests upon detecting a
recover condition and do not call ->spare_active after completing a
check/repair.
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

56ac36d7

05 8月, 2008 6 次提交

Allow raid10 resync to happening in larger chunks. · 0310fa21

由 NeilBrown 提交于 8月 05, 2008

The raid10 resync/recovery code currently limits the amount of
in-flight resync IO to 2Meg.  This was copied from raid1 where
it seems quite adequate.  However for raid10, some layouts require
a bit of seeking to perform a resync, and allowing a larger buffer
size means that the seeking can be significantly reduced.

There is probably no real need to limit the amount of in-flight
IO at all.  Any shortage of memory will naturally reduce the
amount of buffer space available down to a set minimum, and any
concurrent normal IO will quickly cause resync IO to back off.

The only problem would be that normal IO has to wait for all resync IO
to finish, so a very large amount of resync IO could cause unpleasant
latency when normal IO starts up.

So: increase RESYNC_DEPTH to allow 32Meg of buffer (if memory is
available) which seems to be a good amount.  Also reduce the amount
of memory reserved as there is no need to keep 2Meg just for resync if
memory is tight.

Thanks to Keld for the suggestion.

Cc: Keld Jørn Simonsen <keld@dkuug.dk>
Signed-off-by: NNeilBrown <neilb@suse.de>

0310fa21

Allow faulty devices to be removed from a readonly array. · c89a8eee

由 NeilBrown 提交于 8月 05, 2008

Removing faulty devices from an array is a two stage process.
First the device is moved from being a part of the active array
to being similar to a spare device.  Then it can be removed
by a request from user space.

The first step is currently not performed for read-only arrays,
so the second step can never succeed.

So allow readonly arrays to remove failed devices (which aren't
blocked).
Signed-off-by: NNeilBrown <neilb@suse.de>

c89a8eee

Don't let a blocked_rdev interfere with read request in raid5/6 · ac4090d2

由 NeilBrown 提交于 8月 05, 2008

When we have externally managed metadata, we need to mark a failed
device as 'Blocked' and not allow any writes until that device
have been marked as faulty in the metadata and the Blocked flag has
been removed.

However it is perfectly OK to allow read requests when there is a
Blocked device, and with a readonly array, there may not be any
metadata-handler watching for blocked devices.

So in raid5/raid6 only allow a Blocked device to interfere with
Write request or resync.  Read requests go through untouched.

raid1 and raid10 already differentiate between read and write
properly.
Signed-off-by: NNeilBrown <neilb@suse.de>

ac4090d2

Fail safely when trying to grow an array with a write-intent bitmap. · dba034ee

由 NeilBrown 提交于 8月 05, 2008

We cannot currently change the size of a write-intent bitmap.
So if we change the size of an array which has such a bitmap, it
tries to set bits beyond the end of the bitmap.

For now, simply reject any request to change the size of an array
which has a bitmap.  mdadm can remove the bitmap and add a new one
after the array has changed size.
Signed-off-by: NNeilBrown <neilb@suse.de>

dba034ee

Restore force switch of md array to readonly at reboot time. · 2b25000b

由 NeilBrown 提交于 8月 05, 2008

A recent patch allowed do_md_stop to know whether it was being called
via an ioctl or not, and thus where to allow for an extra open file
descriptor when checking if it is in use.
This broke then switch to readonly performed by the shutdown notifier,
which needs to work even when the array is still (apparently) active
(as md doesn't get told when the filesystem becomes readonly).

So restore this feature by pretending that there can be lots of
file descriptors open, but we still want do_md_stop to switch to
readonly.
Signed-off-by: NNeilBrown <neilb@suse.de>

2b25000b

Make writes to md/safe_mode_delay immediately effective. · 19052c0e

由 NeilBrown 提交于 8月 05, 2008

If we reduce the 'safe_mode_delay', it could still wait for the old
delay to completely expire before doing anything about safe_mode.
Thus the effect if the change is delayed.

To make the effect more immediate, run the timeout function
immediately if the delay was reduced.  This may cause it to run
slightly earlier that required, but that is the safer option.
Signed-off-by: NNeilBrown <neilb@suse.de>

19052c0e

02 8月, 2008 1 次提交

md: the bitmap code needs to use blk_plug_device_unlocked() · 93769f58

由 Jens Axboe 提交于 8月 01, 2008

It doesn't hold the queue lock, so it's both racey on the queue flags
and thus spews a warning.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

93769f58

01 8月, 2008 2 次提交

A
[PATCH] switch mtd and dm-table to lookup_bdev() · d5686b44
由 Al Viro 提交于 8月 01, 2008
```
No need to open-code it...
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
d5686b44

md: raid10: wake up frozen array · 388667be

由 Arthur Jones 提交于 7月 25, 2008

When rescheduling a bio in raid10, we wake up
the md thread, but if the array is frozen, this
will have no effect.  This causes the array to
remain frozen for eternity.  We add a wake_up
to allow the array to de-freeze.  This code is
nearly identical to the raid1 code, which has
this fix already.
Signed-off-by: NArthur Jones <ajones@riverbed.com>
Signed-off-by: NNeilBrown <neilb@suse.de>

388667be

29 7月, 2008 2 次提交

md: do not count blocked devices as spares · e5427135

由 Dan Williams 提交于 7月 28, 2008

remove_and_add_spares() assumes that failed devices have been hot-removed
from the array.  Removal is skipped in the 'blocked' case so do not count a
device in this state as 'spare'.
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

e5427135

md: do not progress the resync process if the stripe was blocked · df10cfbc

由 Dan Williams 提交于 7月 28, 2008

handle_stripe will take no action on a stripe when waiting for userspace
to unblock the array, so do not report completed sectors.
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

df10cfbc

27 7月, 2008 1 次提交

[SCSI] scsi_dh: attach to hardware handler from dm-mpath · ae11b1b3

由 Hannes Reinecke 提交于 7月 17, 2008

multipath keeps a separate device table which may be
more current than the built-in one.
So we should make sure to always call ->attach whenever
a multipath map with hardware handler is instantiated.
And we should call ->detach on removal, too.

[sekharan: update as per comments from agk]
Signed-off-by: NHannes Reinecke <hare@suse.de>
Signed-off-by: NChandra Seetharaman <sekharan@us.ibm.com>
Signed-off-by: NJames Bottomley <James.Bottomley@HansenPartnership.com>

ae11b1b3

24 7月, 2008 3 次提交

md: delay notification of 'active_idle' to the recovery thread · d8e64406

由 Dan Williams 提交于 7月 23, 2008

sysfs_notify might sleep, so do not call it from md_safemode_timeout.
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

d8e64406

md: fix merge error · 23397883

由 Dan Williams 提交于 7月 23, 2008

The original STRIPE_OP_IO removal patch had the following hunk:

-               for (i = conf->raid_disks; i--; ) {
+               for (i = conf->raid_disks; i--; )
                        set_bit(R5_Wantwrite, &sh->dev[i].flags);
-                       if (!test_and_set_bit(STRIPE_OP_IO, &sh->ops.pending))
-                               sh->ops.count++;
-               }

However it appears the hunk became broken after merging:
-               for (i = conf->raid_disks; i--; ) {
+               for (i = conf->raid_disks; i--; )
                        set_bit(R5_Wantwrite, &sh->dev[i].flags);
                        set_bit(R5_LOCKED, &dev->flags);
                        s.locked++;
-                       if (!test_and_set_bit(STRIPE_OP_IO, &sh->ops.pending))
-                               sh->ops.count++;
-               }
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

23397883

md: move async_tx_issue_pending_all outside spin_lock_irq · c9f21aaf

由 Dan Williams 提交于 7月 23, 2008

Some dma drivers need to call spin_lock_bh in their device_issue_pending
routines.  This change avoids:

WARNING: at kernel/softirq.c:136 local_bh_enable_ip+0x3a/0x85()
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

c9f21aaf

21 7月, 2008 3 次提交

dm crypt: add merge · d41e26b9

由 Milan Broz 提交于 7月 21, 2008

This patch implements biovec merge function for crypt target.

If the underlying device has merge function defined, call it.
If not, keep precomputed value.
Signed-off-by: NMilan Broz <mbroz@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

d41e26b9

dm table: remove merge_bvec sector restriction · 9980c638

由 Milan Broz 提交于 7月 21, 2008

Remove max_sector restriction - merge function replaced it.
Signed-off-by: NMilan Broz <mbroz@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

9980c638

dm: linear add merge · 7bc3447b

由 Milan Broz 提交于 7月 21, 2008

This patch implements biovec merge function for linear target.

If the underlying device has merge function defined, call it.
If not, keep precomputed value.
Signed-off-by: NMilan Broz <mbroz@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

7bc3447b

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功