提交 · bf91db18ac2852a3ff39fe25ff56c5557c0fff78 · bug2833 / cloud-kernel

03 12月, 2008 2 次提交

block: set disk->node_id before it's being used · bf91db18

由 Cheng Renquan 提交于 11月 20, 2008

disk->node_id will be refered in allocating in disk_expand_part_tbl, so we
should set it before disk->node_id is refered.
Signed-off-by: NCheng Renquan <crquan@gmail.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

bf91db18

When block layer fails to map iov, it calls bio_unmap_user to undo · 53cc0b29

由 Petr Vandrovec 提交于 11月 19, 2008

mapping.  Which is good if pages were mapped - but if they were provided
by someone else and just copied then bad things happen - pages are
released once here, and once by caller, leading to user triggerable BUG
at include/linux/mm.h:246.
Signed-off-by: NPetr Vandrovec <petr@vandrovec.name>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

53cc0b29

18 11月, 2008 3 次提交

block: hold extra reference to bio in blk_rq_map_user_iov() · c26156b2

由 Jens Axboe 提交于 11月 18, 2008

If the size passed in is OK but we end up mapping too many segments,
we call the unmap path directly like from IO completion. But from IO
completion we have an extra reference to the bio, so this error case
goes OOPS when it attempts to free and already free bio.

Fix it by getting an extra reference to the bio before calling the
unmap failure case.
Reported-by: NPetr Vandrovec <vandrove@vc.cvut.cz>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

c26156b2

block: fix boot failure with CONFIG_DEBUG_BLOCK_EXT_DEVT=y and nash · 561ec68e

由 Zhang, Yanmin 提交于 11月 14, 2008

We run into system boot failure with kernel 2.6.28-rc. We found it on a
couple of machines, including T61 notebook, nehalem machine, and another
HPC NX6325 notebook.  All the machines use FedoraCore 8 or FedoraCore 9.
With kernel prior to 2.6.28-rc, system boot doesn't fail.

I debug it and locate the root cause. Pls. see
http://bugzilla.kernel.org/show_bug.cgi?id=11899
https://bugzilla.redhat.com/show_bug.cgi?id=471517

As a matter of fact, there are 2 bugs.

1)root=/dev/sda1, system boot randomly fails. Mostly, boot for 5 times
and fails once. nash has a bug. Some of its functions misuse return
value 0.  Sometimes, 0 means timeout and no uevent available. Sometimes,
0 means nash gets an uevent, but the uevent isn't block-related (for
exmaple, usb). If by coincidence, kernel tells nash that uevents are
available, but kernel also set timeout, nash might stops collecting
other uevents in queue if current uevent isn't block-related.  I work
out a patch for nash to fix it.
http://bugzilla.kernel.org/attachment.cgi?id=18858

2) root=LABEL=/, system always can't boot. initrd init reports
switchroot fails. Here is an executation branch of nash when booting:
    (1) nash read /sys/block/sda/dev; Assume major is 8 (on my desktop)
    (2) nash query /proc/devices with the major number; It found line
	"8 sd";
    (3) nash use 'sd' to search its own probe table to find device (DISK)
	type for the device and add it to its own list;
    (4) Later on, it probes all devices in its list to get filesystem
	labels; scsi register "8 sd" always.

When major is 259, nash fails to find the device(DISK) type. I enables
CONFIG_DEBUG_BLOCK_EXT_DEVT=y when compiling kernel, so 259 is picked up
for device /dev/sda1, which causes nash to fail to find device (DISK)
type.

To fixing issue 2), I create a patch for nash and another patch for
kernel.

http://bugzilla.kernel.org/attachment.cgi?id=18859
http://bugzilla.kernel.org/attachment.cgi?id=18837

Below is the patch for kernel 2.6.28-rc4. It registers blkext, a new
block device in proc/devices.

With 2 patches on nash and 1 patch on kernel, I boot my machines for
dozens of times without failure.

Signed-off-by Zhang Yanmin <yanmin.zhang@linux.intel.com>
Acked-by: NTejun Heo <tj@kernel.org>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

561ec68e

block: make add_partition() return pointer to hd_struct · ba32929a

由 Tejun Heo 提交于 11月 10, 2008

Make add_partition() return pointer to the new hd_struct on success
and ERR_PTR() value on failure.  This change will be used to fix md
autodetection bug.
Signed-off-by: NTejun Heo <tj@kernel.org>
Cc: Neil Brown <neilb@suse.de>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

ba32929a

06 11月, 2008 4 次提交

Block: use round_jiffies_up() · 7838c15b

由 Alan Stern 提交于 11月 06, 2008

This patch (as1159b) changes the timeout routines in the block core to
use round_jiffies_up().  There's no point in rounding the timer
deadline down, since if it expires too early we will have to restart
it.

The patch also removes some unnecessary tests when a request is
removed from the queue's timer list.
Signed-off-by: NAlan Stern <stern@rowland.harvard.edu>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

7838c15b

blk: move blk_delete_timer call in end_that_request_last · e78042e5

由 Mike Anderson 提交于 10月 30, 2008

Move the calling  blk_delete_timer to later in end_that_request_last to
address an issue where blkdev_dequeue_request may have add a timer for the
request.
Signed-off-by: NMike Anderson <andmike@linux.vnet.ibm.com>
Acked-by: NTejun Heo <tj@kernel.org>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

e78042e5

block: add timer on blkdev_dequeue_request() not elv_next_request() · 2920ebbd

由 Tejun Heo 提交于 10月 30, 2008

Block queue supports two usage models - one where block driver peeks
at the front of queue using elv_next_request(), processes it and
finishes it and the other where block driver peeks at the front of
queue, dequeue the request using blkdev_dequeue_request() and finishes
it.  The latter is more flexible as it allows the driver to process
multiple commands concurrently.

These two inconsistent usage models affect the block layer
implementation confusing.  For some, elv_next_request() is considered
the issue point while others consider blkdev_dequeue_request() the
issue point.

Till now the inconsistency mostly affect only accounting, so it didn't
really break anything seriously; however, with block layer timeout,
this inconsistency hits hard.  Block layer considers
elv_next_request() the issue point and adds timer but SCSI layer
thinks it was just peeking and when the request can't process the
command right away, it's just left there without further processing.
This makes the request dangling on the timer list and, when the timer
goes off, the request which the SCSI layer and below think is still on
the block queue ends up in the EH queue, causing various problems - EH
hang (failed count goes over busy count and EH never wakes up),
WARN_ON() and oopses as low level driver trying to handle the unknown
command, etc. depending on the timing.

As SCSI midlayer is the only user of block layer timer at the moment,
moving blk_add_timer() to elv_dequeue_request() fixes the problem;
however, this two usage models definitely need to be cleaned up in the
future.
Signed-off-by: NTejun Heo <tj@kernel.org>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

2920ebbd

block: remove unused ll_new_mergeable() · 43381785

由 FUJITA Tomonori 提交于 10月 20, 2008

Signed-off-by: NFUJITA Tomonori <fujita.tomonori@lab.ntt.co.jp>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

43381785

24 10月, 2008 1 次提交
- L
  compat_blkdev_driver_ioctl: Remove unused variable warning · 5f4f0c4d
  由 Linus Torvalds 提交于 10月 23, 2008
```
Variable 'ret' is no longer used. Don't declare it.
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>
```
  5f4f0c4d
23 10月, 2008 2 次提交
- A
  proc: move /proc/diskstats boilerplate to block/genhd.c · 31d85ab2
  由 Alexey Dobriyan 提交于 10月 06, 2008
```
Signed-off-by: NAlexey Dobriyan <adobriyan@gmail.com>
Acked-by: NJens Axboe <jens.axboe@oracle.com>
```
  31d85ab2
- A
  proc: move rest of /proc/partitions code to block/genhd.c · f500975a
  由 Alexey Dobriyan 提交于 10月 04, 2008
```
Signed-off-by: NAlexey Dobriyan <adobriyan@gmail.com>
Acked-by: NJens Axboe <jens.axboe@oracle.com>
```
  f500975a
21 10月, 2008 12 次提交

A
[PATCH] kill the rest of struct file propagation in block ioctls · 56b26add
由 Al Viro 提交于 9月 19, 2008
```
Now we can switch blkdev_ioctl() block_device/mode
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
56b26add

[PATCH] get rid of struct file use in blkdev_ioctl() BLKBSZSET · 6af3a56e

由 Al Viro 提交于 9月 19, 2008

We need to do bd_claim() only if file hadn't been opened with O_EXCL
and then we have no need to use file itself as owner.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

6af3a56e

[PATCH] get rid of blkdev_locked_ioctl() · 45048d09

由 Al Viro 提交于 9月 18, 2008

Most of that stuff doesn't need BKL at all; expand in the (only) caller,
merge the switch into one there and leave BKL only around the stuff that
might actually need it.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

45048d09

[PATCH] get rid of blkdev_driver_ioctl() · e436fdae

由 Al Viro 提交于 9月 18, 2008

convert remaining callers to __blkdev_driver_ioctl()
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

e436fdae

[PATCH] trim file propagation in block/compat_ioctl.c · 33c2dca4

由 Al Viro 提交于 2月 22, 2008

... and remove the handling of cases when it falls back to native
without changing arguments.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

33c2dca4

A
[PATCH] end of methods switch: remove the old ones · 90b8f282
由 Al Viro 提交于 3月 02, 2008
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
90b8f282

[PATCH] beginning of methods conversion · d4430d62

由 Al Viro 提交于 3月 02, 2008

To keep the size of changesets sane we split the switch by drivers;
to keep the damn thing bisectable we do the following:
	1) rename the affected methods, add ones with correct
prototypes, make (few) callers handle both.  That's this changeset.
	2) for each driver convert to new methods.  *ALL* drivers
are converted in this series.
	3) kill the old (renamed) methods.

Note that it _is_ a flagday; all in-tree drivers are converted and by the
end of this series no trace of old methods remain.  The only reason why
we do that this way is to keep the damn thing bisectable and allow per-driver
debugging if anything goes wrong.

New methods:
	open(bdev, mode)
	release(disk, mode)
	ioctl(bdev, mode, cmd, arg)		/* Called without BKL */
	compat_ioctl(bdev, mode, cmd, arg)
	locked_ioctl(bdev, mode, cmd, arg)	/* Called with BKL, legacy */
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

d4430d62

[PATCH] introduce __blkdev_driver_ioctl() · 633a08b8

由 Al Viro 提交于 8月 29, 2007

Analog of blkdev_driver_ioctl() with sane arguments.  For
now uses fake struct file, by the end of the series it won't
and blkdev_driver_ioctl() will become a wrapper around it.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

633a08b8

A
[PATCH] switch scsi_cmd_ioctl() to passing fmode_t · 74f3c8af
由 Al Viro 提交于 8月 27, 2007
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
74f3c8af
A
[PATCH] switch sg_scsi_ioctl() to passing fmode_t · e915e872
由 Al Viro 提交于 9月 02, 2008
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
e915e872
A
[PATCH] pass mode instead of file to sg_io() · 5842e51f
由 Al Viro 提交于 9月 02, 2008
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
5842e51f
A
[PATCH] introduce fmode_t, do annotations · aeb5d727
由 Al Viro 提交于 9月 02, 2008
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
aeb5d727

17 10月, 2008 8 次提交

block: remove __generic_unplug_device() from exports · f73e2d13

由 Jens Axboe 提交于 10月 17, 2008

The only out-of-core user is IDE, and that should be using
blk_start_queueing() instead.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

f73e2d13

block: move q->unplug_work initialization · 713ada9b

由 Peter Zijlstra 提交于 10月 16, 2008

modprobe loop; rmmod loop effectively creates a blk_queue and destroys it
which results in q->unplug_work being canceled without it ever being
initialized.

Therefore, move the initialization of q->unplug_work from
blk_queue_make_request() to blk_alloc_queue*().
Reported-by: NAlexey Dobriyan <adobriyan@gmail.com>
Signed-off-by: NPeter Zijlstra <a.p.zijlstra@chello.nl>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

713ada9b

block: fix current kernel-doc warnings · 496aa8a9

由 Randy Dunlap 提交于 10月 16, 2008

Fix block kernel-doc warnings:

Warning(linux-2.6.27-git4//fs/block_dev.c:1272): No description found for parameter 'path'
Warning(linux-2.6.27-git4//block/blk-core.c:1021): No description found for parameter 'cpu'
Warning(linux-2.6.27-git4//block/blk-core.c:1021): No description found for parameter 'part'
Warning(/var/linsrc/linux-2.6.27-git4//block/genhd.c:544): No description found for parameter 'partno'
Signed-off-by: NRandy Dunlap <randy.dunlap@oracle.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

496aa8a9

block: only call ->request_fn when the queue is not stopped · 80a4b58e

由 Jens Axboe 提交于 10月 14, 2008

Callers should use either blk_run_queue/__blk_run_queue, or
blk_start_queueing() to invoke request handling instead of calling
->request_fn() directly as that does not take the queue stopped
flag into account.

Also add appropriate comments on the above functions to detail
their usage.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

80a4b58e

block: simplify string handling in elv_iosched_store() · ee2e992c

由 Li Zefan 提交于 10月 14, 2008

strlcpy() guarantees the dest buffer is NULL teminated.
Signed-off-by: NLi Zefan <lizf@cn.fujitsu.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

ee2e992c

block: fix kernel-doc for blk_alloc_devt() · e6d63840

由 Li Zefan 提交于 10月 14, 2008

No argument 'gfp_mask' for blk_alloc_devt().
Signed-off-by: NLi Zefan <lizf@cn.fujitsu.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

e6d63840

block: fix nr_phys_segments miscalculation bug · 86771427

由 FUJITA Tomonori 提交于 10月 13, 2008

This fixes the bug reported by Nikanth Karthikesan <knikanth@suse.de>:

http://lkml.org/lkml/2008/10/2/203

The root cause of the bug is that blk_phys_contig_segment
miscalculates q->max_segment_size.

blk_phys_contig_segment checks:

req->biotail->bi_size + next_req->bio->bi_size > q->max_segment_size

But blk_recalc_rq_segments might expect that req->biotail and the
previous bio in the req are supposed be merged into one
segment. blk_recalc_rq_segments might also expect that next_req->bio
and the next bio in the next_req are supposed be merged into one
segment. In such case, we merge two requests that can't be merged
here. Later, blk_rq_map_sg gives more segments than it should.

We need to keep track of segment size in blk_recalc_rq_segments and
use it to see if two requests can be merged. This patch implements it
in the similar way that we used to do for hw merging (virtual
merging).
Signed-off-by: NFUJITA Tomonori <fujita.tomonori@lab.ntt.co.jp>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

86771427

device create: block: convert device_create_drvdata to device_create · 1ff9f542

由 Greg Kroah-Hartman 提交于 7月 21, 2008

Now that device_create() has been audited, rename things back to the
original call to be sane.
Signed-off-by: NGreg Kroah-Hartman <gregkh@suse.de>

1ff9f542

13 10月, 2008 1 次提交

[SCSI] block: separate failfast into multiple bits. · 6000a368

由 Mike Christie 提交于 8月 19, 2008

Multipath is best at handling transport errors. If it gets a device
error then there is not much the multipath layer can do. It will just
access the same device but from a different path.

This patch breaks up failfast into device, transport and driver errors.
The multipath layers (md and dm mutlipath) only ask the lower levels to
fast fail transport errors. The user of failfast, read ahead, will ask
to fast fail on all errors.

Note that blk_noretry_request will return true if any failfast bit
is set. This allows drivers that do not support the multipath failfast
bits to continue to fail on any failfast error like before. Drivers
like scsi that are able to fail fast specific errors can check
for the specific fail fast type. In the next patch I will convert
scsi.
Signed-off-by: NMike Christie <michaelc@cs.wisc.edu>
Cc: Jens Axboe <jens.axboe@oracle.com>
Signed-off-by: NJames Bottomley <James.Bottomley@HansenPartnership.com>

6000a368

09 10月, 2008 7 次提交

block: Switch blk_integrity_compare from bdev to gendisk · ad7fce93

由 Martin K. Petersen 提交于 10月 01, 2008

The DM and MD integrity support now depends on being able to use
gendisks instead of block_devices when comparing integrity profiles.
Change function parameters accordingly.

Also update comparison logic so that two NULL profiles are a valid
configuration.
Signed-off-by: NMartin K. Petersen <martin.petersen@oracle.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

ad7fce93

block: Fix double put in blk_integrity_unregister · 0c032ab8

由 Martin K. Petersen 提交于 10月 01, 2008

- kobject_del already puts the parent.

 - Set integrity profile to NULL to prevent stale data.
Signed-off-by: NMartin K. Petersen <martin.petersen@oracle.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

0c032ab8

block: remove end_{queued|dequeued}_request() · d00e29fd

由 Kiyoshi Ueda 提交于 10月 01, 2008

This patch removes end_queued_request() and end_dequeued_request(),
which are no longer used.

As a results, users of __end_request() became only end_request().
So the actual code in __end_request() is moved to end_request()
and __end_request() is removed.
Signed-off-by: NKiyoshi Ueda <k-ueda@ct.jp.nec.com>
Signed-off-by: NJun'ichi Nomura <j-nomura@ce.jp.nec.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

d00e29fd

block: change elevator to use __blk_end_request() · 99cd3386

由 Kiyoshi Ueda 提交于 10月 01, 2008

This patch converts elevator to use __blk_end_request() directly
so that end_{queued|dequeued}_request() can be removed.
Related 'uptodate' arguments is converted to 'error'.
Signed-off-by: NKiyoshi Ueda <k-ueda@ct.jp.nec.com>
Signed-off-by: NJun'ichi Nomura <j-nomura@ce.jp.nec.com>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

99cd3386

blktrace: use BLKTRACE_BDEV_SIZE as the name size for setup structure · 0497b345

由 Jens Axboe 提交于 10月 01, 2008

Define as 32, which is is what BDEVNAME_SIZE is/was as well. This keeps
the user interface the same and gets rid of the difference between
kernel and user api here.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

0497b345

block: add lld busy state exporting interface · ef9e3fac

由 Kiyoshi Ueda 提交于 10月 01, 2008

This patch adds an new interface, blk_lld_busy(), to check lld's
busy state from the block layer.
blk_lld_busy() calls down into low-level drivers for the checking
if the drivers set q->lld_busy_fn() using blk_queue_lld_busy().

This resolves a performance problem on request stacking devices below.

Some drivers like scsi mid layer stop dispatching request when
they detect busy state on its low-level device like host/target/device.
It allows other requests to stay in the I/O scheduler's queue
for a chance of merging.

Request stacking drivers like request-based dm should follow
the same logic.
However, there is no generic interface for the stacked device
to check if the underlying device(s) are busy.
If the request stacking driver dispatches and submits requests to
the busy underlying device, the requests will stay in
the underlying device's queue without a chance of merging.
This causes performance problem on burst I/O load.

With this patch, busy state of the underlying device is exported
via q->lld_busy_fn().  So the request stacking driver can check it
and stop dispatching requests if busy.

The underlying device driver must return the busy state appropriately:
    1: when the device driver can't process requests immediately.
    0: when the device driver can process requests immediately,
       including abnormal situations where the device driver needs
       to kill all requests.
Signed-off-by: NKiyoshi Ueda <k-ueda@ct.jp.nec.com>
Signed-off-by: NJun'ichi Nomura <j-nomura@ce.jp.nec.com>
Cc: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

ef9e3fac

block: Fix blk_start_queueing() to not kick a stopped queue · 336c3d8c

由 Elias Oltmanns 提交于 10月 01, 2008

blk_start_queueing() should act like the generic queue unplugging
and kicking and ignore a stopped queue. Such a queue may not be
run until after a call to blk_start_queue().
Signed-off-by: NElias Oltmanns <eo@nebensachen.de>
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

336c3d8c

bug2833 / cloud-kernel 与 Fork 源项目一致

bug2833 / cloud-kernel
与 Fork 源项目一致