提交 · 9a0d9d0389ef769e4b01abf50fcc11407706270b · openeuler / Kernel

24 5月, 2011 11 次提交

由 Lars Ellenberg 提交于 5月 02, 2011

An administrative detach used to request a state change directly to D_DISKLESS,
first suspending IO to avoid the last put_ldev() occuring from an endio handler,
potentially in irq context.

This is not enough on the receiving side (typically secondary), we may miss
some peer_req on the way to local disk, which then may do the last put_ldev()
from their drbd_peer_request_endio().

This patch makes the detach always go through the intermediate D_FAILED state.
We may consider to rename it D_DETACHING.

Alternative approach would be to create yet an other work item to be scheduled
on the worker, do the destructor work from there, and get the timing right.

manually picked commit 564040f from the drbd 8.4 branch.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

9a0d9d03

drbd: Take a more conservative approach when deciding max_bio_size · 99432fcc

由 Philipp Reisner 提交于 5月 20, 2011

The old (optimistic) implementation could shrink the bio size
on an primary device.

Shrinking the bio size on a primary device is bad. Since there
we might get BIOs with the old (bigger) size shortly after
we published the new size.

The new implementation is more conservative, and eventually
increases the max_bio_size on a primary device (which is valid).
It does so, when it knows the local limit AND the remote limit.

 We cache the last seen max_bio_size of the peer in the meta
 data, and rely on that, to make the operation of single
 nodes more efficient.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

99432fcc

P
drbd: Fixed state transitions after async outdate-peer-handler returned · 21423fa7
由 Philipp Reisner 提交于 5月 17, 2011
```
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>
```
21423fa7
P
drbd: Disallow the peer_disk_state to be D_OUTDATED while connected · fa7d9396
由 Philipp Reisner 提交于 5月 17, 2011
```
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>
```
fa7d9396

drbd: Fix for the connection problems on high latency links · a8e40792

由 Philipp Reisner 提交于 5月 13, 2011

It seems that the real cause of all the issues where that
we did not noticed in drbd_try_connect() when the other
guy closes one socket if the round trip time gets higher
than 100ms. There were that 100ms hard coded!
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

a8e40792

drbd: fix potential activity log refcount imbalance in error path · 76727f68

由 Lars Ellenberg 提交于 5月 16, 2011

It is no longer sufficient to trigger on local WRITE,
we need to check on (rq_state & RQ_IN_ACT_LOG)
before calling drbd_al_complete_io also in the error path.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

76727f68

drbd: Only downgrade the disk state in case of disk failures · d2e17807

由 Philipp Reisner 提交于 3月 14, 2011

Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

d2e17807

drbd: fix disconnect/reconnect loop, if ping-timeout == ping-int · f36af18c

由 Lars Ellenberg 提交于 3月 09, 2011

If there is no replication traffic within the idle timeout
(ping-int seconds), DRBD will send a P_PING,
and adjust the timeout to ping-timeout.

If there is no P_PING_ACK received within this ping-timeout,
DRBD finally drops the connection, and tries to re-establish it.

To decide which timeout was active, we compared the current timeout
with the ping-timeout, and dropped the connection, if that was the case.

By default, ping-int is 10 seconds, ping-timeout is 500 ms.

Unfortunately, if you configure ping-timeout to be the same as ping-int,
expiry of the idle-timeout had been mistaken for a missing ping ack,
and caused an immediate reconnection attempt.

Fix:
Allow both timeouts to be equal, use a local variable
to store which timeout is active.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

f36af18c

drbd: fix potential distributed deadlock · 53ea4331

由 Lars Ellenberg 提交于 3月 08, 2011

We limit ourselves to a configurable maximum number of pages used as
temporary bio pages.

If the configured "max_buffers" is not big enough to match the bandwidth
of the respective deployment, a distributed deadlock could be triggered
by e.g. fast online verify and heavy application IO.

TCP connections would block on congestion, because both receivers
would wait on pages to become available.

Fortunately the respective senders in this case would be able to give
back some pages already. So do that.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

53ea4331

lru_cache.h: fix comments referring to ts_ instead of lc_ · 600942e0

由 Lars Ellenberg 提交于 1月 27, 2011

For some time we contemplated calling the "struct lru_cache"
a "struct tracked_set", and some comments kept the ts_ prefix.

Fix those to match the member field names.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

600942e0

drbd: Fix for application IO with the on-io-error=pass-on policy · 738a84b2

由 Philipp Reisner 提交于 3月 03, 2011

In case a write failes on the local disk, go into D_INCONSISTENT
disk state. That causes future reads of that block to be shipped
to the peer.

Read retry remote was already in place.

Actually the documentation needs to get fixed now. Since the
application is still shielded from the error. (as long as we have
only a single disk failing) The difference to detach is that
we keep the disk. And therefore might keep all the other, still
working sectors up to date.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

738a84b2

19 5月, 2011 3 次提交

Merge branches 'for-jens/xen-backend-fixes' and 'for-jens/xen-blkback-v3.3' of... · 779d5306

由 Jens Axboe 提交于 5月 19, 2011

Merge branches 'for-jens/xen-backend-fixes' and 'for-jens/xen-blkback-v3.3' of git://git.kernel.org/pub/scm/linux/kernel/git/konrad/xen into for-2.6.40/drivers

779d5306

xen/p2m: Add EXPORT_SYMBOL_GPL to the M2P override functions. · c9ce9e43

由 Konrad Rzeszutek Wilk 提交于 4月 20, 2011

If the backends, which use these two functions, are compiled as
a module we need these two functions to be exported.
Signed-off-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>

c9ce9e43

xen/p2m/m2p/gnttab: Support GNTMAP_host_map in the M2P override. · d5431d52

由 Konrad Rzeszutek Wilk 提交于 2月 28, 2011

We only supported the M2P (and P2M) override only for the
GNTMAP_contains_pte type mappings. Meaning that we grants
operations would "contain the machine address of the PTE to update"
If the flag is unset, then the grant operation is
"contains a host virtual address". The latter case means that
the Hypervisor takes care of updating our page table
(specifically the PTE entry) with the guest's MFN. As such we should
not try to do anything with the PTE. Previous to this patch
we would try to clear the PTE which resulted in Xen hypervisor
being upset with us:

(XEN) mm.c:1066:d0 Attempt to implicitly unmap a granted PTE c0100000ccc59067
(XEN) domain_crash called from mm.c:1067
(XEN) Domain 0 (vcpu#0) crashed on cpu#3:
(XEN) ----[ Xen-4.0-110228  x86_64  debug=y  Not tainted ]----

and crashing us.

This patch allows us to inhibit the PTE clearing in the PV guest
if the GNTMAP_contains_pte is not set.

On the m2p_remove_override path we provide the same parameter.

Sadly in the grant-table driver we do not have a mechanism to
tell m2p_remove_override whether to clear the PTE or not. Since
the grant-table driver is used by user-space, we can safely assume
that it operates only on PTE's. Hence the implementation for
it to work on !GNTMAP_contains_pte returns -EOPNOTSUPP. In the future
we can implement the support for this. It will require some extra
accounting structure to keep track of the page[i], and the flag.

[v1: Added documentation details, made it return -EOPNOTSUPP instead
 of trying to do a half-way implementation]
Signed-off-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>

d5431d52

18 5月, 2011 1 次提交

xen/blkback: don't fail empty barrier requests · 8ab52150

由 Jan Beulich 提交于 5月 17, 2011

The sector number on empty barrier requests may (will?) be -1, which,
given that it's being treated as unsigned 64-bit quantity, will almost
always exceed the actual (virtual) disk's size.

Inspired by Konrad's "When writting barriers set the sector number to
zero...".

While at it also add overflow checking to the math in vbd_translate().
Signed-off-by: NJan Beulich <jbeulich@novell.com>
Signed-off-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>

8ab52150

13 5月, 2011 18 次提交
- L
  xen/blkback: fix xenbus_transaction_start() hang caused by double xenbus_transaction_end() · 496b318e
  由 Laszlo Ersek 提交于 5月 13, 2011
```
vbd_resize() up_read()'s xs_state.suspend_mutex twice in a row via double
xenbus_transaction_end() calls. The next down_read() in
xenbus_transaction_start() (at eg. the next resize attempt) hangs.

Bugzilla: https://bugzilla.redhat.com/show_bug.cgi?id=618317Acked-by: NJan Beulich <jbeulich@novell.com>
Acked-by: NIan Campbell <ian.campbell@citrix.com>
Signed-off-by: NLaszlo Ersek <lersek@redhat.com>
Signed-off-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>
```
  496b318e
- K
  xen/blkback: Align the tabs on the structure. · 51854322
  由 Konrad Rzeszutek Wilk 提交于 5月 12, 2011
```
The recent changes caused this field of the structure to be offset a bit.
Signed-off-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>
```
  51854322
- K
  xen/blkback: if log_stats is enabled print out the data. · cca537af
  由 Konrad Rzeszutek Wilk 提交于 5月 12, 2011
```
And not depend on the driver being built with -DDEBUG flag.
Signed-off-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>
```
  cca537af
- K
  xen/blkback: Add the prefix XEN in the common.h. · 5a577e38
  由 Konrad Rzeszutek Wilk 提交于 5月 12, 2011
```
Signed-off-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>
```
  5a577e38
- K
  xen/blkback: Prefix 'vbd' with 'xen' in structs and functions. · 3d814731
  由 Konrad Rzeszutek Wilk 提交于 5月 12, 2011
```
Signed-off-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>
```
  3d814731
- K
  xen/blkback: Change structure name blkif_st to xen_blkif. · 30fd1502
  由 Konrad Rzeszutek Wilk 提交于 5月 12, 2011
```
No need for that '_st' and xen_blkif is more apt.
Signed-off-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>
```
  30fd1502
- K
  xen/blkback: Remove the unused typedefs. · 325a6486
  由 Konrad Rzeszutek Wilk 提交于 5月 12, 2011
```
Signed-off-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>
```
  325a6486
- K
  xen/blkback: Move include/xen/blkif.h into drivers/block/xen-blkback/common.h · 452a6b2b
  由 Konrad Rzeszutek Wilk 提交于 5月 12, 2011
```
Not point of the blkif.h file. It is not used by the frontend.
Signed-off-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>
```
  452a6b2b
- K
  xen/blkback: Fixing some more of the cleanpatch.pl warnings. · b0f80127
  由 Konrad Rzeszutek Wilk 提交于 5月 12, 2011
```
Signed-off-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>
```
  b0f80127
- K
  xen/blkback: Checkpatch.pl recommend against multiple assigments. · 03e0edf9
  由 Konrad Rzeszutek Wilk 提交于 5月 12, 2011
```
CHECK: multiple assignments should be avoided
Signed-off-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>
```
  03e0edf9
- K
  xen/blkback: Fix checkpatch.pl warnings about more than 80 lines. · 41ca4d38
  由 Konrad Rzeszutek Wilk 提交于 5月 12, 2011
```
Break up the macro usage.
Signed-off-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>
```
  41ca4d38
- K
  xen/blkback: Flesh out the description in the Kconfig. · a4c34858
  由 Konrad Rzeszutek Wilk 提交于 5月 12, 2011
```
with more details.
Signed-off-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>
```
  a4c34858
- K
  xen/blkback: Fix spelling mistakes. · b9fc0296
  由 Konrad Rzeszutek Wilk 提交于 5月 11, 2011
```
Signed-off-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>
```
  b9fc0296
- K
  xen/blkback: Move blkif_get_x86_[32|64]_req to common.h in block/xen-blkback dir. · 68c88dd7
  由 Konrad Rzeszutek Wilk 提交于 5月 11, 2011
```
From the blkif.h header, which was exposed to the frontend.
Signed-off-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>
```
  68c88dd7
- K
  xen/blkback: Removing the debug_lvl option. · 72468bfc
  由 Konrad Rzeszutek Wilk 提交于 5月 11, 2011
```
It is not really used for anything.
Signed-off-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>
```
  72468bfc
- K
  xen/blkback: Use the DRV_PFX in the pr_.. macros. · 22b20f2d
  由 Konrad Rzeszutek Wilk 提交于 5月 12, 2011
```
To make it easier to read.
Signed-off-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>
```
  22b20f2d
- K
  xen/blkback: Make the DPRINTK uniform. · 1afbd730
  由 Konrad Rzeszutek Wilk 提交于 5月 11, 2011
```
Signed-off-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>
```
  1afbd730
- K
  xen/blkback: Change printk/DPRINTK to pr_.. type variant. · ebe81906
  由 Konrad Rzeszutek Wilk 提交于 5月 12, 2011
```
And also make them uniform and prefix the message with 'xen-blkback'.
Signed-off-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>
```
  ebe81906
12 5月, 2011 4 次提交

xen-blkfront: Introduce BLKIF_OP_FLUSH_DISKCACHE support. · edf6ef59

由 Konrad Rzeszutek Wilk 提交于 5月 03, 2011

If the backend supports the 'feature-flush-cache' mode, use that
instead of the 'feature-barrier' support.

Currently there are three backends that support the 'feature-flush-cache'
mode: NetBSD, Solaris and Linux kernel. The 'flush' option is much
light-weight version than the 'barrier' support so lets try to use as
there are no filesystems in the kernel that use full barriers anymore.
Signed-off-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>

edf6ef59

xen-blkfront: Provide for 'feature-flush-cache' the BLKIF_OP_WRITE_FLUSH_CACHE operation. · 6dcfb751

由 Konrad Rzeszutek Wilk 提交于 5月 05, 2011

The operation BLKIF_OP_WRITE_FLUSH_CACHE has existed in the Xen
tree header file for years but it was never present in the Linux tree
because the frontend (nor the backend) supported this interface.
Signed-off-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>

6dcfb751

xen-blkfront: fix data size for xenbus_gather in blkfront_connect · 4352b47a

由 Marek Marczykowski 提交于 5月 03, 2011

barrier variable is int, not long. This overflow caused another variable
override: "err" (in PV code) and "binfo" (in xenlinux code -
drivers/xen/blkfront/blkfront.c). The later caused incorrect device
flags (RO/removable etc).
Signed-off-by: NMarek Marczykowski <marmarek@mimuw.edu.pl>
Acked-by: NIan Campbell <Ian.Campbell@citrix.com>
[v1: Changed title]
Signed-off-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>

4352b47a

K
xen/blkback: Fixed up comments and converted spaces to tabs. · 01f37f2d
由 Konrad Rzeszutek Wilk 提交于 5月 11, 2011
```
Suggested-by: NIan Campbell <Ian.Campbell@eu.citrix.com>
Signed-off-by: NKonrad Rzeszutek Wilk <konrad.wilk@oracle.com>
```
01f37f2d

06 5月, 2011 3 次提交

cciss: fix compile issue · edc83d47

由 Jens Axboe 提交于 5月 06, 2011

drivers/block/cciss.c: In function ‘cciss_send_reset’:
drivers/block/cciss.c:2515:2: error: implicit declaration of function ‘fill_cmd’
drivers/block/cciss.c: At top level:
drivers/block/cciss.c:2531:12: error: conflicting types for ‘fill_cmd’
drivers/block/cciss.c:2534:1: note: an argument type that has a default promotion can’t match an empty parameter name list declaration
drivers/block/cciss.c:2515:18: note: previous implicit declaration of ‘fill_cmd’ was here
make[1]: *** [drivers/block/cciss.o] Error 1
make: *** [drivers/block/cciss.o] Error 2

Move fill_cmd() to above where it is first used.
Signed-off-by: NJens Axboe <jaxboe@fusionio.com>

edc83d47

cciss: add cciss_tape_cmds module paramter · 8a4ec67b

由 Stephen M. Cameron 提交于 5月 03, 2011

This is to allow number of commands reserved for use by SCSI tape drives
and medium changers to be adjusted at driver load time via the kernel
parameter cciss_tape_cmds, with a default value of 6, and a range
of 2 - 16 inclusive.  Previously, the driver limited the number of
commands which could be queued to the SCSI half of the the driver
to only 2.  This is to fix the problem that if you had more than
two tape drives, you couldn't, for example, erase or rewind them all
at the same time.
Signed-off-by: NStephen M. Cameron <scameron@beardog.cce.hp.com>
Signed-off-by: NJens Axboe <jaxboe@fusionio.com>

8a4ec67b

cciss: do not use bit 2 doorbell reset · 063d2cf7

由 Stephen M. Cameron 提交于 5月 03, 2011

It causes NMIs which are undesirable at best, unsurvivable at worst.
Prefer the soft reset instead.
Signed-off-by: NStephen M. Cameron <scameron@beardog.cce.hp.com>
Signed-off-by: NJens Axboe <jaxboe@fusionio.com>

063d2cf7

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功