提交 · b6dd1a89767bc33e9c98b3195f8925b46c5c95f3 · openanolis / cloud-kernel

08 11月, 2012 40 次提交

drbd: remove struct drbd_tl_epoch objects (barrier works) · b6dd1a89

由 Lars Ellenberg 提交于 11月 28, 2011

cherry-picked and adapted from drbd 9 devel branch

DRBD requests (struct drbd_request) are already on the per resource
transfer log list, and carry their epoch number. We do not need to
additionally link them on other ring lists in other structs.

The drbd sender thread can recognize itself when to send a P_BARRIER,
by tracking the currently processed epoch, and how many writes
have been processed for that epoch.

If the epoch of the request to be processed does not match the currently
processed epoch, any writes have been processed in it, a P_BARRIER for
this last processed epoch is send out first.
The new epoch then becomes the currently processed epoch.

To not get stuck in drbd_al_begin_io() waiting for P_BARRIER_ACK,
the sender thread also needs to handle the case when the current
epoch was closed already, but no new requests are queued yet,
and send out P_BARRIER as soon as possible.

This is done by comparing the per resource "current transfer log epoch"
(tconn->current_tle_nr) with the per connection "currently processed
epoch number" (tconn->send.current_epoch_nr), while waiting for
new requests to be processed in wait_for_work().
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

b6dd1a89

drbd: move the drbd_work_queue from drbd_socket to drbd_connection · d5b27b01

由 Lars Ellenberg 提交于 11月 14, 2011

cherry-picked and adapted from drbd 9 devel branch
In 8.4, we don't distinguish between "resource work" and "connection
work" yet, we have one worker for both, as we still have only one connection.

We only ever used the "data.work",
no need to keep the "meta.work" around.

Move tconn->data.work to tconn->sender_work.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

d5b27b01

drbd: transfer log epoch numbers are now per resource · b379c41e

由 Lars Ellenberg 提交于 11月 17, 2011

cherry-picked from drbd 9 devel branch.

In preparation of multiple connections, the "barrier number" or
"epoch number" needs to be tracked per-resource, not per connection.
The sequence number space will not be reset anymore.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

b379c41e

drbd: rename drbd_restart_write to drbd_restart_request · 9d05e7c4

由 Lars Ellenberg 提交于 7月 17, 2012

Meanwhile, this is used to restart failed READ requests as well.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

9d05e7c4

L
drbd: fix wrong assert in completion/retry path of failed local reads · 629663c9
由 Lars Ellenberg 提交于 6月 08, 2012
```
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>
```
629663c9

drbd: fix local read error hung forever · ab53b90e

由 Lars Ellenberg 提交于 6月 08, 2012

The commit
    drbd: simplify retry path of failed READ requests
simplified it too much:
it just did not do anything for local read errors.

Add the missing req_may_be_completed_not_susp() to the
READ_COMPLETED_WITH_ERROR case.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

ab53b90e

drbd: fix resend/resubmit of frozen IO · 07be15b1

由 Lars Ellenberg 提交于 5月 07, 2012

DRBD can freeze IO, due to fencing policy (fencing resource-and-stonith),
or because we lost access to data (on-no-data-accessible suspend-io).

Resuming from there (re-connect, or re-attach, or explicit admin
intervention) should "just work".

Unfortunately, if the re-attach/re-connect did not happen within
the timeout, since the commit

  drbd: Implemented real timeout checking for request processing time

if so configured, the request_timer_fn() would timeout and
detach/disconnect virtually immediately.

This change tracks the most recent attach and connect, and does not
timeout within <configured timeout interval> after attach/connect.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

07be15b1

drbd: complete_conflicting_writes() should not care about connections · 648e46b5

由 Lars Ellenberg 提交于 3月 26, 2012

complete_conflicting_writes() should not cause -EIO.
It should not timeout either, or care for connection states.

Connection timeout is detected elsewhere, and it's cleanup path is
supposed to remove any pending requests or peer_requests from the
write_requests tree.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

648e46b5

drbd: simplify retry path of failed READ requests · 4439c400

由 Lars Ellenberg 提交于 3月 26, 2012

If a local or remote READ request fails, just push it back to the retry
workqueue.  It will re-enter __drbd_make_request, and be re-assigned to
a suitable local or remote path, or failed, if we do not have access to
good data anymore.

This obsoletes w_read_retry_remote(),
and eliminates two goto...retry blocks in __req_mod()
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

4439c400

drbd: move put_ldev from __req_mod() to the endio callback · 2415308e

由 Lars Ellenberg 提交于 3月 26, 2012

Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

2415308e

drbd: factor out master_bio completion and drbd_request destruction paths · 6870ca6d

由 Lars Ellenberg 提交于 3月 26, 2012

In preparation for multiple connections and reference counting,
separate the code paths for completion of the master bio
and destruction of the request object.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

6870ca6d

L
drbd: conflicting writes: make wake_up of waiting peer_requests explicit · 8d6cdd78
由 Lars Ellenberg 提交于 3月 26, 2012
```
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>
```
8d6cdd78

drbd: fix WRITE_ACKED_BY_PEER_AND_SIS to not set RQ_NET_DONE · 0afd569a

由 Lars Ellenberg 提交于 3月 26, 2012

Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

0afd569a

L
drbd: fix READ_RETRY_REMOTE_CANCELED to not complete if device is suspended · ea9d6729
由 Lars Ellenberg 提交于 3月 26, 2012
```
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>
```
ea9d6729

drbd: make OOS_HANDED_TO_NETWORK its own case · 27a434fe

由 Lars Ellenberg 提交于 3月 26, 2012

Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

27a434fe

drbd: fix potential deadlock during "restart" of conflicting writes · 2312f0b3

由 Lars Ellenberg 提交于 11月 24, 2011

w_restart_write(), run from worker context, calls __drbd_make_request()
and further drbd_al_begin_io(, delegate=true), which then
potentially deadlocks.  The previous patch moved a BUG_ON to expose
such call paths, which would now be triggered.

Also, if we call __drbd_make_request() from resource worker context,
like w_restart_write() did, and that should block for whatever reason
(!drbd_state_is_stable(), resource suspended, ...),
we potentially deadlock the whole resource, as the worker
is needed for state changes and other things.

Create a dedicated retry workqueue for this instead.

Also make sure that inc_ap_bio()/dec_ap_bio() are properly paired,
even if do_retry() needs to retry itself,
in case __drbd_make_request() returns != 0.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

2312f0b3

drbd: Fix a potential race that could case data inconsistency · 81f44862

由 Lars Ellenberg 提交于 3月 26, 2012

When we have a write request and a state change C_WF_BITMAP_S -> C_SYNC_SOURCE
at the same time, and it happens that the line

    remote = remote && drbd_should_do_remote(s);

stills sees C_WF_BITMAP_S, and

     send_oos = rw == WRITE && drbd_should_send_oos(s);

already sees C_SYNC_SOURCE both are 0.

This causes the write to not be mirrored, but marked as out-of-sync on the
Sync_Source node.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

81f44862

drbd: Consider that bio->bi_bdev might be modified below DRBD · 38a05c16

由 Philipp Reisner 提交于 3月 07, 2012

Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

38a05c16

drbd: add missing part_round_stats to _drbd_start_io_acct · 72585d24

由 Philipp Reisner 提交于 2月 23, 2012

Without this, iostat frequently sees bogus svctime and >= 100% "utilization".
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

72585d24

drbd: If disk timeout expires fail only the affected volume · 93f5afe9

由 Philipp Reisner 提交于 2月 23, 2012

...and not all volumes of the resource
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

93f5afe9

drbd: restart loop in drbd_make_request() [prepare for Linux-3.2] · 69b6a3b1

由 Philipp Reisner 提交于 12月 20, 2011

With Linux-3.2 generic_make_request() will no longer loop over
the request function until it finally returns 0. Move this
loop into our drbd_make_request() function.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

69b6a3b1

drbd: Consider that read requests could be NEG_ACKEDed · e8cdc343

由 Philipp Reisner 提交于 12月 13, 2011

ap_in_flight only counts writes. NEG_ACKED is an action
on a request that might be called for reads and writes.

This bug was there forever, but it becomes much more
relevant with the read balincing code.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

e8cdc343

drbd: Do not call generic_make_request() while holding req_lock · 57bcb6cf

由 Philipp Reisner 提交于 12月 03, 2011

Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

57bcb6cf

drbd: Load balancing method: striping · d60de03a

由 Philipp Reisner 提交于 11月 17, 2011

Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

d60de03a

drbd: Load balancing of read requests · 380207d0

由 Philipp Reisner 提交于 11月 11, 2011

New config option for the disk secition "read-balancing", with
the values: prefer-local, prefer-remote, round-robin, when-congested-remote.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

380207d0

drbd: Move the CREATE_BARRIER flag from connection to device · 6936fcb4

由 Philipp Reisner 提交于 11月 10, 2011

That is necessary since the whole transfer log is per connection(tconn)
and not per device(mdev).

This bug caused list corruption on the worker list. When a barrier is queued
for sending in the context of one device, another device did not see the
CREATE_BARRIER bit, and queued the same object again -> list corruption.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

6936fcb4

drbd: Silenced compiler warnings · 376694a0

由 Philipp Reisner 提交于 11月 07, 2011

Since version 4.6.1 gcc warns about variables that get
a value assigned, but which are never read later on.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

376694a0

drbd: Update some outdated comments to match the code · a209b4ae

由 Andreas Gruenbacher 提交于 8月 17, 2011

Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

a209b4ae

drbd: detach must not try to abort non-local requests · 97ddb687

由 Lars Ellenberg 提交于 7月 15, 2011

Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

97ddb687

drbd: Do not mod_timer() with a past time · 3b03ad59

由 Philipp Reisner 提交于 7月 15, 2011

In case we can not find out why the request takes too long
(happens e.g. when IO got suspended on DRBD level). rearm
the timer with a reasonable value.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

3b03ad59

drbd: detach from frozen backing device · cdfda633

由 Philipp Reisner 提交于 7月 05, 2011

* drbd-8.3:
  documentation: Documented detach's --force and disk's --disk-timeout
  drbd: Implemented the disk-timeout option
  drbd: Force flag for the detach operation
  drbd: Allow new IOs while the local disk in in FAILED state
  drbd: Bitmap IO functions can not return prematurely if the disk breaks
  drbd: Added a kref to bm_aio_ctx
  drbd: Hold a reference to ldev while doing meta-data IO
  drbd: Keep a reference to the bio until the completion handler finished
  drbd: Implemented wait_until_done_or_disk_failure()
  drbd: Replaced md_io_mutex by an atomic: md_io_in_use
  drbd: moved md_io into mdev
  drbd: Immediately allow completion of IOs, that wait for IO completions on a failed disk
  drbd: Keep a reference to barrier acked requests
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

cdfda633

drbd: rcu_read_lock() and rcu_dereference() for tconn->net_conf · 44ed167d

由 Philipp Reisner 提交于 4月 19, 2011

Removing the get_net_conf()/put_net_conf() calls
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

44ed167d

drbd: Runtime changeable wire protocol · 303d1448

由 Philipp Reisner 提交于 4月 13, 2011

The wire protocol is no longer a property that is negotiated
between the two peers. It is now expressed with two bits
(DP_SEND_WRITE_ACK and DP_SEND_RECEIVE_ACK) in each data
packet. Therefore the primary node is free to change the
wire protocol at any time without disconnect/reconnect.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

303d1448

drbd: Use tconn in request_timer_fn() · 8b924f1d

由 Philipp Reisner 提交于 3月 01, 2011

Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

8b924f1d

P
drbd: Renamed id_susp(union drbd_state s) to drbd_suspended(struct drbd_conf *) · 2aebfabb
由 Philipp Reisner 提交于 3月 28, 2011
```
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>
```
2aebfabb

drbd: get rid of bio_split, allow bios of "arbitrary" size · 23361cf3

由 Lars Ellenberg 提交于 3月 31, 2011

Where "arbitrary" size is currently 1 MiB, which is the BIO_MAX_SIZE
for architectures with 4k PAGE_CACHE_SIZE (most).
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

23361cf3

drbd: preparation commit, pass drbd_interval to drbd_al_begin/complete_io · 181286ad

由 Lars Ellenberg 提交于 3月 31, 2011

We want to avoid bio_split for bios crossing activity log boundaries.
So we may need to activate two activity log extents "atomically".
drbd_al_begin_io() needs to know more than just the start sector.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

181286ad

A
drbd: Rename various functions from *_oos_* to *_out_of_sync_* for clarity · 8f7bed77
由 Andreas Gruenbacher 提交于 12月 19, 2010
```
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>
```
8f7bed77

drbd: drbd_may_do_local_read(): Use bool/true/false · 0da34df0

由 Andreas Gruenbacher 提交于 12月 19, 2010

Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

0da34df0

drbd: Remove unnecessary assertion · 1097e9a8

由 Andreas Gruenbacher 提交于 12月 17, 2010

This is also checked further below in the same function.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>

1097e9a8

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功