提交 · a0fb3c47a1aae5d38a88ea858f14d6d088d05e07 · openeuler / Kernel

01 5月, 2014 12 次提交

drbd: prepare receiving side for REQ_DISCARD · a0fb3c47

由 Lars Ellenberg 提交于 4月 28, 2014

If the receiver needs to serve a discard request on a queue that does
not announce to be discard cabable, it falls back to do synchronous
blkdev_issue_zeroout().

We expect only "reasonably" large (up to one activity log extent?)
discard requests.

We do this to not to not block the receiver for too long in this
fallback code path, and to not set/clear too many bits inside one
spinlock_irq_save() in drbd_set_in_sync/drbd_set_out_of_sync,
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>
Signed-off-by: NJens Axboe <axboe@fb.com>

a0fb3c47

drbd: allow parallel promote/demote actions · 9e276872

由 Lars Ellenberg 提交于 4月 28, 2014

We plan to use genl_family->parallel_ops = true in the future,
but need to review all possible interactions first.

For now, only selectively drop genl_lock() in drbd_set_role(),
instead serializing on our own internal resource->conf_update mutex.

We now can be promoted/demoted on many resources in parallel,
which may significantly improve cluster failover times
when fencing is required.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>
Signed-off-by: NJens Axboe <axboe@fb.com>

9e276872

drbd: perpare for genetlink parallel_ops · a910b123

由 Lars Ellenberg 提交于 4月 28, 2014

Because all administrative requests via genetlink have been globally
serialized via genl_lock(), we used to have one static struct
drbd_config_context "admin context".

Move this on-stack to the respective callback functions.

This will allow us to selectively drop the genl_lock()
(or use genl_family->parallel_ops) in the future.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>
Signed-off-by: NJens Axboe <axboe@fb.com>

a910b123

drbd: Do not BUG() when connection breaks in a special way · 88ea685d

由 Philipp Reisner 提交于 4月 28, 2014

When a 'cluster wide' disconnect executes, the result comes back
from the peer, and immediately after that the connection breaks
then _conn_rq_cond() reported back SS_CW_SUCCESS.
Therefore _conn_request_state() calls conn_set_state(), which
has a BUG() in it.
The BUG() is hit because conn_is_valid_transition() does not like
the transaction. Which goes back to is_valid_soft_transition()
returning SS_OUTDATE_WO_CONN.

This fix is to consider an error reported by is_valid_soft_transition()
even when the peer agreed to the transaction.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>
Signed-off-by: NJens Axboe <axboe@fb.com>

88ea685d

drbd: don't let application IO pre-empt resync too often · e8299874

由 Lars Ellenberg 提交于 4月 28, 2014

Before, application IO could pre-empt resync activity
for up to hardcoded 20 seconds per resync request.
A very busy server could throttle the effective resync bandwidth
down to one request per 20 seconds.

Now, we only let application IO pre-empt resync traffic
while the current resync rate estimate is above c-min-rate.

If you disable the c-min-rate throttle feature (set c-min-rate = 0),
application IO will no longer pre-empt resync traffic at all.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>
Signed-off-by: NJens Axboe <axboe@fb.com>

e8299874

drbd: fix potential distributed deadlock during verify or resync · 0e49d7b0

由 Lars Ellenberg 提交于 4月 28, 2014

If max-buffers and socket buffer sizes are "too small" for the chosen
resync rate, this could lead potentially lead to a distributed deadlock,
which may or may not resolve itself via the "ko-count" and request
timeout mechanism, or could be resolved by forced disconnect.

One option to deal with this is proper configuration:
use larger max-buffer and socket buffers settings,
or reduce the resync rate.

But even with bad configuration we should not deadlock,
but "gracefully" recover.

The issue is avoided by using only up to max-buffers/2 for resync
requests, and by using max-buffers not as a hard limit for data buffer
allocations, but as a throttle threshold only.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>
Signed-off-by: NJens Axboe <axboe@fb.com>

0e49d7b0

drbd: resync: fix too large bursts for very slow rates · 6377b923

由 Lars Ellenberg 提交于 4月 28, 2014

While merging adjacent dirty blocks into resync requests,
the resync rate throttle was disregarded.
For very low resync rates, the effective rate may have exceeded
the intended rate by a larger margin.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>
Signed-off-by: NJens Axboe <axboe@fb.com>

6377b923

drbd: fix stalled resync detection in /proc/drbd · 9ae47260

由 Lars Ellenberg 提交于 4月 28, 2014

If we don't make resync or verify progress for "too long",
we want to flag it as "stalled".

Since 2010, "use rolling marks for resync speed calculation"
this "too long" was wrong by a factor of HZ.
With HZ 250, it would have been flagged as stalled
after 100 minutes.

Hardcode 3 minutes instead.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>
Signed-off-by: NJens Axboe <axboe@fb.com>

9ae47260

drbd: Allow online layout change of AL while peer is not connected · cdc6af8d

由 Philipp Reisner 提交于 4月 28, 2014

If a user forces the operation he takes the blame in case
the peer does not have enough space. No reason to dey this...
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>
Signed-off-by: NJens Axboe <axboe@fb.com>

cdc6af8d

drbd: Remove drbd_wrappers.h · d40e5671

由 Philipp Reisner 提交于 4月 28, 2014

Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>
Signed-off-by: NJens Axboe <axboe@fb.com>

d40e5671

drbd: Leave IO suspended if the fence handler find the peer primary · d7fe69c6

由 Philipp Reisner 提交于 4月 28, 2014

Actually we are clearing the susp_fen flag if we are not going
to call a fencing handler.

For setting the susp_fen flag needs to be edge-triggerd, and not
level triggered.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>
Signed-off-by: NJens Axboe <axboe@fb.com>

d7fe69c6

drbd: Break a deadlock while concurrent fencing and establishing a connection · 31007745

由 Philipp Reisner 提交于 4月 28, 2014

When we need to outdate the peer while being promoted to primary,
and the connection gets established at the same time, we deadlock
in drbd_try_outdate_peer() when trying to clear the susp_fen
bit.

Fix this by setting the STATE_SENT bit while holding the mutex.

Using drbd_change_state(.. , CS_HARD, ..) which does not block
until STATE_SENT is cleared, is only for clearness. It does
not contribute anything to the fix.
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NLars Ellenberg <lars.ellenberg@linbit.com>
Signed-off-by: NJens Axboe <axboe@fb.com>

31007745

02 4月, 2014 1 次提交
- A
  drbd: don't open-code kernel_recvmsg() · f730c848
  由 Al Viro 提交于 2月 08, 2014
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  f730c848
22 2月, 2014 1 次提交

drbd: Fix future possible NULL pointer dereference · f597f6b8

由 Andreas Gruenbacher 提交于 2月 19, 2014

Right now every resource has exactly one connection. But we are preparing
for dynamic connections. I.e. in the future thre can be resources without
connections.

However smatch points this out as 'variable dereferenced before check',
which is correct.

This issue was introduced in
drbd: get_one_status(): Iterate over resource->devices instead of connection->peer_devices
Reported-by: NDan Carpenter <dan.carpenter@oracle.com>
Signed-off-by: NAndreas Gruenbacher <agruen@linbit.com>
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: NJens Axboe <axboe@fb.com>

f597f6b8

17 2月, 2014 26 次提交

drbd: Add drbd_thread->resource and make drbd_thread->connection optional · 2457b6d5

由 Andreas Gruenbacher 提交于 7月 21, 2011

In the drbd_thread "infrastructure" functions, only use the resource instead of
the connection. Make the connection field of drbd_thread optional. This will
allow to introduce threads which are not associated with a connection.
Signed-off-by: NAndreas Gruenbacher <agruen@linbit.com>
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>

2457b6d5

drbd: Use the right peer device · 6780139c

由 Andreas Gruenbacher 提交于 9月 13, 2011

in w_e_ (peer request) callbacks and in peer request I/O completion handlers
Signed-off-by: NAndreas Gruenbacher <agruen@linbit.com>
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>

6780139c

drbd: Remove unused parameter of wire_flags_to_bio() · 81f0ffd2

由 Andreas Gruenbacher 提交于 8月 30, 2011

Signed-off-by: NAndreas Gruenbacher <agruen@linbit.com>
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>

81f0ffd2

A
drbd: Get rid of first_peer_device() in handle_write_conflicts() · e33b32de
由 Andreas Gruenbacher 提交于 8月 30, 2011
```
Signed-off-by: NAndreas Gruenbacher <agruen@linbit.com>
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
```
e33b32de
A
drbd: In the worker thread, process drbd_work instead of drbd_device_work items · 6db7e50a
由 Andreas Gruenbacher 提交于 8月 26, 2011
```
Signed-off-by: NAndreas Gruenbacher <agruen@linbit.com>
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
```
6db7e50a

drbd: Turn w_make_ov_request and make_resync_request into "normal" functions · d448a2e1

由 Andreas Gruenbacher 提交于 8月 25, 2011

These functions are not used as drbd_work callbacks.
Signed-off-by: NAndreas Gruenbacher <agruen@linbit.com>
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>

d448a2e1

drbd: Make w_make_resync_request() static · 4d010392

由 Andreas Gruenbacher 提交于 8月 25, 2011

Signed-off-by: NAndreas Gruenbacher <agruen@linbit.com>
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>

4d010392

A
drbd: struct drbd_peer_request: Use drbd_work instead of drbd_device_work · a8cd15ba
由 Andreas Gruenbacher 提交于 8月 25, 2011
```
Signed-off-by: NAndreas Gruenbacher <agruen@linbit.com>
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
```
a8cd15ba
A
drbd: struct after_conn_state_chg_work: Use drbd_work instead of drbd_device_work · 4c007603
由 Andreas Gruenbacher 提交于 8月 25, 2011
```
Signed-off-by: NAndreas Gruenbacher <agruen@linbit.com>
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
```
4c007603

drbd: Turn conn_flush_workqueue() into drbd_flush_workqueue() · b5043c5e

由 Andreas Gruenbacher 提交于 7月 28, 2011

The new function can flush any work queue, not just the work queue of the data
socket of a connection.
Signed-off-by: NAndreas Gruenbacher <agruen@linbit.com>
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>

b5043c5e

drbd: Create a dedicated struct drbd_device_work · 84b8c06b

由 Andreas Gruenbacher 提交于 7月 28, 2011

drbd_device_work is a work item that has a reference to a device,
while drbd_work is a more generic work item that does not carry
a reference to a device.

All callbacks get a pointer to a drbd_work instance, those callbacks
that expect a drbd_device_work use the container_of macro to get it.
Signed-off-by: NAndreas Gruenbacher <agruen@linbit.com>
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>

84b8c06b

drbd: Rename w_prev_work_done -> w_complete · 8682eae9

由 Andreas Gruenbacher 提交于 7月 25, 2011

Also move it to drbd_receiver.c and make it static.
Signed-off-by: NAndreas Gruenbacher <agruen@linbit.com>
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>

8682eae9

A
drbd: Move string function prototypes from linux/drbd.h to drbd_string.h · d9f65229
由 Andreas Gruenbacher 提交于 9月 01, 2011
```
Signed-off-by: NAndreas Gruenbacher <agruen@linbit.com>
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
```
d9f65229

drbd: Remove useless assertion · 137975c1

由 Andreas Gruenbacher 提交于 8月 24, 2011

Signed-off-by: NAndreas Gruenbacher <agruen@linbit.com>
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>

137975c1

drbd: Kill drbd_task_to_thread_name() · c60b0251

由 Andreas Gruenbacher 提交于 8月 10, 2011

Signed-off-by: NAndreas Gruenbacher <agruen@linbit.com>
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>

c60b0251

drbd: Pass a peer device to a number of fuctions · 69a22773

由 Andreas Gruenbacher 提交于 8月 09, 2011

These functions actually operate on a peer device, or
need a peer device.

drbd_prepare_command(), drbd_send_command(), drbd_send_sync_param()
drbd_send_uuids(), drbd_gen_and_send_sync_uuid(), drbd_send_sizes()
drbd_send_state(), drbd_send_current_state(), and drbd_send_state_req()
drbd_send_sr_reply(), drbd_send_ack(), drbd_send_drequest(),
drbd_send_drequest_csum(), drbd_send_ov_request(), drbd_send_dblock()
drbd_send_block(), drbd_send_out_of_sync(), recv_dless_read()
drbd_drain_block(), receive_bitmap_plain(), recv_resync_read()
read_in_block(), read_for_csum(), drbd_alloc_pages(), drbd_alloc_peer_req()
need_peer_seq(), update_peer_seq(), wait_for_and_update_peer_seq()
drbd_sync_handshake(), drbd_asb_recover_{0,1,2}p(), drbd_connected()
drbd_disconnected(), decode_bitmap_c() and recv_bm_rle_bits()
Signed-off-by: NAndreas Gruenbacher <agruen@linbit.com>
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>

69a22773

drbd: Replace vnr_to_mdev() with conn_peer_device() · 9f4fe9ad

由 Andreas Gruenbacher 提交于 8月 09, 2011

The new function returns a peer device, which allows us to eliminate a few
instances of first_peer_device().
Signed-off-by: NAndreas Gruenbacher <agruen@linbit.com>
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>

9f4fe9ad

A
drbd: drbd_csum_bio(), drbd_csum_ee(): Remove unused device argument · 79a3c8d3
由 Andreas Gruenbacher 提交于 8月 09, 2011
```
Signed-off-by: NAndreas Gruenbacher <agruen@linbit.com>
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
```
79a3c8d3

drbd: Function prototype cleanups · 753c6191

由 Andreas Gruenbacher 提交于 7月 22, 2011

Signed-off-by: NAndreas Gruenbacher <agruen@linbit.com>
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>

753c6191

drbd: Rename drbdd_init() -> drbd_receiver() · 8fe60551

由 Andreas Gruenbacher 提交于 7月 22, 2011

Signed-off-by: NAndreas Gruenbacher <agruen@linbit.com>
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>

8fe60551

drbd: Move cpu_mask from connection to resource · 625a6ba2

由 Andreas Gruenbacher 提交于 7月 22, 2011

Also fix drbd_calc_cpu_mask() to spread resources equally over all online cpus
independent of device minor numbers.
Signed-off-by: NAndreas Gruenbacher <agruen@linbit.com>
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>

625a6ba2

A
drbd: Define the size of res_opts->cpu_mask in a single place · f44d0436
由 Andreas Gruenbacher 提交于 7月 22, 2011
```
Signed-off-by: NAndreas Gruenbacher <agruen@linbit.com>
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
```
f44d0436
A
drbd: Move susp, susp_nod, susp_fen from connection to resource · 6bbf53ca
由 Andreas Gruenbacher 提交于 7月 08, 2011
```
Signed-off-by: NAndreas Gruenbacher <agruen@linbit.com>
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>
```
6bbf53ca

drbd: Move conf_mutex from connection to resource · 0500813f

由 Andreas Gruenbacher 提交于 7月 07, 2011

Signed-off-by: NAndreas Gruenbacher <agruen@linbit.com>
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>

0500813f

drbd: drbd_adm_prepare(): Only set adm_ctx.connection when a connection is requested · 3ab706fe

由 Andreas Gruenbacher 提交于 7月 06, 2011

Also change drbd_adm_connect() to expect a resource after it requested one.
Signed-off-by: NAndreas Gruenbacher <agruen@linbit.com>
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>

3ab706fe

drbd: Iterate over all connections · b6f85ef9

由 Andreas Gruenbacher 提交于 7月 06, 2011

in drbd_adm_down(), drbd_create_device() and drbd_set_role()
Signed-off-by: NAndreas Gruenbacher <agruen@linbit.com>
Signed-off-by: NPhilipp Reisner <philipp.reisner@linbit.com>

b6f85ef9

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功