提交 0a7f6c7e 编写于 作者: J Jeff Mahoney 提交者: Greg Kroah-Hartman

btrfs: keep trim from interfering with transaction commits

commit fee7acc361314df6561208c2d3c0882d663dd537 upstream.

Commit 499f377f (btrfs: iterate over unused chunk space in FITRIM)
fixed free space trimming, but introduced latency when it was running.
This is due to it pinning the transaction using both a incremented
refcount and holding the commit root sem for the duration of a single
trim operation.

This was to ensure safety but it's unnecessary.  We already hold the the
chunk mutex so we know that the chunk we're using can't be allocated
while we're trimming it.

In order to check against chunks allocated already in this transaction,
we need to check the pending chunks list.  To to that safely without
joining the transaction (or attaching than then having to commit it) we
need to ensure that the dev root's commit root doesn't change underneath
us and the pending chunk lists stays around until we're done with it.

We can ensure the former by holding the commit root sem and the latter
by pinning the transaction.  We do this now, but the critical section
covers the trim operation itself and we don't need to do that.

This patch moves the pinning and unpinning logic into helpers and unpins
the transaction after performing the search and check for pending
chunks.

Limiting the critical section of the transaction pinning improves the
latency substantially on slower storage (e.g. image files over NFS).

Fixes: 499f377f ("btrfs: iterate over unused chunk space in FITRIM")
CC: stable@vger.kernel.org # 4.4+
Signed-off-by: NJeff Mahoney <jeffm@suse.com>
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NGreg Kroah-Hartman <gregkh@linuxfoundation.org>
上级 4d0dfd8f
...@@ -10772,14 +10772,16 @@ int btrfs_error_unpin_extent_range(struct btrfs_fs_info *fs_info, ...@@ -10772,14 +10772,16 @@ int btrfs_error_unpin_extent_range(struct btrfs_fs_info *fs_info,
* We don't want a transaction for this since the discard may take a * We don't want a transaction for this since the discard may take a
* substantial amount of time. We don't require that a transaction be * substantial amount of time. We don't require that a transaction be
* running, but we do need to take a running transaction into account * running, but we do need to take a running transaction into account
* to ensure that we're not discarding chunks that were released in * to ensure that we're not discarding chunks that were released or
* the current transaction. * allocated in the current transaction.
* *
* Holding the chunks lock will prevent other threads from allocating * Holding the chunks lock will prevent other threads from allocating
* or releasing chunks, but it won't prevent a running transaction * or releasing chunks, but it won't prevent a running transaction
* from committing and releasing the memory that the pending chunks * from committing and releasing the memory that the pending chunks
* list head uses. For that, we need to take a reference to the * list head uses. For that, we need to take a reference to the
* transaction. * transaction and hold the commit root sem. We only need to hold
* it while performing the free space search since we have already
* held back allocations.
*/ */
static int btrfs_trim_free_extents(struct btrfs_device *device, static int btrfs_trim_free_extents(struct btrfs_device *device,
u64 minlen, u64 *trimmed) u64 minlen, u64 *trimmed)
...@@ -10810,9 +10812,13 @@ static int btrfs_trim_free_extents(struct btrfs_device *device, ...@@ -10810,9 +10812,13 @@ static int btrfs_trim_free_extents(struct btrfs_device *device,
ret = mutex_lock_interruptible(&fs_info->chunk_mutex); ret = mutex_lock_interruptible(&fs_info->chunk_mutex);
if (ret) if (ret)
return ret; break;
down_read(&fs_info->commit_root_sem); ret = down_read_killable(&fs_info->commit_root_sem);
if (ret) {
mutex_unlock(&fs_info->chunk_mutex);
break;
}
spin_lock(&fs_info->trans_lock); spin_lock(&fs_info->trans_lock);
trans = fs_info->running_transaction; trans = fs_info->running_transaction;
...@@ -10820,13 +10826,17 @@ static int btrfs_trim_free_extents(struct btrfs_device *device, ...@@ -10820,13 +10826,17 @@ static int btrfs_trim_free_extents(struct btrfs_device *device,
refcount_inc(&trans->use_count); refcount_inc(&trans->use_count);
spin_unlock(&fs_info->trans_lock); spin_unlock(&fs_info->trans_lock);
if (!trans)
up_read(&fs_info->commit_root_sem);
ret = find_free_dev_extent_start(trans, device, minlen, start, ret = find_free_dev_extent_start(trans, device, minlen, start,
&start, &len); &start, &len);
if (trans) if (trans) {
up_read(&fs_info->commit_root_sem);
btrfs_put_transaction(trans); btrfs_put_transaction(trans);
}
if (ret) { if (ret) {
up_read(&fs_info->commit_root_sem);
mutex_unlock(&fs_info->chunk_mutex); mutex_unlock(&fs_info->chunk_mutex);
if (ret == -ENOSPC) if (ret == -ENOSPC)
ret = 0; ret = 0;
...@@ -10834,7 +10844,6 @@ static int btrfs_trim_free_extents(struct btrfs_device *device, ...@@ -10834,7 +10844,6 @@ static int btrfs_trim_free_extents(struct btrfs_device *device,
} }
ret = btrfs_issue_discard(device->bdev, start, len, &bytes); ret = btrfs_issue_discard(device->bdev, start, len, &bytes);
up_read(&fs_info->commit_root_sem);
mutex_unlock(&fs_info->chunk_mutex); mutex_unlock(&fs_info->chunk_mutex);
if (ret) if (ret)
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册