提交 · deb406274339f386836313af7eeb8001cca6c33f · openanolis / cloud-kernel

06 8月, 2018 40 次提交

btrfs: qgroup: Drop root parameter from btrfs_qgroup_trace_subtree · deb40627

由 Lu Fengqi 提交于 7月 18, 2018

The fs_info can be fetched from the transaction handle directly.
Signed-off-by: NLu Fengqi <lufq.fnst@cn.fujitsu.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

deb40627

btrfs: qgroup: Drop fs_info parameter from btrfs_qgroup_trace_leaf_items · 8d38d7eb

由 Lu Fengqi 提交于 7月 18, 2018

It can be fetched from the transaction handle.
Signed-off-by: NLu Fengqi <lufq.fnst@cn.fujitsu.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

8d38d7eb

btrfs: qgroup: Drop fs_info parameter from btrfs_qgroup_trace_extent · a95f3aaf

由 Lu Fengqi 提交于 7月 18, 2018

It can be fetched from the transaction handle. In addition, remove the
WARN_ON(trans == NULL) because it's not possible to hit this condition.
Signed-off-by: NLu Fengqi <lufq.fnst@cn.fujitsu.com>
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

a95f3aaf

btrfs: qgroup: Drop fs_info parameter from btrfs_limit_qgroup · f0042d5e

由 Lu Fengqi 提交于 7月 18, 2018

It can be fetched from the transaction handle.
Signed-off-by: NLu Fengqi <lufq.fnst@cn.fujitsu.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

f0042d5e

btrfs: qgroup: Drop fs_info parameter from btrfs_remove_qgroup · 3efbee1d

由 Lu Fengqi 提交于 7月 18, 2018

It can be fetched from the transaction handle.
Signed-off-by: NLu Fengqi <lufq.fnst@cn.fujitsu.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

3efbee1d

btrfs: qgroup: Drop fs_info parameter from btrfs_create_qgroup · 49a05ecd

由 Lu Fengqi 提交于 7月 18, 2018

It can be fetched from the transaction handle.
Signed-off-by: NLu Fengqi <lufq.fnst@cn.fujitsu.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

49a05ecd

btrfs: qgroup: Drop fs_info parameter from btrfs_del_qgroup_relation · 39616c27

由 Lu Fengqi 提交于 7月 18, 2018

It can be fetched from the transaction handle.
Signed-off-by: NLu Fengqi <lufq.fnst@cn.fujitsu.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

39616c27

btrfs: qgroup: Drop fs_info parameter from __del_qgroup_relation · 6b36f1aa

由 Lu Fengqi 提交于 7月 18, 2018

It can be fetched from the transaction handle.
Signed-off-by: NLu Fengqi <lufq.fnst@cn.fujitsu.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

6b36f1aa

btrfs: qgroup: Drop fs_info parameter from btrfs_add_qgroup_relation · 9f8a6ce6

由 Lu Fengqi 提交于 7月 18, 2018

It can be fetched from the transaction handle.
Signed-off-by: NLu Fengqi <lufq.fnst@cn.fujitsu.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

9f8a6ce6

btrfs: qgroup: Drop quota_root and fs_info parameters from update_qgroup_status_item · 2e980acd

由 Lu Fengqi 提交于 7月 18, 2018

They can be fetched from the transaction handle.
Signed-off-by: NLu Fengqi <lufq.fnst@cn.fujitsu.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

2e980acd

btrfs: qgroup: Drop root parameter from update_qgroup_info_item · 3e07e9a0

由 Lu Fengqi 提交于 7月 18, 2018

It can be fetched from the transaction handle.
Signed-off-by: NLu Fengqi <lufq.fnst@cn.fujitsu.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

3e07e9a0

btrfs: qgroup: Drop root parameter from update_qgroup_limit_item · ac8a866a

由 Lu Fengqi 提交于 7月 18, 2018

It can be fetched from the transaction handle.
Signed-off-by: NLu Fengqi <lufq.fnst@cn.fujitsu.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

ac8a866a

btrfs: qgroup: Drop quota_root parameter from del_qgroup_item · 69104618

由 Lu Fengqi 提交于 7月 18, 2018

It can be fetched from the transaction handle.
Signed-off-by: NLu Fengqi <lufq.fnst@cn.fujitsu.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

69104618

btrfs: qgroup: Drop quota_root parameter from del_qgroup_relation_item · 99d7f09a

由 Lu Fengqi 提交于 7月 18, 2018

It can be fetched from the transaction handle.
Signed-off-by: NLu Fengqi <lufq.fnst@cn.fujitsu.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

99d7f09a

btrfs: qgroup: Drop quota_root parameter from add_qgroup_relation_item · 711169c4

由 Lu Fengqi 提交于 7月 18, 2018

It can be fetched from the transaction handle.
Signed-off-by: NLu Fengqi <lufq.fnst@cn.fujitsu.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

711169c4

btrfs: rename btrfs_parse_early_options · fa59f27c

由 Anand Jain 提交于 7月 16, 2018

Rename btrfs_parse_early_options() to btrfs_parse_device_options(). As
btrfs_parse_early_options() parses the -o device options and scan the
device provided. So this rename specifies its action. Also the function
name is in line with btrfs_parse_subvol_options().
No functional changes.
Signed-off-by: NAnand Jain <anand.jain@oracle.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

fa59f27c

btrfs: qgroup: cleanup the unused srcroot from btrfs_qgroup_inherit · c8389d4c

由 Lu Fengqi 提交于 7月 17, 2018

Since commit 0b246afa ("btrfs: root->fs_info cleanup, add fs_info
convenience variables"), the srcroot is no longer used to get
fs_info::nodesize.  In fact, it can be dropped after commit 707e8a07
("btrfs: use nodesize everywhere, kill leafsize").
Signed-off-by: NLu Fengqi <lufq.fnst@cn.fujitsu.com>
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

c8389d4c

btrfs: Use btrfs_mark_bg_unused to replace open code · 031f24da

由 Qu Wenruo 提交于 5月 22, 2018

Introduce a small helper, btrfs_mark_bg_unused(), to acquire locks and
add a block group to unused_bgs list.

No functional modification, and only 3 callers are involved.
Signed-off-by: NQu Wenruo <wqu@suse.com>
Reviewed-by: NNikolay Borisov <nborisov@suse.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

031f24da

btrfs: Rewrite retry logic in do_chunk_alloc · 2556fbb0

由 Nikolay Borisov 提交于 4月 18, 2018

do_chunk_alloc implements logic to detect whether there is currently
pending chunk allocation (by means of space_info->chunk_alloc being
set) and if so it loops around to the 'again' label. Additionally,
based on the state of the space_info (e.g. whether it's full or not)
and the return value of should_alloc_chunk() it decides whether this
is a "hard" error (ENOSPC) or we can just return 0.

This patch refactors all of this:

1. Put order to the scattered ifs handling the various cases in an
easy-to-read if {} else if{} branches. This makes clear the various
cases we are interested in handling.

2. Call should_alloc_chunk only once and use the result in the
if/else if constructs. All of this is done under space_info->lock, so
even before multiple calls of should_alloc_chunk were unnecessary.

3. Rewrite the "do {} while()" loop currently implemented via label
into an explicit loop construct.

4. Move the mutex locking for the case where the caller is the one doing
the allocation. For the case where the caller needs to wait a concurrent
allocation, introduce a pair of mutex_lock/mutex_unlock to act as a
barrier and reword the comment.

5. Switch local vars to bool type where pertinent.

All in all this shouldn't introduce any functional changes.
Signed-off-by: NNikolay Borisov <nborisov@suse.com>
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

2556fbb0

btrfs: use customized batch size for total_bytes_pinned · dec59fa3

由 Ethan Lien 提交于 7月 13, 2018

In commit b150a4f1 ("Btrfs: use a percpu to keep track of possibly
pinned bytes") we use total_bytes_pinned to track how many bytes we are
going to free in this transaction. When we are close to ENOSPC, we check it
and know if we can make the allocation by commit the current transaction.
For every data/metadata extent we are going to free, we add
total_bytes_pinned in btrfs_free_extent() and btrfs_free_tree_block(), and
release it in unpin_extent_range() when we finish the transaction. So this
is a variable we frequently update but rarely read - just the suitable
use of percpu_counter. But in previous commit we update total_bytes_pinned
by default 32 batch size, making every update essentially a spin lock
protected update. Since every spin lock/unlock operation involves syncing
a globally used variable and some kind of barrier in a SMP system, this is
more expensive than using total_bytes_pinned as a simple atomic64_t.

So fix this by using a customized batch size. Since we only read
total_bytes_pinned when we are close to ENOSPC and fail to allocate new
chunk, we can use a really large batch size and have nearly no penalty
in most cases.

[Test]
We tested the patch on a 4-cores x86 machine:

1. fallocate a 16GiB size test file
2. take snapshot (so all following writes will be COW)
3. run a 180 sec, 4 jobs, 4K random write fio on test file

We also added a temporary lockdep class on percpu_counter's spin lock
used by total_bytes_pinned to track it by lock_stat.

[Results]
unpatched:
lock_stat version 0.4
-----------------------------------------------------------------------
                              class name    con-bounces    contentions
waittime-min   waittime-max waittime-total   waittime-avg    acq-bounces
acquisitions   holdtime-min   holdtime-max holdtime-total   holdtime-avg

               total_bytes_pinned_percpu:            82             82
        0.21           0.61          29.46           0.36         298340
      635973           0.09          11.01      173476.25           0.27

patched:
lock_stat version 0.4
-----------------------------------------------------------------------
                              class name    con-bounces    contentions
waittime-min   waittime-max waittime-total   waittime-avg    acq-bounces
acquisitions   holdtime-min   holdtime-max holdtime-total   holdtime-avg

               total_bytes_pinned_percpu:             1              1
        0.62           0.62           0.62           0.62          13601
       31542           0.14           9.61       11016.90           0.35

[Analysis]
Since the spin lock only protects a single in-memory variable, the
contentions (number of lock acquisitions that had to wait) in both
unpatched and patched version are low. But when we see acquisitions and
acq-bounces, we get much lower counts in patched version. Here the most
important metric is acq-bounces. It means how many times the lock gets
transferred between different cpus, so the patch can really reduce
cacheline bouncing of spin lock (also the global counter of percpu_counter)
in a SMP system.

Fixes: b150a4f1 ("Btrfs: use a percpu to keep track of possibly pinned bytes")
Signed-off-by: NEthan Lien <ethanlien@synology.com>
Reviewed-by: NNikolay Borisov <nborisov@suse.com>
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

dec59fa3

btrfs: use correct compare function of dirty_metadata_bytes · d814a491

由 Ethan Lien 提交于 7月 02, 2018

We use customized, nodesize batch value to update dirty_metadata_bytes.
We should also use batch version of compare function or we will easily
goto fast path and get false result from percpu_counter_compare().

Fixes: e2d84521 ("Btrfs: use percpu counter for dirty metadata count")
CC: stable@vger.kernel.org # 4.4+
Signed-off-by: NEthan Lien <ethanlien@synology.com>
Reviewed-by: NNikolay Borisov <nborisov@suse.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

d814a491

btrfs: return device pointer from btrfs_scan_one_device · 36350e95

由 Gu Jinxiang 提交于 7月 12, 2018

Return device pointer (with the IS_ERR semantics) from
btrfs_scan_one_device so we don't have to return in through pointer.

And since btrfs_fs_devices can be obtained from btrfs_device, return that.
Signed-off-by: NGu Jinxiang <gujx@cn.fujitsu.com>
Reviewed-by: NNikolay Borisov <nborisov@suse.com>
Reviewed-by: NDavid Sterba <dsterba@suse.com>
[ fixed conflics after recent changes to btrfs_scan_one_device ]
Signed-off-by: NDavid Sterba <dsterba@suse.com>

36350e95

btrfs: make fs_devices a local variable in btrfs_parse_early_options · d64dcbd1

由 Gu Jinxiang 提交于 7月 12, 2018

fs_devices is always passed to btrfs_scan_one_device which overrides it.
In the call stack below fs_devices is passed to btrfs_scan_one_device
from btrfs_mount_root.  In btrfs_mount_root the output fs_devices of
this call stack is not used.

btrfs_mount_root
  btrfs_parse_early_options
    btrfs_scan_one_device

So, it is not necessary to pass fs_devices from btrfs_mount_root, using
a local variable in btrfs_parse_early_options is enough.
Signed-off-by: NGu Jinxiang <gujx@cn.fujitsu.com>
Reviewed-by: NAnand Jain <Anand.Jain@oracle.com>
Reviewed-by: NNikolay Borisov <nborisov@suse.com>
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

d64dcbd1

btrfs: fix mount and ioctl device scan ioctl race · 81ffd56b