提交 · 14524a846eb52c18438e9bd5eb8cf1431fd57b44 · openeuler / raspberrypi-kernel

22 10月, 2015 40 次提交

btrfs: fallocate: Add support to accurate qgroup reserve · 14524a84

由 Qu Wenruo 提交于 9月 08, 2015

Now fallocate will do accurate qgroup reserve space check, unlike old
method, which will always reserve the whole length of the range.

With this patch, fallocate will:
1) Iterate the desired range and mark in data rsv map
   Only range which is going to be allocated will be recorded in data
   rsv map and reserve the space.
   For already allocated range (normal/prealloc extent) they will be
   skipped.
   Also, record the marked range into a new list for later use.

2) If 1) succeeded, do real file extent allocate.
   And at file extent allocation time, corresponding range will be
   removed from the range in data rsv map.
Signed-off-by: NQu Wenruo <quwenruo@cn.fujitsu.com>
Signed-off-by: NChris Mason <clm@fb.com>

14524a84

btrfs: qgroup: Add new trace point for qgroup data reserve · 81fb6f77

由 Qu Wenruo 提交于 9月 28, 2015

Now each qgroup reserve for data will has its ftrace event for better
debugging.
Signed-off-by: NQu Wenruo <quwenruo@cn.fujitsu.com>
Signed-off-by: NChris Mason <clm@fb.com>

81fb6f77

btrfs: Add handler for invalidate page · b9d0b389

由 Qu Wenruo 提交于 9月 29, 2015

For btrfs_invalidatepage() and its variant evict_inode_truncate_page(),
there will be pages don't reach disk.
In that case, their reserved space won't be release nor freed by
finish_ordered_io() nor delayed_ref handler.

So we must free their qgroup reserved space, or we will leaking reserved
space again.

So this will patch will call btrfs_qgroup_free_data() for
invalidatepage() and its variant evict_inode_truncate_page().

And due to the nature of new btrfs_qgroup_reserve/free_data() reserved
space will only be reserved or freed once, so for pages which are
already flushed to disk, their reserved space will be released and freed
by delayed_ref handler.

Double free won't be a problem.
Signed-off-by: NQu Wenruo <quwenruo@cn.fujitsu.com>
Signed-off-by: NChris Mason <clm@fb.com>

b9d0b389

btrfs: qgroup: Add handler for NOCOW and inline · 94ed938a

由 Qu Wenruo 提交于 9月 08, 2015

For NOCOW and inline case, there will be no delayed_ref created for
them, so we should free their reserved data space at proper
time(finish_ordered_io for NOCOW and cow_file_inline for inline).
Signed-off-by: NQu Wenruo <quwenruo@cn.fujitsu.com>
Signed-off-by: NChris Mason <clm@fb.com>

94ed938a

btrfs: qgroup: Cleanup old inaccurate facilities · 7cf5b976

由 Qu Wenruo 提交于 9月 08, 2015

Cleanup the old facilities which use old btrfs_qgroup_reserve() function
call, replace them with the newer version, and remove the "__" prefix in
them.

Also, make btrfs_qgroup_reserve/free() functions private, as they are
now only used inside qgroup codes.

Now, the whole btrfs qgroup is swithed to use the new reserve facilities.
Signed-off-by: NQu Wenruo <quwenruo@cn.fujitsu.com>
Signed-off-by: NChris Mason <clm@fb.com>

7cf5b976

btrfs: extent-tree: Switch to new delalloc space reserve and release · df480633

由 Qu Wenruo 提交于 9月 08, 2015

Use new __btrfs_delalloc_reserve_space() and
__btrfs_delalloc_release_space() to reserve and release space for
delalloc.
Signed-off-by: NQu Wenruo <quwenruo@cn.fujitsu.com>
Signed-off-by: NChris Mason <clm@fb.com>

df480633

btrfs: extent-tree: Add new version of btrfs_delalloc_reserve/release_space · 1ada3a62

由 Qu Wenruo 提交于 9月 08, 2015

Add new version of btrfs_delalloc_reserve_space() and
btrfs_delalloc_release_space() functions, which supports accurate qgroup
reserve.
Signed-off-by: NQu Wenruo <quwenruo@cn.fujitsu.com>
Signed-off-by: NChris Mason <clm@fb.com>

1ada3a62

btrfs: extent-tree: Switch to new check_data_free_space and free_reserved_data_space · d9d8b2a5

由 Qu Wenruo 提交于 9月 08, 2015

Use new reserve/free for buffered write and inode cache.

For buffered write case, as nodatacow write won't increase quota account,
so unlike old behavior which does reserve before check nocow, now we
check nocow first and then only reserve data if we can't do nocow write.
Signed-off-by: NQu Wenruo <quwenruo@cn.fujitsu.com>
Signed-off-by: NChris Mason <clm@fb.com>

d9d8b2a5

btrfs: extent-tree: Add new version of btrfs_check_data_free_space and... · 4ceff079

由 Qu Wenruo 提交于 9月 08, 2015

btrfs: extent-tree: Add new version of btrfs_check_data_free_space and btrfs_free_reserved_data_space.

Add new functions __btrfs_check_data_free_space() and
__btrfs_free_reserved_data_space() to work with new accurate qgroup
reserved space framework.

The new function will replace old btrfs_check_data_free_space() and
btrfs_free_reserved_data_space() respectively, but until all the change
is done, let's just use the new name.

Also, export internal use function btrfs_alloc_data_chunk_ondemand(), as
now qgroup reserve requires precious bytes, some operation can't get the
accurate number in advance(like fallocate).
But data space info check and data chunk allocate doesn't need to be
that accurate, and can be called at the beginning.

So export it for later operations.
Signed-off-by: NQu Wenruo <quwenruo@cn.fujitsu.com>
Signed-off-by: NChris Mason <clm@fb.com>

4ceff079

btrfs: qgroup: Use new metadata reservation. · 7174109c

由 Qu Wenruo 提交于 9月 08, 2015

As we have the new metadata reservation functions, use them to replace
the old btrfs_qgroup_reserve() call for metadata.
Signed-off-by: NQu Wenruo <quwenruo@cn.fujitsu.com>
Signed-off-by: NChris Mason <clm@fb.com>

7174109c

btrfs: qgroup: Introduce new functions to reserve/free metadata · 55eeaf05

由 Qu Wenruo 提交于 9月 08, 2015

Introduce new functions btrfs_qgroup_reserve/free_meta() to reserve/free
metadata reserved space.
Signed-off-by: NQu Wenruo <quwenruo@cn.fujitsu.com>
Signed-off-by: NChris Mason <clm@fb.com>

55eeaf05

btrfs: delayed_ref: release and free qgroup reserved at proper timing · 297d750b

由 Qu Wenruo 提交于 9月 08, 2015

Qgroup reserved space needs to be released from inode dirty map and get
freed at different timing:

1) Release when the metadata is written into tree
After corresponding metadata is written into tree, any newer write will
be COWed(don't include NOCOW case yet).
So we must release its range from inode dirty range map, or we will
forget to reserve needed range, causing accounting exceeding the limit.

2) Free reserved bytes when delayed ref is run
When delayed refs are run, qgroup accounting will follow soon and turn
the reserved bytes into rfer/excl numbers.
As run_delayed_refs and qgroup accounting are all done at
commit_transaction() time, we are safe to free reserved space in
run_delayed_ref time().

With these timing to release/free reserved space, we should be able to
resolve the long existing qgroup reserve space leak problem.
Signed-off-by: NQu Wenruo <quwenruo@cn.fujitsu.com>
Signed-off-by: NChris Mason <clm@fb.com>

297d750b

btrfs: delayed_ref: Add new function to record reserved space into delayed ref · f64d5ca8

由 Qu Wenruo 提交于 9月 08, 2015

Add new function btrfs_add_delayed_qgroup_reserve() function to record
how much space is reserved for that extent.

As btrfs only accounts qgroup at run_delayed_refs() time, so newly
allocated extent should keep the reserved space until then.

So add needed function with related members to do it.
Signed-off-by: NQu Wenruo <quwenruo@cn.fujitsu.com>
Signed-off-by: NChris Mason <clm@fb.com>

f64d5ca8

btrfs: qgroup: Introduce functions to release/free qgroup reserve data · f695fdce

由 Qu Wenruo 提交于 10月 12, 2015

space

Introduce functions btrfs_qgroup_release/free_data() to release/free
reserved data range.

Release means, just remove the data range from io_tree, but doesn't
free the reserved space.
This is for normal buffered write case, when data is written into disc
and its metadata is added into tree, its reserved space should still be
kept until commit_trans().
So in that case, we only release dirty range, but keep the reserved
space recorded some other place until commit_tran().

Free means not only remove data range, but also free reserved space.
This is used for case for cleanup and invalidate page.
Signed-off-by: NQu Wenruo <quwenruo@cn.fujitsu.com>
Signed-off-by: NChris Mason <clm@fb.com>

f695fdce

btrfs: qgroup: Introduce btrfs_qgroup_reserve_data function · 52472553

由 Qu Wenruo 提交于 10月 12, 2015

Introduce a new function, btrfs_qgroup_reserve_data(), which will use
io_tree to accurate qgroup reserve, to avoid reserved space leaking.
Signed-off-by: NQu Wenruo <quwenruo@cn.fujitsu.com>
Signed-off-by: NChris Mason <clm@fb.com>

52472553

btrfs: extent_io: Introduce new function clear_record_extent_bits() · fefdc557

由 Qu Wenruo 提交于 10月 12, 2015

Introduce new function clear_record_extent_bits(), which will clear bits
for given range and record the details about which ranges are cleared
and how many bytes in total it changes.

This provides the basis for later qgroup reserve codes.
Signed-off-by: NQu Wenruo <quwenruo@cn.fujitsu.com>
Signed-off-by: NChris Mason <clm@fb.com>

fefdc557

btrfs: extent_io: Introduce new function set_record_extent_bits · d38ed27f

由 Qu Wenruo 提交于 10月 12, 2015

Introduce new function set_record_extent_bits(), which will not only set
given bits, but also record how many bytes are changed, and detailed
range info.

This is quite important for later qgroup reserve framework.
The number of bytes will be used to do qgroup reserve, and detailed
range info will be used to cleanup for EQUOT case.
Signed-off-by: NQu Wenruo <quwenruo@cn.fujitsu.com>
Signed-off-by: NChris Mason <clm@fb.com>

d38ed27f

btrfs: extent_io: Introduce needed structure for recoding set/clear bits · ac467772

由 Qu Wenruo 提交于 10月 12, 2015

Add a new structure, extent_change_set, to record how many bytes are
changed in one set/clear_extent_bits() operation, with detailed changed
ranges info.

This provides the needed facilities for later qgroup reserve framework.
Signed-off-by: NQu Wenruo <quwenruo@cn.fujitsu.com>
Signed-off-by: NChris Mason <clm@fb.com>

ac467772

Merge branch 'integration-4.4' of... · a408365c

由 Chris Mason 提交于 10月 21, 2015

Merge branch 'integration-4.4' of git://git.kernel.org/pub/scm/linux/kernel/git/fdmanana/linux into for-linus-4.4

a408365c

Merge branch 'cleanups/for-4.4' of... · a0d58e48

由 Chris Mason 提交于 10月 21, 2015

Merge branch 'cleanups/for-4.4' of git://git.kernel.org/pub/scm/linux/kernel/git/kdave/linux into for-linus-4.4

a0d58e48

btrfs: reada: Fix returned errno code · ddd664f4

由 Luis de Bethencourt 提交于 10月 20, 2015

reada is using -1 instead of the -ENOMEM defined macro to specify that
a buffer allocation failed. Since the error number is propagated, the
caller will get a -EPERM which is the wrong error condition.

Also, updating the caller to return the exact value from
reada_add_block.

Smatch tool warning:
reada_add_block() warn: returning -1 instead of -ENOMEM is sloppy
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NLuis de Bethencourt <luisbg@osg.samsung.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

ddd664f4

btrfs: check-integrity: Fix returned errno codes · 0b8d8ce0

由 Luis de Bethencourt 提交于 10月 20, 2015

check-integrity is using -1 instead of the -ENOMEM defined macro to
specify that a buffer allocation failed. Since the error number is
propagated, the caller will get a -EPERM which is the wrong error
condition.

Also, the smatch tool complains with the following warnings:
btrfsic_process_superblock() warn: returning -1 instead of -ENOMEM is sloppy
btrfsic_read_block() warn: returning -1 instead of -ENOMEM is sloppy
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NLuis de Bethencourt <luisbg@osg.samsung.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

0b8d8ce0

btrfs: compress: put variables defined per compress type in struct to make cache friendly · d9187649

由 Byongho Lee 提交于 10月 14, 2015

Below variables are defined per compress type.
 - struct list_head comp_idle_workspace[BTRFS_COMPRESS_TYPES]
 - spinlock_t comp_workspace_lock[BTRFS_COMPRESS_TYPES]
 - int comp_num_workspace[BTRFS_COMPRESS_TYPES]
 - atomic_t comp_alloc_workspace[BTRFS_COMPRESS_TYPES]
 - wait_queue_head_t comp_workspace_wait[BTRFS_COMPRESS_TYPES]

BTW, while accessing one compress type of these variables, the next or
before address is other compress types of it.
So this patch puts these variables in a struct to make cache friendly.
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NByongho Lee <bhlee.kernel@gmail.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

d9187649

btrfs: cleanup iterating over prop_handlers array · 619ed392

由 Byongho Lee 提交于 10月 08, 2015

This patch eliminates the last item of prop_handlers array which is used
to check end of array and instead uses ARRAY_SIZE macro.
Though this is a very tiny optimization, using ARRAY_SIZE macro is a
good practice to iterate array.
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NByongho Lee <bhlee.kernel@gmail.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

619ed392

btrfs: fix a comment typo · 8cd1e731

由 Geliang Tang 提交于 10月 04, 2015

Just fix a typo in the code comment.
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NGeliang Tang <geliangtang@163.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

8cd1e731

btrfs: declare rsv_count as unsigned int instead of int · 6e4d6fa1

由 Alexandru Moise 提交于 9月 22, 2015

rsv_count ultimately gets passed to start_transaction() which
now takes an unsigned int as its num_items parameter.
The value of rsv_count should always be positive so declare it
as being unsigned.
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NAlexandru Moise <00moses.alexander00@gmail.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

6e4d6fa1

btrfs: change num_items type from u64 to unsigned int · 5aed1dd8

由 Alexandru Moise 提交于 9月 22, 2015

The value of num_items that start_transaction() ultimately
always takes is a small one, so a 64 bit integer is overkill.

Also change num_items for btrfs_start_transaction() and
btrfs_start_transaction_lflush() as well.
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NAlexandru Moise <00moses.alexander00@gmail.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

5aed1dd8

btrfs: cleanup btrfs_balance profile validity checks · bdcd3c97

由 Alexandru Moise 提交于 9月 22, 2015

Improve readability by generalizing the profile validity checks.
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NAlexandru Moise <00moses.alexander00@gmail.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

bdcd3c97

btrfs/file.c: remove an unsed varialbe first_index · bb789152

由 Shan Hai 提交于 9月 21, 2015

The commit b37392ea ("Btrfs: cleanup unnecessary parameter
and variant of prepare_pages()") makes it redundant.
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NShan Hai <haishan.bai@hotmail.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

bb789152

btrfs: use btrfs_raid_array in btrfs_reduce_alloc_profile · 9c170b26

由 Zhao Lei 提交于 9月 15, 2015

btrfs_raid_array[] holds attributes of all raid types.

Use btrfs_raid_array[].devs_min is best way for request
in btrfs_reduce_alloc_profile(), instead of use complex
condition of each raid types.
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NZhao Lei <zhaolei@cn.fujitsu.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

9c170b26

btrfs: use btrfs_raid_array for btrfs_get_num_tolerated_disk_barrier_failures() · 8789f4fe

由 Zhao Lei 提交于 9月 15, 2015

btrfs_raid_array[] is used to define all raid attributes, use it
to get tolerated_failures in btrfs_get_num_tolerated_disk_barrier_failures(),
instead of complex condition in function.

It can make code simple and auto-support other possible raid-type in
future.
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NZhao Lei <zhaolei@cn.fujitsu.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

8789f4fe

btrfs: Move btrfs_raid_array to public · af902047

由 Zhao Lei 提交于 9月 15, 2015

This array is used to record attributes of each raid type,
make it public, and many functions will benifit with this array.

For example, num_tolerated_disk_barrier_failures(), we can
avoid complex conditions in this function, and get raid attribute
simply by accessing above array.

It can also make code logic simple, reduce duplication code, and
increase maintainability.
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NZhao Lei <zhaolei@cn.fujitsu.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

af902047

btrfs: use a single if() statement for one outcome in get_block_rsv() · e9cf439f

由 Alexandru Moise 提交于 9月 09, 2015

Rather than have three separate if() statements for the same outcome
we should just OR them together in the same if() statement.
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NAlexandru Moise <00moses.alexander00@gmail.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

e9cf439f

btrfs: memset cur_trans->delayed_refs to zero · a099d0fd

由 Alexandru Moise 提交于 9月 07, 2015

Use memset() to null out the btrfs_delayed_ref_root of
btrfs_transaction instead of setting all the members to 0 by hand.
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NAlexandru Moise <00moses.alexander00@gmail.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

a099d0fd

btrfs: remove unnecessary list_del · 568b1c9c

由 Byongho Lee 提交于 9月 01, 2015

We can safely iterate whole list items, without using list_del macro.
So remove the list_del call.
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NByongho Lee <bhlee.kernel@gmail.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

568b1c9c

btrfs: replace unnecessary list_for_each_entry_safe to list_for_each_entry · d7641a49

由 Byongho Lee 提交于 9月 01, 2015

There is no removing list element while iterating over list.
So, replace list_for_each_entry_safe to list_for_each_entry.
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NByongho Lee <bhlee.kernel@gmail.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

d7641a49

btrfs: trimming some start_transaction() code away · f2f767e7

由 Alexandru Moise 提交于 8月 27, 2015

Just call kmem_cache_zalloc() instead of calling kmem_cache_alloc().
We're just initializing most fields to 0, false and NULL later on
_anyway_, so to make the code mode readable and potentially gain
a bit of performance (completely untested claim), we should fill our
btrfs_trans_handle with zeros on allocation then just initialize
those five remaining fields (not counting the list_heads) as normal.
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NAlexandru Moise <00moses.alexander00@gmail.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

f2f767e7

btrfs: Fixed declaration of old_len · 0412e58c

由 Alexandru Moise 提交于 8月 24, 2015

old_len is used to store the return value of btrfs_item_size_nr().
The return value of btrfs_item_size_nr() is of type u32.
To improve code correctness and avoid mixing signed and unsigned
integers I've changed old_len to be of type u32 as well.
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NAlexandru Moise <00moses.alexander00@gmail.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

0412e58c

btrfs: Fixed dsize and last_off declarations · ce0eac2a

由 Alexandru Moise 提交于 8月 23, 2015

The return values of btrfs_item_offset_nr and btrfs_item_size_nr are of
type u32. To avoid mixing signed and unsigned integers we should also
declare dsize and last_off to be of type u32.
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NAlexandru Moise <00moses.alexander00@gmail.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

ce0eac2a

Btrfs: btrfs_submit_bio_hook: Use btrfs_wq_endio_type values instead of integer constants · 0d51e28a

由 Chandan Rajendra 提交于 7月 27, 2015

btrfs_submit_bio_hook() uses integer constants instead of values from "enum
btrfs_wq_endio_type". Fix this.
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NChandan Rajendra <chandan@linux.vnet.ibm.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

0d51e28a