提交 · a491abb2e730955df1620165a193678fd775c2d6 · openeuler / raspberrypi-kernel

14 2月, 2017 17 次提交

btrfs: Make btrfs_del_inode_ref take btrfs_inode · a491abb2

由 Nikolay Borisov 提交于 1月 18, 2017

Signed-off-by: NNikolay Borisov <n.borisov.lkml@gmail.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

a491abb2

N
btrfs: Make btrfs_del_dir_entries_in_log take btrfs_inode · 49f34d1f
由 Nikolay Borisov 提交于 1月 18, 2017
```
Signed-off-by: NNikolay Borisov <n.borisov.lkml@gmail.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>
```
49f34d1f

btrfs: Make btrfs_log_new_name take btrfs_inode · 9ca5fbfb

由 Nikolay Borisov 提交于 1月 18, 2017

Signed-off-by: NNikolay Borisov <n.borisov.lkml@gmail.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

9ca5fbfb

btrfs: Make btrfs_inode_in_log take btrfs_inode · 0f8939b8

由 Nikolay Borisov 提交于 1月 18, 2017

Signed-off-by: NNikolay Borisov <n.borisov.lkml@gmail.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

0f8939b8

N
btrfs: Make btrfs_record_unlink_dir take btrfs_inode · 4176bdbf
由 Nikolay Borisov 提交于 1月 18, 2017
```
Signed-off-by: NNikolay Borisov <n.borisov.lkml@gmail.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>
```
4176bdbf
N
btrfs: Make btrfs_inode_delayed_dir_index_count take btrfs_inode · f5cc7b80
由 Nikolay Borisov 提交于 1月 10, 2017
```
Signed-off-by: NNikolay Borisov <n.borisov.lkml@gmail.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>
```
f5cc7b80
N
btrfs: Make btrfs_commit_inode_delayed_inode take btrfs_inode · aa79021f
由 Nikolay Borisov 提交于 1月 10, 2017
```
Signed-off-by: NNikolay Borisov <n.borisov.lkml@gmail.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>
```
aa79021f
N
btrfs: Make btrfs_remove_delayed_node take btrfs_inode · f48d1cf5
由 Nikolay Borisov 提交于 1月 10, 2017
```
Signed-off-by: NNikolay Borisov <n.borisov.lkml@gmail.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>
```
f48d1cf5
N
btrfs: Make btrfs_kill_delayed_inode_items take btrfs_inode · 4ccb5c72
由 Nikolay Borisov 提交于 1月 10, 2017
```
Signed-off-by: NNikolay Borisov <n.borisov.lkml@gmail.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>
```
4ccb5c72
N
btrfs: Make btrfs_delayed_delete_inode_ref take btrfs_inode · e07222c7
由 Nikolay Borisov 提交于 1月 10, 2017
```
Signed-off-by: NNikolay Borisov <n.borisov.lkml@gmail.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>
```
e07222c7
N
btrfs: Make btrfs_delete_delayed_dir_index take btrfs_inode · e67bbbb9
由 Nikolay Borisov 提交于 1月 10, 2017
```
Signed-off-by: NNikolay Borisov <n.borisov.lkml@gmail.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>
```
e67bbbb9

btrfs: Make btrfs_ino take a struct btrfs_inode · 4a0cc7ca

由 Nikolay Borisov 提交于 1月 10, 2017

Currently btrfs_ino takes a struct inode and this causes a lot of
internal btrfs functions which consume this ino to take a VFS inode,
rather than btrfs' own struct btrfs_inode. In order to fix this "leak"
of VFS structs into the internals of btrfs first it's necessary to
eliminate all uses of struct inode for the purpose of inode. This patch
does that by using BTRFS_I to convert an inode to btrfs_inode. With
this problem eliminated subsequent patches will start eliminating the
passing of struct inode altogether, eventually resulting in a lot cleaner
code.
Signed-off-by: NNikolay Borisov <n.borisov.lkml@gmail.com>
[ fix btrfs_get_extent tracepoint prototype ]
Signed-off-by: NDavid Sterba <dsterba@suse.com>

4a0cc7ca

btrfs: add wrapper for counting BTRFS_MAX_EXTENT_SIZE · 823bb20a

由 David Sterba 提交于 1月 04, 2017

The expression is open-coded in several places, this asks for a wrapper.
As we know the MAX_EXTENT fits to u32, we can use the appropirate
division helper. This cascades to the result type updates.

Compiler is clever enough to use shift instead of integer division, so
there's no change in the generated assembly.
Signed-off-by: NDavid Sterba <dsterba@suse.com>

823bb20a

btrfs: remove unused logic of limiting async delalloc pages · 95995dbb

由 David Sterba 提交于 1月 06, 2017

A proposed patch in https://marc.info/?l=linux-btrfs&m=147859791003837
pointed out bad limit threshold in cow_file_range_async, but it turned
out that the whole logic is not necessary and is done by writeback. We
agreed to remove it.
Signed-off-by: NDavid Sterba <dsterba@suse.com>

95995dbb

btrfs: consolidate auto defrag kick off policies · 26d30f85

由 Anand Jain 提交于 12月 19, 2016

As of now writes smaller than 64k for non compressed extents and 16k
for compressed extents inside eof are considered as candidate
for auto defrag, put them together at a place.
Signed-off-by: NAnand Jain <anand.jain@oracle.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

26d30f85

A
btrfs: use BTRFS_COMPRESS_NONE to specify no compression · f74670f7
由 Anand Jain 提交于 12月 06, 2016
```
Signed-off-by: NAnand Jain <anand.jain@oracle.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>
```
f74670f7

btrfs: fix up misleading GFP_NOFS usage in btrfs_releasepage · 3ba7ab22

由 Michal Hocko 提交于 1月 09, 2017

b335b003 ("Btrfs: Avoid using __GFP_HIGHMEM with slab allocator")
has reduced the allocation mask in btrfs_releasepage to GFP_NOFS just
to prevent from giving an unappropriate gfp mask to the slab allocator
deeper down the callchain (in alloc_extent_state). This is wrong for
two reasons a) GFP_NOFS might be just too restrictive for the calling
context b) it is better to tweak the gfp mask down when it needs that.

So just remove the mask tweaking from btrfs_releasepage and move it
down to alloc_extent_state where it is needed.
Signed-off-by: NMichal Hocko <mhocko@suse.com>
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

3ba7ab22

27 1月, 2017 3 次提交

Btrfs: remove ->{get, set}_acl() from btrfs_dir_ro_inode_operations · 57b59ed2

由 Omar Sandoval 提交于 1月 25, 2017

Subvolume directory inodes can't have ACLs.

Cc: <stable@vger.kernel.org> # 4.9.x
Signed-off-by: NOmar Sandoval <osandov@fb.com>
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NChris Mason <clm@fb.com>

57b59ed2

Btrfs: disable xattr operations on subvolume directories · 1fdf4194

由 Omar Sandoval 提交于 1月 25, 2017

When you snapshot a subvolume containing a subvolume, you get a
placeholder directory where the subvolume would be. These directory
inodes have ->i_ops set to btrfs_dir_ro_inode_operations. Previously,
these i_ops didn't include the xattr operation callbacks. The conversion
to xattr_handlers missed this case, leading to bogus attempts to set
xattrs on these inodes. This manifested itself as failures when running
delayed inodes.

To fix this, clear IOP_XATTR in ->i_opflags on these inodes.

Fixes: 6c6ef9f2 ("xattr: Stop calling {get,set,remove}xattr inode operations")
Cc: Andreas Gruenbacher <agruenba@redhat.com>
Reported-by: NChris Murphy <lists@colorremedies.com>
Tested-by: NChris Murphy <lists@colorremedies.com>
Cc: <stable@vger.kernel.org> # 4.9.x
Signed-off-by: NOmar Sandoval <osandov@fb.com>
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NChris Mason <clm@fb.com>

1fdf4194

Btrfs: remove old tree_root case in btrfs_read_locked_inode() · 67ade058

由 Omar Sandoval 提交于 1月 25, 2017

As Jeff explained in c2951f32 ("btrfs: remove old tree_root dirent
processing in btrfs_real_readdir()"), supporting this old format is no
longer necessary since the Btrfs magic number has been updated since we
changed to the current format. There are other places where we still
handle this old format, but since this is part of a fix that is going to
stable, I'm only removing this one for now.

Cc: <stable@vger.kernel.org> # 4.9.x
Signed-off-by: NOmar Sandoval <osandov@fb.com>
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NChris Mason <clm@fb.com>

67ade058

20 1月, 2017 3 次提交

Btrfs: fix truncate down when no_holes feature is enabled · 91298eec

由 Liu Bo 提交于 12月 01, 2016

For such a file mapping,

[0-4k][hole][8k-12k]

In NO_HOLES mode, we don't have the [hole] extent any more.
Commit c1aa4575 ("Btrfs: fix shrinking truncate when the no_holes feature is enabled")
 fixed disk isize not being updated in NO_HOLES mode when data is not flushed.

However, even if data has been flushed, we can still have trouble
in updating disk isize since we updated disk isize to 'start' of
the last evicted extent.
Reviewed-by: NChris Mason <clm@fb.com>
Signed-off-by: NLiu Bo <bo.li.liu@oracle.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

91298eec

Btrfs: Fix deadlock between direct IO and fast fsync · 97dcdea0

由 Chandan Rajendra 提交于 12月 23, 2016

The following deadlock is seen when executing generic/113 test,

 ---------------------------------------------------------+----------------------------------------------------
  Direct I/O task                                           Fast fsync task
 ---------------------------------------------------------+----------------------------------------------------
  btrfs_direct_IO
    __blockdev_direct_IO
     do_blockdev_direct_IO
      do_direct_IO
       btrfs_get_blocks_direct
        while (blocks needs to written)
         get_more_blocks (first iteration)
          btrfs_get_blocks_direct
           btrfs_create_dio_extent
             down_read(&BTRFS_I(inode) >dio_sem)
             Create and add extent map and ordered extent
             up_read(&BTRFS_I(inode) >dio_sem)
                                                            btrfs_sync_file
                                                              btrfs_log_dentry_safe
                                                               btrfs_log_inode_parent
                                                                btrfs_log_inode
                                                                 btrfs_log_changed_extents
                                                                  down_write(&BTRFS_I(inode) >dio_sem)
                                                                   Collect new extent maps and ordered extents
                                                                    wait for ordered extent completion
         get_more_blocks (second iteration)
          btrfs_get_blocks_direct
           btrfs_create_dio_extent
             down_read(&BTRFS_I(inode) >dio_sem)
 --------------------------------------------------------------------------------------------------------------

In the above description, Btrfs direct I/O code path has not yet started
submitting bios for file range covered by the initial ordered
extent. Meanwhile, The fast fsync task obtains the write semaphore and
waits for I/O on the ordered extent to get completed. However, the
Direct I/O task is now blocked on obtaining the read semaphore.

To resolve the deadlock, this commit modifies the Direct I/O code path
to obtain the read semaphore before invoking
__blockdev_direct_IO(). The semaphore is then given up after
__blockdev_direct_IO() returns. This allows the Direct I/O code to
complete I/O on all the ordered extents it creates.
Signed-off-by: NChandan Rajendra <chandan@linux.vnet.ibm.com>
Reviewed-by: NFilipe Manana <fdmanana@suse.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

97dcdea0

btrfs: fix false enospc error when truncating heavily reflinked file · 47b5d646

由 Wang Xiaoguang 提交于 9月 07, 2016

Below test script can reveal this bug:
    dd if=/dev/zero of=fs.img bs=$((1024*1024)) count=100
    dev=$(losetup --show -f fs.img)
    mkdir -p /mnt/mntpoint
    mkfs.btrfs  -f $dev
    mount $dev /mnt/mntpoint
    cd /mnt/mntpoint

    echo "workdir is: /mnt/mntpoint"
    blocksize=$((128 * 1024))
    dd if=/dev/zero of=testfile bs=$blocksize count=1
    sync
    count=$((17*1024*1024*1024/blocksize))
    echo "file size is:" $((count*blocksize))
    for ((i = 1; i <= $count; i++)); do
        dst_offset=$((blocksize * i))
        xfs_io -f -c "reflink testfile 0 $dst_offset $blocksize"\
                testfile > /dev/null
    done
    sync
    truncate --size 0 testfile

The last truncate operation will fail for ENOSPC reason, but indeed
it should not fail.

In btrfs_truncate(), we use a temporary block_rsv to do truncate
operation. With every btrfs_truncate_inode_items() call, we migrate space
to this block_rsv, but forget to cleanup previous reservation, which
will make this block_rsv's reserved bytes keep growing, and this reserved
space will only be released in the end of btrfs_truncate(), this metadata
leak will impact other's metadata reservation. In this case, it's
"btrfs_start_transaction(root, 2);" fails for enospc error, which make
this truncate operation fail.

Call btrfs_block_rsv_release() to fix this bug.
Signed-off-by: NWang Xiaoguang <wangxg.fnst@cn.fujitsu.com>
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

47b5d646

09 1月, 2017 1 次提交

Btrfs: add 'inode' for extent map tracepoint · 92a1bf76

由 Liu Bo 提交于 11月 17, 2016

'inode' is an important field for btrfs_get_extent, lets trace it.
Signed-off-by: NLiu Bo <bo.li.liu@oracle.com>
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

92a1bf76

04 1月, 2017 1 次提交

Btrfs: adjust outstanding_extents counter properly when dio write is split · c2931667

由 Liu Bo 提交于 12月 22, 2016

Currently how btrfs dio deals with split dio write is not good
enough if dio write is split into several segments due to the
lack of contiguous space, a large dio write like 'dd bs=1G count=1'
can end up with incorrect outstanding_extents counter and endio
would complain loudly with an assertion.

This fixes the problem by compensating the outstanding_extents
counter in inode if a large dio write gets split.
Reported-by: NAnand Jain <anand.jain@oracle.com>
Tested-by: NAnand Jain <anand.jain@oracle.com>
Signed-off-by: NLiu Bo <bo.li.liu@oracle.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

c2931667

12 12月, 2016 1 次提交

Revert "Btrfs: adjust len of writes if following a preallocated extent" · 7c4c71ac

由 Chris Mason 提交于 12月 08, 2016

This is exposing an existing deadlock between fsync and AIO.  Until we
have the deadlock fixed, I'm pulling this one out.

This reverts commit a23eaa87.
Signed-off-by: NChris Mason <clm@fb.com>

7c4c71ac

09 12月, 2016 1 次提交

vfs: remove ".readlink = generic_readlink" assignments · dfeef688

由 Miklos Szeredi 提交于 12月 09, 2016

If .readlink == NULL implies generic_readlink().

Generated by:

to_del="\.readlink.*=.*generic_readlink"
for i in `git grep -l $to_del`; do sed -i "/$to_del"/d $i; done
Signed-off-by: NMiklos Szeredi <mszeredi@redhat.com>

dfeef688

06 12月, 2016 6 次提交

btrfs: remove root parameter from transaction commit/end routines · 3a45bb20

由 Jeff Mahoney 提交于 9月 09, 2016

Now we only use the root parameter to print the root objectid in
a tracepoint.  We can use the root parameter from the transaction
handle for that.  It's also used to join the transaction with
async commits, so we remove the comment that it's just for checking.
Signed-off-by: NJeff Mahoney <jeffm@suse.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

3a45bb20

btrfs: take an fs_info directly when the root is not used otherwise · 2ff7e61e

由 Jeff Mahoney 提交于 6月 22, 2016

There are loads of functions in btrfs that accept a root parameter
but only use it to obtain an fs_info pointer.  Let's convert those to
just accept an fs_info pointer directly.
Signed-off-by: NJeff Mahoney <jeffm@suse.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

2ff7e61e

btrfs: root->fs_info cleanup, add fs_info convenience variables · 0b246afa

由 Jeff Mahoney 提交于 6月 22, 2016

In routines where someptr->fs_info is referenced multiple times, we
introduce a convenience variable.  This makes the code considerably
more readable.
Signed-off-by: NJeff Mahoney <jeffm@suse.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

0b246afa

J
btrfs: root->fs_info cleanup, btrfs_calc_{trans,trunc}_metadata_size · 27965b6c
由 Jeff Mahoney 提交于 6月 16, 2016
```
Signed-off-by: NJeff Mahoney <jeffm@suse.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>
```
27965b6c

btrfs: pull node/sector/stripe sizes out of root and into fs_info · da17066c

由 Jeff Mahoney 提交于 6月 15, 2016

We track the node sizes per-root, but they never vary from the values
in the superblock.  This patch messes with the 80-column style a bit,
but subsequent patches to factor out root->fs_info into a convenience
variable fix it up again.
Signed-off-by: NJeff Mahoney <jeffm@suse.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

da17066c

btrfs: call functions that always use the same root with fs_info instead · 6bccf3ab

由 Jeff Mahoney 提交于 6月 21, 2016

There are many functions that are always called with the same root
argument.  Rather than passing the same root every time, we can
pass an fs_info pointer instead and have the function get the root
pointer itself.
Signed-off-by: NJeff Mahoney <jeffm@suse.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

6bccf3ab

30 11月, 2016 7 次提交

btrfs: don't access the bio directly in the direct I/O code · 6a2de22f

由 Christoph Hellwig 提交于 11月 25, 2016

Just use bio_for_each_segment_all to iterate over all segments.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Reviewed-by: NOmar Sandoval <osandov@fb.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

6a2de22f

btrfs: increment ctx->pos for every emitted or skipped dirent in readdir · d2fbb2b5

由 Jeff Mahoney 提交于 11月 05, 2016

If we process the last item in the leaf and hit an I/O error while
reading the next leaf, we return -EIO without having adjusted the
position.  Since we have emitted dirents, getdents() will return
the byte count to the user instead of the error.  Subsequent callers
will emit the last successful dirent again, and return -EIO again,
with the same result.  Callers loop forever.

Instead, if we always increment ctx->pos after emitting or skipping
the dirent, we'll be sure that we won't hit the same one again.  When
we go to process the next leaf, we won't have emitted any dirents
and the -EIO will be returned to the user properly.  We also don't
need to track if we've emitted a dirent already or if we've changed
the position yet.
Signed-off-by: NJeff Mahoney <jeffm@suse.com>
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

d2fbb2b5

btrfs: remove old tree_root dirent processing in btrfs_real_readdir() · c2951f32

由 Jeff Mahoney 提交于 11月 21, 2016

Commit 3de4586c (Btrfs: Allow subvolumes and snapshots anywhere
in the directory tree) introduced the current system of placing
snapshots in the directory tree.  It also introduced the behavior of
creating the snapshot and then creating the directory entries for it.

We've kept this code around for compatibility reasons, but it turns
out that no file systems with the old tree_root based snapshots can
be mounted on newer (>= 2009) kernels anyway.  About a month after the
above commit, commit 2a7108ad (Btrfs: rev the disk format for the
inode compat and csum selection changes) landed, changing the superblock
magic number.

As a result, we know that we'll never encounter tree_root-based dirents
or have to deal with skipping our own snapshot dirents.  Since that
also means that we're now only iterating over DIR_INDEX items, which only
contain one directory entry per leaf item, we don't need to loop over
the leaf item contents anymore either.
Signed-off-by: NJeff Mahoney <jeffm@suse.com>
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

c2951f32

btrfs: change btrfs_csum_final result param type to u8 · 0b5e3daf

由 Domagoj Tršan 提交于 10月 27, 2016

csum member of struct btrfs_super_block has array type of u8. It makes
sense that function btrfs_csum_final should be also declared to accept
u8 *. I changed the declaration of method void btrfs_csum_final(u32 crc,
char *result); to void btrfs_csum_final(u32 crc, u8 *result);
Signed-off-by: NDomagoj Tršan <domagoj.trsan@gmail.com>
[ changed cast to u8 at several call sites ]
Signed-off-by: NDavid Sterba <dsterba@suse.com>

0b5e3daf

Btrfs: adjust len of writes if following a preallocated extent · a23eaa87

由 Liu Bo 提交于 11月 04, 2016

If we have

|0--hole--4095||4096--preallocate--12287|

instead of using preallocated space, a 8K direct write will just
create a new 8K extent and it'll end up with

|0--new extent--8191||8192--preallocate--12287|

It's because we find a hole em and then go to create a new 8K
extent directly without adjusting @len.
Signed-off-by: NLiu Bo <bo.li.liu@oracle.com>
Reviewed-by: NChris Mason <clm@fb.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

a23eaa87

btrfs: remove constant parameter to memset_extent_buffer and rename it · b159fa28

由 David Sterba 提交于 11月 08, 2016

The only memset we do is to 0, so sink the parameter to the function and
simplify all calls. Rename the function to reflect the behaviour.
Signed-off-by: NDavid Sterba <dsterba@suse.com>

b159fa28

D
btrfs: remove unused headers, statfs.h · 926b9233
由 David Sterba 提交于 10月 05, 2016
```
Signed-off-by: NDavid Sterba <dsterba@suse.com>
```
926b9233