提交 · 815a51c74ad14864d0a8fff5eea983819c18feae · openeuler / Kernel

26 5月, 2012 1 次提交

Btrfs: dummy extent buffers for tree mod log · 815a51c7

由 Jan Schmidt 提交于 5月 16, 2012

The tree modification log needs two ways to create dummy extent buffers,
once by allocating a fresh one (to rebuild an old root) and once by
cloning an existing one (to make private rewind modifications) to it.
Signed-off-by: NJan Schmidt <list.btrfs@jan-o-sch.net>

815a51c7

05 5月, 2012 1 次提交

Btrfs: fix page leak when allocing extent buffers · 17de39ac

由 Josef Bacik 提交于 5月 04, 2012

If we happen to alloc a extent buffer and then alloc a page and notice that
page is already attached to an extent buffer, we will only unlock it and
free our existing eb. Any pages currently attached to that eb will be
properly freed, but we don't do the page_cache_release() on the page where
we noticed the other extent buffer which can cause us to leak pages and I
hope cause the weird issues we've been seeing in this area. Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

17de39ac

19 4月, 2012 3 次提交

Btrfs: always store the mirror we read the eb from · 5cf1ab56

由 Josef Bacik 提交于 4月 16, 2012

A user reported a panic where we were trying to fix a bad mirror but the
mirror number we were giving was 0, which is invalid. This is because we
don't do the transid verification until after the read, so as far as the
read code is concerned the read was a success. So instead store the mirror
we read from so that if there is some failure post read we know which mirror
to try next and which mirror needs to be fixed if we find a good copy of the
block. Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>

5cf1ab56

Btrfs: avoid possible use-after-free in clear_extent_bit() · cdc6a395

由 Li Zefan 提交于 3月 12, 2012

clear_extent_bit()
{
    next_node = rb_next(&state->rb_node);
    ...
    clear_state_bit(state);  <-- this may free next_node
    if (next_node) {
        state = rb_entry(next_node);
        ...
    }
}

clear_state_bit() calls merge_state() which may free the next node
of the passing extent_state, so clear_extent_bit() may end up
referencing freed memory.
Signed-off-by: NLi Zefan <lizf@cn.fujitsu.com>

cdc6a395

Btrfs: retrurn void from clear_state_bit · 8e52acf7

由 Li Zefan 提交于 3月 12, 2012

Currently it returns a set of bits that were cleared, but this return
value is not used at all.

Moreover it doesn't seem to be useful, because we may clear the bits
of a few extent_states, but only the cleared bits of last one is
returned.
Signed-off-by: NLi Zefan <lizf@cn.fujitsu.com>

8e52acf7

13 4月, 2012 2 次提交

Btrfs: check return value of bio_alloc() properly · e627ee7b

由 Tsutomu Itoh 提交于 4月 12, 2012

bio_alloc() has the possibility of returning NULL.
So, it is necessary to check the return value.
Signed-off-by: NTsutomu Itoh <t-itoh@jp.fujitsu.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

e627ee7b

Btrfs: fix uninit variable in repair_eb_io_failure · d95603b2

由 Chris Mason 提交于 4月 12, 2012

We'd have to be passing bogus extent buffers for this uninit variable to
actually be used, but set it to zero just in case.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

d95603b2

27 3月, 2012 8 次提交

Btrfs: deal with read errors on extent buffers differently · ea466794

由 Josef Bacik 提交于 3月 26, 2012

Since we need to read and write extent buffers in their entirety we can't use
the normal bio_readpage_error stuff since it only works on a per page basis. So
instead make it so that if we see an io error in endio we just mark the eb as
having an IO error and then in btree_read_extent_buffer_pages we will manually
try other mirrors and then overwrite the bad mirror if we find a good copy.
This works with larger than page size blocks. Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

ea466794

Btrfs: loop waiting on writeback · a098d8e8

由 Chris Mason 提交于 3月 21, 2012

lock_extent_buffer_for_io needs to loop around and make sure the
writeback bits are not set.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

a098d8e8

Btrfs: ensure an entire eb is written at once · 0b32f4bb

由 Josef Bacik 提交于 3月 13, 2012

This patch simplifies how we track our extent buffers. Previously we could exit
writepages with only having written half of an extent buffer, which meant we had
to track the state of the pages and the state of the extent buffers differently.
Now we only read in entire extent buffers and write out entire extent buffers,
this allows us to simply set bits in our bflags to indicate the state of the eb
and we no longer have to do things like track uptodate with our iotree. Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

0b32f4bb

Btrfs: introduce mark_extent_buffer_accessed · 5df4235e

由 Josef Bacik 提交于 3月 15, 2012

Because an eb can have multiple pages we need to make sure that all pages within
the eb are markes as accessed, since releasepage can be called against any page
in the eb. This will keep us from possibly evicting hot eb's when we're doing
larger than pagesize eb's. Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>

5df4235e

Btrfs: introduce free_extent_buffer_stale · 3083ee2e

由 Josef Bacik 提交于 3月 09, 2012

Because btrfs cow's we can end up with extent buffers that are no longer
necessary just sitting around in memory. So instead of evicting these pages, we
could end up evicting things we actually care about. Thus we have
free_extent_buffer_stale for use when we are freeing tree blocks. This will
make it so that the ref for the eb being in the radix tree is dropped as soon as
possible and then is freed when the refcount hits 0 instead of waiting to be
released by releasepage. Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>

3083ee2e

Btrfs: only use the existing eb if it's count isn't 0 · 115391d2

由 Josef Bacik 提交于 3月 09, 2012

We can run into a problem where we find an eb for our existing page already on
the radix tree but it has a ref count of 0. It hasn't yet been removed by RCU
yet so this can cause issues where we will use the EB after free. So do
atomic_inc_not_zero on the exists->refs and if it is zero just do
synchronize_rcu() and try again. We won't have to worry about new allocators
coming in since they will block on the page lock at this point. Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>

115391d2

Btrfs: set page->private to the eb · 4f2de97a

由 Josef Bacik 提交于 3月 07, 2012

We spend a lot of time looking up extent buffers from pages when we could just
store the pointer to the eb the page is associated with in page->private. This
patch does just that, and it makes things a little simpler and reduces a bit of
CPU overhead involved with doing metadata IO. Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>

4f2de97a

Btrfs: allow metadata blocks larger than the page size · 727011e0

由 Chris Mason 提交于 8月 06, 2010

A few years ago the btrfs code to support blocks lager than
the page size was disabled to fix a few corner cases in the
page cache handling.  This fixes the code to properly support
large metadata blocks again.

Since current kernels will crash early and often with larger
metadata blocks, this adds an incompat bit so that older kernels
can't mount it.

This also does away with different blocksizes for nodes and leaves.
You get a single block size for all tree blocks.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

727011e0

22 3月, 2012 8 次提交

btrfs: replace many BUG_ONs with proper error handling · 79787eaa

由 Jeff Mahoney 提交于 3月 12, 2012

 btrfs currently handles most errors with BUG_ON. This patch is a work-in-
 progress but aims to handle most errors other than internal logic
 errors and ENOMEM more gracefully.

 This iteration prevents most crashes but can run into lockups with
 the page lock on occasion when the timing "works out."
Signed-off-by: NJeff Mahoney <jeffm@suse.com>

79787eaa

btrfs: split extent_state ops · 3fbe5c02

由 Jeff Mahoney 提交于 3月 01, 2012

set_extent_bit can do exclusive locking but only when called by lock_extent*,

Drop the exclusive bits argument except when called by lock_extent.
Signed-off-by: NJeff Mahoney <jeffm@suse.com>

3fbe5c02

btrfs: drop gfp_t from lock_extent · d0082371

由 Jeff Mahoney 提交于 3月 01, 2012

 lock_extent and unlock_extent are always called with GFP_NOFS, drop the
 argument and use GFP_NOFS consistently.
Signed-off-by: NJeff Mahoney <jeffm@suse.com>

d0082371

J
btrfs: return void in functions without error conditions · 143bede5
由 Jeff Mahoney 提交于 3月 01, 2012
```
Signed-off-by: NJeff Mahoney <jeffm@suse.com>
```
143bede5

btrfs: ->submit_bio_hook error push-up · 355808c2

由 Jeff Mahoney 提交于 10月 03, 2011

This pushes failures from the submit_bio_hook callbacks,
btrfs_submit_bio_hook and btree_submit_bio_hook into the callers, including
callers of submit_one_bio where it catches the failures with BUG_ON.

It also pushes up through the ->readpage_io_failed_hook to
end_bio_extent_writepage where the error is already caught with BUG_ON.
Signed-off-by: NJeff Mahoney <jeffm@suse.com>

355808c2

btrfs: Factor out tree->ops->merge_bio_hook call · 3444a972

由 Jeff Mahoney 提交于 10月 03, 2011

In submit_extent_page, there's a visually noisy if statement that, in
the midst of other conditions, does the tree dependency for tree->ops
and tree->ops->merge_bio_hook before calling it, and then another
condition afterwards. If an error is returned from merge_bio_hook,
there's no way to catch it. It's considered a routine "1" return
value instead of a failure.

This patch factors out the dependency check into a new local merge_bio
routine and BUG's on an error. The if statement is less noisy as a side-
effect.
Signed-off-by: NJeff Mahoney <jeffm@suse.com>

3444a972

btrfs: Remove set bits return from clear_extent_bit · 6763af84

由 Jeff Mahoney 提交于 3月 01, 2012

There is only one caller of clear_extent_bit that checks the return value
and it only checks if it's negative. Since there are no users of the
returned bits functionality of clear_extent_bit, stop returning it
and avoid complicating error handling.
Signed-off-by: NJeff Mahoney <jeffm@suse.com>

6763af84

btrfs: Catch locking failures in {set,clear,convert}_extent_bit · c2d904e0

由 Jeff Mahoney 提交于 10月 03, 2011

The *_state functions can only return 0 or -EEXIST. This patch addresses
the cases where those functions returning -EEXIST represent a locking
failure. It handles them by panicking with an appropriate error message.
Signed-off-by: NJeff Mahoney <jeffm@suse.com>

c2d904e0

23 2月, 2012 1 次提交

Btrfs: clear the extent uptodate bits during parent transid failures · 50653190

由 Chris Mason 提交于 2月 22, 2012

If btrfs reads a block and finds a parent transid mismatch, it clears
the uptodate flags on the extent buffer, and the pages inside it.  But
we only clear the uptodate bits in the state tree if the block straddles
more than one page.

This is from an old optimization from to reduce contention on the extent
state tree.  But it is buggy because the code that retries a read from
a different copy of the block is going to find the uptodate state bits
set and skip the IO.

The end result of the bug is that we'll never actually read the good
copy (if there is one).

The fix here is to always clear the uptodate state bits, which is safe
because this code is only called when the parent transid fails.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

50653190

21 2月, 2012 1 次提交

Btrfs: be less strict on finding next node in clear_extent_bit · 692e5759

由 Liu Bo 提交于 2月 16, 2012

In clear_extent_bit, it is enough that next node is adjacent in tree level.
Signed-off-by: NLiu Bo <liubo2009@cn.fujitsu.com>

692e5759

17 2月, 2012 4 次提交

Btrfs: kick out redundant stuff in convert_extent_bit · 9d47c767

由 Liu Bo 提交于 2月 16, 2012

clear_state_bit will do merge_state for us, so kick out the redundant one.
Signed-off-by: NLiu Bo <liubo2009@cn.fujitsu.com>

9d47c767

Btrfs: skip states when they does not contain bits to clear · 0449314a

由 Liu Bo 提交于 2月 16, 2012

Clearing a range's bits is different with setting them, since we don't
need to touch them when states do not contain bits we want.
Signed-off-by: NLiu Bo <liubo2009@cn.fujitsu.com>

0449314a

T
Btrfs: check return value of lookup_extent_mapping() correctly · 285190d9
由 Tsutomu Itoh 提交于 2月 16, 2012
```
This patch corrects error checking of lookup_extent_mapping().
Signed-off-by: NTsutomu Itoh <t-itoh@jp.fujitsu.com>
```
285190d9

Btrfs: fix return value check of extent_io_ops · 013bd4c3

由 Tsutomu Itoh 提交于 2月 16, 2012

This patch adds the check on the return value of extent_io_ops.
Signed-off-by: NTsutomu Itoh <t-itoh@jp.fujitsu.com>

013bd4c3

15 2月, 2012 1 次提交

btrfs: delalloc for page dirtied out-of-band in fixup worker · 87826df0

由 Jeff Mahoney 提交于 2月 15, 2012

 We encountered an issue that was easily observable on s/390 systems but
 could really happen anywhere. The timing just seemed to hit reliably
 on s/390 with limited memory.

 The gist is that when an unexpected set_page_dirty() happened, we'd
 run into the BUG() in btrfs_writepage_fixup_worker since it wasn't
 properly set up for delalloc.

 This patch does the following:
 - Performs the missing delalloc in the fixup worker
 - Allow the start hook to return -EBUSY which informs __extent_writepage
   that it should mark the page skipped and not to redirty it. This is
   required since the fixup worker can fail with -ENOSPC and the page
   will have already been redirtied. That causes an Oops in
   drop_outstanding_extents later. Retrying the fixup worker could
   lead to an infinite loop. Deferring the page redirty also saves us
   some cycles since the page would be stuck in a resubmit-redirty loop
   until the fixup worker completes. It's not harmful, just wasteful.
 - If the fixup worker fails, we mark the page and mapping as errored,
   and end the writeback, similar to what we would do had the page
   actually been submitted to writeback.
Signed-off-by: NJeff Mahoney <jeffm@suse.com>

87826df0

27 1月, 2012 1 次提交

Btrfs: Check for NULL page in extent_range_uptodate · 8bedd51b

由 Mitch Harder 提交于 1月 26, 2012

A user has encountered a NULL pointer kernel oops in btrfs when
encountering media errors.  The problem has been identified
as an unhandled NULL pointer returned from find_get_page().
This modification simply checks for a NULL page, and returns
with an error if found (the extent_range_uptodate() function
returns 1 on errors).

After testing this patch, the user reported that the error with
the NULL pointer oops was solved.  However, there is still a
remaining problem with a thread becoming stuck in
wait_on_page_locked(page) in the read_extent_buffer_pages(...)
function in extent_io.c

       for (i = start_i; i < num_pages; i++) {
               page = extent_buffer_page(eb, i);
               wait_on_page_locked(page);
               if (!PageUptodate(page))
                       ret = -EIO;
       }

This patch leaves the issue with the locked page yet to be resolved.
Signed-off-by: NMitch Harder <mitch.harder@sabayonlinux.org>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

8bedd51b

04 1月, 2012 1 次提交

Btrfs: add nested locking mode for paths · 5b25f70f

由 Arne Jansen 提交于 9月 13, 2011

This patch adds the possibilty to read-lock an extent even if it is already
write-locked from the same thread. btrfs_find_all_roots() needs this
capability.
Signed-off-by: NArne Jansen <sensille@gmx.net>
Signed-off-by: NJan Schmidt <list.btrfs@jan-o-sch.net>

5b25f70f

22 12月, 2011 1 次提交

Btrfs: integrate integrity check module into btrfs · 21adbd5c

由 Stefan Behrens 提交于 11月 09, 2011

This is the last part of the patch series. It modifies the btrfs
code to use the integrity check module if configured to do so
with the define BTRFS_FS_CHECK_INTEGRITY. If this define is not set,
the only effective change is that code is added that handles the
mount option to activate the integrity check. If the mount option is
set and the define BTRFS_FS_CHECK_INTEGRITY is not set, that code
complains in the log and the mount fails with EINVAL.

Add the mount option to activate the usage of the integrity check
code.
Add invocation of btrfs integrity check code init and cleanup
function on mount and umount, respectively.
Add hook to call btrfs integrity check code version of
submit_bh/submit_bio.
Signed-off-by: NStefan Behrens <sbehrens@giantdisaster.de>

21adbd5c

08 12月, 2011 1 次提交

Btrfs: drop spin lock when memory alloc fails · 1cf4ffdb

由 Liu Bo 提交于 12月 07, 2011

Drop spin lock in convert_extent_bit() when memory alloc fails,
otherwise, it will be a deadlock.
Signed-off-by: NLiu Bo <liubo2009@cn.fujitsu.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

1cf4ffdb

01 12月, 2011 1 次提交

Btrfs: fix meta data raid-repair merge problem · f4a8e656

由 Jan Schmidt 提交于 12月 01, 2011

Commit 4a54c8c1 introduced raid-repair, killing the individual
readpage_io_failed_hook entries from inode.c and disk-io.c. Commit
4bb31e92 introduced new readahead code, adding a readpage_io_failed_hook to
disk-io.c.

The raid-repair commit had logic to disable raid-repair, if
readpage_io_failed_hook is set. Thus, the readahead commit effectively
disabled raid-repair for meta data.

This commit changes the logic to always attempt raid-repair when needed and
call the readpage_io_failed_hook in case raid-repair fails. This is much
more straight forward and should have been like that from the beginning.
Signed-off-by: NJan Schmidt <list.btrfs@jan-o-sch.net>
Reported-by: NStefan Behrens <sbehrens@giantdisaster.de>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

f4a8e656

20 11月, 2011 3 次提交

Btrfs: sectorsize align offsets in fiemap · 4d479cf0

由 Josef Bacik 提交于 11月 17, 2011

We've been hitting BUG()'s in btrfs_cont_expand and btrfs_fallocate and anywhere
else that calls btrfs_get_extent while running xfstests 13 in a loop. This is
because fiemap is calling btrfs_get_extent with non-sectorsize aligned offsets,
which will end up adding mappings that are not sectorsize aligned, which will
cause problems in some cases for subsequent calls to btrfs_get_extent for
similar areas that are sectorsize aligned. With this patch I ran xfstests 13 in
a loop for a couple of hours and didn't hit the problem that I could previously
hit in at most 20 minutes. Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>

4d479cf0

btrfs: mirror_num should be int, not u64 · 32240a91

由 Jan Schmidt 提交于 11月 20, 2011

My previous patch introduced some u64 for failed_mirror variables, this one
makes it consistent again.
Signed-off-by: NJan Schmidt <list.btrfs@jan-o-sch.net>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

32240a91

btrfs: Fix up 32/64-bit compatibility for new ioctls · 745c4d8e

由 Jeff Mahoney 提交于 11月 20, 2011

This patch casts to unsigned long before casting to a pointer and fixes
the following warnings:
fs/btrfs/extent_io.c:2289:20: warning: cast from pointer to integer of different size [-Wpointer-to-int-cast]
fs/btrfs/ioctl.c:2933:37: warning: cast from pointer to integer of different size [-Wpointer-to-int-cast]
fs/btrfs/ioctl.c:2937:21: warning: cast to pointer from integer of different size [-Wint-to-pointer-cast]
fs/btrfs/ioctl.c:3020:21: warning: cast to pointer from integer of different size [-Wint-to-pointer-cast]
fs/btrfs/scrub.c:275:4: warning: cast to pointer from integer of different size [-Wint-to-pointer-cast]
fs/btrfs/backref.c:686:27: warning: cast from pointer to integer of different size [-Wpointer-to-int-cast]
Signed-off-by: NJeff Mahoney <jeffm@suse.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

745c4d8e

06 11月, 2011 2 次提交

Btrfs: ClearPageError during writepage and clean_tree_block · bf0da8c1

由 Chris Mason 提交于 11月 04, 2011

Failure testing was tripping up over stale PageError bits in
metadata pages.  If we have an io error on a block, and later on
end up reusing it, nobody ever clears PageError on those pages.

During commit, we'll find PageError and think we had trouble writing
the block, which will lead to aborts and other problems.

This changes clean_tree_block and the btrfs writepage code to
clear the PageError bit.  In both cases we're either completely
done with the page or the page has good stuff and the error bit
is no longer valid.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

bf0da8c1

Btrfs: make sure to flush queued bios if write_cache_pages waits · 01d658f2

由 Chris Mason 提交于 11月 01, 2011

write_cache_pages tries to build up a large bio to stuff down the pipe.
But if it needs to wait for a page lock, it needs to make sure and send
down any pending writes so we don't deadlock with anyone who has the
page lock and is waiting for writeback of things inside the bio.

Dave Sterba triggered this as a deadlock between the autodefrag code and
the extent write_cache_pages
Signed-off-by: NChris Mason <chris.mason@oracle.com>

01d658f2

openeuler / Kernel 12 个月 前同步成功

openeuler / Kernel
12 个月前同步成功