提交 · 3e04e7f10b68999e0d8321516ea19d9d5b044dee · openeuler / Kernel

21 2月, 2013 26 次提交

Btrfs: handle errors in compression submission path · 3e04e7f1

由 Josef Bacik 提交于 2月 06, 2013

I noticed we would deadlock if we aborted a transaction while doing
compressed io. This is because we don't unlock our pages if something goes
horribly wrong. To fix this we need to make sure that we call
extent_clear_unlock_delalloc in order to unlock all the pages. If we have
to cow in the async submission thread we need to make sure to unlock our
locked_page as the cow error path will not unlock the locked page as it
depends on the caller to unlock that page. With this patch we no longer
deadlock on the page lock when we have an aborted transaction. Thanks,
Signed-off-by: NJosef Bacik <jbacik@fusionio.com>

3e04e7f1

Btrfs: rework the overcommit logic to be based on the total size · 70afa399

由 Josef Bacik 提交于 2月 06, 2013

People have been complaining about random ENOSPC errors that will clear up
after a umount or just a given amount of time. Chris was able to reproduce
this with stress.sh and lots of processes and so was I. Basically the
overcommit stuff would really let us get out of hand, in my tests I saw up
to 30 gigs of outstanding reservations with only 2 gigs total of metadata
space. This usually worked out fine but with so much outstanding
reservation the flushing stuff short circuits to make sure we don't hang
forever flushing when we really need ENOSPC. Plus we allocate chunks in
order to alleviate the pressure, but this doesn't actually help us since we
only use the non-allocated area in our over commit logic.

So instead of basing overcommit on the amount of non-allocated space,
instead just do it based on how much total space we have, and then limit it
to the non-allocated space in case we are short on space to spill over into.
This allows us to have the same performance as well as no longer giving
random ENOSPC. Thanks,
Signed-off-by: NJosef Bacik <jbacik@fusionio.com>

70afa399

Btrfs: account for orphan inodes properly during cleanup · 925396ec

由 Josef Bacik 提交于 2月 01, 2013

Dave sent me a panic where we were doing the orphan cleanup and panic'ed
trying to release our reservation from the orphan block rsv. The reason for
this is because our orphan block rsv had been free'd out from underneath us
because the transaction commit found that there were no orphan inodes
according to its count and decided to free it. This is incorrect so make
sure we inc the orphan inodes count so the accounting is all done properly.
This would also cause the warning in the orphan commit code normally if you
had any orphans to cleanup as they would only decrement the orphan count so
you'd get a negative orphan count which could cause problems during runtime.
Thanks,
Signed-off-by: NJosef Bacik <jbacik@fusionio.com>

925396ec

Btrfs: unreserve space if our ordered extent fails to work · 0bec9ef5

由 Josef Bacik 提交于 1月 31, 2013

When a transaction aborts or there's an EIO on an ordered extent or any
error really we will not free up the space we reserved for this ordered
extent. This results in warnings from the block group cache cleanup in the
case of a transaction abort, or leaking space in the case of EIO on an
ordered extent. Fix this up by free'ing the reserved space if we have an
error at all trying to complete an ordered extent. Thanks,
Signed-off-by: NJosef Bacik <jbacik@fusionio.com>

0bec9ef5

Btrfs: fix how we discard outstanding ordered extents on abort · 779880ef

由 Josef Bacik 提交于 1月 31, 2013

When we abort we've been just free'ing up all the ordered extents and
hoping for the best. This results in lots of warnings from various places,
warnings from btrfs_destroy_inode() because it's ENOSPC accounting isn't
fixed. It will also screw up lots of pages who have been set private but
never get cleared because the ordered extents are never allowed to be
submitted. This patch fixes those warnings. Thanks,
Signed-off-by: NJosef Bacik <jbacik@fusionio.com>

779880ef

Btrfs: fix freeing delayed ref head while still holding its mutex · eb12db69

由 Josef Bacik 提交于 1月 30, 2013

I hit this error when reproducing a bug that would end in a transaction
abort. We take the delayed ref head's mutex to keep anybody from processing
it while we're destroying it, but we fail to drop the mutex before we carry
on and free the damned thing. Fix this by doing the remove logic for the
head ourselves and unlock the mutex, that way we can avoid use after free's
or hung tasks waiting on that mutex to come back so they know the delayed
ref completed. Thanks,
Signed-off-by: NJosef Bacik <jbacik@fusionio.com>

eb12db69

btrfs: ensure we don't overrun devices_info[] in __btrfs_alloc_chunk · 063d006f