提交 · 8ea05e3a4262b9e6871c349fa3486bcfc72ffd1a · OpenHarmony / kernel_linux

25 7月, 2012 1 次提交

Btrfs: add helper for tree enumeration · e6793769

由 Arne Jansen 提交于 9月 13, 2011

Often no exact match is wanted but just the next lower or
higher item. There's a lot of duplicated code throughout
btrfs to deal with the corner cases. This patch adds a
helper function that can facilitate searching.
Signed-off-by: NArne Jansen <sensille@gmx.net>

e6793769

27 6月, 2012 4 次提交

Btrfs: resolve tree mod log locking issue in btrfs_next_leaf · d42244a0

由 Jan Schmidt 提交于 6月 22, 2012

With the tree mod log, we may end up with two roots (the current root and a
rewinded version of it) both pointing to two leaves, l1 and l2, of which l2
had already been cow-ed in the current transaction. If we don't rewind any
tree blocks, we cannot have two roots both pointing to an already cowed tree
block.

Now there is btrfs_next_leaf, which has a leaf locked and wants a lock on
the next (right) leaf. And there is push_leaf_left, which has a (cowed!)
leaf locked and wants a lock on the previous (left) leaf.

In order to solve this dead lock situation, we use try_lock in
btrfs_next_leaf (only in case it's called with a tree mod log time_seq
paramter) and if we fail to get a lock on the next leaf, we give up our lock
on the current leaf and retry from the very beginning.
Signed-off-by: NJan Schmidt <list.btrfs@jan-o-sch.net>

d42244a0

Btrfs: fix tree mod log rewind of ADD operations · 19956c7e

由 Jan Schmidt 提交于 6月 22, 2012

When a MOD_LOG_KEY_ADD operation is rewinded, we remove the key from the
tree block. If its not the last key, removal involves a move operation.
This move operation was explicitly done before this commit.

However, at insertion time, there's a move operation before the actual
addition to make room for the new key, which is recorded in the tree mod
log as well. This means, we must drop the move operation when rewinding the
add operation, because the next operation we'll be rewinding will be the
corresponding MOD_LOG_MOVE_KEYS operation.
Signed-off-by: NJan Schmidt <list.btrfs@jan-o-sch.net>

19956c7e

Btrfs: always put insert_ptr modifications into the tree mod log · c3e06965

由 Jan Schmidt 提交于 6月 21, 2012

Several callers of insert_ptr set the tree_mod_log parameter to 0 to avoid
addition to the tree mod log. In fact, we need all of those operations. This
commit simply removes the additional parameter and makes addition to the
tree mod log unconditional.
Signed-off-by: NJan Schmidt <list.btrfs@jan-o-sch.net>

c3e06965

Btrfs: fix tree mod log for root replacements at leaf level · 28da9fb4

由 Jan Schmidt 提交于 6月 21, 2012

For the tree mod log, we don't log any operations at leaf level. If the root
is at the leaf level (i.e. the tree consists only of the root), then
__tree_mod_log_oldest_root will find a ROOT_REPLACE operation in the log
(because we always log that one no matter which level), but no other
operations.

With this patch __tree_mod_log_oldest_root exits cleanly instead of
BUGging in this situation. get_old_root checks if its really a root at leaf
level in case we don't have any operations and WARNs if this assumption
breaks.
Signed-off-by: NJan Schmidt <list.btrfs@jan-o-sch.net>

28da9fb4

16 6月, 2012 1 次提交

Btrfs: init old_generation in get_old_root · 4325edd0

由 Chris Mason 提交于 6月 15, 2012

gcc was giving an uninit variable warning here.  Strictly
speaking we don't need to init it, but this will make things
much less error prone.
Signed-off-by: NChris Mason <chris.mason@fusionio.com>

4325edd0

15 6月, 2012 4 次提交

Btrfs: fix race in tree mod log addition · 3310c36e

由 Jan Schmidt 提交于 6月 11, 2012

When adding to the tree modification log, we grab two locks at different
stages. We must not drop the outer lock until we're done with section
protected by the inner lock. This moves the unlock call for the outer lock
to the appropriate position.
Signed-off-by: NJan Schmidt <list.btrfs@jan-o-sch.net>

3310c36e

Btrfs: add btrfs_next_old_leaf · 3d7806ec

由 Jan Schmidt 提交于 6月 11, 2012

To make sense of the tree mod log, the backref walker not only needs
btrfs_search_old_slot, but it also called btrfs_next_leaf, which in turn was
calling btrfs_search_slot. This obviously didn't give the correct result.

This commit adds btrfs_next_old_leaf, a drop-in replacement for
btrfs_next_leaf with a time_seq parameter. If it is zero, it behaves exactly
like btrfs_next_leaf. If it is non-zero, it will use btrfs_search_old_slot
with this time_seq parameter.
Signed-off-by: NJan Schmidt <list.btrfs@jan-o-sch.net>

3d7806ec

Btrfs: fix return value for __tree_mod_log_oldest_root · a95236d9

由 Jan Schmidt 提交于 6月 05, 2012

In __tree_mod_log_oldest_root() we must return the found operation even if
it's not a ROOT_REPLACE operation. Otherwise, the caller assumes that there
are no operations to be rewinded and returns immediately.

The code in the caller is modified to improve readability.
Signed-off-by: NJan Schmidt <list.btrfs@jan-o-sch.net>

a95236d9

Btrfs: use btrfs_read_lock_root_node in get_old_root · 8ba97a15

由 Jan Schmidt 提交于 6月 04, 2012

get_old_root could race with root node updates because we weren't locking
the node early enough. Use btrfs_read_lock_root_node to grab the root locked
in the very beginning and release the lock as soon as possible (just like
btrfs_search_slot does).
Signed-off-by: NJan Schmidt <list.btrfs@jan-o-sch.net>

8ba97a15

04 6月, 2012 1 次提交

Btrfs: remove call to btrfs_header_nritems with no effect · 4d5a0565

由 Jan Schmidt 提交于 4月 30, 2012

This is a leftover from cleanup patch 559af821. Before the cleanup,
btrfs_header_nritems was called inside an if condition. As it has no side
effects we need to preserve here, it should simply be dropped.
Signed-off-by: NJan Schmidt <list.btrfs@jan-o-sch.net>

4d5a0565

01 6月, 2012 4 次提交

Btrfs: fix tree mod log rewinded level and rewinding of moved keys · c3193108

由 Jan Schmidt 提交于 5月 31, 2012

When we rewind REMOVE_WHILE_FREEING operations, there's code that allocates
a fresh buffer instead of cloning the old one. Setting that buffer's level
correctly was missing in this case.

When rewinding a MOVE_KEYS operation, btrfs_node_key_ptr_offset(slot) was
missing for memmove_extent_buffer()'s arguments.
Signed-off-by: NJan Schmidt <list.btrfs@jan-o-sch.net>

c3193108

Btrfs: fix tree mod log del_ptr · f395694c

由 Jan Schmidt 提交于 5月 31, 2012

Logging for del_ptr when we're not deleting the last pointer was wrong. This
fixes both, duplicate log entries and log sequence.
Signed-off-by: NJan Schmidt <list.btrfs@jan-o-sch.net>

f395694c

Btrfs: add tree_mod_dont_log helper · e9b7fd4d

由 Jan Schmidt 提交于 5月 31, 2012

Replace duplicate code by small inline helper function.
Signed-off-by: NJan Schmidt <list.btrfs@jan-o-sch.net>

e9b7fd4d

Btrfs: add missing spin_lock for insertion into tree mod log · 926dd8a6

由 Jan Schmidt 提交于 5月 31, 2012

tree_mod_alloc calls __get_tree_mod_seq and must acquire a spinlock before
doing so.
Signed-off-by: NJan Schmidt <list.btrfs@jan-o-sch.net>

926dd8a6

30 5月, 2012 5 次提交

Btrfs: return value of btrfs_read_buffer is checked correctly · 018642a1

由 Tsutomu Itoh 提交于 5月 29, 2012

btrfs_read_buffer() has the possibility of returning the error.
Therefore, I add the code in which the return value of btrfs_read_buffer()
is checked.
Signed-off-by: NTsutomu Itoh <t-itoh@jp.fujitsu.com>

018642a1

Btrfs: add btrfs_search_old_slot · 5d9e75c4

由 Jan Schmidt 提交于 5月 16, 2012

The tree modification log together with the current state of the tree gives
a consistent, old version of the tree. btrfs_search_old_slot is used to
search through this old version and return old (dummy!) extent buffers.
Naturally, this function cannot do any tree modifications.
Signed-off-by: NJan Schmidt <list.btrfs@jan-o-sch.net>

5d9e75c4

Btrfs: add del_ptr and insert_ptr modifications to the tree mod log · f3ea38da

由 Jan Schmidt 提交于 5月 26, 2012

Record all relevant modifications to block pointers in the tree mod log so
that we can rewind them later on for backref walking.
Signed-off-by: NJan Schmidt <list.btrfs@jan-o-sch.net>

f3ea38da

Btrfs: put all block modifications into the tree mod log · f230475e

由 Jan Schmidt 提交于 5月 26, 2012

When running functions that can make changes to the internal trees
(e.g. btrfs_search_slot), we check if somebody may be interested in the
block we're currently modifying. If so, we record our modification to be
able to rewind it later on.
Signed-off-by: NJan Schmidt <list.btrfs@jan-o-sch.net>

f230475e

Btrfs: add tree modification log functions · bd989ba3

由 Jan Schmidt 提交于 5月 16, 2012

The tree mod log will log modifications made fs-tree nodes. Most
modifications are done by autobalance of the tree. Such changes are recorded
as long as a block entry exists. When released, the log is cleaned.

With the tree modification log, it's possible to reconstruct a consistent
old state of the tree. This is required to do backref walking on a busy
file system.
Signed-off-by: NJan Schmidt <list.btrfs@jan-o-sch.net>

bd989ba3

26 5月, 2012 1 次提交

Btrfs: don't set for_cow parameter for tree block functions · 5581a51a

由 Jan Schmidt 提交于 5月 16, 2012

Three callers of btrfs_free_tree_block or btrfs_alloc_tree_block passed
parameter for_cow = 1. In fact, these two functions should never mark
their tree modification operations as for_cow, because they can change
the number of blocks referenced by a tree.

Hence, we remove the extra for_cow parameter from these functions and
make them pass a zero down.
Signed-off-by: NJan Schmidt <list.btrfs@jan-o-sch.net>

5581a51a

11 5月, 2012 1 次提交
- W
  btrfs/ctree.c: remove the unnecessary 'return -1;' at the end of bin_search · f775738f
  由 Wang Sheng-Hui 提交于 3月 30, 2012
```
The code path should not reach there. Remove it.
Signed-off-by: NWang Sheng-Hui <shhuiw@gmail.com>
```
  f775738f
06 5月, 2012 1 次提交

Btrfs: avoid sleeping in verify_parent_transid while atomic · b9fab919

由 Chris Mason 提交于 5月 06, 2012

verify_parent_transid needs to lock the extent range to make
sure no IO is underway, and so it can safely clear the
uptodate bits if our checks fail.

But, a few callers are using it with spinlocks held.  Most
of the time, the generation numbers are going to match, and
we don't want to switch to a blocking lock just for the error
case.  This adds an atomic flag to verify_parent_transid,
and changes it to return EAGAIN if it needs to block to
properly verifiy things.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

b9fab919

05 5月, 2012 1 次提交

Btrfs: Add properly locking around add_root_to_dirty_list · e5846fc6

由 Chris Mason 提交于 5月 03, 2012

add_root_to_dirty_list happens once at the very beginning of the
transaction, but it is still racey.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

e5846fc6

27 3月, 2012 3 次提交

Btrfs: adjust the write_lock_level as we unlock · f7c79f30

由 Chris Mason 提交于 3月 19, 2012

btrfs_search_slot sometimes needs write locks on high levels of
the tree.  It remembers the highest level that needs a write lock
and will use that for all future searches through the tree in a given
call.

But, very often we'll just cow the top level or the level below and we
won't really need write locks on the root again after that.  This patch
changes things to adjust the write lock requirement as it unlocks
levels.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

f7c79f30

Btrfs: add the ability to cache a pointer into the eb · cfed81a0

由 Chris Mason 提交于 3月 03, 2012

This cuts down on the CPU time used by map_private_extent_buffer
Signed-off-by: NChris Mason <chris.mason@oracle.com>

cfed81a0

Btrfs: introduce free_extent_buffer_stale · 3083ee2e

由 Josef Bacik 提交于 3月 09, 2012

Because btrfs cow's we can end up with extent buffers that are no longer
necessary just sitting around in memory. So instead of evicting these pages, we
could end up evicting things we actually care about. Thus we have
free_extent_buffer_stale for use when we are freeing tree blocks. This will
make it so that the ref for the eb being in the radix tree is dropped as soon as
possible and then is freed when the refcount hits 0 instead of waiting to be
released by releasepage. Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>

3083ee2e

22 3月, 2012 6 次提交

btrfs: replace many BUG_ONs with proper error handling · 79787eaa

由 Jeff Mahoney 提交于 3月 12, 2012

 btrfs currently handles most errors with BUG_ON. This patch is a work-in-
 progress but aims to handle most errors other than internal logic
 errors and ENOMEM more gracefully.

 This iteration prevents most crashes but can run into lockups with
 the page lock on occasion when the timing "works out."
Signed-off-by: NJeff Mahoney <jeffm@suse.com>

79787eaa

btrfs: Go readonly on tree errors in balance_level · 305a26af

由 Mark Fasheh 提交于 9月 01, 2011

balace_level() seems to deal with missing tree nodes by BUG_ON(). Instead,
we can easily just set the file system readonly and bubble -EROFS back up
the stack.
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

305a26af

btrfs: Don't BUG_ON errors from update_ref_for_cow() · b68dc2a9

由 Mark Fasheh 提交于 8月 29, 2011

__btrfs_cow_block(), the only caller of update_ref_for_cow() will BUG_ON()
any error return.  Instead, we can go read-only fs as update_ref_for_cow()
manipulates disk data in a way which doesn't look like it's easily rolled
back.
Signed-off-by: NMark Fasheh <mfasheh@suse.de>

b68dc2a9

btrfs: Go readonly on bad extent refs in update_ref_for_cow() · e5df9573

由 Mark Fasheh 提交于 8月 29, 2011

update_ref_for_cow() will BUG_ON() after it's call to
btrfs_lookup_extent_info() if no existing references are found.  Since refs
are computed directly from disk, this should be treated as a corruption
instead of a logic error.
Signed-off-by: NMark Fasheh <mfasheh@suse.de>

e5df9573

btrfs: Don't BUG_ON() errors in update_ref_for_cow() · be1a5564

由 Mark Fasheh 提交于 8月 08, 2011

The only caller of update_ref_for_cow() is __btrfs_cow_block() which was
originally ignoring any return values. update_ref_for_cow() however doesn't
look like a candidate to become a void function - there are a few places
where errors can occur.

So instead I changed update_ref_for_cow() to bubble all errors up (instead
of BUG_ON). __btrfs_cow_block() was then updated to catch and BUG_ON() any
errors from update_ref_for_cow(). The end effect is that we have no change
in behavior, but about 8 different places where a BUG_ON(ret) was removed.

Obviously a future patch will have to address the BUG_ON() in
__btrfs_cow_block().
Signed-off-by: NMark Fasheh <mfasheh@suse.de>

be1a5564

J
btrfs: return void in functions without error conditions · 143bede5
由 Jeff Mahoney 提交于 3月 01, 2012
```
Signed-off-by: NJeff Mahoney <jeffm@suse.com>
```
143bede5

22 12月, 2011 1 次提交

Btrfs: mark delayed refs as for cow · 66d7e7f0

由 Arne Jansen 提交于 9月 12, 2011

Add a for_cow parameter to add_delayed_*_ref and pass the appropriate value
from every call site. The for_cow parameter will later on be used to
determine if a ref will change anything with respect to qgroups.

Delayed refs coming from relocation are always counted as for_cow, as they
don't change subvol quota.

Also pass in the fs_info for later use.

btrfs_find_all_roots() will use this as an optimization, as changes that are
for_cow will not change anything with respect to which root points to a
certain leaf. Thus, we don't need to add the current sequence number to
those delayed refs.
Signed-off-by: NArne Jansen <sensille@gmx.net>
Signed-off-by: NJan Schmidt <list.btrfs@jan-o-sch.net>

66d7e7f0

15 11月, 2011 1 次提交

Btrfs: fix tree corruption after multi-thread snapshots and inode_cache flush · f1ebcc74

由 Liu Bo 提交于 11月 14, 2011

The btrfs snapshotting code requires that once a root has been
snapshotted, we don't change it during a commit.

But there are two cases to lead to tree corruptions:

1) multi-thread snapshots can commit serveral snapshots in a transaction,
   and this may change the src root when processing the following pending
   snapshots, which lead to the former snapshots corruptions;

2) the free inode cache was changing the roots when it root the cache,
   which lead to corruptions.

This fixes things by making sure we force COW the block after we create a
snapshot during commiting a transaction, then any changes to the roots
will result in COW, and we get all the fs roots and snapshot roots to be
consistent.
Signed-off-by: NLiu Bo <liubo2009@cn.fujitsu.com>
Signed-off-by: NMiao Xie <miaox@cn.fujitsu.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

f1ebcc74

21 10月, 2011 1 次提交

Btrfs: fix array bound checking · a05a9bb1

由 Li Zefan 提交于 9月 06, 2011

Otherwise we can execced the array bound of path->slots[].
Signed-off-by: NLi Zefan <lizf@cn.fujitsu.com>

a05a9bb1

28 7月, 2011 3 次提交

Btrfs: remove lockdep magic from btrfs_next_leaf · 31533fb2

由 Chris Mason 提交于 7月 26, 2011

Before the reader/writer locks, btrfs_next_leaf needed to keep
the path blocking to avoid making lockdep upset.

Now that btrfs_next_leaf only takes read locks, this isn't required.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

31533fb2

Btrfs: switch the btrfs tree locks to reader/writer · bd681513

由 Chris Mason 提交于 7月 16, 2011

The btrfs metadata btree is the source of significant
lock contention, especially in the root node.   This
commit changes our locking to use a reader/writer
lock.

The lock is built on top of rw spinlocks, and it
extends the lock tracking to remember if we have a
read lock or a write lock when we go to blocking.  Atomics
count the number of blocking readers or writers at any
given time.

It removes all of the adaptive spinning from the old code
and uses only the spinning/blocking hints inside of btrfs
to decide when it should continue spinning.

In read heavy workloads this is dramatically faster.  In write
heavy workloads we're still faster because of less contention
on the root node lock.

We suffer slightly in dbench because we schedule more often
during write locks, but all other benchmarks so far are improved.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

bd681513

Btrfs: stop using highmem for extent_buffers · a6591715

由 Chris Mason 提交于 7月 19, 2011

The extent_buffers have a very complex interface where
we use HIGHMEM for metadata and try to cache a kmap mapping
to access the memory.

The next commit adds reader/writer locks, and concurrent use
of this kmap cache would make it even more complex.

This commit drops the ability to use HIGHMEM with extent buffers,
and rips out all of the related code.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

a6591715

10 6月, 2011 1 次提交

Btrfs: don't map extent buffer if path->skip_locking is set · ad3e34bb

由 Josef Bacik 提交于 6月 08, 2011

Arne's scrub stuff exposed a problem with mapping the extent buffer in
reada_for_search. He searches the commit root with multiple threads and with
skip_locking set, so we can race and overwrite node->map_token since node isn't
locked. So fix this so that we only map the extent buffer if we don't already
have a map_token and skip_locking isn't set. Without this patch scrub would
panic almost immediately, with the patch it doesn't panic anymore. Thanks,
Reported-by: NArne Jansen <sensille@gmx.net>
Signed-off-by: NJosef Bacik <josef@redhat.com>

ad3e34bb

OpenHarmony / kernel_linux 上一次同步 大约 4 年

OpenHarmony / kernel_linux
上一次同步大约 4 年