提交 · e7a84565bcdb239caad29ccbe559ef978090ac7e · openanolis / cloud-kernel

25 9月, 2008 40 次提交

Btrfs: Add btree locking to the tree defragmentation code · e7a84565

由 Chris Mason 提交于 6月 25, 2008

The online btree defragger is simplified and rewritten to use
standard btree searches instead of a walk up / down mechanism.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

e7a84565

Btrfs: Replace the transaction work queue with kthreads · a74a4b97

由 Chris Mason 提交于 6月 25, 2008

This creates one kthread for commits and one kthread for
deleting old snapshots.  All the work queues are removed.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

a74a4b97

C
Btrfs: Fix snapshot deletion to release the alloc_mutex much more often. · 333db94c
由 Chris Mason 提交于 6月 25, 2008
```
This lowers the impact of snapshot deletion on the rest of the FS.
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
333db94c

Btrfs: Add a skip_locking parameter to struct path, and make various funcs honor it · 5cd57b2c

由 Chris Mason 提交于 6月 25, 2008

Allocations may need to read in block groups from the extent allocation tree,
which will require a tree search and take locks on the extent allocation
tree.  But, those locks might already be held in other places, leading
to deadlocks.

Since the alloc_mutex serializes everything right now, it is safe to
skip the btree locking while caching block groups.  A better fix will be
to either create a recursive lock or find a way to back off existing
locks while caching block groups.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

5cd57b2c

C
Fix btrfs_next_leaf to check for new items after dropping locks · 168fd7d2
由 Chris Mason 提交于 6月 25, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
168fd7d2

Fix btrfs_del_ordered_inode to allow forcing the drop during unlinks · 594a24eb

由 Chris Mason 提交于 6月 25, 2008

This allows us to delete an unlinked inode with dirty pages from the list
instead of forcing commit to write these out before deleting the inode.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

594a24eb

Drop locks in btrfs_search_slot when reading a tree block. · 051e1b9f

由 Chris Mason 提交于 6月 25, 2008

One lock per btree block can make for significant congestion if everyone
has to wait for IO at the high levels of the btree. This drops
locks held by a path when doing reads during a tree search.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

051e1b9f

Btrfs: Replace the big fs_mutex with a collection of other locks · a2135011

由 Chris Mason 提交于 6月 25, 2008

Extent alloctions are still protected by a large alloc_mutex.
Objectid allocations are covered by a objectid mutex
Other btree operations are protected by a lock on individual btree nodes
Signed-off-by: NChris Mason <chris.mason@oracle.com>

a2135011

Btrfs: Start btree concurrency work. · 925baedd

由 Chris Mason 提交于 6月 25, 2008

The allocation trees and the chunk trees are serialized via their own
dedicated mutexes.  This means allocation location is still not very
fine grained.

The main FS btree is protected by locks on each block in the btree.  Locks
are taken top / down, and as processing finishes on a given level of the
tree, the lock is released after locking the lower level.

The end result of a search is now a path where only the lowest level
is locked.  Releasing or freeing the path drops any locks held.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

925baedd

Btrfs: Allocator fix variety pack · 0ef3e66b

由 Chris Mason 提交于 5月 24, 2008

* Force chunk allocation when find_free_extent has to do a full scan
* Record the max key at the start of defrag so it doesn't run forever
* Block groups might not be contiguous, make a forward search for the
  next block group in extent-tree.c
* Get rid of extra checks for total fs size
* Fix relocate_one_reference to avoid relocating the same file data block
  twice when referenced by an older transaction
* Use the open device count when allocating chunks so that we don't
  try to allocate from devices that don't exist
Signed-off-by: NChris Mason <chris.mason@oracle.com>

0ef3e66b

Btrfs: Handle write errors on raid1 and raid10 · 1259ab75

由 Chris Mason 提交于 5月 12, 2008

When duplicate copies exist, writes are allowed to fail to one of those
copies.  This changeset includes a few changes that allow the FS to
continue even when some IOs fail.

It also adds verification of the parent generation number for btree blocks.
This generation is stored in the pointer to a block, and it ensures
that missed writes to are detected.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

1259ab75

C
Btrfs: Pass down the expected generation number when reading tree blocks · ca7a79ad
由 Chris Mason 提交于 5月 12, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
ca7a79ad

Btrfs: Fix balance_level to free the middle block if there is room in the left one · bce4eae9

由 Chris Mason 提交于 4月 24, 2008

balance level starts by trying to empty the middle block, and then
pushes from the right to the middle.  This might empty the right block
and leave a small number of pointers in the middle.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

bce4eae9

C
Btrfs: Don't empty the middle buffer in push_nodes_for_insert · 971a1f66
由 Chris Mason 提交于 4月 24, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
971a1f66
C
Btrfs: Fix split_node to require more empty slots in the node as well · c448acf0
由 Chris Mason 提交于 4月 24, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
c448acf0
C
Btrfs: Make sure nodes have enough room for a double split · 1514794e
由 Chris Mason 提交于 4月 24, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
1514794e

Btrfs: Don't wait on tree block writeback before freeing them anymore · 699122f5

由 Chris Mason 提交于 4月 16, 2008

This isn't required anymore because we don't reallocate blocks that
have already been written in this transaction.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

699122f5

Btrfs: Add chunk uuids and update multi-device back references · e17cade2

由 Chris Mason 提交于 4月 15, 2008

Block headers now store the chunk tree uuid

Chunk items records the device uuid for each stripes

Device extent items record better back refs to the chunk tree

Block groups record better back refs to the chunk tree

The chunk tree format has also changed.  The objectid of BTRFS_CHUNK_ITEM_KEY
used to be the logical offset of the chunk.  Now it is a chunk tree id,
with the logical offset being stored in the offset field of the key.

This allows a single chunk tree to record multiple logical address spaces,
upping the number of bytes indexed by a chunk tree from 2^64 to
2^128.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

e17cade2

C
Btrfs: Disable extra debugging checks on tree blocks · 85d824c4
由 Chris Mason 提交于 4月 10, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
85d824c4
C
Btrfs: Retry metadata reads in the face of checksum failures · f188591e
由 Chris Mason 提交于 4月 09, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
f188591e

Btrfs: Do metadata checksums for reads via a workqueue · ce9adaa5

由 Chris Mason 提交于 4月 09, 2008

Before, metadata checksumming was done by the callers of read_tree_block,
which would set EXTENT_CSUM bits in the extent tree to show that a given
range of pages was already checksummed and didn't need to be verified
again.

But, those bits could go away via try_to_releasepage, and the end
result was bogus checksum failures on pages that never left the cache.

The new code validates checksums when the page is read.  It is a little
tricky because metadata blocks can span pages and a single read may
end up going via multiple bios.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

ce9adaa5

C
Change btrfs_map_block to return a structure with mappings for all stripes · cea9e445
由 Chris Mason 提交于 4月 09, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
cea9e445
C
Btrfs: Properly dirty buffers in the split corner cases · 0ef8b242
由 Chris Mason 提交于 4月 03, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
0ef8b242

Btrfs: Verify checksums on tree blocks found without read_tree_block · 0999df54

由 Chris Mason 提交于 4月 01, 2008

Checksums were only verified by btrfs_read_tree_block, which meant the
functions to probe the page cache for blocks were not validating checksums.
Normally this is fine because the buffers will only be in cache if they
have already been validated.

But, there is a window while the buffer is being read from disk where
it could be up to date in the cache but not yet verified.  This patch
makes sure all buffers go through checksum verification before they
are used.

This is safer, and it prevents modification of buffers before they go
through the csum code.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

0999df54

Reorder the flags field in struct btrfs_header and record a flag on writeout · 63b10fc4

由 Chris Mason 提交于 4月 01, 2008

This allows detection of blocks that have already been written in the
running transaction so they can be recowed instead of modified again.
It is step one in trusting the transid field of the block pointers.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

63b10fc4

C
Btrfs: Add support for multiple devices per filesystem · 0b86a832
由 Chris Mason 提交于 3月 24, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
0b86a832

Call btrfs_cow_block while lowering tree level. · 2f375ab9

由 Yan 提交于 2月 01, 2008

When freeing root block of a tree,  btrfs_free_extent' parameter
'ref_generation' is from root block itseft.  When freeing non-root
block,  'ref_generation' is from its parent. so when converting a
non-root block to root block, we must guarantee its generation is
equal to its parent's generation.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

2f375ab9

C
Btrfs: Copy correct tree when inserting into slot 0 · 5a01a2e3
由 Chris Mason 提交于 1月 30, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
5a01a2e3
C
Btrfs: Add inode item and backref in one insert, reducing cpu usage · 9c58309d
由 Chris Mason 提交于 1月 29, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
9c58309d
C
Btrfs: During deletes and truncate, remove many items at once from the tree · 85e21bac
由 Chris Mason 提交于 1月 29, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
85e21bac

Btrfs: Add data=ordered support · dc17ff8f

由 Chris Mason 提交于 1月 08, 2008

This forces file data extents down the disk along with the metadata that
references them. The current implementation is fairly simple, and just
writes out all of the dirty pages in an inode before the commit.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

dc17ff8f

C
Btrfs: Force inlining off in a few places to save stack usage · 98ed5174
由 Chris Mason 提交于 1月 03, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
98ed5174
C
Btrfs: Add readahead to the online shrinker, and a mount -o alloc_start= for testing · 8f662a76
由 Chris Mason 提交于 1月 02, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
8f662a76
C
Btrfs: Less aggressive readahead on deletes · 01f46658
由 Chris Mason 提交于 12月 21, 2007
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
01f46658
C
kmalloc a few large stack objects in the btrfs_ioctl path · 4aec2b52
由 Chris Mason 提交于 12月 18, 2007
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
4aec2b52

Btrfs: Add mount option to turn off data cow · be20aa9d

由 Chris Mason 提交于 12月 17, 2007

A number of workloads do not require copy on write data or checksumming.
mount -o nodatasum to disable checksums and -o nodatacow to disable
both copy on write and checksumming.

In nodatacow mode, copy on write is still performed when a given extent
is under snapshot.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

be20aa9d

C
Btrfs: Add back pointers from extents to the btree or file referencing them · 7bb86316
由 Chris Mason 提交于 12月 11, 2007
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
7bb86316
C
Btrfs: Implement generation numbers in block pointers · 74493f7a
由 Chris Mason 提交于 12月 11, 2007
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
74493f7a

Btrfs: Properly update right_nritems in push_leaf_left · eef1c494

由 Yan 提交于 11月 26, 2007

The codes that fixup the right leaf and the codes that dirty the
extnet buffer use the variable 'right_nritems' ,  both of them expect
'right_nritems' is the number of items in right leaf after the push.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

eef1c494

C
Btrfs: Change push_leaf_{leaf,right} to empty the src leave during item deletion · 34a38218
由 Chris Mason 提交于 11月 07, 2007
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
34a38218

openanolis / cloud-kernel 大约 1 年 前同步成功

openanolis / cloud-kernel
大约 1 年前同步成功