提交 · 5036f53868ae943704ae69a192d21225dc914c35 · openeuler / raspberrypi-kernel

25 9月, 2008 40 次提交

Btrfs: Add compatibility for kernels >= 2.6.27-rc1 · 0ee0fda0

由 Sven Wegener 提交于 7月 30, 2008

Add a couple of #if's to follow API changes.
Signed-off-by: NSven Wegener <sven.wegener@stealer.net>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

0ee0fda0

Btrfs: implement memory reclaim for leaf reference cache · bcc63abb

由 Yan 提交于 7月 30, 2008

The memory reclaiming issue happens when snapshot exists. In that
case, some cache entries may not be used during old snapshot dropping,
so they will remain in the cache until umount.

The patch adds a field to struct btrfs_leaf_ref to record create time. Besides,
the patch makes all dead roots of a given snapshot linked together in order of
create time. After a old snapshot was completely dropped, we check the dead
root list and remove all cache entries created before the oldest dead root in
the list.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

bcc63abb

Btrfs: Fix verify_parent_transid · 33958dc6

由 Chris Mason 提交于 7月 30, 2008

It was incorrectly clearing the up to date flag on the buffer even
when the buffer properly verified.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

33958dc6

Btrfs: Search data ordered extents first for checksums on read · 89642229

由 Chris Mason 提交于 7月 24, 2008

Checksum items are not inserted into the tree until all of the io from a
given extent is complete. This means one dirty page from an extent may
be written, freed, and then read again before the entire extent is on disk
and the checksum item is inserted.

The checksums themselves are stored in the ordered extent so they can
be inserted in bulk when IO is complete. On read, if a checksum item isn't
found, the ordered extents were being searched for a checksum record.

This all worked most of the time, but the checksum insertion code tries
to reduce the number of tree operations by pre-inserting checksum items
based on i_size and a few other factors. This means the read code might
find a checksum item that hasn't yet really been filled in.

This commit changes things to check the ordered extents first and only
dive into the btree if nothing was found. This removes the need for
extra locking and is more reliable.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

89642229

Btrfs: Fix some data=ordered related data corruptions · f421950f

由 Chris Mason 提交于 7月 22, 2008

Stress testing was showing data checksum errors, most of which were caused
by a lookup bug in the extent_map tree.  The tree was caching the last
pointer returned, and searches would check the last pointer first.

But, search callers also expect the search to return the very first
matching extent in the range, which wasn't always true with the last
pointer usage.

For now, the code to cache the last return value is just removed.  It is
easy to fix, but I think lookups are rare enough that it isn't required anymore.

This commit also replaces do_sync_mapping_range with a local copy of the
related functions.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

f421950f

Btrfs: Use a mutex in the extent buffer for tree block locking · a61e6f29

由 Chris Mason 提交于 7月 22, 2008

This replaces the use of the page cache lock bit for locking, which wasn't
suitable for block size < page size and couldn't be used recursively.

The mutexes alone don't fix either problem, but they are the first step.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

a61e6f29

Btrfs: Index extent buffers in an rbtree · 6af118ce

由 Chris Mason 提交于 7月 22, 2008

Before, extent buffers were a temporary object, meant to map a number of pages
at once and collect operations on them.

But, a few extra fields have crept in, and they are also the best place to
store a per-tree block lock field as well.  This commit puts the extent
buffers into an rbtree, and ensures a single extent buffer for each
tree block.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

6af118ce

Btrfs: Keep extent mappings in ram until pending ordered extents are done · 7f3c74fb

由 Chris Mason 提交于 7月 18, 2008

It was possible for stale mappings from disk to be used instead of the
new pending ordered extent. This adds a flag to the extent map struct
to keep it pinned until the pending ordered extent is actually on disk.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

7f3c74fb

C
Btrfs: Don't allow releasepage to succeed if EXTENT_ORDERED is set · 211f90e6
由 Chris Mason 提交于 7月 18, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
211f90e6

Btrfs: Use async helpers to deal with pages that have been improperly dirtied · 247e743c

由 Chris Mason 提交于 7月 17, 2008

Higher layers sometimes call set_page_dirty without asking the filesystem
to help. This causes many problems for the data=ordered and cow code.
This commit detects pages that haven't been properly setup for IO and
kicks off an async helper to deal with them.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

247e743c

Btrfs: New data=ordered implementation · e6dcd2dc

由 Chris Mason 提交于 7月 17, 2008

The old data=ordered code would force commit to wait until
all the data extents from the transaction were fully on disk.  This
introduced large latencies into the commit and stalled new writers
in the transaction for a long time.

The new code changes the way data allocations and extents work:

* When delayed allocation is filled, data extents are reserved, and
  the extent bit EXTENT_ORDERED is set on the entire range of the extent.
  A struct btrfs_ordered_extent is allocated an inserted into a per-inode
  rbtree to track the pending extents.

* As each page is written EXTENT_ORDERED is cleared on the bytes corresponding
  to that page.

* When all of the bytes corresponding to a single struct btrfs_ordered_extent
  are written, The previously reserved extent is inserted into the FS
  btree and into the extent allocation trees.  The checksums for the file
  data are also updated.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

e6dcd2dc

Btrfs: Change find_extent_buffer to use TestSetPageLocked · 079899c2

由 Chris Mason 提交于 6月 25, 2008

This makes it possible for callers to check for extent_buffers in cache
without deadlocking against any btree locks held.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

079899c2

Btrfs: Start btree concurrency work. · 925baedd

由 Chris Mason 提交于 6月 25, 2008

The allocation trees and the chunk trees are serialized via their own
dedicated mutexes.  This means allocation location is still not very
fine grained.

The main FS btree is protected by locks on each block in the btree.  Locks
are taken top / down, and as processing finishes on a given level of the
tree, the lock is released after locking the lower level.

The end result of a search is now a path where only the lowest level
is locked.  Releasing or freeing the path drops any locks held.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

925baedd

Fix corners in writepage and btrfs_truncate_page · 211c17f5

由 Chris Mason 提交于 5月 15, 2008

The extent_io writepage calls needed an extra check for discarding
pages that started on th last byte in the file.

btrfs_truncate_page needed checks to make sure the page was still part
of the file after reading it, and most importantly, needed to wait for
all IO to the page to finish before freeing the corresponding extents on
disk.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

211c17f5

Btrfs: Handle write errors on raid1 and raid10 · 1259ab75

由 Chris Mason 提交于 5月 12, 2008

When duplicate copies exist, writes are allowed to fail to one of those
copies.  This changeset includes a few changes that allow the FS to
continue even when some IOs fail.

It also adds verification of the parent generation number for btree blocks.
This generation is stored in the pointer to a block, and it ensures
that missed writes to are detected.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

1259ab75

C
Btrfs: Drop some verbose printks · 4235298e
由 Chris Mason 提交于 4月 28, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
4235298e
C
Btrfs: write_cache_pages came in 2.6.22 · 5e478dc9
由 Chris Mason 提交于 4月 25, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
5e478dc9
C
Btrfs: write_extent_pages came in 2.6.23 · 004fb575
由 Chris Mason 提交于 4月 25, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
004fb575

Fix btrfs_get_extent and get_block corner cases, and disable O_DIRECT reads · e1c4b745

由 Chris Mason 提交于 4月 22, 2008

The generic O_DIRECT code assumes all the bios have the same bdev,
which isn't true for multi-device btrfs.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

e1c4b745

Btrfs: Don't drop extent_map cache during releasepage on the btree inode · 7b13b7b1

由 Chris Mason 提交于 4月 18, 2008

The btree inode should only have a single extent_map in the cache,
it doesn't make sense to ever drop it.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

7b13b7b1

Btrfs: Remove bogus max_sector warnings from the extent_io code · 41471e83

由 Chris Mason 提交于 4月 17, 2008

It was testing the bio before doing logical->physical mapping, so the
test was always wrong.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

41471e83

Btrfs: Use the extent map cache to find the logical disk block during data retries · 3b951516

由 Chris Mason 提交于 4月 17, 2008

The data read retry code needs to find the logical disk block before it
can resubmit new bios. But, finding this block isn't allowed to take
the fs_mutex because that will deadlock with a number of different callers.

This changes the retry code to use the extent map cache instead, but
that requires the extent map cache to have the extent we're looking for.
This is a problem because btrfs_drop_extent_cache just drops the entire
extent instead of the little tiny part it is invalidating.

The bulk of the code in this patch changes btrfs_drop_extent_cache to
invalidate only a portion of the extent cache, and changes btrfs_get_extent
to deal with the results.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

3b951516

Btrfs: define write_cache_pages for linux kernel <= 2.6.20 instead · 594994aa

由 Miguel 提交于 4月 11, 2008

write_cache_pages doesn't exist in linux 2.6.20,  change the #if
condition to match that.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

594994aa

C
Btrfs: Handle checksumming errors while reading data blocks · 7e38326f
由 Chris Mason 提交于 4月 09, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
7e38326f
C
Btrfs: Retry metadata reads in the face of checksum failures · f188591e
由 Chris Mason 提交于 4月 09, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
f188591e

Btrfs: Do metadata checksums for reads via a workqueue · ce9adaa5

由 Chris Mason 提交于 4月 09, 2008

Before, metadata checksumming was done by the callers of read_tree_block,
which would set EXTENT_CSUM bits in the extent tree to show that a given
range of pages was already checksummed and didn't need to be verified
again.

But, those bits could go away via try_to_releasepage, and the end
result was bogus checksum failures on pages that never left the cache.

The new code validates checksums when the page is read.  It is a little
tricky because metadata blocks can span pages and a single read may
end up going via multiple bios.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

ce9adaa5

C
Btrfs: Add additional debugging for metadata checksum failures · 728131d8
由 Chris Mason 提交于 4月 09, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
728131d8

Btrfs: Correct usage of IS_ERR() in extent_io.c · 2b114d1d

由 tthtlc 提交于 4月 01, 2008

Signed-off-by: Peter Teoh <htmldeveloper@gmail.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

2b114d1d

Btrfs: Add leak debugging for extent_buffer and extent_state · 2d2ae547

由 Chris Mason 提交于 3月 26, 2008

This also fixes one leak around the super block when failing to mount the
FS.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

2d2ae547

C
Btrfs: Bring back mount -o ssd optimizations · 239b14b3
由 Chris Mason 提交于 3月 24, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
239b14b3
C
Btrfs: Add support for multiple devices per filesystem · 0b86a832
由 Chris Mason 提交于 3月 24, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
0b86a832

Btrfs: checksum file data at bio submission time instead of during writepage · 065631f6

由 Chris Mason 提交于 2月 20, 2008

When we checkum file data during writepage, the checksumming is done one
page at a time, making it difficult to do bulk metadata modifications
to insert checksums for large ranges of the file at once.

This patch changes btrfs to checksum on a per-bio basis instead. The
bios are checksummed before they are handed off to the block layer, so
each bio is contiguous and only has pages from the same inode.

Checksumming on a bio basis allows us to insert and modify the file
checksum items in large groups. It also allows the checksumming to
be done more easily by async worker threads.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

065631f6

Btrfs: Allocator improvements · d7fc640e

由 Chris Mason 提交于 2月 18, 2008

Reduce CPU time searching for free blocks by optimizing find_first_extent_bit

Fix find_free_extent to make better use of the last_alloc hint. Before it
was often finding blocks just before the hint.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

d7fc640e

Btrfs: Fix "no csum found for inode" issue. · 39b5637f

由 Yan 提交于 2月 15, 2008

A few codes were not properly updated for changes of extent map.  This
may be the causes of "no csum found for inode" issue.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

39b5637f

C
Btrfs: Create larger bios for btree blocks · a86c12c7
由 Chris Mason 提交于 2月 07, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
a86c12c7
C
Btrfs: Don't case unsigned long to int in bio submission · 961d0232
由 Chris Mason 提交于 2月 06, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
961d0232
Y
Btrfs: Fix typo in extent_io.c · c2e639f0
由 Yan 提交于 2月 04, 2008
```
---
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
c2e639f0
C
Btrfs: Fix delalloc account on state deletion · ae9d1285
由 Chris Mason 提交于 2月 01, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
ae9d1285
C
Btrfs: Add a lookup cache to the extent state tree · 80ea96b1
由 Chris Mason 提交于 2月 01, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
80ea96b1
C
Btrfs: Enable delalloc accounting · b0c68f8b
由 Chris Mason 提交于 1月 31, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
b0c68f8b