提交 · ef8bbdfe7e12dc9b4e80756f6d606c4639c65851 · openanolis / cloud-kernel

25 9月, 2008 40 次提交

Btrfs: Dir fsync optimizations · 49eb7e46

由 Chris Mason 提交于 9月 11, 2008

Drop i_mutex during the commit

Don't bother doing the fsync at all unless the dir is marked as dirtied
and needing fsync in this transaction.  For directories, this means
that someone has unlinked a file from the dir without fsyncing the
file.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

49eb7e46

C
Btrfs: Fix releasepage to properly keep dirty and writeback pages · 98509cfc
由 Chris Mason 提交于 9月 11, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
98509cfc
C
Btrfs: Update the highest objectid in a root after log replay is done · 8d5bf1cb
由 Chris Mason 提交于 9月 11, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
8d5bf1cb

remove unused function btrfs_ilookup · a237d2a2

由 Christoph Hellwig 提交于 9月 05, 2008

btrfs_ilookup is unused, which is good because a normal filesystem
should never have to use ilookup anyway.  Remove it.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

a237d2a2

Btrfs: Add a write ahead tree log to optimize synchronous operations · e02119d5

由 Chris Mason 提交于 9月 05, 2008

File syncs and directory syncs are optimized by copying their
items into a special (copy-on-write) log tree. There is one log tree per
subvolume and the btrfs super block points to a tree of log tree roots.

After a crash, items are copied out of the log tree and back into the
subvolume. See tree-log.c for all the details.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

e02119d5

Btrfs: optimize btrget/set/removexattr · 95819c05

由 Christoph Hellwig 提交于 8月 28, 2008

btrfs actually stores the whole xattr name, including the prefix ondisk,
so using the generic resolver that strips off the prefix is not very
helpful.  Instead do the real ondisk xattrs manually and only use the
generic resolver for synthetic xattrs like ACLs.

(Sorry Josef for guiding you towards the wrong direction here intially)
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

95819c05

Btrfs: Optimise NFS readdir hack slightly; don't call readdir() again when done · f2322b1c

由 David Woodhouse 提交于 8月 17, 2008

Date: Sun, 17 Aug 2008 17:12:56 +0100
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

f2322b1c

Minor cleanup of btrfs_real_readdir() · 49593bfa

由 David Woodhouse 提交于 8月 17, 2008

Date: Sun, 17 Aug 2008 17:08:36 +0100
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

49593bfa

Btrfs: Remove special cases for "." and ".." · 5ecc7e5d

由 David Woodhouse 提交于 8月 17, 2008

Date: Sun, 17 Aug 2008 15:14:48 +0100
We never get asked by the VFS to lookup either of them, and we can
handle the readdir() case a lot more simply, too.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

5ecc7e5d

Btrfs: Implement our own copy of the nfsd readdir hack, for older kernels · cbdf5a24

由 David Woodhouse 提交于 8月 06, 2008

Date: Wed, 6 Aug 2008 19:42:33 +0100
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

cbdf5a24

Introduce btrfs_iget helper · 1a54ef8c

由 Balaji Rao 提交于 7月 21, 2008

Date: Mon, 21 Jul 2008 02:01:04 +0530
This patch introduces a btrfs_iget helper to be used in NFS support.
Signed-off-by: NBalaji Rao <balajirrao@gmail.com>
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

1a54ef8c

Btrfs: Lookup readpage checksums on bio submission again · 4d1b5fb4

由 Chris Mason 提交于 8月 20, 2008

This optimization had been removed because I thought it was triggering
csum errors.  The real cause of the errors was elsewhere, and so
this optimization is back.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

4d1b5fb4

Btrfs: Fix add_extent_mapping to check for duplicates across the whole range · 7c2fe32a

由 Chris Mason 提交于 8月 20, 2008

add_extent_mapping was allowing the insertion of overlapping extents.
This never used to happen because it only inserted the extents from disk
and those were never overlapping.

But, with the data=ordered code, the disk and memory representations of the
file are not the same.  add_extent_mapping needs to ensure a new extent
does not overlap before it inserts.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

7c2fe32a

Btrfs: Lower contention on the csum mutex · 53863232

由 Chris Mason 提交于 8月 15, 2008

This takes the csum mutex deeper in the call chain and releases it
more often.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

53863232

Btrfs: Init address_space->writeback_index properly · db69e0eb

由 Chris Mason 提交于 8月 15, 2008

The writeback_index field is used by write_cache_pages to pick up where
writeback on a given inode left off.  But, it is never set to a sane
value, so writeback can often start at a random offset in the file.

Kernels 2.6.28 and higher will have this fixed, but for everyone else,
we also fill in the value in btrfs.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

db69e0eb

C
Btrfs: Avoid calling into the FS for the final iput on fake root inodes · 4ca8b41e
由 Chris Mason 提交于 8月 05, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
4ca8b41e
Y
Btrfs: Fix nodatacow for the new data=ordered mode · 7ea394f1
由 Yan Zheng 提交于 8月 05, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
7ea394f1

Get rid of BTRFS_I(inode)->index and use local vars instead · 00e4e6b3

由 Chris Mason 提交于 8月 05, 2008

rename and link don't always have a lock on the source inode, and
our use of a per-inode index variable was racy.  This changes things to
store the index in a local variable instead.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

00e4e6b3

C
btrfs_lookup_bio_sums seems broken, go back to the readpage_io_hook for now · 3de9d6b6
由 Chris Mason 提交于 8月 04, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
3de9d6b6
C
Btrfs: Maintain a list of inodes that are delalloc and a way to wait on them · ea8c2819
由 Chris Mason 提交于 8月 04, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
ea8c2819
C
Btrfs: Hold csum mutex while reading in sums during readpages · 6dab8157
由 Chris Mason 提交于 8月 04, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
6dab8157
C
Btrfs: Drop some debugging around the extent_map pinned flag · 3ce7e67a
由 Chris Mason 提交于 7月 31, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
3ce7e67a

Btrfs: Fix streaming read performance with checksumming on · 61b49440

由 Chris Mason 提交于 7月 31, 2008

Large streaming reads make for large bios, which means each entry on the
list async work queues represents a large amount of data. IO
congestion throttling on the device was kicking in before the async
worker threads decided a single thread was busy and needed some help.

The end result was that a streaming read would result in a single CPU
running at 100% instead of balancing the work off to other CPUs.

This patch also changes the pre-IO checksum lookup done by reads to
work on a per-bio basis instead of a per-page. This results in many
extra btree lookups on large streaming reads. Doing the checksum lookup
right before bio submit allows us to reuse searches while processing
adjacent offsets.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

61b49440

Btrfs: Add compatibility for kernels >= 2.6.27-rc1 · 0ee0fda0

由 Sven Wegener 提交于 7月 30, 2008

Add a couple of #if's to follow API changes.
Signed-off-by: NSven Wegener <sven.wegener@stealer.net>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

0ee0fda0

Btrfs: implement memory reclaim for leaf reference cache · bcc63abb

由 Yan 提交于 7月 30, 2008

The memory reclaiming issue happens when snapshot exists. In that
case, some cache entries may not be used during old snapshot dropping,
so they will remain in the cache until umount.

The patch adds a field to struct btrfs_leaf_ref to record create time. Besides,
the patch makes all dead roots of a given snapshot linked together in order of
create time. After a old snapshot was completely dropped, we check the dead
root list and remove all cache entries created before the oldest dead root in
the list.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

bcc63abb

Btrfs: Update and fix mount -o nodatacow · f321e491

由 Yan Zheng 提交于 7月 30, 2008

To check whether a given file extent is referenced by multiple snapshots, the
checker walks down the fs tree through dead root and checks all tree blocks in
the path.

We can easily detect whether a given tree block is directly referenced by other
snapshot. We can also detect any indirect reference from other snapshot by
checking reference's generation. The checker can always detect multiple
references, but can't reliably detect cases of single reference. So btrfs may
do file data cow even there is only one reference.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

f321e491

Btrfs: Throttle operations if the reference cache gets too large · ab78c84d

由 Chris Mason 提交于 7月 29, 2008

A large reference cache is directly related to a lot of work pending
for the cleaner thread.  This throttles back new operations based on
the size of the reference cache so the cleaner thread will be able to keep
up.

Overall, this actually makes the FS faster because the cleaner thread will
be more likely to find things in cache.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

ab78c84d

Btrfs: Leaf reference cache update · 017e5369

由 Chris Mason 提交于 7月 28, 2008

This changes the reference cache to make a single cache per root
instead of one cache per transaction, and to key by the byte number
of the disk block instead of the keys inside.

This makes it much less likely to have cache misses if a snapshot
or something has an extra reference on a higher node or a leaf while
the first transaction that added the leaf into the cache is dropping.

Some throttling is added to functions that free blocks heavily so they
wait for old transactions to drop.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

017e5369

Btrfs: Fix .. lookup corner case · 445dceb7

由 Yan 提交于 7月 24, 2008

Inode ref item can be in the next leaf when we find "path->slots[0] ==
btrfs_header_nritems(...)".
Signed-off-by: NChris Mason <chris.mason@oracle.com>

445dceb7

Btrfs: Remove unused variable in fixup_tree_root_location · 45467261

由 Balaji Rao 提交于 7月 24, 2008

Remove a unused variable 'path' in fixup_tree_root_location.
Signed-off-by: NBalaji Rao <balajirrao@gmail.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

45467261

J
Btrfs: Create orphan inode records to prevent lost files after a crash · 7b128766
由 Josef Bacik 提交于 7月 24, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
7b128766
J
Btrfs: Add ACL support · 33268eaf
由 Josef Bacik 提交于 7月 24, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
33268eaf
J
Btrfs: Implement new dir index format · aec7477b
由 Josef Bacik 提交于 7月 24, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
aec7477b

Btrfs: Search data ordered extents first for checksums on read · 89642229

由 Chris Mason 提交于 7月 24, 2008

Checksum items are not inserted into the tree until all of the io from a
given extent is complete. This means one dirty page from an extent may
be written, freed, and then read again before the entire extent is on disk
and the checksum item is inserted.

The checksums themselves are stored in the ordered extent so they can
be inserted in bulk when IO is complete. On read, if a checksum item isn't
found, the ordered extents were being searched for a checksum record.

This all worked most of the time, but the checksum insertion code tries
to reduce the number of tree operations by pre-inserting checksum items
based on i_size and a few other factors. This means the read code might
find a checksum item that hasn't yet really been filled in.

This commit changes things to check the ordered extents first and only
dive into the btree if nothing was found. This removes the need for
extra locking and is more reliable.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

89642229

C
Btrfs: Take the csum mutex while reading checksums · ed98b56a
由 Chris Mason 提交于 7月 22, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
ed98b56a

Btrfs: Fix some data=ordered related data corruptions · f421950f

由 Chris Mason 提交于 7月 22, 2008

Stress testing was showing data checksum errors, most of which were caused
by a lookup bug in the extent_map tree.  The tree was caching the last
pointer returned, and searches would check the last pointer first.

But, search callers also expect the search to return the very first
matching extent in the range, which wasn't always true with the last
pointer usage.

For now, the code to cache the last return value is just removed.  It is
easy to fix, but I think lookups are rare enough that it isn't required anymore.

This commit also replaces do_sync_mapping_range with a local copy of the
related functions.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

f421950f

Btrfs: Index extent buffers in an rbtree · 6af118ce

由 Chris Mason 提交于 7月 22, 2008

Before, extent buffers were a temporary object, meant to map a number of pages
at once and collect operations on them.

But, a few extra fields have crept in, and they are also the best place to
store a per-tree block lock field as well.  This commit puts the extent
buffers into an rbtree, and ensures a single extent buffer for each
tree block.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

6af118ce

Btrfs: Data ordered fixes · 4a096752

由 Chris Mason 提交于 7月 21, 2008

* In btrfs_delete_inode, wait for ordered extents after calling
truncate_inode_pages.  This is much faster, and more correct

* Properly clear our the PageChecked bit everywhere we redirty the page.

* Change the writepage fixup handler to lock the page range and check to
see if an ordered extent had been inserted since the improperly dirtied
page was discovered

* Wait for ordered extents outside the transaction.  This isn't required
for locking rules but does improve transaction latencies

* Reduce contention on the alloc_mutex by dropping it while incrementing
refs on a node/leaf and while dropping refs on a leaf.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

4a096752

C
Fix btrfs_wait_ordered_extent_range to properly wait · e5a2217e
由 Chris Mason 提交于 7月 18, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
e5a2217e

Btrfs: Keep extent mappings in ram until pending ordered extents are done · 7f3c74fb

由 Chris Mason 提交于 7月 18, 2008

It was possible for stale mappings from disk to be used instead of the
new pending ordered extent. This adds a flag to the extent map struct
to keep it pinned until the pending ordered extent is actually on disk.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

7f3c74fb

openanolis / cloud-kernel 大约 1 年 前同步成功

openanolis / cloud-kernel
大约 1 年前同步成功