提交 · c8b978188c9a0fd3d535c13debd19d522b726f1f · openanolis / cloud-kernel

30 10月, 2008 1 次提交

Btrfs: Add zlib compression support · c8b97818

由 Chris Mason 提交于 10月 29, 2008

This is a large change for adding compression on reading and writing,
both for inline and regular extents.  It does some fairly large
surgery to the writeback paths.

Compression is off by default and enabled by mount -o compress.  Even
when the -o compress mount option is not used, it is possible to read
compressed extents off the disk.

If compression for a given set of pages fails to make them smaller, the
file is flagged to avoid future compression attempts later.

* While finding delalloc extents, the pages are locked before being sent down
to the delalloc handler.  This allows the delalloc handler to do complex things
such as cleaning the pages, marking them writeback and starting IO on their
behalf.

* Inline extents are inserted at delalloc time now.  This allows us to compress
the data before inserting the inline extent, and it allows us to insert
an inline extent that spans multiple pages.

* All of the in-memory extent representations (extent_map.c, ordered-data.c etc)
are changed to record both an in-memory size and an on disk size, as well
as a flag for compression.

From a disk format point of view, the extent pointers in the file are changed
to record the on disk size of a given extent and some encoding flags.
Space in the disk format is allocated for compression encoding, as well
as encryption and a generic 'other' field.  Neither the encryption or the
'other' field are currently used.

In order to limit the amount of data read for a single random read in the
file, the size of a compressed extent is limited to 128k.  This is a
software only limit, the disk format supports u64 sized compressed extents.

In order to limit the ram consumed while processing extents, the uncompressed
size of a compressed extent is limited to 256k.  This is a software only limit
and will be subject to tuning later.

Checksumming is still done on compressed extents, and it is done on the
uncompressed version of the data.  This way additional encodings can be
layered on without having to figure out which encoding to checksum.

Compression happens at delalloc time, which is basically singled threaded because
it is usually done by a single pdflush thread.  This makes it tricky to
spread the compression load across all the cpus on the box.  We'll have to
look at parallel pdflush walks of dirty inodes at a later time.

Decompression is hooked into readpages and it does spread across CPUs nicely.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

c8b97818

25 9月, 2008 24 次提交

Btrfs: Fix some data=ordered related data corruptions · f421950f

由 Chris Mason 提交于 7月 22, 2008

Stress testing was showing data checksum errors, most of which were caused
by a lookup bug in the extent_map tree.  The tree was caching the last
pointer returned, and searches would check the last pointer first.

But, search callers also expect the search to return the very first
matching extent in the range, which wasn't always true with the last
pointer usage.

For now, the code to cache the last return value is just removed.  It is
easy to fix, but I think lookups are rare enough that it isn't required anymore.

This commit also replaces do_sync_mapping_range with a local copy of the
related functions.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

f421950f

Btrfs: Keep extent mappings in ram until pending ordered extents are done · 7f3c74fb

由 Chris Mason 提交于 7月 18, 2008

It was possible for stale mappings from disk to be used instead of the
new pending ordered extent. This adds a flag to the extent map struct
to keep it pinned until the pending ordered extent is actually on disk.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

7f3c74fb

Btrfs: Split the extent_map code into two parts · d1310b2e

由 Chris Mason 提交于 1月 24, 2008

There is now extent_map for mapping offsets in the file to disk and
extent_io for state tracking, IO submission and extent_bufers.

The new extent_map code shifts from [start,end] pairs to [start,len], and
pushes the locking out into the caller.  This allows a few performance
optimizations and is easier to use.

A number of extent_map usage bugs were fixed, mostly with failing
to remove extent_map entries when changing the file.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

d1310b2e

Btrfs: Implement basic support for -ENOSPC · 1832a6d5

由 Chris Mason 提交于 12月 21, 2007

This is intended to prevent accidentally filling the drive.  A determined
user can still make things oops.

It includes some accounting of the current bytes under delayed allocation,
but this will change as things get optimized
Signed-off-by: NChris Mason <chris.mason@oracle.com>

1832a6d5

Btrfs: section mismatch warnings · 17636e03

由 Christian Hesse 提交于 12月 11, 2007

--Boundary-00=_CcOWHFYK4T+JwSj
Content-Type: text/plain;
  charset="iso-8859-1"
Content-Transfer-Encoding: 7bit
Content-Disposition: inline

Hello everybody,

compiling btrfs into the kernel results in section mismatch warnings. __exit
functions are called where they are not allowed to. The attached patch fixes
this for me. Not sure if it is correct though.
Signed-off-by: NChristian Hesse <mail@earthworm.de>
--
Regards,
Chris

--Boundary-00=_CcOWHFYK4T+JwSj
Content-Type: text/x-diff; charset="iso-8859-1";
	name="btrfs-section_mismatches.patch"
Content-Transfer-Encoding: 7bit
Content-Disposition: attachment;
	filename="btrfs-section_mismatches.patch"
Signed-off-by: NChris Mason <chris.mason@oracle.com>

17636e03

C
Btrfs: Add efficient dirty accounting to the extent_map tree · ca664626
由 Chris Mason 提交于 11月 27, 2007
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
ca664626
C
Btrfs: Limit btree writeback to prevent seeks · 793955bc
由 Chris Mason 提交于 11月 26, 2007
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
793955bc
W
Btrfs: Return value checking in module init · 2f4cbe64
由 Wyatt Banks 提交于 11月 19, 2007
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
2f4cbe64
C
Btrfs: Add readpages support · 3ab2fb5a
由 Chris Mason 提交于 11月 08, 2007
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
3ab2fb5a
C
Btrfs: Add writepages support · b293f02e
由 Chris Mason 提交于 11月 01, 2007
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
b293f02e

Btrfs: Fix a number of inline extent problems that Yan Zheng reported. · 179e29e4

由 Chris Mason 提交于 11月 01, 2007

The fixes do a number of things:

1) Most btrfs_drop_extent callers will try to leave the inline extents in
place.  It can truncate bytes off the beginning of the inline extent if
required.

2) writepage can now update the inline extent, allowing mmap writes to
go directly into the inline extent.

3) btrfs_truncate_in_transaction truncates inline extents

4) extent_map.c fixed to not merge inline extent mappings and hole
mappings together
Signed-off-by: NChris Mason <chris.mason@oracle.com>

179e29e4

C
Btrfs: Add back metadata checksumming · 19c00ddc
由 Chris Mason 提交于 10月 15, 2007
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
19c00ddc
C
Btrfs: extent_map optimizations to cut down on CPU usage · 810191ff
由 Chris Mason 提交于 10月 15, 2007
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
810191ff
C
Btrfs: Add an extent buffer LRU to reduce radix tree hits · 4dc11904
由 Chris Mason 提交于 10月 15, 2007
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
4dc11904
C
Btrfs: Add back the online defragging code · 6b80053d
由 Chris Mason 提交于 10月 15, 2007
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
6b80053d
C
Btrfs: Use an array of pages in the extent buffers to reduce the cost of find_get_page · 09e71a32
由 Chris Mason 提交于 10月 15, 2007
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
09e71a32
C
Btrfs: Allow tree blocks larger than the page size · db94535d
由 Chris Mason 提交于 10月 15, 2007
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
db94535d
C
Btrfs: Change the remaining radix trees used by extent-tree.c to extent_map trees · 1a5bc167
由 Chris Mason 提交于 10月 15, 2007
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
1a5bc167
C
Btrfs: Stop using radix trees for the block group cache · 96b5179d
由 Chris Mason 提交于 10月 15, 2007
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
96b5179d
C
Btrfs: Fix extent_buffer and extent_state leaks · f510cfec
由 Chris Mason 提交于 10月 15, 2007
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
f510cfec
C
Btrfs: Avoid memcpy where possible in extent_buffers · 6d36dcd4
由 Chris Mason 提交于 10月 15, 2007
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
6d36dcd4
C
Btrfs: Optimizations for the extent_buffer code · 479965d6
由 Chris Mason 提交于 10月 15, 2007
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
479965d6
C
Btrfs: Create extent_buffer interface for large blocksizes · 5f39d397
由 Chris Mason 提交于 10月 15, 2007
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
5f39d397
C
Btrfs: factor page private preparations into a helper · b3cfa35a
由 Christoph Hellwig 提交于 9月 17, 2007
```
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
b3cfa35a

11 9月, 2007 2 次提交

Btrfs: [PATCH] extent_map: add writepage_end_io hook · 0e2752a7

由 Christoph Hellwig 提交于 9月 10, 2007

XFS updates the ondisk inode size only after the data I/O has finished,
so it needs a hook when the writepage end_bio handler has finished.

Might not be worth applying as-is as the per-page callback is very
ineffcient.  What XFS really wants is a callback when writeout of a
whole extent has completed.  This delayed i_size updates scheme might
be worthwile for btrfs aswell, btw.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

0e2752a7

Btrfs: [PATCH] extent_map: provide generic bmap · d396c6f5

由 Christoph Hellwig 提交于 9月 10, 2007

generic_bmap is completely trivial, while the extent to bh mapping in
btrfs is rather complex.  So provide a extent_bmap instead that takes
a get_extent callback and can be used by filesystem using the extent_map
code.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

d396c6f5

30 8月, 2007 1 次提交
- C
  Btrfs: Add file data csums back in via hooks in the extent map code · 07157aac
  由 Chris Mason 提交于 8月 30, 2007
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
  07157aac
28 8月, 2007 2 次提交
- C
  Btrfs: Add delayed allocation to the extent based page tree code · b888db2b
  由 Chris Mason 提交于 8月 27, 2007
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
  b888db2b
- C
  Btrfs: Extent based page cache code. This uses an rbtree of extents and tests · a52d9a80
  由 Chris Mason 提交于 8月 27, 2007
```
instead of buffer heads.
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
  a52d9a80

openanolis / cloud-kernel 接近 2 年 前同步成功

openanolis / cloud-kernel
接近 2 年前同步成功