提交 · 4a69a41009c4ac691f7d9c289f5f37fabeddce46 · openanolis / cloud-kernel

30 10月, 2008 1 次提交

Btrfs: Add zlib compression support · c8b97818

由 Chris Mason 提交于 10月 29, 2008

This is a large change for adding compression on reading and writing,
both for inline and regular extents.  It does some fairly large
surgery to the writeback paths.

Compression is off by default and enabled by mount -o compress.  Even
when the -o compress mount option is not used, it is possible to read
compressed extents off the disk.

If compression for a given set of pages fails to make them smaller, the
file is flagged to avoid future compression attempts later.

* While finding delalloc extents, the pages are locked before being sent down
to the delalloc handler.  This allows the delalloc handler to do complex things
such as cleaning the pages, marking them writeback and starting IO on their
behalf.

* Inline extents are inserted at delalloc time now.  This allows us to compress
the data before inserting the inline extent, and it allows us to insert
an inline extent that spans multiple pages.

* All of the in-memory extent representations (extent_map.c, ordered-data.c etc)
are changed to record both an in-memory size and an on disk size, as well
as a flag for compression.

From a disk format point of view, the extent pointers in the file are changed
to record the on disk size of a given extent and some encoding flags.
Space in the disk format is allocated for compression encoding, as well
as encryption and a generic 'other' field.  Neither the encryption or the
'other' field are currently used.

In order to limit the amount of data read for a single random read in the
file, the size of a compressed extent is limited to 128k.  This is a
software only limit, the disk format supports u64 sized compressed extents.

In order to limit the ram consumed while processing extents, the uncompressed
size of a compressed extent is limited to 256k.  This is a software only limit
and will be subject to tuning later.

Checksumming is still done on compressed extents, and it is done on the
uncompressed version of the data.  This way additional encodings can be
layered on without having to figure out which encoding to checksum.

Compression happens at delalloc time, which is basically singled threaded because
it is usually done by a single pdflush thread.  This makes it tricky to
spread the compression load across all the cpus on the box.  We'll have to
look at parallel pdflush walks of dirty inodes at a later time.

Decompression is hooked into readpages and it does spread across CPUs nicely.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

c8b97818

30 9月, 2008 1 次提交

Btrfs: add and improve comments · d352ac68

由 Chris Mason 提交于 9月 29, 2008

This improves the comments at the top of many functions.  It didn't
dive into the guts of functions because I was trying to
avoid merging problems with the new allocator and back reference work.

extent-tree.c and volumes.c were both skipped, and there is definitely
more work todo in cleaning and commenting the code.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

d352ac68

26 9月, 2008 1 次提交

Remove Btrfs compat code for older kernels · 2b1f55b0

由 Chris Mason 提交于 9月 24, 2008

Btrfs had compatibility code for kernels back to 2.6.18.  These have
been removed, and will be maintained in a separate backport
git tree from now on.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

2b1f55b0

25 9月, 2008 37 次提交

Btrfs: Reinstate '-osubvol=.' option to mount entire tree · 76fcef19

由 David Woodhouse 提交于 8月 19, 2008

Date: Tue, 19 Aug 2008 16:49:35 +0100
This disappeared when I removed the special case for '.' in btrfs_lookup()
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

76fcef19

Mask root object ID into f_fsid in btrfs_statfs() · 32d48fa1

由 David Woodhouse 提交于 8月 18, 2008

Date: Mon, 18 Aug 2008 13:10:20 +0100
This means that subvolumes get a different fsid, and NFS exporting them
works properly.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

32d48fa1

Fill f_fsid field in btrfs_statfs() · 9d03632e

由 David Woodhouse 提交于 8月 18, 2008

Date: Mon, 18 Aug 2008 12:01:52 +0100
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

9d03632e

NFS support for btrfs - v3 · be6e8dc0

由 Balaji Rao 提交于 7月 21, 2008

Date: Mon, 21 Jul 2008 02:01:56 +0530
Here's an implementation of NFS support for btrfs. It relies on the
fixes which are going in to 2.6.28 for the NFS readdir/lookup deadlock.

This uses the btrfs_iget helper introduced previously.

[dwmw2: Tidy up a little, switch to d_obtain_alias() w/compat routine,
	change fh_type,	store parent's root object ID where needed,
	fix some get_parent() and fs_to_dentry() bugs]
Signed-off-by: NBalaji Rao <balajirrao@gmail.com>
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

be6e8dc0

Btrfs: Various small fixes. · b48652c1

由 Yan Zheng 提交于 8月 04, 2008

This trivial patch contains two locking fixes and a off by one fix.

---
Signed-off-by: NChris Mason <chris.mason@oracle.com>

b48652c1

J
Btrfs: Add ACL support · 33268eaf
由 Josef Bacik 提交于 7月 24, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
33268eaf
C
Btrfs: Add version strings on module load · b3c3da71
由 Chris Mason 提交于 7月 23, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
b3c3da71

Btrfs: Online btree defragmentation fixes · 3f157a2f

由 Chris Mason 提交于 6月 25, 2008

The btree defragger wasn't making forward progress because the new key wasn't
being saved by the btrfs_search_forward function.

This also disables the automatic btree defrag, it wasn't scaling well to
huge filesystems. The auto-defrag needs to be done differently.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

3f157a2f

Btrfs: Replace the transaction work queue with kthreads · a74a4b97

由 Chris Mason 提交于 6月 25, 2008

This creates one kthread for commits and one kthread for
deleting old snapshots.  All the work queues are removed.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

a74a4b97

Btrfs: Replace the big fs_mutex with a collection of other locks · a2135011

由 Chris Mason 提交于 6月 25, 2008

Extent alloctions are still protected by a large alloc_mutex.
Objectid allocations are covered by a objectid mutex
Other btree operations are protected by a lock on individual btree nodes
Signed-off-by: NChris Mason <chris.mason@oracle.com>

a2135011

Btrfs: Add a mount option to control worker thread pool size · 4543df7e

由 Chris Mason 提交于 6月 11, 2008

mount -o thread_pool_size changes the default, which is
min(num_cpus + 2, 8).  Larger thread pools would make more sense on
very large disk arrays.

This mount option controls the max size of each thread pool.  There
are multiple thread pools, so the total worker count will be larger
than the mount option.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

4543df7e

Btrfs: Fix mount -o max_inline=0 · 15ada040

由 Chris Mason 提交于 6月 11, 2008

max_inline=0 used to force the max_inline size to one sector instead.  Now
it properly disables inline data items, while still being able to read
any that happen to exist on disk.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

15ada040

btrfs: allow scanning multiple devices during mount · 43e570b0

由 Christoph Hellwig 提交于 6月 10, 2008

Allows to specify one or multiple device=/dev/foo options during mount
so that ioctls on the control device can be avoided.  Especially useful
when trying to mount a multi-device setup as root.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

43e570b0

btrfs: sanity mount option parsing and early mount code · edf24abe

由 Christoph Hellwig 提交于 6月 10, 2008

Also adds lots of comments to describe what's going on here.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

edf24abe

Btrfs: transaction ioctls · 6bf13c0c

由 Sage Weil 提交于 6月 10, 2008

These ioctls let a user application hold a transaction open while it
performs a series of operations.  A final ioctl does a sync on the fs
(closing the current transaction).  This is the main requirement for
Ceph's OSD to be able to keep the data it's storing in a btrfs volume
consistent, and AFAICS it works just fine.  The application would do
something like

	fd = ::open("some/file", O_RDONLY);
	::ioctl(fd, BTRFS_IOC_TRANS_START);
	/* do a bunch of stuff */
	::ioctl(fd, BTRFS_IOC_TRANS_END);
or just
	::close(fd);

And to ensure it commits to disk,

	::ioctl(fd, BTRFS_IOC_SYNC);

When a transaction is held open, the trans_handle is attached to the
struct file (via private_data) so that it will get cleaned up if the
process dies unexpectedly.  A held transaction is also ended on fsync() to
avoid a deadlock.

A misbehaving application could also deliberately hold a transaction open,
effectively locking up the FS, so it may make sense to restrict something
like this to root or something.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

6bf13c0c

btrfsctl -A error code fixup · f819d837

由 Linda Knippers 提交于 6月 09, 2008

Send the error back to userland if the ioctl fails
Signed-off-by: NChris Mason <chris.mason@oracle.com>

f819d837

btrfs delete ordered inode handling fix · e1b81e67

由 Mingming 提交于 5月 27, 2008

Use btrfs_release_file instead of a put_inode call
Signed-off-by: NChris Mason <chris.mason@oracle.com>

e1b81e67

C
Btrfs: Add mount -o degraded to allow mounts to continue with missing devices · dfe25020
由 Chris Mason 提交于 5月 13, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
dfe25020

Btrfs: Add support for online device removal · a061fc8d

由 Chris Mason 提交于 5月 07, 2008

This required a few structural changes to the code that manages bdev pointers:

The VFS super block now gets an anon-bdev instead of a pointer to the
lowest bdev.  This allows us to avoid swapping the super block bdev pointer
around at run time.

The code to read in the super block no longer goes through the extent
buffer interface.  Things got ugly keeping the mapping constant.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

a061fc8d

C
Btrfs: Add new ioctl to add devices · 788f20eb
由 Chris Mason 提交于 4月 28, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
788f20eb
Y
Fix btrfs_fill_super to return -EINVAL when no FS found · e58ca020
由 Yan 提交于 4月 01, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
e58ca020
C
Btrfs: Add support for device scanning and detection ioctls · 8a4b83cc
由 Chris Mason 提交于 3月 24, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
8a4b83cc
C
Add /dev/btrfs-control for device scanning ioctls · a9218f6b
由 Chris Mason 提交于 3月 24, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
a9218f6b

Btrfs: Misc 2.6.25 updates · 6885f308

由 Chris Mason 提交于 2月 20, 2008

Remove the btrfs read_inode method, and use save_mount_options
Signed-off-by: NChris Mason <chris.mason@oracle.com>

6885f308

C
Btrfs: mount -o max_inline=size to control the maximum inline extent size · 6f568d35
由 Chris Mason 提交于 1月 29, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
6f568d35

Btrfs: Split the extent_map code into two parts · d1310b2e

由 Chris Mason 提交于 1月 24, 2008

There is now extent_map for mapping offsets in the file to disk and
extent_io for state tracking, IO submission and extent_bufers.

The new extent_map code shifts from [start,end] pairs to [start,len], and
pushes the locking out into the caller.  This allows a few performance
optimizations and is easier to use.

A number of extent_map usage bugs were fixed, mostly with failing
to remove extent_map entries when changing the file.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

d1310b2e

Y
Btrfs: Add basic lockfs calls · ed0dab6b
由 Yan 提交于 1月 22, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
ed0dab6b
C
Btrfs: Add mount -o ssd, which includes optimizations for seek free storage · e18e4809
由 Chris Mason 提交于 1月 18, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
e18e4809
C
Btrfs: Run igrab on data=ordered inodes to prevent deadlocks during writeout · 2da98f00
由 Chris Mason 提交于 1月 16, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
2da98f00
C
Btrfs: Add drop inode func to avoid data=ordered deadlock · 61295eb8
由 Chris Mason 提交于 1月 14, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
61295eb8
C
Btrfs: Add flush barriers on commit · 21ad10cf
由 Chris Mason 提交于 1月 09, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
21ad10cf
C
Btrfs: Add readahead to the online shrinker, and a mount -o alloc_start= for testing · 8f662a76
由 Chris Mason 提交于 1月 02, 2008
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
8f662a76
C
Btrfs: Support for online FS resize (grow and shrink) · edbd8d4e
由 Chris Mason 提交于 12月 21, 2007
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
edbd8d4e
C
Btrfs: Back port to 2.6.18-el kernels · 6da6abae
由 Chris Mason 提交于 12月 18, 2007
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
6da6abae
C
Btrfs: Add mount option to enforce a max extent size · c59f8951
由 Chris Mason 提交于 12月 17, 2007
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
c59f8951

Btrfs: Add mount option to turn off data cow · be20aa9d

由 Chris Mason 提交于 12月 17, 2007

A number of workloads do not require copy on write data or checksumming.
mount -o nodatasum to disable checksums and -o nodatacow to disable
both copy on write and checksumming.

In nodatacow mode, copy on write is still performed when a given extent
is under snapshot.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

be20aa9d

C
Btrfs: Add mount -o nodatasum to turn of file data checksumming · b6cda9bc
由 Chris Mason 提交于 12月 14, 2007
```
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
b6cda9bc

openanolis / cloud-kernel 大约 1 年 前同步成功

openanolis / cloud-kernel
大约 1 年前同步成功