提交 · ecb8bea87d05fd2d1fc0718e1e4bbf09c7c6045a · openeuler / Kernel

29 3月, 2012 10 次提交

Btrfs: fix race between direct io and autodefrag · ecb8bea8

由 Liu Bo 提交于 3月 29, 2012

The bug is from running xfstests 209 with autodefrag.

The race is as follows:
       t1                       t2(autodefrag)
   direct IO
     invalidate pagecache
     dio(old data)             add_inode_defrag
     invalidate pagecache
   endio

   direct IO
     invalidate pagecache
                                run_defrag
                                  readpage(old data)
                                  set page dirty (old data)
     dio(new data, rewrite)
     invalidate pagecache (*)
     endio

t2(autodefrag) will get old data into pagecache via readpage and set
pagecache dirty.  Meanwhile, invalidate pagecache(*) will fail due to
dirty flags in pages.  So the old data may be flushed into disk by
flush thread, which will lead to data loss.

And so does the case of user defragment progs.

The patch fixes this race by holding i_mutex when we readpage and set page dirty.
Signed-off-by: NLiu Bo <liubo2009@cn.fujitsu.com>
Signed-off-by: NMiao Xie <miaox@cn.fujitsu.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

ecb8bea8

Btrfs: fix deadlock during allocating chunks · 15d1ff81

由 Liu Bo 提交于 3月 29, 2012

This deadlock comes from xfstests 251.

We'll hold the chunk_mutex throughout the whole of a chunk allocation.
But if we find that we've used up system chunk space, we need to allocate a
new system chunk, but this will lead to a recursion of chunk allocation and end
up with a deadlock on chunk_mutex.
So instead we need to allocate the system chunk first if we find we're in ENOSPC.
Signed-off-by: NLiu Bo <liubo2009@cn.fujitsu.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

15d1ff81

Btrfs: show useful info in space reservation tracepoint · 2bcc0328

由 Liu Bo 提交于 3月 29, 2012

o For space info, the type of space info is useful for debug.
o For transaction handle, its transid is useful.
Signed-off-by: NLiu Bo <liubo2009@cn.fujitsu.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

2bcc0328

Btrfs: don't use crc items bigger than 4KB · 7ca4be45

由 Chris Mason 提交于 1月 31, 2012

With the big metadata blocks, we can have crc items
that are much bigger than a page.  There are a few
places that we try to kmalloc memory to hold the
items during a split.

Items bigger than 4KB don't really have a huge benefit
in efficiency, but they do trigger larger order allocations.
This commits changes the csums to make sure they stay under
4KB.  This is not a format change, just a #define to limit
huge items.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

7ca4be45

Btrfs: flush out and clean up any block device pages during mount · 3c4bb26b

由 Chris Mason 提交于 3月 27, 2012

Btrfs puts the filesystem metadata into its own address space, and
somehow the block device address space isn't getting onto disk properly
before a mount.  The end result is that a loop of mkfs and mounting the
filesystem will sometimes find stale or incorrect data.

This commit should fix it by sprinkling fdatawrites and invalidate_bdev
calls around.  This is a short term measure to make sure it is fixed.
The block devices really should be flushed and cleaned up higher in the
stack.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

3c4bb26b

C
Merge git://git.jan-o-sch.net/btrfs-unstable into for-linus · 98961a7e
由 Chris Mason 提交于 3月 28, 2012
```
Conflicts:
	fs/btrfs/transaction.c
Signed-off-by: NChris Mason <chris.mason@oracle.com>
```
98961a7e
C

Merge branch 'for-chris' of git://github.com/idryomov/btrfs-unstable into for-linus · 1c691b33
由 Chris Mason 提交于 3月 28, 2012

1c691b33

Merge branch 'error-handling' into for-linus · 1d4284bd

由 Chris Mason 提交于 3月 28, 2012

Conflicts:
	fs/btrfs/ctree.c
	fs/btrfs/disk-io.c
	fs/btrfs/extent-tree.c
	fs/btrfs/extent_io.c
	fs/btrfs/extent_io.h
	fs/btrfs/inode.c
	fs/btrfs/scrub.c
Signed-off-by: NChris Mason <chris.mason@oracle.com>

1d4284bd

btrfs: disallow unequal data/metadata blocksize for mixed block groups · 65139ed9

由 David Sterba 提交于 2月 17, 2012

With support for bigger metadata blocks, we must avoid mounting a
filesystem with different block size for mixed block groups, this causes
corruption (found by xfstests/083).
Signed-off-by: NDavid Sterba <dsterba@suse.cz>

65139ed9

Btrfs: enhance superblock sanity checks · fcd1f065

由 David Sterba 提交于 3月 06, 2012

Validate checksum algorithm during mount and prevent BUG_ON later in
btrfs_super_csum_size.
Signed-off-by: NDavid Sterba <dsterba@suse.cz>

fcd1f065

28 3月, 2012 3 次提交

Btrfs: change scrub to support big blocks · b5d67f64

由 Stefan Behrens 提交于 3月 27, 2012

Scrub used to be coded for nodesize == leafsize == sectorsize == PAGE_SIZE.
This is now changed to support sizes for nodesize and leafsize which are
N * PAGE_SIZE.
Signed-off-by: NStefan Behrens <sbehrens@giantdisaster.de>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

b5d67f64

Btrfs: minor cleanup in scrub · 1623edeb

由 Stefan Behrens 提交于 3月 27, 2012

Just a minor cleanup commit in preparation for the big block changes.
Signed-off-by: NStefan Behrens <sbehrens@giantdisaster.de>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

1623edeb

Btrfs: introduce common define for max number of mirrors · 94598ba8

由 Stefan Behrens 提交于 3月 27, 2012

Readahead already has a define for the max number of mirrors. Scrub
needs such a define now, the rest of the code will need something
like this soon. Therefore the define was added to ctree.h and removed
from the readahead code.
Signed-off-by: NStefan Behrens <sbehrens@giantdisaster.de>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

94598ba8

27 3月, 2012 27 次提交

Btrfs: fix infinite loop in btrfs_shrink_device() · 213e64da