提交 · 4a92b1b8d2810db4ea0c34616b94c0b3810fa027 · xiphi1978 / linux

20 10月, 2011 7 次提交

Btrfs: stop passing a trans handle all around the reservation code · 4a92b1b8

由 Josef Bacik 提交于 8月 30, 2011

The only thing that we need to have a trans handle for is in
reserve_metadata_bytes and thats to know how much flushing we can do.  So
instead of passing it around, just check current->journal_info for a
trans_handle so we know if we can commit a transaction to try and free up space
or not.  Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>

4a92b1b8

Btrfs: handle enospc accounting for free space inodes · c09544e0

由 Josef Bacik 提交于 8月 30, 2011

Since free space inodes now use normal checksumming we need to make sure to
account for their metadata use. So reserve metadata space, and then if we fail
to write out the metadata we can just release it, otherwise it will be freed up
when the io completes. Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>

c09544e0

Btrfs: put the block group cache after we commit the super · 300e4f8a

由 Josef Bacik 提交于 8月 29, 2011

In moving some enospc stuff around I noticed that when we unmount we are often
evicting the free space cache inodes before we do our last commit. This isn't
bad, but it makes us constantly have to re-read the inodes back. So instead
don't evict the cache until after we do our last commit, this will make things a
little less crappy and makes a future enospc change work properly. Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>

300e4f8a

Btrfs: fix call to btrfs_search_slot in free space cache · a9b5fcdd

由 Josef Bacik 提交于 8月 19, 2011

We are setting ins_len to 1 even tho we are just modifying an item that should
be there already. This may cause the search stuff to split nodes on the way
down needelessly. Set this to 0 since we aren't inserting anything. Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>

a9b5fcdd

Btrfs: allow callers to specify if flushing can occur for btrfs_block_rsv_check · 482e6dc5

由 Josef Bacik 提交于 8月 19, 2011

If you run xfstest 224 it you will get lots of messages about not being able to
delete inodes and that they will be cleaned up next mount. This is because
btrfs_block_rsv_check was not calling reserve_metadata_bytes with the ability to
flush, so if there was not enough space, it simply failed. But in truncate and
evict case we could easily flush space to try and get enough space to do our
work, so make btrfs_block_rsv_check take a flush argument to pass down to
reserve_metadata_bytes. Now xfstests 224 runs fine without all those
complaints. Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>

482e6dc5

Btrfs: ratelimit the generation printk for the free space cache · 6ab60601

由 Josef Bacik 提交于 8月 08, 2011

A user reported getting spammed when moving to 3.0 by this message.  Since we
switched to the normal checksumming infrastructure all old free space caches
will be wrong and need to be regenerated so people are likely to see this
message a lot, so ratelimit it so it doesn't fill up their logs and freak them
out.  Thanks,
Reported-by: NAndrew Lutomirski <luto@mit.edu>
Signed-off-by: NJosef Bacik <josef@redhat.com>

6ab60601

Btrfs: use bytes_may_use for all ENOSPC reservations · fb25e914

由 Josef Bacik 提交于 7月 26, 2011

We have been using bytes_reserved for metadata reservations, which is wrong
since we use that to keep track of outstanding reservations from the allocator.
This resulted in us doing a lot of silly things to make sure we don't allocate a
bunch of metadata chunks since we never had a real view of how much space was
actually in use by metadata.

This passes Arne's enospc test and xfstests as well as my own enospc tests.
Hopefully this will get us moving in the right direction. Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>

fb25e914

11 9月, 2011 1 次提交

Btrfs: reset to appropriate block rsv after orphan operations · 65450aa6

由 Liu Bo 提交于 9月 11, 2011

While truncating free space cache, we forget to change trans->block_rsv
back to the original one, but leave it with the orphan_block_rsv, and
then with option inode_cache enable, it leads to countless warnings of
btrfs_alloc_free_block and btrfs_orphan_commit_root:

WARNING: at fs/btrfs/extent-tree.c:5711 btrfs_alloc_free_block+0x180/0x350 [btrfs]()
...
WARNING: at fs/btrfs/inode.c:2193 btrfs_orphan_commit_root+0xb0/0xc0 [btrfs]()
Signed-off-by: NLiu Bo <liubo2009@cn.fujitsu.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

65450aa6

17 8月, 2011 1 次提交

Btrfs: fix wrong free space information · bb3ac5a4

由 Miao Xie 提交于 8月 05, 2011

Btrfs subtracted the size of the allocated space twice when it allocated
the space from the bitmap in the cluster, it broke the free space information
and led to oops finally.

And this patch also fixes the bug that ctl->free_space was subtracted
without lock.
Reported-by: NLiu Bo <liubo2009@cn.fujitsu.com>
Signed-off-by: NMiao Xie <miaox@cn.fujitsu.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

bb3ac5a4

28 7月, 2011 1 次提交

Btrfs: use find_or_create_page instead of grab_cache_page · a94733d0

由 Josef Bacik 提交于 7月 11, 2011

grab_cache_page will use mapping_gfp_mask(), which for all inodes is set to
GFP_HIGHUSER_MOVABLE. So instead use find_or_create_page in all cases where we
need GFP_NOFS so we don't deadlock. Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>

a94733d0

11 7月, 2011 1 次提交

Btrfs: use the normal checksumming infrastructure for free space cache · 2f356126

由 Josef Bacik 提交于 6月 10, 2011

We used to store the checksums of the space cache directly in the space cache,
however that doesn't work out too well if we have more space than we can fit the
checksums into the first page. So instead use the normal checksumming
infrastructure. There were problems with doing this originally but those
problems don't exist now so this works out fine. Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>

2f356126

25 6月, 2011 1 次提交

Btrfs: make sure to update total_bitmaps when freeing cache V3 · 9b90f513

由 Josef Bacik 提交于 6月 24, 2011

A user reported this bug again where we have more bitmaps than we are supposed
to. This is because we failed to load the free space cache, but don't update
the ctl->total_bitmaps counter when we remove entries from the tree. This patch
fixes this problem and we should be good to go again. Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

9b90f513

11 6月, 2011 1 次提交

Btrfs: make sure to recheck for bitmaps in clusters · 38e87880

由 Chris Mason 提交于 6月 10, 2011

Josef recently changed the free extent cache to look in
the block group cluster for any bitmaps before trying to
add a new bitmap for the same offset.  This avoids BUG_ON()s due
covering duplicate ranges.

But it didn't go quite far enough.  A given free range might span
between one or more bitmaps or free space entries.  The code has
looping to cover this, but it doesn't check for clustered bitmaps
every time.

This shuffles our gotos to check for a bitmap in the cluster
for every new bitmap entry we try to add.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

38e87880

09 6月, 2011 4 次提交

Btrfs: fix duplicate checking logic · f6a39829

由 Josef Bacik 提交于 6月 06, 2011

When merging my code into the integration test the second check for duplicate
entries got screwed up. This patch fixes it by dropping ret2 and just using ret
for the return value, and checking if we got an error before adding the bitmap
to the local list. Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>

f6a39829

Btrfs: fix bitmap regression · 2cdc342c

由 Josef Bacik 提交于 5月 27, 2011

In cleaning up the clustering code I accidently introduced a regression by
adding bitmap entries to the cluster rb tree. The problem is if we've maxed out
the number of bitmaps we can have for the block group we can only add free space
to the bitmaps, but since the bitmap is on the cluster we can't find it and we
try to create another one. This would result in a panic because the total
bitmaps was bigger than the max bitmaps that were allowed. This patch fixes
this by checking to see if we have a cluster, and then looking at the cluster rb
tree to see if it has a bitmap entry and if it does and that space belongs to
that bitmap, go ahead and add it to that bitmap.

I could hit this panic every time with an fs_mark test within a couple of
minutes. With this patch I no longer hit the panic and fs_mark goes to
completion. Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>

2cdc342c

Btrfs: noinline the cluster searching functions · 3de85bb9

由 Josef Bacik 提交于 5月 25, 2011

When profiling the find cluster code it's hard to tell where we are spending our
time because the bitmap and non-bitmap functions get inlined by the compiler, so
make that not happen.  Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>

3de85bb9

Btrfs: cache bitmaps when searching for a cluster · 86d4a77b

由 Josef Bacik 提交于 5月 25, 2011

If we are looking for a cluster in a particularly sparse or fragmented block
group, we will do a lot of looping through the free space tree looking for
various things, and if we need to look at bitmaps we will endup doing the whole
dance twice. So instead add the bitmap entries to a temporary list so if we
have to do the bitmap search we can just look through the list of entries we've
found quickly instead of having to loop through the entire tree again. Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>

86d4a77b

04 6月, 2011 3 次提交

btrfs: add helper for fs_info->closing · 7841cb28

由 David Sterba 提交于 5月 31, 2011

wrap checking of filesystem 'closing' flag and fix a few missing memory
barriers.
Signed-off-by: NDavid Sterba <dsterba@suse.cz>

7841cb28

Btrfs: add mount -o inode_cache · 4b9465cb

由 Chris Mason 提交于 6月 03, 2011

This makes the inode map cache default to off until we
fix the overflow problem when the free space crcs don't fit
inside a single page.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

4b9465cb

Btrfs: make sure we don't overflow the free space cache crc page · 211f96c2

由 Chris Mason 提交于 6月 03, 2011

The free space cache uses only one page for crcs right now,
which means we can't have a cache file bigger than the
crcs we can fit in the first page.  This adds a check to
enforce that restriction.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

211f96c2

24 5月, 2011 1 次提交

Btrfs: check for duplicate entries in the free space cache · 207dde82

由 Josef Bacik 提交于 5月 13, 2011

If there are duplicate entries in the free space cache, discard the entire cache
and load it the old fashioned way.  Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>

207dde82

06 5月, 2011 1 次提交

btrfs: remove all unused functions · f2a97a9d

由 David Sterba 提交于 5月 05, 2011

Remove static and global declarations and/or definitions. Reduces size
of btrfs.ko by ~3.4kB.

  text    data     bss     dec     hex filename
402081    7464     200  409745   64091 btrfs.ko.base
398620    7144     200  405964   631cc btrfs.ko.remove-all
Signed-off-by: NDavid Sterba <dsterba@suse.cz>

f2a97a9d

02 5月, 2011 3 次提交
- D
  btrfs: drop unused parameter from btrfs_release_path · b3b4aa74
  由 David Sterba 提交于 4月 21, 2011
```
parameter tree root it's not used since commit
5f39d397 ("Btrfs: Create extent_buffer
interface for large blocksizes")
Signed-off-by: NDavid Sterba <dsterba@suse.cz>
```
  b3b4aa74
- D
  btrfs: make functions static when possible · 62a45b60
  由 David Sterba 提交于 4月 20, 2011
```
Signed-off-by: NDavid Sterba <dsterba@suse.cz>
```
  62a45b60
- D
  btrfs: remove nested duplicate variable declarations · edc95aec
  由 David Sterba 提交于 4月 19, 2011
```
Signed-off-by: NDavid Sterba <dsterba@suse.cz>
```
  edc95aec
26 4月, 2011 2 次提交

Btrfs: free bitmaps properly when evicting the cache · a4f0162f

由 Josef Bacik 提交于 4月 25, 2011

If our space cache is wrong, we do the right thing and free up everything that
we loaded, however we don't reset the total_bitmaps counter or the thresholds or
anything. So in btrfs_remove_free_space_cache make sure to call free_bitmap()
if it's a bitmap, this will keep us from panicing when we check to make sure we
don't have too many bitmaps. Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

a4f0162f

Btrfs: Free free_space item properly in btrfs_trim_block_group() · f789b684

由 Li Zefan 提交于 4月 25, 2011

Since commit dc89e982, we've changed
to use a specific slab for alocation of free_space items.
Signed-off-by: NLi Zefan <lizf@cn.fujitsu.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

f789b684

25 4月, 2011 6 次提交

Btrfs: Support reading/writing on disk free ino cache · 82d5902d

由 Li Zefan 提交于 4月 20, 2011

This is similar to block group caching.

We dedicate a special inode in fs tree to save free ino cache.

At the very first time we create/delete a file after mount, the free ino
cache will be loaded from disk into memory. When the fs tree is commited,
the cache will be written back to disk.

To keep compatibility, we check the root generation against the generation
of the special inode when loading the cache, so the loading will fail
if the btrfs filesystem was mounted in an older kernel before.
Signed-off-by: NLi Zefan <lizf@cn.fujitsu.com>

82d5902d

Btrfs: Make the code for reading/writing free space cache generic · 0414efae

由 Li Zefan 提交于 4月 20, 2011

Extract out block group specific code from lookup_free_space_inode(),
create_free_space_inode(), load_free_space_cache() and
btrfs_write_out_cache(), so the code can be used to read/write
free ino cache.
Signed-off-by: NLi Zefan <lizf@cn.fujitsu.com>

0414efae

Btrfs: Cache free inode numbers in memory · 581bb050

由 Li Zefan 提交于 4月 20, 2011

Currently btrfs stores the highest objectid of the fs tree, and it always
returns (highest+1) inode number when we create a file, so inode numbers
won't be reclaimed when we delete files, so we'll run out of inode numbers
as we keep create/delete files in 32bits machines.

This fixes it, and it works similarly to how we cache free space in block
cgroups.

We start a kernel thread to read the file tree. By scanning inode items,
we know which chunks of inode numbers are free, and we cache them in
an rb-tree.

Because we are searching the commit root, we have to carefully handle the
cross-transaction case.

The rb-tree is a hybrid extent+bitmap tree, so if we have too many small
chunks of inode numbers, we'll use bitmaps. Initially we allow 16K ram
of extents, and a bitmap will be used if we exceed this threshold. The
extents threshold is adjusted in runtime.
Signed-off-by: NLi Zefan <lizf@cn.fujitsu.com>

581bb050

Btrfs: Make free space cache code generic · 34d52cb6

由 Li Zefan 提交于 3月 29, 2011

So we can re-use the code to cache free inode numbers.

The change is quite straightforward. Two new structures are introduced.

- struct btrfs_free_space_ctl

  We move those variables that are used for caching free space from
  struct btrfs_block_group_cache to this new struct.

- struct btrfs_free_space_op

  We do block group specific work (e.g. calculation of extents threshold)
  through functions registered in this struct.

And then we can remove references to struct btrfs_block_group_cache.
Signed-off-by: NLi Zefan <lizf@cn.fujitsu.com>

34d52cb6

L
Btrfs: Use bitmap_set/clear() · f38b6e75
由 Li Zefan 提交于 3月 14, 2011
```
No functional change.
Signed-off-by: NLi Zefan <lizf@cn.fujitsu.com>
```
f38b6e75

Btrfs: Remove unused btrfs_block_group_free_space() · 92c42311

由 Li Zefan 提交于 3月 02, 2011

We've already recorded the value in block_group->frees_space.
Signed-off-by: NLi Zefan <lizf@cn.fujitsu.com>

92c42311

18 4月, 2011 1 次提交

Btrfs: fix free space cache leak · f65647c2

由 Chris Mason 提交于 4月 18, 2011

The free space caching code was recently reworked to
cache all the pages it needed instead of using find_get_page everywhere.

One loop was missed though, so it ended up leaking pages.  This fixes
it to use our page array instead of find_get_page.
Signed-off-by: NChris Mason <chris.mason@oracle.com>

f65647c2

09 4月, 2011 1 次提交

Btrfs: deal with the case that we run out of space in the cache · be1a12a0

由 Josef Bacik 提交于 4月 06, 2011

Currently we don't handle running out of space in the cache, so to fix this we
keep track of how far in the cache we are.  Then we only dirty the pages if we
successfully modify all of them, otherwise if we have an error or run out of
space we can just drop them and not worry about the vm writing them out.
Thanks,

Tested-by Johannes Hirte <johannes.hirte@fem.tu-ilmenau.de>
Signed-off-by: NJosef Bacik <josef@redhat.com>

be1a12a0

05 4月, 2011 2 次提交

Btrfs: fix free space cache when there are pinned extents and clusters V2 · 43be2146

由 Josef Bacik 提交于 4月 01, 2011

I noticed a huge problem with the free space cache that was presenting
as an early ENOSPC.  Turns out when writing the free space cache out I
forgot to take into account pinned extents and more importantly
clusters.  This would result in us leaking free space everytime we
unmounted the filesystem and remounted it.

I fix this by making sure to check and see if the current block group
has a cluster and writing out any entries that are in the cluster to the
cache, as well as writing any pinned extents we currently have to the
cache since those will be available for us to use the next time the fs
mounts.

This patch also adds a check to the end of load_free_space_cache to make
sure we got the right amount of free space cache, and if not make sure
to clear the cache and re-cache the old fashioned way.
Signed-off-by: NJosef Bacik <josef@redhat.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

43be2146

btrfs: clear __GFP_FS flag in the space cache inode · adae52b9

由 Miao Xie 提交于 3月 31, 2011

the object id of the space cache inode's key is allocated from the relative
root, just like the regular file. So we can't identify space cache inode by
checking the object id of the inode's key, and we have to clear __GFP_FS flag
at the time we look up the space cache inode.
Signed-off-by: NMiao Xie <miaox@cn.fujitsu.com>
Signed-off-by: NLiu Bo <liubo2009@cn.fujitsu.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

adae52b9

28 3月, 2011 1 次提交

Btrfs: add btrfs_trim_fs() to handle FITRIM · f7039b1d

由 Li Dongyang 提交于 3月 24, 2011

We take an free extent out from allocator, trim it, then put it back,
but before we trim the block group, we should make sure the block group is
cached, so plus a little change to make cache_block_group() run without a
transaction.
Signed-off-by: NLi Dongyang <lidongyang@novell.com>
Signed-off-by: NChris Mason <chris.mason@oracle.com>

f7039b1d

26 3月, 2011 1 次提交

Btrfs: cleanup how we setup free space clusters · 4e69b598

由 Josef Bacik 提交于 3月 21, 2011

This patch makes the free space cluster refilling code a little easier to
understand, and fixes some things with the bitmap part of it. Currently we
either want to refill a cluster with

1) All normal extent entries (those without bitmaps)
2) A bitmap entry with enough space

The current code has this ugly jump around logic that will first try and fill up
the cluster with extent entries and then if it can't do that it will try and
find a bitmap to use. So instead split this out into two functions, one that
tries to find only normal entries, and one that tries to find bitmaps.

This also fixes a suboptimal thing we would do with bitmaps. If we used a
bitmap we would just tell the cluster that we were pointing at a bitmap and it
would do the tree search in the block group for that entry every time we tried
to make an allocation. Instead of doing that now we just add it to the clusters
group.

I tested this with my ENOSPC tests and xfstests and it survived.
Signed-off-by: NJosef Bacik <josef@redhat.com>

4e69b598

21 3月, 2011 1 次提交

Btrfs: don't be as aggressive about using bitmaps · 32cb0840

由 Josef Bacik 提交于 3月 18, 2011

We have been creating bitmaps for small extents unconditionally forever. This
was great when testing to make sure the bitmap stuff was working, but is
overkill normally. So instead of always adding small chunks of free space to
bitmaps, only start doing it if we go past half of our extent threshold. This
will keeps us from creating a bitmap for just one small free extent at the front
of the block group, and will make the allocator a little faster as a result.
Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>

32cb0840