提交 · 0a63b66db566cffdf90182eb6e66fdd4d0479e63 · openeuler / Kernel

19 3月, 2014 16 次提交

bcache: Rework btree cache reserve handling · 0a63b66d

由 Kent Overstreet 提交于 3月 17, 2014

This changes the bucket allocation reserves to use _real_ reserves - separate
freelists - instead of watermarks, which if nothing else makes the current code
saner to reason about and is going to be important in the future when we add
support for multiple btrees.

It also adds btree_check_reserve(), which checks (and locks) the reserves for
both bucket allocation and memory allocation for btree nodes; the old code just
kinda sorta assumed that since (e.g. for btree node splits) it had the root
locked and that meant no other threads could try to make use of the same
reserve; this technically should have been ok for memory allocation (we should
always have a reserve for memory allocation (the btree node cache is used as a
reserve and we preallocate it)), but multiple btrees will mean that locking the
root won't be sufficient anymore, and for the bucket allocation reserve it was
technically possible for the old code to deadlock.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

0a63b66d

bcache: Kill btree_io_wq · 56b30770

由 Kent Overstreet 提交于 1月 23, 2014

With the locking rework in the last patch, this shouldn't be needed anymore -
btree_node_write_work() only takes b->write_lock which is never held for very
long.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

56b30770

bcache: btree locking rework · 2a285686

由 Kent Overstreet 提交于 3月 04, 2014

Add a new lock, b->write_lock, which is required to actually modify - or write -
a btree node; this lock is only held for short durations.

This means we can write out a btree node without taking b->lock, which _is_ held
for long durations - solving a deadlock when btree_flush_write() (from the
journalling code) is called with a btree node locked.

Right now just occurs in bch_btree_set_root(), but with an upcoming journalling
rework is going to happen a lot more.

This also turns b->lock is now more of a read/intent lock instead of a
read/write lock - but not completely, since it still blocks readers. May turn it
into a real intent lock at some point in the future.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

2a285686

bcache: Fix a race when freeing btree nodes · 05335cff

由 Kent Overstreet 提交于 3月 17, 2014

This isn't a bulletproof fix; btree_node_free() -> bch_bucket_free() puts the
bucket on the unused freelist, where it can be reused right away without any
ordering requirements. It would be better to wait on at least a journal write to
go down before reusing the bucket. bch_btree_set_root() does this, and inserting
into non leaf nodes is completely synchronous so we should be ok, but future
patches are just going to get rid of the unused freelist - it was needed in the
past for various reasons but shouldn't be anymore.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

05335cff

bcache: Add a real GC_MARK_RECLAIMABLE · 4fe6a816

由 Kent Overstreet 提交于 3月 13, 2014

This means the garbage collection code can better check for data and metadata
pointers to the same buckets.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

4fe6a816

bcache: Add bch_keylist_init_single() · c13f3af9

由 Kent Overstreet 提交于 1月 08, 2014

This will potentially save us an allocation when we've got inode/dirent bkeys
that don't fit in the keylist's inline keys.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

c13f3af9

bcache: Improve priority_stats · 15754020

由 Kent Overstreet 提交于 2月 25, 2014

Break down data into clean data/dirty data/metadata.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

15754020

bcache: Better alloc tracepoints · 7159b1ad

由 Kent Overstreet 提交于 2月 12, 2014

Change the invalidate tracepoint to indicate how much data we're invalidating,
and change the alloc tracepoints to indicate what offset they're for.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

7159b1ad

bcache: Kill dead cgroup code · 3f5e0a34

由 Kent Overstreet 提交于 1月 23, 2014

This hasn't been used or even enabled in ages.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

3f5e0a34

N
bcache: stop moving_gc marking buckets that can't be moved. · 3f6ef381
由 Nicholas Swenson 提交于 1月 23, 2014
```
Signed-off-by: NNicholas Swenson <nks@daterainc.com>
```
3f6ef381

bcache: Fix moving_pred() · 10d9dcf6

由 Kent Overstreet 提交于 2月 17, 2014

Avoid a potential null pointer deref (e.g. from check keys for cache misses)
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

10d9dcf6

bcache: Fix moving_gc deadlocking with a foreground write · da415a09

由 Nicholas Swenson 提交于 1月 09, 2014

Deadlock happened because a foreground write slept, waiting for a bucket
to be allocated. Normally the gc would mark buckets available for invalidation.
But the moving_gc was stuck waiting for outstanding writes to complete.
These writes used the bcache_wq, the same queue foreground writes used.

This fix gives moving_gc its own work queue, so it was still finish moving
even if foreground writes are stuck waiting for allocation. It also makes
work queue a parameter to the data_insert path, so moving_gc can use its
workqueue for writes.
Signed-off-by: NNicholas Swenson <nks@daterainc.com>
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

da415a09

bcache: Fix discard granularity · 90db6919

由 Kent Overstreet 提交于 2月 10, 2014

blk_stack_limits() doesn't like a discard granularity of 0.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

90db6919

bcache: Fix another bug recovering from unclean shutdown · 487dded8

由 Kent Overstreet 提交于 3月 17, 2014

The on disk bucket gens are allowed to be out of date, when we reuse buckets
that didn't have any live data in them. To deal with this, the initial gc has to
update the bucket gen when we find a pointer gen newer than the bucket's gen.

Unfortunately we weren't doing this for pointers in the journal that we're about
to replay.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

487dded8

bcache: Fix a bug recovering from unclean shutdown · 0bd143fd

由 Kent Overstreet 提交于 3月 04, 2014

The code to fixup incorrect bucket prios incorrectly did not skip btree node
freeing keys
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

0bd143fd

bcache: Fix a journalling reclaim after recovery bug · 27201cfd

由 Kent Overstreet 提交于 3月 13, 2014

On recovery we weren't correctly keeping track of what journal buckets had open
journal entries, thus it was possible for them to be overwritten until we'd
written all new journal entries.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

27201cfd

18 3月, 2014 2 次提交
- K
  bcache: Fix a null ptr deref in journal replay · 65ddf45a
  由 Kent Overstreet 提交于 2月 24, 2014
```
Signed-off-by: NKent Overstreet <kmo@daterainc.com>
```
  65ddf45a
- K
  bcache: Fix a lockdep splat in an error path · 4fa03402
  由 Kent Overstreet 提交于 3月 17, 2014
```
Signed-off-by: NKent Overstreet <kmo@daterainc.com>
```
  4fa03402
26 2月, 2014 2 次提交

bcache: Fix a shutdown bug · dabb4433

由 Kent Overstreet 提交于 2月 19, 2014

Shutdown wasn't cancelling/waiting on journal_write_work()
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

dabb4433

bcache: Fix flash_dev_cache_miss() for real this time · 1b4eaf3d

由 Kent Overstreet 提交于 1月 16, 2014

The code was using sectors to count the number of sectors it was zeroing... but
then it passed it to bio_advance()... after it had been set to 0. Amusing...
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

1b4eaf3d

19 2月, 2014 1 次提交

bcache: Fix another compiler warning on m68k · 85cbe1f8

由 Kent Overstreet 提交于 2月 17, 2014

Use a bigger hammer this time
Signed-off-by: NKent Overstreet <kmo@daterainc.com>
Cc: linux-stable <stable@vger.kernel.org>

85cbe1f8

11 2月, 2014 1 次提交

drivers/md/bcache/extents.c: use %zi to format size_t · bd180b4e

由 Geert Uytterhoeven 提交于 2月 10, 2014

drivers/md/bcache/extents.c: In function `btree_ptr_bad_expensive':
drivers/md/bcache/extents.c:196: warning: format `%li' expects type `long int', but argument 4 has type `size_t'
Signed-off-by: NGeert Uytterhoeven <geert@linux-m68k.org>
Cc: Kent Overstreet <kmo@daterainc.com>
Cc: Neil Brown <neilb@suse.de>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

bd180b4e

30 1月, 2014 3 次提交

N
bcache: bugfix - gc thread now gets woken when cache is full · e3b4825b
由 Nicholas Swenson 提交于 12月 12, 2013
```
Signed-off-by: NNicholas Swenson <nks@daterainc.com>
```
e3b4825b
K
bcache: Minor fixes from kbuild robot · 3572324a
由 Kent Overstreet 提交于 1月 10, 2014
```
Signed-off-by: NKent Overstreet <kmo@daterainc.com>
```
3572324a

bcache: fix BUG_ON due to integer overflow with GC_SECTORS_USED · 94717447

由 Darrick J. Wong 提交于 1月 28, 2014

The BUG_ON at the end of __bch_btree_mark_key can be triggered due to
an integer overflow error:

BITMASK(GC_SECTORS_USED, struct bucket, gc_mark, 2, 13);
...
SET_GC_SECTORS_USED(g, min_t(unsigned,
	     GC_SECTORS_USED(g) + KEY_SIZE(k),
	     (1 << 14) - 1));
BUG_ON(!GC_SECTORS_USED(g));

In bcache.h, the SECTORS_USED bitfield is defined to be 13 bits wide.
While the SET_ code tries to ensure that the field doesn't overflow by
clamping it to (1<<14)-1 == 16383, this is incorrect because 16383
requires 14 bits.  Therefore, if GC_SECTORS_USED() + KEY_SIZE() =
8192, the SET_ statement tries to store 8192 into a 13-bit field.  In
a 13-bit field, 8192 becomes zero, thus triggering the BUG_ON.

Therefore, create a field width constant and a max value constant, and
use those to create the bitfield and check the inputs to
SET_GC_SECTORS_USED.  Arguably the BITMASK() template ought to have
BUG_ON checks for too-large values, but that's a separate patch.
Signed-off-by: NDarrick J. Wong <darrick.wong@oracle.com>

94717447

13 1月, 2014 1 次提交

cgroup: remove stray references to css_id · b3ff8a2f

由 Hugh Dickins 提交于 1月 12, 2014

Trivial: remove the few stray references to css_id, which itself
was removed in v3.13's 2ff2a7d0 "cgroup: kill css_id".
Signed-off-by: NHugh Dickins <hughd@google.com>
Signed-off-by: NTejun Heo <tj@kernel.org>

b3ff8a2f

09 1月, 2014 14 次提交

K
bcache: Fix auxiliary search trees for key size > cacheline size · 9dd6358a
由 Kent Overstreet 提交于 12月 17, 2013
```
Signed-off-by: NKent Overstreet <kmo@daterainc.com>
```
9dd6358a

bcache: Don't return -EINTR when insert finished · 3b3e9e50

由 Kent Overstreet 提交于 12月 07, 2013

We need to return -EINTR after a split because we invalidated iterators
(and freed the btree node) - but if we were finished inserting, we don't
want to redo the traversal.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

3b3e9e50

bcache: Improve bucket_prio() calculation · e0a985a4

由 Kent Overstreet 提交于 11月 12, 2013

When deciding what order to reuse buckets we take into account both the bucket's
priority (which indicates lru order) and also the amount of live data in that
bucket. The way they were scaled together wasn't as correct as it could be...
this patch improves and documents it.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

e0a985a4

bcache: Add bch_bkey_equal_header() · 3bdad1e4

由 Nicholas Swenson 提交于 11月 11, 2013

Checks if two keys have equivalent header fields.
(good enough for replacement or merging)

Used in bch_bkey_try_merge, and replacing a key
in the btree.
Signed-off-by: NNicholas Swenson <nks@daterainc.com>
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

3bdad1e4

bcache: update bch_bkey_try_merge · 0f49cf3d

由 Nicholas Swenson 提交于 10月 14, 2013

Added generic header checks to bch_bkey_try_merge,
which then calls the bkey specific function

Removed extraneous checks from bch_extent_merge
Signed-off-by: NNicholas Swenson <nks@daterainc.com>

0f49cf3d

bcache: Move insert_fixup() to btree_keys_ops · 829a60b9

由 Kent Overstreet 提交于 11月 11, 2013

Now handling overlapping extents/keys is a method that's specific to what the
btree node contains.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

829a60b9

bcache: Convert sorting to btree_keys · 89ebb4a2

由 Kent Overstreet 提交于 11月 11, 2013

More work to disentangle various code from struct btree
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

89ebb4a2

bcache: Convert debug code to btree_keys · dc9d98d6

由 Kent Overstreet 提交于 12月 17, 2013

More work to disentangle various code from struct btree
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

dc9d98d6

K
bcache: Convert btree_iter to struct btree_keys · c052dd9a
由 Kent Overstreet 提交于 11月 11, 2013
```
More work to disentangle bset.c from struct btree
Signed-off-by: NKent Overstreet <kmo@daterainc.com>
```
c052dd9a

bcache: Refactor bset_tree sysfs stats · f67342dd

由 Kent Overstreet 提交于 11月 11, 2013

We're in the process of turning bset.c into library code, so none of the code in
that file should know about struct cache_set or struct btree - so, move the
btree traversal part of the stats code to sysfs.c.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

f67342dd

bcache: Add bch_btree_keys_u64s_remaining() · 59158fde

由 Kent Overstreet 提交于 11月 11, 2013

Helper function to explicitly check how much space is free in a btree node
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

59158fde

bcache: Add struct btree_keys · a85e968e

由 Kent Overstreet 提交于 12月 20, 2013

Soon, bset.c won't need to depend on struct btree.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

a85e968e

K
bcache: Abstract out stuff needed for sorting · 65d45231
由 Kent Overstreet 提交于 12月 20, 2013
```
Signed-off-by: NKent Overstreet <kmo@daterainc.com>
```
65d45231

bcache: Rename/shuffle various code around · ee811287

由 Kent Overstreet 提交于 12月 17, 2013

More work to disentangle bset.c from the rest of the code:
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

ee811287

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功