提交 · 0a63b66db566cffdf90182eb6e66fdd4d0479e63 · openeuler / Kernel

19 3月, 2014 5 次提交

bcache: Rework btree cache reserve handling · 0a63b66d

由 Kent Overstreet 提交于 3月 17, 2014

This changes the bucket allocation reserves to use _real_ reserves - separate
freelists - instead of watermarks, which if nothing else makes the current code
saner to reason about and is going to be important in the future when we add
support for multiple btrees.

It also adds btree_check_reserve(), which checks (and locks) the reserves for
both bucket allocation and memory allocation for btree nodes; the old code just
kinda sorta assumed that since (e.g. for btree node splits) it had the root
locked and that meant no other threads could try to make use of the same
reserve; this technically should have been ok for memory allocation (we should
always have a reserve for memory allocation (the btree node cache is used as a
reserve and we preallocate it)), but multiple btrees will mean that locking the
root won't be sufficient anymore, and for the bucket allocation reserve it was
technically possible for the old code to deadlock.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

0a63b66d

bcache: Kill btree_io_wq · 56b30770

由 Kent Overstreet 提交于 1月 23, 2014

With the locking rework in the last patch, this shouldn't be needed anymore -
btree_node_write_work() only takes b->write_lock which is never held for very
long.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

56b30770

bcache: btree locking rework · 2a285686

由 Kent Overstreet 提交于 3月 04, 2014

Add a new lock, b->write_lock, which is required to actually modify - or write -
a btree node; this lock is only held for short durations.

This means we can write out a btree node without taking b->lock, which _is_ held
for long durations - solving a deadlock when btree_flush_write() (from the
journalling code) is called with a btree node locked.

Right now just occurs in bch_btree_set_root(), but with an upcoming journalling
rework is going to happen a lot more.

This also turns b->lock is now more of a read/intent lock instead of a
read/write lock - but not completely, since it still blocks readers. May turn it
into a real intent lock at some point in the future.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

2a285686

bcache: Fix moving_gc deadlocking with a foreground write · da415a09

由 Nicholas Swenson 提交于 1月 09, 2014

Deadlock happened because a foreground write slept, waiting for a bucket
to be allocated. Normally the gc would mark buckets available for invalidation.
But the moving_gc was stuck waiting for outstanding writes to complete.
These writes used the bcache_wq, the same queue foreground writes used.

This fix gives moving_gc its own work queue, so it was still finish moving
even if foreground writes are stuck waiting for allocation. It also makes
work queue a parameter to the data_insert path, so moving_gc can use its
workqueue for writes.
Signed-off-by: NNicholas Swenson <nks@daterainc.com>
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

da415a09

bcache: Fix discard granularity · 90db6919

由 Kent Overstreet 提交于 2月 10, 2014

blk_stack_limits() doesn't like a discard granularity of 0.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

90db6919

18 3月, 2014 1 次提交
- K
  bcache: Fix a lockdep splat in an error path · 4fa03402
  由 Kent Overstreet 提交于 3月 17, 2014
```
Signed-off-by: NKent Overstreet <kmo@daterainc.com>
```
  4fa03402
26 2月, 2014 1 次提交

bcache: Fix a shutdown bug · dabb4433

由 Kent Overstreet 提交于 2月 19, 2014

Shutdown wasn't cancelling/waiting on journal_write_work()
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

dabb4433

09 1月, 2014 9 次提交

bcache: Convert debug code to btree_keys · dc9d98d6

由 Kent Overstreet 提交于 12月 17, 2013

More work to disentangle various code from struct btree
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

dc9d98d6

K
bcache: Abstract out stuff needed for sorting · 65d45231
由 Kent Overstreet 提交于 12月 20, 2013
```
Signed-off-by: NKent Overstreet <kmo@daterainc.com>
```
65d45231

bcache: Rename/shuffle various code around · ee811287

由 Kent Overstreet 提交于 12月 17, 2013

More work to disentangle bset.c from the rest of the code:
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

ee811287

bcache: Add struct bset_sort_state · 67539e85

由 Kent Overstreet 提交于 9月 10, 2013

More disentangling bset.c from the rest of the bcache code - soon, the
sorting routines won't have any dependencies on any outside structs.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

67539e85

bcache: Use a mempool for mergesort temporary space · 0a451145

由 Kent Overstreet 提交于 12月 18, 2013

It was a single element mempool before, it's slightly cleaner to just use a real
mempool.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

0a451145

K
bcache: Trivial error handling fix · 5c41c8a7
由 Kent Overstreet 提交于 7月 08, 2013
```
Signed-off-by: NKent Overstreet <kmo@daterainc.com>
```
5c41c8a7

bcache/md: Use raid stripe size · c78afc62

由 Kent Overstreet 提交于 7月 11, 2013

Now that we've got code for raid5/6 stripe awareness, bcache just needs
to know about the stripes and when writing partial stripes is expensive
- we probably don't want to enable this optimization for raid1 or 10,
even though they have stripes. So add a flag to queue_limits.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

c78afc62

bcache: Rework allocator reserves · 78365411

由 Kent Overstreet 提交于 12月 17, 2013

We need a reserve for allocating buckets for new btree nodes - and now that
we've got multiple btrees, it really needs to be per btree.

This reworks the reserves so we've got separate freelists for each reserve
instead of watermarks, which seems to make things a bit cleaner, and it adds
some code so that btree_split() can make sure the reserve is available before it
starts.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

78365411

K
bcache: kill closure locking usage · cb7a583e
由 Kent Overstreet 提交于 12月 16, 2013
```
Signed-off-by: NKent Overstreet <kmo@daterainc.com>
```
cb7a583e

17 12月, 2013 1 次提交

bcache: Fix for can_attach_cache() · 9eb8ebeb

由 Nicholas Swenson 提交于 10月 22, 2013

Signed-off-by: NNicholas Swenson <nks@daterainc.com>
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

9eb8ebeb

24 11月, 2013 2 次提交

block: Abstract out bvec iterator · 4f024f37

由 Kent Overstreet 提交于 10月 11, 2013

Immutable biovecs are going to require an explicit iterator. To
implement immutable bvecs, a later patch is going to add a bi_bvec_done
member to this struct; for now, this patch effectively just renames
things.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>
Cc: Jens Axboe <axboe@kernel.dk>
Cc: Geert Uytterhoeven <geert@linux-m68k.org>
Cc: Benjamin Herrenschmidt <benh@kernel.crashing.org>
Cc: Paul Mackerras <paulus@samba.org>
Cc: "Ed L. Cashin" <ecashin@coraid.com>
Cc: Nick Piggin <npiggin@kernel.dk>
Cc: Lars Ellenberg <drbd-dev@lists.linbit.com>
Cc: Jiri Kosina <jkosina@suse.cz>
Cc: Matthew Wilcox <willy@linux.intel.com>
Cc: Geoff Levand <geoff@infradead.org>
Cc: Yehuda Sadeh <yehuda@inktank.com>
Cc: Sage Weil <sage@inktank.com>
Cc: Alex Elder <elder@inktank.com>
Cc: ceph-devel@vger.kernel.org
Cc: Joshua Morris <josh.h.morris@us.ibm.com>
Cc: Philip Kelleher <pjk1939@linux.vnet.ibm.com>
Cc: Rusty Russell <rusty@rustcorp.com.au>
Cc: "Michael S. Tsirkin" <mst@redhat.com>
Cc: Konrad Rzeszutek Wilk <konrad.wilk@oracle.com>
Cc: Jeremy Fitzhardinge <jeremy@goop.org>
Cc: Neil Brown <neilb@suse.de>
Cc: Alasdair Kergon <agk@redhat.com>
Cc: Mike Snitzer <snitzer@redhat.com>
Cc: dm-devel@redhat.com
Cc: Martin Schwidefsky <schwidefsky@de.ibm.com>
Cc: Heiko Carstens <heiko.carstens@de.ibm.com>
Cc: linux390@de.ibm.com
Cc: Boaz Harrosh <bharrosh@panasas.com>
Cc: Benny Halevy <bhalevy@tonian.com>
Cc: "James E.J. Bottomley" <JBottomley@parallels.com>
Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Cc: "Nicholas A. Bellinger" <nab@linux-iscsi.org>
Cc: Alexander Viro <viro@zeniv.linux.org.uk>
Cc: Chris Mason <chris.mason@fusionio.com>
Cc: "Theodore Ts'o" <tytso@mit.edu>
Cc: Andreas Dilger <adilger.kernel@dilger.ca>
Cc: Jaegeuk Kim <jaegeuk.kim@samsung.com>
Cc: Steven Whitehouse <swhiteho@redhat.com>
Cc: Dave Kleikamp <shaggy@kernel.org>
Cc: Joern Engel <joern@logfs.org>
Cc: Prasad Joshi <prasadjoshi.linux@gmail.com>
Cc: Trond Myklebust <Trond.Myklebust@netapp.com>
Cc: KONISHI Ryusuke <konishi.ryusuke@lab.ntt.co.jp>
Cc: Mark Fasheh <mfasheh@suse.com>
Cc: Joel Becker <jlbec@evilplan.org>
Cc: Ben Myers <bpm@sgi.com>
Cc: xfs@oss.sgi.com
Cc: Steven Rostedt <rostedt@goodmis.org>
Cc: Frederic Weisbecker <fweisbec@gmail.com>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: Len Brown <len.brown@intel.com>
Cc: Pavel Machek <pavel@ucw.cz>
Cc: "Rafael J. Wysocki" <rjw@sisk.pl>
Cc: Herton Ronaldo Krzesinski <herton.krzesinski@canonical.com>
Cc: Ben Hutchings <ben@decadent.org.uk>
Cc: Andrew Morton <akpm@linux-foundation.org>
Cc: Guo Chao <yan@linux.vnet.ibm.com>
Cc: Tejun Heo <tj@kernel.org>
Cc: Asai Thambi S P <asamymuthupa@micron.com>
Cc: Selvan Mani <smani@micron.com>
Cc: Sam Bradshaw <sbradshaw@micron.com>
Cc: Wei Yongjun <yongjun_wei@trendmicro.com.cn>
Cc: "Roger Pau Monné" <roger.pau@citrix.com>
Cc: Jan Beulich <jbeulich@suse.com>
Cc: Stefano Stabellini <stefano.stabellini@eu.citrix.com>
Cc: Ian Campbell <Ian.Campbell@citrix.com>
Cc: Sebastian Ott <sebott@linux.vnet.ibm.com>
Cc: Christian Borntraeger <borntraeger@de.ibm.com>
Cc: Minchan Kim <minchan@kernel.org>
Cc: Jiang Liu <jiang.liu@huawei.com>
Cc: Nitin Gupta <ngupta@vflare.org>
Cc: Jerome Marchand <jmarchand@redhat.com>
Cc: Joe Perches <joe@perches.com>
Cc: Peng Tao <tao.peng@emc.com>
Cc: Andy Adamson <andros@netapp.com>
Cc: fanchaoting <fanchaoting@cn.fujitsu.com>
Cc: Jie Liu <jeff.liu@oracle.com>
Cc: Sunil Mushran <sunil.mushran@gmail.com>
Cc: "Martin K. Petersen" <martin.petersen@oracle.com>
Cc: Namjae Jeon <namjae.jeon@samsung.com>
Cc: Pankaj Kumar <pankaj.km@samsung.com>
Cc: Dan Magenheimer <dan.magenheimer@oracle.com>
Cc: Mel Gorman <mgorman@suse.de>6

4f024f37

bcache: Kill unaligned bvec hack · ed9c47be

由 Kent Overstreet 提交于 11月 22, 2013

Bcache has a hack to avoid cloning the biovec if it's all full pages -
but with immutable biovecs coming this won't be necessary anymore.

For now, we remove the special case and always clone the bvec array so
that the immutable biovec patches are simpler.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

ed9c47be

11 11月, 2013 18 次提交

bcache: defensively handle format strings · c8694948

由 Kees Cook 提交于 9月 10, 2013

Just to be safe, call the error reporting function with "%s" to avoid
any possible future format string leak.
Signed-off-by: NKees Cook <keescook@chromium.org>
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

c8694948

K
bcache: Use ida for bcache block dev minor · 28935ab5
由 Kent Overstreet 提交于 7月 31, 2013
```
Signed-off-by: NKent Overstreet <kmo@daterainc.com>
```
28935ab5
K
bcache: Fix sysfs splat on shutdown with flash only devs · c4d951dd
由 Kent Overstreet 提交于 8月 21, 2013
```
Whoops.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>
```
c4d951dd

bcache: Better full stripe scanning · 48a915a8

由 Kent Overstreet 提交于 10月 31, 2013

The old scanning-by-stripe code burned too much CPU, this should be
better.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

48a915a8

K
bcache: Move spinlock into struct time_stats · 65d22e91
由 Kent Overstreet 提交于 7月 31, 2013
```
Minor cleanup.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>
```
65d22e91

bcache: Kill sequential_merge option · 8aee1220

由 Kent Overstreet 提交于 7月 30, 2013

It never really made sense to expose this, so just kill it.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

8aee1220

bcache: Avoid deadlocking in garbage collection · bc9389ee

由 Kent Overstreet 提交于 9月 10, 2013

Not a complete fix - we could still deadlock if btree_insert_node() has
to split...
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

bc9389ee

bcache: bch_(btree|extent)_ptr_invalid() · d5cc66e9

由 Kent Overstreet 提交于 7月 24, 2013

Trying to treat btree pointers and leaf node pointers the same way was a
mistake - going to start being more explicit about the type of
key/pointer we're dealing with. This is the first part of that
refactoring; this patch shouldn't change any actual behaviour.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

d5cc66e9

bcache: Don't bother with bucket refcount for btree node allocations · 3a3b6a4e

由 Kent Overstreet 提交于 7月 24, 2013

The bucket refcount (dropped with bkey_put()) is only needed to prevent
the newly allocated bucket from being garbage collected until we've
added a pointer to it somewhere. But for btree node allocations, the
fact that we have btree nodes locked is enough to guard against races
with garbage collection.

Eventually the per bucket refcount is going to be replaced with
something specific to bch_alloc_sectors().
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

3a3b6a4e

bcache: Pull on disk data structures out into a separate header · 81ab4190

由 Kent Overstreet 提交于 10月 31, 2013

Now, the on disk data structures are in a header that can be exported to
userspace - and having them all centralized is nice too.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

81ab4190

bcache: Prune struct btree_op · c18536a7

由 Kent Overstreet 提交于 7月 24, 2013

Eventual goal is for struct btree_op to contain only what is necessary
for traversing the btree.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

c18536a7

bcache: Convert writeback to a kthread · 5e6926da

由 Kent Overstreet 提交于 7月 24, 2013

This simplifies the writeback flow control quite a bit - previously, it
was conceptually two coroutines, refill_dirty() and read_dirty(). This
makes the code quite a bit more straightforward.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

5e6926da

bcache: Convert gc to a kthread · 72a44517

由 Kent Overstreet 提交于 10月 24, 2013

We needed a dedicated rescuer workqueue for gc anyways... and gc was
conceptually a dedicated thread, just one that wasn't running all the
time. Switch it to a dedicated thread to make the code a bit more
straightforward.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

72a44517

bcache: Convert bucket_wait to wait_queue_head_t · 35fcd848

由 Kent Overstreet 提交于 7月 24, 2013

At one point we did do fancy asynchronous waiting stuff with
bucket_wait, but that's all gone (and bucket_wait is used a lot less
than it used to be). So use the standard primitives.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

35fcd848

bcache: Convert try_wait to wait_queue_head_t · e8e1d468

由 Kent Overstreet 提交于 7月 24, 2013

We never waited on c->try_wait asynchronously, so just use the standard
primitives.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

e8e1d468

bcache: Stripe size isn't necessarily a power of two · 2d679fc7

由 Kent Overstreet 提交于 8月 17, 2013

Originally I got this right... except that the divides didn't use
do_div(), which broke 32 bit kernels. When I went to fix that, I forgot
that the raid stripe size usually isn't a power of two... doh
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

2d679fc7

bcache: Add on error panic/unregister setting · 77c320eb

由 Kent Overstreet 提交于 7月 11, 2013

Works kind of like the ext4 setting, to panic or remount read only on
errors.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

77c320eb

bcache: Use blkdev_issue_discard() · 49b1212d

由 Kent Overstreet 提交于 7月 24, 2013

The old asynchronous discard code was really a relic from when all the
allocation code was asynchronous - now that allocation runs out of a
dedicated thread there's no point in keeping around all that complicated
machinery.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

49b1212d

12 7月, 2013 3 次提交

bcache: Allocation kthread fixes · 79826c35

由 Kent Overstreet 提交于 7月 10, 2013

The alloc kthread should've been using try_to_freeze() - and also there
was the potential for the alloc kthread to get woken up after it had
shut down, which would have been bad.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

79826c35

bcache: Shutdown fix · 5caa52af

由 Kent Overstreet 提交于 7月 10, 2013

Stopping a cache set is supposed to make it stop attached backing
devices, but somewhere along the way that code got lost. Fixing this
mainly has the effect of fixing our reboot notifier.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>
Cc: linux-stable <stable@vger.kernel.org> # >= v3.10

5caa52af

bcache: Fix a sysfs splat on shutdown · c9502ea4

由 Kent Overstreet 提交于 7月 10, 2013

If we stopped a bcache device when we were already detaching (or
something like that), bcache_device_unlink() would try to remove a
symlink from sysfs that was already gone because the bcache dev kobject
had already been removed from sysfs.

So keep track of whether we've removed stuff from sysfs.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>
Cc: linux-stable <stable@vger.kernel.org> # >= v3.10

c9502ea4

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功