提交 · 65d45231b56efb3db51eb441e2c68f8252ecdd12 · openanolis / cloud-kernel

09 1月, 2014 7 次提交

K
bcache: Abstract out stuff needed for sorting · 65d45231
由 Kent Overstreet 提交于 12月 20, 2013
```
Signed-off-by: NKent Overstreet <kmo@daterainc.com>
```
65d45231

bcache: Rename/shuffle various code around · ee811287

由 Kent Overstreet 提交于 12月 17, 2013

More work to disentangle bset.c from the rest of the code:
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

ee811287

bcache: Split out sort_extent_cmp() · 911c9610

由 Kent Overstreet 提交于 7月 28, 2013

Only use extent comparison for comparing extents, so we're not using
START_KEY() on other key types (i.e. btree pointers)
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

911c9610

bcache: Btree verify code improvements · 78b77bf8

由 Kent Overstreet 提交于 12月 17, 2013

Used this fixed code to find and fix the bug fixed by
a4d885097b0ac0cd1337f171f2d4b83e946094d4.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

78b77bf8

bcache: kill index() · 88b9f8c4

由 Kent Overstreet 提交于 12月 17, 2013

That was a terrible name for a macro, add some better helpers to replace it.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

88b9f8c4

bcache: Rework allocator reserves · 78365411

由 Kent Overstreet 提交于 12月 17, 2013

We need a reserve for allocating buckets for new btree nodes - and now that
we've got multiple btrees, it really needs to be per btree.

This reworks the reserves so we've got separate freelists for each reserve
instead of watermarks, which seems to make things a bit cleaner, and it adds
some code so that btree_split() can make sure the reserve is available before it
starts.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

78365411

K
bcache: kill closure locking usage · cb7a583e
由 Kent Overstreet 提交于 12月 16, 2013
```
Signed-off-by: NKent Overstreet <kmo@daterainc.com>
```
cb7a583e

11 11月, 2013 21 次提交

bcache: Avoid deadlocking in garbage collection · bc9389ee

由 Kent Overstreet 提交于 9月 10, 2013

Not a complete fix - we could still deadlock if btree_insert_node() has
to split...
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

bc9389ee

bcache: Incremental gc · a1f0358b

由 Kent Overstreet 提交于 9月 10, 2013

Big garbage collection rewrite; now, garbage collection uses the same
mechanisms as used elsewhere for inserting/updating btree node pointers,
instead of rewriting interior btree nodes in place.

This makes the code significantly cleaner and less fragile, and means we
can now make garbage collection incremental - it doesn't have to hold a
write lock on the root of the btree for the entire duration of garbage
collection.

This means that there's less of a latency hit for doing garbage
collection, which means we can gc more frequently (and do a better job
of reclaiming from the cache), and we can coalesce across more btree
nodes (improving our space efficiency).
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

a1f0358b

bcache: bch_(btree|extent)_ptr_invalid() · d5cc66e9

由 Kent Overstreet 提交于 7月 24, 2013

Trying to treat btree pointers and leaf node pointers the same way was a
mistake - going to start being more explicit about the type of
key/pointer we're dealing with. This is the first part of that
refactoring; this patch shouldn't change any actual behaviour.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

d5cc66e9

bcache: Don't bother with bucket refcount for btree node allocations · 3a3b6a4e

由 Kent Overstreet 提交于 7月 24, 2013

The bucket refcount (dropped with bkey_put()) is only needed to prevent
the newly allocated bucket from being garbage collected until we've
added a pointer to it somewhere. But for btree node allocations, the
fact that we have btree nodes locked is enough to guard against races
with garbage collection.

Eventually the per bucket refcount is going to be replaced with
something specific to bch_alloc_sectors().
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

3a3b6a4e

bcache: Debug code improvements · 280481d0

由 Kent Overstreet 提交于 10月 24, 2013

Couple changes:
 * Consolidate bch_check_keys() and bch_check_key_order(), and move the
   checks that only check_key_order() could do to bch_btree_iter_next().

 * Get rid of CONFIG_BCACHE_EDEBUG - now, all that code is compiled in
   when CONFIG_BCACHE_DEBUG is enabled, and there's now a sysfs file to
   flip on the EDEBUG checks at runtime.

 * Dropped an old not terribly useful check in rw_unlock(), and
   refactored/improved a some of the other debug code.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

280481d0

bcache: Convert bch_btree_insert() to bch_btree_map_leaf_nodes() · cc7b8819

由 Kent Overstreet 提交于 7月 24, 2013

Last of the btree_map() conversions. Main visible effect is
bch_btree_insert() is no longer taking a struct btree_op as an argument
anymore - there's no fancy state machine stuff going on, it's just a
normal function.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

cc7b8819

bcache: Kill op->replace · 1b207d80

由 Kent Overstreet 提交于 9月 10, 2013

This is prep work for converting bch_btree_insert to
bch_btree_map_leaf_nodes() - we have to convert all its arguments to
actual arguments. Bunch of churn, but should be straightforward.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

1b207d80

bcache: Kill op->cl · b54d6934

由 Kent Overstreet 提交于 7月 24, 2013

This isn't used for waiting asynchronously anymore - so this is a fairly
trivial refactoring.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

b54d6934

bcache: Prune struct btree_op · c18536a7

由 Kent Overstreet 提交于 7月 24, 2013

Eventual goal is for struct btree_op to contain only what is necessary
for traversing the btree.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

c18536a7

bcache: Convert bch_btree_read_async() to bch_btree_map_keys() · 2c1953e2

由 Kent Overstreet 提交于 7月 24, 2013

This is a fairly straightforward conversion, mostly reshuffling -
op->lookup_done goes away, replaced by MAP_DONE/MAP_CONTINUE. And the
code for handling cache hits and misses wasn't really btree code, so it
gets moved to request.c.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

2c1953e2

bcache: Move some stuff to btree.c · df8e8970

由 Kent Overstreet 提交于 7月 24, 2013

With the new btree_map() functions, we don't need to export the stuff
needed for traversing the btree anymore.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

df8e8970

bcache: Add btree_map() functions · 48dad8ba

由 Kent Overstreet 提交于 9月 10, 2013

Lots of stuff has been open coding its own btree traversal - which is
generally pretty simple code, but there are a few subtleties.

This adds new new functions, bch_btree_map_nodes() and
bch_btree_map_keys(), which do the traversal for you. Everything that's
open coding btree traversal now (with the exception of garbage
collection) is slowly going to be converted to these two functions;
being able to write other code at a higher level of abstraction  is a
big improvement w.r.t. overall code quality.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

48dad8ba

bcache: Convert gc to a kthread · 72a44517

由 Kent Overstreet 提交于 10月 24, 2013

We needed a dedicated rescuer workqueue for gc anyways... and gc was
conceptually a dedicated thread, just one that wasn't running all the
time. Switch it to a dedicated thread to make the code a bit more
straightforward.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

72a44517

bcache: Convert bucket_wait to wait_queue_head_t · 35fcd848

由 Kent Overstreet 提交于 7月 24, 2013

At one point we did do fancy asynchronous waiting stuff with
bucket_wait, but that's all gone (and bucket_wait is used a lot less
than it used to be). So use the standard primitives.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

35fcd848

bcache: Convert try_wait to wait_queue_head_t · e8e1d468

由 Kent Overstreet 提交于 7月 24, 2013

We never waited on c->try_wait asynchronously, so just use the standard
primitives.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

e8e1d468

bcache: Move keylist out of btree_op · 0b93207a

由 Kent Overstreet 提交于 7月 24, 2013

Slowly working on pruning struct btree_op - the aim is for it to only
contain things that are actually necessary for traversing the btree.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

0b93207a

bcache: Refactor request_write() · 84f0db03

由 Kent Overstreet 提交于 7月 24, 2013

Try to improve some of the naming a bit to be more consistent, and also
improve the flow of control in request_write() a bit.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

84f0db03

bcache: Add explicit keylist arg to btree_insert() · 4f3d4014

由 Kent Overstreet 提交于 9月 10, 2013

Some refactoring - better to explicitly pass stuff around instead of
having it all in the "big bag of state", struct btree_op. Going to prune
struct btree_op quite a bit over time.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

4f3d4014

bcache: Convert btree_insert_check_key() to btree_insert_node() · e7c590eb

由 Kent Overstreet 提交于 9月 10, 2013

This was the main point of all this refactoring - now,
btree_insert_check_key() won't fail just because the leaf node happened
to be full.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

e7c590eb

bcache: Explicitly track btree node's parent · d6fd3b11

由 Kent Overstreet 提交于 7月 24, 2013

This is prep work for the reworked btree insertion code.

The way we set b->parent is ugly and hacky... the problem is, when
btree_split() or garbage collection splits or rewrites a btree node, the
parent changes for all its (potentially already cached) children.

I may change this later and add some code to look through the btree node
cache and find all our cached child nodes and change the parent pointer
then...
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

d6fd3b11

bcache: Remove unnecessary check in should_split() · 8304ad4d

由 Kent Overstreet 提交于 7月 24, 2013

Checking i->seq was redundant, because since ages ago we always
initialize the new bset when advancing b->written
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

8304ad4d

02 7月, 2013 1 次提交

bcache: Delete fuzz tester · f3059a54

由 Kent Overstreet 提交于 5月 15, 2013

This code has rotted and it hasn't been used in ages anyways.
Signed-off-by: NKent Overstreet <kmo@daterainc.com>

f3059a54

27 6月, 2013 3 次提交

bcache: Write out full stripes · 72c27061

由 Kent Overstreet 提交于 6月 05, 2013

Now that we're tracking dirty data per stripe, we can add two
optimizations for raid5/6:

 * If a stripe is already dirty, force writes to that stripe to
   writeback mode - to help build up full stripes of dirty data

 * When flushing dirty data, preferentially write out full stripes first
   if there are any.
Signed-off-by: NKent Overstreet <koverstreet@google.com>

72c27061

bcache: Rip out pkey()/pbtree() · 85b1492e

由 Kent Overstreet 提交于 5月 14, 2013

Old gcc doesnt like the struct hack, and it is kind of ugly. So finish
off the work to convert pr_debug() statements to tracepoints, and delete
pkey()/pbtree().
Signed-off-by: NKent Overstreet <koverstreet@google.com>

85b1492e

bcache: Refactor btree io · 57943511

由 Kent Overstreet 提交于 4月 25, 2013

The most significant change is that btree reads are now done
synchronously, instead of asynchronously and doing the post read stuff
from a workqueue.

This was originally done because we can't block on IO under
generic_make_request(). But - we already have a mechanism to punt cache
lookups to workqueue if needed, so if we just use that we don't have to
deal with the complexity of doing things asynchronously.

The main benefit is this makes the locking situation saner; we can hold
our write lock on the btree node until we're finished reading it, and we
don't need that btree_node_read_done() flag anymore.

Also, for writes, btree_write() was broken out into btree_node_write()
and btree_leaf_dirty() - the old code with the boolean argument was dumb
and confusing.

The prio_blocked mechanism was improved a bit too, now the only counter
is in struct btree_write, we don't mess with transfering a count from
struct btree anymore.

This required changing garbage collection to block prios at the start
and unblock when it finishes, which is cleaner than what it was doing
anyways (the old code had mostly the same effect, but was doing it in a
convoluted way)

And the btree iter btree_node_read_done() uses was converted to a real
mempool.
Signed-off-by: NKent Overstreet <koverstreet@google.com>

57943511

24 3月, 2013 1 次提交

bcache: A block layer cache · cafe5635

由 Kent Overstreet 提交于 3月 23, 2013

Does writethrough and writeback caching, handles unclean shutdown, and
has a bunch of other nifty features motivated by real world usage.

See the wiki at http://bcache.evilpiepirate.org for more.
Signed-off-by: NKent Overstreet <koverstreet@google.com>

cafe5635

openanolis / cloud-kernel 接近 2 年 前同步成功

openanolis / cloud-kernel
接近 2 年前同步成功