提交 · 80941b2aebd3433594886d7774220c71c2d7ceec · openeuler / Kernel

24 5月, 2017 1 次提交

libceph: drop version variable from ceph_monmap_decode() · f3b4e55d

由 Ilya Dryomov 提交于 5月 19, 2017

It's set but not used: CEPH_FEATURE_MONNAMES feature bit isn't
advertised, which guarantees a v1 MonMap.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>
Reviewed-by: NAlex Elder <elder@linaro.org>

f3b4e55d

13 12月, 2016 1 次提交

libceph: no need for GFP_NOFS in ceph_monc_init() · 5418d0a2

由 Ilya Dryomov 提交于 12月 02, 2016

It's called during inital setup, when everything should be allocated
with GFP_KERNEL.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>
Reviewed-by: NSage Weil <sage@redhat.com>

5418d0a2

25 8月, 2016 1 次提交

libceph: support for blacklisting clients · 6305a3b4

由 Douglas Fuller 提交于 7月 22, 2015

Reuse ceph_mon_generic_request infrastructure for sending monitor
commands.  In particular, add support for 'blacklist add' to prevent
other, non-responsive clients from making further updates.
Signed-off-by: NDouglas Fuller <dfuller@redhat.com>
[idryomov@gmail.com: refactor, misc fixes throughout]
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>
Reviewed-by: NMike Christie <mchristi@redhat.com>
Reviewed-by: NAlex Elder <elder@linaro.org>

6305a3b4

09 8月, 2016 1 次提交

libceph: make cancel_generic_request() static · f52ec33c

由 Wei Yongjun 提交于 7月 30, 2016

Fixes the following sparse warning:

net/ceph/mon_client.c:577:6: warning:
 symbol 'cancel_generic_request' was not declared. Should it be static?
Signed-off-by: NWei Yongjun <weiyj.lk@gmail.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

f52ec33c

28 7月, 2016 1 次提交

libceph: fsmap.user subscription support · 0cabbd94

由 Yan, Zheng 提交于 4月 07, 2016

Signed-off-by: NYan, Zheng <zyan@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

0cabbd94

26 5月, 2016 6 次提交

I
libceph: support for subscribing to "mdsmap.<id>" maps · 737cc81e
由 Ilya Dryomov 提交于 5月 26, 2016
```
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>
```
737cc81e

libceph: replace ceph_monc_request_next_osdmap() · 7cca78c9

由 Ilya Dryomov 提交于 4月 28, 2016

... with a wrapper around maybe_request_map() - no need for two
osdmap-specific functions.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

7cca78c9

libceph: async MON client generic requests · d0b19705

由 Ilya Dryomov 提交于 4月 28, 2016

For map check, we are going to need to send CEPH_MSG_MON_GET_VERSION
messages asynchronously and get a callback on completion.  Refactor MON
client to allow firing off generic requests asynchronously and add an
async variant of ceph_monc_get_version().  ceph_monc_do_statfs() is
switched over and remains sync.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

d0b19705

libceph: handle_one_map() · 42c1b124

由 Ilya Dryomov 提交于 4月 28, 2016

Separate osdmap handling from decoding and iterating over a bag of maps
in a fresh MOSDMap message.  This sets up the scene for the updated OSD
client.

Of particular importance here is the addition of pi->was_full, which
can be used to answer "did this pool go full -> not-full in this map?".
This is the key bit for supporting pool quotas.

We won't be able to downgrade map_sem for much longer, so drop
downgrade_write().
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

42c1b124

libceph: DEFINE_RB_FUNCS macro · fcd00b68

由 Ilya Dryomov 提交于 4月 28, 2016

Given

    struct foo {
        u64 id;
        struct rb_node bar_node;
    };

generate insert_bar(), erase_bar() and lookup_bar() functions with

    DEFINE_RB_FUNCS(bar, struct foo, id, bar_node)

The key is assumed to be an integer (u64, int, etc), compared with
< and >.  nodefld has to be initialized with RB_CLEAR_NODE().

Start using it for MDS, MON and OSD requests and OSD sessions.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

fcd00b68

libceph: nuke unused fields and functions · 0c0a8de1

由 Ilya Dryomov 提交于 4月 28, 2016

Either unused or useless:

    osdmap->mkfs_epoch
    osd->o_marked_for_keepalive
    monc->num_generic_requests
    osdc->map_waiters
    osdc->last_requested_map
    osdc->timeout_tid

    osd_req_op_cls_response_data()

    osdmap_apply_incremental() @msgr arg
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

0c0a8de1

26 3月, 2016 9 次提交

libceph: behave in mon_fault() if cur_mon < 0 · b5d91704

由 Ilya Dryomov 提交于 1月 23, 2016

This can happen if __close_session() in ceph_monc_stop() races with
a connection reset.  We need to ignore such faults, otherwise it's
likely we would take !hunting, call __schedule_delayed() and end up
with delayed_work() executing on invalid memory, among other things.

The (two!) con->private tests are useless, as nothing ever clears
con->private.  Nuke them.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

b5d91704

libceph: reschedule tick in mon_fault() · bee3a37c

由 Ilya Dryomov 提交于 1月 22, 2016

Doing __schedule_delayed() in the hunting branch is pointless, as the
tick will have already been scheduled by then.

What we need to do instead is *reschedule* it in the !hunting branch,
after reopen_session() changes hunt_mult, which affects the delay.
This helps with spacing out connection attempts and avoiding things
like two back-to-back attempts followed by a longer period of waiting
around.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

bee3a37c

libceph: introduce and switch to reopen_session() · 1752b50c

由 Ilya Dryomov 提交于 1月 21, 2016

hunting is now set in __open_session() and cleared in finish_hunting(),
instead of all around.  The "session lost" message is printed not only
on connection resets, but also on keepalive timeouts.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

1752b50c

libceph: monc hunt rate is 3s with backoff up to 30s · 168b9090

由 Ilya Dryomov 提交于 1月 21, 2016

Unless we are in the process of setting up a client (i.e. connecting to
the monitor cluster for the first time), apply a backoff: every time we
want to reopen a session, increase our timeout by a multiple (currently
2); when we complete the connection, reduce that multipler by 50%.

Mirrors ceph.git commit 794c86fd289bd62a35ed14368fa096c46736e9a2.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

168b9090

libceph: monc ping rate is 10s · 58d81b12

由 Ilya Dryomov 提交于 1月 21, 2016

Split ping interval and ping timeout: ping interval is 10s; keepalive
timeout is 30s.

Make monc_ping_timeout a constant while at it - it's not actually
exported as a mount option (and the rest of tick-related settings won't
be either), so it's got no place in ceph_options.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

58d81b12

libceph: pick a different monitor when reconnecting · 0e04dc26

由 Ilya Dryomov 提交于 1月 20, 2016

Don't try to reconnect to the same monitor when we fail to establish
a session within a timeout or it's lost.

For that, pick_new_mon() needs to see the old value of cur_mon, so
don't clear it in __close_session() - all calls to __close_session()
but one are followed by __open_session() anyway. __open_session() is
only called when a new session needs to be established, so the "already
open?" branch, which is now in the way, is simply dropped.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

0e04dc26

libceph: revamp subs code, switch to SUBSCRIBE2 protocol · 82dcabad

由 Ilya Dryomov 提交于 1月 19, 2016

It is currently hard-coded in the mon_client that mdsmap and monmap
subs are continuous, while osdmap sub is always "onetime". To better
handle full clusters/pools in the osd_client, we need to be able to
issue continuous osdmap subs. Revamp subs code to allow us to specify
for each sub whether it should be continuous or not.

Although not strictly required for the above, switch to SUBSCRIBE2
protocol while at it, eliminating the ambiguity between a request for
"every map since X" and a request for "just the latest" when we don't
have a map yet (i.e. have epoch 0). SUBSCRIBE2 feature bit is now
required - it's been supported since pre-argonaut (2010).

Move "got mdsmap" call to the end of ceph_mdsc_handle_map() - calling
in before we validate the epoch and successfully install the new map
can mess up mon_client sub state.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

82dcabad

libceph: decouple hunting and subs management · 0f9af169

由 Ilya Dryomov 提交于 1月 08, 2016

Coupling hunting state with subscribe state is not a good idea.  Clear
hunting when we complete the authentication handshake.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

0f9af169

libceph: move debugfs initialization into __ceph_open_session() · 02ac956c

由 Ilya Dryomov 提交于 1月 06, 2016

Our debugfs dir name is a concatenation of cluster fsid and client
unique ID ("global_id"). It used to be the case that we learned
global_id first, nowadays we always learn fsid first - the monmap is
sent before any auth replies are. ceph_debugfs_client_init() call in
ceph_monc_handle_map() is therefore never executed and can be removed.

Its counterpart in handle_auth_reply() doesn't really belong there
either: having to do monc->client and unlocking early to work around
lockdep is a testament to that. Move it into __ceph_open_session(),
where it can be called unconditionally.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

02ac956c

22 1月, 2016 1 次提交

libceph: remove outdated comment · 7e01726a

由 Ilya Dryomov 提交于 1月 18, 2016

MClientMount{,Ack} are long gone.  The receipt of bare monmap doesn't
actually indicate a mount success as we are yet to authenticate at that
point in time.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

7e01726a

09 9月, 2015 1 次提交
- Y
  libceph: use keepalive2 to verify the mon session is alive · 8b9558aa
  由 Yan, Zheng 提交于 9月 01, 2015
```
Signed-off-by: NYan, Zheng <zyan@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>
```
  8b9558aa
25 6月, 2015 2 次提交

libceph: a couple tweaks for wait loops · 216639dd

由 Ilya Dryomov 提交于 5月 19, 2015

- return -ETIMEDOUT instead of -EIO in case of timeout
- wait_event_interruptible_timeout() returns time left until timeout
  and since it can be almost LONG_MAX we had better assign it to long
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>
Reviewed-by: NAlex Elder <elder@linaro.org>

216639dd

libceph: store timeouts in jiffies, verify user input · a319bf56

由 Ilya Dryomov 提交于 5月 15, 2015

There are currently three libceph-level timeouts that the user can
specify on mount: mount_timeout, osd_idle_ttl and osdkeepalive.  All of
these are in seconds and no checking is done on user input: negative
values are accepted, we multiply them all by HZ which may or may not
overflow, arbitrarily large jiffies then get added together, etc.

There is also a bug in the way mount_timeout=0 is handled.  It's
supposed to mean "infinite timeout", but that's not how wait.h APIs
treat it and so __ceph_open_session() for example will busy loop
without much chance of being interrupted if none of ceph-mons are
there.

Fix all this by verifying user input, storing timeouts capped by
msecs_to_jiffies() in jiffies and using the new ceph_timeout_jiffies()
helper for all user-specified waits to handle infinite timeouts
correctly.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>
Reviewed-by: NAlex Elder <elder@linaro.org>

a319bf56

19 2月, 2015 2 次提交

I
libceph: use mon_client.c/put_generic_request() more · f646912d
由 Ilya Dryomov 提交于 12月 22, 2014
```
Signed-off-by: NIlya Dryomov <idryomov@redhat.com>
```
f646912d

libceph: nuke pool op infrastructure · 7a6fdeb2

由 Ilya Dryomov 提交于 12月 22, 2014

On Mon, Dec 22, 2014 at 5:35 PM, Sage Weil <sage@newdream.net> wrote:
> On Mon, 22 Dec 2014, Ilya Dryomov wrote:
>> Actually, pool op stuff has been unused for over two years - looks like
>> it was added for rbd create_snap and that got ripped out in 2012.  It's
>> unlikely we'd ever need to manage pools or snaps from the kernel client
>> so I think it makes sense to nuke it.  Sage?
>
> Yep!
Signed-off-by: NIlya Dryomov <idryomov@redhat.com>

7a6fdeb2

09 1月, 2015 1 次提交

libceph: fix sparse endianness warnings · d7d5a007

由 Ilya Dryomov 提交于 12月 19, 2014

The only real issue is the one in auth_x.c and it came with
3.19-rc1 merge.
Signed-off-by: NIlya Dryomov <idryomov@redhat.com>

d7d5a007

15 10月, 2014 1 次提交

libceph: Convert pr_warning to pr_warn · b9a67899

由 Joe Perches 提交于 9月 09, 2014

Use the more common pr_warn.

Other miscellanea:

o Coalesce formats
o Realign arguments
Signed-off-by: NJoe Perches <joe@perches.com>
Signed-off-by: NIlya Dryomov <ilya.dryomov@inktank.com>

b9a67899

11 9月, 2014 1 次提交

libceph: gracefully handle large reply messages from the mon · 73c3d481

由 Sage Weil 提交于 8月 04, 2014

We preallocate a few of the message types we get back from the mon.  If we
get a larger message than we are expecting, fall back to trying to allocate
a new one instead of blindly using the one we have.

CC: stable@vger.kernel.org
Signed-off-by: NSage Weil <sage@redhat.com>
Reviewed-by: NIlya Dryomov <ilya.dryomov@inktank.com>

73c3d481

06 6月, 2014 2 次提交

libceph: add ceph_monc_wait_osdmap() · 6044cde6

由 Ilya Dryomov 提交于 5月 13, 2014

Add ceph_monc_wait_osdmap(), which will block until the osdmap with the
specified epoch is received or timeout occurs.

Export both of these as they are going to be needed by rbd.
Signed-off-by: NIlya Dryomov <ilya.dryomov@inktank.com>
Reviewed-by: NSage Weil <sage@inktank.com>

6044cde6

libceph: mon_get_version request infrastructure · 513a8243

由 Ilya Dryomov 提交于 5月 13, 2014

Add support for mon_get_version requests to libceph.  This reuses much
of the ceph_mon_generic_request infrastructure, with one exception.
Older OSDs don't set mon_get_version reply hdr->tid even if the
original request had a non-zero tid, which makes it impossible to
lookup ceph_mon_generic_request contexts by tid in get_generic_reply()
for such replies.  As a workaround, we allocate a reply message on the
reply path.  This can probably interfere with revoke, but I don't see
a better way.
Signed-off-by: NIlya Dryomov <ilya.dryomov@inktank.com>
Reviewed-by: NSage Weil <sage@inktank.com>

513a8243

14 1月, 2014 1 次提交

libceph: rename ceph_msg::front_max to front_alloc_len · 3cea4c30

由 Ilya Dryomov 提交于 1月 09, 2014

Rename front_max field of struct ceph_msg to front_alloc_len to make
its purpose more clear.
Signed-off-by: NIlya Dryomov <ilya.dryomov@inktank.com>
Reviewed-by: NSage Weil <sage@inktank.com>

3cea4c30

02 5月, 2013 1 次提交

libceph: wrap auth ops in wrapper functions · 27859f97

由 Sage Weil 提交于 3月 25, 2013

Use wrapper functions that check whether the auth op exists so that callers
do not need a bunch of conditional checks.  Simplifies the external
interface.
Signed-off-by: NSage Weil <sage@inktank.com>
Reviewed-by: NAlex Elder <elder@inktank.com>

27859f97

26 2月, 2013 1 次提交

libceph: eliminate sparse warnings · 15417167

由 Alex Elder 提交于 2月 19, 2013

Eliminate most of the problems in the libceph code that cause sparse
to issue warnings.
    - Convert functions that are never referenced externally to have
      static scope.
    - Pass NULL rather than 0 for a pointer argument in one spot in
      ceph_monc_delete_snapid()

This partially resolves:
    http://tracker.ceph.com/issues/4184Reported-by: NFengguang Wu <fengguang.wu@intel.com>
Signed-off-by: NAlex Elder <elder@inktank.com>
Reviewed-by: NJosh Durgin <josh.durgin@inktank.com>

15417167

02 10月, 2012 2 次提交

libceph: Fix sparse warning · 7698f2f5

由 Iulius Curt 提交于 8月 23, 2012

Make ceph_monc_do_poolop() static to remove the following sparse warning:
 * net/ceph/mon_client.c:616:5: warning: symbol 'ceph_monc_do_poolop' was not
   declared. Should it be static?
Also drops the 'ceph_monc_' prefix, now being a private function.
Signed-off-by: NIulius Curt <icurt@ixiacom.com>
Signed-off-by: NSage Weil <sage@inktank.com>

7698f2f5

S
libceph: remove unused monc->have_fsid · 290e3359
由 Sage Weil 提交于 8月 17, 2012
```
This is unused; use monc->client->have_fsid.
Signed-off-by: NSage Weil <sage@inktank.com>
```
290e3359

21 8月, 2012 1 次提交

libceph: delay debugfs initialization until we learn global_id · d1c338a5

由 Sage Weil 提交于 8月 19, 2012

The debugfs directory includes the cluster fsid and our unique global_id.
We need to delay the initialization of the debug entry until we have
learned both the fsid and our global_id from the monitor or else the
second client can't create its debugfs entry and will fail (and multiple
client instances aren't properly reflected in debugfs).

Reported by: Yan, Zheng <zheng.z.yan@intel.com>
Signed-off-by: NSage Weil <sage@inktank.com>
Reviewed-by: NYehuda Sadeh <yehuda@inktank.com>

d1c338a5

31 7月, 2012 1 次提交

libceph: revoke mon_client messages on session restart · 4f471e4a

由 Sage Weil 提交于 7月 30, 2012

Revoke all mon_client messages when we shut down the old connection.
This is mostly moot since we are re-using the same ceph_connection,
but it is cleaner.
Signed-off-by: NSage Weil <sage@inktank.com>
Reviewed-by: NAlex Elder <elder@inktank.com>

4f471e4a

06 7月, 2012 2 次提交

libceph: initialize mon_client con only once · 735a72ef

由 Sage Weil 提交于 6月 27, 2012

Do not re-initialize the con on every connection attempt. When we
ceph_con_close, there may still be work queued on the socket (e.g., to
close it), and re-initializing will clobber the work_struct state.
Signed-off-by: NSage Weil <sage@inktank.com>

735a72ef

libceph: set peer name on con_open, not init · b7a9e5dd

由 Sage Weil 提交于 6月 27, 2012

The peer name may change on each open attempt, even when the connection is
reused.
Signed-off-by: NSage Weil <sage@inktank.com>

b7a9e5dd

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功