提交 · d24cdcd3e40a6825135498e11c20c7976b9bf545 · OpenHarmony / kernel_linux

20 2月, 2017 1 次提交

libceph: use BUG() instead of BUG_ON(1) · d24cdcd3

由 Arnd Bergmann 提交于 1月 16, 2017

I ran into this compile warning, which is the result of BUG_ON(1)
not always leading to the compiler treating the code path as
unreachable:

    include/linux/ceph/osdmap.h: In function 'ceph_can_shift_osds':
    include/linux/ceph/osdmap.h:62:1: error: control reaches end of non-void function [-Werror=return-type]

Using BUG() here avoids the warning.
Signed-off-by: NArnd Bergmann <arnd@arndb.de>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

d24cdcd3

15 12月, 2016 1 次提交

libceph: always signal completion when done · c297eb42

由 Ilya Dryomov 提交于 12月 02, 2016

r_safe_completion is currently, and has always been, signaled only if
on-disk ack was requested. It's there for fsync and syncfs, which wait
for in-flight writes to flush - all data write requests set ONDISK.

However, the pool perm check code introduced in 4.2 sends a write
request with only ACK set. An unfortunately timed syncfs can then hang
forever: r_safe_completion won't be signaled because only an unsafe
reply was requested.

We could patch ceph_osdc_sync() to skip !ONDISK write requests, but
that is somewhat incomplete and yet another special case. Instead,
rename this completion to r_done_completion and always signal it when
the OSD client is done with the request, whether unsafe, safe, or
error. This is a bit cleaner and helps with the cancellation code.
Reported-by: NYan, Zheng <zyan@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

c297eb42

13 12月, 2016 3 次提交

ceph: add flags parameter to send_cap_msg · 1e4ef0c6

由 Jeff Layton 提交于 11月 10, 2016

Add a flags parameter to send_cap_msg, so we can request expedited
service from the MDS when we know we'll be waiting on the result.

Set that flag in the case of try_flush_caps. The callers of that
function generally wait synchronously on the result, so it's beneficial
to ask the server to expedite it.
Signed-off-by: NJeff Layton <jlayton@redhat.com>
Reviewed-by: NYan, Zheng <zyan@redhat.com>

1e4ef0c6

Y
ceph: check availability of mds cluster on mount · e9e427f0
由 Yan, Zheng 提交于 11月 10, 2016
```
Signed-off-by: NYan, Zheng <zyan@redhat.com>
```
e9e427f0

libceph: drop len argument of *verify_authorizer_reply() · 0dde5848

由 Ilya Dryomov 提交于 12月 02, 2016

The length of the reply is protocol-dependent - for cephx it's
ceph_x_authorize_reply.  Nothing sensible can be passed from the
messenger layer anyway.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>
Reviewed-by: NSage Weil <sage@redhat.com>

0dde5848

11 11月, 2016 1 次提交

libceph: initialize last_linger_id with a large integer · 264048af

由 Ilya Dryomov 提交于 11月 08, 2016

osdc->last_linger_id is a counter for lreq->linger_id, which is used
for watch cookies.  Starting with a large integer should ease the task
of telling apart kernel and userspace clients.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

264048af

01 11月, 2016 1 次提交

ceph: don't include blk_types.h in messenger.h · 9f082171

由 Christoph Hellwig 提交于 11月 01, 2016

The file only needs the struct bvec_iter delcaration, which is available
from bvec.h.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NJens Axboe <axboe@fb.com>

9f082171

03 10月, 2016 1 次提交
- Y
  ceph: handle CEPH_SESSION_REJECT message · fcff415c
  由 Yan, Zheng 提交于 9月 14, 2016
```
Signed-off-by: NYan, Zheng <zyan@redhat.com>
```
  fcff415c
25 8月, 2016 8 次提交

rbd: add 'client_addr' sysfs rbd device attribute · 005a07bf

由 Ilya Dryomov 提交于 8月 18, 2016

Export client addr/nonce, so userspace can check if a image is being
blacklisted.
Signed-off-by: NMike Christie <mchristi@redhat.com>
[idryomov@gmail.com: ceph_client_addr(), endianess fix]
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

005a07bf

libceph: rename ceph_client_id() -> ceph_client_gid() · 033268a5

由 Ilya Dryomov 提交于 8月 12, 2016

It's gid / global_id in other places.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>
Reviewed-by: NMike Christie <mchristi@redhat.com>
Reviewed-by: NAlex Elder <elder@linaro.org>

033268a5

libceph: support for blacklisting clients · 6305a3b4

由 Douglas Fuller 提交于 7月 22, 2015

Reuse ceph_mon_generic_request infrastructure for sending monitor
commands.  In particular, add support for 'blacklist add' to prevent
other, non-responsive clients from making further updates.
Signed-off-by: NDouglas Fuller <dfuller@redhat.com>
[idryomov@gmail.com: refactor, misc fixes throughout]
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>
Reviewed-by: NMike Christie <mchristi@redhat.com>
Reviewed-by: NAlex Elder <elder@linaro.org>

6305a3b4

libceph: support for lock.lock_info · d4ed4a53

由 Douglas Fuller 提交于 6月 29, 2015

Add an interface for the Ceph OSD lock.lock_info method and associated
data structures.

Based heavily on code by Mike Christie <michaelc@cs.wisc.edu>.
Signed-off-by: NDouglas Fuller <dfuller@redhat.com>
[idryomov@gmail.com: refactor, misc fixes throughout]
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>
Reviewed-by: NMike Christie <mchristi@redhat.com>
Reviewed-by: NAlex Elder <elder@linaro.org>

d4ed4a53

libceph: support for advisory locking on RADOS objects · f66241cb

由 Douglas Fuller 提交于 6月 18, 2015

This patch adds support for rados lock, unlock and break lock.

Based heavily on code by Mike Christie <michaelc@cs.wisc.edu>.
Signed-off-by: NDouglas Fuller <dfuller@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>
Reviewed-by: NMike Christie <mchristi@redhat.com>
Reviewed-by: NAlex Elder <elder@linaro.org>

f66241cb

libceph: add ceph_osdc_call() single-page helper · 428a7158

由 Douglas Fuller 提交于 6月 17, 2015

Add a convenience function to osd_client to send Ceph OSD
'class' ops. The interface assumes that the request and
reply data each consist of single pages.
Signed-off-by: NDouglas Fuller <dfuller@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>
Reviewed-by: NMike Christie <mchristi@redhat.com>
Reviewed-by: NAlex Elder <elder@linaro.org>

428a7158

libceph: support for CEPH_OSD_OP_LIST_WATCHERS · a4ed38d7

由 Douglas Fuller 提交于 7月 17, 2015

Add support for this Ceph OSD op, needed to support the RBD exclusive
lock feature.
Signed-off-by: NDouglas Fuller <dfuller@redhat.com>
[idryomov@gmail.com: refactor, misc fixes throughout]
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>
Reviewed-by: NMike Christie <mchristi@redhat.com>
Reviewed-by: NAlex Elder <elder@linaro.org>

a4ed38d7

libceph: rename ceph_entity_name_encode() -> ceph_auth_entity_name_encode() · f01d5cb2

由 Ilya Dryomov 提交于 6月 02, 2016

Clear up EntityName vs entity_name_t confusion.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>
Reviewed-by: NMike Christie <mchristi@redhat.com>
Reviewed-by: NAlex Elder <elder@linaro.org>

f01d5cb2

28 7月, 2016 9 次提交

ceph: fix symbol versioning for ceph_monc_do_statfs · a0f2b652

由 Arnd Bergmann 提交于 6月 13, 2016

The genksyms helper in the kernel cannot parse a type definition
like "typeof(((type *)0)->keyfld)" that is used in the DEFINE_RB_FUNCS
helper, causing the following EXPORT_SYMBOL() statement to be ignored
when computing the crcs, and triggering a warning about this:

WARNING: "ceph_monc_do_statfs" [fs/ceph/ceph.ko] has no CRC

To work around the problem, we can rewrite the type to reference
an undefined 'extern' symbol instead of a NULL pointer. This is
evidently ok for genksyms, and it no longer complains about the
line when calling it with 'genksyms -w'.

I've looked briefly into extending genksyms instead, but it seems
really hard to do. Jan Beulich introduced basic support for 'typeof'
a while ago in dc533240 ("genksyms: fix typeof() handling"),
but that is not sufficient for the expression we have here.
Signed-off-by: NArnd Bergmann <arnd@arndb.de>
Fixes: fcd00b68 ("libceph: DEFINE_RB_FUNCS macro")
Cc: Jan Beulich <jbeulich@suse.com>
Cc: Michal Marek <mmarek@suse.cz>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

a0f2b652

libceph: fsmap.user subscription support · 0cabbd94

由 Yan, Zheng 提交于 4月 07, 2016

Signed-off-by: NYan, Zheng <zyan@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

0cabbd94

ceph: reduce i_nr_by_mode array size · 774a6a11

由 Yan, Zheng 提交于 6月 06, 2016

Track usage count for individual fmode bit. This can reduce the
array size by half.
Signed-off-by: NYan, Zheng <zyan@redhat.com>

774a6a11

libceph: rados pool namespace support · 30c156d9

由 Yan, Zheng 提交于 2月 14, 2016

Add pool namesapce pointer to struct ceph_file_layout and struct
ceph_object_locator. Pool namespace is used by when mapping object
to PG, it's also used when composing OSD request.

The namespace pointer in struct ceph_file_layout is RCU protected.
So libceph can read namespace without taking lock.
Signed-off-by: NYan, Zheng <zyan@redhat.com>
[idryomov@gmail.com: ceph_oloc_destroy(), misc minor changes]
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

30c156d9

libceph: introduce reference counted string · 51e92737

由 Yan, Zheng 提交于 2月 05, 2016

The data structure is for storing namesapce string. It allows namespace
string to be shared between cephfs inodes with same layout. This data
structure can also be referenced by OSD request.
Signed-off-by: NYan, Zheng <zyan@redhat.com>

51e92737

libceph: define new ceph_file_layout structure · 7627151e

由 Yan, Zheng 提交于 2月 03, 2016

Define new ceph_file_layout structure and rename old ceph_file_layout
to ceph_file_layout_legacy. This is preparation for adding namespace
to ceph_file_layout structure.
Signed-off-by: NYan, Zheng <zyan@redhat.com>

7627151e

libceph: add start en/decoding block helpers · 22748f9d

由 Ilya Dryomov 提交于 6月 02, 2016

Add ceph_start_encoding() and ceph_start_decoding(), the equivalent of
ENCODE_START and DECODE_START in the userspace ceph code.

This is based on a patch from Mike Christie <michaelc@cs.wisc.edu>.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

22748f9d

libceph: add an ONSTACK initializer for oids · 281dbe5d

由 Ilya Dryomov 提交于 7月 26, 2016

An on-stack oid in ceph_ioctl_get_dataloc() is not initialized,
resulting in a WARN and a NULL pointer dereference later on.  We will
have more of these on-stack in the future, so fix it with a convenience
macro.

Fixes: d30291b9 ("libceph: variable-sized ceph_object_id")
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

281dbe5d

libceph: fix some missing includes · b2aa5d0b

由 Ilya Dryomov 提交于 6月 07, 2016

- decode.h needs slab.h for kmalloc()
- osd_client.h needs msgpool.h for struct ceph_msgpool
- msgpool.h doesn't need messenger.h
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

b2aa5d0b

31 5月, 2016 1 次提交

libceph: change ceph_osdmap_flag() to take osdc · b7ec35b3

由 Ilya Dryomov 提交于 4月 28, 2016

For the benefit of every single caller, take osdc instead of map.
Also, now that osdc->osdmap can't ever be NULL, drop the check.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

b7ec35b3

26 5月, 2016 14 次提交

ceph: make logical calculation functions return bool · 3b33f692

由 Zhang Zhuoyu 提交于 3月 25, 2016

This patch makes serverl logical caculation functions return bool to
improve readability due to these particular functions only using 0/1
as their return value.

No functional change.
Signed-off-by: NZhang Zhuoyu <zhangzhuoyu@cmss.chinamobile.com>

3b33f692

ceph: using hash value to compose dentry offset · f3c4ebe6

由 Yan, Zheng 提交于 4月 29, 2016

If MDS sorts dentries in dirfrag in hash order, we use hash value to
compose dentry offset. dentry offset is:

  (0xff << 52) | ((24 bits hash) << 28) |
  (the nth entry hash hash collision)

This offset is stable across directory fragmentation. This alos means
there is no need to reset readdir offset if directory get fragmented
in the middle of readdir.
Signed-off-by: NYan, Zheng <zyan@redhat.com>

f3c4ebe6

ceph: define 'end/complete' in readdir reply as bit flags · 956d39d6

由 Yan, Zheng 提交于 4月 27, 2016

Set a flag in readdir request, which indicates that client interprets
'end/complete' as bit flags. So that mds can reply additional flags in
readdir reply.
Signed-off-by: NYan, Zheng <zyan@redhat.com>

956d39d6

I
libceph: support for subscribing to "mdsmap.<id>" maps · 737cc81e
由 Ilya Dryomov 提交于 5月 26, 2016
```
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>
```
737cc81e

libceph: replace ceph_monc_request_next_osdmap() · 7cca78c9

由 Ilya Dryomov 提交于 4月 28, 2016

... with a wrapper around maybe_request_map() - no need for two
osdmap-specific functions.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

7cca78c9

libceph: pool deletion detection · 4609245e

由 Ilya Dryomov 提交于 4月 28, 2016

This adds the "map check" infrastructure for sending osdmap version
checks on CALC_TARGET_POOL_DNE and completing in-flight requests with
-ENOENT if the target pool doesn't exist or has just been deleted.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

4609245e

libceph: async MON client generic requests · d0b19705

由 Ilya Dryomov 提交于 4月 28, 2016

For map check, we are going to need to send CEPH_MSG_MON_GET_VERSION
messages asynchronously and get a callback on completion.  Refactor MON
client to allow firing off generic requests asynchronously and add an
async variant of ceph_monc_get_version().  ceph_monc_do_statfs() is
switched over and remains sync.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

d0b19705

libceph: support for checking on status of watch · b07d3c4b

由 Ilya Dryomov 提交于 4月 28, 2016

Implement ceph_osdc_watch_check() to be able to check on status of
watch.  Note that the time it takes for a watch/notify event to get
delivered through the notify_wq is taken into account.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

b07d3c4b

libceph: support for sending notifies · 19079203

由 Ilya Dryomov 提交于 4月 28, 2016

Implement ceph_osdc_notify() for sending notifies.

Due to the fact that the current messenger can't do read-in into
pagelists (it can only do write-out from them), I had to go with a page
vector for a NOTIFY_COMPLETE payload, for now.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

19079203

libceph, rbd: ceph_osd_linger_request, watch/notify v2 · 922dab61

由 Ilya Dryomov 提交于 5月 26, 2016

This adds support and switches rbd to a new, more reliable version of
watch/notify protocol.  As with the OSD client update, this is mostly
about getting the right structures linked into the right places so that
reconnects are properly sent when needed.  watch/notify v2 also
requires sending regular pings to the OSDs - send_linger_ping().

A major change from the old watch/notify implementation is the
introduction of ceph_osd_linger_request - linger requests no longer
piggy back on ceph_osd_request.  ceph_osd_event has been merged into
ceph_osd_linger_request.

All the details are now hidden within libceph, the interface consists
of a simple pair of watch/unwatch functions and ceph_osdc_notify_ack().
ceph_osdc_watch() does return ceph_osd_linger_request, but only to keep
the lifetime management simple.

ceph_osdc_notify_ack() accepts an optional data payload, which is
relayed back to the notifier.

Portions of this patch are loosely based on work by Douglas Fuller
<dfuller@redhat.com> and Mike Christie <michaelc@cs.wisc.edu>.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

922dab61

libceph: a major OSD client update · 5aea3dcd

由 Ilya Dryomov 提交于 4月 28, 2016

This is a major sync up, up to ~Jewel.  The highlights are:

- per-session request trees (vs a global per-client tree)
- per-session locking (vs a global per-client rwlock)
- homeless OSD session
- no ad-hoc global per-client lists
- support for pool quotas
- foundation for watch/notify v2 support
- foundation for map check (pool deletion detection) support

The switchover is incomplete: lingering requests can be setup and
teared down but aren't ever reestablished.  This functionality is
restored with the introduction of the new lingering infrastructure
(ceph_osd_linger_request, linger_work, etc) in a later commit.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

5aea3dcd

libceph: protect osdc->osd_lru list with a spinlock · 9dd2845c

由 Ilya Dryomov 提交于 4月 28, 2016

OSD client is getting moved from the big per-client lock to a set of
per-session locks. The big rwlock would only be held for read most of
the time, so a global osdc->osd_lru needs additional protection.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

9dd2845c

libceph: handle_one_map() · 42c1b124

由 Ilya Dryomov 提交于 4月 28, 2016

Separate osdmap handling from decoding and iterating over a bag of maps
in a fresh MOSDMap message.  This sets up the scene for the updated OSD
client.

Of particular importance here is the addition of pi->was_full, which
can be used to answer "did this pool go full -> not-full in this map?".
This is the key bit for supporting pool quotas.

We won't be able to downgrade map_sem for much longer, so drop
downgrade_write().
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

42c1b124

libceph: allocate dummy osdmap in ceph_osdc_init() · e5253a7b

由 Ilya Dryomov 提交于 4月 28, 2016

This leads to a simpler osdmap handling code, particularly when dealing
with pi->was_full, which is introduced in a later commit.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

e5253a7b

OpenHarmony / kernel_linux 上一次同步 4 年多

OpenHarmony / kernel_linux
上一次同步 4 年多