提交 · ba5b56cb3e3d2cab73d4fee9a022bb69462a8cd9 · openanolis / cloud-kernel

27 7月, 2011 1 次提交

libceph: don't time out osd requests that haven't been received · 4cf9d544

由 Sage Weil 提交于 7月 26, 2011

Keep track of when an outgoing message is ACKed (i.e., the server fully
received it and, presumably, queued it for processing). Time out OSD
requests only if it's been too long since they've been received.

This prevents timeouts and connection thrashing when the OSDs are simply
busy and are throttling the requests they read off the network.
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

4cf9d544

14 6月, 2011 1 次提交

libceph: fix page calculation for non-page-aligned io · 9bb0ce2b

由 Sage Weil 提交于 6月 13, 2011

Set the page count correctly for non-page-aligned IO.  We were already
doing this correctly for alignment, but not the page count.  Fixes
DIRECT_IO writes from unaligned pages.
Signed-off-by: NSage Weil <sage@newdream.net>

9bb0ce2b

08 6月, 2011 1 次提交

ceph: fix sync vs canceled write · 25845472

由 Sage Weil 提交于 6月 03, 2011

If we cancel a write, trigger the safe completions to prevent a sync from
blocking indefinitely in ceph_osdc_sync().
Signed-off-by: NSage Weil <sage@newdream.net>

25845472

25 5月, 2011 1 次提交

libceph: subscribe to osdmap when cluster is full · cd634fb6

由 Sage Weil 提交于 5月 12, 2011

When the cluster is marked full, subscribe to subsequent map updates to
ensure we find out promptly when it is no longer full. This will prevent
us from spewing ENOSPC for (much) longer than necessary.
Signed-off-by: NSage Weil <sage@newdream.net>

cd634fb6

20 5月, 2011 2 次提交

ceph: check return value for start_request in writepages · 9d6fcb08

由 Sage Weil 提交于 5月 12, 2011

Since we pass the nofail arg, we should never get an error; BUG if we do.
(And fix the function to not return an error if __map_request fails.)
Signed-off-by: NSage Weil <sage@newdream.net>

9d6fcb08

S
libceph: use snprintf for formatting object name · 2dab036b
由 Sage Weil 提交于 5月 12, 2011
```
Signed-off-by: NSage Weil <sage@newdream.net>
```
2dab036b

04 5月, 2011 1 次提交
- S
  libceph: fix ceph_osdc_alloc_request error checks · 4ad12621
  由 Sage Weil 提交于 5月 03, 2011
```
ceph_osdc_alloc_request returns NULL on failure.
Signed-off-by: NSage Weil <sage@newdream.net>
```
  4ad12621
07 4月, 2011 1 次提交

libceph: fix linger request requeueing · 77f38e0e

由 Sage Weil 提交于 4月 06, 2011

Fix the request transition from linger -> normal request.  The key is to
preserve r_osd and requeue on the same OSD.  Reregister as a normal request,
add the request to the proper queues, then unregister the linger.  Fix the
unregister helper to avoid clearing r_osd (and also simplify the parallel
check in __unregister_request()).
Reported-by: NHenry Chang <henry.cy.chang@gmail.com>
Signed-off-by: NSage Weil <sage@newdream.net>

77f38e0e

31 3月, 2011 1 次提交

Fix common misspellings · 25985edc

由 Lucas De Marchi 提交于 3月 30, 2011

Fixes generated by 'codespell' and manually reviewed.
Signed-off-by: NLucas De Marchi <lucas.demarchi@profusion.mobi>

25985edc

30 3月, 2011 1 次提交

libceph: fix null dereference when unregistering linger requests · fbdb9190

由 Sage Weil 提交于 3月 29, 2011

We should only clear r_osd if we are neither registered as a linger or a
regular request. We may unregister as a linger while still registered as
a regular request (e.g., in reset_osd). Incorrectly clearing r_osd there
leads to a null pointer dereference in __send_request.

Also simplify the parallel check in __unregister_request() where we just
removed r_osd_item and know it's empty.
Signed-off-by: NSage Weil <sage@newdream.net>

fbdb9190

29 3月, 2011 1 次提交

ceph: unlock on error in ceph_osdc_start_request() · 234af26f

由 Dan Carpenter 提交于 3月 29, 2011

There was a missing unlock on the error path if __map_request() failed.
Signed-off-by: NDan Carpenter <error27@gmail.com>
Signed-off-by: NSage Weil <sage@newdream.net>

234af26f

27 3月, 2011 1 次提交

ceph: fix possible NULL pointer dereference · 6b0ae409

由 Mariusz Kozlowski 提交于 3月 26, 2011

This patch fixes 'event_work' dereference before it is checked for NULL.
Signed-off-by: NMariusz Kozlowski <mk@lab.zgora.pl>
Signed-off-by: NSage Weil <sage@newdream.net>

6b0ae409

23 3月, 2011 1 次提交

libceph: add lingering request and watch/notify event framework · a40c4f10

由 Yehuda Sadeh 提交于 3月 21, 2011

Lingering requests are requests that are sent to the OSD normally but
tracked also after we get a successful request.  This keeps the OSD
connection open and resends the original request if the object moves to
another OSD.  The OSD can then send notification messages back to us
if another client initiates a notify.

This framework will be used by RBD so that the client gets notification
when a snapshot is created by another node or tool.
Signed-off-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

a40c4f10

22 3月, 2011 1 次提交

libceph: fix osd request queuing on osdmap updates · 6f6c7006

由 Sage Weil 提交于 1月 17, 2011

If we send a request to osd A, and the request's pg remaps to osd B and
then back to A in quick succession, we need to resend the request to A. The
old code was only calling kick_requests after processing all incremental
maps in a message, so it was very possible to not resend a request that
needed to be resent.  This would make the osd eventually time out (at least
with the current default of osd timeouts enabled).

The correct approach is to scan requests on every map incremental.  This
patch refactors the kick code in a few ways:
 - all requests are either on req_lru (in flight), req_unsent (ready to
   send), or req_notarget (currently map to no up osd)
 - mapping always done by map_request (previous map_osds)
 - if the mapping changes, we requeue.  requests are resent only after all
   map incrementals are processed.
 - some osd reset code is moved out of kick_requests into a separate
   function
 - the "kick this osd" functionality is moved to kick_osd_requests, as it
   is unrelated to scanning for request->pg->osd mapping changes
Signed-off-by: NSage Weil <sage@newdream.net>

6f6c7006

10 11月, 2010 2 次提交

ceph: explicitly specify page alignment in network messages · c5c6b19d

由 Sage Weil 提交于 11月 09, 2010

The alignment used for reading data into or out of pages used to be taken
from the data_off field in the message header. This only worked as long
as the page alignment matched the object offset, breaking direct io to
non-page aligned offsets.

Instead, explicitly specify the page alignment next to the page vector
in the ceph_msg struct, and use that instead of the message header (which
probably shouldn't be trusted). The alloc_msg callback is responsible for
filling in this field properly when it sets up the page vector.
Signed-off-by: NSage Weil <sage@newdream.net>

c5c6b19d

ceph: make page alignment explicit in osd interface · b7495fc2

由 Sage Weil 提交于 11月 09, 2010

We used to infer alignment of IOs within a page based on the file offset,
which assumed they matched. This broke with direct IO that was not aligned
to pages (e.g., 512-byte aligned IO). We were also trusting the alignment
specified in the OSD reply, which could have been adjusted by the server.

Explicitly specify the page alignment when setting up OSD IO requests.
Signed-off-by: NSage Weil <sage@newdream.net>

b7495fc2

21 10月, 2010 4 次提交

ceph: factor out libceph from Ceph file system · 3d14c5d2

由 Yehuda Sadeh 提交于 4月 06, 2010

This factors out protocol and low-level storage parts of ceph into a
separate libceph module living in net/ceph and include/linux/ceph.  This
is mostly a matter of moving files around.  However, a few key pieces
of the interface change as well:

 - ceph_client becomes ceph_fs_client and ceph_client, where the latter
   captures the mon and osd clients, and the fs_client gets the mds client
   and file system specific pieces.
 - Mount option parsing and debugfs setup is correspondingly broken into
   two pieces.
 - The mon client gets a generic handler callback for otherwise unknown
   messages (mds map, in this case).
 - The basic supported/required feature bits can be expanded (and are by
   ceph_fs_client).

No functional change, aside from some subtle error handling cases that got
cleaned up in the refactoring process.
Signed-off-by: NSage Weil <sage@newdream.net>

3d14c5d2

Y
ceph-rbd: osdc support for osd call and rollback operations · ae1533b6
由 Yehuda Sadeh 提交于 5月 18, 2010
```
This will be used for rbd snapshots administration.
Signed-off-by: NYehuda Sadeh <yehuda@hq.newdream.net>
```
ae1533b6

ceph: messenger and osdc changes for rbd · 68b4476b

由 Yehuda Sadeh 提交于 4月 06, 2010

Allow the messenger to send/receive data in a bio.  This is added
so that we wouldn't need to copy the data into pages or some other buffer
when doing IO for an rbd block device.

We can now have trailing variable sized data for osd
ops.  Also osd ops encoding is more modular.
Signed-off-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

68b4476b

ceph: refactor osdc requests creation functions · 3499e8a5

由 Yehuda Sadeh 提交于 4月 06, 2010

The osd requests creation are being decoupled from the
vino parameter, allowing clients using the osd to use
other arbitrary object names that are not necessarily
vino based. Also, calc_raw_layout now takes a snap id.
Signed-off-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

3499e8a5

07 10月, 2010 1 次提交

ceph: avoid null deref in osd request error path · 6bc18876

由 Sage Weil 提交于 9月 27, 2010

If we interrupt an osd request, we call __cancel_request, but it wasn't
verifying that req->r_osd was non-NULL before dereferencing it. This could
cause a crash if osds were flapping and we aborted a request on said osd.
Reported-by: NHenry C Chang <henry_c_chang@tcloudcomputing.com>
Signed-off-by: NSage Weil <sage@newdream.net>

6bc18876

23 8月, 2010 1 次提交

ceph: fix osd request lru adjustment when sending request · 07a27e22

由 Henry C Chang 提交于 8月 22, 2010

Fix argument order.  We want to move the item to the end of the list, not
change the position of the head.
Signed-off-by: NHenry C Chang <henry_c_chang@tcloudcomputing.com>
Signed-off-by: NSage Weil <sage@newdream.net>

07a27e22

04 8月, 2010 1 次提交
- S
  ceph: whitespace cleanup · 213c99ee
  由 Sage Weil 提交于 8月 03, 2010
```
Signed-off-by: NSage Weil <sage@newdream.net>
```
  213c99ee
02 8月, 2010 1 次提交

ceph: only set num_pages in calc_layout · 796d6955

由 Sage Weil 提交于 6月 10, 2010

Setting it elsewhere is unnecessary and more fragile.
Signed-off-by: NSage Weil <sage@newdream.net>

796d6955

28 7月, 2010 1 次提交

ceph: use complete_all and wake_up_all · 03066f23

由 Yehuda Sadeh 提交于 7月 27, 2010

This fixes an issue triggered by running concurrent syncs. One of the syncs
would go through while the other would just hang indefinitely. In any case, we
never actually want to wake a single waiter, so the *_all functions should
be used.
Signed-off-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

03066f23

14 6月, 2010 1 次提交

ceph: fix map handler error path · 4a32f93d

由 Sage Weil 提交于 6月 13, 2010

Don't leak message if we receive an unexpected message type.
Signed-off-by: NSage Weil <sage@newdream.net>

4a32f93d

30 5月, 2010 1 次提交

ceph: fix leak of osd authorizer · 79494d1b

由 Sage Weil 提交于 5月 27, 2010

Release the ceph_authorizer when releasing osd state.
Signed-off-by: NSage Weil <sage@newdream.net>

79494d1b

22 5月, 2010 1 次提交

ceph: Storage class should be before const qualifier · 9e32789f

由 Tobias Klauser 提交于 5月 20, 2010

The C99 specification states in section 6.11.5:

The placement of a storage-class specifier other than at the beginning
of the declaration specifiers in a declaration is an obsolescent
feature.
Signed-off-by: NTobias Klauser <tklauser@distanz.ch>
Signed-off-by: NSage Weil <sage@newdream.net>

9e32789f

18 5月, 2010 8 次提交

ceph: all allocation functions should get gfp_mask · 34d23762

由 Yehuda Sadeh 提交于 4月 06, 2010

This is essential, as for the rados block device we'll need
to run in different contexts that would need flags that
are other than GFP_NOFS.
Signed-off-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

34d23762

S
ceph: name msgpools; useful error messages · 4f48280e
由 Sage Weil 提交于 4月 24, 2010
```
Signed-off-by: NSage Weil <sage@newdream.net>
```
4f48280e

ceph: osdtimeout=0 for now timeout · f26e681d

由 Sage Weil 提交于 4月 21, 2010

Allow the osd reset timeout to be disabled.
Signed-off-by: NSage Weil <sage@newdream.net>

f26e681d

ceph: wake up mount thread when getting osdmap · c473ad92

由 Yehuda Sadeh 提交于 4月 13, 2010

Now that the mount thread waits for the osdmap, it needs
to be awaken.
Signed-off-by: NYehuda Sadeh <yehuda@hq.newdream.net>

c473ad92

ceph: simplify ceph_msg_new · bb257664

由 Sage Weil 提交于 4月 01, 2010

We only need to pass in front_len.  Callers can attach any other payload
pieces (middle, data) as they see fit.
Signed-off-by: NSage Weil <sage@newdream.net>

bb257664

ceph: make ceph_msg_new return NULL on failure; clean up, fix callers · a79832f2

由 Sage Weil 提交于 4月 01, 2010

Returning ERR_PTR(-ENOMEM) is useless extra work. Return NULL on failure
instead, and fix up the callers (about half of which were wrong anyway).
Signed-off-by: NSage Weil <sage@newdream.net>

a79832f2

ceph: fix theoretically possible double-put on connection · 6f46cb29

由 Sage Weil 提交于 3月 24, 2010

This would only trigger if we bailed out before resetting r_con_filling_msg
because the server reply was corrupt (oversized).
Signed-off-by: NSage Weil <sage@newdream.net>

6f46cb29

ceph: simplify page setup for incoming data · 21b667f6

由 Sage Weil 提交于 3月 04, 2010

Drop largely useless helper __prepare_pages(), and simplify sanity checks.
Signed-off-by: NSage Weil <sage@newdream.net>

21b667f6

12 5月, 2010 2 次提交

ceph: resubmit requests on pg mapping change (not just primary change) · d85b7056

由 Sage Weil 提交于 5月 10, 2010

OSD requests need to be resubmitted on any pg mapping change, not just when
the pg primary changes. Resending only when the primary changes results in
occasional 'hung' requests during osd cluster recovery or rebalancing.
Signed-off-by: NSage Weil <sage@newdream.net>

d85b7056

ceph: unregister osd request on failure · 0ceed5db

由 Sage Weil 提交于 5月 11, 2010

The osd request wasn't being unregistered when the osd returned a failure
code, even though the result was returned to the caller. This would cause
it to eventually time out, and then crash the kernel when it tried to
resend the request using a stale page vector.
Signed-off-by: NSage Weil <sage@newdream.net>

0ceed5db

23 3月, 2010 2 次提交

ceph: avoid reopening osd connections when address hasn't changed · 87b315a5

由 Sage Weil 提交于 3月 22, 2010

We get a fault callback on _every_ tcp connection fault.  Normally, we
want to reopen the connection when that happens.  If the address we have
is bad, however, and connection attempts always result in a connection
refused or similar error, explicitly closing and reopening the msgr
connection just prevents the messenger's backoff logic from kicking in.
The result can be a console full of

[ 3974.417106] ceph: osd11 10.3.14.138:6800 connection failed
[ 3974.423295] ceph: osd11 10.3.14.138:6800 connection failed
[ 3974.429709] ceph: osd11 10.3.14.138:6800 connection failed

Instead, if we get a fault, and have outstanding requests, but the osd
address hasn't changed and the connection never successfully connected in
the first place, do nothing to the osd connection.  The messenger layer
will back off and retry periodically, because we never connected and thus
the lossy bit is not set.

Instead, touch each request's r_stamp so that handle_timeout can tell the
request is still alive and kicking.
Signed-off-by: NSage Weil <sage@newdream.net>

87b315a5

ceph: rename r_sent_stamp r_stamp · 3dd72fc0

由 Sage Weil 提交于 3月 22, 2010

Make variable name slightly more generic, since it will (soon)
reflect either the time the request was sent OR the time it was
last determined to be still retrying.
Signed-off-by: NSage Weil <sage@newdream.net>

3dd72fc0

openanolis / cloud-kernel 大约 1 年 前同步成功

openanolis / cloud-kernel
大约 1 年前同步成功