提交 · 774ac21da76f5c3018428725074e27a3fd40b128 · openeuler / raspberrypi-kernel

26 10月, 2011 7 次提交

libceph: force resend of osd requests if we skip an osdmap · 38d6453c

由 Sage Weil 提交于 10月 14, 2011

If we skip over one or more map epochs, we need to resend all osd requests
because it is possible they remapped to other servers and then back.
Signed-off-by: NSage Weil <sage@newdream.net>

38d6453c

ceph: use kernel DNS resolver · ee3b56f2

由 Noah Watkins 提交于 9月 23, 2011

Change ceph_parse_ips to take either names given as
IP addresses or standard hostnames (e.g. localhost).
The DNS lookup is done using the dns_resolver facility
similar to its use in AFS, NFS, and CIFS.

This patch defines CONFIG_CEPH_LIB_USE_DNS_RESOLVER
that controls if this feature is on or off.
Signed-off-by: NNoah Watkins <noahwatkins@gmail.com>
Signed-off-by: NSage Weil <sage@newdream.net>

ee3b56f2

ceph: fix ceph_monc_init memory leak · 49d9224c

由 Noah Watkins 提交于 9月 12, 2011

failure clean up does not consider ceph_auth_init.
Signed-off-by: NNoah Watkins <noahwatkins@gmail.com>
Signed-off-by: NSage Weil <sage@newdream.net>

49d9224c

libceph: warn on msg allocation failures · f0ed1b7c

由 Sage Weil 提交于 8月 09, 2011

Any non-masked msg allocation failure should generate a warning and stack
trace to the console.  All of these need to eventually be replaced by
safe preallocation or msgpools.
Signed-off-by: NSage Weil <sage@newdream.net>

f0ed1b7c

libceph: don't complain on msgpool alloc failures · b61c2763

由 Sage Weil 提交于 8月 09, 2011

The pool allocation failures are masked by the pool; there is no need to
spam the console about them.  (That's the whole point of having the pool
in the first place.)

Mark msg allocations whose failure is safely handled as such.
Signed-off-by: NSage Weil <sage@newdream.net>

b61c2763

libceph: always preallocate mon connection · f6a2f5be

由 Sage Weil 提交于 8月 09, 2011

Allocate the mon connection on init.  We already reuse it across
reconnects.  Remove now unnecessary (and incomplete) NULL checks.
Signed-off-by: NSage Weil <sage@newdream.net>

f6a2f5be

libceph: create messenger with client · 6ab00d46

由 Sage Weil 提交于 8月 09, 2011

This simplifies the init/shutdown paths, and makes client->msgr available
during the rest of the setup process.
Signed-off-by: NSage Weil <sage@newdream.net>

6ab00d46

29 9月, 2011 2 次提交

libceph: fix pg_temp mapping update · 8adc8b3d

由 Sage Weil 提交于 9月 28, 2011

The incremental map updates have a record for each pg_temp mapping that is
to be add/updated (len > 0) or removed (len == 0).  The old code was
written as if the updates were a complete enumeration; that was just wrong.
Update the code to remove 0-length entries and drop the rbtree traversal.

This avoids misdirected (and hung) requests that manifest as server
errors like

[WRN] client4104 10.0.1.219:0/275025290 misdirected client4104.1:129 0.1 to osd0 not [1,0] in e11/11
Signed-off-by: NSage Weil <sage@newdream.net>

8adc8b3d

libceph: fix pg_temp mapping calculation · 782e182e

由 Sage Weil 提交于 9月 28, 2011

We need to apply the modulo pg_num calculation before looking up a pgid in
the pg_temp mapping rbtree.  This fixes pg_temp mappings, and fixes
(some) misdirected requests that result in messages like

[WRN] client4104 10.0.1.219:0/275025290 misdirected client4104.1:129 0.1 to osd0 not [1,0] in e11/11

on the server and stall make the client block without getting a reply (at
least until the pg_temp mapping goes way, but that can take a long long
time).

Reorder calc_pg_raw() a bit to make more sense.
Signed-off-by: NSage Weil <sage@newdream.net>

782e182e

17 9月, 2011 3 次提交

libceph: fix linger request requeuing · 935b639a

由 Sage Weil 提交于 9月 16, 2011

The r_req_lru_item list node moves between several lists, and that cycle
is not directly related (and does not begin) with __register_request().
Initialize it in the request constructor, not __register_request(). This
fixes later badness (below) when OSDs restart underneath an rbd mount.

Crashes we've seen due to this include:

[  213.974288] kernel BUG at net/ceph/messenger.c:2193!

and

[  144.035274] BUG: unable to handle kernel NULL pointer dereference at 0000000000000048
[  144.035278] IP: [<ffffffffa036c053>] con_work+0x1463/0x2ce0 [libceph]
Signed-off-by: NSage Weil <sage@newdream.net>

935b639a

libceph: fix parse options memory leak · 1cad7893

由 Noah Watkins 提交于 9月 12, 2011

ceph_destroy_options does not free opt->mon_addr that
is allocated in ceph_parse_options.
Signed-off-by: NNoah Watkins <noahwatkins@gmail.com>
Signed-off-by: NSage Weil <sage@newdream.net>

1cad7893

libceph: initialize ack_stamp to avoid unnecessary connection reset · c0d5f9db

由 Jim Schutt 提交于 9月 16, 2011

Commit 4cf9d544 recorded when an outgoing ceph message was ACKed,
in order to avoid unnecessary connection resets when an OSD is busy.

However, ack_stamp is uninitialized, so there is a window between
when the message is sent and when it is ACKed in which handle_timeout()
interprets the unitialized value as an expired timeout, and resets
the connection unnecessarily.

Close the window by initializing ack_stamp.
Signed-off-by: NJim Schutt <jaschut@sandia.gov>
Signed-off-by: NSage Weil <sage@newdream.net>

c0d5f9db

01 9月, 2011 1 次提交

libceph: fix leak of osd structs during shutdown · aca420bc

由 Sage Weil 提交于 8月 31, 2011

We want to remove all OSDs, not just those on the idle LRU.
Signed-off-by: NSage Weil <sage@newdream.net>

aca420bc

10 8月, 2011 1 次提交

libceph: fix msgpool · 5185352c

由 Sage Weil 提交于 8月 09, 2011

There were several problems here:

 1- we weren't tagging allocations with the pool, so they were never
    returned to the pool.
 2- msgpool_put didn't add back to the mempool, even it were called.
 3- msgpool_release didn't clear the pool pointer, so it would have looped
    had #1 not been broken.

These may or may not have been responsible for #1136 or #1381 (BUG due to
non-empty mempool on umount).  I can't seem to trigger the crash now using
the method I was using before.
Signed-off-by: NSage Weil <sage@newdream.net>

5185352c

27 7月, 2011 1 次提交

libceph: don't time out osd requests that haven't been received · 4cf9d544

由 Sage Weil 提交于 7月 26, 2011

Keep track of when an outgoing message is ACKed (i.e., the server fully
received it and, presumably, queued it for processing). Time out OSD
requests only if it's been too long since they've been received.

This prevents timeouts and connection thrashing when the OSDs are simply
busy and are throttling the requests they read off the network.
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

4cf9d544

20 7月, 2011 1 次提交

ceph: fix file mode calculation · 38be7a79

由 Sage Weil 提交于 7月 19, 2011

open(2) must always include one of O_RDONLY, O_WRONLY, or O_RDWR.  No need
for any O_APPEND special case.

Passing O_WRONLY|O_RDWR is undefined according to the man page, but the
Linux VFS interprets this as O_RDWR, so we'll do the same.

This fixes open(2) with flags O_RDWR|O_APPEND, which was incorrectly being
translated to readonly.
Reported-by: NFyodor Ustinov <ufm@ufm.su>
Signed-off-by: NSage Weil <sage@newdream.net>

38be7a79

17 6月, 2011 1 次提交

net: Remove casts of void * · ea110733

由 Joe Perches 提交于 6月 13, 2011

Unnecessary casts of void * clutter the code.

These are the remainder casts after several specific
patches to remove netdev_priv and dev_priv.

Done via coccinelle script:

$ cat cast_void_pointer.cocci
@@
type T;
T *pt;
void *pv;
@@

- pt = (T *)pv;
+ pt = pv;
Signed-off-by: NJoe Perches <joe@perches.com>
Acked-by: NPaul Moore <paul.moore@hp.com>
Signed-off-by: NDavid S. Miller <davem@conan.davemloft.net>

ea110733

14 6月, 2011 1 次提交

libceph: fix page calculation for non-page-aligned io · 9bb0ce2b

由 Sage Weil 提交于 6月 13, 2011

Set the page count correctly for non-page-aligned IO.  We were already
doing this correctly for alignment, but not the page count.  Fixes
DIRECT_IO writes from unaligned pages.
Signed-off-by: NSage Weil <sage@newdream.net>

9bb0ce2b

08 6月, 2011 1 次提交

ceph: fix sync vs canceled write · 25845472

由 Sage Weil 提交于 6月 03, 2011

If we cancel a write, trigger the safe completions to prevent a sync from
blocking indefinitely in ceph_osdc_sync().
Signed-off-by: NSage Weil <sage@newdream.net>

25845472

25 5月, 2011 2 次提交

libceph: subscribe to osdmap when cluster is full · cd634fb6

由 Sage Weil 提交于 5月 12, 2011

When the cluster is marked full, subscribe to subsequent map updates to
ensure we find out promptly when it is no longer full. This will prevent
us from spewing ENOSPC for (much) longer than necessary.
Signed-off-by: NSage Weil <sage@newdream.net>

cd634fb6

libceph: handle new osdmap down/state change encoding · 7662d8ff

由 Sage Weil 提交于 5月 03, 2011

Old incrementals encode a 0 value (nearly always) when an osd goes down.
Change that to allow any state bit(s) to be flipped. Special case 0 to
mean flip the CEPH_OSD_UP bit to mimic the old behavior.
Signed-off-by: NSage Weil <sage@newdream.net>

7662d8ff

20 5月, 2011 8 次提交

ceph: check return value for start_request in writepages · 9d6fcb08

由 Sage Weil 提交于 5月 12, 2011

Since we pass the nofail arg, we should never get an error; BUG if we do.
(And fix the function to not return an error if __map_request fails.)
Signed-off-by: NSage Weil <sage@newdream.net>

9d6fcb08

S
libceph: add missing breaks in addr_set_port · a2a79609
由 Sage Weil 提交于 5月 12, 2011
```
Signed-off-by: NSage Weil <sage@newdream.net>
```
a2a79609

libceph: fix TAG_WAIT case · 04177882

由 Sage Weil 提交于 5月 12, 2011

If we get a WAIT as a client something went wrong; error out.  And don't
fall through to an unrelated case.
Signed-off-by: NSage Weil <sage@newdream.net>

04177882

S
libceph: fix osdmap timestamp assignment · 31456665
由 Sage Weil 提交于 5月 12, 2011
```
Signed-off-by: NSage Weil <sage@newdream.net>
```
31456665
S
libceph: use snprintf for unknown addrs · 12a2f643
由 Sage Weil 提交于 5月 12, 2011
```
Signed-off-by: NSage Weil <sage@newdream.net>
```
12a2f643
S
libceph: use snprintf for formatting object name · 2dab036b
由 Sage Weil 提交于 5月 12, 2011
```
Signed-off-by: NSage Weil <sage@newdream.net>
```
2dab036b

libceph: fix uninitialized value when no get_authorizer method is set · e8f54ce1

由 Sage Weil 提交于 5月 12, 2011

If there is no get_authorizer method we set the out_kvec to a bogus
pointer. The length is also zero in that case, so it doesn't much matter,
but it's better not to add the empty item in the first place.
Signed-off-by: NSage Weil <sage@newdream.net>

e8f54ce1

libceph: handle connection reopen race with callbacks · 0da5d703

由 Sage Weil 提交于 5月 19, 2011

If a connection is closed and/or reopened (ceph_con_close, ceph_con_open)
it can race with a callback.  con_work does various state checks for
closed or reopened sockets at the beginning, but drops con->mutex before
making callbacks.  We need to check for state bit changes after retaking
the lock to ensure we restart con_work and execute those CLOSED/OPENING
tests or else we may end up operating under stale assumptions.

In Jim's case, this was causing 'bad tag' errors.

There are four cases where we re-take the con->mutex inside con_work: catch
them all and return EAGAIN from try_{read,write} so that we can restart
con_work.
Reported-by: NJim Schutt <jaschut@sandia.gov>
Tested-by: NJim Schutt <jaschut@sandia.gov>
Signed-off-by: NSage Weil <sage@newdream.net>

0da5d703

04 5月, 2011 2 次提交

S
libceph: fix ceph_osdc_alloc_request error checks · 4ad12621
由 Sage Weil 提交于 5月 03, 2011
```
ceph_osdc_alloc_request returns NULL on failure.
Signed-off-by: NSage Weil <sage@newdream.net>
```
4ad12621

libceph: fix ceph_msg_new error path · ca20892d

由 Henry C Chang 提交于 5月 03, 2011

If memory allocation failed, calling ceph_msg_put() will cause GPF
since some of ceph_msg variables are not initialized first.

Fix Bug #970.
Signed-off-by: NHenry C Chang <henry_c_chang@tcloudcomputing.com>
Signed-off-by: NSage Weil <sage@newdream.net>

ca20892d

07 4月, 2011 1 次提交

libceph: fix linger request requeueing · 77f38e0e

由 Sage Weil 提交于 4月 06, 2011

Fix the request transition from linger -> normal request.  The key is to
preserve r_osd and requeue on the same OSD.  Reregister as a normal request,
add the request to the proper queues, then unregister the linger.  Fix the
unregister helper to avoid clearing r_osd (and also simplify the parallel
check in __unregister_request()).
Reported-by: NHenry Chang <henry.cy.chang@gmail.com>
Signed-off-by: NSage Weil <sage@newdream.net>

77f38e0e

31 3月, 2011 1 次提交

Fix common misspellings · 25985edc

由 Lucas De Marchi 提交于 3月 30, 2011

Fixes generated by 'codespell' and manually reviewed.
Signed-off-by: NLucas De Marchi <lucas.demarchi@profusion.mobi>

25985edc

30 3月, 2011 4 次提交

libceph: Create a new key type "ceph". · 4b2a58ab

由 Tommi Virtanen 提交于 3月 28, 2011

This allows us to use existence of the key type as a feature test,
from userspace.
Signed-off-by: NTommi Virtanen <tommi.virtanen@dreamhost.com>
Signed-off-by: NSage Weil <sage@newdream.net>

4b2a58ab

T
libceph: Get secret from the kernel keys api when mounting with key=NAME. · e2c3d29b
由 Tommi Virtanen 提交于 3月 25, 2011
```
Signed-off-by: NTommi Virtanen <tommi.virtanen@dreamhost.com>
Signed-off-by: NSage Weil <sage@newdream.net>
```
e2c3d29b

ceph: Move secret key parsing earlier. · 8323c3aa

由 Tommi Virtanen 提交于 3月 25, 2011

This makes the base64 logic be contained in mount option parsing,
and prepares us for replacing the homebew key management with the
kernel key retention service.
Signed-off-by: NTommi Virtanen <tommi.virtanen@dreamhost.com>
Signed-off-by: NSage Weil <sage@newdream.net>

8323c3aa

libceph: fix null dereference when unregistering linger requests · fbdb9190

由 Sage Weil 提交于 3月 29, 2011

We should only clear r_osd if we are neither registered as a linger or a
regular request. We may unregister as a linger while still registered as
a regular request (e.g., in reset_osd). Incorrectly clearing r_osd there
leads to a null pointer dereference in __send_request.

Also simplify the parallel check in __unregister_request() where we just
removed r_osd_item and know it's empty.
Signed-off-by: NSage Weil <sage@newdream.net>

fbdb9190

29 3月, 2011 1 次提交

ceph: unlock on error in ceph_osdc_start_request() · 234af26f

由 Dan Carpenter 提交于 3月 29, 2011

There was a missing unlock on the error path if __map_request() failed.
Signed-off-by: NDan Carpenter <error27@gmail.com>
Signed-off-by: NSage Weil <sage@newdream.net>

234af26f

27 3月, 2011 1 次提交

ceph: fix possible NULL pointer dereference · 6b0ae409

由 Mariusz Kozlowski 提交于 3月 26, 2011

This patch fixes 'event_work' dereference before it is checked for NULL.
Signed-off-by: NMariusz Kozlowski <mk@lab.zgora.pl>
Signed-off-by: NSage Weil <sage@newdream.net>

6b0ae409

26 3月, 2011 1 次提交

ceph: flush msgr_wq during mds_client shutdown · ef550f6f

由 Sage Weil 提交于 3月 25, 2011

The release method for mds connections uses a backpointer to the
mds_client, so we need to flush the workqueue of any pending work (and
ceph_connection references) prior to freeing the mds_client.  This fixes
an oops easily triggered under UML by

 while true ; do mount ... ; umount ... ; done

Also fix an outdated comment: the flush in ceph_destroy_client only flushes
OSD connections out.  This bug is basically an artifact of the ceph ->
ceph+libceph conversion.
Signed-off-by: NSage Weil <sage@newdream.net>

ef550f6f