提交 · 3b663780347ce532b08be1c859b1df14f0eea4c8 · openeuler / raspberrypi-kernel

04 5月, 2011 2 次提交

S
libceph: fix ceph_osdc_alloc_request error checks · 4ad12621
由 Sage Weil 提交于 5月 03, 2011
```
ceph_osdc_alloc_request returns NULL on failure.
Signed-off-by: NSage Weil <sage@newdream.net>
```
4ad12621

libceph: fix ceph_msg_new error path · ca20892d

由 Henry C Chang 提交于 5月 03, 2011

If memory allocation failed, calling ceph_msg_put() will cause GPF
since some of ceph_msg variables are not initialized first.

Fix Bug #970.
Signed-off-by: NHenry C Chang <henry_c_chang@tcloudcomputing.com>
Signed-off-by: NSage Weil <sage@newdream.net>

ca20892d

07 4月, 2011 1 次提交

libceph: fix linger request requeueing · 77f38e0e

由 Sage Weil 提交于 4月 06, 2011

Fix the request transition from linger -> normal request.  The key is to
preserve r_osd and requeue on the same OSD.  Reregister as a normal request,
add the request to the proper queues, then unregister the linger.  Fix the
unregister helper to avoid clearing r_osd (and also simplify the parallel
check in __unregister_request()).
Reported-by: NHenry Chang <henry.cy.chang@gmail.com>
Signed-off-by: NSage Weil <sage@newdream.net>

77f38e0e

31 3月, 2011 1 次提交

Fix common misspellings · 25985edc

由 Lucas De Marchi 提交于 3月 30, 2011

Fixes generated by 'codespell' and manually reviewed.
Signed-off-by: NLucas De Marchi <lucas.demarchi@profusion.mobi>

25985edc

30 3月, 2011 4 次提交

libceph: Create a new key type "ceph". · 4b2a58ab

由 Tommi Virtanen 提交于 3月 28, 2011

This allows us to use existence of the key type as a feature test,
from userspace.
Signed-off-by: NTommi Virtanen <tommi.virtanen@dreamhost.com>
Signed-off-by: NSage Weil <sage@newdream.net>

4b2a58ab

T
libceph: Get secret from the kernel keys api when mounting with key=NAME. · e2c3d29b
由 Tommi Virtanen 提交于 3月 25, 2011
```
Signed-off-by: NTommi Virtanen <tommi.virtanen@dreamhost.com>
Signed-off-by: NSage Weil <sage@newdream.net>
```
e2c3d29b

ceph: Move secret key parsing earlier. · 8323c3aa

由 Tommi Virtanen 提交于 3月 25, 2011

This makes the base64 logic be contained in mount option parsing,
and prepares us for replacing the homebew key management with the
kernel key retention service.
Signed-off-by: NTommi Virtanen <tommi.virtanen@dreamhost.com>
Signed-off-by: NSage Weil <sage@newdream.net>

8323c3aa

libceph: fix null dereference when unregistering linger requests · fbdb9190

由 Sage Weil 提交于 3月 29, 2011

We should only clear r_osd if we are neither registered as a linger or a
regular request. We may unregister as a linger while still registered as
a regular request (e.g., in reset_osd). Incorrectly clearing r_osd there
leads to a null pointer dereference in __send_request.

Also simplify the parallel check in __unregister_request() where we just
removed r_osd_item and know it's empty.
Signed-off-by: NSage Weil <sage@newdream.net>

fbdb9190

29 3月, 2011 1 次提交

ceph: unlock on error in ceph_osdc_start_request() · 234af26f

由 Dan Carpenter 提交于 3月 29, 2011

There was a missing unlock on the error path if __map_request() failed.
Signed-off-by: NDan Carpenter <error27@gmail.com>
Signed-off-by: NSage Weil <sage@newdream.net>

234af26f

27 3月, 2011 1 次提交

ceph: fix possible NULL pointer dereference · 6b0ae409

由 Mariusz Kozlowski 提交于 3月 26, 2011

This patch fixes 'event_work' dereference before it is checked for NULL.
Signed-off-by: NMariusz Kozlowski <mk@lab.zgora.pl>
Signed-off-by: NSage Weil <sage@newdream.net>

6b0ae409

26 3月, 2011 1 次提交

ceph: flush msgr_wq during mds_client shutdown · ef550f6f

由 Sage Weil 提交于 3月 25, 2011

The release method for mds connections uses a backpointer to the
mds_client, so we need to flush the workqueue of any pending work (and
ceph_connection references) prior to freeing the mds_client.  This fixes
an oops easily triggered under UML by

 while true ; do mount ... ; umount ... ; done

Also fix an outdated comment: the flush in ceph_destroy_client only flushes
OSD connections out.  This bug is basically an artifact of the ceph ->
ceph+libceph conversion.
Signed-off-by: NSage Weil <sage@newdream.net>

ef550f6f

23 3月, 2011 1 次提交

libceph: add lingering request and watch/notify event framework · a40c4f10

由 Yehuda Sadeh 提交于 3月 21, 2011

Lingering requests are requests that are sent to the OSD normally but
tracked also after we get a successful request.  This keeps the OSD
connection open and resends the original request if the object moves to
another OSD.  The OSD can then send notification messages back to us
if another client initiates a notify.

This framework will be used by RBD so that the client gets notification
when a snapshot is created by another node or tool.
Signed-off-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

a40c4f10

22 3月, 2011 1 次提交

libceph: fix osd request queuing on osdmap updates · 6f6c7006

由 Sage Weil 提交于 1月 17, 2011

If we send a request to osd A, and the request's pg remaps to osd B and
then back to A in quick succession, we need to resend the request to A. The
old code was only calling kick_requests after processing all incremental
maps in a message, so it was very possible to not resend a request that
needed to be resent.  This would make the osd eventually time out (at least
with the current default of osd timeouts enabled).

The correct approach is to scan requests on every map incremental.  This
patch refactors the kick code in a few ways:
 - all requests are either on req_lru (in flight), req_unsent (ready to
   send), or req_notarget (currently map to no up osd)
 - mapping always done by map_request (previous map_osds)
 - if the mapping changes, we requeue.  requests are resent only after all
   map incrementals are processed.
 - some osd reset code is moved out of kick_requests into a separate
   function
 - the "kick this osd" functionality is moved to kick_osd_requests, as it
   is unrelated to scanning for request->pg->osd mapping changes
Signed-off-by: NSage Weil <sage@newdream.net>

6f6c7006

16 3月, 2011 1 次提交

libceph: Fix base64-decoding when input ends in newline. · b09734b1

由 Tommi Virtanen 提交于 2月 02, 2011

It used to return -EINVAL because it thought the end was not aligned
to 4 bytes.

Clean up superfluous src < end test in if, the while itself guarantees
that.
Signed-off-by: NTommi Virtanen <tommi.virtanen@dreamhost.com>
Signed-off-by: NSage Weil <sage@newdream.net>

b09734b1

05 3月, 2011 3 次提交

libceph: fix msgr standby handling · e00de341

由 Sage Weil 提交于 3月 04, 2011

The standby logic used to be pretty dependent on the work requeueing
behavior that changed when we switched to WQ_NON_REENTRANT.  It was also
very fragile.

Restructure things so that:
 - We clear WRITE_PENDING when we set STANDBY.  This ensures we will
   requeue work when we wake up later.
 - con_work backs off if STANDBY is set.  There is nothing to do if we are
   in standby.
 - clear_standby() helper is called by both con_send() and con_keepalive(),
   the two actions that can wake us up again.  Move the connect_seq++
   logic here.
Signed-off-by: NSage Weil <sage@newdream.net>

e00de341

libceph: fix msgr keepalive flag · e76661d0

由 Sage Weil 提交于 3月 03, 2011

There was some broken keepalive code using a dead variable.  Shift to using
the proper bit flag.
Signed-off-by: NSage Weil <sage@newdream.net>

e76661d0

libceph: fix msgr backoff · 60bf8bf8

由 Sage Weil 提交于 3月 04, 2011

With commit f363e45f we replaced a bunch of hacky workqueue mutual
exclusion logic with the WQ_NON_REENTRANT flag.  One pieces of fallout is
that the exponential backoff breaks in certain cases:

 * con_work attempts to connect.
 * we get an immediate failure, and the socket state change handler queues
   immediate work.
 * con_work calls con_fault, we decide to back off, but can't queue delayed
   work.

In this case, we add a BACKOFF bit to make con_work reschedule delayed work
next time it runs (which should be immediately).
Signed-off-by: NSage Weil <sage@newdream.net>

60bf8bf8

04 3月, 2011 2 次提交

libceph: retry after authorization failure · 692d20f5

由 Sage Weil 提交于 3月 03, 2011

If we mark the connection CLOSED we will give up trying to reconnect to
this server instance. That is appropriate for things like a protocol
version mismatch that won't change until the server is restarted, at which
point we'll get a new addr and reconnect. An authorization failure like
this is probably due to the server not properly rotating it's secret keys,
however, and should be treated as transient so that the normal backoff and
retry behavior kicks in.
Signed-off-by: NSage Weil <sage@newdream.net>

692d20f5

libceph: fix handling of short returns from get_user_pages · 38815b78

由 Sage Weil 提交于 3月 02, 2011

get_user_pages() can return fewer pages than we ask for. We were returning
a bogus pointer/error code in that case. Instead, loop until we get all
the pages we want or get an error we can return to the caller.
Signed-off-by: NSage Weil <sage@newdream.net>

38815b78

26 1月, 2011 2 次提交

libceph: fix socket write error handling · 42961d23

由 Sage Weil 提交于 1月 25, 2011

Pass errors from writing to the socket up the stack.  If we get -EAGAIN,
return 0 from the helper to simplify the callers' checks.
Signed-off-by: NSage Weil <sage@newdream.net>

42961d23

libceph: fix socket read error handling · 98bdb0aa

由 Sage Weil 提交于 1月 25, 2011

If we get EAGAIN when trying to read from the socket, it is not an error.
Return 0 from the helper in this case to simplify the error handling cases
in the caller (indirectly, try_read).

Fix try_read to pass any error to it's caller (con_work) instead of almost
always returning 0.  This let's us respond to things like socket
disconnects.
Signed-off-by: NSage Weil <sage@newdream.net>

98bdb0aa

13 1月, 2011 3 次提交

net/ceph: make ceph_msgr_wq non-reentrant · f363e45f

由 Tejun Heo 提交于 1月 03, 2011

ceph messenger code does a rather complex dancing around multithread
workqueue to make sure the same work item isn't executed concurrently
on different CPUs.  This restriction can be provided by workqueue with
WQ_NON_REENTRANT.

Make ceph_msgr_wq non-reentrant workqueue with the default concurrency
level and remove the QUEUED/BUSY logic.

* This removes backoff handling in con_work() but it couldn't reliably
  block execution of con_work() to begin with - queue_con() can be
  called after the work started but before BUSY is set.  It seems that
  it was an optimization for a rather cold path and can be safely
  removed.

* The number of concurrent work items is bound by the number of
  connections and connetions are independent from each other.  With
  the default concurrency level, different connections will be
  executed independently.
Signed-off-by: NTejun Heo <tj@kernel.org>
Cc: Sage Weil <sage@newdream.net>
Cc: ceph-devel@vger.kernel.org
Signed-off-by: NSage Weil <sage@newdream.net>

f363e45f

ceph: Always free allocated memory in osdmap_decode() · b0aee351

由 Jesper Juhl 提交于 12月 24, 2010

Always free memory allocated to 'pi' in
net/ceph/osdmap.c::osdmap_decode().
Signed-off-by: NJesper Juhl <jj@chaosbits.net>
Signed-off-by: NSage Weil <sage@newdream.net>

b0aee351

ceph: add dir_layout to inode · 6c0f3af7

由 Sage Weil 提交于 11月 16, 2010

Add a ceph_dir_layout to the inode, and calculate dentry hash values based
on the parent directory's specified dir_hash function. This is needed
because the old default Linux dcache hash function is extremely week and
leads to a poor distribution of files among dir fragments.
Signed-off-by: NSage Weil <sage@newdream.net>

6c0f3af7

18 12月, 2010 2 次提交

ceph: handle partial result from get_user_pages · 361cf405

由 Henry C Chang 提交于 12月 17, 2010

The get_user_pages() helper can return fewer than the requested pages.
Error out in that case, and clean up the partial result.
Signed-off-by: NHenry C Chang <henry_c_chang@tcloudcomputing.com>
Signed-off-by: NSage Weil <sage@newdream.net>

361cf405

ceph: mark user pages dirty on direct-io reads · b6aa5901

由 Henry C Chang 提交于 12月 15, 2010

For read operation, we have to set the argument _write_ of get_user_pages
to 1 since we will write data to pages. Also, we need to SetPageDirty before
releasing these pages.
Signed-off-by: NHenry C Chang <henry_c_chang@tcloudcomputing.com>
Signed-off-by: NSage Weil <sage@newdream.net>

b6aa5901

14 12月, 2010 1 次提交

ceph: fix msgr_init error path · d96c9043

由 Sage Weil 提交于 12月 13, 2010

create_workqueue() returns NULL on failure.
Signed-off-by: NSage Weil <sage@newdream.net>

d96c9043

28 11月, 2010 1 次提交

Net: ceph: Makefile: Remove unnessary code · 4cb6a614

由 Tracey Dent 提交于 11月 21, 2010

Remove the if and else conditional because the code is in mainline and there
is no need in it being there.
Signed-off-by: NTracey Dent <tdent48227@gmail.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

4cb6a614

23 11月, 2010 1 次提交

Net: ceph: Makefile: remove deprecated kbuild goal definitions · fa13bc3d

由 Tracey Dent 提交于 11月 21, 2010

Changed Makefile to use <modules>-y instead of <modules>-objs
because -objs is deprecated and not mentioned in
Documentation/kbuild/makefiles.txt.
Signed-off-by: NTracey Dent <tdent48227@gmail.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

fa13bc3d

22 11月, 2010 1 次提交

net: allow GFP_HIGHMEM in __vmalloc() · 7a1c8e5a

由 Eric Dumazet 提交于 11月 20, 2010

We forgot to use __GFP_HIGHMEM in several __vmalloc() calls.

In ceph, add the missing flag.

In fib_trie.c, xfrm_hash.c and request_sock.c, using vzalloc() is
cleaner and allows using HIGHMEM pages as well.
Signed-off-by: NEric Dumazet <eric.dumazet@gmail.com>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>

7a1c8e5a

10 11月, 2010 3 次提交

ceph: explicitly specify page alignment in network messages · c5c6b19d

由 Sage Weil 提交于 11月 09, 2010

The alignment used for reading data into or out of pages used to be taken
from the data_off field in the message header. This only worked as long
as the page alignment matched the object offset, breaking direct io to
non-page aligned offsets.

Instead, explicitly specify the page alignment next to the page vector
in the ceph_msg struct, and use that instead of the message header (which
probably shouldn't be trusted). The alloc_msg callback is responsible for
filling in this field properly when it sets up the page vector.
Signed-off-by: NSage Weil <sage@newdream.net>

c5c6b19d

ceph: make page alignment explicit in osd interface · b7495fc2

由 Sage Weil 提交于 11月 09, 2010

We used to infer alignment of IOs within a page based on the file offset,
which assumed they matched. This broke with direct IO that was not aligned
to pages (e.g., 512-byte aligned IO). We were also trusting the alignment
specified in the OSD reply, which could have been adjusted by the server.

Explicitly specify the page alignment when setting up OSD IO requests.
Signed-off-by: NSage Weil <sage@newdream.net>

b7495fc2

S
ceph: fix comment, remove extraneous args · e98b6fed
由 Sage Weil 提交于 11月 09, 2010
```
The offset/length arguments aren't used.
Signed-off-by: NSage Weil <sage@newdream.net>
```
e98b6fed

02 11月, 2010 1 次提交

ceph: fix small seq message skipping · df9f86fa

由 Sage Weil 提交于 11月 01, 2010

If the client gets out of sync with the server message sequence number, we
normally skip low seq messages (ones we already received).  The skip code
was also incrementing the expected seq, such that all subsequent messages
also appeared old and got skipped, and an eventual timeout on the osd
connection.  This resulted in some lagging requests and console messages
like

[233480.882885] ceph: skipping osd22 10.138.138.13:6804 seq 2016, expected 2017
[233480.882919] ceph: skipping osd22 10.138.138.13:6804 seq 2017, expected 2018
[233480.882963] ceph: skipping osd22 10.138.138.13:6804 seq 2018, expected 2019
[233480.883488] ceph: skipping osd22 10.138.138.13:6804 seq 2019, expected 2020
[233485.219558] ceph: skipping osd22 10.138.138.13:6804 seq 2020, expected 2021
[233485.906595] ceph: skipping osd22 10.138.138.13:6804 seq 2021, expected 2022
[233490.379536] ceph: skipping osd22 10.138.138.13:6804 seq 2022, expected 2023
[233495.523260] ceph: skipping osd22 10.138.138.13:6804 seq 2023, expected 2024
[233495.923194] ceph: skipping osd22 10.138.138.13:6804 seq 2024, expected 2025
[233500.534614] ceph:  tid 6023602 timed out on osd22, will reset osd
Reported-by: NTheodore Ts'o <tytso@mit.edu>
Signed-off-by: NSage Weil <sage@newdream.net>

df9f86fa

21 10月, 2010 5 次提交

ceph: fix num_pages_free accounting in pagelist · 240634e9

由 Sage Weil 提交于 10月 05, 2010

Decrement the free page counter when removing a page from the free_list.
Signed-off-by: NSage Weil <sage@newdream.net>

240634e9

ceph: don't crash when passed bad mount options · 010e3b48

由 Yehuda Sadeh 提交于 9月 30, 2010

This only happened when parse_extra_token was not passed
to ceph_parse_option() (hence, only happened in rbd).
Signed-off-by: NYehuda Sadeh <yehuda@hq.newdream.net>

010e3b48

ceph: add pagelist_reserve, pagelist_truncate, pagelist_set_cursor · ac0b74d8

由 Greg Farnum 提交于 9月 17, 2010

These facilitate preallocation of pages so that we can encode into the pagelist
in an atomic context.
Signed-off-by: NGreg Farnum <gregf@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

ac0b74d8

rbd: introduce rados block device (rbd), based on libceph · 602adf40

由 Yehuda Sadeh 提交于 8月 12, 2010

The rados block device (rbd), based on osdblk, creates a block device
that is backed by objects stored in the Ceph distributed object storage
cluster.  Each device consists of a single metadata object and data
striped over many data objects.

The rbd driver supports read-only snapshots.
Signed-off-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

602adf40

ceph: factor out libceph from Ceph file system · 3d14c5d2

由 Yehuda Sadeh 提交于 4月 06, 2010

This factors out protocol and low-level storage parts of ceph into a
separate libceph module living in net/ceph and include/linux/ceph.  This
is mostly a matter of moving files around.  However, a few key pieces
of the interface change as well:

 - ceph_client becomes ceph_fs_client and ceph_client, where the latter
   captures the mon and osd clients, and the fs_client gets the mds client
   and file system specific pieces.
 - Mount option parsing and debugfs setup is correspondingly broken into
   two pieces.
 - The mon client gets a generic handler callback for otherwise unknown
   messages (mds map, in this case).
 - The basic supported/required feature bits can be expanded (and are by
   ceph_fs_client).

No functional change, aside from some subtle error handling cases that got
cleaned up in the refactoring process.
Signed-off-by: NSage Weil <sage@newdream.net>

3d14c5d2