提交 · 6c4a19158b96ea1fb8acbe0c1d5493d9dcd2f147 · openanolis / cloud-kernel

17 5月, 2012 1 次提交

ceph: define ceph_auth_handshake type · 6c4a1915

由 Alex Elder 提交于 5月 16, 2012

The definitions for the ceph_mds_session and ceph_osd both contain
five fields related only to "authorizers."  Encapsulate those fields
into their own struct type, allowing for better isolation in some
upcoming patches.

Fix the #includes in "linux/ceph/osd_client.h" to lay out their more
complete canonical path.
Signed-off-by: NAlex Elder <elder@inktank.com>
Reviewed-by: NSage Weil <sage@inktank.com>

6c4a1915

22 3月, 2012 1 次提交

ceph: don't reset s_cap_ttl to zero · 1ce208a6

由 Alex Elder 提交于 1月 12, 2012

Avoid the need to check for a special zero s_cap_ttl value by just
using (jiffies - 1) as the value assigned to indicate "sometime in
the past."
Signed-off-by: NAlex Elder <elder@dreamhost.com>
Reviewed-by: NSage Weil <sage@newdream.net>

1ce208a6

03 2月, 2012 2 次提交

ceph: create a new session lock to avoid lock inversion · d8fb02ab

由 Alex Elder 提交于 1月 12, 2012

Lockdep was reporting a possible circular lock dependency in
dentry_lease_is_valid().  That function needs to sample the
session's s_cap_gen and and s_cap_ttl fields coherently, but needs
to do so while holding a dentry lock.  The s_cap_lock field was
being used to protect the two fields, but that can't be taken while
holding a lock on a dentry within the session.

In most cases, the s_cap_gen and s_cap_ttl fields only get operated
on separately.  But in three cases they need to be updated together.
Implement a new lock to protect the spots updating both fields
atomically is required.
Signed-off-by: NAlex Elder <elder@dreamhost.com>
Reviewed-by: NSage Weil <sage@newdream.net>

d8fb02ab

ceph: fix length validation in parse_reply_info() · 32852a81

由 Xi Wang 提交于 1月 14, 2012

"len" is read from network and thus needs validation.  Otherwise, given
a bogus "len" value, p+len could be an out-of-bounds pointer, which is
used in further parsing.
Signed-off-by: NXi Wang <xi.wang@gmail.com>
Signed-off-by: NSage Weil <sage@newdream.net>

32852a81

11 1月, 2012 1 次提交

ceph: remove unnecessary d_fsdata conditional checks · 3d8eb7a9

由 Sage Weil 提交于 11月 11, 2011

We now set d_fsdata unconditionally on all dentries prior to setting up
the d_ops, so all of these checks are unnecessary.
Signed-off-by: NSage Weil <sage@newdream.net>

3d8eb7a9

14 12月, 2011 1 次提交
- Y
  ceph: add missing spin_unlock at ceph_mdsc_build_path() · 9d5a09e6
  由 Yehuda Sadeh 提交于 12月 13, 2011
```
one of the paths was missing spin_unlock
Signed-off-by: NYehuda Sadeh <yehuda@hq.newdream.net>
```
  9d5a09e6
08 12月, 2011 1 次提交

ceph: use i_ceph_lock instead of i_lock · be655596

由 Sage Weil 提交于 11月 30, 2011

We have been using i_lock to protect all kinds of data structures in the
ceph_inode_info struct, including lists of inodes that we need to iterate
over while avoiding races with inode destruction.  That requires grabbing
a reference to the inode with the list lock protected, but igrab() now
takes i_lock to check the inode flags.

Changing the list lock ordering would be a painful process.

However, using a ceph-specific i_ceph_lock in the ceph inode instead of
i_lock is a simple mechanical change and avoids the ordering constraints
imposed by igrab().
Reported-by: NAmon Ott <a.ott@m-privacy.de>
Signed-off-by: NSage Weil <sage@newdream.net>

be655596

06 11月, 2011 2 次提交

ceph/mds_client.c: quiet sparse noise · 7fd7d101

由 H Hartley Sweeten 提交于 9月 23, 2011

Quiet the following sparse noise:

warning: symbol 'get_nonsnap_parent' was not declared. Should it be static?
warning: symbol 'done_closing_sessions' was not declared. Should it be static?

Local functions don't need external visability. Make them static.
Signed-off-by: NH Hartley Sweeten <hsweeten@visionengravers.com>
Cc: Sage Weil <sage@newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

7fd7d101

ceph: use new D_COMPLETE dentry flag · c6ffe100

由 Sage Weil 提交于 11月 03, 2011

We used to use a flag on the directory inode to track whether the dcache
contents for a directory were a complete cached copy. Switch to a dentry
flag CEPH_D_COMPLETE that is safely updated by ->d_prune().
Signed-off-by: NSage Weil <sage@newdream.net>

c6ffe100

26 10月, 2011 1 次提交

libceph: don't complain on msgpool alloc failures · b61c2763

由 Sage Weil 提交于 8月 09, 2011

The pool allocation failures are masked by the pool; there is no need to
spam the console about them.  (That's the whole point of having the pool
in the first place.)

Mark msg allocations whose failure is safely handled as such.
Signed-off-by: NSage Weil <sage@newdream.net>

b61c2763

16 8月, 2011 1 次提交

ceph: fix encoding of ino only (not relative) paths · 795858db

由 Sage Weil 提交于 8月 15, 2011

A 'path' consists of a starting ino and relative component.  Encode even
when there is no relative component.  This is primarily needed by the
NFS reexport code.
Signed-off-by: NSage Weil <sage@newdream.net>

795858db

27 7月, 2011 4 次提交

ceph: document unlocked d_parent accesses · d79698da

由 Sage Weil 提交于 7月 26, 2011

For the most part we don't care about racing with rename when directing
MDS requests; either the old or new parent is fine.  Document that, and
do some minor cleanup.
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

d79698da

ceph: explicitly reference rename old_dentry parent dir in request · 41b02e1f

由 Sage Weil 提交于 7月 26, 2011

We carry a pin on the parent directory for the rename source and dest
dentries.  For the source it's r_locked_dir; we need to explicitly
reference the old_dentry parent as well, since the dentry's d_parent may
change between when the request was created and pinned and when it is
freed.
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

41b02e1f

ceph: avoid d_parent in ceph_dentry_hash; fix ceph_encode_fh() hashing bug · e5f86dc3

由 Sage Weil 提交于 7月 26, 2011

Have caller pass in a safely-obtained reference to the parent directory
for calculating a dentry's hash valud.

While we're here, simpify the flow through ceph_encode_fh() so that there
is a single exit point and cleanup.

Also fix a bug with the dentry hash calculation: calculate the hash for the
dentry we were given, not its parent.
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

e5f86dc3

ceph: ignore lease mask · 2f90b852

由 Sage Weil 提交于 7月 26, 2011

The lease mask is no longer used (and it changed a while back).  Instead,
use a non-zero duration to indicate that there is a lease being issued.
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

2f90b852

17 7月, 2011 1 次提交

ceph analog of cifs build_path_from_dentry() race fix · 1b71fe2e

由 Al Viro 提交于 7月 16, 2011

... unfortunately, cifs bug got copied.  Fix is essentially the same.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

1b71fe2e

25 5月, 2011 1 次提交

ceph: fix cap flush race reentrancy · db354052

由 Sage Weil 提交于 5月 24, 2011

In e9964c10 we change cap flushing to do a delicate dance because some
inodes on the cap_dirty list could be in a migrating state (got EXPORT but
not IMPORT) in which we couldn't actually flush and move from
dirty->flushing, breaking the while (!empty) { process first } loop
structure.  It worked for a single sync thread, but was not reentrant and
triggered infinite loops when multiple syncers came along.

Instead, move inodes with dirty to a separate cap_dirty_migrating list
when in the limbo export-but-no-import state, allowing us to go back to
the simple loop structure (which was reentrant).  This is cleaner and more
robust.

Audited the cap_dirty users and this looks fine:
list_empty(&ci->i_dirty_item) is still a reliable indicator of whether we
have dirty caps (which list we're on is irrelevant) and list_del_init()
calls still do the right thing.
Signed-off-by: NSage Weil <sage@newdream.net>

db354052

20 5月, 2011 2 次提交

S
libceph: remove unused variable · 1b366985
由 Sage Weil 提交于 5月 12, 2011
```
Signed-off-by: NSage Weil <sage@newdream.net>
```
1b366985

ceph: take reference on mds request r_unsafe_dir · 3b663780

由 Sage Weil 提交于 5月 18, 2011

We put ourselves on an inode list for the parent directory of metadata
operations so that an fsync on the directory will wait for metadata updates
to commit to disk.  We weren't holding a reference to that directory,
however, and under certain workloads (fsstress in this case) the directory
can go away.
Signed-off-by: NSage Weil <sage@newdream.net>

3b663780

12 5月, 2011 1 次提交

ceph: print debug message before put mds session · 7d8e18a6

由 Henry C Chang 提交于 5月 11, 2011

The mds session, s, could be freed during ceph_put_mds_session.
Move dout before ceph_put_mds_session.
Signed-off-by: NHenry C Chang <henry.cy.chang@gmail.com>
Signed-off-by: NSage Weil <sage@newdream.net>

7d8e18a6

26 3月, 2011 1 次提交

ceph: flush msgr_wq during mds_client shutdown · ef550f6f

由 Sage Weil 提交于 3月 25, 2011

The release method for mds connections uses a backpointer to the
mds_client, so we need to flush the workqueue of any pending work (and
ceph_connection references) prior to freeing the mds_client.  This fixes
an oops easily triggered under UML by

 while true ; do mount ... ; umount ... ; done

Also fix an outdated comment: the flush in ceph_destroy_client only flushes
OSD connections out.  This bug is basically an artifact of the ceph ->
ceph+libceph conversion.
Signed-off-by: NSage Weil <sage@newdream.net>

ef550f6f

26 1月, 2011 1 次提交

ceph: avoid picking MDS that is not active · d66bbd44

由 Sage Weil 提交于 1月 21, 2011

Ignore replication or auth frag data if it indicates an MDS that is not
active.  This can happen if the MDS shuts down and the client has stale
data about the namespace distribution across the MDS cluster.  If that's
the case, fall back to directing the request based on the auth cap (which
should always be accurate).
Signed-off-by: NSage Weil <sage@newdream.net>

d66bbd44

13 1月, 2011 3 次提交

ceph: associate requests with opening sessions · dc69e2e9

由 Sage Weil 提交于 11月 02, 2010

Associate request with sessions that aren't yep open.  This makes the
debugfs mdsc request list more informative.
Signed-off-by: NSage Weil <sage@newdream.net>

dc69e2e9

ceph: drop redundant r_mds field · 4af25fdd

由 Sage Weil 提交于 11月 02, 2010

The r_mds field is redundant, since we can find the same information at
r_session->s_mds, and when r_session is NULL then r_mds is meaningless.
Signed-off-by: NSage Weil <sage@newdream.net>

4af25fdd

ceph: implement DIRLAYOUTHASH feature to get dir layout from MDS · 14303d20

由 Sage Weil 提交于 12月 14, 2010

This implements the DIRLAYOUTHASH protocol feature, which passes the dir
layout over the wire from the MDS. This gives the client knowledge
of the correct hash function to use for mapping dentries among dir
fragments.

Note that if this feature is _not_ present on the client but is on the
MDS, the client may misdirect requests. This will result in a forward
and degrade performance. It may also result in inaccurate NFS filehandle
generation, which will prevent fh resolution when the inode is not present
in the client cache and the parent directories have been fragmented.
Signed-off-by: NSage Weil <sage@newdream.net>

14303d20

07 1月, 2011 1 次提交

fs: dcache scale dentry refcount · b7ab39f6

由 Nick Piggin 提交于 1月 07, 2011

Make d_count non-atomic and protect it with d_lock. This allows us to ensure a
0 refcount dentry remains 0 without dcache_lock. It is also fairly natural when
we start protecting many other dentry members with d_lock.
Signed-off-by: NNick Piggin <npiggin@kernel.dk>

b7ab39f6

02 12月, 2010 1 次提交

ceph: Handle file locks in replies from the MDS. · 25933abd

由 Herb Shiu 提交于 12月 01, 2010

Previously the kernel client incorrectly assumed everything was a directory.
Signed-off-by: NHerb Shiu <herb_shiu@tcloudcomputing.com>
Acked-by: NGreg Farnum <gregf@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

25933abd

18 11月, 2010 1 次提交

BKL: remove extraneous #include <smp_lock.h> · 451a3c24

由 Arnd Bergmann 提交于 11月 17, 2010

The big kernel lock has been removed from all these files at some point,
leaving only the #include.

Remove this too as a cleanup.
Signed-off-by: NArnd Bergmann <arnd@arndb.de>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

451a3c24

08 11月, 2010 1 次提交

ceph: fix uid/gid on resent mds requests · cb4276cc

由 Sage Weil 提交于 11月 08, 2010

MDS requests can be rebuilt and resent in non-process context, but were
filling in uid/gid from current_fsuid/gid.  Put that information in the
request struct on request setup.

This fixes incorrect (and root) uid/gid getting set for requests that
are forwarded between MDSs, usually due to metadata migrations.
Signed-off-by: NSage Weil <sage@newdream.net>

cb4276cc

21 10月, 2010 3 次提交

ceph: switch from BKL to lock_flocks() · 496e5955

由 Sage Weil 提交于 9月 22, 2010

Switch from using the BKL explicitly to the new lock_flocks() interface.
Eventually this will turn into a spinlock.
Signed-off-by: NSage Weil <sage@newdream.net>

496e5955

ceph: preallocate flock state without locks held · fca4451a

由 Greg Farnum 提交于 9月 17, 2010

When the lock_kernel() turns into lock_flocks() and a spinlock, we won't
be able to do allocations with the lock held. Preallocate space without
the lock, and retry if the lock state changes out from underneath us.
Signed-off-by: NGreg Farnum <gregf@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

fca4451a

ceph: factor out libceph from Ceph file system · 3d14c5d2

由 Yehuda Sadeh 提交于 4月 06, 2010

This factors out protocol and low-level storage parts of ceph into a
separate libceph module living in net/ceph and include/linux/ceph.  This
is mostly a matter of moving files around.  However, a few key pieces
of the interface change as well:

 - ceph_client becomes ceph_fs_client and ceph_client, where the latter
   captures the mon and osd clients, and the fs_client gets the mds client
   and file system specific pieces.
 - Mount option parsing and debugfs setup is correspondingly broken into
   two pieces.
 - The mon client gets a generic handler callback for otherwise unknown
   messages (mds map, in this case).
 - The basic supported/required feature bits can be expanded (and are by
   ceph_fs_client).

No functional change, aside from some subtle error handling cases that got
cleaned up in the refactoring process.
Signed-off-by: NSage Weil <sage@newdream.net>

3d14c5d2

12 9月, 2010 1 次提交

ceph: fix reconnect encoding for old servers · 3612abbd

由 Sage Weil 提交于 9月 07, 2010

Fix the reconnect encoding to encode the cap record when the MDS does not
have the FLOCK capability (i.e., pre v0.22).
Signed-off-by: NSage Weil <sage@newdream.net>

3612abbd

27 8月, 2010 1 次提交

ceph: don't BUG on ENOMEM during mds reconnect · e072f8aa

由 Sage Weil 提交于 8月 26, 2010

We are in a position to return an error; do that instead.
Signed-off-by: NSage Weil <sage@newdream.net>

e072f8aa

23 8月, 2010 2 次提交

ceph: direct requests in snapped namespace based on nonsnap parent · eb6bb1c5

由 Sage Weil 提交于 8月 16, 2010

When making a request in the virtual snapdir or a snapped portion of the
namespace, we should choose the MDS based on the first nonsnap parent (and
its caps). If that is not the best place, we will get forward hints to
find the right MDS in the cluster. This fixes ESTALE errors when using
the .snap directory and namespace with multiple MDSs.
Signed-off-by: NSage Weil <sage@newdream.net>

eb6bb1c5

ceph: fix multiple mds session shutdown · f3c60c59

由 Sage Weil 提交于 8月 11, 2010

The use of a completion when waiting for session shutdown during umount is
inappropriate, given the complexity of the condition.  For multiple MDS's,
this resulted in the umount thread spinning, often preventing the session
close message from being processed in some cases.

Switch to a waitqueue and defined a condition helper.  This cleans things
up nicely.
Signed-off-by: NSage Weil <sage@newdream.net>

f3c60c59

04 8月, 2010 1 次提交
- S
  ceph: whitespace cleanup · 213c99ee
  由 Sage Weil 提交于 8月 03, 2010
```
Signed-off-by: NSage Weil <sage@newdream.net>
```
  213c99ee
03 8月, 2010 2 次提交

ceph: add flock/fcntl lock support · 40819f6f

由 Greg Farnum 提交于 8月 02, 2010

Implement flock inode operation to support advisory file locking.  All
lock/unlock operations are synchronous with the MDS.  Lock state is
sent when reconnecting to a recovering MDS to restore the shared lock
state.
Signed-off-by: NGreg Farnum <gregf@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

40819f6f

ceph: support v2 reconnect encoding · 20cb34ae

由 Sage Weil 提交于 5月 12, 2010

Encode either old or v2 encoding of client_reconnect message, depending on
whether the peer has the FLOCK feature bit.
Signed-off-by: NSage Weil <sage@newdream.net>

20cb34ae

02 8月, 2010 1 次提交
- G
  ceph: handle ESTALE properly; on receipt send to authority if it wasn't · e55b71f8
  由 Greg Farnum 提交于 6月 22, 2010
```
Signed-off-by: NGreg Farnum <gregf@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>
```
  e55b71f8

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功