提交 · 9bae113a085b790de384bf86f09e15b42a65a985 · openeuler / raspberrypi-kernel

27 7月, 2011 9 次提交

ceph: only link open operations to directory unsafe list if O_CREAT|O_TRUNC · 9bae113a

由 Sage Weil 提交于 7月 26, 2011

We only need to put these on the directory unsafe list if they have
side effects that fsync(2) should flush out.
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

9bae113a

ceph: fix bad parent_inode calc in ceph_lookup_open · acda7657

由 Sage Weil 提交于 7月 26, 2011

We were always getting NULL here because the intent file f_dentry is always
NULL at this point, which means we were always passing NULL to
ceph_mdsc_do_request.  In reality, this was fine, since this isn't
currently ever a write operation that needs to get strung on the dir's
unsafe list.

Use the dir explicitly, and only pass it if this open has side-effects that
a dir fsync should flush.
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

acda7657

ceph: avoid carrying Fw cap during write into page cache · d8de9ab6

由 Sage Weil 提交于 7月 26, 2011

The generic_file_aio_write call may block on balance_dirty_pages while we
flush data to the OSDs.  If we hold a reference to the FILE_WR cap during
that interval revocation by the MDS (e.g., to do a stat(2)) may be very
slow.
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

d8de9ab6

ceph: report f_bfree based on kb_avail rather than diffing. · 8f04d422

由 Greg Farnum 提交于 7月 26, 2011

Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NGreg Farnum <gregory.farnum@dreamhost.com>

8f04d422

ceph: only queue capsnap if caps are dirty · e77dc3e9

由 Sage Weil 提交于 7月 26, 2011

We used to go into this branch if i_wrbuffer_ref_head was non-zero.  This
was an ancient check from before we were careful about dealing with all
kinds of caps (and not just dirty pages).  It is cleaner to only queue a
capsnap if there is an actual dirty cap.  If we are racing with...
something...we will end up here with ci->i_wrbuffer_refs but no dirty
caps.
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

e77dc3e9

ceph: fix snap writeback when racing with writes · af0ed569

由 Sage Weil 提交于 7月 26, 2011

There are two problems that come up when we try to queue a capsnap while a
write is in progress:

 - The FILE_WR cap is held, but not yet dirty, so we may queue a capsnap
   with dirty == 0.  That will crash later in __ceph_flush_snaps().  Or
   on the FILE_WR cap if a write is in progress.
 - We may not have i_head_snapc set, which causes problems pretty quickly.
   Look to the snaprealm in this case.
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

af0ed569

ceph: use flag bit for at_end readdir flag · 9cfa1098

由 Sage Weil 提交于 7月 26, 2011

This saves us a word of memory per file.
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

9cfa1098

ceph: add F_SYNC file flag to force sync (non-O_DIRECT) io · 4918b6d1

由 Sage Weil 提交于 7月 26, 2011

This allows us to force IO through the sync path which you normally only
get when multiple clients are reading/writing to the same file or by
mounting with -o sync.  Among other things, this lets test programs verify
correctness with a single mount.
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

4918b6d1

ceph: add flags field to file_info · 252c6728

由 Sage Weil 提交于 7月 26, 2011

Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

252c6728

17 7月, 2011 1 次提交

ceph analog of cifs build_path_from_dentry() race fix · 1b71fe2e

由 Al Viro 提交于 7月 16, 2011

... unfortunately, cifs bug got copied.  Fix is essentially the same.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

1b71fe2e

14 6月, 2011 2 次提交

ceph: fix sync and dio writes across stripe boundaries · d7f124f1

由 Sage Weil 提交于 6月 13, 2011

We were iterating across stripe boundaries properly, but not moving the
write buffer pointer forward. This caused us to rewrite the same data
after the break. Fix by adjusting the data pointer forward, and
recalculating the io and buffer alignment after the break.
Signed-off-by: NSage Weil <sage@newdream.net>

d7f124f1

ceph: fix page alignment corrections · 773e9b44

由 Sage Weil 提交于 6月 07, 2011

 dd if=/dev/urandom of=/mnt/fs_depot/dd10 bs=500 seek=8388 count=1
 dd if=/mnt/fs_depot/dd10 of=/root/dd10out bs=500 skip=8388 count=1
Reported-by: NHenry C Chang <henry.cy.chang@gmail.com>
Signed-off-by: NSage Weil <sage@newdream.net>

773e9b44

08 6月, 2011 4 次提交

ceph: unwind canceled flock state · 0c1f91f2

由 Sage Weil 提交于 5月 25, 2011

If we request a lock and then abort (e.g., ^C), we need to send a matching
unlock request to the MDS to unwind our lock attempt to avoid indefinitely
blocking other clients.
Reported-by: NBrian Chrisman <brchrisman@gmail.com>
Signed-off-by: NSage Weil <sage@newdream.net>

0c1f91f2

ceph: fix ENOENT logic in striped_read · 0e98728f

由 Sage Weil 提交于 6月 07, 2011

Getting ENOENT is equivalent to reading 0 bytes.  Make that correction
before setting up the hit_stripe and was_short flags.

Fixes the following case:
 dd if=/dev/zero of=/mnt/fs_depot/dd3 bs=1 seek=1048576 count=0
 dd if=/mnt/fs_depot/dd3 of=/root/ddout1 skip=8 bs=500 count=2 iflag=direct
Reported-by: NHenry C Chang <henry.cy.chang@gmail.com>
Signed-off-by: NSage Weil <sage@newdream.net>

0e98728f

ceph: fix short sync reads from the OSD · c3cd6283

由 Sage Weil 提交于 6月 01, 2011

If we get a short read from the OSD because the object is small, we need to
zero the remainder of the buffer.  For O_DIRECT reads, the attempted range
is not trimmed to i_size by the VFS, so we were actually looping
indefinitely.

Fix by trimming by i_size, and the unconditionally zeroing the trailing
range.
Reported-by: NJeff Wu <cpwu@tnsoft.com.cn>
Signed-off-by: NSage Weil <sage@newdream.net>

c3cd6283

ceph: use ihold when we already have an inode ref · 70b666c3

由 Sage Weil 提交于 5月 27, 2011

We should use ihold whenever we already have a stable inode ref, even
when we aren't holding i_lock.  This avoids adding new and unnecessary
locking dependencies.
Signed-off-by: NSage Weil <sage@newdream.net>

70b666c3

26 5月, 2011 3 次提交

ceph: remove unnecessary dentry_unhash calls · 051e8f0e

由 Sage Weil 提交于 5月 24, 2011

Ceph does not need these, and they screw up our use of the dcache as a
consistent cache.
Signed-off-by: NSage Weil <sage@newdream.net>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

051e8f0e

vfs: push dentry_unhash on rename_dir into file systems · e4eaac06

由 Sage Weil 提交于 5月 24, 2011

Only a few file systems need this.  Start by pushing it down into each
rename method (except gfs2 and xfs) so that it can be dealt with on a
per-fs basis.
Acked-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NSage Weil <sage@newdream.net>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

e4eaac06

vfs: push dentry_unhash on rmdir into file systems · 79bf7c73

由 Sage Weil 提交于 5月 24, 2011

Only a few file systems need this.  Start by pushing it down into each
fs rmdir method (except gfs2 and xfs) so it can be dealt with on a per-fs
basis.

This does not change behavior for any in-tree file systems.
Acked-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NSage Weil <sage@newdream.net>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

79bf7c73

25 5月, 2011 3 次提交

ceph: fix cap flush race reentrancy · db354052

由 Sage Weil 提交于 5月 24, 2011

In e9964c10 we change cap flushing to do a delicate dance because some
inodes on the cap_dirty list could be in a migrating state (got EXPORT but
not IMPORT) in which we couldn't actually flush and move from
dirty->flushing, breaking the while (!empty) { process first } loop
structure.  It worked for a single sync thread, but was not reentrant and
triggered infinite loops when multiple syncers came along.

Instead, move inodes with dirty to a separate cap_dirty_migrating list
when in the limbo export-but-no-import state, allowing us to go back to
the simple loop structure (which was reentrant).  This is cleaner and more
robust.

Audited the cap_dirty users and this looks fine:
list_empty(&ci->i_dirty_item) is still a reliable indicator of whether we
have dirty caps (which list we're on is irrelevant) and list_del_init()
calls still do the right thing.
Signed-off-by: NSage Weil <sage@newdream.net>

db354052

ceph: avoid inode lookup on nfs fh reconnect · 45e3d3ee

由 Sage Weil 提交于 4月 06, 2011

If we get the inode from the MDS, we have a reference in req; don't do a
fresh lookup.
Signed-off-by: NSage Weil <sage@newdream.net>

45e3d3ee

ceph: use LOOKUPINO to make unconnected nfs fh more reliable · 3c454cf2

由 Sage Weil 提交于 4月 06, 2011

If we are unable to locate an inode by ino, ask the MDS using the new
LOOKUPINO command.
Signed-off-by: NSage Weil <sage@newdream.net>

3c454cf2

20 5月, 2011 7 次提交

ceph: check return value for start_request in writepages · 9d6fcb08

由 Sage Weil 提交于 5月 12, 2011

Since we pass the nofail arg, we should never get an error; BUG if we do.
(And fix the function to not return an error if __map_request fails.)
Signed-off-by: NSage Weil <sage@newdream.net>

9d6fcb08

ceph: remove useless check · 6b4a3b51

由 Sage Weil 提交于 5月 12, 2011

rc is only ever 0 or negative in this method.
Signed-off-by: NSage Weil <sage@newdream.net>

6b4a3b51

ceph: fix broken comparison in readdir loop · da39822c

由 Sage Weil 提交于 5月 12, 2011

Both off and fi->offset are unsigned, so the difference is always >= 0.
Compare them directly instead of the sign of the difference.
Signed-off-by: NSage Weil <sage@newdream.net>

da39822c

ceph: fix rare potential cap leak · 3540303f

由 Sage Weil 提交于 5月 12, 2011

If we grab new_cap, retake the lock, and find we already have a cap now
for the given mds, release new_cap.
Signed-off-by: NSage Weil <sage@newdream.net>

3540303f

ceph: use snprintf for dirstat content · ae598083

由 Sage Weil 提交于 5月 12, 2011

We allocate a buffer for rstats if the dirstat option is enabled.  Use
snprintf.
Signed-off-by: NSage Weil <sage@newdream.net>

ae598083

S
libceph: remove unused variable · 1b366985
由 Sage Weil 提交于 5月 12, 2011
```
Signed-off-by: NSage Weil <sage@newdream.net>
```
1b366985

ceph: take reference on mds request r_unsafe_dir · 3b663780

由 Sage Weil 提交于 5月 18, 2011

We put ourselves on an inode list for the parent directory of metadata
operations so that an fsync on the directory will wait for metadata updates
to commit to disk.  We weren't holding a reference to that directory,
however, and under certain workloads (fsstress in this case) the directory
can go away.
Signed-off-by: NSage Weil <sage@newdream.net>

3b663780

12 5月, 2011 3 次提交

ceph: do not use i_wrbuffer_ref as refcount for Fb cap · d3d0720d

由 Henry C Chang 提交于 5月 11, 2011

We increments i_wrbuffer_ref when taking the Fb cap. This breaks
the dirty page accounting and causes looping in
__ceph_do_pending_vmtruncate, and ceph client hangs.

This bug can be reproduced occasionally by running blogbench.

Add a new field i_wb_ref to inode and dedicate it to Fb reference
counting.
Signed-off-by: NHenry C Chang <henry.cy.chang@gmail.com>
Signed-off-by: NSage Weil <sage@newdream.net>

d3d0720d

ceph: fix list_add in ceph_put_snap_realm · a26a185d

由 Henry C Chang 提交于 5月 11, 2011

Signed-off-by: NHenry C Chang <henry.cy.chang@gmail.com>
Signed-off-by: NSage Weil <sage@newdream.net>

a26a185d

ceph: print debug message before put mds session · 7d8e18a6

由 Henry C Chang 提交于 5月 11, 2011

The mds session, s, could be freed during ceph_put_mds_session.
Move dout before ceph_put_mds_session.
Signed-off-by: NHenry C Chang <henry.cy.chang@gmail.com>
Signed-off-by: NSage Weil <sage@newdream.net>

7d8e18a6

05 5月, 2011 1 次提交

ceph: do not call __mark_dirty_inode under i_lock · fca65b4a

由 Sage Weil 提交于 5月 04, 2011

The __mark_dirty_inode helper now takes i_lock as of 250df6ed. Fix the
one ceph callers that held i_lock (__ceph_mark_dirty_caps) to return the
flags value so that the callers can do it outside of i_lock.
Signed-off-by: NSage Weil <sage@newdream.net>

fca65b4a

04 5月, 2011 2 次提交

ceph: handle ceph_osdc_new_request failure in ceph_writepages_start · 8c71897b

由 Henry C Chang 提交于 5月 03, 2011

We should unlock the page and return -ENOMEM if ceph_osdc_new_request
failed.
Signed-off-by: NHenry C Chang <henry_c_chang@tcloudcomputing.com>
Signed-off-by: NSage Weil <sage@newdream.net>

8c71897b

S
ceph: use ihold() when i_lock is held · 3772d26d
由 Sage Weil 提交于 5月 03, 2011
```
See 0444d76a.
Signed-off-by: NSage Weil <sage@newdream.net>
```
3772d26d

31 3月, 2011 1 次提交

Fix common misspellings · 25985edc

由 Lucas De Marchi 提交于 3月 30, 2011

Fixes generated by 'codespell' and manually reviewed.
Signed-off-by: NLucas De Marchi <lucas.demarchi@profusion.mobi>

25985edc

30 3月, 2011 1 次提交

ceph: Move secret key parsing earlier. · 8323c3aa

由 Tommi Virtanen 提交于 3月 25, 2011

This makes the base64 logic be contained in mount option parsing,
and prepares us for replacing the homebew key management with the
kernel key retention service.
Signed-off-by: NTommi Virtanen <tommi.virtanen@dreamhost.com>
Signed-off-by: NSage Weil <sage@newdream.net>

8323c3aa

29 3月, 2011 1 次提交

fs: don't use igrab() while holding i_lock · 0444d76a

由 Dave Chinner 提交于 3月 29, 2011

Fix the incorrect use of igrab() inside the i_lock in NFS and Ceph‥

If we are already holding the i_lock, we have a reference to the
inode so we can safely use ihold() to gain an extra reference. This
avoids hangs due to lock recursion on the i_lock now that the
inode_lock is gone and igrab() uses the i_lock itself.
Signed-off-by: NDave Chinner <dchinner@redhat.com>
Cc: Al Viro <viro@zeniv.linux.org.uk>
Cc: linux-fsdevel@vger.kernel.org
Cc: Ryan Mallon <ryan@bluewatersys.com>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

0444d76a

26 3月, 2011 1 次提交

ceph: flush msgr_wq during mds_client shutdown · ef550f6f

由 Sage Weil 提交于 3月 25, 2011

The release method for mds connections uses a backpointer to the
mds_client, so we need to flush the workqueue of any pending work (and
ceph_connection references) prior to freeing the mds_client.  This fixes
an oops easily triggered under UML by

 while true ; do mount ... ; umount ... ; done

Also fix an outdated comment: the flush in ceph_destroy_client only flushes
OSD connections out.  This bug is basically an artifact of the ceph ->
ceph+libceph conversion.
Signed-off-by: NSage Weil <sage@newdream.net>

ef550f6f

22 3月, 2011 1 次提交
- S
  ceph: rename dentry_release -> d_release, fix comment · 147851d2
  由 Sage Weil 提交于 3月 15, 2011
```
Just for consistency's sake.  Fix obsolete comment too.
Signed-off-by: NSage Weil <sage@newdream.net>
```
  147851d2