提交 · 6a8ea4706adb4b4d8f77a8da5f9778b65fbf6f48 · openeuler / raspberrypi-kernel

26 10月, 2011 4 次提交

由 Sage Weil 提交于 8月 04, 2011

...after some prodding by Christoph.
Signed-off-by: NSage Weil <sage@newdream.net>

6a8ea470

ceph: implement (optional) max read size · 0d66a487

由 Sage Weil 提交于 8月 04, 2011

The 'rsize' mount option limits the maximum size of an individual
read(ahead) operation that is sent off to an OSD.  This is distinct from
'rasize', which controls the size of the readahead window.
Signed-off-by: NSage Weil <sage@newdream.net>

0d66a487

S
ceph: rename rsize -> rasize · 83817e35
由 Sage Weil 提交于 8月 04, 2011
```
It controls readahead.
Signed-off-by: NSage Weil <sage@newdream.net>
```
83817e35

ceph: make readpages fully async · 7c272194

由 Sage Weil 提交于 8月 03, 2011

When we get a ->readpages() aop, submit async reads for all page ranges
in the provided page list.  Lock the pages immediately, so that VFS/MM
will block until the reads complete.
Signed-off-by: NSage Weil <sage@newdream.net>

7c272194

23 8月, 2011 1 次提交

ceph: fix memory leak · 259a187a

由 Noah Watkins 提交于 8月 22, 2011

kfree does not clean up indirect allocations in
ceph_fs_client and ceph_options (e.g. snapdir_name).
Signed-off-by: NNoah Watkins <noahwatkins@gmail.com>
Signed-off-by: NSage Weil <sage@newdream.net>

259a187a

16 8月, 2011 1 次提交

ceph: fix encoding of ino only (not relative) paths · 795858db

由 Sage Weil 提交于 8月 15, 2011

A 'path' consists of a starting ino and relative component.  Encode even
when there is no relative component.  This is primarily needed by the
NFS reexport code.
Signed-off-by: NSage Weil <sage@newdream.net>

795858db

27 7月, 2011 20 次提交

ceph: document unlocked d_parent accesses · d79698da

由 Sage Weil 提交于 7月 26, 2011

For the most part we don't care about racing with rename when directing
MDS requests; either the old or new parent is fine.  Document that, and
do some minor cleanup.
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

d79698da

ceph: explicitly reference rename old_dentry parent dir in request · 41b02e1f

由 Sage Weil 提交于 7月 26, 2011

We carry a pin on the parent directory for the rename source and dest
dentries.  For the source it's r_locked_dir; we need to explicitly
reference the old_dentry parent as well, since the dentry's d_parent may
change between when the request was created and pinned and when it is
freed.
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

41b02e1f

S
ceph: document locking for ceph_set_dentry_offset · 4f177264
由 Sage Weil 提交于 7月 26, 2011
```
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>
```
4f177264

ceph: avoid d_parent in ceph_dentry_hash; fix ceph_encode_fh() hashing bug · e5f86dc3

由 Sage Weil 提交于 7月 26, 2011

Have caller pass in a safely-obtained reference to the parent directory
for calculating a dentry's hash valud.

While we're here, simpify the flow through ceph_encode_fh() so that there
is a single exit point and cleanup.

Also fix a bug with the dentry hash calculation: calculate the hash for the
dentry we were given, not its parent.
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

e5f86dc3

ceph: protect d_parent access in ceph_d_revalidate · bf1c6aca

由 Sage Weil 提交于 7月 26, 2011

Protect d_parent with d_lock.  Carry a reference.  Simplify the flow so
that there is a single exit point and cleanup.
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

bf1c6aca

ceph: protect access to d_parent · 5f21c96d

由 Sage Weil 提交于 7月 26, 2011

d_parent is protected by d_lock: use it when looking up a dentry's parent
directory inode.  Also take a reference and drop it in the caller to avoid
a use-after-free.
Reported-by: NAl Viro <viro@ZenIV.linux.org.uk>
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

5f21c96d

ceph: handle racing calls to ceph_init_dentry · 48d0cbd1

由 Sage Weil 提交于 7月 26, 2011

The ->lookup() and prepopulate_readdir() callers are working with unhashed
dentries, so we don't have to worry.  The export.c callers, though, need
to initialize something they got back from d_obtain_alias() and are
potentially racing with other callers.  Make sure we don't return unless
the dentry is properly initialized (by us or someone else).
Reported-by: NAl Viro <viro@ZenIV.linux.org.uk>
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

48d0cbd1

ceph: set dir complete frag after adding capability · dfabbed6

由 Sage Weil 提交于 7月 26, 2011

Curretly ceph_add_cap clears the complete bit if we are newly issued the
FILE_SHARED cap, which is normally the case for a newly issue cap on a new
directory.  That means we clear the just-set bit.  Move the check that sets
the flag to after the cap is added/updated.
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

dfabbed6

ceph: set up readahead size when rsize is not passed · e9852227

由 Yehuda Sadeh 提交于 7月 22, 2011

This should improve the default read performance, as without it
readahead is practically disabled.
Signed-off-by: NYehuda Sadeh <yehuda@hq.newdream.net>

e9852227

ceph: ignore lease mask · 2f90b852

由 Sage Weil 提交于 7月 26, 2011

The lease mask is no longer used (and it changed a while back).  Instead,
use a non-zero duration to indicate that there is a lease being issued.
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

2f90b852

ceph: fix ceph_lookup_open intent usage · 468640e3

由 Sage Weil 提交于 7月 26, 2011

We weren't properly calling lookup_instantiate_filp when setting up the
lookup intent, which could lead to file leakage on errors.  So:

 - use separate helper for the hidden snapdir translation, immediately
   following the mds request
 - use ceph_finish_lookup for the final dentry/return value dance in the
   exit path
 - lookup_instantiate_filp on success
Reported-by: NAl Viro <viro@ZenIV.linux.org.uk>
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

468640e3

ceph: only link open operations to directory unsafe list if O_CREAT|O_TRUNC · 9bae113a

由 Sage Weil 提交于 7月 26, 2011

We only need to put these on the directory unsafe list if they have
side effects that fsync(2) should flush out.
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

9bae113a

ceph: fix bad parent_inode calc in ceph_lookup_open · acda7657

由 Sage Weil 提交于 7月 26, 2011

We were always getting NULL here because the intent file f_dentry is always
NULL at this point, which means we were always passing NULL to
ceph_mdsc_do_request.  In reality, this was fine, since this isn't
currently ever a write operation that needs to get strung on the dir's
unsafe list.

Use the dir explicitly, and only pass it if this open has side-effects that
a dir fsync should flush.
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

acda7657

ceph: avoid carrying Fw cap during write into page cache · d8de9ab6

由 Sage Weil 提交于 7月 26, 2011

The generic_file_aio_write call may block on balance_dirty_pages while we
flush data to the OSDs.  If we hold a reference to the FILE_WR cap during
that interval revocation by the MDS (e.g., to do a stat(2)) may be very
slow.
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

d8de9ab6

ceph: report f_bfree based on kb_avail rather than diffing. · 8f04d422

由 Greg Farnum 提交于 7月 26, 2011

Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NGreg Farnum <gregory.farnum@dreamhost.com>

8f04d422

ceph: only queue capsnap if caps are dirty · e77dc3e9

由 Sage Weil 提交于 7月 26, 2011

We used to go into this branch if i_wrbuffer_ref_head was non-zero.  This
was an ancient check from before we were careful about dealing with all
kinds of caps (and not just dirty pages).  It is cleaner to only queue a
capsnap if there is an actual dirty cap.  If we are racing with...
something...we will end up here with ci->i_wrbuffer_refs but no dirty
caps.
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

e77dc3e9

ceph: fix snap writeback when racing with writes · af0ed569

由 Sage Weil 提交于 7月 26, 2011

There are two problems that come up when we try to queue a capsnap while a
write is in progress:

 - The FILE_WR cap is held, but not yet dirty, so we may queue a capsnap
   with dirty == 0.  That will crash later in __ceph_flush_snaps().  Or
   on the FILE_WR cap if a write is in progress.
 - We may not have i_head_snapc set, which causes problems pretty quickly.
   Look to the snaprealm in this case.
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

af0ed569

ceph: use flag bit for at_end readdir flag · 9cfa1098

由 Sage Weil 提交于 7月 26, 2011

This saves us a word of memory per file.
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

9cfa1098

ceph: add F_SYNC file flag to force sync (non-O_DIRECT) io · 4918b6d1

由 Sage Weil 提交于 7月 26, 2011

This allows us to force IO through the sync path which you normally only
get when multiple clients are reading/writing to the same file or by
mounting with -o sync.  Among other things, this lets test programs verify
correctness with a single mount.
Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

4918b6d1

ceph: add flags field to file_info · 252c6728

由 Sage Weil 提交于 7月 26, 2011

Reviewed-by: NYehuda Sadeh <yehuda@hq.newdream.net>
Signed-off-by: NSage Weil <sage@newdream.net>

252c6728

21 7月, 2011 3 次提交

fs: push i_mutex and filemap_write_and_wait down into ->fsync() handlers · 02c24a82

由 Josef Bacik 提交于 7月 16, 2011

Btrfs needs to be able to control how filemap_write_and_wait_range() is called
in fsync to make it less of a painful operation, so push down taking i_mutex and
the calling of filemap_write_and_wait() down into the ->fsync() handlers. Some
file systems can drop taking the i_mutex altogether it seems, like ext3 and
ocfs2. For correctness sake I just pushed everything down in all cases to make
sure that we keep the current behavior the same for everybody, and then each
individual fs maintainer can make up their mind about what to do from there.
Thanks,
Acked-by: NJan Kara <jack@suse.cz>
Signed-off-by: NJosef Bacik <josef@redhat.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

02c24a82

fs: handle SEEK_HOLE/SEEK_DATA properly in all fs's that define their own llseek · 06222e49

由 Josef Bacik 提交于 7月 18, 2011

This converts everybody to handle SEEK_HOLE/SEEK_DATA properly. In some cases
we just return -EINVAL, in others we do the normal generic thing, and in others
we're simply making sure that the properly due-dilligence is done. For example
in NFS/CIFS we need to make sure the file size is update properly for the
SEEK_HOLE and SEEK_DATA case, but since it calls the generic llseek stuff itself
that is all we have to do. Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

06222e49

A
don't open-code parent_ino() in assorted ->readdir() · b85fd6bd
由 Al Viro 提交于 7月 17, 2011
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
b85fd6bd

20 7月, 2011 5 次提交

A
ceph: LOOKUP_OPEN is set only when it's the last component · a127e0af
由 Al Viro 提交于 6月 25, 2011
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
a127e0af
A
don't transliterate lower bits of ->intent.open.flags to FMODE_... · 8a5e929d
由 Al Viro 提交于 6月 25, 2011
```
->create() instances are much happier that way...
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
8a5e929d
A
->permission() sanitizing: don't pass flags to ->permission() · 10556cb2
由 Al Viro 提交于 6月 20, 2011
```
not used by the instances anymore.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
10556cb2

->permission() sanitizing: don't pass flags to generic_permission() · 2830ba7f

由 Al Viro 提交于 6月 20, 2011

redundant; all callers get it duplicated in mask & MAY_NOT_BLOCK and none of
them removes that bit.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

2830ba7f

kill check_acl callback of generic_permission() · 178ea735

由 Al Viro 提交于 6月 20, 2011

its value depends only on inode and does not change; we might as
well store it in ->i_op->check_acl and be done with that.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

178ea735

17 7月, 2011 1 次提交

ceph analog of cifs build_path_from_dentry() race fix · 1b71fe2e

由 Al Viro 提交于 7月 16, 2011

... unfortunately, cifs bug got copied.  Fix is essentially the same.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

1b71fe2e

14 6月, 2011 2 次提交

ceph: fix sync and dio writes across stripe boundaries · d7f124f1

由 Sage Weil 提交于 6月 13, 2011

We were iterating across stripe boundaries properly, but not moving the
write buffer pointer forward. This caused us to rewrite the same data
after the break. Fix by adjusting the data pointer forward, and
recalculating the io and buffer alignment after the break.
Signed-off-by: NSage Weil <sage@newdream.net>

d7f124f1

ceph: fix page alignment corrections · 773e9b44

由 Sage Weil 提交于 6月 07, 2011

 dd if=/dev/urandom of=/mnt/fs_depot/dd10 bs=500 seek=8388 count=1
 dd if=/mnt/fs_depot/dd10 of=/root/dd10out bs=500 skip=8388 count=1
Reported-by: NHenry C Chang <henry.cy.chang@gmail.com>
Signed-off-by: NSage Weil <sage@newdream.net>

773e9b44

08 6月, 2011 3 次提交

ceph: unwind canceled flock state · 0c1f91f2

由 Sage Weil 提交于 5月 25, 2011

If we request a lock and then abort (e.g., ^C), we need to send a matching
unlock request to the MDS to unwind our lock attempt to avoid indefinitely
blocking other clients.
Reported-by: NBrian Chrisman <brchrisman@gmail.com>
Signed-off-by: NSage Weil <sage@newdream.net>

0c1f91f2

ceph: fix ENOENT logic in striped_read · 0e98728f

由 Sage Weil 提交于 6月 07, 2011

Getting ENOENT is equivalent to reading 0 bytes.  Make that correction
before setting up the hit_stripe and was_short flags.

Fixes the following case:
 dd if=/dev/zero of=/mnt/fs_depot/dd3 bs=1 seek=1048576 count=0
 dd if=/mnt/fs_depot/dd3 of=/root/ddout1 skip=8 bs=500 count=2 iflag=direct
Reported-by: NHenry C Chang <henry.cy.chang@gmail.com>
Signed-off-by: NSage Weil <sage@newdream.net>

0e98728f

ceph: fix short sync reads from the OSD · c3cd6283

由 Sage Weil 提交于 6月 01, 2011

If we get a short read from the OSD because the object is small, we need to
zero the remainder of the buffer.  For O_DIRECT reads, the attempted range
is not trimmed to i_size by the VFS, so we were actually looping
indefinitely.

Fix by trimming by i_size, and the unconditionally zeroing the trailing
range.
Reported-by: NJeff Wu <cpwu@tnsoft.com.cn>
Signed-off-by: NSage Weil <sage@newdream.net>

c3cd6283