提交 · 37c20f16e7a73e5fe34815e785ca6c5a46e4d260 · openanolis / cloud-kernel

07 5月, 2014 11 次提交

A
fuse_file_aio_read(): convert to ->read_iter() · 37c20f16
由 Al Viro 提交于 4月 02, 2014
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
37c20f16

由 Al Viro 提交于 3月 22, 2014

Now It Can Be Done(tm) - we don't need to do iov_shorten() in
generic_file_direct_write() anymore, now that all ->direct_IO()
instances are converted to proper iov_iter methods and honour
iter->count and iter->iov_offset properly.

Get rid of count/ocount arguments of generic_file_direct_write(),
while we are at it.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

0c949334

new helper: iov_iter_npages() · f67da30c

由 Al Viro 提交于 3月 19, 2014

counts the pages covered by iov_iter, up to given limit.
do_block_direct_io() and fuse_iter_npages() switched to
it.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

f67da30c

A
fuse: switch to iov_iter_get_pages() · c9c37e2e
由 Al Viro 提交于 3月 16, 2014
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
c9c37e2e

fuse: pull iov_iter initializations up · d22a943f

由 Al Viro 提交于 3月 16, 2014

... to fuse_direct_{read,write}().  ->direct_IO() path uses the
iov_iter passed by the caller instead.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

d22a943f

start adding the tag to iov_iter · 71d8e532

由 Al Viro 提交于 3月 05, 2014

For now, just use the same thing we pass to ->direct_IO() - it's all
iovec-based at the moment.  Pass it explicitly to iov_iter_init() and
account for kvec vs. iovec in there, by the same kludge NFS ->direct_IO()
uses.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

71d8e532

A
fuse_file_aio_write(): merge initializations of iov_iter · 23faa7b8
由 Al Viro 提交于 3月 05, 2014
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
23faa7b8

get rid of pointless iov_length() in ->direct_IO() · a6cbcd4a

由 Al Viro 提交于 3月 04, 2014

all callers have iov_length(iter->iov, iter->nr_segs) == iov_iter_count(iter)
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

a6cbcd4a

A
pass iov_iter to ->direct_IO() · d8d3d94b
由 Al Viro 提交于 3月 04, 2014
```
unmodified, for now
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
d8d3d94b

kill generic_segment_checks() · cb66a7a1

由 Al Viro 提交于 3月 04, 2014

all callers of ->aio_read() and ->aio_write() have iov/nr_segs already
checked - generic_segment_checks() done after that is just an odd way
to spell iov_length().
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

cb66a7a1

A
generic_file_direct_write(): switch to iov_iter · f8579f86
由 Al Viro 提交于 3月 03, 2014
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
f8579f86

28 4月, 2014 7 次提交

fuse: allow ctime flushing to userspace · ab9e13f7

由 Maxim Patlasov 提交于 4月 28, 2014

The patch extends fuse_setattr_in, and extends the flush procedure
(fuse_flush_times()) called on ->write_inode() to send the ctime as well as
mtime.
Signed-off-by: NMaxim Patlasov <MPatlasov@parallels.com>
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>

ab9e13f7

fuse: add .write_inode · 1e18bda8

由 Miklos Szeredi 提交于 4月 28, 2014

...and flush mtime from this.  This allows us to use the kernel
infrastructure for writing out dirty metadata (mtime at this point, but
ctime in the next patches and also maybe atime).
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>

1e18bda8

fuse: clean up fsync · 22401e7b

由 Miklos Szeredi 提交于 4月 28, 2014

Don't need to start I/O twice (once without i_mutex and one within).

Also make sure that even if the userspace filesystem doesn't support FSYNC
we do all the steps other than sending the message.
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>

22401e7b

fuse: fuse: fallocate: use file_update_time() · 93d2269d

由 Miklos Szeredi 提交于 4月 28, 2014

in preparation for getting rid of FUSE_I_MTIME_DIRTY.
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>

93d2269d

fuse: update mtime on open(O_TRUNC) in atomic_o_trunc mode · 75caeecd

由 Maxim Patlasov 提交于 4月 28, 2014

In case of fc->atomic_o_trunc is set, fuse does nothing in
fuse_do_setattr() while handling open(O_TRUNC). Hence, i_mtime must be
updated explicitly in fuse_finish_open(). The patch also adds extra locking
encompassing open(O_TRUNC) operation to avoid races between the truncation
and updating i_mtime.
Signed-off-by: NMaxim Patlasov <MPatlasov@parallels.com>
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>

75caeecd

M
fuse: fix mtime update error in fsync · aeb4eb6b
由 Miklos Szeredi 提交于 4月 28, 2014
```
Bad case of shadowing.
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>
```
aeb4eb6b

fuse: check fallocate mode · 4adb8302

由 Miklos Szeredi 提交于 4月 28, 2014

Don't allow new fallocate modes until we figure out what (if anything) that
takes.
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>

4adb8302

08 4月, 2014 1 次提交

mm: implement ->map_pages for page cache · f1820361

由 Kirill A. Shutemov 提交于 4月 07, 2014

filemap_map_pages() is generic implementation of ->map_pages() for
filesystems who uses page cache.

It should be safe to use filemap_map_pages() for ->map_pages() if
filesystem use filemap_fault() for ->fault().
Signed-off-by: NKirill A. Shutemov <kirill.shutemov@linux.intel.com>
Acked-by: NLinus Torvalds <torvalds@linux-foundation.org>
Cc: Mel Gorman <mgorman@suse.de>
Cc: Rik van Riel <riel@redhat.com>
Cc: Andi Kleen <ak@linux.intel.com>
Cc: Matthew Wilcox <matthew.r.wilcox@intel.com>
Cc: Dave Hansen <dave.hansen@linux.intel.com>
Cc: Alexander Viro <viro@zeniv.linux.org.uk>
Cc: Dave Chinner <david@fromorbit.com>
Cc: Ning Qu <quning@gmail.com>
Cc: Hugh Dickins <hughd@google.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

f1820361

02 4月, 2014 13 次提交

fuse: fix "uninitialized variable" warning · f3846266

由 Rajat Jain 提交于 2月 05, 2014

Fix the following warning:

In file included from include/linux/fs.h:16:0,
                 from fs/fuse/fuse_i.h:13,
                 from fs/fuse/file.c:9:
fs/fuse/file.c: In function 'fuse_file_poll':
include/linux/rbtree.h:82:28: warning: 'parent' may be used
uninitialized in this function [-Wmaybe-uninitialized]
fs/fuse/file.c:2592:27: note: 'parent' was declared here
Signed-off-by: NRajat Jain <rajatxjain@gmail.com>
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>

f3846266

fuse: Turn writeback cache on · 4d99ff8f

由 Pavel Emelyanov 提交于 10月 10, 2013

Introduce a bit kernel and userspace exchange between each-other on
the init stage and turn writeback on if the userspace want this and
mount option 'allow_wbcache' is present (controlled by fusermount).

Also add each writable file into per-inode write list and call the
generic_file_aio_write to make use of the Linux page cache engine.
Signed-off-by: NMaxim Patlasov <MPatlasov@parallels.com>
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>

4d99ff8f

fuse: Fix O_DIRECT operations vs cached writeback misorder · ea8cd333

由 Pavel Emelyanov 提交于 10月 10, 2013

The problem is:

1. write cached data to a file
2. read directly from the same file (via another fd)

The 2nd operation may read stale data, i.e. the one that was in a file
before the 1st op. Problem is in how fuse manages writeback.

When direct op occurs the core kernel code calls filemap_write_and_wait
to flush all the cached ops in flight. But fuse acks the writeback right
after the ->writepages callback exits w/o waiting for the real write to
happen. Thus the subsequent direct op proceeds while the real writeback
is still in flight. This is a problem for backends that reorder operation.

Fix this by making the fuse direct IO callback explicitly wait on the
in-flight writeback to finish.
Signed-off-by: NMaxim Patlasov <MPatlasov@parallels.com>
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>

ea8cd333

fuse: fuse_flush() should wait on writeback · fe38d7df

由 Maxim Patlasov 提交于 10月 10, 2013

The aim of .flush fop is to hint file-system that flushing its state or caches
or any other important data to reliable storage would be desirable now.
fuse_flush() passes this hint by sending FUSE_FLUSH request to userspace.
However, dirty pages and pages under writeback may be not visible to userspace
yet if we won't ensure it explicitly.
Signed-off-by: NMaxim Patlasov <MPatlasov@parallels.com>
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>

fe38d7df

fuse: Implement write_begin/write_end callbacks · 6b12c1b3

由 Pavel Emelyanov 提交于 10月 10, 2013

The .write_begin and .write_end are requiered to use generic routines
(generic_file_aio_write --> ... --> generic_perform_write) for buffered
writes.
Signed-off-by: NMaxim Patlasov <MPatlasov@parallels.com>
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>

6b12c1b3

fuse: restructure fuse_readpage() · 482fce55

由 Maxim Patlasov 提交于 10月 10, 2013

Move the code filling and sending read request to a separate function. Future
patches will use it for .write_begin -- partial modification of a page
requires reading the page from the storage very similarly to what fuse_readpage
does.
Signed-off-by: NMaxim Patlasov <MPatlasov@parallels.com>
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>

482fce55

fuse: Flush files on wb close · e7cc133c

由 Pavel Emelyanov 提交于 10月 10, 2013

Any write request requires a file handle to report to the userspace. Thus
when we close a file (and free the fuse_file with this info) we have to
flush all the outstanding dirty pages.

filemap_write_and_wait() is enough because every page under fuse writeback
is accounted in ff->count. This delays actual close until all fuse wb is
completed.

In case of "write cache" turned off, the flush is ensured by fuse_vma_close().
Signed-off-by: NMaxim Patlasov <MPatlasov@parallels.com>
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>

e7cc133c

fuse: Trust kernel i_mtime only · b0aa7606

由 Maxim Patlasov 提交于 12月 26, 2013

Let the kernel maintain i_mtime locally:
 - clear S_NOCMTIME
 - implement i_op->update_time()
 - flush mtime on fsync and last close
 - update i_mtime explicitly on truncate and fallocate

Fuse inode flag FUSE_I_MTIME_DIRTY serves as indication that local i_mtime
should be flushed to the server eventually.
Signed-off-by: NMaxim Patlasov <MPatlasov@parallels.com>
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>

b0aa7606

fuse: Trust kernel i_size only · 8373200b

由 Pavel Emelyanov 提交于 10月 10, 2013

Make fuse think that when writeback is on the inode's i_size is always
up-to-date and not update it with the value received from the userspace.
This is done because the page cache code may update i_size without letting
the FS know.

This assumption implies fixing the previously introduced short-read helper --
when a short read occurs the 'hole' is filled with zeroes.

fuse_file_fallocate() is also fixed because now we should keep i_size up to
date, so it must be updated if FUSE_FALLOCATE request succeeded.
Signed-off-by: NMaxim V. Patlasov <MPatlasov@parallels.com>
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>

8373200b

fuse: Prepare to handle short reads · a92adc82

由 Pavel Emelyanov 提交于 10月 10, 2013

A helper which gets called when read reports less bytes than was requested.
See patch "trust kernel i_size only" for details.
Signed-off-by: NMaxim Patlasov <MPatlasov@parallels.com>
Signed-off-by: NPavel Emelyanov <xemul@openvz.org>
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>

a92adc82

fuse: Linking file to inode helper · 650b22b9

由 Pavel Emelyanov 提交于 10月 10, 2013

When writeback is ON every writeable file should be in per-inode write list,
not only mmap-ed ones. Thus introduce a helper for this linkage.
Signed-off-by: NMaxim Patlasov <MPatlasov@parallels.com>
Signed-off-by: NPavel Emelyanov <xemul@openvz.org>
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>

650b22b9

A
generic_file_direct_write(): get rid of ppos argument · 5cb6c6c7
由 Al Viro 提交于 2月 11, 2014
```
always equal to &iocb->ki_pos.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
5cb6c6c7
A
callers of iov_copy_from_user_atomic() don't need pagecache_disable() · 9e8c2af9
由 Al Viro 提交于 2月 02, 2014
```
... it does that itself (via kmap_atomic())
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
9e8c2af9

26 1月, 2014 1 次提交

Fix race when checking i_size on direct i/o read · 9fe55eea

由 Steven Whitehouse 提交于 1月 24, 2014

So far I've had one ACK for this, and no other comments. So I think it
is probably time to send this via some suitable tree. I'm guessing that
the vfs tree would be the most appropriate route, but not sure that
there is one at the moment (don't see anything recent at kernel.org)
so in that case I think -mm is the "back up plan". Al, please let me
know if you will take this?

Steve.

---------------------

Following on from the "Re: [PATCH v3] vfs: fix a bug when we do some dio
reads with append dio writes" thread on linux-fsdevel, this patch is my
current version of the fix proposed as option (b) in that thread.

Removing the i_size test from the direct i/o read path at vfs level
means that filesystems now have to deal with requests which are beyond
i_size themselves. These I've divided into three sets:

 a) Those with "no op" ->direct_IO (9p, cifs, ceph)
These are obviously not going to be an issue

 b) Those with "home brew" ->direct_IO (nfs, fuse)
I've been told that NFS should not have any problem with the larger
i_size, however I've added an extra test to FUSE to duplicate the
original behaviour just to be on the safe side.

 c) Those using __blockdev_direct_IO()
These call through to ->get_block() which should deal with the EOF
condition correctly. I've verified that with GFS2 and I believe that
Zheng has verified it for ext4. I've also run the test on XFS and it
passes both before and after this change.

The part of the patch in filemap.c looks a lot larger than it really is
- there are only two lines of real change. The rest is just indentation
of the contained code.

There remains a test of i_size though, which was added for btrfs. It
doesn't cause the other filesystems a problem as the test is performed
after ->direct_IO has been called. It is possible that there is a race
that does matter to btrfs, however this patch doesn't change that, so
its still an overall improvement.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>
Reported-by: NZheng Liu <gnehzuil.liu@gmail.com>
Cc: Jan Kara <jack@suse.cz>
Cc: Dave Chinner <david@fromorbit.com>
Acked-by: NMiklos Szeredi <miklos@szeredi.hu>
Cc: Chris Mason <clm@fb.com>
Cc: Josef Bacik <jbacik@fb.com>
Cc: Christoph Hellwig <hch@infradead.org>
Cc: Alexander Viro <viro@zeniv.linux.org.uk>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

9fe55eea

23 1月, 2014 2 次提交

fuse: support clients that don't implement 'open' · 7678ac50

由 Andrew Gallagher 提交于 11月 05, 2013

open/release operations require userspace transitions to keep track
of the open count and to perform any FS-specific setup.  However,
for some purely read-only FSs which don't need to perform any setup
at open/release time, we can avoid the performance overhead of
calling into userspace for open/release calls.

This patch adds the necessary support to the fuse kernel modules to prevent
open/release operations from hitting in userspace. When the client returns
ENOSYS, we avoid sending the subsequent release to userspace, and also
remember this so that future opens also don't trigger a userspace
operation.
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>

7678ac50

fuse: don't invalidate attrs when not using atime · 451418fc

由 Andrew Gallagher 提交于 11月 05, 2013

Various read operations (e.g. readlink, readdir) invalidate the cached
attrs for atime changes.  This patch adds a new function
'fuse_invalidate_atime', which checks for a read-only super block and
avoids the attr invalidation in that case.
Signed-off-by: NAndrew Gallagher <andrewjcg@fb.com>
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>

451418fc

05 11月, 2013 4 次提交

fuse: writepages: protect secondary requests from fuse file release · ce128de6

由 Maxim Patlasov 提交于 10月 02, 2013

All async fuse requests must be supplied with extra reference to a fuse
file.  This is necessary to ensure that the fuse file is not released until
all in-flight requests are completed.  Fuse secondary writeback requests
must obey this rule as well.
Signed-off-by: NMaxim Patlasov <MPatlasov@parallels.com>
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>

ce128de6

fuse: writepages: update bdi writeout when deleting secondary request · 41b6e41f

由 Maxim Patlasov 提交于 10月 02, 2013

BDI_WRITTEN counter is used to estimate bdi bandwidth.  It must be
incremented every time as bdi ends page writeback.  No matter whether it
was fulfilled by actual write or by discarding the request (e.g. due to
shrunk i_size).

Note that even before writepages patches, the case "Got truncated off
completely" was handled in fuse_send_writepage() by calling
fuse_writepage_finish() which updated BDI_WRITTEN unconditionally.
Signed-off-by: NMaxim Patlasov <MPatlasov@parallels.com>
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>

41b6e41f

fuse: writepages: crop secondary requests · 6eaf4782

由 Maxim Patlasov 提交于 10月 02, 2013

If writeback happens while fuse is in FUSE_NOWRITE condition, the request
will be queued but not processed immediately (see fuse_flush_writepages()).
Until FUSE_NOWRITE becomes relaxed, more writebacks can happen. They will
be queued as "secondary" requests to that first ("primary") request.

Existing implementation crops only primary request. This is not correct
because a subsequent extending write(2) may increase i_size and then
secondary requests won't be cropped properly. The result would be stale
data written to the server to a file offset where zeros must be.

Similar problem may happen if secondary requests are attached to an
in-flight request that was already cropped.

The patch solves the issue by cropping all secondary requests in
fuse_writepage_end(). Thanks to Miklos for idea.
Signed-off-by: NMaxim Patlasov <MPatlasov@parallels.com>
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>

6eaf4782

fuse: writepages: roll back changes if request not found · f6011081

由 Maxim Patlasov 提交于 10月 02, 2013

fuse_writepage_in_flight() returns false if it fails to find request with
given index in fi->writepages.  Then the caller proceeds with populating
data->orig_pages[] and incrementing req->num_pages.  Hence,
fuse_writepage_in_flight() must revert changes it made in request before
returning false.
Signed-off-by: NMaxim Patlasov <MPatlasov@parallels.com>
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>

f6011081

01 10月, 2013 1 次提交

fuse: writepage: skip already in flight · ff17be08

由 Miklos Szeredi 提交于 10月 01, 2013

If ->writepage() tries to write back a page whose copy is still in flight,
then just skip by calling redirty_page_for_writepage().

This is OK, since now ->writepage() should never be called for data
integrity sync.
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>

ff17be08

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功