提交 · 612645f7cfe1221f8066b87d62089719bcf43f68 · openeuler / raspberrypi-kernel

18 5月, 2016 2 次提交

pnfs: rework LAYOUTGET retry handling · 183d9e7b

由 Jeff Layton 提交于 5月 17, 2016

There are several problems in the way a stateid is selected for a
LAYOUTGET operation:

We pick a stateid to use in the RPC prepare op, but that makes
it difficult to serialize LAYOUTGETs that use the open stateid. That
serialization is done in pnfs_update_layout, which occurs well before
the rpc_prepare operation.

Between those two events, the i_lock is dropped and reacquired.
pnfs_update_layout can find that the list has lsegs in it and not do any
serialization, but then later pnfs_choose_layoutget_stateid ends up
choosing the open stateid.

This patch changes the client to select the stateid to use in the
LAYOUTGET earlier, when we're searching for a usable layout segment.
This way we can do it all while holding the i_lock the first time, and
ensure that we serialize any LAYOUTGET call that uses a non-layout
stateid.

This also means a rework of how LAYOUTGET replies are handled, as we
must now get the latest stateid if we want to retransmit in response
to a retryable error.

Most of those errors boil down to the fact that the layout state has
changed in some fashion. Thus, what we really want to do is to re-search
for a layout when it fails with a retryable error, so that we can avoid
reissuing the RPC at all if possible.

While the LAYOUTGET RPC is async, the initiating thread always waits for
it to complete, so it's effectively synchronous anyway. Currently, when
we need to retry a LAYOUTGET because of an error, we drive that retry
via the rpc state machine.

This means that once the call has been submitted, it runs until it
completes. So, we must move the error handling for this RPC out of the
rpc_call_done operation and into the caller.

In order to handle errors like NFS4ERR_DELAY properly, we must also
pass a pointer to the sliding timeout, which is now moved to the stack
in pnfs_update_layout.

The complicating errors are -NFS4ERR_RECALLCONFLICT and
-NFS4ERR_LAYOUTTRYLATER, as those involve a timeout after which we give
up and return NULL back to the caller. So, there is some special
handling for those errors to ensure that the layers driving the retries
can handle that appropriately.
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

183d9e7b

NFS: Add COPY nfs operation · 2e72448b

由 Anna Schumaker 提交于 5月 21, 2013

This adds the copy_range file_ops function pointer used by the
sys_copy_range() function call. This patch only implements sync copies,
so if an async copy happens we decode the stateid and ignore it.
Signed-off-by: NAnna Schumaker <bjschuma@netapp.com>

2e72448b

09 5月, 2016 2 次提交

nfs: per-name sillyunlink exclusion · 884be175

由 Al Viro 提交于 4月 28, 2016

use d_alloc_parallel() for sillyunlink/lookup exclusion and
explicit rwsem (nfs_rmdir() being a writer and nfs_call_unlink() -
a reader) for rmdir/sillyunlink one.

That ought to make lookup/readdir/!O_CREAT atomic_open really
parallel on NFS.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

884be175

NFS: Save struct inode * inside nfs_commit_info to clarify usage of i_lock · fe238e60

由 Dave Wysochanski 提交于 4月 01, 2016

Commit ea2cf228 created nfs_commit_info and saved &inode->i_lock inside
this NFS specific structure.  This obscures the usage of i_lock.
Instead, save struct inode * so later it's clear the spinlock taken is
i_lock.

Should be no functional change.
Signed-off-by: NDave Wysochanski <dwysocha@redhat.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

fe238e60

18 2月, 2016 1 次提交

pnfs/blocklayout: fix a memeory leak when using,vmalloc_to_page · c8975706

由 Kinglong Mee 提交于 2月 01, 2016

unreferenced object 0xffffc90000abf000 (size 16900):
  comm "fsync02", pid 15765, jiffies 4297431627 (age 423.772s)
  hex dump (first 32 bytes):
    00 00 00 00 00 00 00 00 00 a0 c2 19 00 88 ff ff  ................
    00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
  backtrace:
    [<ffffffff8174d54e>] kmemleak_alloc+0x4e/0xb0
    [<ffffffff811b9b91>] __vmalloc_node_range+0x231/0x280
    [<ffffffff811b9c2a>] __vmalloc+0x4a/0x50
    [<ffffffffa02c9ec1>] ext_tree_prepare_commit+0x231/0x2e0 [blocklayoutdriver]
    [<ffffffffa02c700e>] bl_prepare_layoutcommit+0xe/0x10 [blocklayoutdriver]
    [<ffffffffa0596a6c>] pnfs_layoutcommit_inode+0x29c/0x330 [nfsv4]
    [<ffffffffa0596b13>] pnfs_generic_sync+0x13/0x20 [nfsv4]
    [<ffffffffa0585188>] nfs4_file_fsync+0x58/0x150 [nfsv4]
    [<ffffffff81228e5b>] vfs_fsync_range+0x4b/0xb0
    [<ffffffff81228f1d>] do_fsync+0x3d/0x70
    [<ffffffff812291d0>] SyS_fsync+0x10/0x20
    [<ffffffff81757def>] entry_SYSCALL_64_fastpath+0x12/0x76
    [<ffffffffffffffff>] 0xffffffffffffffff

v2, add missing include header
Signed-off-by: NKinglong Mee <kinglongmee@gmail.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

c8975706

01 1月, 2016 3 次提交

NFSv4.1/pNFS: Don't queue up a new commit if the layout segment is invalid · b20135d0

由 Trond Myklebust 提交于 12月 31, 2015

If the layout segment is invalid, then we should not be adding more
write requests to the commit list. Instead, those writes should be
replayed after requesting a new layout.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

b20135d0

NFS: Allow multiple commit requests in flight per file · af7cf057

由 Trond Myklebust 提交于 9月 29, 2015

Allow synchronous RPC calls to wait for pending RPC calls to finish,
but also allow asynchronous ones to just fire off another commit.

With this patch, the xfstests generic/074 test completes in 226s
instead of 242s
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

af7cf057

NFS/pNFS: Fix up pNFS write reschedule layering violations and bugs · dc602dd7

由 Trond Myklebust 提交于 12月 31, 2015

The flexfiles layout in particular, seems to want to poke around in the
O_DIRECT flags when retransmitting.
This patch sets up an interface to allow it to call back into O_DIRECT
to handle retransmission correctly. It also fixes a potential bug whereby
we could change the behaviour of O_DIRECT if an error is already pending.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

dc602dd7

29 12月, 2015 1 次提交
- T
  pNFS: Add flag to track if we've called nfs4_ff_layout_stat_io_start_read/write · 37e9ed22
  由 Trond Myklebust 提交于 12月 22, 2015
```
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  37e9ed22
24 11月, 2015 1 次提交

nfs: use sliding delay when LAYOUTGET gets NFS4ERR_DELAY · 91ab4b4d

由 Jeff Layton 提交于 11月 19, 2015

When LAYOUTGET gets NFS4ERR_DELAY, we currently will wait 15s before
retrying the call. That is a _very_ long time, so add a timeout value to
struct nfs4_layoutget and pass nfs4_async_handle_error a pointer to it.
This allows the RPC engine to use a sliding delay window, instead of a
15s delay.
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

91ab4b4d

04 11月, 2015 1 次提交

nfs: Remove unused xdr page offsets in getacl/setacl arguments · 8fbcf237

由 Andreas Gruenbacher 提交于 11月 03, 2015

The arguments passed around for getacl and setacl xdr encoding, struct
nfs_setaclargs and struct nfs_getaclargs, both contain an array of
pages, an offset into the first page, and the length of the page data.
The offset is unused as it is always zero; remove it.
Signed-off-by: NAndreas Gruenbacher <agruenba@redhat.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

8fbcf237

16 10月, 2015 2 次提交

nfs: get clone_blksize when probing fsinfo · 2a92ee92

由 Peng Tao 提交于 9月 26, 2015

NFSv42 CLONE operation is supposed to respect it.
Signed-off-by: NPeng Tao <tao.peng@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

2a92ee92

nfs42: add CLONE xdr functions · 36022770

由 Peng Tao 提交于 9月 26, 2015

xdr definitions per draft-ietf-nfsv4-minorversion2-38.txt
Signed-off-by: NPeng Tao <tao.peng@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

36022770

08 10月, 2015 1 次提交

NFSv4: nfs4_async_handle_error should take a non-const nfs_server · 516285eb

由 Trond Myklebust 提交于 9月 20, 2015

For symmetry with the synchronous handler, and so that we can potentially
handle errors such as NFS4ERR_BADNAME.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

516285eb

08 9月, 2015 1 次提交

NFSv4: Express delegation limit in units of pages · 7d160a6c

由 Trond Myklebust 提交于 9月 05, 2015

Since we're tracking modifications to the page cache on a per-page
basis, it makes sense to express the limit to how much we may cache
in units of pages.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

7d160a6c

28 8月, 2015 2 次提交

NFS: Send attributes in OPEN request for NFS4_CREATE_EXCLUSIVE4_1 · 5334c5bd

由 Kinglong Mee 提交于 8月 26, 2015

Client sends a SETATTR request after OPEN for updating attributes.
For create file with S_ISGID is set, the S_ISGID in SETATTR will be
ignored at nfs server as chmod of no PERMISSION.

v3, same as v2.
Signed-off-by: NKinglong Mee <kinglongmee@gmail.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

5334c5bd

NFS: Get suppattr_exclcreat when getting server capabilities · 8c61282f

由 Kinglong Mee 提交于 8月 26, 2015

Create file with attributs as NFS4_CREATE_EXCLUSIVE4_1 mode
depends on suppattr_exclcreat attribut.

v3, same as v2.
Signed-off-by: NKinglong Mee <kinglongmee@gmail.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

8c61282f

20 8月, 2015 1 次提交

NFSv4: Enable delegated opens even when reboot recovery is pending · 2a606188

由 Trond Myklebust 提交于 8月 19, 2015

Unlike the previous attempt, this takes into account the fact that
we may be calling it from the recovery thread itself. Detect this
by looking at what kind of open we're doing, and checking the state
of the NFS_DELEGATION_NEED_RECLAIM if it turns out we're doing a
reboot reclaim-type open.

Cc: Olga Kornievskaia <aglo@umich.edu>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

2a606188

24 6月, 2015 1 次提交

NFSv.2/pnfs Add a LAYOUTSTATS rpc function · be3a5d23

由 Trond Myklebust 提交于 6月 23, 2015

Reviewed-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
Signed-off-by: NPeng Tao <tao.peng@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

be3a5d23

16 6月, 2015 3 次提交

nfs: make nfs4_init_uniform_client_string use a dynamically allocated buffer · 873e3851

由 Jeff Layton 提交于 6月 09, 2015

Change the uniform client string generator to dynamically allocate the
NFSv4 client name string buffer. With this patch, we can eliminate the
buffers that are embedded within the "args" structs and simply use the
name string that is hanging off the client.

This uniform string case is a little simpler than the nonuniform since
we don't need to deal with RCU, but we do have two different cases,
depending on whether there is a uniquifier or not.
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

873e3851

nfs: convert setclientid and exchange_id encoders to use clp->cl_owner_id · 3a6bb738

由 Jeff Layton 提交于 6月 09, 2015

...instead of buffers that are part of their arg structs. We already
hold a reference to the client, so we might as well use the allocated
buffer. In the event that we can't allocate the clp->cl_owner_id, then
just return -ENOMEM.

Note too that we switch from a GFP_KERNEL allocation here to GFP_NOFS.
It's possible we could end up trying to do a SETCLIENTID or EXCHANGE_ID
in order to reclaim some memory, and the GFP_KERNEL allocations in the
existing code could cause recursion back into NFS reclaim.
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

3a6bb738

nfs: increase size of EXCHANGE_ID name string buffer · 764ad8ba

由 Jeff Layton 提交于 6月 09, 2015

The current buffer is much too small if you have a relatively long
hostname. Bring it up to the size of the one that SETCLIENTID has.

Cc: <stable@vger.kernel.org>
Reported-by: NMichael Skralivetsky <michael.skralivetsky@primarydata.com>
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

764ad8ba

24 4月, 2015 1 次提交

NFS: Don't zap caches on fallocate() · 9a51940b

由 Anna Schumaker 提交于 3月 16, 2015

This patch adds a GETATTR to the end of ALLOCATE and DEALLOCATE
operations so we can set the updated inode size and change attribute
directly.  DEALLOCATE will still need to release pagecache pages, so
nfs42_proc_deallocate() now calls truncate_pagecache_range() before
contacting the server.
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

9a51940b

28 3月, 2015 1 次提交

NFSv4.1: Allow getdeviceinfo to return notification info back to caller · 4e590803

由 Trond Myklebust 提交于 3月 09, 2015

We are only allowed to cache deviceinfo if the server supports notifications
and actually promises to call us back when changes occur. Right now, we
request those notifications, but then we don't check the server's reply.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

4e590803

19 2月, 2015 2 次提交

NFSv4.1: Clean up bind_conn_to_session · 71a097c6

由 Trond Myklebust 提交于 2月 18, 2015

We don't need to fake up an entire session in order retrieve the arguments.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

71a097c6

NFSv4.1: Clean up create_session · 79969dd1

由 Trond Myklebust 提交于 2月 18, 2015

Don't decode directly into the shared struct session
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

79969dd1

14 2月, 2015 1 次提交

NFS: struct nfs_commit_info.lock must always point to inode->i_lock · f4086a3d

由 Trond Myklebust 提交于 2月 13, 2015

Commit 411a99ad (nfs: clear_request_commit while holding i_lock)
assumes that the nfs_commit_info always points to the inode->i_lock.
For historical reasons, that is not the case for O_DIRECT writes.

Cc: Weston Andros Adamson <dros@primarydata.com>
Fixes: 411a99ad ("nfs: clear_request_commit while holding i_lock")
Cc: stable@vger.kernel.org # 3.17.x
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

f4086a3d

06 2月, 2015 2 次提交

NFSv4.1: Pin the inode and super block in asynchronous layoutreturns · 5a0ec8ac

由 Trond Myklebust 提交于 2月 05, 2015

If we're sending an asynchronous layoutreturn, then we need to ensure
that the inode and the super block remain pinned.

Cc: Peng Tao <tao.peng@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
Reviewed-by: NPeng Tao <tao.peng@primarydata.com>

5a0ec8ac

NFSv4.1: Pin the inode and super block in asynchronous layoutcommit · 472e2594

由 Trond Myklebust 提交于 2月 05, 2015

If we're sending an asynchronous layoutcommit, then we need to ensure
that the inode and the super block remain pinned.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
Reviewed-by: NPeng Tao <tao.peng@primarydata.com>

472e2594

04 2月, 2015 6 次提交

NFSv4.1: Ask for no delegation on OPEN if using O_DIRECT · 6ae37339

由 Trond Myklebust 提交于 1月 30, 2015

If we're using NFSv4.1, then we have the ability to let the server know
whether or not we believe that returning a delegation as part of our OPEN
request would be useful.
The feature needs to be used with care, since the client sending the request
doesn't necessarily know how other clients are using that file, and how
they may be affected by the delegation.
For this reason, our initial use of the feature will be to let the server
know when the client believes that handing out a delegation would not be
useful.
The first application for this function is when opening the file using
O_DIRECT.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

6ae37339

nfs41: add range to layoutreturn args · 15eb67c1

由 Peng Tao 提交于 11月 17, 2014

So that callers can specify which range to return.
Signed-off-by: NPeng Tao <tao.peng@primarydata.com>
Signed-off-by: NTom Haynes <loghyr@primarydata.com>

15eb67c1

nfs: add mirroring support to pgio layer · a7d42ddb

由 Weston Andros Adamson 提交于 9月 19, 2014

This patch adds mirrored write support to the pgio layer. The default
is to use one mirror, but pgio callers may define callbacks to change
this to any value up to the (arbitrarily selected) limit of 16.

The basic idea is to break out members of nfs_pageio_descriptor that cannot
be shared between mirrored DSes and put them in a new structure.
Signed-off-by: NWeston Andros Adamson <dros@primarydata.com>

a7d42ddb

nfs: rename pgio header ds_idx to ds_commit_idx · 6cccbb6f

由 Weston Andros Adamson 提交于 9月 16, 2014

'ds_commit_idx' is a better name - it is used to select the right
commit bucket for pnfs.
Signed-off-by: NWeston Andros Adamson <dros@primarydata.com>

6cccbb6f

nfs41: pass iomode through layoutreturn args · 4579d6b8

由 Peng Tao 提交于 9月 06, 2014

So that it is possible to return a specific iomode layouts.
Signed-off-by: NPeng Tao <tao.peng@primarydata.com>
Signed-off-by: NTom Haynes <Thomas.Haynes@primarydata.com>

4579d6b8

nfs: save server READ/WRITE/COMMIT status · aabff4dd

由 Peng Tao 提交于 8月 27, 2014

Flexfiles layout would want to use them to report DS IO status.
Signed-off-by: NPeng Tao <tao.peng@primarydata.com>
Signed-off-by: NTom Haynes <Thomas.Haynes@primarydata.com>

aabff4dd

25 1月, 2015 2 次提交

NFSv4: Update of VFS byte range lock must be atomic with the stateid update · c69899a1

由 Trond Myklebust 提交于 1月 24, 2015

Ensure that we test the lock stateid remained unchanged while we were
updating the VFS tracking of the byte range lock. Have the process
replay the lock to the server if we detect that was not the case.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

c69899a1

NFSv4: Fix lock on-wire reordering issues · 425c1d4e

由 Trond Myklebust 提交于 1月 24, 2015

This patch ensures that the server cannot reorder our LOCK/LOCKU
requests if they are sent in parallel on the wire.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

425c1d4e

24 1月, 2015 1 次提交

NFSv4: Fix an atomicity problem in CLOSE · 566fcec6

由 Trond Myklebust 提交于 1月 23, 2015

If we are to remove the serialisation of OPEN/CLOSE, then we need to
ensure that the stateid sent as part of a CLOSE operation does not
change after we test the state in nfs4_close_prepare.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

566fcec6

26 11月, 2014 1 次提交

nfs: Add ALLOCATE support · f4ac1674

由 Anna Schumaker 提交于 11月 25, 2014

This patch adds support for using the NFS v4.2 operation ALLOCATE to
preallocate data in a file.
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

f4ac1674

13 11月, 2014 1 次提交

nfs: fix pnfs direct write memory leak · 8c393f9a

由 Peng Tao 提交于 11月 05, 2014

For pNFS direct writes, layout driver may dynamically allocate ds_cinfo.buckets.
So we need to take care to free them when freeing dreq.

Ideally this needs to be done inside layout driver where ds_cinfo.buckets
are allocated. But buckets are attached to dreq and reused across LD IO iterations.
So I feel it's OK to free them in the generic layer.

Cc: stable@vger.kernel.org [v3.4+]
Signed-off-by: NPeng Tao <tao.peng@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

8c393f9a