提交 · d07fbb8fdf57c4af28679ad7c4b2542e78ca7218 · openanolis / cloud-kernel

16 2月, 2016 2 次提交

pNFS: Always set NFS_LAYOUT_RETURN_REQUESTED with lo->plh_return_iomode · e0fa0d01

由 Trond Myklebust 提交于 2月 15, 2016

When setting the layout return mode, we must always also set the
NFS_LAYOUT_RETURN_REQUESTED flag to ensure that we send a layoutreturn.
Otherwise pnfs_error_mark_layout_for_return() could set the mode, but
fail to send the layoutreturn because another is already in flight.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

e0fa0d01

pNFS: Fix pnfs_mark_matching_lsegs_return() · 2f215968

由 Trond Myklebust 提交于 2月 15, 2016

We don't need to schedule a layoutreturn if the layout segment can
be freed immediately.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

2f215968

28 1月, 2016 1 次提交

NFS: Cleanup - rename NFS_LAYOUT_RETURN_BEFORE_CLOSE · 2370abda

由 Trond Myklebust 提交于 1月 27, 2016

NFS_LAYOUT_RETURN_BEFORE_CLOSE is being used to signal that a
layoutreturn is needed, either due to a layout recall or to a
layout error. Rename it to NFS_LAYOUT_RETURN_REQUESTED in order
to clarify its purpose.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

2370abda

27 1月, 2016 1 次提交

pNFS: Fix missing layoutreturn calls · 13c13a6a

由 Trond Myklebust 提交于 1月 26, 2016

The layoutreturn code currently relies on pnfs_put_lseg() to initiate the
RPC call when conditions are right. A problem arises when we want to
free the layout segment from inside an inode->i_lock section (e.g. in
pnfs_clear_request_commit()), since we cannot sleep.

The workaround is to move the actual call to pnfs_send_layoutreturn()
to pnfs_put_layout_hdr(), which doesn't have this restriction.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

13c13a6a

05 1月, 2016 7 次提交
- T
  NFSv4.1/pNFS: Cleanup constify struct pnfs_layout_range arguments · 506c0d68
  由 Trond Myklebust 提交于 1月 04, 2016
```
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  506c0d68
- T
  NFSv4.1/pnfs: Cleanup copying of pnfs_layout_range structures · e144e539
  由 Trond Myklebust 提交于 1月 04, 2016
```
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  e144e539
- T
  NFSv4.1/pNFS: Cleanup pnfs_mark_matching_lsegs_invalid() · 71b39854
  由 Trond Myklebust 提交于 1月 04, 2016
```
Make it more obvious what we're returning...
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  71b39854
- T
  NFSv4.1/pNFS: pnfs_error_mark_layout_for_return() must always return layout · 10335556
  由 Trond Myklebust 提交于 1月 04, 2016
```
Fix a bug whereby if all the layout segments could be immediately freed,
the call to pnfs_error_mark_layout_for_return() would never result in
a layoutreturn.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  10335556
- T
  NFSv4.1/pNFS: pnfs_mark_matching_lsegs_return() should set the iomode · 5c97f5de
  由 Trond Myklebust 提交于 1月 04, 2016
```
If pnfs_mark_matching_lsegs_return() needs to mark a layout segment for
return, then it must also set the return iomode.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  5c97f5de
- T
  NFSv4.1/pNFS: Use nfs4_stateid_copy for copying stateids · 50f563ef
  由 Trond Myklebust 提交于 1月 04, 2016
```
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  50f563ef
- T
  NFSv4.1/pNFS: Don't pass stateids by value to pnfs_send_layoutreturn() · ed429d6b
  由 Trond Myklebust 提交于 1月 04, 2016
```
A stateid is a structure, pass it as a pointer.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  ed429d6b
01 1月, 2016 1 次提交

NFSv4.1/pNFS: Don't queue up a new commit if the layout segment is invalid · b20135d0

由 Trond Myklebust 提交于 12月 31, 2015

If the layout segment is invalid, then we should not be adding more
write requests to the commit list. Instead, those writes should be
replayed after requesting a new layout.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

b20135d0

29 12月, 2015 7 次提交

pNFS: If we have to delay the layout callback, mark the layout for return · fc7ff367

由 Trond Myklebust 提交于 12月 28, 2015

If the client needs to delay the layout callback, then speed up the recall
process by marking the remaining layout segments to be actively returned
by the client.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

fc7ff367

NFSv4.1/pNFS: Add a helper to mark the layout as returned · 0654cc72

由 Trond Myklebust 提交于 12月 28, 2015

This ensures that we don't reuse the stateid if a layout return or
implied layout return means that we've returned all layout segments
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

0654cc72

pNFS/flexfiles: Don't mark the entire layout as failed, when returning it · b9fc773e

由 Trond Myklebust 提交于 12月 15, 2015

In pNFS/flexfiles, we want to return the layout without necessarily marking
it as having completely failed. We therefore move the call to
pnfs_layout_io_set_failed() out of pnfs_error_mark_layout_for_return(),
and then ensura that pNFS/files layout calls it separately.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

b9fc773e

pNFS/flexfiles: Don't prevent flexfiles client from retrying LAYOUTGET · 2e5b29f0

由 Trond Myklebust 提交于 12月 14, 2015

Fix a bug in which flexfiles clients are falling back to I/O through the
MDS even when the FF_FLAGS_NO_IO_THRU_MDS flag is set.

The flexfiles client will always report errors through the LAYOUTRETURN
and/or LAYOUTERROR mechanisms, so it should normally be safe for it
to retry the LAYOUTGET until it fails or succeeds.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

2e5b29f0

nfs: handle request add failure properly · 0bcbf039

由 Peng Tao 提交于 12月 05, 2015

When we fail to queue a read page to IO descriptor,
we need to clean it up otherwise it is hanging around
preventing nfs module from being removed.

When we fail to queue a write page to IO descriptor,
we need to clean it up and also save the failure status
to open context. Then at file close, we can try to write
pages back again and drop the page if it fails to writeback
in .launder_page, which will be done in the next patch.
Signed-off-by: NPeng Tao <tao.peng@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

0bcbf039

nfs: centralize pgio error cleanup · 2bff2288

由 Peng Tao 提交于 12月 05, 2015

In case we fail during setting things up for read/write IO, set
pg_error in IO descriptor and do the cleanup in nfs_pageio_add_request,
where we clean up all pages that are still hanging around on the IO
descriptor.
Signed-off-by: NPeng Tao <tao.peng@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

2bff2288

NFS41: pop some layoutget errors to application · d600ad1f

由 Peng Tao 提交于 12月 04, 2015

For ERESTARTSYS/EIO/EROFS/ENOSPC/E2BIG in layoutget, we
should just bail out instead of hiding the error and
retrying inband IO.

Change all the call sites to pop the error all the way up.
Signed-off-by: NPeng Tao <tao.peng@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

d600ad1f

28 12月, 2015 2 次提交

pNFS: Modify pnfs_update_layout tracepoints to use layout stateid · f4848303

由 Trond Myklebust 提交于 12月 26, 2015

Instead of displaying a layout segment pointer in these tracepoints,
let's use the layout stateid, now that Olga gave us a set of tools for
displaying them.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

f4848303

nfs: add new tracepoint for pnfs_update_layout · 9a4bf31d

由 Jeff Layton 提交于 12月 10, 2015

pnfs_update_layout is really the "nexus" of layout handling. If it
returns NULL then we end up going through the MDS. This patch adds
some tracepoints to that function that allow us to determine the
cause when we end up going through the MDS unexpectedly.
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

9a4bf31d

14 12月, 2015 1 次提交

sched/wait: Fix the signal handling fix · dfd01f02

由 Peter Zijlstra 提交于 12月 13, 2015

Jan Stancek reported that I wrecked things for him by fixing things for
Vladimir :/

His report was due to an UNINTERRUPTIBLE wait getting -EINTR, which
should not be possible, however my previous patch made this possible by
unconditionally checking signal_pending().

We cannot use current->state as was done previously, because the
instruction after the store to that variable it can be changed.  We must
instead pass the initial state along and use that.

Fixes: 68985633 ("sched/wait: Fix signal handling in bit wait helpers")
Reported-by: NJan Stancek <jstancek@redhat.com>
Reported-by: NChris Mason <clm@fb.com>
Tested-by: NJan Stancek <jstancek@redhat.com>
Tested-by: NVladimir Murzin <vladimir.murzin@arm.com>
Tested-by: NChris Mason <clm@fb.com>
Reviewed-by: NPaul Turner <pjt@google.com>
Cc: Ingo Molnar <mingo@kernel.org>
Cc: tglx@linutronix.de
Cc: Oleg Nesterov <oleg@redhat.com>
Cc: hpa@zytor.com
Signed-off-by: NPeter Zijlstra (Intel) <peterz@infradead.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

dfd01f02

26 11月, 2015 1 次提交

nfs4: resend LAYOUTGET when there is a race that changes the seqid · 4f2e9dce

由 Jeff Layton 提交于 11月 25, 2015

pnfs_layout_process will check the returned layout stateid against what
the kernel has in-core. If it turns out that the stateid we received is
older, then we should resend the LAYOUTGET instead of falling back to
MDS I/O.
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Cc: stable@vger.kernel.org # 3.18+
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

4f2e9dce

22 10月, 2015 1 次提交

NFSv4.1/pnfs: Retry through MDS when getting bad length of data · f8417b48

由 Kinglong Mee 提交于 10月 16, 2015

If non rpc-based layout driver return bad length of data, nfs retries
by calling rpc_restart_call_prepare() that cause an NULL reference panic.

This patch lets nfs retry through MDS for non rpc-based layout driver
return bad length of data.

[13034.883329] BUG: unable to handle kernel NULL pointer dereference at           (null)
[13034.884902] IP: [<ffffffffa00db372>] rpc_restart_call_prepare+0x62/0x90 [sunrpc]
[13034.886558] PGD 0
[13034.888126] Oops: 0000 [#1] KASAN
[13034.889710] Modules linked in: blocklayoutdriver(OE) nfsv4(OE) nfs(OE) fscache(E) nfsd(OE) xfs libcrc32c coretemp btrfs crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel ppdev vmw_balloon auth_rpcgss shpchp nfs_acl lockd vmw_vmci parport_pc xor raid6_pq grace parport sunrpc i2c_piix4 vmwgfx drm_kms_helper ttm drm mptspi e1000 serio_raw scsi_transport_spi mptscsih mptbase ata_generic pata_acpi [last unloaded: fscache]
[13034.898260] CPU: 0 PID: 10112 Comm: kworker/0:1 Tainted: G           OE   4.3.0-rc5+ #279
[13034.899932] Hardware name: VMware, Inc. VMware Virtual Platform/440BX Desktop Reference Platform, BIOS 6.00 07/02/2015
[13034.903342] Workqueue: events bl_read_cleanup [blocklayoutdriver]
[13034.905059] task: ffff88006a9148c0 ti: ffff880035e90000 task.ti: ffff880035e90000
[13034.906827] RIP: 0010:[<ffffffffa00db372>]  [<ffffffffa00db372>] rpc_restart_call_prepare+0x62/0x90 [sunrpc]
[13034.910522] RSP: 0018:ffff880035e97b58  EFLAGS: 00010282
[13034.912378] RAX: fffffbfff04a5a94 RBX: ffff880068fe4858 RCX: 0000000000000003
[13034.914339] RDX: dffffc0000000000 RSI: 0000000000000003 RDI: 0000000000000282
[13034.916236] RBP: ffff880035e97b68 R08: 0000000000000001 R09: 0000000000000001
[13034.918229] R10: 0000000000000000 R11: 0000000000000001 R12: 0000000000000000
[13034.920007] R13: ffff880068fe4858 R14: ffff880068fe4a60 R15: 0000000000001000
[13034.921845] FS:  0000000000000000(0000) GS:ffffffff82247000(0000) knlGS:0000000000000000
[13034.923645] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[13034.925525] CR2: 0000000000000000 CR3: 00000000063dd000 CR4: 00000000001406f0
[13034.932808] Stack:
[13034.934813]  ffff880068fe4780 0000000000001000 ffff880035e97ba8 ffffffffa08800d2
[13034.936675]  ffffffffa088029d ffff880068fe4780 ffff880068fe4858 ffffffffa089c0a0
[13034.938593]  ffff880068fe47e0 ffff88005d59faf0 ffff880035e97be0 ffffffffa087e08f
[13034.940454] Call Trace:
[13034.942388]  [<ffffffffa08800d2>] nfs_readpage_result+0x112/0x200 [nfs]
[13034.944317]  [<ffffffffa088029d>] ? nfs_readpage_done+0xdd/0x160 [nfs]
[13034.946267]  [<ffffffffa087e08f>] nfs_pgio_result+0x9f/0x120 [nfs]
[13034.948166]  [<ffffffffa09266cc>] pnfs_ld_read_done+0x7c/0x1e0 [nfsv4]
[13034.950247]  [<ffffffffa03b07ee>] bl_read_cleanup+0x2e/0x60 [blocklayoutdriver]
[13034.952156]  [<ffffffff810ebf62>] process_one_work+0x412/0x870
[13034.954102]  [<ffffffff810ebe84>] ? process_one_work+0x334/0x870
[13034.955949]  [<ffffffff810ebb50>] ? queue_delayed_work_on+0x40/0x40
[13034.957985]  [<ffffffff810ec441>] worker_thread+0x81/0x6a0
[13034.959817]  [<ffffffff810ec3c0>] ? process_one_work+0x870/0x870
[13034.961785]  [<ffffffff810f43bd>] kthread+0x17d/0x1a0
[13034.963544]  [<ffffffff810f4240>] ? kthread_create_on_node+0x330/0x330
[13034.965479]  [<ffffffff81100428>] ? finish_task_switch+0x88/0x220
[13034.967223]  [<ffffffff810f4240>] ? kthread_create_on_node+0x330/0x330
[13034.968929]  [<ffffffff81b6ae5f>] ret_from_fork+0x3f/0x70
[13034.970534]  [<ffffffff810f4240>] ? kthread_create_on_node+0x330/0x330
[13034.972176] Code: c7 43 50 40 84 0d a0 e8 3d fe 1c e1 48 8d 7b 58 c7 83 e4 00 00 00 00 00 00 00 e8 ca fe 1c e1 4c 8b 63 58 4c 89 e7 e8 be fe 1c e1 <49> 83 3c 24 00 74 12 48 c7 43 50 f0 a2 0e a0 b8 01 00 00 00 5b
[13034.977148] RIP  [<ffffffffa00db372>] rpc_restart_call_prepare+0x62/0x90 [sunrpc]
[13034.978780]  RSP <ffff880035e97b58>
[13034.980399] CR2: 0000000000000000
Signed-off-by: NKinglong Mee <kinglongmee@gmail.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

f8417b48

23 9月, 2015 1 次提交

NFS41: make close wait for layoutreturn · 500d701f

由 Peng Tao 提交于 9月 22, 2015

If we send a layoutreturn asynchronously before close, the close
might reach server first and layoutreturn would fail with BADSTATEID
because there is nothing keeping the layout stateid alive.

Also do not pretend sending layoutreturn if we are not.
Signed-off-by: NPeng Tao <tao.peng@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

500d701f

31 8月, 2015 2 次提交

NFSv4.1/pNFS: Don't request a minimal read layout beyond the end of file · 2d89a1d3

由 Trond Myklebust 提交于 8月 31, 2015

If we have a read layout, then sanity check the minimal layout length
so that it does not extend beyond the end of file.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

2d89a1d3

T
NFSv4.1/pnfs: Don't ask for a read layout for an empty file. · 4ae93560
由 Trond Myklebust 提交于 8月 31, 2015
```
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
4ae93560

28 8月, 2015 1 次提交

NFSv4.1/pNFS: pnfs_mark_matching_lsegs_return must notify of layout return · 0bdb8fa6

由 Trond Myklebust 提交于 8月 27, 2015

It's not sufficient to just mark the layout segment for layout return. We
also need to set the NFS_LAYOUT_RETURN_BEFORE_CLOSE flag in the layout header.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

0bdb8fa6

26 8月, 2015 5 次提交

NFSv4.1/pnfs: Allow pNFS device drivers to customise layout segment insertion · 03772d2f

由 Trond Myklebust 提交于 8月 25, 2015

This is needed in order to allow merging of contiguous layout segments,
and also to correct the ordering of layouts for those device drivers that
don't necessarily want to place the read-write layouts first.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

03772d2f

T
NFSv4.1/pnfs: Add sanity check for the layout range returned by the server · 540d9864
由 Trond Myklebust 提交于 8月 25, 2015
```
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
540d9864

NFSv4.2/pnfs: Make the layoutstats timer configurable · bbf58bf3

由 Trond Myklebust 提交于 8月 24, 2015

Allow advanced users to set the layoutstats timer in order to lengthen
or shorten the period between layoutstat transmissions to the server.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

bbf58bf3

NFS41: remove NFS_LAYOUT_ROC flag · 3976143b

由 Peng Tao 提交于 8月 21, 2015

If we return delegation before closing, we fail to do roc check
during close because NFS_LAYOUT_ROC is cleared by delegreturn
and it causes layouts to be still hanging around after delegreturn
+ close, which is a voilation against protocol.
Signed-off-by: NPeng Tao <tao.peng@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

3976143b

T
NFSv4.1/pnfs: Add a tracepoint for return-on-close events · 6a463beb
由 Trond Myklebust 提交于 8月 20, 2015
```
Allow tracing of return-on-close.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
6a463beb

20 8月, 2015 1 次提交
- T
  pNFS: Fix an unused variable warning in pnfs_roc_get_barrier · c7406249
  由 Trond Myklebust 提交于 8月 19, 2015
```
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  c7406249
19 8月, 2015 2 次提交

NFS41: make sure sending LAYOUTRETURN before close if marked so · e755d638

由 Peng Tao 提交于 8月 19, 2015

If layout is marked by NFS_LAYOUT_RETURN_BEFORE_CLOSE, we should always
send LAYOUTRETURN before close, and we don't need to do ROC drain if we
do send LAYOUTRETURN.
Signed-off-by: NPeng Tao <tao.peng@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

e755d638

NFSv4.1/pnfs: Fix a close/delegreturn hang when return-on-close is set · 4ff376fe

由 Trond Myklebust 提交于 8月 18, 2015

The helper pnfs_roc() has already verified that we have no delegations,
and no further open files, hence no outstanding I/O and it has marked
all the return-on-close lsegs as being invalid.
Furthermore, it sets the NFS_LAYOUT_RETURN bit, thus serialising the
close/delegreturn with all future layoutget calls on this inode.

The checks in pnfs_roc_drain() for valid layout segments are therefore
redundant: those cannot exist until another layoutget completes.
The other check for whether or not NFS_LAYOUT_RETURN is set, actually
causes a hang, since we already know that we hold that flag.

To fix, we therefore strip out all the functionality in pnfs_roc_drain()
except the retrieval of the barrier state, and then rename the function
accordingly.
Reported-by: NChristoph Hellwig <hch@infradead.org>
Fixes: 5c4a79fb ("Don't prevent layoutgets when doing return-on-close")
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

4ff376fe

13 8月, 2015 4 次提交

NFSv4.1/pnfs: Remove redundant wakeup in pnfs_send_layoutreturn() · 58830550

由 Trond Myklebust 提交于 8月 04, 2015

pnfs_clear_layoutreturn_waitbit() should already be calling
rpc_wake_up(&NFS_SERVER(ino)->roc_rpcwaitq) for us.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

58830550

NFSv4.1/pnfs: Remove redundant check in pnfs_layoutgets_blocked() · e1c06f80

由 Trond Myklebust 提交于 8月 04, 2015

layoutget now should already be serialised w.r.t. layout returns
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

e1c06f80

NFSv4.1/pnfs: Remove redundant lo->plh_block_lgets in layoutreturn · 2d8ae84f

由 Trond Myklebust 提交于 8月 04, 2015

The NFS_LAYOUT_RETURN bit already suffices to ensure that layoutget
is blocked.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

2d8ae84f

NFSv4.1/pnfs: Don't prevent layoutgets when doing return-on-close · 5c4a79fb

由 Trond Myklebust 提交于 8月 04, 2015

If there is an outstanding return-on-close, then we just want new
layoutget requests to wait rather than fail.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

5c4a79fb

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功