提交 · ecebb80bf3ee8c5f3172f00bb17ba55f9e3ae24f · openanolis / cloud-kernel

18 5月, 2016 2 次提交

pnfs: only tear down lsegs that precede seqid in LAYOUTRETURN args · 6d597e17

由 Jeff Layton 提交于 5月 17, 2016

LAYOUTRETURN is "special" in that servers and clients are expected to
work with old stateids. When the client sends a LAYOUTRETURN with an old
stateid in it then the server is expected to only tear down layout
segments that were present when that seqid was current. Ensure that the
client handles its accounting accordingly.
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

6d597e17

Fixing oops in callback path · c2985d00

由 Olga Kornievskaia 提交于 5月 10, 2016

Commit 80f96427 ("NFSv4.x: Enforce the ca_maxreponsesize_cached
on the back channel") causes an oops when it receives a callback with
cachethis=yes.

[  109.667378] BUG: unable to handle kernel NULL pointer dereference at 00000000000002c8
[  109.669476] IP: [<ffffffffa08a3e68>] nfs4_callback_compound+0x4f8/0x690 [nfsv4]
[  109.671216] PGD 0
[  109.671736] Oops: 0000 [#1] SMP
[  109.705427] CPU: 1 PID: 3579 Comm: nfsv4.1-svc Not tainted 4.5.0-rc1+ #1
[  109.706987] Hardware name: VMware, Inc. VMware Virtual Platform/440BX Desktop Reference Platform, BIOS 6.00 05/20/2014
[  109.709468] task: ffff8800b4408000 ti: ffff88008448c000 task.ti: ffff88008448c000
[  109.711207] RIP: 0010:[<ffffffffa08a3e68>]  [<ffffffffa08a3e68>] nfs4_callback_compound+0x4f8/0x690 [nfsv4]
[  109.713521] RSP: 0018:ffff88008448fca0  EFLAGS: 00010286
[  109.714762] RAX: ffff880081ee202c RBX: ffff8800b7b5b600 RCX: 0000000000000001
[  109.716427] RDX: 0000000000000008 RSI: 0000000000000008 RDI: 0000000000000000
[  109.718091] RBP: ffff88008448fda8 R08: 0000000000000000 R09: 000000000b000000
[  109.719757] R10: ffff880137786000 R11: ffff8800b7b5b600 R12: 0000000001000000
[  109.721415] R13: 0000000000000002 R14: 0000000053270000 R15: 000000000000000b
[  109.723061] FS:  0000000000000000(0000) GS:ffff880139640000(0000) knlGS:0000000000000000
[  109.724931] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  109.726278] CR2: 00000000000002c8 CR3: 0000000034d50000 CR4: 00000000001406e0
[  109.727972] Stack:
[  109.728465]  ffff880081ee202c ffff880081ee201c 000000008448fcc0 ffff8800baccb800
[  109.730349]  ffff8800baccc800 ffffffffa08d0380 0000000000000000 0000000000000000
[  109.732211]  ffff8800b7b5b600 0000000000000001 ffffffff81d073c0 ffff880081ee3090
[  109.734056] Call Trace:
[  109.734657]  [<ffffffffa03795d4>] svc_process_common+0x5c4/0x6c0 [sunrpc]
[  109.736267]  [<ffffffffa0379a4c>] bc_svc_process+0x1fc/0x360 [sunrpc]
[  109.737775]  [<ffffffffa08a2c2c>] nfs41_callback_svc+0x10c/0x1d0 [nfsv4]
[  109.739335]  [<ffffffff810cb380>] ? prepare_to_wait_event+0xf0/0xf0
[  109.740799]  [<ffffffffa08a2b20>] ? nfs4_callback_svc+0x50/0x50 [nfsv4]
[  109.742349]  [<ffffffff810a6998>] kthread+0xd8/0xf0
[  109.743495]  [<ffffffff810a68c0>] ? kthread_park+0x60/0x60
[  109.744776]  [<ffffffff816abc4f>] ret_from_fork+0x3f/0x70
[  109.746037]  [<ffffffff810a68c0>] ? kthread_park+0x60/0x60
[  109.747324] Code: cc 45 31 f6 48 8b 85 00 ff ff ff 44 89 30 48 8b 85 f8 fe ff ff 44 89 20 48 8b 9d 38 ff ff ff 48 8b bd 30 ff ff ff 48 85 db 74 4c <4c> 8b af c8 02 00 00 4d 8d a5 08 02 00 00 49 81 c5 98 02 00 00
[  109.754361] RIP  [<ffffffffa08a3e68>] nfs4_callback_compound+0x4f8/0x690 [nfsv4]
[  109.756123]  RSP <ffff88008448fca0>
[  109.756951] CR2: 00000000000002c8
[  109.757738] ---[ end trace 2b8555511ab5dfb4 ]---
[  109.758819] Kernel panic - not syncing: Fatal exception
[  109.760126] Kernel Offset: disabled
[  118.938934] ---[ end Kernel panic - not syncing: Fatal exception

It doesn't unlock the table nor does it set the cps->clp pointer which
is later needed by nfs4_cb_free_slot().

Fixes: 80f96427 ("NFSv4.x: Enforce the ca_maxresponsesize_cached ...")
CC: stable@vger.kernel.org
Signed-off-by: NOlga Kornievskaia <kolga@netapp.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

c2985d00

02 2月, 2016 1 次提交

NFSv4.x: Fix NFS4ERR_RETRY_UNCACHED_REP in nfs4_callback_sequence · e5003b2f

由 Trond Myklebust 提交于 2月 01, 2016

We need to initialize cb_sequenceres information when reporting a
NFS4ERR_RETRY_UNCACHED_REP error, since that will apply to the
next operation, not to the CB_SEQUENCE itself.
Reported-by: NKinglong Mee <kinglongmee@gmail.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

e5003b2f

25 1月, 2016 5 次提交

NFSv4.x: Allow multiple callbacks in flight · 810d82e6

由 Trond Myklebust 提交于 1月 23, 2016

Hook the callback channel into the same session management machinery
as we use for the forward channel.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

810d82e6

NFSv4.x: Fix wraparound issues when validing the callback sequence id · 5f83d86c

由 Trond Myklebust 提交于 1月 23, 2016

We need to make sure that we don't allow args->csa_sequenceid == 0.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

5f83d86c

NFSv4.x: Enforce the ca_maxresponsesize_cached on the back channel · 80f96427

由 Trond Myklebust 提交于 1月 23, 2016

We have no duplicate reply cache, so we always set the back channel
ca_maxresponsesize_cached to zero when negotiating the session.
That means we should always error out as soon as we see the server
set args->csa_cachethis.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

80f96427

NFSv4.x: CB_SEQUENCE should return NFS4ERR_DELAY if still executing · f74a834a

由 Trond Myklebust 提交于 1月 23, 2016

See RFC5661 Section 2.10.6.2: if retrying a request, and the old one is
still in progress, we must return NFS4ERR_DELAY as the reply to sequence.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

f74a834a

NFSv4.x: Remove hard coded slotids in callback channel · f4f58ed1

由 Trond Myklebust 提交于 1月 23, 2016

Instead, use the values encoded in the slot table itself.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

f4f58ed1

05 1月, 2016 1 次提交

NFSv4.1/pNFS: Fix a race in initiate_file_draining() · 4b0934ba

由 Trond Myklebust 提交于 1月 04, 2016

Peng Tao points out that the call to pnfs_mark_matching_lsegs_return()
could race with pnfs_put_lseg(), in which case the layout segment is
cleared, but no layoutreturn will be sent.
Fix is to replace the call to pnfs_mark_matching_lsegs_invalid().
Reported-by: NPeng Tao <tao.peng@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

4b0934ba

01 1月, 2016 1 次提交

NFSv4.1/pNFS: Don't queue up a new commit if the layout segment is invalid · b20135d0

由 Trond Myklebust 提交于 12月 31, 2015

If the layout segment is invalid, then we should not be adding more
write requests to the commit list. Instead, those writes should be
replayed after requesting a new layout.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

b20135d0

29 12月, 2015 5 次提交

T
NFSv4: List stateid information in the callback tracepoints · e07db907
由 Trond Myklebust 提交于 12月 28, 2015
```
The stateid is extremely valuable when debugging.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
e07db907

NFSv4.1/pNFS: Don't return NFS4ERR_DELAY unnecessarily in CB_LAYOUTRECALL · e0d92430

由 Trond Myklebust 提交于 12月 28, 2015

If the client is promising to return the layout ASAP, then there is no
need to return DELAY and have the server retry. Instead default to the
normal procedure described in RFC5661.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

e0d92430

NFSv4.1/pNFS: Ensure we enforce RFC5661 Section 12.5.5.2.1 · 41c9127d

由 Trond Myklebust 提交于 12月 28, 2015

The RFC requires us to check if the server is recalling a stateid that we
haven't yet received. If so, tell it to wait.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

41c9127d

pNFS: If we have to delay the layout callback, mark the layout for return · fc7ff367

由 Trond Myklebust 提交于 12月 28, 2015

If the client needs to delay the layout callback, then speed up the recall
process by marking the remaining layout segments to be actively returned
by the client.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

fc7ff367

NFSv4.1/pNFS: Add a helper to mark the layout as returned · 0654cc72

由 Trond Myklebust 提交于 12月 28, 2015

This ensures that we don't reuse the stateid if a layout return or
implied layout return means that we've returned all layout segments
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

0654cc72

22 10月, 2015 1 次提交

NFS: Remove unneeded NFS_DEBUG checking before define NFSDBG_FACILITY · 39de493e

由 Kinglong Mee 提交于 9月 24, 2015

It's not needed to checking NFS_DEBUG before define NFSDBG_FACILITY, remove it.
Signed-off-by: NKinglong Mee <kinglongmee@gmail.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

39de493e

26 8月, 2015 2 次提交
- T
  NFSv4: Add a tracepoint for CB_LAYOUTRECALL · 249b2eef
  由 Trond Myklebust 提交于 8月 20, 2015
```
Only support for single file layoutrecall for now.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  249b2eef
- T
  NFSv4: Add a tracepoint for CB_GETATTR · 7cd14861
  由 Trond Myklebust 提交于 8月 20, 2015
```
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  7cd14861
18 8月, 2015 1 次提交

NFS: Remove nfs41_server_notify_{target|highest}_slotid_update() · 3f10a6af

由 Anna Schumaker 提交于 7月 13, 2015

All these functions do is call nfs41_ping_server() without adding
anything.  Let's remove them and give nfs41_ping_server() a better name
instead.
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

3f10a6af

12 6月, 2015 3 次提交

NFS: Ensure that we update the sequence id under the slot table lock · 4e54ab8d

由 Trond Myklebust 提交于 6月 11, 2015

Fix a callback slot table regression.

Fixes: e937ee71 ("nfs: Only update callback sequnce id when CB_SEQUENCE success")
Cc: Kinglong Mee <kinglongmee@gmail.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

4e54ab8d

nfs: Initialize cb_sequenceres information before validate_seqid() · 0579c8d2

由 Kinglong Mee 提交于 6月 02, 2015

For a cb_layoutrecall replay, nfsd got CB_SEQUENCE status of zero,
but all informations of cb_sequenceres are zero too !!!

validate_seqid() return NFS4ERR_RETRY_UNCACHED_REP for a replay,
and skip the initlize cb_sequenceres.
Signed-off-by: NKinglong Mee <kinglongmee@gmail.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

0579c8d2

nfs: Only update callback sequnce id when CB_SEQUENCE success · e937ee71

由 Kinglong Mee 提交于 6月 02, 2015

When testing pnfs layout, nfsd got error NFS4ERR_SEQ_MISORDERED.
It is caused by nfs return NFS4ERR_DELAY before validate_seqid(),
don't update the sequnce id, but nfsd updates the sequnce id !!!

According to RFC5661 20.9.3,
" If CB_SEQUENCE returns an error, then the state of the slot
  (sequence ID, cached reply) MUST NOT change. "
Signed-off-by: NKinglong Mee <kinglongmee@gmail.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

e937ee71

19 2月, 2015 1 次提交

NFSv4.1: Don't set up a backchannel if the server didn't agree to do so · b1c0df5f

由 Trond Myklebust 提交于 2月 18, 2015

If the server doesn't agree to out backchannel setup request, then
don't set one up.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

b1c0df5f

25 11月, 2014 1 次提交

NFS: fix subtle change in COMMIT behavior · cb1410c7

由 Weston Andros Adamson 提交于 11月 12, 2014

Recent work in the pgio layer made it possible for there to be more than one
request per page. This caused a subtle change in commit behavior, because
write.c:nfs_commit_unstable_pages compares the number of *pages* waiting for
writeback against the number of requests on a commit list to choose when to
send a COMMIT in a non-blocking flush.

This is probably hard to hit in normal operation - you have to be using
rsize/wsize < PAGE_SIZE, or pnfs with lots of boundaries that are not page
aligned to have a noticeable change in behavior.
Signed-off-by: NWeston Andros Adamson <dros@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

cb1410c7

13 9月, 2014 1 次提交

pnfs: enable CB_NOTIFY_DEVICEID support · 84c9dee3

由 Christoph Hellwig 提交于 9月 10, 2014

This code has been around for a while, but never was enabled, although
it is in a working shape.

Note that we implement NOTIFY_DEVICEID4_CHANGE identical to
NOTIFY_DEVICEID4_DELETE.  Given that in either case we can't do anything
but preventing further lookups of a given device ID there isn't much difference
in semantics for the two.  For the delete case the server MUST ensure that
there are no outstanding layouts, while for the change case it doesn't, but
that has little relevance to the client.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

84c9dee3

11 9月, 2014 2 次提交

pnfs: add return_range method · c88953d8

由 Christoph Hellwig 提交于 9月 10, 2014

If a layout driver keeps per-inode state outside of the layout segments it
needs to be notified of any layout returns or recalls on an inode, and not
just about the freeing of layout segments. Add a method to acomplish this,
which will allow the block layout driver to handle the case of truncated
and re-expanded files properly.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

c88953d8

pnfs: force a layout commit when encountering busy segments during recall · 7c5d1875

由 Christoph Hellwig 提交于 9月 10, 2014

Expedite layout recall processing by forcing a layout commit when
we see busy segments.  Without it the layout recall might have to wait
until the VM decided to start writeback for the file, which can introduce
long delays.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

7c5d1875

20 2月, 2014 2 次提交

NFSv4.1: Minor optimisation in get_layout_by_fh_locked() · 9a7fe9e8

由 Trond Myklebust 提交于 2月 12, 2014

If the filehandles match, but the igrab() fails, or the layout is
freed before we can get it, then just return NULL.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

9a7fe9e8

NFSv4.1: Ensure that the layout recall callback matches layout stateids · 27999f25

由 Trond Myklebust 提交于 2月 12, 2014

It is not sufficient to compare filehandles when we receive a layout
recall from the server; we also need to check that the layout stateids
match.
Reported-by: Nshaobingqing <shaobingqing@bwstor.com.cn>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

27999f25

04 9月, 2013 1 次提交

NFS: When displaying session slot numbers, use "%u" consistently · e8d92382

由 Chuck Lever 提交于 8月 09, 2013

Clean up, since slot and sequence numbers are all unsigned anyway.

Among other things, squelch compiler warnings:

linux/fs/nfs/nfs4proc.c: In function ‘nfs4_setup_sequence’:
linux/fs/nfs/nfs4proc.c:703:2: warning: signed and unsigned type in
	conditional expression [-Wsign-compare]

and

linux/fs/nfs/nfs4session.c: In function ‘nfs4_alloc_slot’:
linux/fs/nfs/nfs4session.c:151:31: warning: signed and unsigned type in
	conditional expression [-Wsign-compare]
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

e8d92382

22 8月, 2013 2 次提交

NFSv4.1: Add tracepoints for debugging slot table operations · 2f92ae34

由 Trond Myklebust 提交于 8月 14, 2013

Add tracepoints to nfs41_setup_sequence and nfs41_sequence_done
to track session and slot table state changes.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

2f92ae34

NFSv4: Add tracepoints for debugging delegations · ca8acf8d

由 Trond Myklebust 提交于 8月 13, 2013

Set up tracepoints to track when delegations are set, reclaimed,
returned by the client, or recalled by the server.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

ca8acf8d

09 6月, 2013 1 次提交

NFS: Make callbacks minor version generic · 459de2ed

由 Bryan Schumaker 提交于 6月 05, 2013

I found a few places that hardcode the minor version number rather than
making it dependent on the protocol the callback came in over. This
patch makes it easier to add new minor versions in the future.
Signed-off-by: NBryan Schumaker <bjschuma@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

459de2ed

21 5月, 2013 1 次提交

NFSv4.1 Fix a pNFS session draining deadlock · 774d5f14

由 Andy Adamson 提交于 5月 20, 2013

On a CB_RECALL the callback service thread flushes the inode using
filemap_flush prior to scheduling the state manager thread to return the
delegation. When pNFS is used and I/O has not yet gone to the data server
servicing the inode, a LAYOUTGET can preceed the I/O. Unlike the async
filemap_flush call, the LAYOUTGET must proceed to completion.

If the state manager starts to recover data while the inode flush is sending
the LAYOUTGET, a deadlock occurs as the callback service thread holds the
single callback session slot until the flushing is done which blocks the state
manager thread, and the state manager thread has set the session draining bit
which puts the inode flush LAYOUTGET RPC to sleep on the forechannel slot
table waitq.

Separate the draining of the back channel from the draining of the fore channel
by moving the NFS4_SESSION_DRAINING bit from session scope into the fore
and back slot tables. Drain the back channel first allowing the LAYOUTGET
call to proceed (and fail) so the callback service thread frees the callback
slot. Then proceed with draining the forechannel.
Signed-off-by: NAndy Adamson <andros@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

774d5f14

06 4月, 2013 1 次提交
- T
  NFSv4: Fix CB_RECALL_ANY to only return delegations that are not in use · 826e0013
  由 Trond Myklebust 提交于 4月 03, 2013
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
  826e0013
15 2月, 2013 1 次提交

NFSv4.1: Fix bulk recall and destroy of layouts · fd9a8d71

由 Trond Myklebust 提交于 2月 12, 2013

The current code in pnfs_destroy_all_layouts() assumes that removing
the layout from the server->layouts list is sufficient to make it
invisible to other processes. This ignores the fact that most
users access the layout through the nfs_inode->layout...
There is further breakage due to lack of reference counting of the
layouts, meaning that the whole thing Oopses at the drop of a hat.

The code in initiate_bulk_draining() is almost correct, and can be
used as a model for pnfs_destroy_all_layouts(), so move that
code to pnfs.c, and refactor the code to allow us to choose between
a single filesystem bulk recall, and a recall of all layouts.
Also note that initiate_bulk_draining() currently calls iput() while
holding locks. Fix that too.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
Cc: stable@vger.kernel.org

fd9a8d71

06 1月, 2013 1 次提交

nfs: avoid dereferencing null pointer in initiate_bulk_draining · ecf0eb9e

由 Nickolai Zeldovich 提交于 1月 05, 2013

Fix an inverted null pointer check in initiate_bulk_draining().
Signed-off-by: NNickolai Zeldovich <nickolai@csail.mit.edu>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
Cc: stable@vger.kernel.org [>= 3.7]

ecf0eb9e

06 12月, 2012 3 次提交

NFSv4.1: Cleanup move session slot management to fs/nfs/nfs4session.c · 73e39aaa

由 Trond Myklebust 提交于 11月 26, 2012

NFSv4.1 session management is getting complex enough to deserve
a separate file.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

73e39aaa

NFSv4.1: CB_RECALL_SLOT must schedule a sequence op after updating targets · ac074835

由 Trond Myklebust 提交于 11月 21, 2012

RFC5661 requires us to make sure that the server knows we've updated
our slot table size by sending at least one SEQUENCE op containing the
new 'highest_slotid' value.
We can do so using the 'CHECK_LEASE' functionality of the state
manager.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

ac074835

NFSv4.1: Remove the state manager code to resize the slot table · afa29610

由 Trond Myklebust 提交于 11月 20, 2012

The state manager no longer needs any special machinery to stop the
session flow and resize the slot table. It is all done on the fly by
the SEQUENCE op code now.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

afa29610

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功