提交 · 9a0fe86745b8e95f7ea39933a956f5771332c430 · openanolis / cloud-kernel

20 8月, 2016 1 次提交

pNFS: Handle NFS4ERR_OLD_STATEID correctly in LAYOUTSTAT calls · 9a0fe867

由 Trond Myklebust 提交于 8月 19, 2016

We normally want to update the stateid and then retry,
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

9a0fe867

25 7月, 2016 12 次提交

pNFS: Remove redundant smp_mb() from pnfs_init_lseg() · 01d7b29f

由 Trond Myklebust 提交于 7月 24, 2016

It's not visible yet, and won't be until after we grab the inode->i_lock.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

01d7b29f

pNFS: Cleanup - do layout segment initialisation in one place · 119cef97

由 Trond Myklebust 提交于 7月 24, 2016

...instead of splitting the initialisation over init_lseg() and
pnfs_layout_process().
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

119cef97

pNFS: Remove redundant stateid invalidation · 28c1acff

由 Trond Myklebust 提交于 7月 21, 2016

The layout stateid will be invalidated once it holds no more layout
segments anyway.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

28c1acff

pNFS: Remove redundant pnfs_mark_layout_returned_if_empty() · f71dfe8f

由 Trond Myklebust 提交于 7月 24, 2016

That's already being taken care of in pnfs_layout_remove_lseg().
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

f71dfe8f

pNFS: Clear the layout metadata if the server changed the layout stateid · d9b61708

由 Trond Myklebust 提交于 7月 24, 2016

If the server changed the layout stateid's "other" field, then
we should treat the old layout as being completely gone. In that
case, we want to clear the metadata such as scheduled layoutreturns.

Do this by calling pnfs_mark_layout_stateid_invalid().
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

d9b61708

pNFS: Cleanup - don't open code pnfs_mark_layout_stateid_invalid() · 5f46be04

由 Trond Myklebust 提交于 7月 22, 2016

Ensure nfs42_layoutstat_done() layoutget don't open code layout stateid
invalidation.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

5f46be04

NFS: pnfs_mark_matching_lsegs_return() should match the layout sequence id · e036f464

由 Trond Myklebust 提交于 7月 22, 2016

When determining which layout segments to return, we do want
pnfs_mark_matching_lsegs_return to check that they match the layout
sequence id. This ensures that we don't waste time if the server
is replaying a layout recall that has already been satisfied.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

e036f464

pNFS: Do not set plh_return_seq for non-callback related layoutreturns · 2d6cf5ab

由 Trond Myklebust 提交于 7月 21, 2016

In cases where we need to send a layoutreturn in order to propagate
an error, we should not tie that to a specific layout stateid.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

2d6cf5ab

pNFS: Ensure layoutreturn acts as a completion for layout callbacks · e5fd1904

由 Trond Myklebust 提交于 7月 21, 2016

When we return NFS_OK to the CB_LAYOUTRECALL, we are required to
send a layoutreturn that "completes" that layout recall request, using
the correct stateid.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

e5fd1904

pNFS: Always update the layout barrier seqid on LAYOUTGET · ecebb80b

由 Trond Myklebust 提交于 7月 24, 2016

Currently, pnfs_set_layout_stateid() will update the layout sequence
id barrier only if the stateid itself is newer than the current
layout stateid. However in a situation where multiple LAYOUTGET calls
and a LAYOUTRETURN raced, it is entirely possible for one of the
LAYOUTGET to set the current stateid to something newer than the
LAYOUTRETURN that needs to set the barrier.

The fix is to allow the "update_barrier" flag to force a check as to
whether or not the barrier needs to be updated.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

ecebb80b

pNFS: Always update the layout stateid if NFS_LAYOUT_INVALID_STID is set · 13bede18

由 Trond Myklebust 提交于 7月 24, 2016

If the layout stateid is invalid, then pnfs_set_layout_stateid() must
always initialise it.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

13bede18

pNFS: Clear the layout return tracking on layout reinitialisation · 8e0acf90

由 Trond Myklebust 提交于 7月 21, 2016

Ensure that we don't carry over layoutreturn info from a previous
incarnation of this layout.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

8e0acf90

20 7月, 2016 3 次提交

pNFS: Handle NFS4ERR_RECALLCONFLICT correctly in LAYOUTGET · 66b53f32

由 Trond Myklebust 提交于 7月 14, 2016

Instead of giving up altogether and falling back to doing I/O
through the MDS, which may make the situation worse, wait for
2 lease periods for the callback to resolve itself, and then
try destroying the existing layout.

Only if this was an attempt at getting a first layout, do we
give up altogether, as the server is clearly crazy.

Fixes: 183d9e7b ("pnfs: rework LAYOUTGET retry handling")
Cc: stable@vger.kernel.org # 4.7
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
Reviewed-by: NJeff Layton <jlayton@redhat.com>

66b53f32

pNFS: Separate handling of NFS4ERR_LAYOUTTRYLATER and RECALLCONFLICT · e85d7ee4

由 Trond Myklebust 提交于 7月 14, 2016

They are not the same error, and need to be handled differently.

Fixes: 183d9e7b ("pnfs: rework LAYOUTGET retry handling")
Cc: stable@vger.kernel.org # 4.7
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
Reviewed-by: NJeff Layton <jlayton@redhat.com>

e85d7ee4

pNFS: Fix post-layoutget error handling in pnfs_update_layout() · 56b38a1f

由 Trond Myklebust 提交于 7月 14, 2016

The non-retry error path is currently broken and ends up releasing the
reference to the layout twice. It also can end up clearing the
NFS_LAYOUT_FIRST_LAYOUTGET flag twice, causing a race.

In addition, the retry path will fail to decrement the plh_outstanding
counter.

Fixes: 183d9e7b ("pnfs: rework LAYOUTGET retry handling")
Cc: stable@vger.kernel.org # 4.7
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
Reviewed-by: NJeff Layton <jlayton@redhat.com>

56b38a1f

06 7月, 2016 1 次提交

pNFS: Files and flexfiles always need to commit before layoutcommit · 2e18d4d8

由 Trond Myklebust 提交于 6月 26, 2016

So ensure that we mark the layout for commit once the write is done,
and then ensure that the commit to ds is finished before sending
layoutcommit.

Note that by doing this, we're able to optimise away the commit
for the case of servers that don't need layoutcommit in order to
return updated attributes.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

2e18d4d8

25 6月, 2016 3 次提交

NFSv4.1/pnfs: Mark the layout stateid invalid when all segments are removed · 2d148c7e

由 Trond Myklebust 提交于 6月 17, 2016

According to RFC5661, section 12.5.3. the layout stateid is no longer
valid once the client no longer holds any layout segments. Ensure that
we mark it invalid.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

2d148c7e

NFSv4.1/pnfs: Add sparse lock annotations for pnfs_find_alloc_layout · e5241e43

由 Trond Myklebust 提交于 6月 17, 2016

Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
Reviewed-by: NJeff Layton <jlayton@poochiereds.net>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

e5241e43

NFSv4.1/pnfs: Layout stateids start out as being invalid · 67a3b721

由 Trond Myklebust 提交于 6月 17, 2016

Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
Reviewed-by: NJeff Layton <jlayton@poochiereds.net>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

67a3b721

26 5月, 2016 1 次提交

pnfs: pnfs_update_layout needs to consider if strict iomode checking is on · c7d73af2

由 Tom Haynes 提交于 5月 25, 2016

As flexfiles has FF_FLAGS_NO_READ_IO, there is a need to generically
support enforcing that a IOMODE_RW segment will not allow READ I/O.
Signed-off-by: NTom Haynes <loghyr@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

c7d73af2

18 5月, 2016 8 次提交

pnfs: make pnfs_layout_process more robust · 1b3c6d07

由 Jeff Layton 提交于 5月 17, 2016

It can return NULL if layoutgets are blocked currently. Fix it to return
-EAGAIN in that case, so we can properly handle it in pnfs_update_layout.

Also, clean up and simplify the error handling -- eliminate "status" and
just use "lseg".
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

1b3c6d07

pnfs: rework LAYOUTGET retry handling · 183d9e7b

由 Jeff Layton 提交于 5月 17, 2016

There are several problems in the way a stateid is selected for a
LAYOUTGET operation:

We pick a stateid to use in the RPC prepare op, but that makes
it difficult to serialize LAYOUTGETs that use the open stateid. That
serialization is done in pnfs_update_layout, which occurs well before
the rpc_prepare operation.

Between those two events, the i_lock is dropped and reacquired.
pnfs_update_layout can find that the list has lsegs in it and not do any
serialization, but then later pnfs_choose_layoutget_stateid ends up
choosing the open stateid.

This patch changes the client to select the stateid to use in the
LAYOUTGET earlier, when we're searching for a usable layout segment.
This way we can do it all while holding the i_lock the first time, and
ensure that we serialize any LAYOUTGET call that uses a non-layout
stateid.

This also means a rework of how LAYOUTGET replies are handled, as we
must now get the latest stateid if we want to retransmit in response
to a retryable error.

Most of those errors boil down to the fact that the layout state has
changed in some fashion. Thus, what we really want to do is to re-search
for a layout when it fails with a retryable error, so that we can avoid
reissuing the RPC at all if possible.

While the LAYOUTGET RPC is async, the initiating thread always waits for
it to complete, so it's effectively synchronous anyway. Currently, when
we need to retry a LAYOUTGET because of an error, we drive that retry
via the rpc state machine.

This means that once the call has been submitted, it runs until it
completes. So, we must move the error handling for this RPC out of the
rpc_call_done operation and into the caller.

In order to handle errors like NFS4ERR_DELAY properly, we must also
pass a pointer to the sliding timeout, which is now moved to the stack
in pnfs_update_layout.

The complicating errors are -NFS4ERR_RECALLCONFLICT and
-NFS4ERR_LAYOUTTRYLATER, as those involve a timeout after which we give
up and return NULL back to the caller. So, there is some special
handling for those errors to ensure that the layers driving the retries
can handle that appropriately.
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

183d9e7b

pnfs: lift retry logic from send_layoutget to pnfs_update_layout · 83026d80

由 Jeff Layton 提交于 5月 17, 2016

If we get back something like NFS4ERR_OLD_STATEID, that will be
translated into -EAGAIN, and the do/while loop in send_layoutget
will drive the call again.

This is not quite what we want, I think. An error like that is a
sign that something has changed. That something could have been a
concurrent LAYOUTGET that would give us a usable lseg.

Lift the retry logic into pnfs_update_layout instead. That allows
us to redo the layout search, and may spare us from having to issue
an RPC.
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

83026d80

pnfs: fix bad error handling in send_layoutget · d03ab29d

由 Jeff Layton 提交于 5月 17, 2016

Currently, the code will clear the fail bit if we get back a fatal
error. I don't think that's correct -- we want to clear that bit
if we do not get a fatal error.

Fixes: 0bcbf039 (nfs: handle request add failure properly)
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

d03ab29d

pnfs: only tear down lsegs that precede seqid in LAYOUTRETURN args · 6d597e17

由 Jeff Layton 提交于 5月 17, 2016

LAYOUTRETURN is "special" in that servers and clients are expected to
work with old stateids. When the client sends a LAYOUTRETURN with an old
stateid in it then the server is expected to only tear down layout
segments that were present when that seqid was current. Ensure that the
client handles its accounting accordingly.
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

6d597e17

pnfs: keep track of the return sequence number in pnfs_layout_hdr · 3982a6a2

由 Jeff Layton 提交于 5月 17, 2016

When we want to selectively do a LAYOUTRETURN, we need to specify a
stateid that represents most recent layout acquisition that is to be
returned.

When we mark a layout stateid to be returned, we update the return
sequence number in the layout header with that value, if it's newer
than the existing one. Then, when we go to do a LAYOUTRETURN on
layout header put, we overwrite the seqid in the stateid with the
saved one, and then zero it out.
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

3982a6a2

pnfs: record sequence in pnfs_layout_segment when it's created · 66755283

由 Jeff Layton 提交于 5月 17, 2016

In later patches, we're going to teach the client to be more selective
about how it returns layouts. This means keeping a record of what the
stateid's seqid was at the time that the server handed out a layout
segment.
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

66755283

pNFS: Fix a leaked layoutstats flag · f538d0ba

由 Trond Myklebust 提交于 5月 16, 2016

Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

f538d0ba

09 5月, 2016 1 次提交

pnfs: set NFS_IOHDR_REDO in pnfs_read_resend_pnfs · 1b1bc66b

由 Weston Andros Adamson 提交于 4月 01, 2016

Like other resend paths, mark the (old) hdr as NFS_IOHDR_REDO. This
ensures the hdr completion function will not count the (old) hdr
as good bytes.

Also, vector the error back through the hdr->task.tk_status like other
retry calls.

This fixes a bug with the FlexFiles layout where libaio was reporting more
bytes read than requested.
Signed-off-by: NWeston Andros Adamson <dros@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

1b1bc66b

05 4月, 2016 1 次提交

mm, fs: get rid of PAGE_CACHE_* and page_cache_{get,release} macros · 09cbfeaf

由 Kirill A. Shutemov 提交于 4月 01, 2016

PAGE_CACHE_{SIZE,SHIFT,MASK,ALIGN} macros were introduced *long* time
ago with promise that one day it will be possible to implement page
cache with bigger chunks than PAGE_SIZE.

This promise never materialized.  And unlikely will.

We have many places where PAGE_CACHE_SIZE assumed to be equal to
PAGE_SIZE.  And it's constant source of confusion on whether
PAGE_CACHE_* or PAGE_* constant should be used in a particular case,
especially on the border between fs and mm.

Global switching to PAGE_CACHE_SIZE != PAGE_SIZE would cause to much
breakage to be doable.

Let's stop pretending that pages in page cache are special.  They are
not.

The changes are pretty straight-forward:

 - <foo> << (PAGE_CACHE_SHIFT - PAGE_SHIFT) -> <foo>;

 - <foo> >> (PAGE_CACHE_SHIFT - PAGE_SHIFT) -> <foo>;

 - PAGE_CACHE_{SIZE,SHIFT,MASK,ALIGN} -> PAGE_{SIZE,SHIFT,MASK,ALIGN};

 - page_cache_get() -> get_page();

 - page_cache_release() -> put_page();

This patch contains automated changes generated with coccinelle using
script below.  For some reason, coccinelle doesn't patch header files.
I've called spatch for them manually.

The only adjustment after coccinelle is revert of changes to
PAGE_CAHCE_ALIGN definition: we are going to drop it later.

There are few places in the code where coccinelle didn't reach.  I'll
fix them manually in a separate patch.  Comments and documentation also
will be addressed with the separate patch.

virtual patch

@@
expression E;
@@
- E << (PAGE_CACHE_SHIFT - PAGE_SHIFT)
+ E

@@
expression E;
@@
- E >> (PAGE_CACHE_SHIFT - PAGE_SHIFT)
+ E

@@
@@
- PAGE_CACHE_SHIFT
+ PAGE_SHIFT

@@
@@
- PAGE_CACHE_SIZE
+ PAGE_SIZE

@@
@@
- PAGE_CACHE_MASK
+ PAGE_MASK

@@
expression E;
@@
- PAGE_CACHE_ALIGN(E)
+ PAGE_ALIGN(E)

@@
expression E;
@@
- page_cache_get(E)
+ get_page(E)

@@
expression E;
@@
- page_cache_release(E)
+ put_page(E)
Signed-off-by: NKirill A. Shutemov <kirill.shutemov@linux.intel.com>
Acked-by: NMichal Hocko <mhocko@suse.com>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

09cbfeaf

23 2月, 2016 2 次提交

NFSv4.x/pnfs: Fix a race between layoutget and bulk recalls · 9fd4b9fc

由 Trond Myklebust 提交于 2月 22, 2016

Replace another case where the layout 'plh_block_lgets' can trigger
infinite loops in send_layoutget().
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

9fd4b9fc

NFSv4.x/pnfs: Fix a race between layoutget and pnfs_destroy_layout · 2454dfea

由 Trond Myklebust 提交于 2月 22, 2016

If the server reboots while there is a layoutget outstanding, then
the call to pnfs_choose_layoutget_stateid() will fail with an EAGAIN
error, which causes an infinite loop in send_layoutget(). The reason
why we never break out of the loop is that the layout 'plh_block_lgets'
field is never cleared.

Fix is to replace plh_block_lgets with NFS_LAYOUT_INVALID_STID, which
can be reset after a new layoutget.

Fixes: ab7d763e ("pNFS: Ensure nfs4_layoutget_prepare returns...")
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

2454dfea

16 2月, 2016 2 次提交

pNFS: Always set NFS_LAYOUT_RETURN_REQUESTED with lo->plh_return_iomode · e0fa0d01

由 Trond Myklebust 提交于 2月 15, 2016

When setting the layout return mode, we must always also set the
NFS_LAYOUT_RETURN_REQUESTED flag to ensure that we send a layoutreturn.
Otherwise pnfs_error_mark_layout_for_return() could set the mode, but
fail to send the layoutreturn because another is already in flight.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

e0fa0d01

pNFS: Fix pnfs_mark_matching_lsegs_return() · 2f215968

由 Trond Myklebust 提交于 2月 15, 2016

We don't need to schedule a layoutreturn if the layout segment can
be freed immediately.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

2f215968

28 1月, 2016 1 次提交

NFS: Cleanup - rename NFS_LAYOUT_RETURN_BEFORE_CLOSE · 2370abda

由 Trond Myklebust 提交于 1月 27, 2016

NFS_LAYOUT_RETURN_BEFORE_CLOSE is being used to signal that a
layoutreturn is needed, either due to a layout recall or to a
layout error. Rename it to NFS_LAYOUT_RETURN_REQUESTED in order
to clarify its purpose.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

2370abda

27 1月, 2016 1 次提交

pNFS: Fix missing layoutreturn calls · 13c13a6a

由 Trond Myklebust 提交于 1月 26, 2016

The layoutreturn code currently relies on pnfs_put_lseg() to initiate the
RPC call when conditions are right. A problem arises when we want to
free the layout segment from inside an inode->i_lock section (e.g. in
pnfs_clear_request_commit()), since we cannot sleep.

The workaround is to move the actual call to pnfs_send_layoutreturn()
to pnfs_put_layout_hdr(), which doesn't have this restriction.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

13c13a6a

05 1月, 2016 3 次提交
- T
  NFSv4.1/pNFS: Cleanup constify struct pnfs_layout_range arguments · 506c0d68
  由 Trond Myklebust 提交于 1月 04, 2016
```
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  506c0d68
- T
  NFSv4.1/pnfs: Cleanup copying of pnfs_layout_range structures · e144e539
  由 Trond Myklebust 提交于 1月 04, 2016
```
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  e144e539
- T
  NFSv4.1/pNFS: Cleanup pnfs_mark_matching_lsegs_invalid() · 71b39854
  由 Trond Myklebust 提交于 1月 04, 2016
```
Make it more obvious what we're returning...
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  71b39854

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功