提交 · e036f46453f252539cb62bf91d82c3d08e37e73c · openanolis / cloud-kernel

25 7月, 2016 6 次提交

NFS: pnfs_mark_matching_lsegs_return() should match the layout sequence id · e036f464

由 Trond Myklebust 提交于 7月 22, 2016

When determining which layout segments to return, we do want
pnfs_mark_matching_lsegs_return to check that they match the layout
sequence id. This ensures that we don't waste time if the server
is replaying a layout recall that has already been satisfied.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

e036f464

pNFS: Do not set plh_return_seq for non-callback related layoutreturns · 2d6cf5ab

由 Trond Myklebust 提交于 7月 21, 2016

In cases where we need to send a layoutreturn in order to propagate
an error, we should not tie that to a specific layout stateid.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

2d6cf5ab

pNFS: Ensure layoutreturn acts as a completion for layout callbacks · e5fd1904

由 Trond Myklebust 提交于 7月 21, 2016

When we return NFS_OK to the CB_LAYOUTRECALL, we are required to
send a layoutreturn that "completes" that layout recall request, using
the correct stateid.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

e5fd1904

pNFS: Always update the layout barrier seqid on LAYOUTGET · ecebb80b

由 Trond Myklebust 提交于 7月 24, 2016

Currently, pnfs_set_layout_stateid() will update the layout sequence
id barrier only if the stateid itself is newer than the current
layout stateid. However in a situation where multiple LAYOUTGET calls
and a LAYOUTRETURN raced, it is entirely possible for one of the
LAYOUTGET to set the current stateid to something newer than the
LAYOUTRETURN that needs to set the barrier.

The fix is to allow the "update_barrier" flag to force a check as to
whether or not the barrier needs to be updated.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

ecebb80b

pNFS: Always update the layout stateid if NFS_LAYOUT_INVALID_STID is set · 13bede18

由 Trond Myklebust 提交于 7月 24, 2016

If the layout stateid is invalid, then pnfs_set_layout_stateid() must
always initialise it.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

13bede18

pNFS: Clear the layout return tracking on layout reinitialisation · 8e0acf90

由 Trond Myklebust 提交于 7月 21, 2016

Ensure that we don't carry over layoutreturn info from a previous
incarnation of this layout.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

8e0acf90

25 6月, 2016 3 次提交

NFSv4.1/pnfs: Mark the layout stateid invalid when all segments are removed · 2d148c7e

由 Trond Myklebust 提交于 6月 17, 2016

According to RFC5661, section 12.5.3. the layout stateid is no longer
valid once the client no longer holds any layout segments. Ensure that
we mark it invalid.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

2d148c7e

NFSv4.1/pnfs: Add sparse lock annotations for pnfs_find_alloc_layout · e5241e43

由 Trond Myklebust 提交于 6月 17, 2016

Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
Reviewed-by: NJeff Layton <jlayton@poochiereds.net>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

e5241e43

NFSv4.1/pnfs: Layout stateids start out as being invalid · 67a3b721

由 Trond Myklebust 提交于 6月 17, 2016

Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
Reviewed-by: NJeff Layton <jlayton@poochiereds.net>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

67a3b721

26 5月, 2016 1 次提交

pnfs: pnfs_update_layout needs to consider if strict iomode checking is on · c7d73af2

由 Tom Haynes 提交于 5月 25, 2016

As flexfiles has FF_FLAGS_NO_READ_IO, there is a need to generically
support enforcing that a IOMODE_RW segment will not allow READ I/O.
Signed-off-by: NTom Haynes <loghyr@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

c7d73af2

18 5月, 2016 8 次提交

pnfs: make pnfs_layout_process more robust · 1b3c6d07

由 Jeff Layton 提交于 5月 17, 2016

It can return NULL if layoutgets are blocked currently. Fix it to return
-EAGAIN in that case, so we can properly handle it in pnfs_update_layout.

Also, clean up and simplify the error handling -- eliminate "status" and
just use "lseg".
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

1b3c6d07

pnfs: rework LAYOUTGET retry handling · 183d9e7b

由 Jeff Layton 提交于 5月 17, 2016

There are several problems in the way a stateid is selected for a
LAYOUTGET operation:

We pick a stateid to use in the RPC prepare op, but that makes
it difficult to serialize LAYOUTGETs that use the open stateid. That
serialization is done in pnfs_update_layout, which occurs well before
the rpc_prepare operation.

Between those two events, the i_lock is dropped and reacquired.
pnfs_update_layout can find that the list has lsegs in it and not do any
serialization, but then later pnfs_choose_layoutget_stateid ends up
choosing the open stateid.

This patch changes the client to select the stateid to use in the
LAYOUTGET earlier, when we're searching for a usable layout segment.
This way we can do it all while holding the i_lock the first time, and
ensure that we serialize any LAYOUTGET call that uses a non-layout
stateid.

This also means a rework of how LAYOUTGET replies are handled, as we
must now get the latest stateid if we want to retransmit in response
to a retryable error.

Most of those errors boil down to the fact that the layout state has
changed in some fashion. Thus, what we really want to do is to re-search
for a layout when it fails with a retryable error, so that we can avoid
reissuing the RPC at all if possible.

While the LAYOUTGET RPC is async, the initiating thread always waits for
it to complete, so it's effectively synchronous anyway. Currently, when
we need to retry a LAYOUTGET because of an error, we drive that retry
via the rpc state machine.

This means that once the call has been submitted, it runs until it
completes. So, we must move the error handling for this RPC out of the
rpc_call_done operation and into the caller.

In order to handle errors like NFS4ERR_DELAY properly, we must also
pass a pointer to the sliding timeout, which is now moved to the stack
in pnfs_update_layout.

The complicating errors are -NFS4ERR_RECALLCONFLICT and
-NFS4ERR_LAYOUTTRYLATER, as those involve a timeout after which we give
up and return NULL back to the caller. So, there is some special
handling for those errors to ensure that the layers driving the retries
can handle that appropriately.
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

183d9e7b

pnfs: lift retry logic from send_layoutget to pnfs_update_layout · 83026d80

由 Jeff Layton 提交于 5月 17, 2016

If we get back something like NFS4ERR_OLD_STATEID, that will be
translated into -EAGAIN, and the do/while loop in send_layoutget
will drive the call again.

This is not quite what we want, I think. An error like that is a
sign that something has changed. That something could have been a
concurrent LAYOUTGET that would give us a usable lseg.

Lift the retry logic into pnfs_update_layout instead. That allows
us to redo the layout search, and may spare us from having to issue
an RPC.
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

83026d80

pnfs: fix bad error handling in send_layoutget · d03ab29d

由 Jeff Layton 提交于 5月 17, 2016

Currently, the code will clear the fail bit if we get back a fatal
error. I don't think that's correct -- we want to clear that bit
if we do not get a fatal error.

Fixes: 0bcbf039 (nfs: handle request add failure properly)
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

d03ab29d

pnfs: only tear down lsegs that precede seqid in LAYOUTRETURN args · 6d597e17

由 Jeff Layton 提交于 5月 17, 2016

LAYOUTRETURN is "special" in that servers and clients are expected to
work with old stateids. When the client sends a LAYOUTRETURN with an old
stateid in it then the server is expected to only tear down layout
segments that were present when that seqid was current. Ensure that the
client handles its accounting accordingly.
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

6d597e17

pnfs: keep track of the return sequence number in pnfs_layout_hdr · 3982a6a2

由 Jeff Layton 提交于 5月 17, 2016

When we want to selectively do a LAYOUTRETURN, we need to specify a
stateid that represents most recent layout acquisition that is to be
returned.

When we mark a layout stateid to be returned, we update the return
sequence number in the layout header with that value, if it's newer
than the existing one. Then, when we go to do a LAYOUTRETURN on
layout header put, we overwrite the seqid in the stateid with the
saved one, and then zero it out.
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

3982a6a2

pnfs: record sequence in pnfs_layout_segment when it's created · 66755283

由 Jeff Layton 提交于 5月 17, 2016

In later patches, we're going to teach the client to be more selective
about how it returns layouts. This means keeping a record of what the
stateid's seqid was at the time that the server handed out a layout
segment.
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

66755283

pNFS: Fix a leaked layoutstats flag · f538d0ba

由 Trond Myklebust 提交于 5月 16, 2016

Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

f538d0ba

09 5月, 2016 1 次提交

pnfs: set NFS_IOHDR_REDO in pnfs_read_resend_pnfs · 1b1bc66b

由 Weston Andros Adamson 提交于 4月 01, 2016

Like other resend paths, mark the (old) hdr as NFS_IOHDR_REDO. This
ensures the hdr completion function will not count the (old) hdr
as good bytes.

Also, vector the error back through the hdr->task.tk_status like other
retry calls.

This fixes a bug with the FlexFiles layout where libaio was reporting more
bytes read than requested.
Signed-off-by: NWeston Andros Adamson <dros@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

1b1bc66b

05 4月, 2016 1 次提交

mm, fs: get rid of PAGE_CACHE_* and page_cache_{get,release} macros · 09cbfeaf

由 Kirill A. Shutemov 提交于 4月 01, 2016

PAGE_CACHE_{SIZE,SHIFT,MASK,ALIGN} macros were introduced *long* time
ago with promise that one day it will be possible to implement page
cache with bigger chunks than PAGE_SIZE.

This promise never materialized.  And unlikely will.

We have many places where PAGE_CACHE_SIZE assumed to be equal to
PAGE_SIZE.  And it's constant source of confusion on whether
PAGE_CACHE_* or PAGE_* constant should be used in a particular case,
especially on the border between fs and mm.

Global switching to PAGE_CACHE_SIZE != PAGE_SIZE would cause to much
breakage to be doable.

Let's stop pretending that pages in page cache are special.  They are
not.

The changes are pretty straight-forward:

 - <foo> << (PAGE_CACHE_SHIFT - PAGE_SHIFT) -> <foo>;

 - <foo> >> (PAGE_CACHE_SHIFT - PAGE_SHIFT) -> <foo>;

 - PAGE_CACHE_{SIZE,SHIFT,MASK,ALIGN} -> PAGE_{SIZE,SHIFT,MASK,ALIGN};

 - page_cache_get() -> get_page();

 - page_cache_release() -> put_page();

This patch contains automated changes generated with coccinelle using
script below.  For some reason, coccinelle doesn't patch header files.
I've called spatch for them manually.

The only adjustment after coccinelle is revert of changes to
PAGE_CAHCE_ALIGN definition: we are going to drop it later.

There are few places in the code where coccinelle didn't reach.  I'll
fix them manually in a separate patch.  Comments and documentation also
will be addressed with the separate patch.

virtual patch

@@
expression E;
@@
- E << (PAGE_CACHE_SHIFT - PAGE_SHIFT)
+ E

@@
expression E;
@@
- E >> (PAGE_CACHE_SHIFT - PAGE_SHIFT)
+ E

@@
@@
- PAGE_CACHE_SHIFT
+ PAGE_SHIFT

@@
@@
- PAGE_CACHE_SIZE
+ PAGE_SIZE

@@
@@
- PAGE_CACHE_MASK
+ PAGE_MASK

@@
expression E;
@@
- PAGE_CACHE_ALIGN(E)
+ PAGE_ALIGN(E)

@@
expression E;
@@
- page_cache_get(E)
+ get_page(E)

@@
expression E;
@@
- page_cache_release(E)
+ put_page(E)
Signed-off-by: NKirill A. Shutemov <kirill.shutemov@linux.intel.com>
Acked-by: NMichal Hocko <mhocko@suse.com>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

09cbfeaf

23 2月, 2016 2 次提交

NFSv4.x/pnfs: Fix a race between layoutget and bulk recalls · 9fd4b9fc

由 Trond Myklebust 提交于 2月 22, 2016

Replace another case where the layout 'plh_block_lgets' can trigger
infinite loops in send_layoutget().
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

9fd4b9fc

NFSv4.x/pnfs: Fix a race between layoutget and pnfs_destroy_layout · 2454dfea

由 Trond Myklebust 提交于 2月 22, 2016

If the server reboots while there is a layoutget outstanding, then
the call to pnfs_choose_layoutget_stateid() will fail with an EAGAIN
error, which causes an infinite loop in send_layoutget(). The reason
why we never break out of the loop is that the layout 'plh_block_lgets'
field is never cleared.

Fix is to replace plh_block_lgets with NFS_LAYOUT_INVALID_STID, which
can be reset after a new layoutget.

Fixes: ab7d763e ("pNFS: Ensure nfs4_layoutget_prepare returns...")
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

2454dfea

16 2月, 2016 2 次提交

pNFS: Always set NFS_LAYOUT_RETURN_REQUESTED with lo->plh_return_iomode · e0fa0d01

由 Trond Myklebust 提交于 2月 15, 2016

When setting the layout return mode, we must always also set the
NFS_LAYOUT_RETURN_REQUESTED flag to ensure that we send a layoutreturn.
Otherwise pnfs_error_mark_layout_for_return() could set the mode, but
fail to send the layoutreturn because another is already in flight.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

e0fa0d01

pNFS: Fix pnfs_mark_matching_lsegs_return() · 2f215968

由 Trond Myklebust 提交于 2月 15, 2016

We don't need to schedule a layoutreturn if the layout segment can
be freed immediately.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

2f215968

28 1月, 2016 1 次提交

NFS: Cleanup - rename NFS_LAYOUT_RETURN_BEFORE_CLOSE · 2370abda

由 Trond Myklebust 提交于 1月 27, 2016

NFS_LAYOUT_RETURN_BEFORE_CLOSE is being used to signal that a
layoutreturn is needed, either due to a layout recall or to a
layout error. Rename it to NFS_LAYOUT_RETURN_REQUESTED in order
to clarify its purpose.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

2370abda

27 1月, 2016 1 次提交

pNFS: Fix missing layoutreturn calls · 13c13a6a

由 Trond Myklebust 提交于 1月 26, 2016

The layoutreturn code currently relies on pnfs_put_lseg() to initiate the
RPC call when conditions are right. A problem arises when we want to
free the layout segment from inside an inode->i_lock section (e.g. in
pnfs_clear_request_commit()), since we cannot sleep.

The workaround is to move the actual call to pnfs_send_layoutreturn()
to pnfs_put_layout_hdr(), which doesn't have this restriction.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

13c13a6a

05 1月, 2016 7 次提交
- T
  NFSv4.1/pNFS: Cleanup constify struct pnfs_layout_range arguments · 506c0d68
  由 Trond Myklebust 提交于 1月 04, 2016
```
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  506c0d68
- T
  NFSv4.1/pnfs: Cleanup copying of pnfs_layout_range structures · e144e539
  由 Trond Myklebust 提交于 1月 04, 2016
```
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  e144e539
- T
  NFSv4.1/pNFS: Cleanup pnfs_mark_matching_lsegs_invalid() · 71b39854
  由 Trond Myklebust 提交于 1月 04, 2016
```
Make it more obvious what we're returning...
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  71b39854
- T
  NFSv4.1/pNFS: pnfs_error_mark_layout_for_return() must always return layout · 10335556
  由 Trond Myklebust 提交于 1月 04, 2016
```
Fix a bug whereby if all the layout segments could be immediately freed,
the call to pnfs_error_mark_layout_for_return() would never result in
a layoutreturn.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  10335556
- T
  NFSv4.1/pNFS: pnfs_mark_matching_lsegs_return() should set the iomode · 5c97f5de
  由 Trond Myklebust 提交于 1月 04, 2016
```
If pnfs_mark_matching_lsegs_return() needs to mark a layout segment for
return, then it must also set the return iomode.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  5c97f5de
- T
  NFSv4.1/pNFS: Use nfs4_stateid_copy for copying stateids · 50f563ef
  由 Trond Myklebust 提交于 1月 04, 2016
```
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  50f563ef
- T
  NFSv4.1/pNFS: Don't pass stateids by value to pnfs_send_layoutreturn() · ed429d6b
  由 Trond Myklebust 提交于 1月 04, 2016
```
A stateid is a structure, pass it as a pointer.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  ed429d6b
01 1月, 2016 1 次提交

NFSv4.1/pNFS: Don't queue up a new commit if the layout segment is invalid · b20135d0

由 Trond Myklebust 提交于 12月 31, 2015

If the layout segment is invalid, then we should not be adding more
write requests to the commit list. Instead, those writes should be
replayed after requesting a new layout.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

b20135d0

29 12月, 2015 6 次提交

pNFS: If we have to delay the layout callback, mark the layout for return · fc7ff367

由 Trond Myklebust 提交于 12月 28, 2015

If the client needs to delay the layout callback, then speed up the recall
process by marking the remaining layout segments to be actively returned
by the client.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

fc7ff367

NFSv4.1/pNFS: Add a helper to mark the layout as returned · 0654cc72

由 Trond Myklebust 提交于 12月 28, 2015

This ensures that we don't reuse the stateid if a layout return or
implied layout return means that we've returned all layout segments
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

0654cc72

pNFS/flexfiles: Don't mark the entire layout as failed, when returning it · b9fc773e

由 Trond Myklebust 提交于 12月 15, 2015

In pNFS/flexfiles, we want to return the layout without necessarily marking
it as having completely failed. We therefore move the call to
pnfs_layout_io_set_failed() out of pnfs_error_mark_layout_for_return(),
and then ensura that pNFS/files layout calls it separately.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

b9fc773e

pNFS/flexfiles: Don't prevent flexfiles client from retrying LAYOUTGET · 2e5b29f0

由 Trond Myklebust 提交于 12月 14, 2015

Fix a bug in which flexfiles clients are falling back to I/O through the
MDS even when the FF_FLAGS_NO_IO_THRU_MDS flag is set.

The flexfiles client will always report errors through the LAYOUTRETURN
and/or LAYOUTERROR mechanisms, so it should normally be safe for it
to retry the LAYOUTGET until it fails or succeeds.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

2e5b29f0

nfs: handle request add failure properly · 0bcbf039

由 Peng Tao 提交于 12月 05, 2015

When we fail to queue a read page to IO descriptor,
we need to clean it up otherwise it is hanging around
preventing nfs module from being removed.

When we fail to queue a write page to IO descriptor,
we need to clean it up and also save the failure status
to open context. Then at file close, we can try to write
pages back again and drop the page if it fails to writeback
in .launder_page, which will be done in the next patch.
Signed-off-by: NPeng Tao <tao.peng@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

0bcbf039

nfs: centralize pgio error cleanup · 2bff2288

由 Peng Tao 提交于 12月 05, 2015

In case we fail during setting things up for read/write IO, set
pg_error in IO descriptor and do the cleanup in nfs_pageio_add_request,
where we clean up all pages that are still hanging around on the IO
descriptor.
Signed-off-by: NPeng Tao <tao.peng@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

2bff2288

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功