提交 · 6712007734cbd64ff924af16fc236751d47ff80b · openanolis / cloud-kernel

06 7月, 2016 1 次提交
- T
  pNFS: pnfs_layoutcommit_outstanding() is no longer used when !CONFIG_NFS_V4_1 · 67120077
  由 Trond Myklebust 提交于 7月 05, 2016
```
Cleanup...
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  67120077
26 5月, 2016 1 次提交

pnfs: pnfs_update_layout needs to consider if strict iomode checking is on · c7d73af2

由 Tom Haynes 提交于 5月 25, 2016

As flexfiles has FF_FLAGS_NO_READ_IO, there is a need to generically
support enforcing that a IOMODE_RW segment will not allow READ I/O.
Signed-off-by: NTom Haynes <loghyr@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

c7d73af2

18 5月, 2016 4 次提交

pnfs: rework LAYOUTGET retry handling · 183d9e7b

由 Jeff Layton 提交于 5月 17, 2016

There are several problems in the way a stateid is selected for a
LAYOUTGET operation:

We pick a stateid to use in the RPC prepare op, but that makes
it difficult to serialize LAYOUTGETs that use the open stateid. That
serialization is done in pnfs_update_layout, which occurs well before
the rpc_prepare operation.

Between those two events, the i_lock is dropped and reacquired.
pnfs_update_layout can find that the list has lsegs in it and not do any
serialization, but then later pnfs_choose_layoutget_stateid ends up
choosing the open stateid.

This patch changes the client to select the stateid to use in the
LAYOUTGET earlier, when we're searching for a usable layout segment.
This way we can do it all while holding the i_lock the first time, and
ensure that we serialize any LAYOUTGET call that uses a non-layout
stateid.

This also means a rework of how LAYOUTGET replies are handled, as we
must now get the latest stateid if we want to retransmit in response
to a retryable error.

Most of those errors boil down to the fact that the layout state has
changed in some fashion. Thus, what we really want to do is to re-search
for a layout when it fails with a retryable error, so that we can avoid
reissuing the RPC at all if possible.

While the LAYOUTGET RPC is async, the initiating thread always waits for
it to complete, so it's effectively synchronous anyway. Currently, when
we need to retry a LAYOUTGET because of an error, we drive that retry
via the rpc state machine.

This means that once the call has been submitted, it runs until it
completes. So, we must move the error handling for this RPC out of the
rpc_call_done operation and into the caller.

In order to handle errors like NFS4ERR_DELAY properly, we must also
pass a pointer to the sliding timeout, which is now moved to the stack
in pnfs_update_layout.

The complicating errors are -NFS4ERR_RECALLCONFLICT and
-NFS4ERR_LAYOUTTRYLATER, as those involve a timeout after which we give
up and return NULL back to the caller. So, there is some special
handling for those errors to ensure that the layers driving the retries
can handle that appropriately.
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

183d9e7b

pnfs: only tear down lsegs that precede seqid in LAYOUTRETURN args · 6d597e17

由 Jeff Layton 提交于 5月 17, 2016

LAYOUTRETURN is "special" in that servers and clients are expected to
work with old stateids. When the client sends a LAYOUTRETURN with an old
stateid in it then the server is expected to only tear down layout
segments that were present when that seqid was current. Ensure that the
client handles its accounting accordingly.
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

6d597e17

pnfs: keep track of the return sequence number in pnfs_layout_hdr · 3982a6a2

由 Jeff Layton 提交于 5月 17, 2016

When we want to selectively do a LAYOUTRETURN, we need to specify a
stateid that represents most recent layout acquisition that is to be
returned.

When we mark a layout stateid to be returned, we update the return
sequence number in the layout header with that value, if it's newer
than the existing one. Then, when we go to do a LAYOUTRETURN on
layout header put, we overwrite the seqid in the stateid with the
saved one, and then zero it out.
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

3982a6a2

pnfs: record sequence in pnfs_layout_segment when it's created · 66755283

由 Jeff Layton 提交于 5月 17, 2016

In later patches, we're going to teach the client to be more selective
about how it returns layouts. This means keeping a record of what the
stateid's seqid was at the time that the server handed out a layout
segment.
Signed-off-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

66755283

09 5月, 2016 1 次提交

pnfs: set NFS_IOHDR_REDO in pnfs_read_resend_pnfs · 1b1bc66b

由 Weston Andros Adamson 提交于 4月 01, 2016

Like other resend paths, mark the (old) hdr as NFS_IOHDR_REDO. This
ensures the hdr completion function will not count the (old) hdr
as good bytes.

Also, vector the error back through the hdr->task.tk_status like other
retry calls.

This fixes a bug with the FlexFiles layout where libaio was reporting more
bytes read than requested.
Signed-off-by: NWeston Andros Adamson <dros@primarydata.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

1b1bc66b

28 1月, 2016 1 次提交

NFS: Cleanup - rename NFS_LAYOUT_RETURN_BEFORE_CLOSE · 2370abda

由 Trond Myklebust 提交于 1月 27, 2016

NFS_LAYOUT_RETURN_BEFORE_CLOSE is being used to signal that a
layoutreturn is needed, either due to a layout recall or to a
layout error. Rename it to NFS_LAYOUT_RETURN_REQUESTED in order
to clarify its purpose.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

2370abda

05 1月, 2016 3 次提交
- T
  NFSv4.1/pNFS: Cleanup constify struct pnfs_layout_range arguments · 506c0d68
  由 Trond Myklebust 提交于 1月 04, 2016
```
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  506c0d68
- T
  NFSv4.1/pnfs: Cleanup copying of pnfs_layout_range structures · e144e539
  由 Trond Myklebust 提交于 1月 04, 2016
```
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  e144e539
- T
  NFSv4.1/pNFS: pnfs_error_mark_layout_for_return() must always return layout · 10335556
  由 Trond Myklebust 提交于 1月 04, 2016
```
Fix a bug whereby if all the layout segments could be immediately freed,
the call to pnfs_error_mark_layout_for_return() would never result in
a layoutreturn.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  10335556
01 1月, 2016 1 次提交

NFSv4.1/pNFS: Don't queue up a new commit if the layout segment is invalid · b20135d0

由 Trond Myklebust 提交于 12月 31, 2015

If the layout segment is invalid, then we should not be adding more
write requests to the commit list. Instead, those writes should be
replayed after requesting a new layout.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

b20135d0

29 12月, 2015 3 次提交

pNFS: If we have to delay the layout callback, mark the layout for return · fc7ff367

由 Trond Myklebust 提交于 12月 28, 2015

If the client needs to delay the layout callback, then speed up the recall
process by marking the remaining layout segments to be actively returned
by the client.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

fc7ff367

NFSv4.1/pNFS: Add a helper to mark the layout as returned · 0654cc72

由 Trond Myklebust 提交于 12月 28, 2015

This ensures that we don't reuse the stateid if a layout return or
implied layout return means that we've returned all layout segments
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

0654cc72

pNFS/flexfiles: Don't prevent flexfiles client from retrying LAYOUTGET · 2e5b29f0

由 Trond Myklebust 提交于 12月 14, 2015

Fix a bug in which flexfiles clients are falling back to I/O through the
MDS even when the FF_FLAGS_NO_IO_THRU_MDS flag is set.

The flexfiles client will always report errors through the LAYOUTRETURN
and/or LAYOUTERROR mechanisms, so it should normally be safe for it
to retry the LAYOUTGET until it fails or succeeds.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

2e5b29f0

23 9月, 2015 1 次提交

NFS41: make close wait for layoutreturn · 500d701f

由 Peng Tao 提交于 9月 22, 2015

If we send a layoutreturn asynchronously before close, the close
might reach server first and layoutreturn would fail with BADSTATEID
because there is nothing keeping the layout stateid alive.

Also do not pretend sending layoutreturn if we are not.
Signed-off-by: NPeng Tao <tao.peng@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

500d701f

26 8月, 2015 5 次提交

NFSv4.1/flexfiles: Allow coalescing of new layout segments and existing ones · 0762ed2c

由 Trond Myklebust 提交于 8月 25, 2015

In order to ensure atomicity of updates, we merge the old layout segments
into the new ones, and then invalidate the old ones.

Also ensure that we order the list of layout segments so that
RO segments are preferred over RW.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

0762ed2c

NFSv4.1/pnfs: Allow pNFS device drivers to customise layout segment insertion · 03772d2f

由 Trond Myklebust 提交于 8月 25, 2015

This is needed in order to allow merging of contiguous layout segments,
and also to correct the ordering of layouts for those device drivers that
don't necessarily want to place the read-write layouts first.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

03772d2f

NFSv4.1/pnfs Improve the packing of struct pnfs_layout_hdr · 82714bd1

由 Trond Myklebust 提交于 8月 25, 2015

Eliminate a couple of holes in the structure, and move the 2 atomics
into the same cacheline.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

82714bd1

NFSv4.2/pnfs: Make the layoutstats timer configurable · bbf58bf3

由 Trond Myklebust 提交于 8月 24, 2015

Allow advanced users to set the layoutstats timer in order to lengthen
or shorten the period between layoutstat transmissions to the server.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

bbf58bf3

NFS41: remove NFS_LAYOUT_ROC flag · 3976143b

由 Peng Tao 提交于 8月 21, 2015

If we return delegation before closing, we fail to do roc check
during close because NFS_LAYOUT_ROC is cleared by delegreturn
and it causes layouts to be still hanging around after delegreturn
+ close, which is a voilation against protocol.
Signed-off-by: NPeng Tao <tao.peng@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

3976143b

19 8月, 2015 1 次提交

NFSv4.1/pnfs: Fix a close/delegreturn hang when return-on-close is set · 4ff376fe

由 Trond Myklebust 提交于 8月 18, 2015

The helper pnfs_roc() has already verified that we have no delegations,
and no further open files, hence no outstanding I/O and it has marked
all the return-on-close lsegs as being invalid.
Furthermore, it sets the NFS_LAYOUT_RETURN bit, thus serialising the
close/delegreturn with all future layoutget calls on this inode.

The checks in pnfs_roc_drain() for valid layout segments are therefore
redundant: those cannot exist until another layoutget completes.
The other check for whether or not NFS_LAYOUT_RETURN is set, actually
causes a hang, since we already know that we hold that flag.

To fix, we therefore strip out all the functionality in pnfs_roc_drain()
except the retrieval of the barrier state, and then rename the function
accordingly.
Reported-by: NChristoph Hellwig <hch@infradead.org>
Fixes: 5c4a79fb ("Don't prevent layoutgets when doing return-on-close")
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

4ff376fe

13 8月, 2015 1 次提交
- T
  NFSv4.2/pnfs: Use GFP_NOIO for layoutstat reporting in the writeback path · c8ad8894
  由 Trond Myklebust 提交于 8月 05, 2015
```
Prevent a potential deadlock.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
```
  c8ad8894
27 6月, 2015 1 次提交

nfs: provide pnfs_report_layoutstat when NFS42 is disabled · 865a7ecb

由 Peng Tao 提交于 6月 25, 2015

kbuild test robot reported:
   fs/built-in.o: In function `pnfs_report_layoutstat':
>> (.text+0x151a1c): undefined reference to `nfs42_proc_layoutstats_generic'
Reported-by: Nkbuild test robot <fengguang.wu@intel.com>
Signed-off-by: NPeng Tao <tao.peng@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

865a7ecb

24 6月, 2015 1 次提交

pnfs: add pnfs_report_layoutstat helper function · 8733408d

由 Peng Tao 提交于 6月 23, 2015

Reviewed-by: NJeff Layton <jeff.layton@primarydata.com>
Signed-off-by: NPeng Tao <tao.peng@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

8733408d

16 4月, 2015 1 次提交

VFS: normal filesystems (and lustre): d_inode() annotations · 2b0143b5

由 David Howells 提交于 3月 17, 2015

that's the bulk of filesystem drivers dealing with inodes of their own
Signed-off-by: NDavid Howells <dhowells@redhat.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

2b0143b5

28 3月, 2015 4 次提交

NFSv4.1/pnfs: Separate out metadata and data consistency for pNFS · 5bb89b47

由 Trond Myklebust 提交于 3月 25, 2015

The LAYOUTCOMMIT operation means different things to different layout types.
For blocks and objects, it is both a data and metadata consistency operation.
For files and flexfiles, it is only a metadata consistency operation.

This patch separates out the 2 cases, allowing the files/flexfiles layout
drivers to optimise away the data consistency calls to layoutcommit.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

5bb89b47

NFSv4.1/pnfs: Refactor pnfs_set_layoutcommit() · 67af7611

由 Trond Myklebust 提交于 3月 25, 2015

pnfs_set_layoutcommit() and pnfs_commit_set_layoutcommit() are 100% identical
except for the function arguments. Refactor to eliminate the difference.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

67af7611

NFSv4.1: Don't cache deviceids that have no notifications · df52699e

由 Trond Myklebust 提交于 3月 09, 2015

The spec says that once all layouts that reference a given deviceid
have been returned, then we are only allowed to continue to cache
the deviceid if the metadata server supports notifications.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

df52699e

NFSv4.1: Convert pNFS deviceid to use kfree_rcu() · 84a80f62

由 Trond Myklebust 提交于 3月 09, 2015

Use of synchronize_rcu() when unmounting and potentially freeing a lot
of deviceids is problematic. There really is no reason why we can't just
use kfree_rcu() here.
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

84a80f62

18 2月, 2015 1 次提交

pnfs: Refactor the *_layout_mark_request_commit to use pnfs_layout_mark_request_commit · 338d00cf

由 Tom Haynes 提交于 2月 17, 2015

The File Layout's filelayout_mark_request_commit() is almost the
Flex File Layout's ff_layout_mark_request_commit(). And that can
be reduced by calling into nfs_request_add_commit_list().
Signed-off-by: NTom Haynes <loghyr@primarydata.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>

338d00cf

04 2月, 2015 9 次提交

pnfs/flexfiles: Add the FlexFile Layout Driver · d67ae825

由 Tom Haynes 提交于 12月 11, 2014

The flexfile layout is a new layout that extends the
file layout. It is currently being drafted as a specification at
https://datatracker.ietf.org/doc/draft-ietf-nfsv4-layout-types/Signed-off-by: NWeston Andros Adamson <dros@primarydata.com>
Signed-off-by: NTom Haynes <loghyr@primarydata.com>
Signed-off-by: NTao Peng <bergwolf@primarydata.com>

d67ae825

nfs41: wait for LAYOUTRETURN before retrying LAYOUTGET · aa8a45ee

由 Peng Tao 提交于 12月 01, 2014

Also take care to stop waiting if someone clears retry bit.
Signed-off-by: NPeng Tao <tao.peng@primarydata.com>

aa8a45ee

nfs41: add NFS_LAYOUT_RETRY_LAYOUTGET to layout header flags · c829013d

由 Peng Tao 提交于 12月 01, 2014

Use it to indicate that LD wants to retry layoutget. LD can set
it whenever it wants the common pnfs code to return and retry
pnfs path through a new layout.

The bit gets cleared when client does a new layoutget, when client
closes the file (ROC case), or when kernel needs to evict the inode
(non-ROC case).
Signed-off-by: NPeng Tao <tao.peng@primarydata.com>

c829013d

nfs41: introduce NFS_LAYOUT_RETURN_BEFORE_CLOSE · 193e3aa2

由 Peng Tao 提交于 11月 17, 2014

When it is set, generic pnfs would try to send layoutreturn right
before last close/delegation_return regard less NFS_LAYOUT_ROC is
set or not. LD can then make sure layoutreturn is always sent
rather than being omitted.

The difference against NFS_LAYOUT_RETURN is that
NFS_LAYOUT_RETURN_BEFORE_CLOSE does not block usage of the layout so
LD can set it and expect generic layer to try pnfs path at the
same time.
Signed-off-by: NPeng Tao <tao.peng@primarydata.com>
Signed-off-by: NTom Haynes <loghyr@primarydata.com>

193e3aa2

nfs41: allow async version layoutreturn · 6c16605d

由 Peng Tao 提交于 11月 17, 2014

Signed-off-by: NPeng Tao <tao.peng@primarydata.com>
Signed-off-by: NTom Haynes <loghyr@primarydata.com>

6c16605d

pnfs: allow LD to ask to resend read through pnfs · ceb11e13

由 Peng Tao 提交于 11月 10, 2014

If current IO cannot be completed due to some transient errors,
LD may want to ask generic layer to resend the request through
pnfs again.
Signed-off-by: NPeng Tao <tao.peng@primarydata.com>
Signed-off-by: NTom Haynes <loghyr@primarydata.com>

ceb11e13

pnfs: pass ds_commit_idx through the commit path · b57ff130

由 Weston Andros Adamson 提交于 9月 05, 2014

Pass ds_commit_idx through the nfs commit path. It's used to select
the commit bucket when using pnfs and is ignored when not using pnfs.
Several functions had to be changed: nfs_retry_commit,
nfs_mark_request_commit, pnfs_mark_request_commit and the pnfs layout
driver .mark_request_commit functions.
Signed-off-by: NTom Haynes <loghyr@primarydata.com>

b57ff130

pnfs: release lseg in pnfs_generic_pg_cleanup · 180bb5ec

由 Weston Andros Adamson 提交于 9月 10, 2014

This is needed to support mirrored writes - the first write can't just
trash the lseg, we need to keep it around until all mirrors have
written.
Signed-off-by: NWeston Andros Adamson <dros@primarydata.com>

180bb5ec

nfs41: don't use a layout if it is marked for returning · ce6ab4f2

由 Peng Tao 提交于 9月 06, 2014

And if we are to return the same type of layouts, don't bother
sending more layoutgets.
Signed-off-by: NPeng Tao <tao.peng@primarydata.com>
Signed-off-by: NTom Haynes <Thomas.Haynes@primarydata.com>

ce6ab4f2

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功