提交 · 541846502f4fe826cd7c16e4784695ac90736585 · openeuler / Kernel

03 8月, 2022 2 次提交

mm/migrate: Convert migrate_page() to migrate_folio() · 54184650

由 Matthew Wilcox (Oracle) 提交于 6月 06, 2022

Convert all callers to pass a folio.  Most have the folio
already available.  Switch all users from aops->migratepage to
aops->migrate_folio.  Also turn the documentation into kerneldoc.
Signed-off-by: NMatthew Wilcox (Oracle) <willy@infradead.org>
Reviewed-by: NChristoph Hellwig <hch@lst.de>
Acked-by: NDavid Sterba <dsterba@suse.com>

54184650

nfs: Convert to migrate_folio · 4ae84a80

由 Matthew Wilcox (Oracle) 提交于 6月 06, 2022

Use a folio throughout this function.  migrate_page() will be converted
later.
Signed-off-by: NMatthew Wilcox (Oracle) <willy@infradead.org>
Acked-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>
Reviewed-by: NChristoph Hellwig <hch@lst.de>

4ae84a80

01 6月, 2022 1 次提交

NFSv4.1 mark qualified async operations as MOVEABLE tasks · 118f09ed

由 Olga Kornievskaia 提交于 5月 25, 2022

Mark async operations such as RENAME, REMOVE, COMMIT MOVEABLE
for the nfsv4.1+ sessions.

Fixes: 85e39fee ("NFSv4.1 identify and mark RPC tasks that can move between transports")
Signed-off-by: NOlga Kornievskaia <kolga@netapp.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

118f09ed

18 5月, 2022 3 次提交

NFS: Further fixes to the writeback error handling · c6fd3511

由 Trond Myklebust 提交于 5月 14, 2022

When we handle an error by redirtying the page, we're not corrupting the
mapping, so we don't want the error to be recorded in the mapping.
If the caller has specified a sync_mode of WB_SYNC_NONE, we can just
return AOP_WRITEPAGE_ACTIVATE. However if we're dealing with
WB_SYNC_ALL, we need to ensure that retries happen when the errors are
non-fatal.
Reported-by: NOlga Kornievskaia <aglo@umich.edu>
Fixes: 8fc75bed ("NFS: Fix up return value on fatal errors in nfs_page_async_flush()")
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

c6fd3511

NFS: Don't report errors from nfs_pageio_complete() more than once · c5e483b7

由 Trond Myklebust 提交于 5月 14, 2022

Since errors from nfs_pageio_complete() are already being reported
through nfs_async_write_error(), we should not be returning them to the
callers of do_writepages() as well. They will end up being reported
through the generic mechanism instead.

Fixes: 6fbda89b ("NFS: Replace custom error reporting mechanism with generic one")
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

c5e483b7

NFS: Do not report EINTR/ERESTARTSYS as mapping errors · cea9ba72

由 Trond Myklebust 提交于 5月 14, 2022

If the attempt to flush data was interrupted due to a local signal, then
just requeue the writes back for I/O.

cea9ba72

23 3月, 2022 3 次提交

nfs: remove reliance on bdi congestion · 6df25e58

由 NeilBrown 提交于 3月 22, 2022

The bdi congestion tracking in not widely used and will be removed.

NFS is one of a small number of filesystems that uses it, setting just
the async (write) congestion flag at what it determines are appropriate
times.

The only remaining effect of the async flag is to cause (some)
WB_SYNC_NONE writes to be skipped.

So instead of setting the flag, set an internal flag and change:

 - .writepages to do nothing if WB_SYNC_NONE and the flag is set

 - .writepage to return AOP_WRITEPAGE_ACTIVATE if WB_SYNC_NONE and the
   flag is set.

The writepages change causes a behavioural change in that pageout() can
now return PAGE_ACTIVATE instead of PAGE_KEEP, so SetPageActive() will be
called on the page which (I think) wil further delay the next attempt at
writeout.  This might be a good thing.

Link: https://lkml.kernel.org/r/164549983738.9187.3972219847989393182.stgit@noble.brownSigned-off-by: NNeilBrown <neilb@suse.de>
Cc: Anna Schumaker <Anna.Schumaker@Netapp.com>
Cc: Chao Yu <chao@kernel.org>
Cc: Darrick J. Wong <djwong@kernel.org>
Cc: Ilya Dryomov <idryomov@gmail.com>
Cc: Jaegeuk Kim <jaegeuk@kernel.org>
Cc: Jan Kara <jack@suse.cz>
Cc: Jeff Layton <jlayton@kernel.org>
Cc: Jens Axboe <axboe@kernel.dk>
Cc: Lars Ellenberg <lars.ellenberg@linbit.com>
Cc: Miklos Szeredi <miklos@szeredi.hu>
Cc: Paolo Valente <paolo.valente@linaro.org>
Cc: Philipp Reisner <philipp.reisner@linbit.com>
Cc: Ryusuke Konishi <konishi.ryusuke@gmail.com>
Cc: Trond Myklebust <trond.myklebust@hammerspace.com>
Cc: Wu Fengguang <fengguang.wu@intel.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

6df25e58

NFS: Avoid writeback threads getting stuck in mempool_alloc() · 0bae835b

由 Trond Myklebust 提交于 3月 21, 2022

In a low memory situation, allow the NFS writeback code to fail without
getting stuck in infinite loops in mempool_alloc().
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

0bae835b

NFS: nfsiod should not block forever in mempool_alloc() · 515dcdcd

由 Trond Myklebust 提交于 3月 21, 2022

The concern is that since nfsiod is sometimes required to kick off a
commit, it can get locked up waiting forever in mempool_alloc() instead
of failing gracefully and leaving the commit until later.

Try to allocate from the slab first, with GFP_KERNEL | __GFP_NORETRY,
then fall back to a non-blocking attempt to allocate from the memory
pool.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

515dcdcd

15 3月, 2022 1 次提交

nfs: Convert from invalidatepage to invalidate_folio · 6d740c76

由 Matthew Wilcox (Oracle) 提交于 2月 09, 2022

Print the folio index instead of the pointer, since this is more
useful.  We also don't need to use page_file_mapping() as we do not
invalidate swapcache pages.  Since this is the only caller of
nfs_wb_page_cancel(), convert it to nfs_wb_folio_cancel().
Signed-off-by: NMatthew Wilcox (Oracle) <willy@infradead.org>
Tested-by: NDamien Le Moal <damien.lemoal@opensource.wdc.com>
Acked-by: NDamien Le Moal <damien.lemoal@opensource.wdc.com>
Tested-by: Mike Marshall <hubcap@omnibond.com> # orangefs
Tested-by: David Howells <dhowells@redhat.com> # afs

6d740c76

14 3月, 2022 1 次提交

SUNRPC: improve 'swap' handling: scheduling and PF_MEMALLOC · 8db55a03

由 NeilBrown 提交于 3月 07, 2022

rpc tasks can be marked as RPC_TASK_SWAPPER.  This causes GFP_MEMALLOC
to be used for some allocations.  This is needed in some cases, but not
in all where it is currently provided, and in some where it isn't
provided.

Currently *all* tasks associated with a rpc_client on which swap is
enabled get the flag and hence some GFP_MEMALLOC support.

GFP_MEMALLOC is provided for ->buf_alloc() but only swap-writes need it.
However xdr_alloc_bvec does not get GFP_MEMALLOC - though it often does
need it.

xdr_alloc_bvec is called while the XPRT_LOCK is held.  If this blocks,
then it blocks all other queued tasks.  So this allocation needs
GFP_MEMALLOC for *all* requests, not just writes, when the xprt is used
for any swap writes.

Similarly, if the transport is not connected, that will block all
requests including swap writes, so memory allocations should get
GFP_MEMALLOC if swap writes are possible.

So with this patch:
 1/ we ONLY set RPC_TASK_SWAPPER for swap writes.
 2/ __rpc_execute() sets PF_MEMALLOC while handling any task
    with RPC_TASK_SWAPPER set, or when handling any task that
    holds the XPRT_LOCKED lock on an xprt used for swap.
    This removes the need for the RPC_IS_SWAPPER() test
    in ->buf_alloc handlers.
 3/ xprt_prepare_transmit() sets PF_MEMALLOC after locking
    any task to a swapper xprt.  __rpc_execute() will clear it.
 3/ PF_MEMALLOC is set for all the connect workers.

Reviewed-by: Chuck Lever <chuck.lever@oracle.com> (for xprtrdma parts)
Signed-off-by: NNeilBrown <neilb@suse.de>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

8db55a03

26 2月, 2022 2 次提交

NFS: Use of mapping_set_error() results in spurious errors · 6c984083

由 Trond Myklebust 提交于 2月 15, 2022

The use of mapping_set_error() in conjunction with calls to
filemap_check_errors() is problematic because every error gets reported
as either an EIO or an ENOSPC by filemap_check_errors() in functions
such as filemap_write_and_wait() or filemap_write_and_wait_range().
In almost all cases, we prefer to use the more nuanced wb errors.

Fixes: b8946d7b ("NFS: Revalidate the file mapping on all fatal writeback errors")
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

6c984083

NFS: Replace last uses of NFS_INO_REVAL_PAGECACHE · 88a6099f

由 Trond Myklebust 提交于 2月 09, 2022

Now that we have more fine grained attribute revalidation, let's just
get rid of NFS_INO_REVAL_PAGECACHE.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

88a6099f

10 1月, 2022 2 次提交

nfs: Implement cache I/O by accessing the cache directly · 16f2f4e6

由 David Howells 提交于 8月 27, 2021

Move NFS to using fscache DIO API instead of the old upstream I/O API as
that has been removed.  This is a stopgap solution as the intention is that
at sometime in the future, the cache will move to using larger blocks and
won't be able to store individual pages in order to deal with the potential
for data corruption due to the backing filesystem being able insert/remove
bridging blocks of zeros into its extent list[1].

NFS then reads and writes cache pages synchronously and one page at a time.

The preferred change would be to use the netfs lib, but the new I/O API can
be used directly.  It's just that as the cache now needs to track data for
itself, caching blocks may exceed page size...

This code is somewhat borrowed from my "fallback I/O" patchset[2].

Changes
=======
ver #3:
 - Restore lost =n fallback for nfs_fscache_release_page()[2].
Signed-off-by: NDavid Howells <dhowells@redhat.com>
Tested-by: NDave Wysochanski <dwysocha@redhat.com>
Acked-by: NJeff Layton <jlayton@kernel.org>
cc: Trond Myklebust <trond.myklebust@hammerspace.com>
cc: Anna Schumaker <anna.schumaker@netapp.com>
cc: linux-nfs@vger.kernel.org
cc: linux-cachefs@redhat.com
Link: https://lore.kernel.org/r/YO17ZNOcq+9PajfQ@mit.edu [1]
Link: https://lore.kernel.org/r/202112100957.2oEDT20W-lkp@intel.com/ [2]
Link: https://lore.kernel.org/r/163189108292.2509237.12615909591150927232.stgit@warthog.procyon.org.uk/ [2]
Link: https://lore.kernel.org/r/163906981318.143852.17220018647843475985.stgit@warthog.procyon.org.uk/ # v2
Link: https://lore.kernel.org/r/163967184451.1823006.6450645559828329590.stgit@warthog.procyon.org.uk/ # v3
Link: https://lore.kernel.org/r/164021577632.640689.11069627070150063812.stgit@warthog.procyon.org.uk/ # v4

16f2f4e6

nfs: Convert to new fscache volume/cookie API · a6b5a28e

由 Dave Wysochanski 提交于 11月 14, 2020

Change the nfs filesystem to support fscache's indexing rewrite and
reenable caching in nfs.

The following changes have been made:

 (1) The fscache_netfs struct is no more, and there's no need to register
     the filesystem as a whole.

 (2) The session cookie is now an fscache_volume cookie, allocated with
     fscache_acquire_volume().  That takes three parameters: a string
     representing the "volume" in the index, a string naming the cache to
     use (or NULL) and a u64 that conveys coherency metadata for the
     volume.

     For nfs, I've made it render the volume name string as:

        "nfs,<ver>,<family>,<address>,<port>,<fsidH>,<fsidL>*<,param>[,<uniq>]"

 (3) The fscache_cookie_def is no more and needed information is passed
     directly to fscache_acquire_cookie().  The cache no longer calls back
     into the filesystem, but rather metadata changes are indicated at
     other times.

     fscache_acquire_cookie() is passed the same keying and coherency
     information as before.

 (4) fscache_enable/disable_cookie() have been removed.

     Call fscache_use_cookie() and fscache_unuse_cookie() when a file is
     opened or closed to prevent a cache file from being culled and to keep
     resources to hand that are needed to do I/O.

     If a file is opened for writing, we invalidate it with
     FSCACHE_INVAL_DIO_WRITE in lieu of doing writeback to the cache,
     thereby making it cease caching until all currently open files are
     closed.  This should give the same behaviour as the uptream code.
     Making the cache store local modifications isn't straightforward for
     NFS, so that's left for future patches.

 (5) fscache_invalidate() now needs to be given uptodate auxiliary data and
     a file size.  It also takes a flag to indicate if this was due to a
     DIO write.

 (6) Call nfs_fscache_invalidate() with FSCACHE_INVAL_DIO_WRITE on a file
     to which a DIO write is made.

 (7) Call fscache_note_page_release() from nfs_release_page().

 (8) Use a killable wait in nfs_vm_page_mkwrite() when waiting for
     PG_fscache to be cleared.

 (9) The functions to read and write data to/from the cache are stubbed out
     pending a conversion to use netfslib.

Changes
=======
ver #3:
 - Added missing =n fallback for nfs_fscache_release_file()[1][2].

ver #2:
 - Use gfpflags_allow_blocking() rather than using flag directly.
 - fscache_acquire_volume() now returns errors.
 - Remove NFS_INO_FSCACHE as it's no longer used.
 - Need to unuse a cookie on file-release, not inode-clear.
Signed-off-by: NDave Wysochanski <dwysocha@redhat.com>
Co-developed-by: NDavid Howells <dhowells@redhat.com>
Signed-off-by: NDavid Howells <dhowells@redhat.com>
Tested-by: NDave Wysochanski <dwysocha@redhat.com>
Acked-by: NJeff Layton <jlayton@kernel.org>
cc: Trond Myklebust <trond.myklebust@hammerspace.com>
cc: Anna Schumaker <anna.schumaker@netapp.com>
cc: linux-nfs@vger.kernel.org
cc: linux-cachefs@redhat.com
Link: https://lore.kernel.org/r/202112100804.nksO8K4u-lkp@intel.com/ [1]
Link: https://lore.kernel.org/r/202112100957.2oEDT20W-lkp@intel.com/ [2]
Link: https://lore.kernel.org/r/163819668938.215744.14448852181937731615.stgit@warthog.procyon.org.uk/ # v1
Link: https://lore.kernel.org/r/163906979003.143852.2601189243864854724.stgit@warthog.procyon.org.uk/ # v2
Link: https://lore.kernel.org/r/163967182112.1823006.7791504655391213379.stgit@warthog.procyon.org.uk/ # v3
Link: https://lore.kernel.org/r/164021575950.640689.12069642327533368467.stgit@warthog.procyon.org.uk/ # v4

a6b5a28e

22 10月, 2021 1 次提交

NFS: Remove redundant call to __set_page_dirty_nobuffers · 4cd27df8

由 Trond Myklebust 提交于 10月 21, 2021

Remove a redundant call in nfs_updatepage(). nfs_writepage_setup() will
have already called nfs_mark_request_dirty() on success.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

4cd27df8

21 10月, 2021 2 次提交

SUNRPC: Trace calls to .rpc_call_done · b40887e1

由 Chuck Lever 提交于 10月 16, 2021

Introduce a single tracepoint that can replace simple dprintk call
sites in upper layer "rpc_call_done" callbacks. Example:

kworker/u24:2-1254 [001] 771.026677: rpc_stats_latency: task:00000001@00000002 xid=0x16a6f3c0 rpcbindv2 GETPORT backlog=446 rtt=101 execute=555
kworker/u24:2-1254 [001] 771.026677: rpc_task_call_done: task:00000001@00000002 flags=ASYNC|DYNAMIC|SOFT|SOFTCONN|SENT runstate=RUNNING|ACTIVE status=0 action=rpcb_getport_done
kworker/u24:2-1254 [001] 771.026678: rpcb_setport: task:00000001@00000002 status=0 port=20048
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

b40887e1

NFS: Fix up commit deadlocks · 133a48ab

由 Trond Myklebust 提交于 10月 04, 2021

If O_DIRECT bumps the commit_info rpcs_out field, then that could lead
to fsync() hangs. The fix is to ensure that O_DIRECT calls
nfs_commit_end().

Fixes: 723c921e ("sched/wait, fs/nfs: Convert wait_on_atomic_t() usage to the new wait_var_event() API")
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

133a48ab

10 10月, 2021 2 次提交

NFS: Fix deadlocks in nfs_scan_commit_list() · 64a93dbf

由 Trond Myklebust 提交于 10月 04, 2021

Partially revert commit 2ce209c4 ("NFS: Wait for requests that are
locked on the commit list"), since it can lead to deadlocks between
commit requests and nfs_join_page_group().
For now we should assume that any locked requests on the commit list are
either about to be removed and committed by another task, or the writes
they describe are about to be retransmitted. In either case, we should
not need to worry.

Fixes: 2ce209c4 ("NFS: Wait for requests that are locked on the commit list")
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

64a93dbf

NFS: Instrument i_size_write() · 110cb2d2

由 Chuck Lever 提交于 10月 04, 2021

Generate a trace event whenever the NFS client modifies the size of
a file. These new events aid troubleshooting workloads that trigger
races around size updates.

There are four new trace points, all named nfs_size_something so
they are easy to grep for or enable as a group with a single glob.

Size updated on the server:

kworker/u24:10-194 [010] 369.939174: nfs_size_update: fileid=00:28:2 fhandle=0x36fbbe51 version=1752899344277980615 cursize=250471 newsize=172083

Server-side size update reported via NFSv3 WCC attributes:

fsx-1387 [006] 380.760686: nfs_size_wcc: fileid=00:28:2 fhandle=0x36fbbe51 version=1752899355909932456 cursize=146792 newsize=171216

File has been truncated locally:

fsx-1387 [007] 369.437421: nfs_size_truncate: fileid=00:28:2 fhandle=0x36fbbe51 version=1752899231200117272 cursize=215244 newsize=0

File has been extended locally:

fsx-1387 [007] 369.439213: nfs_size_grow: fileid=00:28:2 fhandle=0x36fbbe51 version=1752899343704248410 cursize=258048 newsize=262144
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

110cb2d2

04 10月, 2021 1 次提交

NFS: Fix up nfs_ctx_key_to_expire() · ca05cbae

由 Trond Myklebust 提交于 7月 10, 2021

If the cached credential exists but doesn't have any expiration callback
then exit early.
Fix up atomicity issues when replacing the credential with a new one
since the existing code could lead to refcount leaks.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

ca05cbae

09 7月, 2021 1 次提交

NFSv4.1 identify and mark RPC tasks that can move between transports · 85e39fee

由 Olga Kornievskaia 提交于 6月 23, 2021

In preparation for when we can re-try a task on a different transport,
identify and mark such RPC tasks as moveable. Only 4.1+ operarations can
be re-tried on a different transport.
Signed-off-by: NOlga Kornievskaia <kolga@netapp.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

85e39fee

13 4月, 2021 3 次提交

NFSv4: Don't modify the change attribute cached in the inode · 993e2d4b

由 Trond Myklebust 提交于 4月 12, 2021

When the client is caching data and a write delegation is held, then the
server may send a CB_GETATTR to query the attributes. When this happens,
the client is supposed to bump the change attribute value that it
returns if it holds cached data.
However that process uses a value that is stored in the delegation. We
do not want to bump the change attribute held in the inode.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

993e2d4b

NFS: Separate tracking of file mode cache validity from the uid/gid · 720869eb

由 Trond Myklebust 提交于 4月 13, 2021

chown()/chgrp() and chmod() are separate operations, and in addition,
there are mode operations that are performed automatically by the
server. So let's track mode validity separately from the file ownership
validity.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

720869eb

NFS: Replace use of NFS_INO_REVAL_PAGECACHE when checking cache validity · 13c0b082

由 Trond Myklebust 提交于 3月 25, 2021

When checking cache validity, be more specific than just 'we want to
check the page cache validity'. In almost all cases, we want to check
that change attribute, and possibly also the size.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

13c0b082

09 3月, 2021 1 次提交

NFS: Fix open coded versions of nfs_set_cache_invalid() · ac46b3d7

由 Trond Myklebust 提交于 3月 08, 2021

nfs_set_cache_invalid() has code to handle delegations, and other
optimisations, so let's use it when appropriate.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

ac46b3d7

17 2月, 2021 1 次提交

NFS: Add support for eager writes · ed7bcdb3

由 Trond Myklebust 提交于 2月 12, 2021

Support eager writing to the server, meaning that we write the data to
cache on the server, and wait for that to complete. This ensures that we
see ENOSPC errors immediately.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

ed7bcdb3

09 2月, 2021 1 次提交

NFS: Optimise sparse writes past the end of file · fc9dc401

由 Trond Myklebust 提交于 2月 08, 2021

If we're doing a write, and the entire page lies beyond the end-of-file,
then we can assume the write can be extended to cover the beginning of
the page, since we know the data in that region will be all zeros.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

fc9dc401

03 6月, 2020 1 次提交

mm/writeback: discard NR_UNSTABLE_NFS, use NR_WRITEBACK instead · 8d92890b

由 NeilBrown 提交于 6月 01, 2020

After an NFS page has been written it is considered "unstable" until a
COMMIT request succeeds.  If the COMMIT fails, the page will be
re-written.

These "unstable" pages are currently accounted as "reclaimable", either
in WB_RECLAIMABLE, or in NR_UNSTABLE_NFS which is included in a
'reclaimable' count.  This might have made sense when sending the COMMIT
required a separate action by the VFS/MM (e.g.  releasepage() used to
send a COMMIT).  However now that all writes generated by ->writepages()
will automatically be followed by a COMMIT (since commit 919e3bd9
("NFS: Ensure we commit after writeback is complete")) it makes more
sense to treat them as writeback pages.

So this patch removes NR_UNSTABLE_NFS and accounts unstable pages in
NR_WRITEBACK and WB_WRITEBACK.

A particular effect of this change is that when
wb_check_background_flush() calls wb_over_bg_threshold(), the latter
will report 'true' a lot less often as the 'unstable' pages are no
longer considered 'dirty' (as there is nothing that writeback can do
about them anyway).

Currently wb_check_background_flush() will trigger writeback to NFS even
when there are relatively few dirty pages (if there are lots of unstable
pages), this can result in small writes going to the server (10s of
Kilobytes rather than a Megabyte) which hurts throughput.  With this
patch, there are fewer writes which are each larger on average.

Where the NR_UNSTABLE_NFS count was included in statistics
virtual-files, the entry is retained, but the value is hard-coded as
zero.  static trace points and warning printks which mentioned this
counter no longer report it.

[akpm@linux-foundation.org: re-layout comment]
[akpm@linux-foundation.org: fix printk warning]
Signed-off-by: NNeilBrown <neilb@suse.de>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Reviewed-by: NJan Kara <jack@suse.cz>
Reviewed-by: NChristoph Hellwig <hch@lst.de>
Acked-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Acked-by: Michal Hocko <mhocko@suse.com>	[mm]
Cc: Christoph Hellwig <hch@lst.de>
Cc: Chuck Lever <chuck.lever@oracle.com>
Link: http://lkml.kernel.org/r/87d06j7gqa.fsf@notabene.neil.brown.nameSigned-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

8d92890b

13 5月, 2020 1 次提交

NFS/pnfs: Don't use RPC_TASK_CRED_NOREF with pnfs · 4fa7ef69

由 Trond Myklebust 提交于 5月 13, 2020

When we're doing pnfs then the credential being used for the RPC call
is not necessarily the same as the one used in the open context, so
don't use RPC_TASK_CRED_NOREF.

Fixes: 61296507 ("NFSv4: Avoid referencing the cred unnecessarily during NFSv4 I/O")
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

4fa7ef69

02 4月, 2020 4 次提交

NFS: Try to join page groups before an O_DIRECT retransmission · ed5d588f

由 Trond Myklebust 提交于 3月 30, 2020

If we have to retransmit requests, try to join their page groups
first.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

ed5d588f

NFS: Refactor nfs_lock_and_join_requests() · e00ed89d

由 Trond Myklebust 提交于 3月 30, 2020

Refactor nfs_lock_and_join_requests() in order to separate out the
subrequest merging into its own function nfs_lock_and_join_group()
that can be used by O_DIRECT.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

e00ed89d

NFS: Clean up nfs_lock_and_join_requests() · a62f8e3b

由 Trond Myklebust 提交于 3月 30, 2020

Clean up nfs_lock_and_join_requests() to simplify the calculation
of the range covered by the page group, taking into account the
presence of mirrors.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

a62f8e3b

NFS: Fix races nfs_page_group_destroy() vs nfs_destroy_unlinked_subrequests() · 08ca8b21

由 Trond Myklebust 提交于 4月 01, 2020

When a subrequest is being detached from the subgroup, we want to
ensure that it is not holding the group lock, or in the process
of waiting for the group lock.

Fixes: 5b2b5187 ("NFS: Fix nfs_page_group_destroy() and nfs_lock_and_join_requests() race cases")
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

08ca8b21

01 4月, 2020 1 次提交

NFS: Fix a page leak in nfs_destroy_unlinked_subrequests() · add42de3

由 Trond Myklebust 提交于 4月 01, 2020

When we detach a subrequest from the list, we must also release the
reference it holds to the parent.

Fixes: 5b2b5187 ("NFS: Fix nfs_page_group_destroy() and nfs_lock_and_join_requests() race cases")
Cc: stable@vger.kernel.org # v4.14+
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

add42de3

28 3月, 2020 1 次提交

NFS: Fix O_DIRECT commit verifier handling · 1f28476d

由 Trond Myklebust 提交于 3月 21, 2020

Instead of trying to save the commit verifiers and checking them against
previous writes, adopt the same strategy as for buffered writes, of
just checking the verifiers at commit time.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

1f28476d

26 3月, 2020 1 次提交

NFS/pNFS: Refactor pnfs_generic_commit_pagelist() · 19573c93

由 Trond Myklebust 提交于 3月 19, 2020

Refactor pnfs_generic_commit_pagelist() to simplify the conversion
to layout segment based commit lists.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

19573c93

16 3月, 2020 1 次提交

NFS: Assume cred is pinned by open context in I/O requests · 542b994b

由 Trond Myklebust 提交于 2月 07, 2020

In read/write/commit, we should be able to assume that the cred is
pinned by the open context.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

542b994b

15 1月, 2020 2 次提交

NFS: When resending after a short write, reset the reply count to zero · 8c9cb714

由 Trond Myklebust 提交于 1月 06, 2020

If we're resending a write due to a short read or write, ensure we
reset the reply count to zero.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

8c9cb714

NFS: Clean up generic file commit tracepoint · 7bdd297e

由 Trond Myklebust 提交于 1月 06, 2020

Clean up the generic file commit tracepoints to use a 64-bit value
for the verifier, and to display the pNFS filehandle, if it exists.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

7bdd297e

openeuler / Kernel 接近 2 年 前同步成功

openeuler / Kernel
接近 2 年前同步成功