提交 · c917cfaf9bbef44dd35b75b8fb772a44798a1cf2 · openeuler / Kernel

26 4月, 2019 9 次提交

NFS: Fix up NFS I/O subrequest creation · c917cfaf

由 Trond Myklebust 提交于 4月 07, 2019

We require all NFS I/O subrequests to duplicate the lock context as well
as the open context.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

c917cfaf

NFS: Replace custom error reporting mechanism with generic one · 6fbda89b

由 Trond Myklebust 提交于 4月 07, 2019

Replace the NFS custom error reporting mechanism with the generic
mapping_set_error().
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

6fbda89b

NFS: Don't inadvertently clear writeback errors · aded8d7b

由 Trond Myklebust 提交于 4月 07, 2019

vfs_fsync() has the side effect of clearing unreported writeback errors,
so we need to make sure that we do not abuse it in situations where
applications might not normally expect us to report those errors.

The solution is to replace calls to vfs_fsync() with calls to nfs_wb_all().
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

aded8d7b

NFS: Don't call generic_error_remove_page() while holding locks · 22876f54

由 Trond Myklebust 提交于 4月 07, 2019

The NFS read code can trigger writeback while holding the page lock.
If an error then triggers a call to nfs_write_error_remove_page(),
we can deadlock.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

22876f54

NFS: Don't interrupt file writeout due to fatal errors · 14bebe3c

由 Trond Myklebust 提交于 4月 07, 2019

When flushing out dirty pages, the fact that we may hit fatal errors
is not a reason to stop writeback. Those errors are reported through
fsync(), not through the flush mechanism.

Fixes: a6598813 ("NFS: Don't write back further requests if there...")
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

14bebe3c

NFS: Add a mount option "softerr" to allow clients to see ETIMEDOUT errors · 91a575e1

由 Trond Myklebust 提交于 4月 07, 2019

Add a mount option that exposes the ETIMEDOUT errors that occur during
soft timeouts to the application. This allows aware applications to
distinguish between server disk IO errors and client timeout errors.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

91a575e1

NFS: Consider ETIMEDOUT to be a fatal error · 11982a7c

由 Trond Myklebust 提交于 4月 07, 2019

When we introduce the 'softerr' mount option, we will see the RPC
layer returning ETIMEDOUT errors if the server is unresponsive. We
want to consider those errors to be fatal on par with the EIO errors
that are returned by ordinary 'soft' timeouts..
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

11982a7c

SUNRPC: Add function rpc_sleep_on_timeout() · 6b2e6856

由 Trond Myklebust 提交于 4月 07, 2019

Clean up the RPC task sleep interfaces by replacing the task->tk_timeout
'hidden parameter' to rpc_sleep_on() with a new function that takes an
absolute timeout.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

6b2e6856

SUNRPC: Remove unused argument 'action' from rpc_sleep_on_priority() · 8357a9b6

由 Trond Myklebust 提交于 4月 07, 2019

None of the callers set the 'action' argument, so let's just remove it.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>

8357a9b6

12 4月, 2019 3 次提交

NFSv4.1 fix incorrect return value in copy_file_range · 0769663b

由 Olga Kornievskaia 提交于 4月 11, 2019

According to the NFSv4.2 spec if the input and output file is the
same file, operation should fail with EINVAL. However, linux
copy_file_range() system call has no such restrictions. Therefore,
in such case let's return EOPNOTSUPP and allow VFS to fallback
to doing do_splice_direct(). Also when copy_file_range is called
on an NFSv4.0 or 4.1 mount (ie., a server that doesn't support
COPY functionality), we also need to return EOPNOTSUPP and
fallback to a regular copy.

Fixes xfstest generic/075, generic/091, generic/112, generic/263
for all NFSv4.x versions.
Signed-off-by: NOlga Kornievskaia <kolga@netapp.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

0769663b

NFS: Fix handling of reply page vector · 29e7ca71

由 Chuck Lever 提交于 4月 09, 2019

NFSv4 GETACL and FS_LOCATIONS requests stopped working in v5.1-rc.

These two need the extra padding to be added directly to the reply
length.
Reported-by: NOlga Kornievskaia <aglo@umich.edu>
Fixes: 02ef04e4 ("NFS: Account for XDR pad of buf->pages")
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Tested-by: NOlga Kornievskaia <aglo@umich.edu>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

29e7ca71

NFS: Forbid setting AF_INET6 to "struct sockaddr_in"->sin_family. · 7c2bd9a3

由 Tetsuo Handa 提交于 3月 30, 2019

syzbot is reporting uninitialized value at rpc_sockaddr2uaddr() [1]. This
is because syzbot is setting AF_INET6 to "struct sockaddr_in"->sin_family
(which is embedded into user-visible "struct nfs_mount_data" structure)
despite nfs23_validate_mount_data() cannot pass sizeof(struct sockaddr_in6)
bytes of AF_INET6 address to rpc_sockaddr2uaddr().

Since "struct nfs_mount_data" structure is user-visible, we can't change
"struct nfs_mount_data" to use "struct sockaddr_storage". Therefore,
assuming that everybody is using AF_INET family when passing address via
"struct nfs_mount_data"->addr, reject if its sin_family is not AF_INET.

[1] https://syzkaller.appspot.com/bug?id=599993614e7cbbf66bc2656a919ab2a95fb5d75cReported-by: Nsyzbot <syzbot+047a11c361b872896a4f@syzkaller.appspotmail.com>
Signed-off-by: NTetsuo Handa <penguin-kernel@I-love.SAKURA.ne.jp>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

7c2bd9a3

24 3月, 2019 2 次提交

pNFS/flexfiles: Fix layoutstats handling during read failovers · 166bd5b8

由 Trond Myklebust 提交于 3月 22, 2019

During a read failover, we may end up changing the value of
the pgio_mirror_idx, so make sure that we record the layout
stats before that update.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

166bd5b8

NFS: Fix a typo in nfs_init_timeout_values() · 5a698243

由 Trond Myklebust 提交于 3月 21, 2019

Specifying a retrans=0 mount parameter to a NFS/TCP mount, is
inadvertently causing the NFS client to rewrite any specified
timeout parameter to the default of 60 seconds.

Fixes: a956beda ("NFS: Allow the mount option retrans=0")
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

5a698243

20 3月, 2019 1 次提交

NFSv4.1 don't free interrupted slot on open · 0cb98abb

由 Olga Kornievskaia 提交于 3月 19, 2019

Allow the async rpc task for finish and update the open state if needed,
then free the slot. Otherwise, the async rpc unable to decode the reply.
Signed-off-by: NOlga Kornievskaia <kolga@netapp.com>
Fixes: ae55e59d ("pnfs: Don't release the sequence slot...")
Cc: stable@vger.kernel.org # v4.18+
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

0cb98abb

19 3月, 2019 1 次提交

NFS: Fix nfs4_lock_state refcounting in nfs4_alloc_{lock,unlock}data() · 3028efe0

由 Catalin Marinas 提交于 3月 18, 2019

Commit 7b587e1a ("NFS: use locks_copy_lock() to copy locks.")
changed the lock copying from memcpy() to the dedicated
locks_copy_lock() function. The latter correctly increments the
nfs4_lock_state.ls_count via nfs4_fl_copy_lock(), however, this refcount
has already been incremented in the nfs4_alloc_{lock,unlock}data().
Kmemleak subsequently reports an unreferenced nfs4_lock_state object as
below (arm64 platform):

unreferenced object 0xffff8000fce0b000 (size 256):
  comm "systemd-sysuser", pid 1608, jiffies 4294892825 (age 32.348s)
  hex dump (first 32 bytes):
    20 57 4c fb 00 80 ff ff 20 57 4c fb 00 80 ff ff   WL..... WL.....
    00 57 4c fb 00 80 ff ff 01 00 00 00 00 00 00 00  .WL.............
  backtrace:
    [<000000000d15010d>] kmem_cache_alloc+0x178/0x208
    [<00000000d7c1d264>] nfs4_set_lock_state+0x124/0x1f0
    [<000000009c867628>] nfs4_proc_lock+0x90/0x478
    [<000000001686bd74>] do_setlk+0x64/0xe8
    [<00000000e01500d4>] nfs_lock+0xe8/0x1f0
    [<000000004f387d8d>] vfs_lock_file+0x18/0x40
    [<00000000656ab79b>] do_lock_file_wait+0x68/0xf8
    [<00000000f17c4a4b>] fcntl_setlk+0x224/0x280
    [<0000000052a242c6>] do_fcntl+0x418/0x730
    [<000000004f47291a>] __arm64_sys_fcntl+0x84/0xd0
    [<00000000d6856e01>] el0_svc_common+0x80/0xf0
    [<000000009c4bd1df>] el0_svc_handler+0x2c/0x80
    [<00000000b1a0d479>] el0_svc+0x8/0xc
    [<0000000056c62a0f>] 0xffffffffffffffff

This patch removes the original refcount_inc(&lsp->ls_count) that was
paired with the memcpy() lock copying.

Fixes: 7b587e1a ("NFS: use locks_copy_lock() to copy locks.")
Cc: <stable@vger.kernel.org> # 5.0.x-
Cc: NeilBrown <neilb@suse.com>
Signed-off-by: NCatalin Marinas <catalin.marinas@arm.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

3028efe0

13 3月, 2019 1 次提交

pNFS: Fix a typo in pnfs_update_layout · 400417b0

由 Trond Myklebust 提交于 3月 12, 2019

We're supposed to wait for the outstanding layout count to go to zero,
but that got lost somehow.

Fixes: d03360aa ("pNFS: Ensure we return the error if someone...")
Reported-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

400417b0

03 3月, 2019 1 次提交

NFSv4.1: Bump the default callback session slot count to 16 · 067c4696

由 Trond Myklebust 提交于 3月 02, 2019

Users can still control this value explicitly using the
max_session_cb_slots module parameter, but let's bump the default
up to 16 for now.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

067c4696

02 3月, 2019 22 次提交

NFS/flexfiles: Clean up mirror DS initialisation · cefa587a

由 Trond Myklebust 提交于 2月 28, 2019

Get rid of the redundant parameter and rename the function
ff_layout_mirror_valid() to ff_layout_init_mirror_ds() for clarity.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

cefa587a

NFS/flexfiles: Remove dead code in ff_layout_mirror_valid() · 29a23909

由 Trond Myklebust 提交于 2月 28, 2019

nfs4_ff_alloc_deviceid_node() guarantees that if mirror->mirror_ds is
a valid pointer, then so is mirror->mirror_ds->ds.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

29a23909

NFS/flexfile: Simplify nfs4_ff_layout_select_ds_stateid() · 4cbc8a57

由 Trond Myklebust 提交于 2月 28, 2019

Pass in a pointer to the mirror rather than forcing another
array access.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

4cbc8a57

NFS/flexfile: Simplify nfs4_ff_layout_ds_version() · 626d48b1

由 Trond Myklebust 提交于 2月 28, 2019

Pass in a pointer to the mirror rather than forcing another
array access.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

626d48b1

NFS/flexfiles: Simplify ff_layout_get_ds_cred() · 312cd4cb

由 Trond Myklebust 提交于 2月 28, 2019

Pass in a pointer to the mirror rather than forcing another
array access.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

312cd4cb

NFS/flexfiles: Simplify nfs4_ff_find_or_create_ds_client() · 561d6f8a

由 Trond Myklebust 提交于 2月 28, 2019

Pass in a pointer to the mirror rather than forcing another
array access.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

561d6f8a

NFS/flexfiles: Simplify nfs4_ff_layout_select_ds_fh() · 749da527

由 Trond Myklebust 提交于 2月 28, 2019

Pass in a pointer to the mirror rather than having to retrieve it from
the array and then verify the resulting pointer.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

749da527

NFS/flexfiles: Speed up read failover when DSes are down · 76c66905

由 Trond Myklebust 提交于 2月 14, 2019

If we notice that a DS may be down, we should attempt to read from the
other mirrors first before we go back to retry the dead DS.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

76c66905

NFS/flexfiles: Don't invalidate DS deviceids for being unresponsive · 17aaec81

由 Trond Myklebust 提交于 2月 26, 2019

If the DS is unresponsive, we want to just mark it as such, while
reporting the errors. If the server later returns the same deviceid
in a new layout, then we don't want to have to look it up again.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

17aaec81

NFS/flexfiles: Remove bogus checks for invalid deviceids · d082d4b5

由 Trond Myklebust 提交于 2月 26, 2019

We already check the deviceids before we start the RPC call.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

d082d4b5

NFS/flexfiles: Avoid unnecessary layout invalidations · 0a156dd5

由 Trond Myklebust 提交于 2月 27, 2019

In ff_layout_mirror_valid() we may not want to invalidate the layout
segment despite the call to GETDEVICEINFO failing. The reason is that
a read may still be able to make progress on another mirror.

So instead we let the caller (in this case nfs4_ff_layout_prepare_ds())
decide whether or not it needs to invalidate.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

0a156dd5

NFS/flexfiles: refactor calls to fs4_ff_layout_prepare_ds() · 2444ff27

由 Trond Myklebust 提交于 2月 14, 2019

While we may want to skip attempting to connect to a downed mirror
when we're deciding which mirror to select for a read, we do not
want to do so once we've committed to attempting the I/O in
ff_layout_read/write_pagelist(), or ff_layout_initiate_commit()
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

2444ff27

NFSv4: Handle early exit in layoutget by returning an error · 18c0778a

由 Trond Myklebust 提交于 2月 13, 2019

If the LAYOUTGET rpc call exits early without an error, convert it to
EAGAIN.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

18c0778a

NFS/flexfiles: Send LAYOUTERROR when failing over mirrored reads · f0922a6c

由 Trond Myklebust 提交于 2月 10, 2019

When a read to the preferred mirror returns an error, the flexfiles
driver records the error in the inode list and currently marks the
layout for return before failing over the attempted read to the next
mirror.
What we actually want to do is fire off a LAYOUTERROR to notify the
MDS that there is an issue with the preferred mirror, then we fail
over. Only once we've failed to read from all mirrors should we
return the layout.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

f0922a6c

T
NFSv4.2: Add client support for the generic 'layouterror' RPC call · 3eb86093
由 Trond Myklebust 提交于 2月 08, 2019
```
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
```
3eb86093

NFSv4/flexfiles: Abort I/O early if the layout segment was invalidated · a79f194a

由 Trond Myklebust 提交于 2月 27, 2019

If a layout segment gets invalidated while a pNFS I/O operation
is queued for transmission, then we ideally want to abort
immediately. This is particularly the case when there is a large
number of I/O related RPCs queued in the RPC layer, and the layout
segment gets invalidated due to an ENOSPC error, or an EACCES (because
the client was fenced). We may end up forced to spam the MDS with a
lot of otherwise unnecessary LAYOUTERRORs after that I/O fails.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

a79f194a

NFSv4/pnfs: Fix barriers in nfs4_mark_deviceid_unavailable() · 39a5201a

由 Trond Myklebust 提交于 2月 26, 2019

Fix the memory barriers in nfs4_mark_deviceid_unavailable() and
nfs4_test_deviceid_unavailable().
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

39a5201a

T
NFS/flexfiles: Fix up sparse RCU annotations · 762bb7e9
由 Trond Myklebust 提交于 2月 26, 2019
```
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
```
762bb7e9

NFSv4/flexfiles: Fix invalid deref in FF_LAYOUT_DEVID_NODE() · 108bb4af

由 Trond Myklebust 提交于 2月 26, 2019

If the attempt to instantiate the mirror's layout DS pointer failed,
then that pointer may hold a value of type ERR_PTR(), so we need
to check that before we dereference it.

Fixes: 65990d1a ("pNFS/flexfiles: Fix a deadlock on LAYOUTGET")
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

108bb4af

NFS: Add missing encode / decode sequence_maxsz to v4.2 operations · 1a3466ae

由 Anna Schumaker 提交于 3月 01, 2019

These really should have been there from the beginning, but we never
noticed because there was enough slack in the RPC request for the extra
bytes. Chuck's recent patch to use au_cslack and au_rslack to compute
buffer size shrunk the buffer enough that this was now a problem for
SEEK operations on my test client.

Fixes: f4ac1674 ("nfs: Add ALLOCATE support")
Fixes: 2e72448b ("NFS: Add COPY nfs operation")
Fixes: cb95deea ("NFS OFFLOAD_CANCEL xdr")
Fixes: 624bd5b7 ("nfs: Add DEALLOCATE support")
Fixes: 1c6dcbe5 ("NFS: Implement SEEK")
Signed-off-by: NAnna Schumaker <Anna.Schumaker@Netapp.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

1a3466ae

NFSv4.1: Don't process the sequence op more than once. · c71c46f0

由 Trond Myklebust 提交于 3月 01, 2019

Ensure that if we call nfs41_sequence_process() a second time for the
same rpc_task, then we only process the results once.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

c71c46f0

NFSv4.1: Reinitialise sequence results before retransmitting a request · c1dffe0b

由 Trond Myklebust 提交于 3月 01, 2019

If we have to retransmit a request, we should ensure that we reinitialise
the sequence results structure, since in the event of a signal
we need to treat the request as if it had not been sent.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
Cc: stable@vger.kernel.org

c1dffe0b

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功