提交 · c21443c2c792cd9b463646d982b0fe48aa6feb0f · openeuler / raspberrypi-kernel

12 2月, 2013 5 次提交

NFSv4: Fix a reboot recovery race when opening a file · c21443c2

由 Trond Myklebust 提交于 2月 07, 2013

If the server reboots after it has replied to our OPEN, but before we
call nfs4_opendata_to_nfs4_state(), then the reboot recovery thread
will not see a stateid for this open, and so will fail to recover it.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

c21443c2

NFSv4: Ensure delegation recall and byte range lock removal don't conflict · 65b62a29

由 Trond Myklebust 提交于 2月 07, 2013

Add a mutex to the struct nfs4_state_owner to ensure that delegation
recall doesn't conflict with byte range lock removal.

Note that we nest the new mutex _outside_ the state manager reclaim
protection (nfsi->rwsem) in order to avoid deadlocks.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

65b62a29

NFSv4: Fix up the return values of nfs4_open_delegation_recall · 37380e42

由 Trond Myklebust 提交于 2月 05, 2013

Adjust the return values so that they return EAGAIN to the caller in
cases where we might want to retry the delegation recall after
the state recovery has run.
Note that we can't wait and retry in this routine, because the caller
may be the state manager thread.

If delegation recall fails due to a session or reboot related issue,
also ensure that we mark the stateid as delegated so that
nfs_delegation_claim_opens can find it again later.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

37380e42

NFSv4.1: Don't lose locks when a server reboots during delegation return · d25be546

由 Trond Myklebust 提交于 2月 05, 2013

If the server reboots while we are converting a delegation into
OPEN/LOCK stateids as part of a delegation return, the current code
will simply exit with an error. This causes us to lose both
delegation state and locking state (i.e. locking atomicity).

Deal with this by exposing the delegation stateid during delegation
return, so that we can recover the delegation, and then resume
open/lock recovery.

Note that not having to hold the nfs_inode->rwsem across the
calls to nfs_delegation_claim_opens() also fixes a deadlock against
the NFSv4.1 reboot recovery code.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

d25be546

NFSv4.1: Prevent deadlocks between state recovery and file locking · 9a99af49

由 Trond Myklebust 提交于 2月 04, 2013

We currently have a deadlock in which the state recovery thread
ends up blocking due to one of the locks which it is trying to
recover holding the nfs_inode->rwsem.
The situation is as follows: the state recovery thread is
scheduled in order to recover from a reboot. It immediately
drains the session, forcing all ordinary NFSv4.1 calls to
nfs41_setup_sequence() to be put to sleep.  This includes the
file locking process that holds the nfs_inode->rwsem.
When the thread gets to nfs4_reclaim_locks(), it tries to
grab a write lock on nfs_inode->rwsem, and boom...

Fix is to have the lock drop the nfs_inode->rwsem while it is
doing RPC calls. We use a sequence lock in order to signal to
the locking process whether or not a state recovery thread has
run on that inode, in which case it should retry the lock.
Reported-by: NAndy Adamson <andros@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

9a99af49

01 2月, 2013 1 次提交

Revert "NFS: add nfs_sb_deactive_async to avoid deadlock" · 322b2b90

由 Trond Myklebust 提交于 1月 11, 2013

This reverts commit 324d003b.

The deadlock turned out to be caused by a workqueue limitation that has
now been worked around in the RPC code (see comment in rpc_free_task).
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

322b2b90

04 1月, 2013 1 次提交

NFS: Fix access to suid/sgid executables · f8d9a897

由 Weston Andros Adamson 提交于 1月 03, 2013

nfs_open_permission_mask() should only check MAY_EXEC for files that
are opened with __FMODE_EXEC.

Also fix NFSv4 access-in-open path in a similar way -- openflags must be
used because fmode will not always have FMODE_EXEC set.

This patch fixes https://bugzilla.kernel.org/show_bug.cgi?id=49101Signed-off-by: NWeston Andros Adamson <dros@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
Cc: stable@vger.kernel.org

f8d9a897

21 12月, 2012 1 次提交

NFS: Use FS-Cache invalidation · de242c0b

由 David Howells 提交于 12月 20, 2012

Use the new FS-Cache invalidation facility from NFS to deal with foreign
changes being detected on the server rather than attempting to retire the old
cookie and get a new one.

The problem with the old method was that NFS did not wait for all outstanding
storage and retrieval ops on the cache to complete.  There was no automatic
wait between the calls to ->readpages() and calls to invalidate_inode_pages2()
as the latter can only wait on locked pages that have been added to the
pagecache (which they haven't yet on entry to ->readpages()).

This was leading to oopses like the one below when an outstanding read got cut
off from its cookie by a premature release.

BUG: unable to handle kernel NULL pointer dereference at 00000000000000a8
IP: [<ffffffffa0075118>] __fscache_read_or_alloc_pages+0x1dd/0x315 [fscache]
PGD 15889067 PUD 15890067 PMD 0
Oops: 0000 [#1] SMP
CPU 0
Modules linked in: cachefiles nfs fscache auth_rpcgss nfs_acl lockd sunrpc

Pid: 4544, comm: tar Not tainted 3.1.0-rc4-fsdevel+ #1064                  /DG965RY
RIP: 0010:[<ffffffffa0075118>]  [<ffffffffa0075118>] __fscache_read_or_alloc_pages+0x1dd/0x315 [fscache]
RSP: 0018:ffff8800158799e8  EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff8800070d41e0 RCX: ffff8800083dc1b0
RDX: 0000000000000000 RSI: ffff880015879960 RDI: ffff88003e627b90
RBP: ffff880015879a28 R08: 0000000000000002 R09: 0000000000000002
R10: 0000000000000001 R11: ffff880015879950 R12: ffff880015879aa4
R13: 0000000000000000 R14: ffff8800083dc158 R15: ffff880015879be8
FS:  00007f671e9d87c0(0000) GS:ffff88003bc00000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 00000000000000a8 CR3: 000000001587f000 CR4: 00000000000006f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process tar (pid: 4544, threadinfo ffff880015878000, task ffff880015875040)
Stack:
 ffffffffa00b1759 ffff8800070dc158 ffff8800000213da ffff88002a286508
 ffff880015879aa4 ffff880015879be8 0000000000000001 ffff88002a2866e8
 ffff880015879a88 ffffffffa00b20be 00000000000200da ffff880015875040
Call Trace:
 [<ffffffffa00b1759>] ? nfs_fscache_wait_bit+0xd/0xd [nfs]
 [<ffffffffa00b20be>] __nfs_readpages_from_fscache+0x7e/0x13f [nfs]
 [<ffffffff81095fe7>] ? __alloc_pages_nodemask+0x156/0x662
 [<ffffffffa0098763>] nfs_readpages+0xee/0x187 [nfs]
 [<ffffffff81098a5e>] __do_page_cache_readahead+0x1be/0x267
 [<ffffffff81098942>] ? __do_page_cache_readahead+0xa2/0x267
 [<ffffffff81098d7b>] ra_submit+0x1c/0x20
 [<ffffffff8109900a>] ondemand_readahead+0x28b/0x29a
 [<ffffffff810990ce>] page_cache_sync_readahead+0x38/0x3a
 [<ffffffff81091d8a>] generic_file_aio_read+0x2ab/0x67e
 [<ffffffffa008cfbe>] nfs_file_read+0xa4/0xc9 [nfs]
 [<ffffffff810c22c4>] do_sync_read+0xba/0xfa
 [<ffffffff810a62c9>] ? might_fault+0x4e/0x9e
 [<ffffffff81177a47>] ? security_file_permission+0x7b/0x84
 [<ffffffff810c25dd>] ? rw_verify_area+0xab/0xc8
 [<ffffffff810c29a4>] vfs_read+0xaa/0x13a
 [<ffffffff810c2a79>] sys_read+0x45/0x6c
 [<ffffffff813ac37b>] system_call_fastpath+0x16/0x1b
Reported-by: NMark Moseley <moseleymark@gmail.com>
Signed-off-by: NDavid Howells <dhowells@redhat.com>

de242c0b

16 12月, 2012 3 次提交

NFSv4.1: Deal effectively with interrupted RPC calls. · ac20d163

由 Trond Myklebust 提交于 12月 15, 2012

If an RPC call is interrupted, assume that the server hasn't processed
the RPC call so that the next time we use the slot, we know that if we
get a NFS4ERR_SEQ_MISORDERED or NFS4ERR_SEQ_FALSE_RETRY, we just have
to bump the sequence number.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

ac20d163

NFSv4.1: Move the RPC timestamp out of the slot. · 8e63b6a8

由 Trond Myklebust 提交于 12月 15, 2012

Shave a few bytes off the slot table size by moving the RPC timestamp
into the sequence results.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

8e63b6a8

NFSv4.1: Try to deal with NFS4ERR_SEQ_MISORDERED. · e8794440

由 Trond Myklebust 提交于 12月 15, 2012

If the server returns NFS4ERR_SEQ_MISORDERED, it could be a sign
that the slot was retired at some point. Retry the attempt after
reinitialising the slot sequence number to 1.

Also add a handler for NFS4ERR_SEQ_FALSE_RETRY. Just bump the slot
sequence number and retry...
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

e8794440

13 12月, 2012 1 次提交

SUNRPC handle EKEYEXPIRED in call_refreshresult · eb96d5c9

由 Andy Adamson 提交于 11月 27, 2012

Currently, when an RPCSEC_GSS context has expired or is non-existent
and the users (Kerberos) credentials have also expired or are non-existent,
the client receives the -EKEYEXPIRED error and tries to refresh the context
forever. If an application is performing I/O, or other work against the share,
the application hangs, and the user is not prompted to refresh/establish their
credentials. This can result in a denial of service for other users.

Users are expected to manage their Kerberos credential lifetimes to mitigate
this issue.

Move the -EKEYEXPIRED handling into the RPC layer. Try tk_cred_retry number
of times to refresh the gss_context, and then return -EACCES to the application.
Signed-off-by: NAndy Adamson <andros@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

eb96d5c9

11 12月, 2012 2 次提交

NFSv4.1: Handle NFS4ERR_BADSLOT errors correctly · 85563073

由 Trond Myklebust 提交于 12月 11, 2012

Most (all) NFS4ERR_BADSLOT errors are due to the client failing to
respect the server's sr_highest_slotid limit. This mainly happens
due to reordered RPC requests.
The way to handle it is simply to drop the slot that we're using,
and retry using the new highest_slotid limits.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

85563073

NFSv4: Check for buffer length in __nfs4_get_acl_uncached · 7d3e91a8

由 Sven Wegener 提交于 12月 08, 2012

Commit 1f1ea6c2 "NFSv4: Fix buffer overflow checking in
__nfs4_get_acl_uncached" accidently dropped the checking for too small
result buffer length.

If someone uses getxattr on "system.nfs4_acl" on an NFSv4 mount
supporting ACLs, the ACL has not been cached and the buffer suplied is
too short, we still copy the complete ACL, resulting in kernel and user
space memory corruption.
Signed-off-by: NSven Wegener <sven.wegener@stealer.net>
Cc: stable@kernel.org
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

7d3e91a8

06 12月, 2012 20 次提交

NFSv4.1: Ensure smooth handover of slots from one task to the next waiting · b75ad4cd

由 Trond Myklebust 提交于 11月 29, 2012

Currently, we see a lot of bouncing for the value of highest_used_slotid
due to the fact that slots are getting freed, instead of getting instantly
transmitted to the next waiting task.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

b75ad4cd

NFSv4.1: Don't mess with task priorities in nfs41_setup_sequence · 1e1093c7

由 Trond Myklebust 提交于 11月 01, 2012

We want to preserve the rpc_task priority for things like writebacks,
that may have differing levels of urgency.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

1e1093c7

NFS: Remove _nfs_call_sync_session · 104287cd

由 Bryan Schumaker 提交于 11月 12, 2012

All it does is pass its arguments through to another function.  Let's
cut out the middleman...
Signed-off-by: NBryan Schumaker <bjschuma@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

104287cd

NFSv4: Clean up handling of privileged operations · 8fe72bac

由 Trond Myklebust 提交于 10月 29, 2012

Privileged rpc calls are those that are run by the state recovery thread,
in cases where we're trying to recover the system after a server reboot
or a network partition. In those cases, we want to fence off all other
rpc calls (see nfs4_begin_drain_session()) so that they don't end up
using stateids or clientids that are in the process of being recovered.

Prior to this patch, we had to set up special callback functions in
order to declare an rpc call as being privileged.
By adding a new field to the sequence arguments, this patch simplifies
things considerably, and allows us to declare the rpc call as privileged
before it is run.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

8fe72bac

NFSv4.1: Remove the 'FIFO' behaviour for nfs41_setup_sequence · 275e7e20

由 Trond Myklebust 提交于 11月 01, 2012

It is more important to preserve the task priority behaviour, which ensures
that things like reclaim writes take precedence over background and kupdate
writes.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

275e7e20

NFSv4.1: Clean up nfs41_setup_sequence · 7b939a3f

由 Trond Myklebust 提交于 11月 01, 2012

Move all the sleep-and-exit cases into a single section of code.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

7b939a3f

NFSv4: Simplify the NFSv4/v4.1 synchronous call switch · fd0c0953

由 Trond Myklebust 提交于 11月 01, 2012

We shouldn't need to pass the 'cache_reply' parameter if we
initialise the sequence_args/sequence_res in the caller.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

fd0c0953

NFSv4.1: Simplify the sequence setup · d9afbd1b

由 Trond Myklebust 提交于 10月 22, 2012

Nobody calls nfs4_setup_sequence or nfs41_setup_sequence without
also calling rpc_call_start() on success. This commit therefore
folds the rpc_call_start call into nfs41_setup_sequence().
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

d9afbd1b

NFSv4.1: Use nfs41_setup_sequence where appropriate · 6ba7db34

由 Trond Myklebust 提交于 10月 22, 2012

There is no point in using nfs4_setup_sequence or nfs4_sequence_done
in pure NFSv4.1 functions. We already know that those have sessions...
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

6ba7db34

NFSv4.1: Ping server when our session table limits are too high · c10e4498

由 Trond Myklebust 提交于 11月 26, 2012

If the server requests a lower target_highest_slotid, then ensure
that we ping it with at least one RPC call containing an
appropriate SEQUENCE op. This ensures that the server won't need to
send a recall callback in order to shrink the slot table.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

c10e4498

NFSv4.1: Cleanup move session slot management to fs/nfs/nfs4session.c · 73e39aaa

由 Trond Myklebust 提交于 11月 26, 2012

NFSv4.1 session management is getting complex enough to deserve
a separate file.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

73e39aaa

NFSv4: Move nfs4_wait_clnt_recover and nfs4_client_recover_expired_lease · 33021279

由 Trond Myklebust 提交于 11月 26, 2012

nfs4_wait_clnt_recover and nfs4_client_recover_expired_lease are both
generic state related functions. As such, they belong in nfs4state.c,
and not nfs4proc.c
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

33021279

NFSv4.1: Clean up session draining · 5d63360d

由 Trond Myklebust 提交于 11月 23, 2012

Coalesce nfs4_check_drain_bc_complete and nfs4_check_drain_fc_complete
into a single function that can be called when the slot table is known
to be empty, then change nfs4_callback_free_slot() and nfs4_free_slot()
to use it.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

5d63360d

NFSv4.1: If slot allocation fails due to OOM, retry more quickly · 69d206b5

由 Trond Myklebust 提交于 11月 22, 2012

If the NFSv4.1 session slot allocation fails due to an ENOMEM condition,
then set the task->tk_timeout to 1/4 second to ensure that we do retry
the slot allocation more quickly.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

69d206b5

NFSv4.1: Remove the state manager code to resize the slot table · afa29610

由 Trond Myklebust 提交于 11月 20, 2012

The state manager no longer needs any special machinery to stop the
session flow and resize the slot table. It is all done on the fly by
the SEQUENCE op code now.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

afa29610

NFSv4.1: Allow SEQUENCE to resize the slot table on the fly · 87dda67e

由 Trond Myklebust 提交于 11月 20, 2012

Instead of an array of slots, use a singly linked list of slots that
can be dynamically appended to or shrunk.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

87dda67e

NFSv4.1: Support dynamic resizing of the session slot table · 97e548a9

由 Trond Myklebust 提交于 11月 20, 2012

Allow the server to control the size of the session slot table
by adjusting the value of sr_target_max_slots in the reply to the
SEQUENCE operation.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

97e548a9

NFSv4.1: Fix nfs4_callback_recallslot to work with dynamic slot allocation · ce008c4b

由 Trond Myklebust 提交于 11月 20, 2012

Ensure that the NFSv4.1 CB_RECALL_SLOT callback updates the slot table
target max slotid safely.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

ce008c4b

NFSv4.1: Reset the sequence number for slots that have been deallocated · da0507b7

由 Trond Myklebust 提交于 11月 20, 2012

When the server tells us that it is dynamically resizing the session
replay cache, we should reset the sequence number for those slots
that have been deallocated.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

da0507b7

NFSv4.1: Ensure that the client tracks the server target_highest_slotid · 464ee9f9

由 Trond Myklebust 提交于 11月 20, 2012

Dynamic slot allocation in NFSv4.1 depends on the client being able to
track the server's target value for the highest slotid in the
slot table.  See the reference in Section 2.10.6.1 of RFC5661.

To avoid ordering problems in the case where 2 SEQUENCE replies contain
conflicting updates to this target value, we also introduce a generation
counter, to track whether or not an RPC containing a SEQUENCE operation
was launched before or after the last update.

Also rename the nfs4_slot_table target_max_slots field to
'target_highest_slotid' to avoid confusion with a slot
table size or number of slots.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

464ee9f9

27 11月, 2012 5 次提交

NFSv4.1: Clean up nfs4_free_slot · f4af6e2a

由 Trond Myklebust 提交于 11月 20, 2012

Change the argument to take the pointer to the slot, instead of
just the slotid.

We know that the new value of highest_used_slot must be less than
the current value. No need to scan the whole table.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

f4af6e2a

NFSv4.1: Simplify slot allocation · 2dc03b7f

由 Trond Myklebust 提交于 11月 16, 2012

Clean up the NFSv4.1 slot allocation by replacing nfs_find_slot() with
a function nfs_alloc_slot() that returns a pointer to the nfs4_slot
instead of an offset into the slot table.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

2dc03b7f

NFSv4.1: Simplify struct nfs4_sequence_args too · 2b2fa717

由 Trond Myklebust 提交于 11月 16, 2012

Replace the session pointer + slotid with a pointer to the
allocated slot.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

2b2fa717

NFSv4.1: Label each entry in the session slot tables with its slot number · df2fabff

由 Trond Myklebust 提交于 11月 16, 2012

Instead of doing slot table pointer gymnastics every time we want to
know which slot we're using.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

df2fabff

NFSv4.1: Shrink struct nfs4_sequence_res by moving the session pointer · e3725ec0

由 Trond Myklebust 提交于 11月 16, 2012

Move the session pointer into the slot table, then have struct nfs4_slot
point to that slot table.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

e3725ec0

21 11月, 2012 1 次提交
- T
  NFSv4.1: Shrink struct nfs4_sequence_res by moving sr_renewal_time · 933602e3
  由 Trond Myklebust 提交于 11月 16, 2012
```
Store the renewal time inside the session slot instead.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
  933602e3