提交 · 59db4a0c102e0de226a3395dbf25ea51bf845937 · openeuler / raspberrypi-kernel

23 7月, 2010 7 次提交

nfsd: move more into nfsd_startup() · 59db4a0c

由 J. Bruce Fields 提交于 7月 21, 2010

This is just cleanup--it's harmless to call nfsd_rachache_init,
nfsd_init_socks, and nfsd_reset_versions more than once.  But there's no
point to it.
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>

59db4a0c

nfsd: just keep single lockd reference for nfsd · ac77efbe

由 Jeff Layton 提交于 7月 20, 2010

Right now, nfsd keeps a lockd reference for each socket that it has
open. This is unnecessary and complicates the error handling on
startup and shutdown. Change it to just do a lockd_up when starting
the first nfsd thread just do a single lockd_down when taking down the
last nfsd thread. Because of the strange way the sv_count is handled
this requires an extra flag to tell whether the nfsd_serv holds a
reference for lockd or not.
Signed-off-by: NJeff Layton <jlayton@redhat.com>
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>

ac77efbe

nfsd: clean up nfsd_create_serv error handling · 628b3687

由 Jeff Layton 提交于 7月 21, 2010

There doesn't seem to be any need to reset the nfssvc_boot time if the
nfsd startup failed.
Signed-off-by: NJeff Layton <jlayton@redhat.com>
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>

628b3687

nfsd: fix error handling in __write_ports_addxprt · 0cd14a06

由 Jeff Layton 提交于 7月 19, 2010

__write_ports_addxprt calls nfsd_create_serv. That increases the
refcount of nfsd_serv (which is tracked in sv_nrthreads). The service
only decrements the thread count on error, not on success like
__write_ports_addfd does, so using this interface leaves the nfsd
thread count high.

Fix this by having this function call svc_destroy() on error to release
the reference (and possibly to tear down the service) and simply
decrement the refcount without tearing down the service on success.

This makes the sv_threads handling work basically the same in both
__write_ports_addxprt and __write_ports_addfd.
Signed-off-by: NJeff Layton <jlayton@redhat.com>
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>

0cd14a06

nfsd: fix error handling when starting nfsd with rpcbind down · 78a8d7c8

由 Jeff Layton 提交于 7月 19, 2010

The refcounting for nfsd is a little goofy. What happens is that we
create the nfsd RPC service, attach sockets to it but don't actually
start the threads until someone writes to the "threads" procfile. To do
this, __write_ports_addfd will create the nfsd service and then will
decrement the refcount when exiting but won't actually destroy the
service.

This is fine when there aren't errors, but when there are this can
cause later attempts to start nfsd to fail. nfsd_serv will be set,
and that causes __write_versions to return EBUSY.

Fix this by calling svc_destroy on nfsd_serv when this function is
going to return error.
Signed-off-by: NJeff Layton <jlayton@redhat.com>
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>

78a8d7c8

nfsd4: fix v4 state shutdown error paths · 4ad9a344

由 Jeff Layton 提交于 7月 19, 2010

If someone tries to shut down the laundry_wq while it isn't up it'll
cause an oops.

This can happen because write_ports can create a nfsd_svc before we
really start the nfs server, and we may fail before the server is ever
started.

Also make sure state is shutdown on error paths in nfsd_svc().

Use a common global nfsd_up flag instead of nfs4_init, and create common
helper functions for nfsd start/shutdown, as there will be other work
that we want done only when we the number of nfsd threads transitions
between zero and nonzero.
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>

4ad9a344

nfsd: remove unused assignment from nfsd_link · 55b13354

由 J. Bruce Fields 提交于 7月 19, 2010

Trivial cleanup, since "dest" is never used.
Reported-by: NAnshul Madan <Anshul.Madan@netapp.com>
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>

55b13354

08 7月, 2010 1 次提交

NFSD: Fill in WCC data for REMOVE, RMDIR, MKNOD, and MKDIR · 43a9aa64

由 Chuck Lever 提交于 7月 06, 2010

Some well-known NFSv3 clients drop their directory entry caches when
they receive replies with no WCC data.  Without this data, they
employ extra READ, LOOKUP, and GETATTR requests to ensure their
directory entry caches are up to date, causing performance to suffer
needlessly.

In order to return WCC data, our server has to have both the pre-op
and the post-op attribute data on hand when a reply is XDR encoded.
The pre-op data is filled in when the incoming fh is locked, and the
post-op data is filled in when the fh is unlocked.

Unfortunately, for REMOVE, RMDIR, MKNOD, and MKDIR, the directory fh
is not unlocked until well after the reply has been XDR encoded.  This
means that encode_wcc_data() does not have wcc_data for the parent
directory, so none is returned to the client after these operations
complete.

By unlocking the parent directory fh immediately after the internal
operations for each NFS procedure is complete, the post-op data is
filled in before XDR encoding starts, so it can be returned to the
client properly.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NJ. Bruce Fields <bfields@citi.umich.edu>

43a9aa64

07 7月, 2010 1 次提交

nfsd4: comment nitpick · 6a85d6c7

由 J. Bruce Fields 提交于 7月 06, 2010

Reported-by: N"Madan, Anshul" <Anshul.Madan@netapp.com>
Signed-off-by: NJ. Bruce Fields <bfields@citi.umich.edu>

6a85d6c7

25 6月, 2010 2 次提交

nfsd4: fix delegation recall race use-after-free · cba9ba4b

由 J. Bruce Fields 提交于 6月 01, 2010

When the rarely-used callback-connection-changing setclientid occurs
simultaneously with a delegation recall, we rerun the recall by
requeueing it on a workqueue. But we also need to take a reference on
the delegation in that case, since the delegation held by the rpc itself
will be released by the rpc_release callback.
Signed-off-by: NJ. Bruce Fields <bfields@citi.umich.edu>

cba9ba4b

J
nfsd4: fix deleg leak on callback error · ac94bf58
由 J. Bruce Fields 提交于 5月 31, 2010
```
Signed-off-by: NJ. Bruce Fields <bfields@citi.umich.edu>
```
ac94bf58

23 6月, 2010 4 次提交

J
nfsd4: remove some debugging code · ec8acac8
由 J. Bruce Fields 提交于 6月 16, 2010
```
This is overkill.
Signed-off-by: NJ. Bruce Fields <bfields@citi.umich.edu>
```
ec8acac8

nfsd: nfs4callback encode_stateid helper function · 9303bbd3

由 Benny Halevy 提交于 5月 25, 2010

To be used also for the pnfs cb_layoutrecall callback
Signed-off-by: NBenny Halevy <bhalevy@panasas.com>
[nfsd4: fix cb_recall encoding]
    "nfsd: nfs4callback encode_stateid helper function" forgot to reserve
    more space after return from the new helper.
Reported-by: NMichael Groshans <groshans@citi.umich.edu>
Signed-off-by: NBenny Halevy <bhalevy@panasas.com>
Signed-off-by: NJ. Bruce Fields <bfields@citi.umich.edu>

9303bbd3

nfsd4: translate memory errors to delay, not serverfault · 4731030d

由 J. Bruce Fields 提交于 6月 22, 2010

If the server is out of memory is better for clients to back off and
retry than to just error out.
Signed-off-by: NJ. Bruce Fields <bfields@citi.umich.edu>

4731030d

nfsd4; fix session reference count leak · 76407f76

由 J. Bruce Fields 提交于 6月 22, 2010

Note the session has to be put() here regardless of what happens to the
client.
Signed-off-by: NJ. Bruce Fields <bfields@citi.umich.edu>

76407f76

01 6月, 2010 4 次提交

J
nfsd4: don't bother storing callback reply tag · 68a4b48c
由 J. Bruce Fields 提交于 5月 27, 2010
```
We don't use this, and probably never will.
Signed-off-by: NJ. Bruce Fields <bfields@citi.umich.edu>
```
68a4b48c

nfsd4: fix use of op_share_access · 24a0111e

由 J. Bruce Fields 提交于 5月 18, 2010

NFSv4.1 adds additional flags to the share_access argument of the open
call.  These flags need to be masked out in some of the existing code,
but current code does that inconsistently.
Tested-by: NMichael Groshans <groshans@citi.umich.edu>
Signed-off-by: NJ. Bruce Fields <bfields@citi.umich.edu>

24a0111e

nfsd4: treat more recall errors as failures · 172c85dd

由 J. Bruce Fields 提交于 5月 30, 2010

If a recall fails for some unexpected reason, instead of ignoring it and
treating it like a success, it's safer to treat it as a failure,
preventing further delgation grants and returning CB_PATH_DOWN.

Also put put switches in a (two me) more logical order, with normal case
first.
Signed-off-by: NJ. Bruce Fields <bfields@citi.umich.edu>

172c85dd

nfsd4: remove extra put() on callback errors · 378b7d37

由 J. Bruce Fields 提交于 5月 25, 2010

Since rpc_call_async() guarantees that the release method will be called
even on failure, this put is wrong.
Signed-off-by: NJ. Bruce Fields <bfields@citi.umich.edu>

378b7d37

30 5月, 2010 10 次提交

ceph: clean up on forwarded aborted mds request · 2a8e5e36

由 Sage Weil 提交于 5月 28, 2010

If an mds request is aborted (timeout, SIGKILL), it is left registered to
keep our state in sync with the mds.  If we get a forward notification,
though, we know the request didn't succeed and we can unregister it
safely.  We were trying to resend it, but then bailing out (and not
unregistering) in __do_request.
Signed-off-by: NSage Weil <sage@newdream.net>

2a8e5e36

ceph: fix leak of osd authorizer · 79494d1b

由 Sage Weil 提交于 5月 27, 2010

Release the ceph_authorizer when releasing osd state.
Signed-off-by: NSage Weil <sage@newdream.net>

79494d1b

ceph: close out mds, osd connections before stopping auth · a922d38f

由 Sage Weil 提交于 5月 29, 2010

The auth module (part of the mon_client) is needed to free any
ceph_authorizer(s) used by the mds and osd connections. Flush the msgr
workqueue before stopping monc to ensure that the destroy_authorizer
auth op is available when those connections are closed out.
Signed-off-by: NSage Weil <sage@newdream.net>

a922d38f

ceph: make lease code DN specific · dd1c9057

由 Sage Weil 提交于 5月 25, 2010

The lease code includes a mask in the CEPH_LOCK_* namespace, but that
namespace is changing, and only one mask (formerly _DN == 1) is used, so
hard code for that value for now.

If we ever extend this code to handle leases over different data types we
can extend it accordingly.
Signed-off-by: NSage Weil <sage@newdream.net>

dd1c9057

fs/ceph: Use ERR_CAST · 7e34bc52

由 Julia Lawall 提交于 5月 22, 2010

Use ERR_CAST(x) rather than ERR_PTR(PTR_ERR(x)).  The former makes more
clear what is the purpose of the operation, which otherwise looks like a
no-op.

In the case of fs/ceph/inode.c, ERR_CAST is not needed, because the type of
the returned value is the same as the type of the enclosing function.

The semantic patch that makes this change is as follows:
(http://coccinelle.lip6.fr/)

// <smpl>
@@
type T;
T x;
identifier f;
@@

T f (...) { <+...
- ERR_PTR(PTR_ERR(x))
+ x
 ...+> }

@@
expression x;
@@

- ERR_PTR(PTR_ERR(x))
+ ERR_CAST(x)
// </smpl>
Signed-off-by: NJulia Lawall <julia@diku.dk>
Signed-off-by: NSage Weil <sage@newdream.net>

7e34bc52

ceph: renew auth tickets before they expire · a41359fa

由 Sage Weil 提交于 5月 25, 2010

We were only requesting renewal after our tickets expire; do so before
that.  Most of the low-level logic for this was already there; just use
it.
Signed-off-by: NSage Weil <sage@newdream.net>

a41359fa

ceph: do not resend mon requests on auth ticket renewal · 09c4d6a7

由 Sage Weil 提交于 5月 25, 2010

We only want to send pending mon requests when we successfully
authenticate.  If we are already authenticated, like when we renew our
ticket, there is no need to resend pending requests.
Signed-off-by: NSage Weil <sage@newdream.net>

09c4d6a7

ceph: removed duplicated #includes · 984c7690

由 Andrea Gelmini 提交于 5月 23, 2010

fs/ceph/auth.c: linux/slab.h is included more than once.
fs/ceph/super.h: linux/slab.h is included more than once.
Acked-by: NChristoph Lameter <cl@linux-foundation.org>
Signed-off-by: NAndrea Gelmini <andrea.gelmini@gelma.net>
Signed-off-by: NSage Weil <sage@newdream.net>

984c7690

ceph: avoid possible null dereference · e95e9a7a

由 Sage Weil 提交于 5月 25, 2010

ac->ops may be null; use protocol id in error message instead.
Reported-by: NDan Carpenter <error27@gmail.com>
Signed-off-by: NSage Weil <sage@newdream.net>

e95e9a7a

ceph: make mds requests killable, not interruptible · aa91647c

由 Sage Weil 提交于 5月 24, 2010

The underlying problem is that many mds requests can't be restarted.  For
example, a restarted create() would return -EEXIST if the original request
succeeds.  However, we do not want a hung MDS to hang the client too.  So,
use the _killable wait_for_completion variants to abort on SIGKILL but
nothing else.
Signed-off-by: NSage Weil <sage@newdream.net>

aa91647c

28 5月, 2010 11 次提交

remove detritus left by "mm: make read_cache_page synchronous" · 49837a80

由 Al Viro 提交于 5月 28, 2010

gets minix get_dir_page() in sync with its analogs; back in 2007
Nick has switched read_cache_page() and friends to sync behaviour
(i.e.  they wait for the page to get unlocked, check if it's uptodate
and if it isn't return ERR_PTR(-EIO) instead) and removed the
duplicate logics from the callers.  In case of fs/minix/dir.c he'd
removed only half of that...
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

49837a80

fix fs/sysv s_dirt handling · 4c9002de

由 Al Viro 提交于 5月 27, 2010

got broken on ->sync_fs() conversion a year ago, nobody noticed...
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

4c9002de

fat: convert to use the new truncate convention. · 459f6ed3

由 npiggin@suse.de 提交于 5月 27, 2010

Cc: OGAWA Hirofumi <hirofumi@mail.parknet.co.jp>
Cc: Christoph Hellwig <hch@lst.de>
Signed-off-by: NNick Piggin <npiggin@suse.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

459f6ed3

ext2: convert to use the new truncate convention. · 737f2e93

由 npiggin@suse.de 提交于 5月 27, 2010

I also have commented a possible bug in existing ext2 code, marked with XXX.

Cc: linux-ext4@vger.kernel.org
Cc: Christoph Hellwig <hch@lst.de>
Signed-off-by: NNick Piggin <npiggin@suse.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

737f2e93

fs: convert simple fs to new truncate · 3322e79a

由 Nick Piggin 提交于 5月 27, 2010

Convert simple filesystems: ramfs, configfs, sysfs, block_dev to new truncate
sequence.

Cc: Christoph Hellwig <hch@lst.de>
Signed-off-by: NNick Piggin <npiggin@suse.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

3322e79a

kill spurious reference to vmtruncate · 15c6fd97

由 npiggin@suse.de 提交于 5月 27, 2010

Lots of filesystems calls vmtruncate despite not implementing the old
->truncate method.  Switch them to use simple_setsize and add some
comments about the truncate code where it seems fitting.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NNick Piggin <npiggin@suse.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

15c6fd97

fs: introduce new truncate sequence · 7bb46a67

由 npiggin@suse.de 提交于 5月 27, 2010

Introduce a new truncate calling sequence into fs/mm subsystems. Rather than
setattr > vmtruncate > truncate, have filesystems call their truncate sequence
from ->setattr if filesystem specific operations are required. vmtruncate is
deprecated, and truncate_pagecache and inode_newsize_ok helpers introduced
previously should be used.

simple_setattr is introduced for simple in-ram filesystems to implement
the new truncate sequence. Eventually all filesystems should be converted
to implement a setattr, and the default code in notify_change should go
away.

simple_setsize is also introduced to perform just the ATTR_SIZE portion
of simple_setattr (ie. changing i_size and trimming pagecache).

To implement the new truncate sequence:
- filesystem specific manipulations (eg freeing blocks) must be done in
  the setattr method rather than ->truncate.
- vmtruncate can not be used by core code to trim blocks past i_size in
  the event of write failure after allocation, so this must be performed
  in the fs code.
- convert usage of helpers block_write_begin, nobh_write_begin,
  cont_write_begin, and *blockdev_direct_IO* to use _newtrunc postfixed
  variants. These avoid calling vmtruncate to trim blocks (see previous).
- inode_setattr should not be used. generic_setattr is a new function
  to be used to copy simple attributes into the generic inode.
- make use of the better opportunity to handle errors with the new sequence.

Big problem with the previous calling sequence: the filesystem is not called
until i_size has already changed.  This means it is not allowed to fail the
call, and also it does not know what the previous i_size was. Also, generic
code calling vmtruncate to truncate allocated blocks in case of error had
no good way to return a meaningful error (or, for example, atomically handle
block deallocation).

Cc: Christoph Hellwig <hch@lst.de>
Acked-by: NJan Kara <jack@suse.cz>
Signed-off-by: NNick Piggin <npiggin@suse.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

7bb46a67

fs/super: fix kernel-doc warning · 7000d3c4

由 Randy Dunlap 提交于 5月 24, 2010

Fix fs/super.c kernel-doc warning and function notation:
Warning(fs/super.c:957): No description found for parameter 'sb'
Signed-off-by: NRandy Dunlap <randy.dunlap@oracle.com>
Cc: Alexander Viro <viro@zeniv.linux.org.uk>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

7000d3c4

fs/minix: bugfix, number of indirect block ptrs per block depends on block size · 0ab7620a

由 Erik van der Kouwe 提交于 5月 26, 2010

The MINIX filesystem driver used a constant number of indirect block
pointers in an indirect block. This worked only for filesystems with 1kb
block, while the MINIX default block size is now 4kb. As a consequence,
large files were read incorrectly on such filesystems and writing a
large file would cause the filesystem to become corrupted. This patch
computes the number of indirect block pointers based on the block size,
making the driver work for each block size.

I would like to thank Feiran Zheng ('Fam') for pointing out the cause
of the corruption.
Signed-off-by: NErik van der Kouwe <vdkouwe@cs.vu.nl>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

0ab7620a

rename the generic fsync implementations · 1b061d92

由 Christoph Hellwig 提交于 5月 26, 2010

We don't name our generic fsync implementations very well currently.
The no-op implementation for in-memory filesystems currently is called
simple_sync_file which doesn't make too much sense to start with,
the the generic one for simple filesystems is called simple_fsync
which can lead to some confusion.

This patch renames the generic file fsync method to generic_file_fsync
to match the other generic_file_* routines it is supposed to be used
with, and the no-op implementation to noop_fsync to make it obvious
what to expect.  In addition add some documentation for both methods.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

1b061d92

drop unused dentry argument to ->fsync · 7ea80859

由 Christoph Hellwig 提交于 5月 26, 2010

Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

7ea80859