提交 · 3925675cb37cc9c3fd1d3f56ce0fa729f995f863 · openeuler / Kernel

06 12月, 2006 17 次提交

NFS: Fix up the dirty page accounting · 3925675c

由 Trond Myklebust 提交于 12月 05, 2006

There is now no reason to account for the dirty pages in the NFS code,
since the VM code will now do it for us via __set_page_dirty_nobuffers(),
and set_page_writeback().

We still need to keep the accounting of stable writes, though.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

3925675c

T
NFS: Ensure the inode is marked as dirty if we break out of nfs_wb_all() · e507d9eb
由 Trond Myklebust 提交于 12月 05, 2006
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
e507d9eb
T
NFS: Ensure we only call set_page_writeback() under the page lock · 61822ab5
由 Trond Myklebust 提交于 12月 05, 2006
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
61822ab5

NFS: Make nfs_updatepage() mark the page as dirty. · e261f51f

由 Trond Myklebust 提交于 12月 05, 2006

This will ensure that we can call set_page_writeback() from within
nfs_writepage(), which is always called with the page lock set.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

e261f51f

T
NFS: Ensure that nfs_wb_page() calls writepage when necessary. · 4d770ccf
由 Trond Myklebust 提交于 12月 05, 2006
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
4d770ccf

NFS: Add nfs_set_page_dirty() · 1a54533e

由 Trond Myklebust 提交于 12月 05, 2006

We will want to allow nfs_writepage() to distinguish between pages that
have been marked as dirty by the VM, and those that have been marked as
dirty by nfs_updatepage().
In the former case, the entire page will want to be written out, and so any
requests that were pending need to be flushed out first.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

1a54533e

NFS: Remove nfs_writepage_sync() · 200baa21

由 Trond Myklebust 提交于 12月 05, 2006

Maintaining two parallel ways of doing synchronous writes is rather
pointless. This patch gets rid of the legacy nfs_writepage_sync(), and
replaces it with the faster asynchronous writes.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

200baa21

T
NFS: More cleanups of fs/nfs/write.c · e21195a7
由 Trond Myklebust 提交于 12月 05, 2006
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
e21195a7

NFS: Remove call to igrab() from nfs_writepage() · 87a4ce16

由 Trond Myklebust 提交于 12月 05, 2006

We always ensure that the nfs_open_context holds a reference to the dentry,
so the test in nfs_writepage() for whether or not the inode is referenced
is redundant.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

87a4ce16

NFS: Cleanup: add common helper nfs_page_length() · 49a70f27

由 Trond Myklebust 提交于 12月 05, 2006

Clean up a lot of ad-hoc page length calculations in fs/nfs/write.c
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

49a70f27

NFS: Store pointer to the nfs_page in page->private · 277459d2

由 Trond Myklebust 提交于 12月 05, 2006

This will allow fast lookup of the nfs_page from the struct page instead of
having to search the radix tree.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

277459d2

NFS: cleanup of nfs_sync_inode_wait() · 1c75950b

由 Trond Myklebust 提交于 10月 09, 2006

Allow callers to directly pass it a struct writeback_control.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

1c75950b

NFS: Clean up nfs_scan_dirty() · 3f442547

由 Trond Myklebust 提交于 9月 17, 2006

Pass down struct writeback control.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

3f442547

NFS: Clean up nfs_flush_inode() · 28c6925f

由 Trond Myklebust 提交于 9月 16, 2006

Make it take a struct writepages argument, and rename to
nfs_flush_mapping().
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

28c6925f

NFS: Remove use of the Big Kernel Lock around calls to rpc_execute. · a99b71c9

由 Frank Filz 提交于 10月 17, 2006

Remove use of the Big Kernel Lock around calls to rpc_execute.
Signed-off-by: NFrank Filz <ffilz@us.ibm.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

a99b71c9

NFS: Fix nfs_sync_inode_wait(FLUSH_INVALIDATE) · e8e058e8

由 Trond Myklebust 提交于 11月 15, 2006

Currently nfs_sync_inode_wait() will fail to loop correctly when we call
nfs_sync_inode_wait with the FLUSH_INVALIDATE argument.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

e8e058e8

SUNRPC: Fix a potential race in rpc_wake_up_task() · 8aca67f0

由 Trond Myklebust 提交于 11月 13, 2006

Use RCU to ensure that we can safely call rpc_finish_wakeup after we've
called __rpc_do_wake_up_task. If not, there is a theoretical race, in which
the rpc_task finishes executing, and gets freed first.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

8aca67f0

21 10月, 2006 2 次提交

[PATCH] NFS: Fix oops in nfs_cancel_commit_list · b6dff26a

由 Trond Myklebust 提交于 10月 19, 2006

Fix two bugs:
 - nfs_inode_remove_request will call nfs_clear_request, so we cannot
   reference req->wb_page after it. Move the call to dec_zone_page_state so
   that it occurs while req->wb_page is still valid.
 - Calling nfs_clear_page_writeback is unnecessary since the radix tree
   tags will have been cleared by the call to nfs_inode_remove_request.
   Replace with a simple call to nfs_unlock_request.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

b6dff26a

[PATCH] separate bdi congestion functions from queue congestion functions · 3fcfab16

由 Andrew Morton 提交于 10月 19, 2006

Separate out the concept of "queue congestion" from "backing-dev congestion".
Congestion is a backing-dev concept, not a queue concept.

The blk_* congestion functions are retained, as wrappers around the core
backing-dev congestion functions.

This proper layering is needed so that NFS can cleanly use the congestion
functions, and so that CONFIG_BLOCK=n actually links.

Cc: "Thomas Maier" <balagi@justmail.de>
Cc: "Jens Axboe" <jens.axboe@oracle.com>
Cc: Trond Myklebust <trond.myklebust@fys.uio.no>
Cc: David Howells <dhowells@redhat.com>
Cc: Peter Osterlund <petero2@telia.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

3fcfab16

01 10月, 2006 1 次提交

[PATCH] BLOCK: Remove no-longer necessary linux/mpage.h inclusions [try ] · 4cb50dc2

由 David Howells 提交于 8月 29, 2006

Remove inclusions of linux/mpage.h that are no longer necessary due to the
transfer of generic_writepages().
Signed-Off-By: NDavid Howells <dhowells@redhat.com>
Signed-off-by: NJens Axboe <axboe@kernel.dk>

4cb50dc2

27 9月, 2006 1 次提交

[PATCH] Really ignore kmem_cache_destroy return value · 1a1d92c1

由 Alexey Dobriyan 提交于 9月 27, 2006

* Rougly half of callers already do it by not checking return value
* Code in drivers/acpi/osl.c does the following to be sure:

	(void)kmem_cache_destroy(cache);

* Those who check it printk something, however, slab_error already printed
  the name of failed cache.
* XFS BUGs on failed kmem_cache_destroy which is not the decision
  low-level filesystem driver should make. Converted to ignore.
Signed-off-by: NAlexey Dobriyan <adobriyan@gmail.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

1a1d92c1

23 9月, 2006 3 次提交

NFS: add comments clarifying the use of nfs_post_op_update() · f551e44f

由 Chuck Lever 提交于 9月 20, 2006

Comments-only change to clarify a detail of the NFS protocol and how it is
implemented in Linux.

Test plan:
None.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

f551e44f

T
Add a real API for dealing with blk_congestion_wait() · 275a082f
由 Trond Myklebust 提交于 8月 22, 2006
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
275a082f

NFS: Share NFS superblocks per-protocol per-server per-FSID · 54ceac45

由 David Howells 提交于 8月 22, 2006

The attached patch makes NFS share superblocks between mounts from the same
server and FSID over the same protocol.

It does this by creating each superblock with a false root and returning the
real root dentry in the vfsmount presented by get_sb(). The root dentry set
starts off as an anonymous dentry if we don't already have the dentry for its
inode, otherwise it simply returns the dentry we already have.

We may thus end up with several trees of dentries in the superblock, and if at
some later point one of anonymous tree roots is discovered by normal filesystem
activity to be located in another tree within the superblock, the anonymous
root is named and materialises attached to the second tree at the appropriate
point.

Why do it this way? Why not pass an extra argument to the mount() syscall to
indicate the subpath and then pathwalk from the server root to the desired
directory? You can't guarantee this will work for two reasons:

 (1) The root and intervening nodes may not be accessible to the client.

     With NFS2 and NFS3, for instance, mountd is called on the server to get
     the filehandle for the tip of a path. mountd won't give us handles for
     anything we don't have permission to access, and so we can't set up NFS
     inodes for such nodes, and so can't easily set up dentries (we'd have to
     have ghost inodes or something).

     With this patch we don't actually create dentries until we get handles
     from the server that we can use to set up their inodes, and we don't
     actually bind them into the tree until we know for sure where they go.

 (2) Inaccessible symbolic links.

     If we're asked to mount two exports from the server, eg:

	mount warthog:/warthog/aaa/xxx /mmm
	mount warthog:/warthog/bbb/yyy /nnn

     We may not be able to access anything nearer the root than xxx and yyy,
     but we may find out later that /mmm/www/yyy, say, is actually the same
     directory as the one mounted on /nnn. What we might then find out, for
     example, is that /warthog/bbb was actually a symbolic link to
     /warthog/aaa/xxx/www, but we can't actually determine that by talking to
     the server until /warthog is made available by NFS.

     This would lead to having constructed an errneous dentry tree which we
     can't easily fix. We can end up with a dentry marked as a directory when
     it should actually be a symlink, or we could end up with an apparently
     hardlinked directory.

     With this patch we need not make assumptions about the type of a dentry
     for which we can't retrieve information, nor need we assume we know its
     place in the grand scheme of things until we actually see that place.

This patch reduces the possibility of aliasing in the inode and page caches for
inodes that may be accessed by more than one NFS export. It also reduces the
number of superblocks required for NFS where there are many NFS exports being
used from a server (home directory server + autofs for example).

This in turn makes it simpler to do local caching of network filesystems, as it
can then be guaranteed that there won't be links from multiple inodes in
separate superblocks to the same cache file.

Obviously, cache aliasing between different levels of NFS protocol could still
be a problem, but at least that gives us another key to use when indexing the
cache.

This patch makes the following changes:

 (1) The server record construction/destruction has been abstracted out into
     its own set of functions to make things easier to get right.  These have
     been moved into fs/nfs/client.c.

     All the code in fs/nfs/client.c has to do with the management of
     connections to servers, and doesn't touch superblocks in any way; the
     remaining code in fs/nfs/super.c has to do with VFS superblock management.

 (2) The sequence of events undertaken by NFS mount is now reordered:

     (a) A volume representation (struct nfs_server) is allocated.

     (b) A server representation (struct nfs_client) is acquired.  This may be
     	 allocated or shared, and is keyed on server address, port and NFS
     	 version.

     (c) If allocated, the client representation is initialised.  The state
     	 member variable of nfs_client is used to prevent a race during
     	 initialisation from two mounts.

     (d) For NFS4 a simple pathwalk is performed, walking from FH to FH to find
     	 the root filehandle for the mount (fs/nfs/getroot.c).  For NFS2/3 we
     	 are given the root FH in advance.

     (e) The volume FSID is probed for on the root FH.

     (f) The volume representation is initialised from the FSINFO record
     	 retrieved on the root FH.

     (g) sget() is called to acquire a superblock.  This may be allocated or
     	 shared, keyed on client pointer and FSID.

     (h) If allocated, the superblock is initialised.

     (i) If the superblock is shared, then the new nfs_server record is
     	 discarded.

     (j) The root dentry for this mount is looked up from the root FH.

     (k) The root dentry for this mount is assigned to the vfsmount.

 (3) nfs_readdir_lookup() creates dentries for each of the entries readdir()
     returns; this function now attaches disconnected trees from alternate
     roots that happen to be discovered attached to a directory being read (in
     the same way nfs_lookup() is made to do for lookup ops).

     The new d_materialise_unique() function is now used to do this, thus
     permitting the whole thing to be done under one set of locks, and thus
     avoiding any race between mount and lookup operations on the same
     directory.

 (4) The client management code uses a new debug facility: NFSDBG_CLIENT which
     is set by echoing 1024 to /proc/net/sunrpc/nfs_debug.

 (5) Clone mounts are now called xdev mounts.

 (6) Use the dentry passed to the statfs() op as the handle for retrieving fs
     statistics rather than the root dentry of the superblock (which is now a
     dummy).
Signed-Off-By: NDavid Howells <dhowells@redhat.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

54ceac45

19 9月, 2006 1 次提交
- T
  NFS: Fix nfs_page use after free issues in fs/nfs/write.c · 5c2d97cb
  由 Trond Myklebust 提交于 9月 18, 2006
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
  5c2d97cb
09 9月, 2006 1 次提交

[PATCH] NFS: large non-page-aligned direct I/O clobbers memory · e9f7bee1

由 Trond Myklebust 提交于 9月 08, 2006

The logic in nfs_direct_read_schedule and nfs_direct_write_schedule can
allow data->npages to be one larger than rpages.  This causes a page
pointer to be written beyond the end of the pagevec in nfs_read_data (or
nfs_write_data).

Fix this by making nfs_(read|write)_alloc() calculate the size of the
pagevec array, and initialise data->npages.

Also get rid of the redundant argument to nfs_commit_alloc().
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
Cc: Chuck Lever <chuck.lever@oracle.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

e9f7bee1

04 8月, 2006 1 次提交

NFS: make 2 functions static · e4e20512

由 Adrian Bunk 提交于 8月 03, 2006

nfs_writedata_free() and nfs_readdata_free() can now become static.
Signed-off-by: NAdrian Bunk <bunk@stusta.de>
Cc: Trond Myklebust <trond.myklebust@fys.uio.no>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
(cherry picked from 5e1ce40f0c3c8f67591aff17756930d7a18ceb1a commit)

e4e20512

06 7月, 2006 1 次提交

NFS: Fix NFS page_state usage · 83715ad5

由 Trond Myklebust 提交于 7月 05, 2006

The introduction of the FLUSH_INVALIDATE argument to nfs_sync_inode_wait()
does not clear the nr_unstable page state counter for pages that are being
released.

Also fix a longstanding similar bug when nfs_commit_list() fails.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

83715ad5

01 7月, 2006 3 次提交

[PATCH] zoned vm counters: conversion of nr_unstable to per zone counter · fd39fc85

由 Christoph Lameter 提交于 6月 30, 2006

Conversion of nr_unstable to a per zone counter

We need to do some special modifications to the nfs code since there are
multiple cases of disposition and we need to have a page ref for proper
accounting.

This converts the last critical page state of the VM and therefore we need to
remove several functions that were depending on GET_PAGE_STATE_LAST in order
to make the kernel compile again.  We are only left with event type counters
in page state.

[akpm@osdl.org: bugfixes]
Signed-off-by: NChristoph Lameter <clameter@sgi.com>
Cc: Trond Myklebust <trond.myklebust@fys.uio.no>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

fd39fc85

[PATCH] zoned vm counters: conversion of nr_dirty to per zone counter · b1e7a8fd

由 Christoph Lameter 提交于 6月 30, 2006

This makes nr_dirty a per zone counter.  Looping over all processors is
avoided during writeback state determination.

The counter aggregation for nr_dirty had to be undone in the NFS layer since
we summed up the page counts from multiple zones.  Someone more familiar with
NFS should probably review what I have done.

[akpm@osdl.org: bugfix]
Signed-off-by: NChristoph Lameter <clameter@sgi.com>
Cc: Trond Myklebust <trond.myklebust@fys.uio.no>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

b1e7a8fd

Remove obsolete #include <linux/config.h> · 6ab3d562

由 Jörn Engel 提交于 6月 30, 2006

Signed-off-by: NJörn Engel <joern@wohnheim.fh-wedel.de>
Signed-off-by: NAdrian Bunk <bunk@stusta.de>

6ab3d562

28 6月, 2006 1 次提交

[PATCH] fix static linking of NFS · 266bee88

由 David Brownell 提交于 6月 27, 2006

Builds on ARM report link problems with common configurations like
statically linked NFS (for nfsroot).  The symptom is that __init
section code references __exit section code; that won't work since
the exit sections are discarded (since they can never be called).

The best fix for these particular cases would be an "__init_or_exit"
section annotation.
Signed-off-by: NDavid Brownell <dbrownell@users.sourceforge.net>
Acked-by: NTrond Myklebust <trond.myklebust@fys.uio.no>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

266bee88

09 6月, 2006 3 次提交

NFS: Split fs/nfs/inode.c · f7b422b1

由 David Howells 提交于 6月 09, 2006

As fs/nfs/inode.c is rather large, heterogenous and unwieldy, the attached
patch splits it up into a number of files:

 (*) fs/nfs/inode.c

     Strictly inode specific functions.

 (*) fs/nfs/super.c

     Superblock management functions for NFS and NFS4, normal access, clones
     and referrals.  The NFS4 superblock functions _could_ move out into a
     separate conditionally compiled file, but it's probably not worth it as
     there're so many common bits.

 (*) fs/nfs/namespace.c

     Some namespace-specific functions have been moved here.

 (*) fs/nfs/nfs4namespace.c

     NFS4-specific namespace functions (this could be merged into the previous
     file).  This file is conditionally compiled.

 (*) fs/nfs/internal.h

     Inter-file declarations, plus a few simple utility functions moved from
     fs/nfs/inode.c.

     Additionally, all the in-.c-file externs have been moved here, and those
     files they were moved from now includes this file.

For the most part, the functions have not been changed, only some multiplexor
functions have changed significantly.

I've also:

 (*) Added some extra banner comments above some functions.

 (*) Rearranged the function order within the files to be more logical and
     better grouped (IMO), though someone may prefer a different order.

 (*) Reduced the number of #ifdefs in .c files.

 (*) Added missing __init and __exit directives.
Signed-Off-By: NDavid Howells <dhowells@redhat.com>

f7b422b1

NFS: Flesh out nfs_invalidate_page() · d2ccddf0

由 Trond Myklebust 提交于 5月 31, 2006

In the case of a call to truncate_inode_pages(), we should really try to
cancel any pending writes on the page.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

d2ccddf0

NFS: Optimize allocation of nfs_read/write_data structures · 0d0b5cb3

由 Chuck Lever 提交于 5月 25, 2006

Clean up use of page_array, and fix an off-by-one error noticed by Tom
Talpey which causes kmalloc calls in cases where using the page_array
is sufficient.

Test plan:
Normal client functional testing with r/wsize=32768.
Signed-off-by: NChuck Lever <cel@netapp.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

0d0b5cb3

27 3月, 2006 1 次提交

[PATCH] mempool: use mempool_create_slab_pool() · 93d2341c

由 Matthew Dobson 提交于 3月 26, 2006

Modify well over a dozen mempool users to call mempool_create_slab_pool()
rather than calling mempool_create() with extra arguments, saving about 30
lines of code and increasing readability.
Signed-off-by: NMatthew Dobson <colpatch@us.ibm.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

93d2341c

21 3月, 2006 4 次提交

NFS: Fix a race in nfs_sync_inode() · c42de9dd

由 Trond Myklebust 提交于 3月 20, 2006

Kudos to Neil Brown for spotting the problem:

"in nfs_sync_inode, there is effectively the sequence:

   nfs_wait_on_requests
   nfs_flush_inode
   nfs_commit_inode

 This seems a bit racy to me as if the only requests are on the
 ->commit list, and nfs_commit_inode is called separately after
 nfs_wait_on_requests completes, and before nfs_commit_inode start
 (say: by nfs_write_inode) then none of these function will return
 >0, yet there will be some pending request that aren't waited for."

The solution is to search for requests to wait upon, search for dirty
requests, and search for uncommitted requests while holding the
nfsi->req_lock

The patch also cleans up nfs_sync_inode(), getting rid of the redundant
FLUSH_WAIT flag. It turns out that we were always setting it.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

c42de9dd

T
NFS: Clean up nfs_flush_list() · 7d46a49f
由 Trond Myklebust 提交于 3月 20, 2006
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
7d46a49f

NFS: Fix a race with PG_private and nfs_release_page() · deb7d638

由 Trond Myklebust 提交于 3月 20, 2006

We don't need to set PG_private for readahead pages, since they never get
unlocked while I/O is in progress. However there is a small race in
nfs_readpage_release() whereby the page may be unlocked, and have
PG_private set.

Fix is to have PG_private set only for the case of writes...

Also fix a bug in nfs_clear_page_writeback(): Don't attempt to clear the
radix_tree tag if we've already deleted the radix tree entry.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

deb7d638

T
NFS: Uninline nfs_writedata_(alloc|free) and nfs_readdata_(alloc|free) · 3feb2d49
由 Trond Myklebust 提交于 3月 20, 2006
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
3feb2d49

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功