提交 · e7d39069e387a12d4c57f4067d9f48c1d29ea900 · openanolis / cloud-kernel

10 7月, 2008 5 次提交

NFS: Clean up nfs_update_request() · e7d39069

由 Trond Myklebust 提交于 6月 13, 2008

Simplify the loop in nfs_update_request by moving into a separate function
the code that attempts to update an existing cached NFS write.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

e7d39069

NFS: Fix trace debugging nits in write.c · 48186c7d

由 Chuck Lever 提交于 6月 11, 2008

Clean up: fix a few dprintk messages that still need to show the RPC task ID
correctly, and be sure we use the preferred %lld or %llu instead of %Ld or
%Lu.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

48186c7d

NFS: Revert commit · 7e5f6146

由 Trond Myklebust 提交于 5月 25, 2008

Revert commit 44dd151d "NFS: Don't mark a written page as uptodate until it
is on disk". While it is true that the write may fail, that is always the
case. There is no reason why we should treat data on pages that are not
already marked as PG_uptodate as being special. The only thing we gain is a
noticeable slowdown when re-reading these pages.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

7e5f6146

NFS: Optimise append writes with holes · efc91ed0

由 Trond Myklebust 提交于 6月 10, 2008

If a file is being extended, and we're creating a hole, we might as well
declare the entire page to be up to date.

This patch significantly improves the write performance for sparse files
in the case where lseek(SEEK_END) is used to append several non-contiguous
writes at intervals of < PAGE_SIZE.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

efc91ed0

NFS: Fix a preemption count leak in nfs_update_request · f3d47a3a

由 Trond Myklebust 提交于 6月 05, 2008

The commit 27852596 (nfs: use GFP_NOFS
preloads for radix-tree insertion) appears to have introduced a bug:
We only want to call radix_tree_preload() once after creating a request.
Calling it every time we loop after we created the request, will cause
preemption count leaks.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
Cc: Nick Piggin <npiggin@suse.de>

f3d47a3a

24 6月, 2008 1 次提交
- T
  NFS: nfs_updatepage(): don't mark page as dirty if an error occurred · 03fa9e84
  由 Trond Myklebust 提交于 6月 05, 2008
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
  03fa9e84
17 5月, 2008 1 次提交

nfs: fix race in nfs_dirty_request · 38def50f

由 Fred Isaman 提交于 5月 01, 2008

When called from nfs_flush_incompatible, the req is not locked, so
req->wb_page might be set to NULL before it is used by PageWriteback.
Signed-off-by: NFred Isaman <iisaman@citi.umich.edu>
Signed-off-by: NBenny Halevy <bhalevy@panasas.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

38def50f

20 4月, 2008 3 次提交

T
NFS: Ensure that rpc_run_task() errors are propagated back to the caller · dbae4c73
由 Trond Myklebust 提交于 4月 14, 2008
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
dbae4c73
T
NFS: Ensure that the write code cleans up properly when rpc_run_task() fails · c9d8f89d
由 Trond Myklebust 提交于 4月 15, 2008
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
c9d8f89d

NFS: Fix nfs_wb_page() to always exit with an error or a clean page · 73e3302f

由 Trond Myklebust 提交于 4月 11, 2008

It is possible for nfs_wb_page() to sometimes exit with 0 return value, yet
the page is left in a dirty state.
For instance in the case where the server rebooted, and the COMMIT request
failed, then all the previously "clean" pages which were cached by the
server, but were not guaranteed to have been writted out to disk,
have to be redirtied and resent to the server.
The fix is to have nfs_wb_page_priority() check that the page is clean
before it exits...

This fixes a condition that triggers the BUG_ON(PagePrivate(page)) in
nfs_create_request() when we're in the nfs_readpage() path.

Also eliminate a redundant BUG_ON(!PageLocked(page)) while we're at it. It
turns out that clear_page_dirty_for_io() has the exact same test.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

73e3302f

20 3月, 2008 2 次提交

nfs: nfs_redirty_request · 6d884e8f

由 Fred 提交于 3月 19, 2008

Both flush functions have the same error handling routine.  Pull
it out as a function.
Signed-off-by: NFred Isaman <iisaman@citi.umich.edu>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

6d884e8f

nfs: don't ignore return value from nfs_pageio_add_request · f8512ad0

由 Fred Isaman 提交于 3月 19, 2008

Ignoring the return value from nfs_pageio_add_request can cause deadlocks.

In read path:
  call nfs_pageio_add_request from readpage_async_filler
  assume at this point that there are requests already in desc, that
    can't be merged with the current request.
  so nfs_pageio_doio is fired up to clear out desc.
  assume something goes wrong in setting up the io, so desc->pg_error is set.
  This causes nfs_pageio_add_request to return 0, *WITHOUT* adding the original
    request.
  BUT, since return code is ignored, readpage_async_filler assumes it has
    been added, and does nothing further, leaving page locked.
  do_generic_mapping_read will eventually call lock_page, resulting in deadlock

In write path:
  page is marked dirty by generic_perform_write
  nfs_writepages is called
  call nfs_pageio_add_request from nfs_page_async_flush
  assume at this point that there are requests already in desc, that
    can't be merged with the current request.
  so nfs_pageio_doio is fired up to clear out desc.
  assume something goes wrong in setting up the io, so desc->pg_error is set.
  This causes nfs_page_async_flush to return 0, *WITHOUT* adding the original
    request, yet marking the request as locked (PG_BUSY) and in writeback,
    clearing dirty marks.
  The next time a write is done to the page, deadlock will result as
    nfs_write_end calls nfs_update_request
Signed-off-by: NFred Isaman <iisaman@citi.umich.edu>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

f8512ad0

08 3月, 2008 1 次提交

NFS: Fix an f_mode/f_flags confusion in fs/nfs/write.c · af1b8c2f

由 Trond Myklebust 提交于 2月 25, 2008

O_SYNC is stored in filp->f_flags.
Thanks to Al Viro for pointing out the bug.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

af1b8c2f

29 2月, 2008 1 次提交

SUNRPC: Remove now-redundant RCU-safe rpc_task free path · 5e4424af

由 Trond Myklebust 提交于 2月 25, 2008

Now that we've tightened up the locking rules for RPC queue wakeups, we can
remove the RCU-safe kfree calls...
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

5e4424af

26 2月, 2008 3 次提交

NFS: Ensure that the asynchronous RPC calls complete on nfsiod. · 101070ca

由 Trond Myklebust 提交于 2月 19, 2008

We want to ensure that rpc_call_ops that involve mntput() are run on nfsiod
rather than on rpciod, so that they don't deadlock when the resulting
umount calls rpc_shutdown_client(). Hence we specify that read, write and
commit calls must complete on nfsiod.
Ditto for NFSv4 open, lock, locku and close asynchronous calls.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

101070ca

NFS: Fix a deadlock with lazy umount · 383ba719

由 Trond Myklebust 提交于 2月 19, 2008

We can't allow rpc callback functions like task->tk_ops->rpc_call_prepare()
and task->tk_ops->rpc_call_done() to call mntput() in any way, since
that will cause a deadlock when the call to rpc_shutdown_client() attempts
to wait on 'task' to complete.

We can avoid the above deadlock by moving calls to mntput to
task->tk_ops->rpc_release() callback, since at that time the task will be
marked as completed, and so rpc_shutdown_client won't attempt to wait on
it.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

383ba719

NFS: Fix an f_mode/f_flags confusion in fs/nfs/write.c · 4b5621f6

由 Trond Myklebust 提交于 2月 25, 2008

O_SYNC is stored in filp->f_flags.
Thanks to Al Viro for pointing out the bug.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

4b5621f6

14 2月, 2008 1 次提交

nfs: use GFP_NOFS preloads for radix-tree insertion · 27852596

由 Nick Piggin 提交于 2月 04, 2008

NFS should use GFP_NOFS mode radix tree preloads rather than GFP_ATOMIC
allocations at radix-tree insertion-time.  This is important to reduce the
atomic memory requirement.
Signed-off-by: NNick Piggin <npiggin@suse.de>
Cc: Trond Myklebust <trond.myklebust@fys.uio.no>
Cc: "J. Bruce Fields" <bfields@fieldses.org>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

27852596

08 2月, 2008 1 次提交

NFS: Fix a potential file corruption issue when writing · 5d47a356

由 Trond Myklebust 提交于 2月 07, 2008

If the inode is flagged as having an invalid mapping, then we can't rely on
the PageUptodate() flag. Ensure that we don't use the "anti-fragmentation"
write optimisation in nfs_updatepage(), since that will cause NFS to write
out areas of the page that are no longer guaranteed to be up to date.

A potential corruption could occur in the following scenario:

client 1			client 2
===============			===============
				fd=open("f",O_CREAT|O_WRONLY,0644);
				write(fd,"fubar\n",6);	// cache last page
				close(fd);
fd=open("f",O_WRONLY|O_APPEND);
write(fd,"foo\n",4);
close(fd);

				fd=open("f",O_WRONLY|O_APPEND);
				write(fd,"bar\n",4);
				close(fd);
-----
The bug may lead to the file "f" reading 'fubar\n\0\0\0\nbar\n' because
client 2 does not update the cached page after re-opening the file for
write. Instead it keeps it marked as PageUptodate() until someone calls
invaldate_inode_pages2() (typically by calling read()).
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

5d47a356

06 2月, 2008 1 次提交

Pagecache zeroing: zero_user_segment, zero_user_segments and zero_user · eebd2aa3

由 Christoph Lameter 提交于 2月 04, 2008

Simplify page cache zeroing of segments of pages through 3 functions

zero_user_segments(page, start1, end1, start2, end2)

        Zeros two segments of the page. It takes the position where to
        start and end the zeroing which avoids length calculations and
	makes code clearer.

zero_user_segment(page, start, end)

        Same for a single segment.

zero_user(page, start, length)

        Length variant for the case where we know the length.

We remove the zero_user_page macro. Issues:

1. Its a macro. Inline functions are preferable.

2. The KM_USER0 macro is only defined for HIGHMEM.

   Having to treat this special case everywhere makes the
   code needlessly complex. The parameter for zeroing is always
   KM_USER0 except in one single case that we open code.

Avoiding KM_USER0 makes a lot of code not having to be dealing
with the special casing for HIGHMEM anymore. Dealing with
kmap is only necessary for HIGHMEM configurations. In those
configurations we use KM_USER0 like we do for a series of other
functions defined in highmem.h.

Since KM_USER0 is depends on HIGHMEM the existing zero_user_page
function could not be a macro. zero_user_* functions introduced
here can be be inline because that constant is not used when these
functions are called.

Also extract the flushing of the caches to be outside of the kmap.

[akpm@linux-foundation.org: fix nfs and ntfs build]
[akpm@linux-foundation.org: fix ntfs build some more]
Signed-off-by: NChristoph Lameter <clameter@sgi.com>
Cc: Steven French <sfrench@us.ibm.com>
Cc: Michael Halcrow <mhalcrow@us.ibm.com>
Cc: <linux-ext4@vger.kernel.org>
Cc: Steven Whitehouse <swhiteho@redhat.com>
Cc: Trond Myklebust <trond.myklebust@fys.uio.no>
Cc: "J. Bruce Fields" <bfields@fieldses.org>
Cc: Anton Altaparmakov <aia21@cantab.net>
Cc: Mark Fasheh <mark.fasheh@oracle.com>
Cc: David Chinner <dgc@sgi.com>
Cc: Michael Halcrow <mhalcrow@us.ibm.com>
Cc: Steven French <sfrench@us.ibm.com>
Cc: Steven Whitehouse <swhiteho@redhat.com>
Cc: Trond Myklebust <trond.myklebust@fys.uio.no>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

eebd2aa3

30 1月, 2008 6 次提交

NFS: Fix minor mixed sign comparison in NFS client's write logic · bf4285e7

由 Chuck Lever 提交于 12月 20, 2007

Clean up: PAGE_CACHE_SIZE is unsigned, and nfs_pageio_init() takes a size_t.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

bf4285e7

T
NFS/SUNRPC: Convert users of rpc_init_task+rpc_execute to rpc_run_task() · 07737691
由 Trond Myklebust 提交于 10月 25, 2007
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
07737691

NFS: Clean up the (commit|read|write)_setup() callback routines · bdc7f021

由 Trond Myklebust 提交于 7月 14, 2007

Move the common code for setting up the nfs_write_data and nfs_read_data
structures into fs/nfs/read.c, fs/nfs/write.c and fs/nfs/direct.c.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

bdc7f021

SUNRPC: Clean up the initialisation of priority queue scheduling info. · 3ff7576d

由 Trond Myklebust 提交于 7月 14, 2007

We want the default scheduling priority (priority == 0) to remain
RPC_PRIORITY_NORMAL.

Also ensure that the priority wait queue scheduling is per process id
instead of sometimes being per thread, and sometimes being per inode.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

3ff7576d

T
SUNRPC: Cleanup of rpc_task initialisation · 84115e1c
由 Trond Myklebust 提交于 7月 14, 2007
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
84115e1c

NFS: Clean up the write request locking. · acee478a

由 Trond Myklebust 提交于 1月 22, 2008

Ensure that we set/clear NFS_PAGE_TAG_LOCKED when the nfs_page is hashed.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

acee478a

07 12月, 2007 1 次提交

NFS: Switch from intr mount option to TASK_KILLABLE · 150030b7

由 Matthew Wilcox 提交于 12月 06, 2007

By using the TASK_KILLABLE infrastructure, we can get rid of the 'intr'
mount option.  We have to use _killable everywhere instead of _interruptible
as we get rid of rpc_clnt_sigmask/sigunmask.
Signed-off-by: NLiam R. Howlett <howlett@gmail.com>
Signed-off-by: NMatthew Wilcox <willy@linux.intel.com>

150030b7

27 11月, 2007 1 次提交

NFS: make nfs_wb_page_priority() static · 5334eb13

由 Adrian Bunk 提交于 11月 21, 2007

nfs_wb_page_priority() can now become static.
Signed-off-by: NAdrian Bunk <bunk@kernel.org>
Cc: Trond Myklebust <trond.myklebust@fys.uio.no>
Cc: "J. Bruce Fields" <bfields@fieldses.org>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

5334eb13

20 10月, 2007 1 次提交

NFS: Fix a writeback race... · 61e930a9

由 Trond Myklebust 提交于 10月 18, 2007

This patch fixes a regression that was introduced by commit
44dd151d

We cannot zero the user page in nfs_mark_uptodate() any more, since

  a) We'd be modifying the page without holding the page lock
  b) We can race with other updates of the page, most notably
     because of the call to nfs_wb_page() in nfs_writepage_setup().

Instead, we do the zeroing in nfs_update_request() if we see that we're
creating a request that might potentially be marked as up to date.

Thanks to Olivier Paquet for reporting the bug and providing a test-case.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

61e930a9

17 10月, 2007 2 次提交

mm: count reclaimable pages per BDI · c9e51e41

由 Peter Zijlstra 提交于 10月 16, 2007

Count per BDI reclaimable pages; nr_reclaimable = nr_dirty + nr_unstable.
Signed-off-by: NPeter Zijlstra <a.p.zijlstra@chello.nl>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

c9e51e41

nfs: remove congestion_end() · c4dc4bee

由 Peter Zijlstra 提交于 10月 16, 2007

These patches aim to improve balance_dirty_pages() and directly address three
issues:
  1) inter device starvation
  2) stacked device deadlocks
  3) inter process starvation

1 and 2 are a direct result from removing the global dirty limit and using
per device dirty limits. By giving each device its own dirty limit is will
no longer starve another device, and the cyclic dependancy on the dirty limit
is broken.

In order to efficiently distribute the dirty limit across the independant
devices a floating proportion is used, this will allocate a share of the total
limit proportional to the device's recent activity.

3 is done by also scaling the dirty limit proportional to the current task's
recent dirty rate.

This patch:

nfs: remove congestion_end().  It's redundant, clear_bdi_congested() already
wakes the waiters.
Signed-off-by: NPeter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Trond Myklebust <trond.myklebust@fys.uio.no>
Cc: "J. Bruce Fields" <bfields@fieldses.org>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

c4dc4bee

10 10月, 2007 7 次提交

NFS: Remove nfs_begin_data_update/nfs_end_data_update · 60ccd4ec

由 Trond Myklebust 提交于 9月 29, 2007

The lower level routines in fs/nfs/proc.c, fs/nfs/nfs3proc.c and
fs/nfs/nfs4proc.c should already be dealing with the revalidation issues.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

60ccd4ec

T
NFS: Replace file->private_data with calls to nfs_file_open_context() · cd3758e3
由 Trond Myklebust 提交于 8月 10, 2007
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
cd3758e3

NFS: Fall back to synchronous writes when a background write errors... · 7b159fc1

由 Trond Myklebust 提交于 7月 25, 2007

This helps prevent huge queues of background writes from building up
whenever the server runs out of disk or quota space, or if someone changes
the file access modes behind our backs.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

7b159fc1

NFS: Writeback optimisation · 34901f70

由 Trond Myklebust 提交于 7月 25, 2007

Schedule writes using WB_SYNC_NONE first, then come back for a second pass
using WB_SYNC_ALL.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

34901f70

NFS: Clean up NFS writeback flush code · ed90ef51

由 Trond Myklebust 提交于 7月 20, 2007

The only user of nfs_sync_mapping_range() is nfs_getattr(), which uses it
to flush out the entire inode without sending a commit. We therefore
replace nfs_sync_mapping_range with a more appropriate helper.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

ed90ef51

NFS: Clean up nfs_writepages() · f758c885

由 Trond Myklebust 提交于 7月 22, 2007

Just call write_cache_pages directly instead of hacking the writeback
control structure in order to find out if we were called from writepages()
or directly from the VM.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

f758c885

NFS: Clean up write code... · 9cccef95

由 Trond Myklebust 提交于 7月 22, 2007

The addition of nfs_page_mkwrite means that We should no longer need to
create requests inside nfs_writepage()
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

9cccef95

01 9月, 2007 1 次提交

NFS: Fix a write request leak in nfs_invalidate_page() · 1b3b4a1a

由 Trond Myklebust 提交于 8月 28, 2007

Ryusuke Konishi says:

The recent truncate_complete_page() clears the dirty flag from a page
before calling a_ops->invalidatepage(),
^^^^^^
static void
truncate_complete_page(struct address_space *mapping, struct page *page)
{
        ...
        cancel_dirty_page(page, PAGE_CACHE_SIZE);  <--- Inserted here at
kernel 2.6.20

        if (PagePrivate(page))
                do_invalidatepage(page, 0);   ---> will call
a_ops->invalidatepage()
        ...
}

and this is disturbing nfs_wb_page_priority() from calling 
nfs_writepage_locked() that is expected to handle the pending
request (=nfs_page) associated with the page.

int nfs_wb_page_priority(struct inode *inode, struct page *page, int how)
{
        ...
        if (clear_page_dirty_for_io(page)) {
                ret = nfs_writepage_locked(page, &wbc);
                if (ret < 0)
                        goto out;
        }
        ...
}

Since truncate_complete_page() will get rid of the page after
a_ops->invalidatepage() returns, the request (=nfs_page) associated
with the page becomes a garbage in nfs_inode->nfs_page_tree.
------------------------

Fix this by ensuring that nfs_wb_page_priority() recognises that it may
also need to clear out non-dirty pages that have an nfs_page associated
with them.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

1b3b4a1a

20 7月, 2007 1 次提交

mm: Remove slab destructors from kmem_cache_create(). · 20c2df83

由 Paul Mundt 提交于 7月 20, 2007

Slab destructors were no longer supported after Christoph's
c59def9f change. They've been
BUGs for both slab and slub, and slob never supported them
either.

This rips out support for the dtor pointer from kmem_cache_create()
completely and fixes up every single callsite in the kernel (there were
about 224, not including the slab allocator definitions themselves,
or the documentation references).
Signed-off-by: NPaul Mundt <lethal@linux-sh.org>

20c2df83

openanolis / cloud-kernel 大约 1 年 前同步成功

openanolis / cloud-kernel
大约 1 年前同步成功