提交 · 464ad6b1ade186b53a1dae863361853326b85694 · OpenHarmony / kernel_linux

30 1月, 2008 20 次提交

NFS: Change sign of some loop indices in nfs4xdr.c · 464ad6b1

由 Chuck Lever 提交于 10月 26, 2007

Nit: Eliminate some mixed sign comparisons in loop indices.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

464ad6b1

NFS: Use unsigned intermediates for manipulating header lengths (NFSv4 XDR) · bcecff77

由 Chuck Lever 提交于 10月 26, 2007

Clean up: prevent length underflow and mixed sign comparison when
unmarshalling NFS version 4 getacl, readdir, and readlink replies.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

bcecff77

NFS: Use unsigned intermediates for manipulating header lengths (NFSv3 XDR) · c957c526

由 Chuck Lever 提交于 10月 26, 2007

Clean up: prevent length underflow and mixed sign comparisons when
unmarshalling NFS version 3 read, readdir, and readlink replies.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

c957c526

NFS: Use unsigned intermediates for manipulating header lengths (NFSv2 XDR) · 6232dbbc

由 Chuck Lever 提交于 10月 26, 2007

Clean up: prevent length underflow and mixed sign comparisons when
unmarshalling NFS version 2 read, readdir, and readlink replies.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

6232dbbc

NFS: Ensure nfs_wcc_update_inode always converts file size to loff_t · 8a8c74bf

由 Chuck Lever 提交于 10月 26, 2007

The nfs_wcc_update_inode() function omits logic to convert the type of
the NFS on-the-wire value of a file's size (__u64) to the type of file
size value stored in struct inode (loff_t, which is signed).

Everywhere else in the NFS client I checked already correctly converts the
file size type.

This effects only very large files.
Signed-off-by: NChuck Lever <chuck.lever@oracle.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

8a8c74bf

T
NFS/SUNRPC: Convert users of rpc_init_task+rpc_execute to rpc_run_task() · 07737691
由 Trond Myklebust 提交于 10月 25, 2007
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
07737691

NFS/SUNRPC: Convert all users of rpc_call_setup() · 5138fde0

由 Trond Myklebust 提交于 7月 14, 2007

Replace use of rpc_call_setup() with rpc_init_task(), and in cases where we
need to initialise task->tk_action, with rpc_call_start().
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

5138fde0

NFS: Clean up the (commit|read|write)_setup() callback routines · bdc7f021

由 Trond Myklebust 提交于 7月 14, 2007

Move the common code for setting up the nfs_write_data and nfs_read_data
structures into fs/nfs/read.c, fs/nfs/write.c and fs/nfs/direct.c.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

bdc7f021

SUNRPC: Clean up the initialisation of priority queue scheduling info. · 3ff7576d

由 Trond Myklebust 提交于 7月 14, 2007

We want the default scheduling priority (priority == 0) to remain
RPC_PRIORITY_NORMAL.

Also ensure that the priority wait queue scheduling is per process id
instead of sometimes being per thread, and sometimes being per inode.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

3ff7576d

SUNRPC: Clean up rpc_run_task · c970aa85

由 Trond Myklebust 提交于 7月 14, 2007

Make it use the new task initialiser structure instead of acting as a
wrapper.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

c970aa85

T
SUNRPC: Cleanup of rpc_task initialisation · 84115e1c
由 Trond Myklebust 提交于 7月 14, 2007
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
84115e1c

NFS: Stop sillyname renames and unmounts from racing · ef818a28

由 Steve Dickson 提交于 11月 08, 2007

Added an active/deactive mechanism to the nfs_server structure
allowing async operations to hold off umount until the
operations are done.
Signed-off-by: NSteve Dickson <steved@redhat.com>
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

ef818a28

NFSv4: Clean up the OPEN/CLOSE serialisation code · 2f74c0a0

由 Trond Myklebust 提交于 1月 08, 2008

Reduce the time spent locking the rpc_sequence structure by queuing the
nfs_seqid only when we are ready to take the lock (when calling
nfs_wait_on_sequence).
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

2f74c0a0

NFS: Clean up the write request locking. · acee478a

由 Trond Myklebust 提交于 1月 22, 2008

Ensure that we set/clear NFS_PAGE_TAG_LOCKED when the nfs_page is hashed.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

acee478a

NFS: Optimise nfs_vm_page_mkwrite() · 8b1f9ee5

由 Trond Myklebust 提交于 1月 22, 2008

The current model locks the page twice for no good reason. Optimise by
inlining the parts of nfs_write_begin()/nfs_write_end() that we care about.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

8b1f9ee5

T
NFS: Ensure that we eject stale inodes as soon as possible · 77f11192
由 Trond Myklebust 提交于 1月 28, 2008
```
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>
```
77f11192

NFS: Handle -ENOENT errors in unlink()/rmdir()/rename() · d45b9d8b

由 Trond Myklebust 提交于 1月 28, 2008

If the server returns an ENOENT error, we still need to do a d_delete() in
order to ensure that the dentry is deleted.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

d45b9d8b

NFS: Sillyrename: in the case of a race, check aliases are really positive · 609005c3

由 Trond Myklebust 提交于 1月 28, 2008

In nfs_do_call_unlink() we check that we haven't raced, and that lookup()
hasn't created an aliased dentry to our sillydeleted dentry. If somebody
has deleted the file on the server and the lookup() resulted in a negative
dentry, then ignore...
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

609005c3

NFS: Fix a sillyrename race... · fccca7fc

由 Trond Myklebust 提交于 1月 26, 2008

Ensure that readdir revalidates its data cache after blocking on
sillyrename.

Also fix a typo in nfs_do_call_unlink(): swap the ^= for an |=. The result
is the same, since we've already checked that the flag is unset, but it
makes the code more readable.
Signed-off-by: NTrond Myklebust <Trond.Myklebust@netapp.com>

fccca7fc

splice: fix problem with atime not being updated · 9e97198d

由 Jens Axboe 提交于 1月 29, 2008

A bug report on nfsd that states that since it was switched to use
splice instead of sendfile, the atime was no longer being updated
on the input file. do_generic_mapping_read() does this when accessing
the file, make splice do it for the direct splice handler.
Signed-off-by: NJens Axboe <jens.axboe@oracle.com>

9e97198d

29 1月, 2008 20 次提交

jbd2: sparse pointer use of zero as null · 4019191b

由 Mingming Cao 提交于 1月 28, 2008

Get rid of sparse related warnings from places that use integer as NULL
pointer.  (Ported from upstream ext3/jbd changes.)
Signed-off-by: NMingming Cao <cmm@us.ibm.com>
Signed-off-by: N"Theodore Ts'o" <tytso@mit.edu>

4019191b

jbd2: Use round-jiffies() function for the "5 second" ext4/jbd2 wakeup · db857da3

由 Mingming Cao 提交于 1月 28, 2008

While "every 5 seconds" doesn't sound as a problem, there can be many
of these (and these timers do add up over all the kernel).  The "5
second" wakeup isn't really timing sensitive; in addition even with
rounding it'll still happen every 5 seconds (with the exception of the
very first time, which is likely to be rounded up to somewhere closer
to 6 seconds)

(Ported from similar JBD patch made by Arjan van de Ven to
fs/jbd/transaction.c)

Cc: Arjan van de Ven <arjan@linux.intel.com>
Cc: Andrew Morton <akpm@osdl.org>
Signed-off-by: NMingming Cao <cmm@us.ibm.com>
Signed-off-by: N"Theodore Ts'o" <tytso@mit.edu>

db857da3

jbd2: Mark jbd2 slabs as SLAB_TEMPORARY · 77160957

由 Mingming Cao 提交于 1月 28, 2008

This patch marks slab allocations by jbd2 as short-lived in support of
Mel Gorman's "Group short-lived and reclaimable kernel allocations"
patch.  (Ported from similar changes made to fs/jbd/journal.c and
fs/jbd/revoke.c in Mel's patch.)

Cc: Mel Gorman <mel@csn.ul.ie>
Cc: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: NMingming Cao <cmm@us.ibm.com>
Signed-off-by: N"Theodore Ts'o" <tytso@mit.edu>

77160957

jbd2: add lockdep support · 7b751066

由 Mingming Cao 提交于 1月 28, 2008

Ported from similar patch for the jbd layer.
Signed-off-by: NMingming Cao <cmm@us.ibm.com>
Signed-off-by: N"Theodore Ts'o" <tytso@mit.edu>

7b751066

ext4: Use the ext4_ext_actual_len() helper function · b939e376

由 Aneesh Kumar K.V 提交于 1月 28, 2008

ext4 uses the high bit of the extent length to encode whether the extent
is intialized or not. The helper function ext4_ext_get_actual_len should
be used to get the actual length of the extent.

This addresses the kernel bug documented here: 
     http://bugzilla.kernel.org/show_bug.cgi?id=9732

kernel BUG at fs/ext4/extents.c:1056!
....
Call Trace:
[<ffffffff88366073>] :ext4dev:ext4_ext_get_blocks+0x5ba/0x8c1
[<ffffffff81053c91>] lock_release_holdtime+0x27/0x49
[<ffffffff812748f6>] _spin_unlock+0x17/0x20
[<ffffffff883400a6>] :jbd2:start_this_handle+0x4e0/0x4fe
[<ffffffff88366564>] :ext4dev:ext4_fallocate+0x175/0x39a
[<ffffffff81053c91>] lock_release_holdtime+0x27/0x49
[<ffffffff81056480>] __lock_acquire+0x4e7/0xc4d
[<ffffffff81053c91>] lock_release_holdtime+0x27/0x49
[<ffffffff810a8de7>] sys_fallocate+0xe4/0x10d
[<ffffffff8100c043>] tracesys+0xd5/0xda
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>
Signed-off-by: N"Theodore Ts'o" <tytso@mit.edu>

b939e376

ext4: fix uniniatilized extent splitting error · dbf9d7da

由 Dmitry Monakhov 提交于 1月 28, 2008

Fix bug reported by Dmitry Monakhov caused by lost error code

    Testcase: 

    blksize = 0x1000;
    fd = open(argv[1], O_RDWR|O_CREAT, 0700);
    unsigned long long sz = 0x10000000UL;
    /* allocating big blocks chunk */
    syscall(__NR_fallocate, fd, 0, 0UL, sz)

    /* grab all other available filesystem space */
    tfd = open("tmp", O_RDWR|O_CREAT|O_DIRECT, 0700);
    while( write(tfd, buf, 4096) > 0); /* loop untill ENOSPC */
    fsync(fd); /* just in case */
    while (pos < sz) {
    	/* each seek+ write operation result in splits uninitialized extent
    	in three extents. Splitting may result in new extent allocation
    	which probably will fail because of ENOSPC*/

    	lseek(fd, blksize*2 -1, SEEK_CUR);
    	if ((ret = write(fd, 'a', 1)) != 1)
    		exit(1);
    	pos += blksize * 2;
    }
Signed-off-by: NDmitry Monakhov <dmonakhov@openvz.org>
Signed-off-by: NMingming Cao <cmm@us.ibm.com>
Signed-off-by: N"Theodore Ts'o" <tytso@mit.edu>

dbf9d7da

ext4: Check for return value from sb_set_blocksize · ce40733c

由 Aneesh Kumar K.V 提交于 1月 28, 2008

sb_set_blocksize validates whether the specfied block size can be used by
the file system. Make sure we fail mounting the file system if the
blocksize specfied cannot be used.
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>
Signed-off-by: NMingming Cao <cmm@us.ibm.com>

ce40733c

ext4: Add stripe= option to /proc/mounts · cb45bbe4

由 Miklos Szeredi 提交于 1月 28, 2008

Add stripe= option to /proc/mounts for ext4 filesystems.
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>
Signed-off-by: NMingming Cao <cmm@us.ibm.com>

cb45bbe4

ext4: Enable the multiblock allocator by default · 3dbd0ede

由 Aneesh Kumar K.V 提交于 1月 28, 2008

Enable the multiblock allocator by default.

Fix ext4_show_options() so if it is not enabled, the nomballoc option
included in /proc/mounts.
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>
Acked-by: NEric Sandeen <sandeen@redhat.com>
Signed-off-by: NMingming Cao <cmm@us.ibm.com>
Signed-off-by: N"Theodore Ts'o" <tytso@mit.edu>

3dbd0ede

ext4: Add multi block allocator for ext4 · c9de560d

由 Alex Tomas 提交于 1月 29, 2008

Signed-off-by: NAlex Tomas <alex@clusterfs.com>
Signed-off-by: NAndreas Dilger <adilger@clusterfs.com>
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>
Signed-off-by: NEric Sandeen <sandeen@redhat.com>
Signed-off-by: N"Theodore Ts'o" <tytso@mit.edu>

c9de560d

ext4: Add new functions for searching extent tree · 1988b51e

由 Alex Tomas 提交于 1月 28, 2008

Add the functions ext4_ext_search_left() and ext4_ext_search_right(),
which are used by mballoc during ext4_ext_get_blocks to decided whether
to merge extent information.
Signed-off-by: NAlex Tomas <alex@clusterfs.com>
Signed-off-by: NAndreas Dilger <adilger@clusterfs.com>
Signed-off-by: NJohann Lombardi <johann@clusterfs.com>
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>
Signed-off-by: N"Theodore Ts'o" <tytso@mit.edu>

1988b51e

ext4: fix up EXT4FS_DEBUG builds · c549a95d

由 Eric Sandeen 提交于 1月 28, 2008

Builds with EXT4FS_DEBUG defined (to enable ext4_debug()) fail
without these changes.  Clean up some format warnings too.
Signed-off-by: NEric Sandeen <sandeen@redhat.com>
Signed-off-by: NMingming Cao <cmm@us.ibm.com>

c549a95d

ext4: Fix ext4_show_options to show the correct mount options. · aa22df2c

由 Aneesh Kumar K.V 提交于 1月 28, 2008

We need to look at the default value and make sure
the mount options are not set via default value
before showing them via ext4_show_options
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>

aa22df2c

ext4: Add EXT4_IOC_MIGRATE ioctl · c14c6fd5

由 Aneesh Kumar K.V 提交于 1月 28, 2008

The below patch add ioctl for migrating ext3 indirect block mapped inode
to ext4 extent mapped inode.
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>

c14c6fd5

ext4: Add inode version support in ext4 · 25ec56b5

由 Jean Noel Cordenner 提交于 1月 28, 2008

This patch adds 64-bit inode version support to ext4. The lower 32 bits
are stored in the osd1.linux1.l_i_version field while the high 32 bits
are stored in the i_version_hi field newly created in the ext4_inode.
This field is incremented in case the ext4_inode is large enough. A
i_version mount option has been added to enable the feature.
Signed-off-by: NMingming Cao <cmm@us.ibm.com>
Signed-off-by: NAndreas Dilger <adilger@clusterfs.com>
Signed-off-by: NKalpak Shah <kalpak@clusterfs.com>
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>
Signed-off-by: NJean Noel Cordenner <jean-noel.cordenner@bull.net>

25ec56b5

vfs: Add 64 bit i_version support · 7a224228

由 Jean Noel Cordenner 提交于 1月 28, 2008

The i_version field of the inode is changed to be a 64-bit counter that
is set on every inode creation and that is incremented every time the
inode data is modified (similarly to the "ctime" time-stamp).
The aim is to fulfill a NFSv4 requirement for rfc3530.
This first part concerns the vfs, it converts the 32-bit i_version in
the generic inode to a 64-bit, a flag is added in the super block in
order to check if the feature is enabled and the i_version is
incremented in the vfs.
Signed-off-by: NMingming Cao <cmm@us.ibm.com>
Signed-off-by: NJean Noel Cordenner <jean-noel.cordenner@bull.net>
Signed-off-by: NKalpak Shah <kalpak@clusterfs.com>

7a224228

ext4: Add the journal checksum feature · 818d276c

由 Girish Shilamkar 提交于 1月 28, 2008

The journal checksum feature adds two new flags i.e
JBD2_FEATURE_INCOMPAT_ASYNC_COMMIT and JBD2_FEATURE_COMPAT_CHECKSUM.

JBD2_FEATURE_CHECKSUM flag indicates that the commit block contains the
checksum for the blocks described by the descriptor blocks.
Due to checksums, writing of the commit record no longer needs to be
synchronous. Now commit record can be sent to disk without waiting for
descriptor blocks to be written to disk. This behavior is controlled
using JBD2_FEATURE_ASYNC_COMMIT flag. Older kernels/e2fsck should not be
able to recover the journal with _ASYNC_COMMIT hence it is made
incompat.
The commit header has been extended to hold the checksum along with the
type of the checksum.

For recovery in pass scan checksums are verified to ensure the sanity
and completeness(in case of _ASYNC_COMMIT) of every transaction.
Signed-off-by: NAndreas Dilger <adilger@clusterfs.com>
Signed-off-by: NGirish Shilamkar <girish@clusterfs.com>
Signed-off-by: NDave Kleikamp <shaggy@linux.vnet.ibm.com>
Signed-off-by: NMingming Cao <cmm@us.ibm.com>

818d276c

jbd2: jbd2 stats through procfs · 8e85fb3f

由 Johann Lombardi 提交于 1月 28, 2008

The patch below updates the jbd stats patch to 2.6.20/jbd2.
The initial patch was posted by Alex Tomas in December 2005
(http://marc.info/?l=linux-ext4&m=113538565128617&w=2).
It provides statistics via procfs such as transaction lifetime and size.

Sometimes, investigating performance problems, i find useful to have
stats from jbd about transaction's lifetime, size, etc. here is a
patch for review and inclusion probably.

for example, stats after creation of 3M files in htree directory:

[root@bob ~]# cat /proc/fs/jbd/sda/history
R/C  tid   wait  run   lock  flush log   hndls  block inlog ctime write drop  close
R    261   8260  2720  0     0     750   9892   8170  8187
C    259                                                    750   0     4885  1
R    262   20    2200  10    0     770   9836   8170  8187
R    263   30    2200  10    0     3070  9812   8170  8187
R    264   0     5000  10    0     1340  0      0     0
C    261                                                    8240  3212  4957  0
R    265   8260  1470  0     0     4640  9854   8170  8187
R    266   0     5000  10    0     1460  0      0     0
C    262                                                    8210  2989  4868  0
R    267   8230  1490  10    0     4440  9875   8171  8188
R    268   0     5000  10    0     1260  0      0     0
C    263                                                    7710  2937  4908  0
R    269   7730  1470  10    0     3330  9841   8170  8187
R    270   0     5000  10    0     830   0      0     0
C    265                                                    8140  3234  4898  0
C    267                                                    720   0     4849  1
R    271   8630  2740  20    0     740   9819   8170  8187
C    269                                                    800   0     4214  1
R    272   40    2170  10    0     830   9716   8170  8187
R    273   40    2280  0     0     3530  9799   8170  8187
R    274   0     5000  10    0     990   0      0     0


where,

R     - line for transaction's life from T_RUNNING to T_FINISHED
C     - line for transaction's checkpointing
tid   - transaction's id
wait  - for how long we were waiting for new transaction to start
         (the longest period journal_start() took in this transaction)
run   - real transaction's lifetime (from T_RUNNING to T_LOCKED
lock  - how long we were waiting for all handles to close
         (time the transaction was in T_LOCKED)
flush - how long it took to flush all data (data=ordered)
log   - how long it took to write the transaction to the log
hndls - how many handles got to the transaction
block - how many blocks got to the transaction
inlog - how many blocks are written to the log (block + descriptors)
ctime - how long it took to checkpoint the transaction
write - how many blocks have been written during checkpointing
drop  - how many blocks have been dropped during checkpointing
close - how many running transactions have been closed to checkpoint this one

all times are in msec.


[root@bob ~]# cat /proc/fs/jbd/sda/info
280 transaction, each upto 8192 blocks
average:
  1633ms waiting for transaction
  3616ms running transaction
  5ms transaction was being locked
  1ms flushing data (in ordered mode)
  1799ms logging transaction
  11781 handles per transaction
  5629 blocks per transaction
  5641 logged blocks per transaction
Signed-off-by: NJohann Lombardi <johann.lombardi@bull.net>
Signed-off-by: NMariusz Kozlowski <m.kozlowski@tuxland.pl>
Signed-off-by: NMingming Cao <cmm@us.ibm.com>
Signed-off-by: NEric Sandeen <sandeen@redhat.com>

8e85fb3f

ext4: Take read lock during overwrite case. · 4df3d265

由 Aneesh Kumar K.V 提交于 1月 28, 2008

When we are overwriting a file and not actually allocating new file system
blocks we need to take only the read lock on i_data_sem.
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>

4df3d265

ext4: Convert truncate_mutex to read write semaphore. · 0e855ac8

由 Aneesh Kumar K.V 提交于 1月 28, 2008

We are currently taking the truncate_mutex for every read. This would have
performance impact on large CPU configuration. Convert the lock to read write
semaphore and take read lock when we are trying to read the file.
Signed-off-by: NAneesh Kumar K.V <aneesh.kumar@linux.vnet.ibm.com>

0e855ac8

OpenHarmony / kernel_linux 上一次同步 3 年多

OpenHarmony / kernel_linux
上一次同步 3 年多