提交 · eee194e76c681dbdbf5024b889fda1181b66ef57 · openanolis / cloud-kernel

27 9月, 2006 6 次提交

[PATCH] ext3: inode numbers are unsigned long · eee194e7

由 Eric Sandeen 提交于 9月 27, 2006

This is primarily format string fixes, with changes to ialloc.c where large
inode counts could overflow, and also pass around journal_inum as an
unsigned long, just to be pedantic about it....
Signed-off-by: NEric Sandeen <esandeen@redhat.com>
Cc: Mingming Cao <cmm@us.ibm.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

eee194e7

[PATCH] ext2: fix mounts at 16T · 41f04d85

由 Eric Sandeen 提交于 9月 27, 2006

Signed-off-by: NEric Sandeen <esandeen@redhat.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

41f04d85

[PATCH] fix ext3 mounts at 16T · 855565e8

由 Eric Sandeen 提交于 9月 27, 2006

I need to do some actual IO testing now, but this gets things mounting for
a 16T ext3 filesystem.  (patched up e2fsprogs is needed too, I'll send that
off the kernel list)

This patch fixes these issues in the kernel:

o sbi->s_groups_count overflows in ext3_fill_super()

	sbi->s_groups_count = (le32_to_cpu(es->s_blocks_count) -
			       le32_to_cpu(es->s_first_data_block) +
			       EXT3_BLOCKS_PER_GROUP(sb) - 1) /
			      EXT3_BLOCKS_PER_GROUP(sb);

  at 16T, s_blocks_count is already maxed out; adding
  EXT3_BLOCKS_PER_GROUP(sb) overflows it and groups_count comes out to 0.
  Not really what we want, and causes a failed mount.

  Feel free to check my math (actually, please do!), but changing it this
  way should work & avoid the overflow:

  (A + B - 1)/B changed to: ((A - 1)/B) + 1

o ext3_check_descriptors() overflows range checks

  ext3_check_descriptors() iterates over all block groups making sure
  that various bits are within the right block ranges...  on the last pass
  through, it is checking the error case

   [item] >= block + EXT3_BLOCKS_PER_GROUP(sb)

  where "block" is the first block in the last block group.  The last
  block in this group (and the last one that will fit in 32 bits) is block
  + EXT3_BLOCKS_PER_GROUP(sb)- 1.  block + EXT3_BLOCKS_PER_GROUP(sb) wraps
  back around to 0.

  so, make things clearer with "first_block" and "last_block" where those
  are first and last, inclusive, and use <, > rather than <, >=.

  Finally, the last block group may be smaller than the rest, so account
  for this on the last pass through: last_block = sb->s_blocks_count - 1;

(a similar patch could be done for ext2; does anyone in their right mind
use ext2 at 16T?  I'll send an ext2 patch doing the same thing if that's
warranted)
Signed-off-by: NEric Sandeen <esandeen@redhat.com>
Cc: Mingming Cao <cmm@us.ibm.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

855565e8

[PATCH] jbd: use BUILD_BUG_ON in journal init · 2aed3484

由 Alexey Dobriyan 提交于 9月 27, 2006

Signed-off-by: NAlexey Dobriyan <adobriyan@gmail.com>
Acked-by: NStephen Tweedie <sct@redhat.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

2aed3484

[PATCH] ext3 and jbd cleanup: remove whitespace · ae6ddcc5

由 Mingming Cao 提交于 9月 27, 2006

Remove whitespace from ext3 and jbd, before we clone ext4.

Signed-off-by: Mingming Cao<cmm@us.ibm.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

ae6ddcc5

[PATCH] jbd: add lock annotation to jbd_sync_bh · e7ab8d65

由 Josh Triplett 提交于 9月 27, 2006

jbd_sync_bh releases journal->j_list_lock.  Add a lock annotation to this
function so that sparse can check callers for lock pairing, and so that
sparse will not complain about this function since it intentionally uses
the lock in this manner.
Signed-off-by: NJosh Triplett <josh@freedesktop.org>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

e7ab8d65

26 9月, 2006 13 次提交

[PATCH] binfmt_elf: consistently use loff_t · 8d6b5eee

由 Andrew Morton 提交于 9月 25, 2006

As David Howells <dhowells@redhat.com> points out, binfmt_elf sometimes uses
off_t, sometimes uses loff_t.  Use loff_t throughout.
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

8d6b5eee

[PATCH] ZVC: Support NR_SLAB_RECLAIMABLE / NR_SLAB_UNRECLAIMABLE · 972d1a7b

由 Christoph Lameter 提交于 9月 25, 2006

Remove the atomic counter for slab_reclaim_pages and replace the counter
and NR_SLAB with two ZVC counter that account for unreclaimable and
reclaimable slab pages: NR_SLAB_RECLAIMABLE and NR_SLAB_UNRECLAIMABLE.

Change the check in vmscan.c to refer to to NR_SLAB_RECLAIMABLE. The
intend seems to be to check for slab pages that could be freed.
Signed-off-by: NChristoph Lameter <clameter@sgi.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

972d1a7b

[PATCH] reduce MAX_NR_ZONES: make display of highmem counters conditional on CONFIG_HIGHMEM · 182e8e23

由 Christoph Lameter 提交于 9月 25, 2006

Do not display HIGHMEM memory sizes if CONFIG_HIGHMEM is not set.

Make HIGHMEM dependent texts and make display of highmem counters optional

Some texts are depending on CONFIG_HIGHMEM.

Remove those strings and remove the display of highmem counter values if
CONFIG_HIGHMEM is not set.

[akpm@osdl.org: remove some ifdefs]
Signed-off-by: NChristoph Lameter <clameter@sgi.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

182e8e23

[PATCH] mm: tracking shared dirty pages · d08b3851

由 Peter Zijlstra 提交于 9月 25, 2006

Tracking of dirty pages in shared writeable mmap()s.

The idea is simple: write protect clean shared writeable pages, catch the
write-fault, make writeable and set dirty.  On page write-back clean all the
PTE dirty bits and write protect them once again.

The implementation is a tad harder, mainly because the default
backing_dev_info capabilities were too loosely maintained.  Hence it is not
enough to test the backing_dev_info for cap_account_dirty.

The current heuristic is as follows, a VMA is eligible when:
 - its shared writeable
    (vm_flags & (VM_WRITE|VM_SHARED)) == (VM_WRITE|VM_SHARED)
 - it is not a 'special' mapping
    (vm_flags & (VM_PFNMAP|VM_INSERTPAGE)) == 0
 - the backing_dev_info is cap_account_dirty
    mapping_cap_account_dirty(vma->vm_file->f_mapping)
 - f_op->mmap() didn't change the default page protection

Page from remap_pfn_range() are explicitly excluded because their COW
semantics are already horrid enough (see vm_normal_page() in do_wp_page()) and
because they don't have a backing store anyway.

mprotect() is taught about the new behaviour as well.  However it overrides
the last condition.

Cleaning the pages on write-back is done with page_mkclean() a new rmap call.
It can be called on any page, but is currently only implemented for mapped
pages, if the page is found the be of a VMA that accounts dirty pages it will
also wrprotect the PTE.

Finally, in fs/buffers.c:try_to_free_buffers(); remove clear_page_dirty() from
under ->private_lock.  This seems to be safe, since ->private_lock is used to
serialize access to the buffers, not the page itself.  This is needed because
clear_page_dirty() will call into page_mkclean() and would thereby violate
locking order.

[dhowells@redhat.com: Provide a page_mkclean() implementation for NOMMU]
Signed-off-by: NPeter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Hugh Dickins <hugh@veritas.com>
Signed-off-by: NDavid Howells <dhowells@redhat.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

d08b3851

[PATCH] jbd: fix commit of ordered data buffers · 3998b930

由 Jan Kara 提交于 9月 25, 2006

Original commit code assumes, that when a buffer on BJ_SyncData list is
locked, it is being written to disk.  But this is not true and hence it can
lead to a potential data loss on crash.  Also the code didn't count with
the fact that journal_dirty_data() can steal buffers from committing
transaction and hence could write buffers that no longer belong to the
committing transaction.  Finally it could possibly happen that we tried
writing out one buffer several times.

The patch below tries to solve these problems by a complete rewrite of the
data commit code.  We go through buffers on t_sync_datalist, lock buffers
needing write out and store them in an array.  Buffers are also immediately
refiled to BJ_Locked list or unfiled (if the write out is completed).  When
the array is full or we have to block on buffer lock, we submit all
accumulated buffers for IO.

[suitable for 2.6.18.x around the 2.6.19-rc2 timeframe]
Signed-off-by: NJan Kara <jack@suse.cz>
Cc: Badari Pulavarty <pbadari@us.ibm.com>
Cc: <stable@kernel.org>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

3998b930

[PATCH] Check return value of copy_to_user in compat_sys_pselect7 · 75833345

由 Andi Kleen 提交于 9月 26, 2006

Fix

linux/fs/compat.c: In function compat_sys_pselect7
linux/fs/compat.c:1869: warning: ignoring return value of copy_to_user, declared with attribute warn_unused_result

To make it easier to handle I changed to semantics to not try to
write out a timespec if an error occurred. I hope that's ok.

Cc: dwmw2@infradead.org
Signed-off-by: NAndi Kleen <ak@suse.de>

75833345

A
[PATCH] i386/x86-64: Don't randomize stack top when no randomization personality is set · c16b63e0
由 Andi Kleen 提交于 9月 26, 2006
```
Based on patch from Frank van Maarseveen <frankvm@frankvm.com>, but
extended.
Signed-off-by: NAndi Kleen <ak@suse.de>
```
c16b63e0

sysfs: add proper sysfs_init() prototype · f20a9ead

由 Andrew Morton 提交于 8月 14, 2006

Don't be crufty.  Mark it __must_check too.

Cc: "Randy.Dunlap" <rdunlap@xenotime.net>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NGreg Kroah-Hartman <gregkh@suse.de>

f20a9ead

sysfs_remove_bin_file: no return value, dump_stack on error · 995982ca

由 Randy.Dunlap 提交于 7月 10, 2006

Make sysfs_remove_bin_file() void.  If it detects an error,
printk the file name and call dump_stack().

sysfs_hash_and_remove() now returns an error code indicating
its success or failure so that sysfs_remove_bin_file() can
know success/failure.

Convert the only driver that checked the return value of
sysfs_remove_bin_file().
Signed-off-by: NRandy Dunlap <rdunlap@xenotime.net>
Signed-off-by: NGreg Kroah-Hartman <gregkh@suse.de>

995982ca

G
SYSFS: allow sysfs_create_link to create symlinks in the root of sysfs · ceeee1fb
由 Greg Kroah-Hartman 提交于 4月 09, 2002
```
This is needed to make the compatible link for /sys/block in the future.
Signed-off-by: NGreg Kroah-Hartman <gregkh@suse.de>
```
ceeee1fb

Debugfs: kernel-doc fixes for debugfs · 6468b3af

由 Randy Dunlap 提交于 7月 20, 2006

Fix kernel-doc and typos/spellos in fs/debugfs/.
Signed-off-by: NRandy Dunlap <rdunlap@xenotime.net>
Signed-off-by: NGreg Kroah-Hartman <gregkh@suse.de>

6468b3af

sysfs: Make poll behaviour consistent · eea3f891

由 Juha Yrjl 提交于 8月 03, 2006

When no events have been reported by sysfs_notify(), sd->s_events
was previously set to zero.  The initial value for new readers is
also zero, so poll was blocking, regardless of whether the attribute
was read by the process or not.

Make poll behave consistently by setting the initial value of
sd->s_events to non-zero.
Signed-off-by: NJuha Yrjola <juha.yrjola@solidboot.com>
Signed-off-by: NGreg Kroah-Hartman <gregkh@suse.de>

eea3f891

[PATCH] autofs4: zero timeout prevents shutdown · c0ba7e51

由 Ian Kent 提交于 9月 25, 2006

If the timeout of an autofs mount is set to zero then umounts are disabled.
 This works fine, however the kernel module checks the expire timeout and
goes no further if it is zero.  This is not the right thing to do at
shutdown as the module is passed an option to expire mounts regardless of
their timeout setting.

This patch allows autofs to honor the force expire option.
Signed-off-by: NIan Kent <raven@themaw.net>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

c0ba7e51

25 9月, 2006 21 次提交

ocfs2: Teach ocfs2_drop_lock() to use ->set_lvb() callback · 0d5dc6c2

由 Mark Fasheh 提交于 9月 14, 2006

With this, we don't need to pass an additional struct with function pointer.

Now that the callbacks are fully used, comment the remaining API.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

0d5dc6c2

ocfs2: Remove ->unblock lockres operation · b5e500e2

由 Mark Fasheh 提交于 9月 13, 2006

Have ocfs2_process_blocked_lock() call ocfs2_generic_unblock_lock(), which
gets to be ocfs2_unblock_lock() now that it's the only possible unblock
function.

Remove the ->unblock() callback from the structure, and all lock type
specific unblock functions.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

b5e500e2

ocfs2: move downconvert worker to lockres ops · cc567d89

由 Mark Fasheh 提交于 9月 13, 2006

This way lock types don't have to manually pass it to
ocfs2_generic_unblock_lock().
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

cc567d89

ocfs2: Remove unused dlmglue functions · 08280f11

由 Mark Fasheh 提交于 9月 13, 2006

The meta data unblocking code no longer needs ocfs2_do_unblock_meta() or
ocfs2_can_downconvert_meta_lock(), so remove them.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

08280f11

ocfs2: Have the metadata lock use generic dlmglue functions · 810d5aeb

由 Mark Fasheh 提交于 9月 13, 2006

Fill in the ->check_downconvert and ->set_lvb callbacks with meta data
specific operations and switch ocfs2_unblock_meta() to call
ocfs2_generic_unblock_lock()
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

810d5aeb

ocfs2: Add ->set_lvb callback in dlmglue · 5ef0d4ea

由 Mark Fasheh 提交于 9月 13, 2006

This allows a lock type to set the value block before downconvert.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

5ef0d4ea

ocfs2: Add ->check_downconvert callback in dlmglue · 16d5b956

由 Mark Fasheh 提交于 9月 13, 2006

This will allow lock types to force a requeue of a lock downconvert.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

16d5b956

M
ocfs2: Check for refreshing locks in generic unblock function · f7fbfdd1
由 Mark Fasheh 提交于 9月 13, 2006
```
Tidy up the exit path a bit too.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>
```
f7fbfdd1

ocfs2: don't unconditionally pass LVB flags · b80fc012

由 Mark Fasheh 提交于 9月 12, 2006

Allow a lock type to specifiy whether it makes use of the LVB. The only type
which does this right now is the meta data lock. This should save us some
space on network messages since they won't have to needlessly transmit value
blocks.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

b80fc012

ocfs2: combine inode and generic blocking AST functions · aa2623ad

由 Mark Fasheh 提交于 9月 12, 2006

There is extremely little difference between the two now. We can remove the
callback from ocfs2_lock_res_ops as well.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

aa2623ad

ocfs2: Add ->get_osb() dlmglue locking operation · 54a7e755

由 Mark Fasheh 提交于 9月 12, 2006

Will be used to find the ocfs2_super structure from a given lockres.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

54a7e755

ocfs2: remove ->unlock_ast() callback from ocfs2_lock_res_ops · 2a45f2d1

由 Mark Fasheh 提交于 9月 12, 2006

This was always defined to the same function in all locks, so clean things
up by removing and passing ocfs2_unlock_ast() directly to the DLM.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

2a45f2d1

ocfs2: combine inode and generic AST functions · e92d57df

由 Mark Fasheh 提交于 9月 12, 2006

There is extremely little difference between the two now. We can remove the
callback from ocfs2_lock_res_ops as well.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

e92d57df

ocfs2: Clean up lock resource refresh flags · f625c979

由 Mark Fasheh 提交于 9月 12, 2006

Use of the refresh mechanism is lock-type wide, so move knowledge of that to
the ocfs2_lock_res_ops structure.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

f625c979

ocfs2: Remove i_generation from inode lock names · 24c19ef4

由 Mark Fasheh 提交于 9月 22, 2006

OCFS2 puts inode meta data in the "lock value block" provided by the DLM.
Typically, i_generation is encoded in the lock name so that a deleted inode
on and a new one in the same block don't share the same lvb.

Unfortunately, that scheme means that the read in ocfs2_read_locked_inode()
is potentially thrown away as soon as the meta data lock is taken - we
cannot encode the lock name without first knowing i_generation, which
requires a disk read.

This patch encodes i_generation in the inode meta data lvb, and removes the
value from the inode meta data lock name. This way, the read can be covered
by a lock, and at the same time we can distinguish between an up to date and
a stale LVB.

This will help cold-cache stat(2) performance in particular.

Since this patch changes the protocol version, we take the opportunity to do
a minor re-organization of two of the LVB fields.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

24c19ef4

ocfs2: Encode i_generation in the meta data lvb · f9e2d82e

由 Mark Fasheh 提交于 9月 12, 2006

When i_generation is removed from the lockname, this will help us determine
whether a meta data lvb has information that is in sync with the local
struct inode.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

f9e2d82e

ocfs2: Free up some space in the lvb · 4d3b83f7

由 Mark Fasheh 提交于 9月 12, 2006

lvb_version doesn't need to be a whole 32 bits. Make it an 8 bit field to
free up some space. This should be backwards compatible until we use one of
the fields, in which case we'd bump the lvb version anyway.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

4d3b83f7

ocfs2: Remove special casing for inode creation in ocfs2_dentry_attach_lock() · 0027dd5b

由 Mark Fasheh 提交于 9月 21, 2006

We can't use LKM_LOCAL for new dentry locks because an unlink and subsequent
re-create of a name/inode pair may result in the lock still being mastered
somewhere in the cluster.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

0027dd5b

ocfs2: manually d_move() during ocfs2_rename() · 1ba9da2f

由 Mark Fasheh 提交于 9月 08, 2006

Make use of FS_RENAME_DOES_D_MOVE to avoid a race condition that can occur
during ->rename() if we d_move() outside of the parent directory cluster
locks, and another node discovers the new name (created during the rename)
and unlinks it. d_move() will unconditionally rehash a dentry - which will
leave stale data in the system.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

1ba9da2f

[PATCH] Allow file systems to manually d_move() inside of ->rename() · 349457cc

由 Mark Fasheh 提交于 9月 08, 2006

Some file systems want to manually d_move() the dentries involved in a
rename.  We can do this by making use of the FS_ODD_RENAME flag if we just
have nfs_rename() unconditionally do the d_move().  While there, we rename
the flag to be more descriptive.

OCFS2 uses this to protect that part of the rename operation with a cluster
lock.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>
Cc: Trond Myklebust <trond.myklebust@fys.uio.no>
Cc: Al Viro <viro@zeniv.linux.org.uk>
Cc: Christoph Hellwig <hch@lst.de>
Signed-off-by: NAndrew Morton <akpm@osdl.org>

349457cc

M
ocfs2: Remove the dentry vote · 1390334b
由 Mark Fasheh 提交于 9月 08, 2006
```
This is unused now.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>
```
1390334b

openanolis / cloud-kernel 大约 1 年 前同步成功

openanolis / cloud-kernel
大约 1 年前同步成功