提交 · b657c95c11088d77fc1bfc9c84d940f778bf9d12 · openanolis / cloud-kernel

06 1月, 2009 1 次提交

ocfs2: Wrap inode block reads in a dedicated function. · b657c95c

由 Joel Becker 提交于 11月 13, 2008

The ocfs2 code currently reads inodes off disk with a simple
ocfs2_read_block() call.  Each place that does this has a different set
of sanity checks it performs.  Some check only the signature.  A couple
validate the block number (the block read vs di->i_blkno).  A couple
others check for VALID_FL.  Only one place validates i_fs_generation.  A
couple check nothing.  Even when an error is found, they don't all do
the same thing.

We wrap inode reading into ocfs2_read_inode_block().  This will validate
all the above fields, going readonly if they are invalid (they never
should be).  ocfs2_read_inode_block_full() is provided for the places
that want to pass read_block flags.  Every caller is passing a struct
inode with a valid ip_blkno, so we don't need a separate blkno argument
either.

We will remove the validation checks from the rest of the code in a
later commit, as they are no longer necessary.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

b657c95c

15 10月, 2008 3 次提交

ocfs2: Make cached block reads the common case. · d4a8c93c

由 Joel Becker 提交于 10月 09, 2008

ocfs2_read_blocks() currently requires the CACHED flag for cached I/O.
However, that's the common case.  Let's flip it around and provide an
IGNORE_CACHE flag for the special users.  This has the added benefit of
cleaning up the code some (ignore_cache takes on its special meaning
earlier in the loop).
Signed-off-by: NJoel Becker <joel.becker@oracle.com>
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

d4a8c93c

ocfs2: Simplify ocfs2_read_block() · 0fcaa56a

由 Joel Becker 提交于 10月 09, 2008

More than 30 callers of ocfs2_read_block() pass exactly OCFS2_BH_CACHED.
Only six pass a different flag set. Rather than have every caller care,
let's make ocfs2_read_block() take no flags and always do a cached read.
The remaining six places can call ocfs2_read_blocks() directly.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

0fcaa56a

ocfs2: Require an inode for ocfs2_read_block(s)(). · 31d33073

由 Joel Becker 提交于 10月 09, 2008

Now that synchronous readers are using ocfs2_read_blocks_sync(), all
callers of ocfs2_read_blocks() are passing an inode.  Use it
unconditionally.  Since it's there, we don't need to pass the
ocfs2_super either.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

31d33073

14 10月, 2008 6 次提交

M
ocfs2: Don't check for NULL before brelse() · a81cb88b
由 Mark Fasheh 提交于 10月 07, 2008
```
This is pointless as brelse() already does the check.

Signed-off-by: Mark Fasheh
```
a81cb88b

ocfs2: make la_debug_mutex static · 4cc81245

由 Mark Fasheh 提交于 10月 07, 2008

It can also be moved into ocfs2_la_debug_read().
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

4cc81245

ocfs2: Limit inode allocation to 32bits. · 1187c968

由 Joel Becker 提交于 9月 03, 2008

ocfs2 inode numbers are block numbers.  For any filesystem with less
than 2^32 blocks, this is not a problem.  However, when ocfs2 starts
using JDB2, it will be able to support filesystems with more than 2^32
blocks.  This would result in inode numbers higher than 2^32.

The problem is that stat(2) can't handle those numbers on 32bit
machines.  The simple solution is to have ocfs2 allocate all inodes
below that boundary.

The suballoc code is changed to honor an optional block limit.  Only the
inode suballocator sets that limit - all other allocations stay unlimited.

The biggest trick is to grow the inode suballocator beneath that limit.
There's no point in allocating block groups that are above the limit,
then rejecting their elements later on.  We want to prevent the inode
allocator from ever having block groups above the limit.  This involves
a little gyration with the local alloc code.  If the local alloc window
is above the limit, it signals the caller to try the global bitmap but
does not disable the local alloc file (which can be used for other
allocations).

[ Minor cleanup - removed an ML_NOTICE comment. --Mark ]
Signed-off-by: NJoel Becker <joel.becker@oracle.com>
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

1187c968

ocfs2: track local alloc state via debugfs · 9a8ff578

由 Mark Fasheh 提交于 7月 29, 2008

A per-mount debugfs file, "local_alloc" is created which when read will
expose live state of the nodes local alloc file. Performance impact is
minimal, only a bit of memory overhead per mount point. Still, the code is
hidden behind CONFIG_OCFS2_FS_STATS. This feature will help us debug
local alloc performance problems on a live system.
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

9a8ff578

ocfs2: throttle back local alloc when low on disk space · 9c7af40b

由 Mark Fasheh 提交于 7月 28, 2008

Ocfs2's local allocator disables itself for the duration of a mount point
when it has trouble allocating a large enough area from the primary bitmap.
That can cause performance problems, especially for disks which were only
temporarily full or fragmented. This patch allows for the allocator to
shrink it's window first, before being disabled. Later, it can also be
re-enabled so that any performance drop is minimized.

To do this, we allow the value of osb->local_alloc_bits to be shrunk when
needed. The default value is recorded in a mostly read-only variable so that
we can re-initialize when required.

Locking had to be updated so that we could protect changes to
local_alloc_bits. Mostly this involves protecting various local alloc values
with the osb spinlock. A new state is also added, OCFS2_LA_THROTTLED, which
is used when the local allocator is has shrunk, but is not disabled. If the
available space dips below 1 megabyte, the local alloc file is disabled. In
either case, local alloc is re-enabled 30 seconds after the event, or when
an appropriate amount of bits is seen in the primary bitmap.
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

9c7af40b

ocfs2: Track local alloc bits internally · ebcee4b5

由 Mark Fasheh 提交于 7月 28, 2008

Do this instead of tracking absolute local alloc size. This avoids
needless re-calculatiion of bits from bytes in localalloc.c. Additionally,
the value is now in a more natural unit for internal file system bitmap
work.
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

ebcee4b5

15 7月, 2008 1 次提交

ocfs2: Fix CONFIG_OCFS2_DEBUG_FS #ifdefs · e407e397

由 Joel Becker 提交于 6月 12, 2008

A couple places use OCFS2_DEBUG_FS where they really mean
CONFIG_OCFS2_DEBUG_FS.
Reported-by: NRobert P. J. Day <rpjday@crashcourse.ca>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

e407e397

01 5月, 2008 1 次提交

ocfs2: Use GFP_NOFS in kmalloc during localalloc window move · 4ba1c5bf

由 Sunil Mushran 提交于 4月 18, 2008

kmalloc() during a localalloc window move can trigger the mm to prune
the dcache which inturn can trigger the fs to delete an inode causing
it start a recursive transaction.

The fix also makes the change in kmalloc during localalloc shutdown
just to be safe.

Fixes oss bugzilla#901
http://oss.oracle.com/bugzilla/show_bug.cgi?id=901Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

4ba1c5bf

18 4月, 2008 2 次提交

ocfs2: Add inode stealing for ocfs2_reserve_new_inode · 4d0ddb2c

由 Tao Ma 提交于 3月 05, 2008

Inode allocation is modified to look in other nodes allocators during
extreme out of space situations. We retry our own slot when space is freed
back to the global bitmap, or whenever we've allocated more than 1024 inodes
from another slot.
Signed-off-by: NTao Ma <tao.ma@oracle.com>
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

4d0ddb2c

ocfs2: Add ac_alloc_slot in ocfs2_alloc_context · a4a48911

由 Tao Ma 提交于 3月 03, 2008

In inode stealing, we no longer restrict the allocation to
happen in the local node. So it is neccessary for us to add
a new member in ocfs2_alloc_context to indicate which slot
we are using for allocation. We also modify the process of
local alloc so that this member can be used there also.
Signed-off-by: NTao Ma <tao.ma@oracle.com>
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

a4a48911

04 3月, 2008 2 次提交

[PATCH] ocfs2: le*_add_cpu conversion · 0dd3256e

由 Marcin Slusarz 提交于 2月 13, 2008

replace all:
little_endian_variable = cpu_to_leX(leX_to_cpu(little_endian_variable) +
					expression_in_cpu_byteorder);
with:
	leX_add_cpu(&little_endian_variable, expression_in_cpu_byteorder);
generated with semantic patch
Signed-off-by: NMarcin Slusarz <marcin.slusarz@gmail.com>
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

0dd3256e

ocfs2: Enable localalloc for local mounts · 7ad8b3d3

由 Sunil Mushran 提交于 2月 06, 2008

Commit 2fbe8d1e disabled localalloc
for local mounts. This caused issues as ocfs2 uses localalloc to
provide write locality. This patch enables localalloc for local mounts.
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

7ad8b3d3

26 1月, 2008 2 次提交

ocfs2: Local alloc window size changeable via mount option · 2fbe8d1e

由 Sunil Mushran 提交于 12月 20, 2007

Local alloc is a performance optimization in ocfs2 in which a node
takes a window of bits from the global bitmap and then uses that for
all small local allocations. This window size is fixed to 8MB currently.
This patch allows users to specify the window size in MB including
disabling it by passing in 0. If the number specified is too large,
the fs will use the default value of 8MB.

mount -o localalloc=X /dev/sdX /mntpoint
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

2fbe8d1e

ocfs2: Rename ocfs2_meta_[un]lock · e63aecb6

由 Mark Fasheh 提交于 10月 18, 2007

Call this the "inode_lock" now, since it covers both data and meta data.
This patch makes no functional changes.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

e63aecb6

28 11月, 2007 1 次提交

[PATCH] ocfs2: Remove expensive bitmap scanning · 5a58c3ef

由 Jan Kara 提交于 11月 13, 2007

Enable expensive bitmap scanning only if DEBUG option is enabled.
The bitmap scanning quite loads the CPU and on my machine the write
throughput of dd if=/dev/zero of=/ocfs2/file bs=1M count=500 conv=sync
improves from 37 MB/s to 45.4 MB/s in local mode...
Signed-off-by: NJan Kara <jack@suse.cz>
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

5a58c3ef

04 10月, 2007 1 次提交

ocfs2: Unlock mutex in local alloc failure case · bda0233b

由 Sunil Mushran 提交于 9月 21, 2007

The fs was not unlocking the local alloc inode mutex in the code path in
which it failed to find a window of free bits in the global bitmap.
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

bda0233b

21 9月, 2007 1 次提交

ocfs2: Allow smaller allocations during large writes · 415cb800

由 Mark Fasheh 提交于 9月 16, 2007

The ocfs2 write code loops through a page much like the block code, except
that ocfs2 allocation units can be any size, including larger than page
size. Typically it's equal to or larger than page size - most kernels run 4k
pages, the minimum ocfs2 allocation (cluster) size.

Some changes introduced during 2.6.23 changed the way writes to pages are
handled, and inadvertantly broke support for > 4k page size. Instead of just
writing one cluster at a time, we now handle the whole page in one pass.

This means that multiple (small) seperate allocations might happen in the
same pass. The allocation code howver typically optimizes by getting the
maximum which was reserved. This triggered a BUG_ON in the extend code where
it'd ask for a single bit (for one part of a > 4k page) and get back more
than it asked for.

Fix this by providing a variant of the high level allocation function which
allows the caller to specify a maximum. The traditional function remains and
just calls the new one with a maximum determined from the initial
reservation.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

415cb800

26 5月, 2007 1 次提交

ocfs2: fix inode leak · 8fccfc82

由 Mark Fasheh 提交于 5月 09, 2007

We weren't cleaning up our inode reference on error in
ocfs2_reserve_local_alloc_bits(). Add a check for error return and iput() if
need be. Move the code to set the alloc context inode info to the end of the
function so we don't have any possibility of passing back a bad pointer.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

8fccfc82

14 12月, 2006 1 次提交

[PATCH] Fix numerous kcalloc() calls, convert to kzalloc() · cd861280

由 Robert P. J. Day 提交于 12月 13, 2006

All kcalloc() calls of the form "kcalloc(1,...)" are converted to the
equivalent kzalloc() calls, and a few kcalloc() calls with the incorrect
ordering of the first two arguments are fixed.
Signed-off-by: NRobert P. J. Day <rpjday@mindspring.com>
Cc: Jeff Garzik <jeff@garzik.org>
Cc: Alan Cox <alan@lxorguk.ukuu.org.uk>
Cc: Dominik Brodowski <linux@dominikbrodowski.net>
Cc: Adam Belay <ambx1@neo.rr.com>
Cc: James Bottomley <James.Bottomley@steeleye.com>
Cc: Greg KH <greg@kroah.com>
Cc: Mark Fasheh <mark.fasheh@oracle.com>
Cc: Trond Myklebust <trond.myklebust@fys.uio.no>
Cc: Neil Brown <neilb@suse.de>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

cd861280

02 12月, 2006 7 次提交

ocfs2: Remove struct ocfs2_journal_handle in favor of handle_t · 1fabe148

由 Mark Fasheh 提交于 10月 09, 2006

This is mostly a search and replace as ocfs2_journal_handle is now no more
than a container for a handle_t pointer.

ocfs2_commit_trans() becomes very straight forward, and we remove some out
of date comments / code.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

1fabe148

ocfs2: remove handle argument to ocfs2_start_trans() · 65eff9cc

由 Mark Fasheh 提交于 10月 09, 2006

All callers either pass in NULL directly, or a local variable that is
already set to NULL.

The internals of ocfs2_start_trans() get a nice cleanup as a result.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

65eff9cc

M
ocfs2: pass ocfs2_super * into ocfs2_commit_trans() · 02dc1af4
由 Mark Fasheh 提交于 10月 09, 2006
```
This sets us up to remove handle->journal.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>
```
02dc1af4

ocfs2: remove unused handle argument from ocfs2_meta_lock_full() · 4bcec184

由 Mark Fasheh 提交于 10月 09, 2006

Now that this is unused and all callers pass NULL, we can safely remove it.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

4bcec184

ocfs2: don't use handle for locking in allocation functions · da5cbf2f

由 Mark Fasheh 提交于 10月 06, 2006

Instead we record our state on the allocation context structure which all
callers already know about and lifetime correctly. This means the
reservation functions don't need a handle passed in any more, and we can
also take it off the alloc context.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

da5cbf2f

M
ocfs2: don't pass handle to ocfs2_meta_lock() in localalloc.c · 8898a5a5
由 Mark Fasheh 提交于 10月 05, 2006
```
Take and drop the locks directly.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>
```
8898a5a5

ocfs2: remove ocfs2_journal_handle flags field · c161f89b

由 Mark Fasheh 提交于 10月 05, 2006

Callers can set h_sync directly on the handle_t, whether a transaction has
been started or not can be determined via the existence of the handle_t on
the struct ocfs2_journal_handle.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

c161f89b

08 8月, 2006 1 次提交

ocfs2: allocation hints · 883d4cae

由 Mark Fasheh 提交于 6月 05, 2006

Record the most recently used allocation group on the allocation context, so
that subsequent allocations can attempt to optimize for contiguousness.
Local alloc especially should benefit from this as the current chain search
tends to let it spew across the disk.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>

883d4cae

25 3月, 2006 1 次提交
- M
  ocfs2: don't use MLF* in the file system · b0697053
  由 Mark Fasheh 提交于 3月 03, 2006
```
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>
```
  b0697053
10 1月, 2006 1 次提交

[PATCH] mutex subsystem, semaphore to mutex: VFS, ->i_sem · 1b1dcc1b

由 Jes Sorensen 提交于 1月 09, 2006

This patch converts the inode semaphore to a mutex. I have tested it on
XFS and compiled as much as one can consider on an ia64. Anyway your
luck with it might be different.
Modified-by: NIngo Molnar <mingo@elte.hu>

(finished the conversion)
Signed-off-by: NJes Sorensen <jes@sgi.com>
Signed-off-by: NIngo Molnar <mingo@elte.hu>

1b1dcc1b

04 1月, 2006 1 次提交

[PATCH] OCFS2: The Second Oracle Cluster Filesystem · ccd979bd

由 Mark Fasheh 提交于 12月 15, 2005

The OCFS2 file system module.
Signed-off-by: NMark Fasheh <mark.fasheh@oracle.com>
Signed-off-by: NKurt Hackel <kurt.hackel@oracle.com>

ccd979bd

openanolis / cloud-kernel 大约 1 年 前同步成功

openanolis / cloud-kernel
大约 1 年前同步成功