提交 · d1f3e68efb4c98fa229b39ff09a8984ef16cafa4 · openanolis / cloud-kernel

24 9月, 2010 1 次提交

ocfs2: Use cpu_to_le16 for e_leaf_clusters in ocfs2_bg_discontig_add_extent. · 47dea423

由 Tao Ma 提交于 9月 13, 2010

e_leaf_clusters is a le16, so use cpu_to_le16 instead
of cpu_to_le32.

What's more, we change 'clusters' to unsigned int to
signify that the size of 'clusters' isn't important here.
Signed-off-by: NTao Ma <tao.ma@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

47dea423

08 9月, 2010 4 次提交

ocfs2: allow return of new inode block location before allocation of the inode · e49e2767

由 Mark Fasheh 提交于 8月 13, 2010

This allows code which needs to know the eventual block number of an inode
but can't allocate it yet due to transaction or lock ordering. For example,
ocfs2_create_inode_in_orphan() currently gives a junk blkno for preparation
of the orphan dir because it can't yet know where the actual inode is placed
- that code is actually in ocfs2_mknod_locked. This is a problem when the
orphan dirs are indexed as the junk inode number will create an index entry
which goes unused (and fails the later removal from the orphan dir).  Now
with these interfaces, ocfs2_create_inode_in_orphan() can run the block
group search (and get back the inode block number) *before* any actual
allocation occurs.
Signed-off-by: NMark Fasheh <mfasheh@suse.com>
Signed-off-by: NTao Ma <tao.ma@oracle.com>

e49e2767

ocfs2: use ocfs2_alloc_dinode_update_counts() instead of open coding · d5134982

由 Mark Fasheh 提交于 8月 13, 2010

ocfs2_search_chain() makes the same updates as
ocfs2_alloc_dinode_update_counts to the alloc inode. Instead of open coding
the bitmap update, use our helper function.
Signed-off-by: NMark Fasheh <mfasheh@suse.com>
Signed-off-by: NTao Ma <tao.ma@oracle.com>

d5134982

ocfs2: properly set and use inode group alloc hint · b2b6ebf5

由 Mark Fasheh 提交于 8月 26, 2010

We were setting ac->ac_last_group in ocfs2_claim_suballoc_bits from
res->sr_bg_blkno.  Unfortunately, res->sr_bg_blkno is going to be zero under
normal (non-fragmented) circumstances. The discontig block group patches
effectively turned off that feature. Fix this by correctly calculating what
the next group hint should be.
Acked-by: NTao Ma <tao.ma@oracle.com>
Signed-off-by: NMark Fasheh <mfasheh@suse.com>
Tested-by: NGoldwyn Rodrigues <rgoldwyn@suse.de>
Signed-off-by: NTao Ma <tao.ma@oracle.com>

b2b6ebf5

ocfs2: Use the right group in nfs sync check. · 889f004a

由 Tao Ma 提交于 9月 02, 2010

We have added discontig block group now, and now an inode
can be allocated in an discontig block group. So get
it in ocfs2_get_suballoc_slot_bit.

The old ocfs2_test_suballoc_bit gets group block no
from the allocation inode which is wrong. Fix it by
passing the right group.
Acked-by: NMark Fasheh <mfasheh@suse.com>
Signed-off-by: NTao Ma <tao.ma@oracle.com>

889f004a

13 7月, 2010 1 次提交

ocfs2: Remove the redundant cpu_to_le64. · 0a463b74

由 Tao Ma 提交于 7月 08, 2010

In ocfs2_block_group_alloc, we set c_blkno by bg->bg_blkno.
But actually bg->bg_blkno is already changed to little endian
in ocfs2_block_group_fill. So remove the extra cpu_to_le64.
Reported-by: NMarcos Matsunaga <Marcos.Matsunaga@oracle.com>
Signed-off-by: NTao Ma <tao.ma@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

0a463b74

19 5月, 2010 1 次提交

ocfs2: Silence a gcc warning. · 18d3a98f

由 Joel Becker 提交于 5月 18, 2010

ocfs2_block_group_claim_bits() is never called with min_bits=0, but we
shouldn't leave status undefined if it ever is.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

18d3a98f

06 5月, 2010 4 次提交

ocfs2: remove ocfs2_local_alloc_in_range() · a57c8fd2

由 Mark Fasheh 提交于 3月 16, 2010

Inodes are always allocated from the global bitmap now so we don't need this
any more. Also, the existing implementation bounces reservations around
needlessly.
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

a57c8fd2

ocfs2: allocate btree internal block groups from the global bitmap · 33d5d380

由 Mark Fasheh 提交于 2月 24, 2010

Otherwise, the need for a very large contiguous allocation tends to
wreak havoc on many inode allocation reservations on the local alloc, thus
ruining any chances for contiguousness.
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

33d5d380

ocfs2: use allocation reservations for directory data · e3b4a97d

由 Mark Fasheh 提交于 12月 07, 2009

Use the reservations system for unindexed dir tree allocations. We don't
bother with the indexed tree as reads from it are mostly random anyway.
Directory reservations are marked seperately, to allow the reservations code
a chance to optimize their window sizes. This patch allocates only 8 bits
for directory windows as they generally are not expected to grow as quickly
as file data. Future improvements to dir window sizing can trivially be
made.
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

e3b4a97d

ocfs2: Make ocfs2_journal_dirty() void. · ec20cec7

由 Joel Becker 提交于 3月 19, 2010

jbd[2]_journal_dirty_metadata() only returns 0.  It's been returning 0
since before the kernel moved to git.  There is no point in checking
this error.

ocfs2_journal_dirty() has been faithfully returning the status since the
beginning.  All over ocfs2, we have blocks of code checking this can't
fail status.  In the past few years, we've tried to avoid adding these
checks, because they are pointless.  But anyone who looks at our code
assumes they are needed.

Finally, ocfs2_journal_dirty() is made a void function.  All error
checking is removed from other files.  We'll BUG_ON() the status of
jbd2_journal_dirty_metadata() just in case they change it someday.  They
won't.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

ec20cec7

24 3月, 2010 1 次提交

ocfs2: Clear undo bits when local alloc is freed · b4414eea

由 Mark Fasheh 提交于 3月 11, 2010

When the local alloc file changes windows, unused bits are freed back to the
global bitmap. By defnition, those bits can not be in use by any file. Also,
the local alloc will never have been able to allocate those bits if they
were part of a previous truncate. Therefore it makes sense that we should
clear unused local alloc bits in the undo buffer so that they can be used
immediatly.

[ Modified to call it ocfs2_release_clusters() -- Joel ]
Signed-off-by: NMark Fasheh <mfasheh@suse.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

b4414eea

27 4月, 2010 1 次提交

ocfs2: Set ac_last_group properly with discontig group. · abf1b3cb

由 Tao Ma 提交于 4月 27, 2010

ac_last_group is used to record the last block group we
used during allocation. But the initialization process
only calls ocfs2_which_suballoc_group and fails to
use suballoc_loc properly. So let us do it.
Another function ocfs2_test_suballoc_bit also needs fix.

I have searched all the callers of ocfs2_which_suballoc_group,
and all the callers notices suballoc_loc now.
Signed-off-by: NTao Ma <tao.ma@oracle.com>

abf1b3cb

22 3月, 2010 1 次提交

ocfs2: Free block to the right block group. · 74380c47

由 Tao Ma 提交于 3月 22, 2010

In case the block we are going to free is allocated from
a discontiguous block group, we have to use suballoc_loc
to be the right group.
Signed-off-by: NTao Ma <tao.ma@oracle.com>

74380c47

13 4月, 2010 1 次提交

ocfs2: ocfs2_group_bitmap_size has to handle old volume. · 8571882c

由 Tao Ma 提交于 4月 13, 2010

ocfs2_group_bitmap_size has to handle the case when the
volume don't have discontiguous block group support. So
pass the feature_incompat in and check it.
Signed-off-by: NTao Ma <tao.ma@oracle.com>

8571882c

22 4月, 2010 1 次提交

ocfs2: Some tiny bug fixes for discontiguous block allocation. · 4711954e

由 Tao Ma 提交于 4月 22, 2010

The fixes include:
1. some endian problems.
2. we should use bit/bpc in ocfs2_block_group_grow_discontig to
   allocate clusters.
3. set num_clusters properly in __ocfs2_claim_clusters.
4. change name from ocfs2_supports_discontig_bh to
   ocfs2_supports_discontig_bg.
Signed-off-by: NTao Ma <tao.ma@oracle.com>

4711954e

26 3月, 2010 4 次提交

ocfs2: Don't relink cluster groups when allocating discontig block groups · 95ec0adf

由 Joel Becker 提交于 3月 26, 2010

We don't have enough credits, and the filesystem is in a full state
anyway.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

95ec0adf

ocfs2: Grow discontig block groups in one transaction. · 8b06bc59

由 Joel Becker 提交于 3月 26, 2010

Rather than extending the transaction every time we add an extent to a
discontiguous block group, we grab enough credits to fill the extent
list up front.  This means we can free the bits in the same transaction
if we end up not getting enough space.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

8b06bc59

ocfs2: Set suballoc_loc on allocated metadata. · 2b6cb576

由 Joel Becker 提交于 3月 26, 2010

Get the suballoc_loc from ocfs2_claim_new_inode() or
ocfs2_claim_metadata().  Store it on the appropriate field of the block
we just allocated.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

2b6cb576

ocfs2: Return allocated metadata blknos on the ocfs2_suballoc_result. · ba206635

由 Joel Becker 提交于 3月 26, 2010

Rather than calculating the resulting block number, return it on the
ocfs2_suballoc_result structure.  This way we can calculate block
numbers for discontiguous block groups.

Cluster groups keep doing it the old way.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

ba206635

06 5月, 2010 1 次提交

ocfs2: ocfs2_claim_*() don't need an ocfs2_super argument. · 1ed9b777

由 Joel Becker 提交于 5月 06, 2010

They all take an ocfs2_alloc_context, which has the allocation inode.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>
Signed-off-by: NTao Ma <tao.ma@oracle.com>

1ed9b777

26 3月, 2010 2 次提交

ocfs2: Trim suballocations if they cross discontiguous regions · 13e434cf

由 Joel Becker 提交于 3月 26, 2010

A discontiguous block group can find a range of free bits that straddle
more than one region of its space. Callers can't handle that, so we
trim the returned bits until they fit within one region.

Only cluster allocations ask for min_bits>1. Discontiguous block groups
are only for block allocations. So min_bits doesn't matter here.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

13e434cf

J
ocfs2: ocfs2_claim_suballoc_bits() doesn't need an osb argument. · aa8f8e93
由 Joel Becker 提交于 3月 26, 2010
```
It's contained on ac->ac_inode->i_sb anyway.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>
```
aa8f8e93

13 4月, 2010 3 次提交

ocfs2: Pass suballocation results back via a structure. · 7d1fe093

由 Joel Becker 提交于 4月 13, 2010

We're going to be adding more info to a suballocator allocation.  Rather
than growing every function in the chain, let's pass a result structure
around.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>
Signed-off-by: NTao Ma <tao.ma@oracle.com>

7d1fe093

ocfs2: Allocate discontiguous block groups. · 798db35f

由 Joel Becker 提交于 4月 13, 2010

If we cannot get a contiguous region for a block group, allocate a
discontiguous one when the filesystem supports it.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>
Signed-off-by: NTao Ma <tao.ma@oracle.com>

798db35f

ocfs2: Define data structures for discontiguous block groups. · 4cbe4249

由 Joel Becker 提交于 4月 13, 2010

Defines the OCFS2_FEATURE_INCOMPAT_DISCONTIG_BG feature bit and modifies
struct ocfs2_group_desc for the feature.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>
Signed-off-by: NTao Ma <tao.ma@oracle.com>

4cbe4249

18 3月, 2010 1 次提交

ocfs2: Change bg_chain check for ocfs2_validate_gd_parent. · 78c37eb0

由 Tao Ma 提交于 3月 03, 2010

In ocfs2_validate_gd_parent, we check bg_chain against the
cl_next_free_rec of the dinode. Actually in resize, we have
the chance of bg_chain == cl_next_free_rec. So add some
additional condition check for it.

I also rename paramter "clean_error" to "resize", since the
old one is not clearly enough to indicate that we should only
meet with this case in resize.

btw, the correpsonding bug is
http://oss.oracle.com/bugzilla/show_bug.cgi?id=1230.
Signed-off-by: NTao Ma <tao.ma@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

78c37eb0

27 2月, 2010 1 次提交

ocfs2: add extent block stealing for ocfs2 v5 · b89c5428

由 Tiger Yang 提交于 1月 25, 2010

This patch add extent block (metadata) stealing mechanism for
extent allocation. This mechanism is same as the inode stealing.
if no room in slot specific extent_alloc, we will try to
allocate extent block from the next slot.
Signed-off-by: NTiger Yang <tiger.yang@oracle.com>
Acked-by: NTao Ma <tao.ma@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

b89c5428

05 9月, 2009 3 次提交

ocfs2: Pass ocfs2_caching_info to ocfs2_read_extent_block(). · 3d03a305

由 Joel Becker 提交于 2月 12, 2009

extent blocks belong to btrees on more than just inodes, so we want to
pass the ocfs2_caching_info structure directly to
ocfs2_read_extent_block().  A number of places in alloc.c can now drop
struct inode from their argument list.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

3d03a305

ocfs2: Pass struct ocfs2_caching_info to the journal functions. · 0cf2f763

由 Joel Becker 提交于 2月 12, 2009

The next step in divorcing metadata I/O management from struct inode is
to pass struct ocfs2_caching_info to the journal functions.  Thus the
journal locks a metadata cache with the cache io_lock function.  It also
can compare ci_last_trans and ci_created_trans directly.

This is a large patch because of all the places we change
ocfs2_journal_access..(handle, inode, ...) to
ocfs2_journal_access..(handle, INODE_CACHE(inode), ...).
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

0cf2f763

ocfs2: Take the inode out of the metadata read/write paths. · 8cb471e8

由 Joel Becker 提交于 2月 10, 2009

We are really passing the inode into the ocfs2_read/write_blocks()
functions to get at the metadata cache.  This commit passes the cache
directly into the metadata block functions, divorcing them from the
inode.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

8cb471e8

23 6月, 2009 1 次提交

ocfs2: Pin journal head before accessing jh->b_committed_data · 94e41ecf

由 Sunil Mushran 提交于 6月 19, 2009

This patch adds jbd_lock_bh_state() and jbd_unlock_bh_state() around accessses
to jh->b_committed_data.

Fixes oss bugzilla#1131
http://oss.oracle.com/bugzilla/show_bug.cgi?id=1131Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

94e41ecf

22 4月, 2009 2 次提交

ocfs2: Fix some printk() warnings. · 5b09b507

由 Joel Becker 提交于 4月 21, 2009

The old %llu vs u64 battle.  Cast them correctly.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

5b09b507

ocfs2: Fix 2 warning during ocfs2 make. · 0fba8137

由 Tao Ma 提交于 3月 19, 2009

fs/ocfs2/dir.c: In function ‘ocfs2_extend_dir’:
fs/ocfs2/dir.c:2700: warning: ‘ret’ may be used uninitialized in this function

fs/ocfs2/suballoc.c: In function ‘ocfs2_get_suballoc_slot_bit’:
fs/ocfs2/suballoc.c:2216: warning: comparison is always true due to limited range of data type
Signed-off-by: NTao Ma <tao.ma@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

0fba8137

04 4月, 2009 4 次提交

ocfs2: fix rare stale inode errors when exporting via nfs · 6ca497a8

由 wengang wang 提交于 3月 06, 2009

For nfs exporting, ocfs2_get_dentry() returns the dentry for fh.
ocfs2_get_dentry() may read from disk when the inode is not in memory,
without any cross cluster lock. this leads to the file system loading a
stale inode.

This patch fixes above problem.

Solution is that in case of inode is not in memory, we get the cluster
lock(PR) of alloc inode where the inode in question is allocated from (this
causes node on which deletion is done sync the alloc inode) before reading
out the inode itsself. then we check the bitmap in the group (the inode in
question allcated from) to see if the bit is clear. if it's clear then it's
stale. if the bit is set, we then check generation as the existing code
does.

We have to read out the inode in question from disk first to know its alloc
slot and allot bit. And if its not stale we read it out using ocfs2_iget().
The second read should then be from cache.

And also we have to add a per superblock nfs_sync_lock to cover the lock for
alloc inode and that for inode in question. this is because ocfs2_get_dentry()
and ocfs2_delete_inode() lock on them in reverse order. nfs_sync_lock is locked
in EX mode in ocfs2_get_dentry() and in PR mode in ocfs2_delete_inode(). so
that mutliple ocfs2_delete_inode() can run concurrently in normal case.

[mfasheh@suse.com: build warning fixes and comment cleanups]
Signed-off-by: NWengang Wang <wen.gang.wang@oracle.com>
Acked-by: NJoel Becker <joel.becker@oracle.com>
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

6ca497a8

ocfs2: Optimize inode group allocation by recording last used group. · feb473a6

由 Tao Ma 提交于 2月 25, 2009

In ocfs2, the block group search looks for the "emptiest" group
to allocate from. So if the allocator has many equally(or almost
equally) empty groups, new block group will tend to get spread
out amongst them.

So we add osb_inode_alloc_group in ocfs2_super to record the last
used inode allocation group.
For more details, please see
http://oss.oracle.com/osswiki/OCFS2/DesignDocs/InodeAllocationStrategy.

I have done some basic test and the results are a ten times improvement on
some cold-cache stat workloads.
Signed-off-by: NTao Ma <tao.ma@oracle.com>
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

feb473a6

ocfs2: Allocate inode groups from global_bitmap. · 60ca81e8

由 Tao Ma 提交于 2月 25, 2009

Inode groups used to be allocated from local alloc file,
but since we want all inodes to be contiguous enough, we
will try to allocate them directly from global_bitmap.
Signed-off-by: NTao Ma <tao.ma@oracle.com>
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

60ca81e8

ocfs2: Optimize inode allocation by remembering last group · 13821151

由 Tao Ma 提交于 2月 25, 2009

In ocfs2, the inode block search looks for the "emptiest" inode
group to allocate from. So if an inode alloc file has many equally
(or almost equally) empty groups, new inodes will tend to get
spread out amongst them, which in turn can put them all over the
disk. This is undesirable because directory operations on conceptually
"nearby" inodes force a large number of seeks.

So we add ip_last_used_group in core directory inodes which records
the last used allocation group. Another field named ip_last_used_slot
is also added in case inode stealing happens. When claiming new inode,
we passed in directory's inode so that the allocation can use this
information.
For more details, please see
http://oss.oracle.com/osswiki/OCFS2/DesignDocs/InodeAllocationStrategy.
Signed-off-by: NTao Ma <tao.ma@oracle.com>
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

13821151

06 1月, 2009 2 次提交

ocfs2: Use metadata-specific ocfs2_journal_access_*() functions. · 13723d00

由 Joel Becker 提交于 10月 17, 2008

The per-metadata-type ocfs2_journal_access_*() functions hook up jbd2
commit triggers and allow us to compute metadata ecc right before the
buffers are written out.  This commit provides ecc for inodes, extent
blocks, group descriptors, and quota blocks.  It is not safe to use
extened attributes and metaecc at the same time yet.

The ocfs2_extent_tree and ocfs2_path abstractions in alloc.c both hide
the type of block at their root.  Before, it didn't matter, but now the
root block must use the appropriate ocfs2_journal_access_*() function.
To keep this abstract, the structures now have a pointer to the matching
journal_access function and a wrapper call to call it.

A few places use naked ocfs2_write_block() calls instead of adding the
blocks to the journal.  We make sure to calculate their checksum and ecc
before the write.

Since we pass around the journal_access functions.  Let's typedef them
in ocfs2.h.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

13723d00

ocfs2: block read meta ecc. · d6b32bbb

由 Joel Becker 提交于 10月 17, 2008

Add block check calls to the read_block validate functions. This is the
almost all of the read-side checking of metaecc. xattr buckets are not checked
yet. Writes are also unchecked, and so a read-write mount will quickly fail.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

d6b32bbb

openanolis / cloud-kernel 大约 1 年 前同步成功

openanolis / cloud-kernel
大约 1 年前同步成功