提交 · c5d95df5f78312c879f3058059c98a08821897a5 · openanolis / cloud-kernel

27 2月, 2010 15 次提交

ocfs2: Let ocfs2_xa_prepare_entry() do space checks. · c5d95df5

由 Joel Becker 提交于 8月 18, 2009

ocfs2_xattr_set_in_bucket() doesn't need to do its own hacky space
checking.  Let's let ocfs2_xa_prepare_entry() (via ocfs2_xa_set()) do
the more accurate work.  Whenever it doesn't have space,
ocfs2_xattr_set_in_bucket() can try to get more space.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

c5d95df5

ocfs2: Gell into ocfs2_xa_set() · bca5e9bd

由 Joel Becker 提交于 8月 18, 2009

ocfs2_xa_set() wraps the ocfs2_xa_prepare_entry()/ocfs2_xa_store_value()
logic.  Both callers can now use the same routine.  ocfs2_xa_remove()
moves directly into ocfs2_xa_set().
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

bca5e9bd

ocfs2: Allocation in ocfs2_xa_prepare_entry(), values in ocfs2_xa_store_value() · 73857ee0

由 Joel Becker 提交于 8月 18, 2009

ocfs2_xa_prepare_entry() gets all the logic to add, remove, or modify
external value trees.  Now, when it exits, the entry is ready to receive
a value of any size.

ocfs2_xa_remove() is added to handle the complete removal of an entry.
It truncates the external value tree before calling
ocfs2_xa_remove_entry().

ocfs2_xa_store_inline_value() becomes ocfs2_xa_store_value().  It can
store any value.

ocfs2_xattr_set_entry() loses all the allocation logic and just uses
these functions.  ocfs2_xattr_set_value_outside() disappears.

ocfs2_xattr_set_in_bucket() uses these functions and makes
ocfs2_xattr_set_entry_in_bucket() obsolete.  That goes away, as does
ocfs2_xattr_bucket_set_value_outside() and
ocfs2_xattr_bucket_value_truncate().
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

73857ee0

ocfs2: Teach ocfs2_xa_loc how to do its own journal work · cf2bc809

由 Joel Becker 提交于 8月 18, 2009

We're going to want to make sure our buffers get accessed and dirtied
correctly.  So have the xa_loc do the work.  This includes storing the
inode on ocfs2_xa_loc.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

cf2bc809

ocfs2: Provide ocfs2_xa_fill_value_buf() for external value processing · 3fc12afa

由 Joel Becker 提交于 8月 18, 2009

We use the ocfs2_xattr_value_buf structure to manage external values.
It lets the value tree code do its work regardless of the containing
storage.  ocfs2_xa_fill_value_buf() initializes a value buf from an
ocfs2_xa_loc entry.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

3fc12afa

ocfs2: Handle value tree roots in ocfs2_xa_set_inline_value() · 9dc47400

由 Joel Becker 提交于 8月 17, 2009

Previously the xattr code would send in a fake value, containing a tree
root, to the function that installed name+value pairs. Instead, we pass
the real value to ocfs2_xa_set_inline_value(), and it notices that the
value cannot fit. Thus, it installs a tree root.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

9dc47400

ocfs2: Set the xattr name+value pair in one place · 69a3e539

由 Joel Becker 提交于 8月 17, 2009

We create two new functions on ocfs2_xa_loc, ocfs2_xa_prepare_entry()
and ocfs2_xa_store_inline_value().

ocfs2_xa_prepare_entry() makes sure that the xl_entry field of
ocfs2_xa_loc is ready to receive an xattr.  The entry will point to an
appropriately sized name+value region in storage.  If an existing entry
can be reused, it will be.  If no entry already exists, it will be
allocated.  If there isn't space to allocate it, -ENOSPC will be
returned.

ocfs2_xa_store_inline_value() stores the data that goes into the 'value'
part of the name+value pair.  For values that don't fit directly, this
stores the value tree root.

A number of operations are added to ocfs2_xa_loc_operations to support
these functions.  This reflects the disparate behaviors of xattr blocks
and buckets.

With these functions, the overlapping ocfs2_xattr_set_entry_local() and
ocfs2_xattr_set_entry_normal() can be replaced with a single call
scheme.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

69a3e539

ocfs2: Wrap calculation of name+value pair size. · 199799a3

由 Joel Becker 提交于 8月 14, 2009

An ocfs2 xattr entry stores the text name and value as a pair in the
storage area.  Obviously names and values can be variable-sized.  If a
value is too large for the entry storage, a tree root is stored instead.
The name+value pair is also padded.

Because of this, there are a million places in the code that do:

	if (needs_external_tree(value_size)
		namevalue_size = pad(name_size) + tree_root_size;
	else
		namevalue_size = pad(name_size) + pad(value_size);

Let's create some convenience functions to make the code more readable.
There are three forms.  The first takes the raw sizes.  The second takes
an ocfs2_xattr_info structure.  The third takes an existing
ocfs2_xattr_entry.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

199799a3

ocfs2: Add a name_len field to ocfs2_xattr_info. · 18853b95

由 Joel Becker 提交于 8月 14, 2009

Rather than calculating strlen all over the place, let's store the
name length directly on ocfs2_xattr_info.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

18853b95

ocfs2: Prefix the member fields of struct ocfs2_xattr_info. · 6b240ff6

由 Joel Becker 提交于 8月 14, 2009

struct ocfs2_xattr_info is a useful structure describing an xattr
you'd like to set.  Let's put prefixes on the member fields so it's
easier to read and use.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

6b240ff6

ocfs2: Remove xattrs via ocfs2_xa_loc · bde1e540

由 Joel Becker 提交于 8月 14, 2009

Add ocfs2_xa_remove_entry(), which will remove an xattr entry from its
storage via the ocfs2_xa_loc descriptor.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

bde1e540

ocfs2: Introduce ocfs2_xa_loc · 11179f2c

由 Joel Becker 提交于 8月 14, 2009

The ocfs2 extended attribute (xattr) code is very flexible.  It can
store xattrs in the inode itself, in an external block, or in a tree of
data structures.  This allows the number of xattrs to be bounded by the
filesystem size.

However, the code that manages each possible storage location is
different.  Maintaining the ocfs2 xattr code requires changing each hunk
separately.

This patch is the start of a series introducing the ocfs2_xa_loc
structure.  This structure wraps the on-disk details of an xattr
entry.  The goal is that the generic xattr routines can use
ocfs2_xa_loc without knowing the underlying storage location.

This first pass merely implements the basic structure, initializing it,
and wiping the name+value pair of the entry.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

11179f2c

ocfs2: Add current->comm in trace output · 8545e03d

由 Sunil Mushran 提交于 2月 12, 2010

Add current->comm to the standard mlog() output to help with debugging.
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

8545e03d

ocfs2: Clean up the checks for CoW and direct I/O. · 96a1cc73

由 Wengang Wang 提交于 2月 09, 2010

When ocfs2 has to do CoW for refcounted extents, we disable direct I/O
and go through the buffered I/O path.  This makes the combined check
easier to read.
Signed-off-by: NWengang Wang <wen.gang.wang@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

96a1cc73

ocfs2: add extent block stealing for ocfs2 v5 · b89c5428

由 Tiger Yang 提交于 1月 25, 2010

This patch add extent block (metadata) stealing mechanism for
extent allocation. This mechanism is same as the inode stealing.
if no room in slot specific extent_alloc, we will try to
allocate extent block from the next slot.
Signed-off-by: NTiger Yang <tiger.yang@oracle.com>
Acked-by: NTao Ma <tao.ma@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

b89c5428

09 2月, 2010 2 次提交

ocfs2/cluster: Make o2net connect messages KERN_NOTICE · 6efd8066

由 Sunil Mushran 提交于 2月 05, 2010

Connect and disconnect messages are more than informational as they are required
during root cause analysis for failures. This patch changes them from KERN_INFO
to KERN_NOTICE.
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Acked-by: NMark Faseh <mfasheh@suse.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

6efd8066

ocfs2/dlm: Fix printing of lockname · 86a06aba

由 Sunil Mushran 提交于 2月 05, 2010

The debug call printing the name of the lock resource was chopping
off the last character. This patch fixes the problem.
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Acked-by: NMark Fasheh <mfasheh@suse.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

86a06aba

06 2月, 2010 1 次提交

ocfs2: Fix contiguousness check in ocfs2_try_to_merge_extent_map() · bd6b0bf8

由 Roel Kluin 提交于 2月 05, 2010

The wrong member was compared in the continguousness check.
Acked-by: NTao Ma <tao.ma@oracle.com>
Signed-off-by: NRoel Kluin <roel.kluin@gmail.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

bd6b0bf8

04 2月, 2010 2 次提交

ocfs2/dlm: Remove BUG_ON in dlm recovery when freeing locks of a dead node · cda70ba8

由 Sunil Mushran 提交于 2月 01, 2010

During recovery, the dlm frees the locks for the dead node. If it finds a
lock in a resource for the dead node, it expects that node to also have a
ref in that lock resource. If not, it BUGs.

ossbz#1175 was filed with the above BUG. Now, while it is correct that we
should be expecting the ref, I see no reason why we have to BUG. After all,
we are freeing up the lock and clearing the ref.

This patch replaces the BUG_ON with a printk(). Hopefully, that will give
us more clues next time this happens.

http://oss.oracle.com/bugzilla/show_bug.cgi?id=1175Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Acked-by: NMark Fasheh <mfasheh@suse.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

cda70ba8

ocfs2: Plugs race between the dc thread and an unlock ast message · 079b8057

由 Sunil Mushran 提交于 2月 03, 2010

This patch plugs a race between the downconvert thread and an unlock ast message.
Specifically, after the downconvert worker has done its task, the dc thread needs
to check whether an unlock ast made the downconvert moot.
Reported-by: NDavid Teigland <teigland@redhat.com>
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Acked-by: NMark Fasheh <mfasheh@sus.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

079b8057

03 2月, 2010 9 次提交

ocfs2: Remove overzealous BUG_ON during blocked lock processing · db0f6ce6

由 Sunil Mushran 提交于 2月 01, 2010

During blocked lock processing, we should consider the possibility that the
lock is no longer blocking.

Joel Becker <joel.becker@oracle.com> assisted in fixing this issue.
Reported-by: NDavid Teigland <teigland@redhat.com>
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

db0f6ce6

ocfs2: Do not downconvert if the lock level is already compatible · 0d74125a

由 Sunil Mushran 提交于 1月 29, 2010

During upconvert, if the master were to send a BAST, dlmglue will detect the
upconversion in process and send a cancel convert to the master. Upon receiving
the AST for the cancel convert, it will re-process the lock resource to determine
whether it needs downconverting. Say, the up was from PR to EX and the BAST was
for EX. After the cancel convert, it will need to downconvert to NL.

However, if the node was originally upconverting from NL to EX, then there would
be no reason to downconvert (assuming the same message sequence).

This patch makes dlmglue consider the possibility that the current lock level
is already compatible and that downconverting is not required.

Joel Becker <joel.becker@oracle.com> assisted in fixing this issue.

Fixes ossbz#1178
http://oss.oracle.com/bugzilla/show_bug.cgi?id=1178Reported-by: NColy Li <coly.li@suse.de>
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

0d74125a

ocfs2: Prevent a livelock in dlmglue · a1912826

由 Sunil Mushran 提交于 1月 21, 2010

There is possibility of a livelock in __ocfs2_cluster_lock(). If a node were
to get an ast for an upconvert request, followed immediately by a bast,
there is a small window where the fs may downconvert the lock before the
process requesting the upconvert is able to take the lock.

This patch adds a new flag to indicate that the upconvert is still in
progress and that the dc thread should not downconvert it right now.

Wengang Wang <wen.gang.wang@oracle.com> and Joel Becker
<joel.becker@oracle.com> contributed heavily to this patch.
Reported-by: NDavid Teigland <teigland@redhat.com>
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

a1912826

ocfs2: Fix setting of OCFS2_LOCK_BLOCKED during bast · 0b94a909

由 Wengang Wang 提交于 1月 21, 2010

During bast, set the OCFS2_LOCK_BLOCKED flag only if the lock needs to
downconverted.
Signed-off-by: NWengang Wang <wen.gang.wang@oracle.com>
Acked-by: NSunil Mushran <sunil.mushran@oracle.com>
Acked-by: NMark Fasheh <mfasheh@suse.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

0b94a909

ocfs2: Use compat_ptr in reflink_arguments. · 34e6c59a

由 Tao Ma 提交于 1月 27, 2010

Although we use u64 to pass userspace pointers to the kernel
to avoid compat_ioctl, it doesn't work in some ppc platform.
So wrap them with compat_ptr and add compat_ioctl.

The detailed discussion about compat_ptr can be found in thread
http://lkml.org/lkml/2009/10/27/423.

We indeed met with a bug when testing on ppc(-EFAULT is returned
when using old_path). This patch try to fix this.
I have tested in ppc64(with 32 bit reflink) and x86_64(with i686
reflink), both works.
Signed-off-by: NTao Ma <tao.ma@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

34e6c59a

ocfs2/dlm: Handle EAGAIN for compatibility - v2 · cd34edd8

由 Sunil Mushran 提交于 1月 25, 2010

Mainline commit aad1b153 made the
dlm_begin_reco_handler() return -EAGAIN instead of EAGAIN.

As this error is transmitted over the wire, we want the receiver,
dlm_send_begin_reco_message(), to understand both the older EAGAIN and
the newer -EAGAIN, to allow rolling upgrade of the cluster nodes.
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

cd34edd8

ocfs2: Add parenthesis to wrap the check for O_DIRECT. · 60c48674

由 Tao Ma 提交于 2月 03, 2010

Add parenthesis to wrap the check for O_DIRECT.
Signed-off-by: NTao Ma <tao.ma@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

60c48674

ocfs2: Only bug out when page size is larger than cluster size. · 0a1ea437

由 Tao Ma 提交于 2月 01, 2010

In CoW, we have to make sure that the page is already written
out to the disk. So we have a BUG_ON(PageDirty(page)).

In ppc platform we have pagesize=64K, so if the cs=4K, if the
file have fragmented clusters, we will map the page many times.
See this file as an example.
Tree Depth: 0   Count: 19   Next Free Rec: 14
	## Offset        Clusters       Block#          Flags
	0  0             4              2164864         0x2 Refcounted
	1  4             2              9302792         0x2 Refcounted
...

We have to replace the extent recs one by one, so the page with index 0
will be mapped and dirtied twice.

I'd like to leave the BUG_ON there while adding a check so that in
case we meet with an error in other platforms, we can find it easily.
Signed-off-by: NTao Ma <tao.ma@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

0a1ea437

ocfs2: Fix memory overflow in cow_by_page. · d622b89a

由 Tao Ma 提交于 1月 30, 2010

In ocfs2_duplicate_clusters_by_page, we calculate map_end
by shifting page_index. But actually in case we meet with
a large offset(say in a i686 box, poff_t is only 32 bits
and page_index=2056240), we will overflow. So change the
type of page_index to loff_t.
Signed-off-by: NTao Ma <tao.ma@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

d622b89a

26 1月, 2010 5 次提交

ocfs2/dlm: Print more messages during lock migration · 26636bf6

由 Sunil Mushran 提交于 1月 25, 2010

When a lock resource is migrated, the dlm compares the migrated
locks with that that was already existing on the new node. If the
comparison fails, it BUGs. This patch prints more messages when the
comparison fails inorder to help with the root cause analyis.

http://oss.oracle.com/bugzilla/show_bug.cgi?id=1206
This does not fix bz1206. However, if we run into it again, we will
have more information to chew on.
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

26636bf6

ocfs2/dlm: Ignore LVBs of locks in the Blocked list · 71656fa6

由 Sunil Mushran 提交于 1月 25, 2010

During lock resource migration, o2dlm fills the packet with a LVB from the
first valid lock. For sanity, it ensures that the other valid locks have the
same LVB. If not, it BUGs.

The valid locks are ones that have granted EX or PR lock levels and are either
on the Granted or Converting lists. Locks in the Blocked list cannot have a
valid LVB.

This patch ensures that we skip the locks in the Blocked list.

Fixes oss bugzilla#1202
http://oss.oracle.com/bugzilla/show_bug.cgi?id=1202Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

71656fa6

ocfs2/trivial: Remove trailing whitespaces · 2bd63216

由 Sunil Mushran 提交于 1月 25, 2010

Patch removes trailing whitespaces.
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

2bd63216

ocfs2: fix a misleading variable name · e5f2cb2b

由 Wengang Wang 提交于 1月 22, 2010

a local variable "dlm_version" is used as a fs locking version.
rename it fs_version.
Signed-off-by: NWengang Wang <wen.gang.wang@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

e5f2cb2b

ocfs2: Sync max_inline_data_with_xattr from tools. · 1097df3f

由 Tao Ma 提交于 1月 20, 2010

In ocfs2-tools, we have added ocfs2_max_inline_data_with_xattr,
so add it in the kernel's ocfs2_fs.h.
Signed-off-by: NTao Ma <tao.ma@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

1097df3f

12 1月, 2010 1 次提交

ocfs2: Fix refcnt leak on ocfs2_fast_follow_link() error path · 1dd473fd

由 OGAWA Hirofumi 提交于 1月 12, 2010

If ->follow_link handler returns an error, it should decrement
nd->path refcnt. But ocfs2_fast_follow_link() doesn't decrement.

This patch fixes the problem by using nd_set_link() style error handling
instead of playing with nd->path.
Signed-off-by: NOGAWA Hirofumi <hirofumi@mail.parknet.co.jp>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

1dd473fd

31 12月, 2009 1 次提交

ocfs2: Handle O_DIRECT when writing to a refcounted cluster. · 86470e98

由 Tao Ma 提交于 12月 03, 2009

In case of writing to a refcounted cluster with O_DIRECT,
we need to fall back to buffer write. And when it is finished,
we need to flush the page and the journal as we did for other
O_DIRECT writes.

This patch fix oss bug 1191.
http://oss.oracle.com/bugzilla/show_bug.cgi?id=1191Signed-off-by: NTao Ma <tao.ma@oracle.com>
Tested-by: NTristan Ye <tristan.ye@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

86470e98

24 12月, 2009 3 次提交

ocfs2/trivial: Use le16_to_cpu for a disk value in xattr.c · 8ff6af88

由 Tao Ma 提交于 12月 23, 2009

In ocfs2_value_metas_in_xattr_header, we should Use
le16_to_cpu for ocfs2_extent_list.l_next_free_rec.
Signed-off-by: NTao Ma <tao.ma@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

8ff6af88

ocfs2/trivial: Use proper mask for 2 places in hearbeat.c · b31d308d

由 Tao Ma 提交于 12月 22, 2009

I just noticed today that there are 2 places of "mlog(0,...)"
in  fs/ocfs2/cluster/heartbeat.c, but actually have no default
mask prefix in that file.
So change them to mlog(ML_HEARTBEAT,...).
Signed-off-by: NTao Ma <tao.ma@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

b31d308d

Ocfs2: Let ocfs2 support fiemap for symlink and fast symlink. · 86239d59

由 Tristan Ye 提交于 12月 22, 2009

For fast symlink, it can be treated the same as inlined files since
the data extent we want to return of both case all were stored in
metadata block. For symlink, it can be simply treated the same as we
did for regular files.
Signed-off-by: NTristan Ye <tristan.ye@oracle.com>
Acked-by: NSunil Mushran <sunil.mushran@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

86239d59

19 12月, 2009 1 次提交

ocfs2: Set i_nlink properly during reflink. · 10cf1a02

由 Tao Ma 提交于 12月 18, 2009

We create a file in orphan dir for reflink so that if there
is any error, we don't create any wrong dentry in the dir.
But actually the file in orphan dir should be i_nlink = 0
so that it can be replayed and freed successfully.

This patch first set i_nlink to 0 when creating the file in
orphan dir and then set it to 1(reflink now only works for
regular file) when we move it to the dest dir.
Signed-off-by: NTao Ma <tao.ma@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

10cf1a02

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功