提交 · db71922217a214e5c9268448e537b54fc1f301ea · openanolis / cloud-kernel

05 10月, 2010 1 次提交

BKL: Explicitly add BKL around get_sb/fill_super · db719222

由 Jan Blunck 提交于 8月 15, 2010

This patch is a preparation necessary to remove the BKL from do_new_mount().
It explicitly adds calls to lock_kernel()/unlock_kernel() around
get_sb/fill_super operations for filesystems that still uses the BKL.

I've read through all the code formerly covered by the BKL inside
do_kern_mount() and have satisfied myself that it doesn't need the BKL
any more.

do_kern_mount() is already called without the BKL when mounting the rootfs
and in nfsctl. do_kern_mount() calls vfs_kern_mount(), which is called
from various places without BKL: simple_pin_fs(), nfs_do_clone_mount()
through nfs_follow_mountpoint(), afs_mntpt_do_automount() through
afs_mntpt_follow_link(). Both later functions are actually the filesystems
follow_link inode operation. vfs_kern_mount() is calling the specified
get_sb function and lets the filesystem do its job by calling the given
fill_super function.

Therefore I think it is safe to push down the BKL from the VFS to the
low-level filesystems get_sb/fill_super operation.

[arnd: do not add the BKL to those file systems that already
       don't use it elsewhere]
Signed-off-by: NJan Blunck <jblunck@infradead.org>
Signed-off-by: NArnd Bergmann <arnd@arndb.de>
Cc: Matthew Wilcox <matthew@wil.cx>
Cc: Christoph Hellwig <hch@infradead.org>

db719222

10 8月, 2010 1 次提交
- A
  convert ocfs2 to ->evict_inode() · 066d92dc
  由 Al Viro 提交于 6月 08, 2010
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  066d92dc
17 6月, 2010 1 次提交

fix typos concerning "initiali[zs]e" · 421f91d2

由 Uwe Kleine-König 提交于 6月 11, 2010

Signed-off-by: NUwe Kleine-König <u.kleine-koenig@pengutronix.de>
Signed-off-by: NJiri Kosina <jkosina@suse.cz>

421f91d2

24 5月, 2010 4 次提交

quota: rename default quotactl methods to dquot_ · 287a8095

由 Christoph Hellwig 提交于 5月 19, 2010

Follow the dquot_* style used elsewhere in dquot.c.

[Jan Kara: Fixed up missing conversion of ext2]
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NJan Kara <jack@suse.cz>

287a8095

quota: drop remount argument to ->quota_on and ->quota_off · 307ae18a

由 Christoph Hellwig 提交于 5月 19, 2010

Remount handling has fully moved into the filesystem, so all this is
superflous now.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NJan Kara <jack@suse.cz>

307ae18a

quota: kill the vfs_dq_off and vfs_dq_quota_on_remount wrappers · 0f0dd62f

由 Christoph Hellwig 提交于 5月 19, 2010

Instead of having wrappers in the VFS namespace export the dquot_suspend
and dquot_resume helpers directly.  Also rename vfs_quota_disable to
dquot_disable while we're at it.

[Jan Kara: Moved dquot_suspend to quotaops.h and made it inline]
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NJan Kara <jack@suse.cz>

0f0dd62f

ocfs2: Fix use after free on remount read-only · eea7feb0

由 Jan Kara 提交于 5月 13, 2010

We also have to cancel quota syncing thread on remount read only because
at that moment quota is being turned off. Otherwise quota syncing thread
will try to access already freed quota structures.
Signed-off-by: NJan Kara <jack@suse.cz>

eea7feb0

22 5月, 2010 1 次提交

ocfs2: Fix lock inversion in quotas during umount · c06bcbfa

由 Jan Kara 提交于 5月 13, 2010

We cannot cancel delayed work from ocfs2_local_free_info because that is called
with dqonoff_mutex held and the work it cancels requires dqonoff_mutex to
finish. Cancel the work before acquiring dqonoff_mutex.
Acked-by: NJoel Becker <Joel.Becker@oracle.com>
Signed-off-by: NJan Kara <jack@suse.cz>

c06bcbfa

11 5月, 2010 1 次提交

ocfs2: Wrap signal blocking in void functions. · e4b963f1

由 Joel Becker 提交于 9月 02, 2009

ocfs2 sometimes needs to block signals around dlm operations, but it
currently does it with sigprocmask().  Even worse, it's checking the
error code of sigprocmask().  The in-kernel sigprocmask() can only error
if you get the SIG_* argument wrong.  We don't.

Wrap the sigprocmask() calls with ocfs2_[un]block_signals().  These
functions are void, but they will BUG() if somehow sigprocmask() returns
an error.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

e4b963f1

06 5月, 2010 6 次提交

ocfs2: Make nointr a default mount option · 4b37fcb7

由 Sunil Mushran 提交于 4月 13, 2010

OCFS2 has never really supported intr. This patch acknowledges this reality
and makes nointr the default mount option. In a later patch, we intend to
support intr.
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

4b37fcb7

ocfs2: Add dir_resv_level mount option · 83f92318

由 Mark Fasheh 提交于 4月 05, 2010

The default behavior for directory reservations stays the same, but we add a
mount option so people can tweak the size of directory reservations
according to their workloads.
Signed-off-by: NMark Fasheh <mfasheh@suse.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

83f92318

ocfs2: increase the default size of local alloc windows · 6b82021b

由 Mark Fasheh 提交于 4月 05, 2010

I have observed that the current size of 8M gives us pretty poor
fragmentation on multi-threaded workloads which do lots of writes.

Generally, I can increase the size of local alloc windows and observe a
marked decrease in fragmentation, even up and beyond window sizes of 512
megabytes. This makes sense for a couple reasons - larger local alloc means
more room for reservation windows. On multi-node workloads the larger local
alloc helps as well because we don't have to do window slides as often.

Also, I removed the OCFS2_DEFAULT_LOCAL_ALLOC_SIZE constant as it is no
longer used and the comment above it was out of date.

To test fragmentation, I used a workload which launched 4 threads that did
4k writes into a series of about 140 alternating files.

With resv_level=2, and a 4k/4k file system I observed the following average
fragmentation for various localalloc= parameters:

localalloc=	avg. fragmentation
	8		48
	32		16
	64		10
	120		7

On larger cluster sizes, the difference is more dramatic.

The new default size top out at 256M, which we'll only get for cluster
sizes of 32K and above.
Signed-off-by: NMark Fasheh <mfasheh@suse.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

6b82021b

ocfs2: clean up localalloc mount option size parsing · 73c8a800

由 Mark Fasheh 提交于 4月 05, 2010

This patch pulls the local alloc sizing code into localalloc.c and provides
a callout to it from ocfs2_fill_super(). Behavior is essentially unchanged
except that I correctly calculate the maximum local alloc size. The old code
in ocfs2_parse_options() calculated the max size as:

ocfs2_local_alloc_size(sb) * 8

which is correct, in bits. Unfortunately though the option passed in is in
megabytes. Ultimately, this bug made no real difference - the shrink code
would catch a too-large size and bring it down to something reasonable.
Still, it's less than efficient as-is.
Signed-off-by: NMark Fasheh <mfasheh@suse.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

73c8a800

ocfs2: use allocation reservations during file write · 4fe370af

由 Mark Fasheh 提交于 12月 07, 2009

Add a per-inode reservations structure and pass it through to the
reservations code.
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

4fe370af

ocfs2: allocation reservations · d02f00cc

由 Mark Fasheh 提交于 12月 07, 2009

This patch improves Ocfs2 allocation policy by allowing an inode to
reserve a portion of the local alloc bitmap for itself. The reserved
portion (allocation window) is advisory in that other allocation
windows might steal it if the local alloc bitmap becomes
full. Otherwise, the reservations are honored and guaranteed to be
free. When the local alloc window is moved to a different portion of
the bitmap, existing reservations are discarded.

Reservation windows are represented internally by a red-black
tree. Within that tree, each node represents the reservation window of
one inode. An LRU of active reservations is also maintained. When new
data is written, we allocate it from the inodes window. When all bits
in a window are exhausted, we allocate a new one as close to the
previous one as possible. Should we not find free space, an existing
reservation is pulled off the LRU and cannibalized.
Signed-off-by: NMark Fasheh <mfasheh@suse.com>

d02f00cc

13 4月, 2010 2 次提交

ocfs2: ocfs2_group_bitmap_size has to handle old volume. · 8571882c

由 Tao Ma 提交于 4月 13, 2010

ocfs2_group_bitmap_size has to handle the case when the
volume don't have discontiguous block group support. So
pass the feature_incompat in and check it.
Signed-off-by: NTao Ma <tao.ma@oracle.com>

8571882c

ocfs2: Define data structures for discontiguous block groups. · 4cbe4249

由 Joel Becker 提交于 4月 13, 2010

Defines the OCFS2_FEATURE_INCOMPAT_DISCONTIG_BG feature bit and modifies
struct ocfs2_group_desc for the feature.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>
Signed-off-by: NTao Ma <tao.ma@oracle.com>

4cbe4249

27 2月, 2010 1 次提交

ocfs2: add extent block stealing for ocfs2 v5 · b89c5428

由 Tiger Yang 提交于 1月 25, 2010

This patch add extent block (metadata) stealing mechanism for
extent allocation. This mechanism is same as the inode stealing.
if no room in slot specific extent_alloc, we will try to
allocate extent block from the next slot.
Signed-off-by: NTiger Yang <tiger.yang@oracle.com>
Acked-by: NTao Ma <tao.ma@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

b89c5428

26 1月, 2010 1 次提交

ocfs2/trivial: Remove trailing whitespaces · 2bd63216

由 Sunil Mushran 提交于 1月 25, 2010

Patch removes trailing whitespaces.
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

2bd63216

30 10月, 2009 1 次提交

ocfs2: return f_fsid info in ocfs2_statfs() · 837711f8

由 Coly Li 提交于 1月 16, 2009

Currently the f_fsid of struct kstatfs returned from ocfs2_statfs() is
undefined (vfs layer fills in 0 as default). Since in some conditions,
f_fsid value might be used in a (f_fsid, ino) pair to uniquely identify
a file, ocfs2 should return a unique defined f_fsid value from
ocfs2_statfs().

Because uuid_str is the same on big or litlle endian machine, it's
endian consistent to use osb->uuid_str to generate f_fsid value.
Signed-off-by: NColy Li <coly.li@suse.de>
Cc: Sunil Mushran <sunil.mushran@oracle.com>
Cc: Mark Fasheh <mfasheh@suse.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

837711f8

29 10月, 2009 4 次提交

ocfs2: Set MS_POSIXACL on remount · 57b09bb5

由 Jan Kara 提交于 10月 15, 2009

We have to set MS_POSIXACL on remount as well. Otherwise VFS
would not know we started supporting ACLs after remount and
thus ACLs would not work.
Signed-off-by: NJan Kara <jack@suse.cz>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

57b09bb5

ocfs2: Make acl use the default · 5297aad8

由 Jan Kara 提交于 10月 15, 2009

Change acl mount options handling to match the one of XFS and BTRFS and
hopefully it is also easier to use now. When admin does not specify any
acl mount option, acls are enabled if and only if the filesystem has
xattr feature enabled. If admin specifies 'acl' mount option, we fail
the mount if the filesystem does not have xattr feature and thus acls
cannot be enabled.
Signed-off-by: NJan Kara <jack@suse.cz>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

5297aad8

ocfs2: Always include ACL support · e6aabe0c

由 Jan Kara 提交于 10月 15, 2009

To become consistent with filesystems such as XFS or BTRFS, make posix
ACLs always available. This also reduces possibility of
misconfiguration on admin's side.
Signed-off-by: NJan Kara <jack@suse.cz>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

e6aabe0c

ocfs2: Return -EINVAL when a device is not ocfs2. · fb5cbe9e

由 Joel Becker 提交于 10月 28, 2009

In case of non-modular kernels the root filesystem is mounted by trying
several filesystems. If ocfs2 was tried before the actual filesystem
type, the mount would fail because ocfs2_sb_probe() returns -EAGAIN
instead of -EINVAL. ocfs2 will now return -EINVAL properly.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>
Reported-by: NLaszlo Attila Toth <panther@balabit.hu>

fb5cbe9e

02 10月, 2009 1 次提交

const: constify remaining file_operations · 828c0950

由 Alexey Dobriyan 提交于 10月 01, 2009

[akpm@linux-foundation.org: fix KVM]
Signed-off-by: NAlexey Dobriyan <adobriyan@gmail.com>
Acked-by: NMike Frysinger <vapier@gentoo.org>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

828c0950

24 9月, 2009 1 次提交

headers: utsname.h redux · 2bcd57ab

由 Alexey Dobriyan 提交于 9月 24, 2009

* remove asm/atomic.h inclusion from linux/utsname.h --
   not needed after kref conversion
 * remove linux/utsname.h inclusion from files which do not need it

NOTE: it looks like fs/binfmt_elf.c do not need utsname.h, however
due to some personality stuff it _is_ needed -- cowardly leave ELF-related
headers and files alone.
Signed-off-by: NAlexey Dobriyan <adobriyan@gmail.com>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

2bcd57ab

23 9月, 2009 2 次提交

ocfs2: __ocfs2_abort() should not enable panic for local mounts · a2f2ddbf

由 Sunil Mushran 提交于 8月 19, 2009

In a clustered setup, we have to panic the box on journal abort. This is
because we don't have the facility to go hard readonly. With hard ro, another
node would detect node failure and initiate recovery.

Having said that, we shouldn't force panic if the volume is mounted locally.
This patch defers the handling to the mount option, errors.
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

a2f2ddbf

ocfs2: Add refcount tree lock mechanism. · 374a263e

由 Tao Ma 提交于 8月 24, 2009

Implement locking around struct ocfs2_refcount_tree.  This protects
all read/write operations on refcount trees.  ocfs2_refcount_tree
has its own lock and its own caching_info, protecting buffers among
multiple nodes.

User must call ocfs2_lock_refcount_tree before his operation on
the tree and unlock it after that.

ocfs2_refcount_trees are referenced by the block number of the
refcount tree root block, So we create an rb-tree on the ocfs2_super
to look them up.
Signed-off-by: NTao Ma <tao.ma@oracle.com>

374a263e

22 9月, 2009 1 次提交

const: make struct super_block::s_qcop const · 0d54b217

由 Alexey Dobriyan 提交于 9月 21, 2009

Signed-off-by: NAlexey Dobriyan <adobriyan@gmail.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

0d54b217

05 9月, 2009 5 次提交

ocfs2: move ip_created_trans to struct ocfs2_caching_info · 292dd27e

由 Joel Becker 提交于 2月 12, 2009

Similar ip_last_trans, ip_created_trans tracks the creation of a journal
managed inode.  This specifically tracks what transaction created the
inode.  This is so the code can know if the inode has ever been written
to disk.

This behavior is desirable for any journal managed object.  We move it
to struct ocfs2_caching_info as ci_created_trans so that any object
using ocfs2_caching_info can rely on this behavior.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

292dd27e

ocfs2: move ip_last_trans to struct ocfs2_caching_info · 66fb345d

由 Joel Becker 提交于 2月 12, 2009

We have the read side of metadata caching isolated to struct
ocfs2_caching_info, now we need the write side.  This means the journal
functions.  The journal only does a couple of things with struct inode.

This change moves the ip_last_trans field onto struct
ocfs2_caching_info as ci_last_trans.  This field tells the journal
whether a pending journal flush is required.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

66fb345d

ocfs2: Take the inode out of the metadata read/write paths. · 8cb471e8

由 Joel Becker 提交于 2月 10, 2009

We are really passing the inode into the ocfs2_read/write_blocks()
functions to get at the metadata cache.  This commit passes the cache
directly into the metadata block functions, divorcing them from the
inode.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

8cb471e8

ocfs2: Change metadata caching locks to an operations structure. · 6e5a3d75

由 Joel Becker 提交于 2月 10, 2009

We don't really want to cart around too many new fields on the
ocfs2_caching_info structure.  So let's wrap all our access of the
parent object in a set of operations.  One pointer on caching_info, and
more flexibility to boot.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

6e5a3d75

ocfs2: Make the ocfs2_caching_info structure self-contained. · 47460d65

由 Joel Becker 提交于 2月 10, 2009

We want to use the ocfs2_caching_info structure in places that are not
inodes.  To do that, it can no longer rely on referencing the inode
directly.

This patch moves the flags to ocfs2_caching_info->ci_flags, stores
pointers to the parent's locks on the ocfs2_caching_info, and renames
the constants and flags to reflect its independant state.
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

47460d65

18 8月, 2009 1 次提交

ocfs2: Don't oops in ocfs2_kill_sb on a failed mount · 5fd13189

由 Jan Kara 提交于 7月 30, 2009

If we fail to mount the filesystem, we have to be careful not to dereference
uninitialized structures in ocfs2_kill_sb.
Signed-off-by: NJan Kara <jack@suse.cz>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

5fd13189

24 7月, 2009 1 次提交

ocfs2: Fix initialization of blockcheck stats · 1c1d9793

由 Jan Kara 提交于 7月 22, 2009

We just set blockcheck stats to zeros but we should also
properly initialize the spinlock there.
Signed-off-by: NJan Kara <jack@suse.cz>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

1c1d9793

22 7月, 2009 1 次提交

ocfs2: Fix deadlock on umount · f7b1aa69

由 Jan Kara 提交于 7月 20, 2009

In commit ea455f8a, we moved the dentry lock
put process into ocfs2_wq. This causes problems during umount because ocfs2_wq
can drop references to inodes while they are being invalidated by
invalidate_inodes() causing all sorts of nasty things (invalidate_inodes()
ending in an infinite loop, "Busy inodes after umount" messages etc.).

We fix the problem by stopping ocfs2_wq from doing any further releasing of
inode references on the superblock being unmounted, wait until it finishes
the current round of releasing and finally cleaning up all the references in
dentry_lock_list from ocfs2_put_super().

The issue was tracked down by Tao Ma <tao.ma@oracle.com>.
Signed-off-by: NJan Kara <jack@suse.cz>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

f7b1aa69

09 7月, 2009 1 次提交

ocfs2: Fixup orphan scan cleanup after failed mount · 8b712cd5

由 Jeff Mahoney 提交于 7月 07, 2009

If the mount fails for any reason, ocfs2_dismount_volume calls
ocfs2_orphan_scan_stop. It requires that ocfs2_orphan_scan_init
be called to setup the mutex and work queues, but that doesn't
happen if the mount has failed and we oops accessing an uninitialized
work queue.

This patch splits the init and startup of the orphan scan, eliminating
the oops.
Signed-off-by: NJeff Mahoney <jeffm@suse.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

8b712cd5

23 6月, 2009 2 次提交

ocfs2: Disable orphan scanning for local and hard-ro mounts · df152c24

由 Sunil Mushran 提交于 6月 22, 2009

Local and Hard-RO mounts do not need orphan scanning.
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

df152c24

ocfs2: Stop orphan scan as early as possible during umount · 692684e1

由 Sunil Mushran 提交于 6月 19, 2009

Currently if the orphan scan fires a tick before the user issues the umount,
the umount will wait for the queued orphan scan tasks to complete.

This patch makes the umount stop the orphan scan as early as possible so as
to reduce the probability of the queued tasks slowing down the umount.
Signed-off-by: NSunil Mushran <sunil.mushran@oracle.com>
Signed-off-by: NJoel Becker <joel.becker@oracle.com>

692684e1

openanolis / cloud-kernel 大约 1 年 前同步成功

openanolis / cloud-kernel
大约 1 年前同步成功