提交 · 4b2fecc84655055a6a1fe9151786992ac04b56ce · openeuler / raspberrypi-kernel

01 10月, 2014 2 次提交

f2fs: introduce FITRIM in f2fs_ioctl · 4b2fecc8

由 Jaegeuk Kim 提交于 9月 20, 2014

This patch introduces FITRIM in f2fs_ioctl.
In this case, f2fs will issue small discards and prefree discards as many as
possible for the given area.
Reviewed-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

4b2fecc8

f2fs: introduce cp_control structure · 75ab4cb8

由 Jaegeuk Kim 提交于 9月 20, 2014

This patch add a new data structure to control checkpoint parameters.
Currently, it presents the reason of checkpoint such as is_umount and normal
sync.
Reviewed-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

75ab4cb8

24 9月, 2014 3 次提交

f2fs: remove redundant operation during roll-forward recovery · c52e1b10

由 Jaegeuk Kim 提交于 9月 11, 2014

If same data is updated multiple times, we don't need to redo whole the
operations.
Let's just update the lastest one.
Reviewed-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

c52e1b10

f2fs: fix conditions to remain recovery information in f2fs_sync_file · 88bd02c9

由 Jaegeuk Kim 提交于 9月 15, 2014

This patch revisited whole the recovery information during the f2fs_sync_file.

In this patch, there are three information to make a decision.

a) IS_CHECKPOINTED,	/* is it checkpointed before? */
b) HAS_FSYNCED_INODE,	/* is the inode fsynced before? */
c) HAS_LAST_FSYNC,	/* has the latest node fsync mark? */

And, the scenarios for our rule are based on:

[Term] F: fsync_mark, D: dentry_mark

1. inode(x) | CP | inode(x) | dnode(F)
2. inode(x) | CP | inode(F) | dnode(F)
3. inode(x) | CP | dnode(F) | inode(x) | inode(F)
4. inode(x) | CP | dnode(F) | inode(F)
5. CP | inode(x) | dnode(F) | inode(DF)
6. CP | inode(DF) | dnode(F)
7. CP | dnode(F) | inode(DF)
8. CP | dnode(F) | inode(x) | inode(DF)

For example, #3, the three conditions should be changed as follows.

   inode(x) | CP | dnode(F) | inode(x) | inode(F)
a)    x       o      o          o          o
b)    x       x      x          x          o
c)    x       o      o          x          o

If f2fs_sync_file stops   ------^,
 it should write inode(F)    --------------^

So, the need_inode_block_update should return true, since
 c) get_nat_flag(e, HAS_LAST_FSYNC), is false.

For example, #8,
      CP | alloc | dnode(F) | inode(x) | inode(DF)
a)    o      x        x          x          x
b)    x               x          x          o
c)    o               o          x          o

If f2fs_sync_file stops   -------^,
 it should write inode(DF)    --------------^

Note that, the roll-forward policy should follow this rule, which means,
if there are any missing blocks, we doesn't need to recover that inode.
Signed-off-by: NHuang Ying <ying.huang@intel.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

88bd02c9

f2fs: use meta_inode cache to improve roll-forward speed · 4c521f49

由 Jaegeuk Kim 提交于 9月 11, 2014

Previously, all the dnode pages should be read during the roll-forward recovery.
Even worsely, whole the chain was traversed twice.
This patch removes that redundant and costly read operations by using page cache
of meta_inode and readahead function as well.
Reviewed-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

4c521f49

16 9月, 2014 2 次提交

f2fs: give an option to enable in-place-updates during fsync to users · c1ce1b02

由 Jaegeuk Kim 提交于 9月 10, 2014

If user wrote F2FS_IPU_FSYNC:4 in /sys/fs/f2fs/ipu_policy, f2fs_sync_file
only starts to try in-place-updates.
And, if the number of dirty pages is over /sys/fs/f2fs/min_fsync_blocks, it
keeps out-of-order manner. Otherwise, it triggers in-place-updates.

This may be used by storage showing very high random write performance.

For example, it can be used when,

Seq. writes (Data) + wait + Seq. writes (Node)

is pretty much slower than,

Rand. writes (Data)
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

c1ce1b02

f2fs: expand counting dirty pages in the inode page cache · a7ffdbe2

由 Jaegeuk Kim 提交于 9月 12, 2014

Previously f2fs only counts dirty dentry pages, but there is no reason not to
expand the scope.

This patch changes the names on the management of dirty pages and to count
dirty pages in each inode info as well.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

a7ffdbe2

10 9月, 2014 4 次提交

f2fs: use lock-less list(llist) to simplify the flush cmd management · 721bd4d5

由 Gu Zheng 提交于 9月 05, 2014

We use flush cmd control to collect many flush cmds, and flush them
together. In this case, we use two list to manage the flush cmds
(collect and dispatch), and one spin lock is used to protect this.
In fact, the lock-less list(llist) is very suitable to this case,
and we use simplify this routine.

-
v2:
-use llist_for_each_entry_safe to fix possible use-after-free issue.
-remove the unused field from struct flush_cmd.
Thanks for Yu's suggestion.
-
Signed-off-by: NGu Zheng <guz.fnst@cn.fujitsu.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

721bd4d5

f2fs: refactor flush_sit_entries codes for reducing SIT writes · 184a5cd2

由 Chao Yu 提交于 9月 04, 2014

In commit aec71382 ("f2fs: refactor flush_nat_entries codes for reducing NAT
writes"), we descripte the issue as below:

"Although building NAT journal in cursum reduce the read/write work for NAT
block, but previous design leave us lower performance when write checkpoint
frequently for these cases:
1. if journal in cursum has already full, it's a bit of waste that we flush all
   nat entries to page for persistence, but not to cache any entries.
2. if journal in cursum is not full, we fill nat entries to journal util
   journal is full, then flush the left dirty entries to disk without merge
   journaled entries, so these journaled entries may be flushed to disk at next
   checkpoint but lost chance to flushed last time."

Actually, we have the same problem in using SIT journal area.

In this patch, firstly we will update sit journal with dirty entries as many as
possible. Secondly if there is no space in sit journal, we will remove all
entries in journal and walk through the whole dirty entry bitmap of sit,
accounting dirty sit entries located in same SIT block to sit entry set. All
entry sets are linked to list sit_entry_set in sm_info, sorted ascending order
by count of entries in set. Later we flush entries in set which have fewest
entries into journal as many as we can, and then flush dense set with merged
entries to disk.

In this way we can use sit journal area more effectively, also we will reduce
SIT update, result in gaining in performance and saving lifetime of flash
device.

In my testing environment, it shows this patch can help to reduce SIT block
update obviously.

virtual machine + hard disk:
fsstress -p 20 -n 400 -l 5
		sit page num	cp count	sit pages/cp
based		2006.50		1349.75		1.486
patched		1566.25		1463.25		1.070

Our latency of merging op is small when handling a great number of dirty SIT
entries in flush_sit_entries:
latency(ns)	dirty sit count
36038		2151
49168		2123
37174		2232
Signed-off-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

184a5cd2

J
f2fs: need fsck.f2fs when f2fs_bug_on is triggered · 9850cf4a
由 Jaegeuk Kim 提交于 9月 02, 2014
```
If any f2fs_bug_on is triggered, fsck.f2fs is needed.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>
```
9850cf4a

f2fs: retain inconsistency information to initiate fsck.f2fs · 2ae4c673

由 Jaegeuk Kim 提交于 9月 02, 2014

This patch adds sbi->need_fsck to conduct fsck.f2fs later.
This flag can only be removed by fsck.f2fs.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

2ae4c673

04 9月, 2014 1 次提交

f2fs: introduce F2FS_I_SB, F2FS_M_SB, and F2FS_P_SB · 4081363f

由 Jaegeuk Kim 提交于 9月 02, 2014

This patch adds three inline functions to clean up dirty casting codes.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

4081363f

22 8月, 2014 5 次提交

f2fs: remove rewrite_node_page · 202095a7

由 Jaegeuk Kim 提交于 8月 15, 2014

I think we need to let the dirty node pages remain in the page cache instead
of rewriting them in their places.
So, after done with successful recovery, write_checkpoint will flush all of them
through the normal write path.
Through this, we can avoid potential error cases in terms of block allocation.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

202095a7

f2fs: avoid double lock in truncate_blocks · 764aa3e9

由 Jaegeuk Kim 提交于 8月 14, 2014

The init_inode_metadata calls truncate_blocks when error is occurred.
The callers holds f2fs_lock_op, so we should not call it again in
truncate_blocks.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

764aa3e9

f2fs: add WARN_ON in f2fs_bug_on · b3fe0a0d

由 Jaegeuk Kim 提交于 8月 13, 2014

This patch adds WARN_ON when f2fs_bug_on is disable to see kernel messages.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

b3fe0a0d

J
f2fs: introduce f2fs_cp_error for readability · 1e968fdf
由 Jaegeuk Kim 提交于 8月 11, 2014
```
This patch adds f2fs_cp_error for readability.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>
```
1e968fdf

f2fs: trigger release_dirty_inode in f2fs_put_super · 6f12ac25

由 Jaegeuk Kim 提交于 8月 19, 2014

The generic_shutdown_super calls sync_filesystem, evict_inode, and then
f2fs_put_super. In f2fs_evict_inode, we remain some dirty inode information
so we should release them at f2fs_put_super.
Reviewed-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

6f12ac25

20 8月, 2014 4 次提交

f2fs: fix to recover inline_xattr/data and blocks · 1c35a90e

由 Jaegeuk Kim 提交于 8月 07, 2014

This patch fixes not to skip xattr recovery and inline xattr/data recovery
order.
Reviewed-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

1c35a90e

f2fs: make clear on test condition and return types · 0342fd30

由 Jaegeuk Kim 提交于 8月 07, 2014

This patch adds a parentheses to make clear for condition check.
And also it changes the return type for better meanings.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

0342fd30

f2fs: should convert inline_data during the mkwrite · b067ba1f

由 Jaegeuk Kim 提交于 8月 07, 2014

If mkwrite is called to an inode having inline_data, it can overwrite the data
index space as NEW_ADDR. (e.g., the first 4 bytes are coincidently zero)
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

b067ba1f

f2fs: fix typo · e1c42045

由 arter97 提交于 8月 06, 2014

Fix typo and some grammatical errors.

The words "filesystem" and "readahead" are being used without the space treewide.
Signed-off-by: NPark Ju Hyung <qkrwngud825@gmail.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

e1c42045

02 8月, 2014 1 次提交

f2fs: avoid skipping recover_inline_xattr after recover_inline_data · 70cfed88

由 Chao Yu 提交于 8月 02, 2014

When we recover data of inode in roll-forward procedure, and the inode has both
inline data and inline xattr. We may skip recovering inline xattr if we recover
inline data form node page first.
This patch will fix the problem that we lost inline xattr data in above
scenario.
Signed-off-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

70cfed88

31 7月, 2014 5 次提交

f2fs: reduce competition among node page writes · b3582c68

由 Chao Yu 提交于 7月 03, 2014

We do not need to block on ->node_write among different node page writers e.g.
fsync/flush, unless we have a node page writer from write_checkpoint.
So it's better use rw_semaphore instead of mutex type for ->node_write to
promote performance.
Signed-off-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

b3582c68

f2fs: fix coding style · 65b85ccc

由 Jaegeuk Kim 提交于 7月 30, 2014

This patch fixes wrong coding style.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

65b85ccc

f2fs: avoid retrying wrong recovery routine when error was occurred · cf2271e7

由 Jaegeuk Kim 提交于 7月 25, 2014

This patch eliminates the propagation of recovery errors to the next mount.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

cf2271e7

f2fs: test before set/clear bits · 61e0f2d0

由 Jaegeuk Kim 提交于 7月 25, 2014

If the bit is already set, we don't need to reset it, and vice versa.
Because we don't need to make the caches dirty for that.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

61e0f2d0

f2fs: enable in-place-update for fdatasync · ea1aa12c

由 Jaegeuk Kim 提交于 7月 24, 2014

This patch enforces in-place-updates only when fdatasync is requested.
If we adopt this in-place-updates for the fdatasync, we can skip to write the
recovery information.
Reviewed-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

ea1aa12c

29 7月, 2014 4 次提交

f2fs: add info of appended or updated data writes · fff04f90

由 Jaegeuk Kim 提交于 7月 25, 2014

This patch introduces a inode number list in which represents inodes having
appended data writes or updated data writes after last checkpoint.
This will be used at fsync to determine whether the recovery information
should be written or not.
Reviewed-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

fff04f90

f2fs: use radix_tree for ino management · 39efac41

由 Jaegeuk Kim 提交于 7月 24, 2014

For better ino management, this patch replaces the data structure from list
to radix tree.
Reviewed-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

39efac41

f2fs: add infra for ino management · 6451e041

由 Jaegeuk Kim 提交于 7月 25, 2014

This patch changes the naming of orphan-related data structures to use as
inode numbers managed globally.
Later, we can use this facility for managing any inode number lists.
Reviewed-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

6451e041

f2fs: add nobarrier mount option · 0f7b2abd

由 Jaegeuk Kim 提交于 7月 23, 2014

This patch adds a mount option, nobarrier, in f2fs.
The assumption in here is that file system keeps the IO ordering, but
doesn't care about cache flushes inside the storages.
Reviewed-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

0f7b2abd

12 7月, 2014 1 次提交

f2fs: remove the unused stat_lock · 4b2868aa

由 Gu Zheng 提交于 7月 11, 2014

Signed-off-by: NGu Zheng <guz.fnst@cn.fujitsu.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

4b2868aa

10 7月, 2014 4 次提交

G
f2fs: arguments cleanup of finding file flow functions · eee6160f
由 Gu Zheng 提交于 6月 24, 2014
```
Signed-off-by: NGu Zheng <guz.fnst@cn.fujitsu.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>
```
eee6160f

f2fs: refactor flush_nat_entries codes for reducing NAT writes · aec71382

由 Chao Yu 提交于 6月 24, 2014

Although building NAT journal in cursum reduce the read/write work for NAT
block, but previous design leave us lower performance when write checkpoint
frequently for these cases:
1. if journal in cursum has already full, it's a bit of waste that we flush all
   nat entries to page for persistence, but not to cache any entries.
2. if journal in cursum is not full, we fill nat entries to journal util
   journal is full, then flush the left dirty entries to disk without merge
   journaled entries, so these journaled entries may be flushed to disk at next
   checkpoint but lost chance to flushed last time.

In this patch we merge dirty entries located in same NAT block to nat entry set,
and linked all set to list, sorted ascending order by entries' count of set.
Later we flush entries in sparse set into journal as many as we can, and then
flush merged entries to disk. In this way we can not only gain in performance,
but also save lifetime of flash device.

In my testing environment, it shows this patch can help to reduce NAT block
writes obviously. In hard disk test case: cost time of fsstress is stablely
reduced by about 5%.

1. virtual machine + hard disk:
fsstress -p 20 -n 200 -l 5
		node num	cp count	nodes/cp
based		4599.6		1803.0		2.551
patched		2714.6		1829.6		1.483

2. virtual machine + 32g micro SD card:
fsstress -p 20 -n 200 -l 1 -w -f chown=0 -f creat=4 -f dwrite=0
-f fdatasync=4 -f fsync=4 -f link=0 -f mkdir=4 -f mknod=4 -f rename=5
-f rmdir=5 -f symlink=0 -f truncate=4 -f unlink=5 -f write=0 -S

		node num	cp count	nodes/cp
based		84.5		43.7		1.933
patched		49.2		40.0		1.23

Our latency of merging op shows not bad when handling extreme case like:
merging a great number of dirty nats:
latency(ns)	dirty nat count
3089219		24922
5129423		27422
4000250		24523

change log from v1:
 o fix wrong logic in add_nat_entry when grab a new nat entry set.
 o swith to create slab cache in create_node_manager_caches.
 o use GFP_ATOMIC instead of GFP_NOFS to avoid potential long latency.

change log from v2:
 o make comment position more appropriate suggested by Jaegeuk Kim.
Signed-off-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

aec71382

J
f2fs: clean up an unused parameter and assignment · a014e037
由 Jaegeuk Kim 提交于 6月 20, 2014
```
This patch cleans up simple unnecessary codes.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>
```
a014e037

f2fs: introduce f2fs_do_tmpfile for code consistency · b97a9b5d

由 Jaegeuk Kim 提交于 6月 20, 2014

This patch adds f2fs_do_tmpfile to eliminate the redundant init_inode_metadata
flow.
Throught this, we can provide the consistent lock usage, e.g., fi->i_sem,  and
this will enable better debugging stuffs.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

b97a9b5d

09 7月, 2014 2 次提交

f2fs: check lower bound nid value in check_nid_range · d6b7d4b3

由 Chao Yu 提交于 6月 12, 2014

This patch add lower bound verification for nid in check_nid_range, so nids
reserved like 0, node, meta passed by caller could be checked there.

And then check_nid_range could be used in f2fs_nfs_get_inode for simplifying
code.
Signed-off-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

d6b7d4b3

f2fs: remove unused variables in f2fs_sm_info · 8bc6f60e

由 Chao Yu 提交于 6月 11, 2014

Remove unused variables in struct f2fs_sm_info.
Signed-off-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

8bc6f60e

08 6月, 2014 1 次提交

f2fs: support f2fs_fiemap · 9ab70134

由 Jaegeuk Kim 提交于 6月 08, 2014

This patch links f2fs_fiemap with generic function with get_block.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

9ab70134

04 6月, 2014 1 次提交

f2fs: fix to recover data written by dio · b6fe5873

由 Jaegeuk Kim 提交于 6月 04, 2014

If data are overwritten through dio, previous f2fs doesn't remain the fsync mark
due to no additional node writes.

Note that this patch should resolve the xfstests:311.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

b6fe5873