提交 · a82afa20197a2ed289dd8fd18208a9e8b9af0130 · openeuler / raspberrypi-kernel

04 11月, 2014 15 次提交

f2fs: reuse room_for_filename for inline dentry operation · a82afa20

由 Jaegeuk Kim 提交于 10月 13, 2014

This patch introduces to reuse the existing room_for_filename for inline dentry
operation.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

a82afa20

f2fs: enable inline dir handling · 622f28ae

由 Chao Yu 提交于 9月 24, 2014

Add inline dir functions into normal dir ops' function to handle inline ops.
Besides, we enable inline dir mode when a new dir inode is created if
inline_data option is on.
Signed-off-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

622f28ae

f2fs: add key function to handle inline dir · 201a05be

由 Chao Yu 提交于 9月 24, 2014

Adds Functions to implement inline dir init/lookup/insert/delete/convert ops.
Signed-off-by: NChao Yu <chao2.yu@samsung.com>
[Jaegeuk Kim: remove needless reserved area copy, pointed by Dan Carpenter]
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

201a05be

f2fs: export dir operations for inline dir · dbeacf02

由 Chao Yu 提交于 9月 24, 2014

This patch exports some dir operations for inline dir, additionally introduces
f2fs_drop_nlink from f2fs_delete_entry for reusing by inline dir function.
Signed-off-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

dbeacf02

f2fs: add a new mount option for inline dir · 5efd3c6f

由 Chao Yu 提交于 9月 24, 2014

Adds a new mount option 'inline_dentry' for inline dir.
Signed-off-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

5efd3c6f

f2fs: add infra struct and helper for inline dir · 34d67deb

由 Chao Yu 提交于 9月 24, 2014

This patch defines macro/inline dentry structure, and adds some helpers for
inline dir infrastructure.
Signed-off-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

34d67deb

f2fs: avoid infinite loop at cp_error · af41d3ee

由 Jaegeuk Kim 提交于 10月 17, 2014

This patch avoids an infinite loop in sync_dirty_inode_page when -EIO was
detected.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

af41d3ee

f2fs: avoid build warning · 4a257ed6

由 Jaegeuk Kim 提交于 10月 16, 2014

This patch removes build warning.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

4a257ed6

f2fs: fix to call f2fs_unlock_op · 13fd8f89

由 Jaegeuk Kim 提交于 10月 19, 2014

This patch fixes to call f2fs_unlock_op, which was missing before.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

13fd8f89

f2fs: avoid to allocate when inline_data was written · 9ba69cf9

由 Jaegeuk Kim 提交于 10月 17, 2014

The sceanrio is like this.
inline_data   i_size     page                 write_begin/vm_page_mkwrite
  X             30       dirty_page
  X             30                            write to #4096 position
  X             30       get_dnode_of_data    wait for get_dnode_of_data
  O             30       write inline_data
  O             30                            get_dnode_of_data
  O             30                            reserve data block
..

In this case, we have #0 = NEW_ADDR and inline_data as well.
We should not allow this condition for further access.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

9ba69cf9

f2fs: use highmem for directory pages · a78186eb

由 Jaegeuk Kim 提交于 10月 17, 2014

This patch fixes to use highmem for directory pages.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

a78186eb

f2fs: fix race conditon on truncation with inline_data · 1ce86bf6

由 Jaegeuk Kim 提交于 10月 15, 2014

Let's consider the following scenario.

blkaddr[0] inline_data i_size  i_blocks writepage           truncate
  NEW        X        4096        2    dirty page #0
  NEW        X         0                                    change i_size
  NEW        X         0          2    f2fs_write_inline_data
  NEW        X         0          2    get_dnode_of_data
  NEW        X         0          2    truncate_data_blocks_range
  NULL       O         0          1    memcpy(inline_data)
  NULL       O         0          1    f2fs_put_dnode
  NULL       O         0          1                         f2fs_truncate
  NULL       O         0          1                         get_dnode_of_data
  NULL       O         0          1                       *invalid block addr*

This patch adds checking inline_data flag during f2fs_truncate not to refer
corrupted block indices.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

1ce86bf6

f2fs: should truncate any allocated block for inline_data write · c08a690b

由 Jaegeuk Kim 提交于 10月 15, 2014

When trying to write inline_data, we should truncate any data block allocated
and pointed by the inode block.
We should consider the data index is not 0.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

c08a690b

f2fs: invalidate inmemory page · cbcb2872

由 Jaegeuk Kim 提交于 10月 09, 2014

If user truncates file's data, we should truncate inmemory pages too.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

cbcb2872

f2fs: do not make dirty any inmemory pages · 34ba94ba

由 Jaegeuk Kim 提交于 10月 09, 2014

This patch let inmemory pages be clean all the time.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

34ba94ba

08 10月, 2014 1 次提交

f2fs: support volatile operations for transient data · 02a1335f

由 Jaegeuk Kim 提交于 10月 06, 2014

This patch adds support for volatile writes which keep data pages in memory
until f2fs_evict_inode is called by iput.

For instance, we can use this feature for the sqlite database as follows.
While supporting atomic writes for main database file, we can keep its journal
data temporarily in the page cache by the following sequence.

1. open
 -> ioctl(F2FS_IOC_START_VOLATILE_WRITE);
2. writes
 : keep all the data in the page cache.
3. flush to the database file with atomic writes
  a. ioctl(F2FS_IOC_START_ATOMIC_WRITE);
  b. writes
  c. ioctl(F2FS_IOC_COMMIT_ATOMIC_WRITE);
4. close
 -> drop the cached data
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

02a1335f

07 10月, 2014 1 次提交

f2fs: support atomic writes · 88b88a66

由 Jaegeuk Kim 提交于 10月 06, 2014

This patch introduces a very limited functionality for atomic write support.
In order to support atomic write, this patch adds two ioctls:
 o F2FS_IOC_START_ATOMIC_WRITE
 o F2FS_IOC_COMMIT_ATOMIC_WRITE

The database engine should be aware of the following sequence.
1. open
 -> ioctl(F2FS_IOC_START_ATOMIC_WRITE);
2. writes
  : all the written data will be treated as atomic pages.
3. commit
 -> ioctl(F2FS_IOC_COMMIT_ATOMIC_WRITE);
  : this flushes all the data blocks to the disk, which will be shown all or
  nothing by f2fs recovery procedure.
4. repeat to #2.

The IO pattens should be:

  ,- START_ATOMIC_WRITE                  ,- COMMIT_ATOMIC_WRITE
 CP | D D D D D D | FSYNC | D D D D | FSYNC ...
                      `- COMMIT_ATOMIC_WRITE
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

88b88a66

06 10月, 2014 1 次提交

f2fs: remove unused return value · 120c2cba

由 Jaegeuk Kim 提交于 10月 03, 2014

Don't return any value without any usage.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

120c2cba

01 10月, 2014 7 次提交

f2fs: clean up f2fs_ioctl functions · 52656e6c

由 Jaegeuk Kim 提交于 9月 24, 2014

This patch cleans up f2fs_ioctl functions for better readability.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

52656e6c

f2fs: potential shift wrapping buf in f2fs_trim_fs() · 8a21984d

由 Dan Carpenter 提交于 9月 25, 2014

My static checker complains that segment is a u64 but only the lower 31
bits can be used before we hit a shift wrapping bug.
Signed-off-by: NDan Carpenter <dan.carpenter@oracle.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

8a21984d

f2fs: call f2fs_unlock_op after error was handled · 44c16156

由 Jaegeuk Kim 提交于 9月 25, 2014

This patch relocates f2fs_unlock_op in every directory operations to be called
after any error was processed.
Otherwise, the checkpoint can be entered with valid node ids without its
dentry when -ENOSPC is occurred.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

44c16156

f2fs: check the use of macros on block counts and addresses · 7cd8558b

由 Jaegeuk Kim 提交于 9月 23, 2014

This patch cleans up the existing and new macros for readability.

Rule is like this.

         ,-----------------------------------------> MAX_BLKADDR -,
         |  ,------------- TOTAL_BLKS ----------------------------,
         |  |                                                     |
         |  ,- seg0_blkaddr   ,----- sit/nat/ssa/main blkaddress  |
block    |  | (SEG0_BLKADDR)  | | | |   (e.g., MAIN_BLKADDR)      |
address  0..x................ a b c d .............................
            |                                                     |
global seg# 0...................... m .............................
            |                       |                             |
            |                       `------- MAIN_SEGS -----------'
            `-------------- TOTAL_SEGS ---------------------------'
                                    |                             |
 seg#                               0..........xx..................

= Note =
 o GET_SEGNO_FROM_SEG0 : blk address -> global segno
 o GET_SEGNO           : blk address -> segno
 o START_BLOCK         : segno -> starting block address
Reviewed-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

7cd8558b

f2fs: refactor flush_nat_entries to remove costly reorganizing ops · 309cc2b6

由 Jaegeuk Kim 提交于 9月 22, 2014

Previously, f2fs tries to reorganize the dirty nat entries into multiple sets
according to its nid ranges. This can improve the flushing nat pages, however,
if there are a lot of cached nat entries, it becomes a bottleneck.

This patch introduces a new set management flow by removing dirty nat list and
adding a series of set operations when the nat entry becomes dirty.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

309cc2b6

f2fs: introduce FITRIM in f2fs_ioctl · 4b2fecc8

由 Jaegeuk Kim 提交于 9月 20, 2014

This patch introduces FITRIM in f2fs_ioctl.
In this case, f2fs will issue small discards and prefree discards as many as
possible for the given area.
Reviewed-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

4b2fecc8

f2fs: introduce cp_control structure · 75ab4cb8

由 Jaegeuk Kim 提交于 9月 20, 2014

This patch add a new data structure to control checkpoint parameters.
Currently, it presents the reason of checkpoint such as is_umount and normal
sync.
Reviewed-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

75ab4cb8

24 9月, 2014 15 次提交

f2fs: use more free segments until SSR is activated · 95dd8973

由 Jaegeuk Kim 提交于 9月 17, 2014

Previously, f2fs activates SSR if the # of free segments reaches to the # of
overprovisioned segments.
In this case, SSR starts to use dirty segments only, so that the overprovisoned
space cannot be selected for new data.
This means that we have no chance to utilizae the overprovisioned space at all.

This patch fixes that by allowing LFS allocations until the # of free segments
reaches to the last threshold, reserved space.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

95dd8973

f2fs: change the ipu_policy option to enable combinations · 9b5f136f

由 Jaegeuk Kim 提交于 9月 16, 2014

This patch changes the ipu_policy setting to use any combination of orthogonal policies.
Signed-off-by: NChangman Lee <cm224.lee@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

9b5f136f

f2fs: fix to search whole dirty segmap when get_victim · 210f41bc

由 Chao Yu 提交于 9月 15, 2014

In ->get_victim we get max_search value from dirty_i->nr_dirty without
protection of seglist_lock, after that, nr_dirty can be increased/decreased
before we hold seglist_lock lock.
Then in main loop we attempt to traverse all dirty section one time to find
victim section, but it's not accurate to use max_search as the total loop count,
because we might lose checking several sections or check sections redundantly
for the case of nr_dirty are increased or decreased previously.
Signed-off-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

210f41bc

f2fs: fix to clean previous mount option when remount_fs · 26666c8a

由 Chao Yu 提交于 9月 15, 2014

In manual of mount, we descript remount as below:

"mount -o remount,rw /dev/foo /dir
After  this call all old mount options are replaced and arbitrary stuff from
fstab is ignored, except the loop= option which is internally generated and
maintained by the mount command."

Previously f2fs do not clear up old mount options when remount_fs, so we have no
chance of disabling previous option (e.g. flush_merge). Fix it.
Signed-off-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

26666c8a

f2fs: skip punching hole in special condition · 14cecc5c

由 Chao Yu 提交于 9月 15, 2014

Now punching hole in directory is not supported in f2fs, so let's limit file
type in punch_hole().

In addition, in punch_hole if offset is exceed file size, we should skip
punching hole.
Signed-off-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

14cecc5c

f2fs: support large sector size · 55cf9cb6

由 Chao Yu 提交于 9月 15, 2014

Block size in f2fs is 4096 bytes, so theoretically, f2fs can support 4096 bytes
sector device at maximum. But now f2fs only support 512 bytes size sector, so
block device such as zRAM which uses page cache as its block storage space will
not be mounted successfully as mismatch between sector size of zRAM and sector
size of f2fs supported.

In this patch we support large sector size in f2fs, so block device with sector
size of 512/1024/2048/4096 bytes can be supported in f2fs.
Signed-off-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

55cf9cb6

f2fs: fix to truncate blocks past EOF in ->setattr · 09db6a2e

由 Chao Yu 提交于 9月 15, 2014

By using FALLOC_FL_KEEP_SIZE in ->fallocate of f2fs, we can fallocate block past
EOF without changing i_size of inode. These blocks past EOF will not be
truncated in ->setattr as we truncate them only when change the file size.

We should give a chance to truncate blocks out of filesize in setattr().
Signed-off-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

09db6a2e

f2fs: update i_size when __allocate_data_block · 976e4c50

由 Jaegeuk Kim 提交于 9月 15, 2014

The f2fs_direct_IO uses __allocate_data_block, but inside the allocation path,
we should update i_size at the changed time to update its inode page.
Otherwise, we can get wrong i_size after roll-forward recovery.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

976e4c50

f2fs: use MAX_BIO_BLOCKS(sbi) · 90a893c7

由 Jaegeuk Kim 提交于 9月 22, 2014

This patch cleans up a simple macro.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

90a893c7

f2fs: remove redundant operation during roll-forward recovery · c52e1b10

由 Jaegeuk Kim 提交于 9月 11, 2014

If same data is updated multiple times, we don't need to redo whole the
operations.
Let's just update the lastest one.
Reviewed-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

c52e1b10

f2fs: do not skip latest inode information · 19c9c466

由 Jaegeuk Kim 提交于 9月 10, 2014

In f2fs_sync_file, if there is no written appended writes, it skips
to write its node blocks.
But, if there is up-to-date inode page, we should write it to update
its metadata during the roll-forward recovery.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

19c9c466

f2fs: fix roll-forward missing scenarios · 441ac5cb

由 Jaegeuk Kim 提交于 9月 15, 2014

We can summarize the roll forward recovery scenarios as follows.

[Term] F: fsync_mark, D: dentry_mark

1. inode(x) | CP | inode(x) | dnode(F)
-> Update the latest inode(x).

2. inode(x) | CP | inode(F) | dnode(F)
-> No problem.

3. inode(x) | CP | dnode(F) | inode(x)
-> Recover to the latest dnode(F), and drop the last inode(x)

4. inode(x) | CP | dnode(F) | inode(F)
-> No problem.

5. CP | inode(x) | dnode(F)
-> The inode(DF) was missing. Should drop this dnode(F).

6. CP | inode(DF) | dnode(F)
-> No problem.

7. CP | dnode(F) | inode(DF)
-> If f2fs_iget fails, then goto next to find inode(DF).

8. CP | dnode(F) | inode(x)
-> If f2fs_iget fails, then goto next to find inode(DF).
   But it will fail due to no inode(DF).

So, this patch adds some missing points such as #1, #5, #7, and #8.
Signed-off-by: NHuang Ying <ying.huang@intel.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

441ac5cb

f2fs: fix conditions to remain recovery information in f2fs_sync_file · 88bd02c9

由 Jaegeuk Kim 提交于 9月 15, 2014

This patch revisited whole the recovery information during the f2fs_sync_file.

In this patch, there are three information to make a decision.

a) IS_CHECKPOINTED,	/* is it checkpointed before? */
b) HAS_FSYNCED_INODE,	/* is the inode fsynced before? */
c) HAS_LAST_FSYNC,	/* has the latest node fsync mark? */

And, the scenarios for our rule are based on:

[Term] F: fsync_mark, D: dentry_mark

1. inode(x) | CP | inode(x) | dnode(F)
2. inode(x) | CP | inode(F) | dnode(F)
3. inode(x) | CP | dnode(F) | inode(x) | inode(F)
4. inode(x) | CP | dnode(F) | inode(F)
5. CP | inode(x) | dnode(F) | inode(DF)
6. CP | inode(DF) | dnode(F)
7. CP | dnode(F) | inode(DF)
8. CP | dnode(F) | inode(x) | inode(DF)

For example, #3, the three conditions should be changed as follows.

   inode(x) | CP | dnode(F) | inode(x) | inode(F)
a)    x       o      o          o          o
b)    x       x      x          x          o
c)    x       o      o          x          o

If f2fs_sync_file stops   ------^,
 it should write inode(F)    --------------^

So, the need_inode_block_update should return true, since
 c) get_nat_flag(e, HAS_LAST_FSYNC), is false.

For example, #8,
      CP | alloc | dnode(F) | inode(x) | inode(DF)
a)    o      x        x          x          x
b)    x               x          x          o
c)    o               o          x          o

If f2fs_sync_file stops   -------^,
 it should write inode(DF)    --------------^

Note that, the roll-forward policy should follow this rule, which means,
if there are any missing blocks, we doesn't need to recover that inode.
Signed-off-by: NHuang Ying <ying.huang@intel.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

88bd02c9

f2fs: introduce a flag to represent each nat entry information · 7ef35e3b

由 Jaegeuk Kim 提交于 9月 15, 2014

This patch introduces a flag in the nat entry structure to merge various
information such as checkpointed and fsync_done marks.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

7ef35e3b

f2fs: use meta_inode cache to improve roll-forward speed · 4c521f49

由 Jaegeuk Kim 提交于 9月 11, 2014

Previously, all the dnode pages should be read during the roll-forward recovery.
Even worsely, whole the chain was traversed twice.
This patch removes that redundant and costly read operations by using page cache
of meta_inode and readahead function as well.
Reviewed-by: NChao Yu <chao2.yu@samsung.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

4c521f49