提交 · 26a28a0c1eb756ba18bfb1f93309c4b4406b9cd9 · openeuler / Kernel

29 1月, 2017 12 次提交

f2fs: show the max number of atomic operations · 26a28a0c

由 Jaegeuk Kim 提交于 12月 28, 2016

This patch adds to show the max number of atomic operations which are
conducting concurrently.
Reviewed-by: NChao Yu <yuchao0@huawei.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

26a28a0c

f2fs: get io size bit from mount option · ec91538d

由 Jaegeuk Kim 提交于 12月 21, 2016

This patch adds to set io_size_bits from mount option.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

ec91538d

f2fs: support IO alignment for DATA and NODE writes · 0a595eba

由 Jaegeuk Kim 提交于 12月 14, 2016

This patch implements IO alignment by filling dummy blocks in DATA and NODE
write bios. If we can guarantee, for example, 32KB or 64KB for such the IOs,
we can eliminate underlying dummy page problem which FTL conducts in order to
close MLC or TLC partial written pages.

Note that,
 - it requires "-o mode=lfs".
 - IO size should be power of 2, not exceed BIO_MAX_PAGES, 256.
 - read IO is still 4KB.
 - do checkpoint at fsync, if dummy NODE page was written.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

0a595eba

f2fs: add submit_bio tracepoint · 554b5125

由 Jaegeuk Kim 提交于 12月 21, 2016

This patch adds final submit_bio() tracepoint.
Reviewed-by: NChao Yu <yuchao0@huawei.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

554b5125

f2fs: reassign new segment for mode=lfs · 9d52a504

由 Jaegeuk Kim 提交于 12月 21, 2016

Otherwise we can remain wrong curseg->next_blkoff, resulting in fsck failure.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

9d52a504

f2fs: fix a missing discard prefree segments · 650d3c4e

由 Yunlei He 提交于 12月 22, 2016

If userspace issue a fstrim with a range not involve prefree segments,
it will reuse these segments without discard. This patch fix it.
Signed-off-by: NYunlei He <heyunlei@huawei.com>
Reviewed-by: NChao Yu <yuchao0@huawei.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

650d3c4e

f2fs: use rb_entry_safe · ed0b5620

由 Geliang Tang 提交于 12月 20, 2016

Use rb_entry_safe() instead of open-coding it.
Signed-off-by: NGeliang Tang <geliangtang@gmail.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

ed0b5620

f2fs: add a case of no need to read a page in write begin · 746e2403

由 Yunlei He 提交于 12月 20, 2016

If the range we write cover the whole valid data in the last page,
we do not need to read it.
Signed-off-by: NYunlei He <heyunlei@huawei.com>
[Jaegeuk Kim: nullify the remaining area (fix: xfstests/f2fs/001)]
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

746e2403

f2fs: fix a problem of using memory after free · 7855eba4

由 Yunlei He 提交于 12月 19, 2016

This patch fix a problem of using memory after free
in function __try_merge_extent_node.

Fixes: 0f825ee6 ("f2fs: add new interfaces for extent tree")
Cc: <stable@vger.kernel.org>
Signed-off-by: NYunlei He <heyunlei@huawei.com>
Reviewed-by: NChao Yu <yuchao0@huawei.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

7855eba4

f2fs: remove unneeded condition · 07fe8d44

由 Dan Carpenter 提交于 12月 16, 2016

We checked that "inode" is not an error pointer earlier so there is
no need to check again here.
Signed-off-by: NDan Carpenter <dan.carpenter@oracle.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

07fe8d44

f2fs: don't cache nat entry if out of memory · 5c9e4184

由 Chao Yu 提交于 12月 13, 2016

If we run out of memory, in cache_nat_entry, it's better to avoid loop
for allocating memory to cache nat entry, so in low memory scenario, for
read path of node block, I expect this can avoid unneeded latency.
Signed-off-by: NChao Yu <yuchao0@huawei.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

5c9e4184

f2fs: remove unused values in recover_fsync_data · fed24668

由 Yunlei He 提交于 12月 13, 2016

This patch remove unused values in function recover_fsync_data
Signed-off-by: NYunlei He <heyunlei@huawei.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

fed24668

12 1月, 2017 1 次提交

block: Rename blk_queue_zone_size and bdev_zone_size · f99e8648

由 Damien Le Moal 提交于 1月 12, 2017

All block device data fields and functions returning a number of 512B
sectors are by convention named xxx_sectors while names in the form
xxx_size are generally used for a number of bytes. The blk_queue_zone_size
and bdev_zone_size functions were not following this convention so rename
them.

No functional change is introduced by this patch.
Signed-off-by: NDamien Le Moal <damien.lemoal@wdc.com>

Collapsed the two patches, they were nonsensically split and broke
bisection.
Signed-off-by: NJens Axboe <axboe@fb.com>

f99e8648

13 12月, 2016 1 次提交

f2fs: fix a missing size change in f2fs_setattr · c0ed4405

由 Yunlei He 提交于 12月 11, 2016

This patch fix a missing size change in f2fs_setattr
Signed-off-by: NYunlei He <heyunlei@huawei.com>
Reviewed-by: NChao Yu <yuchao0@huawei.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

c0ed4405

12 12月, 2016 2 次提交

fscrypt: Cleanup page locking requirements for fscrypt_{decrypt,encrypt}_page() · bd7b8290

由 David Gstir 提交于 12月 06, 2016

Rename the FS_CFLG_INPLACE_ENCRYPTION flag to FS_CFLG_OWN_PAGES which,
when set, indicates that the fs uses pages under its own control as
opposed to writeback pages which require locking and a bounce buffer for
encryption.
Signed-off-by: NDavid Gstir <david@sigma-star.at>
Signed-off-by: NTheodore Ts'o <tytso@mit.edu>

bd7b8290

fscrypto: move ioctl processing more fully into common code · db717d8e

由 Eric Biggers 提交于 11月 26, 2016

Multiple bugs were recently fixed in the "set encryption policy" ioctl.
To make it clear that fscrypt_process_policy() and fscrypt_get_policy()
implement ioctls and therefore their implementations must take standard
security and correctness precautions, rename them to
fscrypt_ioctl_set_policy() and fscrypt_ioctl_get_policy(). Make the
latter take in a struct file * to make it consistent with the former.
Signed-off-by: NEric Biggers <ebiggers@google.com>
Signed-off-by: NTheodore Ts'o <tytso@mit.edu>

db717d8e

09 12月, 2016 1 次提交

vfs: remove ".readlink = generic_readlink" assignments · dfeef688

由 Miklos Szeredi 提交于 12月 09, 2016

If .readlink == NULL implies generic_readlink().

Generated by:

to_del="\.readlink.*=.*generic_readlink"
for i in `git grep -l $to_del`; do sed -i "/$to_del"/d $i; done
Signed-off-by: NMiklos Szeredi <mszeredi@redhat.com>

dfeef688

08 12月, 2016 3 次提交

f2fs: fix to access nullified flush_cmd_control pointer · 5eba8c5d

由 Jaegeuk Kim 提交于 12月 07, 2016

f2fs_sync_file()             remount_ro
 - f2fs_readonly
                               - destroy_flush_cmd_control
 - f2fs_issue_flush
   - no fcc pointer!

So, this patch doesn't free fcc in this case, but just stop its kernel thread
which sends flush commands.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

5eba8c5d

f2fs: free meta pages if sanity check for ckpt is failed · a2125ff7

由 Jaegeuk Kim 提交于 12月 05, 2016

This fixes missing freeing meta pages in the error case.
Tested-by: NEric Biggers <ebiggers@google.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

a2125ff7

f2fs: detect wrong layout · 2040fce8

由 Jaegeuk Kim 提交于 12月 05, 2016

Previous mkfs.f2fs allows small partition inappropriately, so f2fs should detect
that as well.

Refer this in f2fs-tools.

mkfs.f2fs: detect small partition by overprovision ratio and # of segments
Reported-and-Tested-by: NEric Biggers <ebiggers@google.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

2040fce8

06 12月, 2016 2 次提交

f2fs: call sync_fs when f2fs is idle · f455c8a5

由 Jaegeuk Kim 提交于 12月 05, 2016

The sync_fs in f2fs_balance_fs_bg must avoid interrupting current user requests.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

f455c8a5

Revert "f2fs: use percpu_counter for # of dirty pages in inode" · 204706c7

由 Jaegeuk Kim 提交于 12月 02, 2016

This reverts commit 1beba1b3.

The perpcu_counter doesn't provide atomicity in single core and consume more
DRAM. That incurs fs_mark test failure due to ENOMEM.

Cc: stable@vger.kernel.org # 4.7+
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

204706c7

30 11月, 2016 2 次提交

f2fs: return AOP_WRITEPAGE_ACTIVATE for writepage · 0002b61b

由 Chao Yu 提交于 11月 28, 2016

We should use AOP_WRITEPAGE_ACTIVATE when we bypass writing pages.
Signed-off-by: NChao Yu <yuchao0@huawei.com>
Signed-off-by: NMiao Xie <miaoxie@huawei.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

0002b61b

f2fs: do not activate auto_recovery for fallocated i_size · 26787236

由 Jaegeuk Kim 提交于 11月 28, 2016

If a file needs to keep its i_size by fallocate, we need to turn off auto
recovery during roll-forward recovery.

This will resolve the below scenario.

1. xfs_io -f /mnt/f2fs/file -c "pwrite 0 4096" -c "fsync"
2. xfs_io -f /mnt/f2fs/file -c "falloc -k 4096 4096" -c "fsync"
3. md5sum /mnt/f2fs/file;
4. godown /mnt/f2fs/
5. umount /mnt/f2fs/
6. mount -t f2fs /dev/sdx /mnt/f2fs
7. md5sum /mnt/f2fs/file
Reported-by: NChao Yu <chao@kernel.org>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

26787236

29 11月, 2016 1 次提交

f2fs: fix to determine start_cp_addr by sbi->cur_cp_pack · 8508e44a

由 Jaegeuk Kim 提交于 11月 24, 2016

We don't guarantee cp_addr is fixed by cp_version.
This is to sync with f2fs-tools.

Cc: stable@vger.kernel.org
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

8508e44a

26 11月, 2016 15 次提交

f2fs: fix 32-bit build · 19c52651

由 Arnd Bergmann 提交于 11月 22, 2016

The addition of multiple-device support broke CONFIG_BLK_DEV_ZONED
on 32-bit machines because of a 64-bit division:

fs/f2fs/f2fs.o: In function `__issue_discard_async':
extent_cache.c:(.text.__issue_discard_async+0xd4): undefined reference to `__aeabi_uldivmod'

Fortunately, bdev_zone_size() is guaranteed to return a power-of-two
number, so we can replace the % operator with a cheaper bit mask.

Fixes: 792b84b74b54 ("f2fs: support multiple devices")
Signed-off-by: NArnd Bergmann <arnd@arndb.de>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

19c52651

f2fs: set ->owner for debugfs status file's file_operations · 05e6ea26

由 Nicolai Stange 提交于 11月 20, 2016

The struct file_operations instance serving the f2fs/status debugfs file
lacks an initialization of its ->owner.

This means that although that file might have been opened, the f2fs module
can still get removed. Any further operation on that opened file, releasing
included,  will cause accesses to unmapped memory.

Indeed, Mike Marshall reported the following:

  BUG: unable to handle kernel paging request at ffffffffa0307430
  IP: [<ffffffff8132a224>] full_proxy_release+0x24/0x90
  <...>
  Call Trace:
   [] __fput+0xdf/0x1d0
   [] ____fput+0xe/0x10
   [] task_work_run+0x8e/0xc0
   [] do_exit+0x2ae/0xae0
   [] ? __audit_syscall_entry+0xae/0x100
   [] ? syscall_trace_enter+0x1ca/0x310
   [] do_group_exit+0x44/0xc0
   [] SyS_exit_group+0x14/0x20
   [] do_syscall_64+0x61/0x150
   [] entry_SYSCALL64_slow_path+0x25/0x25
  <...>
  ---[ end trace f22ae883fa3ea6b8 ]---
  Fixing recursive fault but reboot is needed!

Fix this by initializing the f2fs/status file_operations' ->owner with
THIS_MODULE.

This will allow debugfs to grab a reference to the f2fs module upon any
open on that file, thus preventing it from getting removed.

Fixes: 902829aa ("f2fs: move proc files to debugfs")
Reported-by: NMike Marshall <hubcap@omnibond.com>
Reported-by: NMartin Brandenburg <martin@omnibond.com>
Cc: stable@vger.kernel.org
Signed-off-by: NNicolai Stange <nicstange@gmail.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

05e6ea26

f2fs: fix incorrect free inode count in ->statfs · b08b12d2

由 Chao Yu 提交于 11月 18, 2016

While calculating inode count that we can create at most in the left space,
we should consider space which data/node blocks occupied, since we create
data/node mixly in main area. So fix the wrong calculation in ->statfs.
Signed-off-by: NChao Yu <yuchao0@huawei.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

b08b12d2

f2fs: drop duplicate header timer.h · b4ceec29

由 Geliang Tang 提交于 11月 18, 2016

Drop duplicate header timer.h from segment.c.
Signed-off-by: NGeliang Tang <geliangtang@gmail.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

b4ceec29

f2fs: fix wrong AUTO_RECOVER condition · 97dd26ad

由 Jaegeuk Kim 提交于 11月 16, 2016

If i_size is not aligned to the f2fs's block size, we should not skip inode
update during fsync.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

97dd26ad

f2fs: do not recover i_size if it's valid · 3a3a5ead

由 Jaegeuk Kim 提交于 11月 16, 2016

If i_size is already valid during roll_forward recovery, we should not update
it according to the block alignment.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

3a3a5ead

f2fs: fix fdatasync · 281518c6

由 Chao Yu 提交于 11月 17, 2016

For below two cases, we can't guarantee data consistence:

a)
1. xfs_io "pwrite 0 4195328" "fsync"
2. xfs_io "pwrite 4195328 1024" "fdatasync"
3. godown
4. umount & mount
--> isize we updated before fdatasync won't be recovered

b)
1. xfs_io "pwrite -S 0xcc 0 4202496" "fsync"
2. xfs_io "fpunch 4194304 4096" "fdatasync"
3. godown
4. umount & mount
--> dnode we punched before fdatasync won't be recovered

The reason is that normally fdatasync won't be aware of modification
of metadata in file, e.g. isize changing, dnode updating, so in ->fsync
we will skip flushing node pages for above cases, result in making
fdatasynced file being lost during recovery.

Currently we have introduced DIRTY_META global list in sbi for tracking
dirty inode selectively, so in fdatasync we can choose to flush nodes
depend on dirty state of current inode in the list.
Signed-off-by: NChao Yu <yuchao0@huawei.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

281518c6

f2fs: fix to account total free nid correctly · 04d47e67

由 Chao Yu 提交于 11月 17, 2016

Thread A		Thread B		Thread C
- f2fs_create
 - f2fs_new_inode
  - f2fs_lock_op
   - alloc_nid
    alloc last nid
  - f2fs_unlock_op
			- f2fs_create
			 - f2fs_new_inode
			  - f2fs_lock_op
			   - alloc_nid
			    as node count still not
			    be increased, we will
			    loop in alloc_nid
						- f2fs_write_node_pages
						 - f2fs_balance_fs_bg
						  - f2fs_sync_fs
						   - write_checkpoint
						    - block_operations
						     - f2fs_lock_all
 - f2fs_lock_op

While creating new inode, we do not allocate and account nid atomically,
so that when there is almost no free nids left, we may encounter deadloop
like above stack.

In order to avoid that, reuse nm_i::available_nids for accounting free nids
and make nid allocation and counting being atomical during node creation.
Signed-off-by: NChao Yu <yuchao0@huawei.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

04d47e67

f2fs: fix an infinite loop when flush nodes in cp · d40a43af

由 Yunlei He 提交于 11月 16, 2016

Thread A			Thread B

- write_checkpoint
 - block_operations
   -blk_start_plug
    -sync_node_pages		- f2fs_do_sync_file
				 - fsync_node_pages
				  - f2fs_wait_on_page_writeback

Thread A wait for global F2FS_DIRTY_NODES decreased to zero,
it start a plug list, some requests have been added to this list.
Thread B lock one dirty node page, and wait this page write back.
But this page has been in plug list of thread A with PG_writeback flag.
Thread A keep on running and its plug list has no chance to finish,
so it seems a deadlock between cp and fsync path.

This patch add a wait on page write back before set node page dirty
to avoid this problem.
Signed-off-by: NYunlei He <heyunlei@huawei.com>
Signed-off-by: NPengyang Hou <houpengyang@huawei.com>
Reviewed-by: NChao Yu <yuchao0@huawei.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

d40a43af

f2fs: don't wait writeback for datas during checkpoint · 36951b38

由 Chao Yu 提交于 11月 16, 2016

Normally, while committing checkpoint, we will wait on all pages to be
writebacked no matter the page is data or metadata, so in scenario where
there are lots of data IO being submitted with metadata, we may suffer
long latency for waiting writeback during checkpoint.

Indeed, we only care about persistence for pages with metadata, but not
pages with data, as file system consistent are only related to metadate,
so in order to avoid encountering long latency in above scenario, let's
recognize and reference metadata in submitted IOs, wait writeback only
for metadatas.
Signed-off-by: NChao Yu <yuchao0@huawei.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

36951b38

f2fs: fix wrong written_valid_blocks counting · c79b7ff1

由 Jaegeuk Kim 提交于 11月 14, 2016

Previously, written_valid_blocks was got by ckpt->valid_block_count. But if
the last checkpoint has some NEW_ADDR due to power-cut, we can get wrong value.
Fix it to get the number from actual written block count from sit entries.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

c79b7ff1

f2fs: avoid BG_GC in f2fs_balance_fs · 7702bdbe

由 Jaegeuk Kim 提交于 11月 14, 2016

If many threads hit has_not_enough_free_secs() in f2fs_balance_fs() at the same
time, all the threads would do FG_GC or BG_GC.
In this critical path, we totally don't need to do BG_GC at all.
Let's avoid that.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

7702bdbe

f2fs: fix redundant block allocation · c040ff9d

由 Jaegeuk Kim 提交于 11月 11, 2016

In direct_IO path of f2fs_file_write_iter(),
1. f2fs_preallocate_blocks(F2FS_GET_BLOCK_PRE_DIO)
   -> allocate LBA X
2. f2fs_direct_IO()
   -> return 0;

Then,
f2fs_write_data_page() will allocate another LBA X+1.

This makes EIO triggered by HM-SMR.
Reviewed-by: NChao Yu <yuchao0@huawei.com>
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

c040ff9d

J
f2fs: use err for f2fs_preallocate_blocks · a7de6086
由 Jaegeuk Kim 提交于 11月 11, 2016
```
This patch has no functional change.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>
```
a7de6086

f2fs: support multiple devices · 3c62be17

由 Jaegeuk Kim 提交于 10月 06, 2016

This patch implements multiple devices support for f2fs.
Given multiple devices by mkfs.f2fs, f2fs shows them entirely as one big
volume under one f2fs instance.

Internal block management is very simple, but we will modify block allocation
and background GC policy to boost IO speed by exploiting them accoording to
each device speed.
Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>

3c62be17

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功