1. 24 8月, 2017 1 次提交
    • C
      block: replace bi_bdev with a gendisk pointer and partitions index · 74d46992
      Christoph Hellwig 提交于
      This way we don't need a block_device structure to submit I/O.  The
      block_device has different life time rules from the gendisk and
      request_queue and is usually only available when the block device node
      is open.  Other callers need to explicitly create one (e.g. the lightnvm
      passthrough code, or the new nvme multipathing code).
      
      For the actual I/O path all that we need is the gendisk, which exists
      once per block device.  But given that the block layer also does
      partition remapping we additionally need a partition index, which is
      used for said remapping in generic_make_request.
      
      Note that all the block drivers generally want request_queue or
      sometimes the gendisk, so this removes a layer of indirection all
      over the stack.
      Signed-off-by: NChristoph Hellwig <hch@lst.de>
      Signed-off-by: NJens Axboe <axboe@kernel.dk>
      74d46992
  2. 09 7月, 2017 1 次提交
    • C
      f2fs: support plain user/group quota · 0abd675e
      Chao Yu 提交于
      This patch adds to support plain user/group quota.
      
      Change Note by Jaegeuk Kim.
      
      - Use f2fs page cache for quota files in order to consider garbage collection.
        so, quota files are not tolerable for sudden power-cuts, so user needs to do
        quotacheck.
      
      - setattr() calls dquot_transfer which will transfer inode->i_blocks.
        We can't reclaim that during f2fs_evict_inode(). So, we need to count
        node blocks as well in order to match i_blocks with dquot's space.
      
        Note that, Chao wrote a patch to count inode->i_blocks without inode block.
        (f2fs: don't count inode block in in-memory inode.i_blocks)
      
      - in f2fs_remount, we need to make RW in prior to dquot_resume.
      
      - handle fault_injection case during f2fs_quota_off_umount
      
      - TODO: Project quota
      Signed-off-by: NChao Yu <yuchao0@huawei.com>
      Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>
      0abd675e
  3. 08 7月, 2017 3 次提交
  4. 04 7月, 2017 1 次提交
    • Q
      f2fs: dax: fix races between page faults and truncating pages · 5a3a2d83
      Qiuyang Sun 提交于
      Currently in F2FS, page faults and operations that truncate the pagecahe
      or data blocks, are completely unsynchronized. This can result in page
      fault faulting in a page into a range that we are changing after
      truncating, and thus we can end up with a page mapped to disk blocks that
      will be shortly freed. Filesystem corruption will shortly follow.
      
      This patch fixes the problem by creating new rw semaphore i_mmap_sem in
      f2fs_inode_info and grab it for functions removing blocks from extent tree
      and for read over page faults. The mechanism is similar to that in ext4.
      Signed-off-by: NQiuyang Sun <sunqiuyang@huawei.com>
      Reviewed-by: NChao Yu <yuchao0@huawei.com>
      Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>
      5a3a2d83
  5. 09 6月, 2017 1 次提交
  6. 24 5月, 2017 4 次提交
  7. 04 5月, 2017 2 次提交
  8. 03 5月, 2017 3 次提交
  9. 25 4月, 2017 1 次提交
    • J
      f2fs: fix out-of free segments · a7881893
      Jaegeuk Kim 提交于
      This patch also reverts d0db7703 ("f2fs: do SSR in higher priority").
      
      This patch fixes out of free segments caused by many small file creation by
      1) mkfs -s 1 2G
      2) mount
      3) untar
       - preoduce 60000 small files burstly
      4) sync
       - flush node pages
       - flush imeta
      
      Here, when we do f2fs_balance_fs, we missed # of imeta blocks, resulting in
      skipping to check has_not_enough_free_secs.
      
      Another test is done by
      1) mkfs -s 12 2G
      2) mount
      3) untar
       - preoduce 60000 small files burstly
      4) sync
       - flush node pages
       - flush imeta
      
      In this case, this patch also fixes wrong block allocation under large section
      size.
      Reported-by: NWilliam Brana <wbrana@gmail.com>
      Cc: <stable@vger.kernel.org>
      Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>
      a7881893
  10. 20 4月, 2017 2 次提交
    • H
      f2fs: introduce async IPU policy · 04485987
      Hou Pengyang 提交于
      This patch introduces an ASYNC IPU policy.
      
      Under senario of large # of async updating(e.g. log writing in Android),
      disk would be seriously fragmented, and higher frequent gc would be triggered.
      
      This patch uses IPU to rewrite the async update writting, since async is
      NOT sensitive to io latency.
      Signed-off-by: NHou Pengyang <houpengyang@huawei.com>
      04485987
    • C
      f2fs: unlock cp_rwsem early for IPU writes · 001c584c
      Chao Yu 提交于
      For IPU writes, there won't be any udpates in dnode page since we
      will reuse old block address instead of allocating new one, so we
      don't need to lock cp_rwsem during IPU IO submitting.
      Signed-off-by: NChao Yu <yuchao0@huawei.com>
      001c584c
  11. 11 4月, 2017 1 次提交
  12. 06 4月, 2017 3 次提交
  13. 25 3月, 2017 1 次提交
  14. 22 3月, 2017 2 次提交
    • K
      f2fs: fix bad prefetchw of NULL page · a83d50bc
      Kinglong Mee 提交于
      For f2fs_read_data_pages, the f2fs_mpage_readpages gets "page == NULL",
      so that, the prefetchw(&page->flags) is operated on NULL.
      
      Fixes: f1e88660 ("f2fs: expose f2fs_mpage_readpages")
      Signed-off-by: NKinglong Mee <kinglongmee@gmail.com>
      Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>
      a83d50bc
    • J
      f2fs: fix stale ATOMIC_WRITTEN_PAGE private pointer · 8c242db9
      Jaegeuk Kim 提交于
      When I forced to enable atomic operations intentionally, I could hit the below
      panic, since we didn't clear page->private in f2fs_invalidate_page called by
      file truncation.
      
      The panic occurs due to NULL mapping having page->private.
      
      BUG: unable to handle kernel paging request at ffffffffffffffff
      IP: drop_buffers+0x38/0xe0
      PGD 5d00c067
      PUD 5d00e067
      PMD 0
      CPU: 3 PID: 1648 Comm: fsstress Tainted: G      D    OE   4.10.0+ #5
      Hardware name: innotek GmbH VirtualBox/VirtualBox, BIOS VirtualBox 12/01/2006
      task: ffff9151952863c0 task.stack: ffffaaec40db4000
      RIP: 0010:drop_buffers+0x38/0xe0
      RSP: 0018:ffffaaec40db74c8 EFLAGS: 00010292
      Call Trace:
       ? page_referenced+0x8b/0x170
       try_to_free_buffers+0xc5/0xe0
       try_to_release_page+0x49/0x50
       shrink_page_list+0x8bc/0x9f0
       shrink_inactive_list+0x1dd/0x500
       ? shrink_active_list+0x2c0/0x430
       shrink_node_memcg+0x5eb/0x7c0
       shrink_node+0xe1/0x320
       do_try_to_free_pages+0xef/0x2e0
       try_to_free_pages+0xe9/0x190
       __alloc_pages_slowpath+0x390/0xe70
       __alloc_pages_nodemask+0x291/0x2b0
       alloc_pages_current+0x95/0x140
       __page_cache_alloc+0xc4/0xe0
       pagecache_get_page+0xab/0x2a0
       grab_cache_page_write_begin+0x20/0x40
       get_read_data_page+0x2e6/0x4c0 [f2fs]
       ? f2fs_mark_inode_dirty_sync+0x16/0x30 [f2fs]
       ? truncate_data_blocks_range+0x238/0x2b0 [f2fs]
       get_lock_data_page+0x30/0x190 [f2fs]
       __exchange_data_block+0xaaf/0xf40 [f2fs]
       f2fs_fallocate+0x418/0xd00 [f2fs]
       vfs_fallocate+0x157/0x220
       SyS_fallocate+0x48/0x80
      Signed-off-by: NYunlei He <heyunlei@huawei.com>
      Signed-off-by: NChao Yu <yuchao0@huawei.com>
      [Chao Yu: use INMEM_INVALIDATE for better tracing]
      Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>
      8c242db9
  15. 02 3月, 2017 1 次提交
  16. 28 2月, 2017 5 次提交
  17. 24 2月, 2017 5 次提交
  18. 23 2月, 2017 3 次提交
    • W
      f2fs: fix a dead loop in f2fs_fiemap() · b86e3307
      Wei Fang 提交于
      A dead loop can be triggered in f2fs_fiemap() using the test case
      as below:
      
      	...
      	fd = open();
      	fallocate(fd, 0, 0, 4294967296);
      	ioctl(fd, FS_IOC_FIEMAP, fiemap_buf);
      	...
      
      It's caused by an overflow in __get_data_block():
      	...
      	bh->b_size = map.m_len << inode->i_blkbits;
      	...
      map.m_len is an unsigned int, and bh->b_size is a size_t which is 64 bits
      on 64 bits archtecture, type conversion from an unsigned int to a size_t
      will result in an overflow.
      
      In the above-mentioned case, bh->b_size will be zero, and f2fs_fiemap()
      will call get_data_block() at block 0 again an again.
      
      Fix this by adding a force conversion before left shift.
      Signed-off-by: NWei Fang <fangwei1@huawei.com>
      Acked-by: NChao Yu <yuchao0@huawei.com>
      Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>
      b86e3307
    • J
      f2fs: do not preallocate blocks which has wrong buffer · dc91de78
      Jaegeuk Kim 提交于
      Sheng Yong reports needless preallocation if write(small_buffer, large_size)
      is called.
      
      In that case, f2fs preallocates large_size, but vfs returns early due to
      small_buffer size. Let's detect it before preallocation phase in f2fs.
      Reported-by: NSheng Yong <shengyong1@huawei.com>
      Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>
      dc91de78
    • C
      f2fs: introduce FI_ATOMIC_COMMIT · 5fe45743
      Chao Yu 提交于
      This patch introduces a new flag to indicate inode status of doing atomic
      write committing, so that, we can keep atomic write status for inode
      during atomic committing, then we can skip GCing pages of atomic write inode,
      that avoids random GCed datas being mixed with current transaction, so
      isolation of transaction can be kept.
      Signed-off-by: NChao Yu <yuchao0@huawei.com>
      Signed-off-by: NJaegeuk Kim <jaegeuk@kernel.org>
      5fe45743