提交 · 8605cf0e852af3b2c771c18417499dc4ceed03d5 · openeuler / Kernel

23 3月, 2020 1 次提交

NFS: direct.c: Fix memory leak of dreq when nfs_get_lock_context fails · 8605cf0e

由 Misono Tomohiro 提交于 8月 28, 2019

When dreq is allocated by nfs_direct_req_alloc(), dreq->kref is
initialized to 2. Therefore we need to call nfs_direct_req_release()
twice to release the allocated dreq. Usually it is called in
nfs_file_direct_{read, write}() and nfs_direct_complete().

However, current code only calls nfs_direct_req_relese() once if
nfs_get_lock_context() fails in nfs_file_direct_{read, write}().
So, that case would result in memory leak.

Fix this by adding the missing call.
Signed-off-by: NMisono Tomohiro <misono.tomohiro@jp.fujitsu.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

8605cf0e

18 3月, 2020 1 次提交
- T
  nfs: Fix up documentation in nfs_follow_referral() and nfs_do_submount() · 3cab1854
  由 Trond Myklebust 提交于 3月 16, 2020
```
Fallout from the mount patches.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
```
  3cab1854
16 3月, 2020 22 次提交

nfs: Replace zero-length array with flexible-array member · 5601cda8

由 Gustavo A. R. Silva 提交于 3月 09, 2020

The current codebase makes use of the zero-length array language
extension to the C90 standard, but the preferred mechanism to declare
variable-length types such as these ones is a flexible array member[1][2],
introduced in C99:

struct foo {
        int stuff;
        struct boo array[];
};

By making use of the mechanism above, we will get a compiler warning
in case the flexible array does not occur last in the structure, which
will help us prevent some kind of undefined behavior bugs from being
inadvertently introduced[3] to the codebase from now on.

Also, notice that, dynamic memory allocations won't be affected by
this change:

"Flexible array members have incomplete type, and so the sizeof operator
may not be applied. As a quirk of the original implementation of
zero-length arrays, sizeof evaluates to zero."[1]

This issue was found with the help of Coccinelle.

[1] https://gcc.gnu.org/onlinedocs/gcc/Zero-Length.html
[2] https://github.com/KSPP/linux/issues/21
[3] commit 76497732 ("cxgb3/l2t: Fix undefined behaviour")
Signed-off-by: NGustavo A. R. Silva <gustavo@embeddedor.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

5601cda8

NFSv4.2: error out when relink swapfile · f5fdf124

由 Murphy Zhou 提交于 2月 14, 2020

This fixes xfstests generic/356 failure on NFSv4.2.
Signed-off-by: NMurphy Zhou <jencce.kernel@gmail.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

f5fdf124

NFS:remove redundant call to nfs_do_access · eb095c14

由 Zhouyi Zhou 提交于 3月 06, 2020

In function nfs_permission:
1. the rcu_read_lock and rcu_read_unlock around nfs_do_access
is unnecessary because the rcu critical data structure is already
protected in subsidiary function nfs_access_get_cached_rcu. No other
data structure needs rcu_read_lock in nfs_do_access.

2. call nfs_do_access once is enough, because:
2-1. when mask has MAY_NOT_BLOCK bit
The second call to nfs_do_access will not happen.

2-2. when mask has no MAY_NOT_BLOCK bit
The second call to nfs_do_access will happen if res == -ECHILD, which
means the first nfs_do_access goes out after statement if (!may_block).
The second call to nfs_do_access will go through this procedure once
again except continue the work after if (!may_block).
But above work can be performed by only one call to nfs_do_access
without mangling the mask flag.

Tested in x86_64
Signed-off-by: NZhouyi Zhou <zhouzhouyi@gmail.com>
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

eb095c14

NFSv4: Add support for CB_RECALL_ANY for flexfiles layouts · b5fdf841

由 Trond Myklebust 提交于 2月 18, 2020

When we receive a CB_RECALL_ANY that asks us to return flexfiles
layouts, we iterate through all the layouts and look at whether or
not there are active open file descriptors that might need them
for I/O. If there are no such descriptors, we return the layouts.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

b5fdf841

NFSv4: Clean up nfs_delegation_reap_expired() · 7f156ef0

由 Trond Myklebust 提交于 2月 27, 2020

Convert to use nfs_client_for_each_server() for efficiency.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

7f156ef0

NFSv4: Clean up nfs_delegation_reap_unclaimed() · 1bba38b2

由 Trond Myklebust 提交于 2月 27, 2020

Convert nfs_delegation_reap_unclaimed() to use nfs_client_for_each_server()
for efficiency.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

1bba38b2

NFSv4: Clean up nfs_client_return_marked_delegations() · af3b61bf

由 Trond Myklebust 提交于 2月 27, 2020

Convert it to use the nfs_client_for_each_server() helper, and
make it more efficient by skipping delegations for inodes we
know are in the process of being freed. Also improve the efficiency
of the cursor by skipping delegations that are being freed.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

af3b61bf

NFS: Add a helper nfs_client_for_each_server() · 3c9e502b

由 Trond Myklebust 提交于 2月 26, 2020

Add a helper nfs_client_for_each_server() to iterate through all the
filesystems that are attached to a struct nfs_client, and apply
a function to all the active ones.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

3c9e502b

NFSv4/pnfs: Clean up nfs_layout_find_inode() · 58ac3e59

由 Trond Myklebust 提交于 2月 27, 2020

Now that we can rely on just the rcu_read_lock(), remove the
clp->cl_lock and clean up.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

58ac3e59

T
NFSv4: Ensure layout headers are RCU safe · cf6605d1
由 Trond Myklebust 提交于 2月 18, 2020
```
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>
```
cf6605d1

NFSv4/pnfs: Return valid stateids in nfs_layout_find_inode_by_stateid() · d911c57a

由 Trond Myklebust 提交于 2月 27, 2020

Make sure to test the stateid for validity so that we catch instances
where the server may have been reusing stateids in
nfs_layout_find_inode_by_stateid().

Fixes: 7b410d9c ("pNFS: Delay getting the layout header in CB_LAYOUTRECALL handlers")
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

d911c57a

pNFS/flexfiles: Report DELAY and GRACE errors from the DS to the server · 194a0dc8

由 Trond Myklebust 提交于 2月 10, 2020

Ensure that if the DS is returning too many DELAY and GRACE errors, we
also report that to the MDS through the layouterror mechanism.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

194a0dc8

NFS: Limit the size of the access cache by default · a8b373ee

由 Trond Myklebust 提交于 2月 08, 2020

Currently, we have no real limit on the access cache size (we set it
to ULONG_MAX). That can lead to credentials getting pinned for a
very long time on lots of files if you have a system with a lot of
memory.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

a8b373ee

NFS: Avoid referencing the cred twice in async rename/unlink · 49cd3254

由 Trond Myklebust 提交于 2月 07, 2020

In both async rename and rename, we take a reference to the
cred in the call arguments.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

49cd3254

NFSv4: Avoid unnecessary credential references in layoutget · 63ec2b69

由 Trond Myklebust 提交于 2月 07, 2020

Layoutget is just using the credential attached to the open context.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

63ec2b69

NFSv4: Avoid referencing the cred unnecessarily during NFSv4 I/O · 61296507

由 Trond Myklebust 提交于 2月 07, 2020

Avoid unnecessary references to the cred when we have already referenced
it through the open context or the open owner.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

61296507

NFS: Assume cred is pinned by open context in I/O requests · 542b994b

由 Trond Myklebust 提交于 2月 07, 2020

In read/write/commit, we should be able to assume that the cred is
pinned by the open context.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

542b994b

NFS: alloc_nfs_open_context() must use the file cred when available · 1d179d6b

由 Trond Myklebust 提交于 2月 07, 2020

If we're creating a nfs_open_context() for a specific file pointer,
we must use the cred assigned to that file.

Fixes: a52458b4 ("NFS/NFSD/SUNRPC: replace generic creds with 'struct cred'.")
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

1d179d6b

NFS: Ensure we time out if a delegreturn does not complete · 244fcd2f

由 Trond Myklebust 提交于 12月 20, 2019

We can't allow delegreturn to hold up nfs4_evict_inode() forever,
since that can cause the memory shrinkers to block. This patch
therefore ensures that we eventually time out, and complete the
reclaim of the inode.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

244fcd2f

NFSv4/pnfs: pnfs_set_layout_stateid() should update the layout cred · 59b56394

由 Trond Myklebust 提交于 1月 27, 2020

If the cred assigned to the layout that we're updating differs from
the one used to retrieve the new layout segment, then we need to
update the layout plh_lc_cred field.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

59b56394

NFSv4: nfs_update_inplace_delegation() should update delegation cred · 57f188e0

由 Trond Myklebust 提交于 1月 27, 2020

If the cred assigned to the delegation that we're updating differs
from the one we're updating too, then we need to update that field
too.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

57f188e0

NFS: Use the 64-bit server readdir cookies when possible · 59e356a9

由 Trond Myklebust 提交于 2月 03, 2020

When we're running as a 64-bit architecture and are not running in
32-bit compatibility mode, it is better to use the 64-bit readdir
cookies that supplied by the server. Doing so improves the accuracy
of telldir()/seekdir(), particularly when the directory is changing,
for instance, when doing 'rm -rf'.

We still fall back to using the 32-bit offsets on 32-bit architectures
and when in compatibility mode.
Signed-off-by: NTrond Myklebust <trond.myklebust@hammerspace.com>

59e356a9

14 3月, 2020 1 次提交

afs: Use kfree_rcu() instead of casting kfree() to rcu_callback_t · ddd2b85f

由 Jann Horn 提交于 3月 12, 2020

afs_put_addrlist() casts kfree() to rcu_callback_t. Apart from being wrong
in theory, this might also blow up when people start enforcing function
types via compiler instrumentation, and it means the rcu_head has to be
first in struct afs_addr_list.

Use kfree_rcu() instead, it's simpler and more correct.
Signed-off-by: NJann Horn <jannh@google.com>
Signed-off-by: NDavid Howells <dhowells@redhat.com>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

ddd2b85f

13 3月, 2020 4 次提交

ovl: fix lockdep warning for async write · c8536804

由 Miklos Szeredi 提交于 3月 13, 2020

Lockdep reports "WARNING: lock held when returning to user space!" due to
async write holding freeze lock over the write.  Apparently aio.c already
deals with this by lying to lockdep about the state of the lock.

Do the same here.  No need to check for S_IFREG() here since these file ops
are regular-only.

Reported-by: syzbot+9331a354f4f624a52a55@syzkaller.appspotmail.com
Fixes: 2406a307 ("ovl: implement async IO routines")
Signed-off-by: NMiklos Szeredi <mszeredi@redhat.com>

c8536804

ovl: fix some xino configurations · 53afcd31

由 Amir Goldstein 提交于 2月 21, 2020

Fix up two bugs in the coversion to xino_mode:
1. xino=off does not always end up in disabled mode
2. xino=auto on 32bit arch should end up in disabled mode

Take a proactive approach to disabling xino on 32bit kernel:
1. Disable XINO_AUTO config during build time
2. Disable xino with a warning on mount time

As a by product, xino=on on 32bit arch also ends up in disabled mode.
We never intended to enable xino on 32bit arch and this will make the
rest of the logic simpler.

Fixes: 0f831ec8 ("ovl: simplify ovl_same_sb() helper")
Signed-off-by: NAmir Goldstein <amir73il@gmail.com>
Signed-off-by: NMiklos Szeredi <mszeredi@redhat.com>

53afcd31

cifs_atomic_open(): fix double-put on late allocation failure · d9a9f484

由 Al Viro 提交于 3月 12, 2020

several iterations of ->atomic_open() calling conventions ago, we
used to need fput() if ->atomic_open() failed at some point after
successful finish_open().  Now (since 2016) it's not needed -
struct file carries enough state to make fput() work regardless
of the point in struct file lifecycle and discarding it on
failure exits in open() got unified.  Unfortunately, I'd missed
the fact that we had an instance of ->atomic_open() (cifs one)
that used to need that fput(), as well as the stale comment in
finish_open() demanding such late failure handling.  Trivially
fixed...

Fixes: fe9ec829 "do_last(): take fput() on error after opening to out:"
Cc: stable@kernel.org # v4.7+
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

d9a9f484

gfs2_atomic_open(): fix O_EXCL|O_CREAT handling on cold dcache · 21039132

由 Al Viro 提交于 3月 10, 2020

with the way fs/namei.c:do_last() had been done, ->atomic_open()
instances needed to recognize the case when existing file got
found with O_EXCL|O_CREAT, either by falling back to finish_no_open()
or failing themselves.  gfs2 one didn't.

Fixes: 6d4ade98 (GFS2: Add atomic_open support)
Cc: stable@kernel.org # v3.11
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

21039132

12 3月, 2020 1 次提交

ovl: fix lock in ovl_llseek() · 531d3040

由 Amir Goldstein 提交于 3月 02, 2020

ovl_inode_lock() is interruptible. When inode_lock() in ovl_llseek()
was replaced with ovl_inode_lock(), we did not add a check for error.

Fix this by making ovl_inode_lock() uninterruptible and change the
existing call sites to use an _interruptible variant.

Reported-by: syzbot+66a9752fa927f745385e@syzkaller.appspotmail.com
Fixes: b1f9d385 ("ovl: use ovl_inode_lock in ovl_llseek()")
Signed-off-by: NAmir Goldstein <amir73il@gmail.com>
Signed-off-by: NMiklos Szeredi <mszeredi@redhat.com>

531d3040

09 3月, 2020 1 次提交

io_uring: ensure RCU callback ordering with rcu_barrier() · 805b13ad

由 Jens Axboe 提交于 3月 08, 2020

After more careful studying, Paul informs me that we cannot rely on
ordering of RCU callbacks in the way that the the tagged commit did.
The current construct looks like this:

	void C(struct rcu_head *rhp)
	{
		do_something(rhp);
		call_rcu(&p->rh, B);
	}

	call_rcu(&p->rh, A);
	call_rcu(&p->rh, C);

and we're relying on ordering between A and B, which isn't guaranteed.
Make this explicit instead, and have a work item issue the rcu_barrier()
to ensure that A has run before we manually execute B.

While thorough testing never showed this issue, it's dependent on the
per-cpu load in terms of RCU callbacks. The updated method simplifies
the code as well, and eliminates the need to maintain an rcu_head in
the fileset data.

Fixes: c1e2148f ("io_uring: free fixed_file_data after RCU grace period")
Reported-by: NPaul E. McKenney <paulmck@kernel.org>
Signed-off-by: NJens Axboe <axboe@kernel.dk>

805b13ad

08 3月, 2020 1 次提交

fscrypt: don't evict dirty inodes after removing key · 2b4eae95

由 Eric Biggers 提交于 3月 05, 2020

After FS_IOC_REMOVE_ENCRYPTION_KEY removes a key, it syncs the
filesystem and tries to get and put all inodes that were unlocked by the
key so that unused inodes get evicted via fscrypt_drop_inode().
Normally, the inodes are all clean due to the sync.

However, after the filesystem is sync'ed, userspace can modify and close
one of the files. (Userspace is *supposed* to close the files before
removing the key. But it doesn't always happen, and the kernel can't
assume it.) This causes the inode to be dirtied and have i_count == 0.
Then, fscrypt_drop_inode() failed to consider this case and indicated
that the inode can be dropped, causing the write to be lost.

On f2fs, other problems such as a filesystem freeze could occur due to
the inode being freed while still on f2fs's dirty inode list.

Fix this bug by making fscrypt_drop_inode() only drop clean inodes.

I've written an xfstest which detects this bug on ext4, f2fs, and ubifs.

Fixes: b1c0ec35 ("fscrypt: add FS_IOC_REMOVE_ENCRYPTION_KEY ioctl")
Cc: <stable@vger.kernel.org> # v5.4+
Link: https://lore.kernel.org/r/20200305084138.653498-1-ebiggers@kernel.orgSigned-off-by: NEric Biggers <ebiggers@google.com>

2b4eae95

07 3月, 2020 3 次提交

io_uring: fix lockup with timeouts · f0e20b89

由 Pavel Begunkov 提交于 3月 07, 2020

There is a recipe to deadlock the kernel: submit a timeout sqe with a
linked_timeout (e.g.  test_single_link_timeout_ception() from liburing),
and SIGKILL the process.

Then, io_kill_timeouts() takes @ctx->completion_lock, but the timeout
isn't flagged with REQ_F_COMP_LOCKED, and will try to double grab it
during io_put_free() to cancel the linked timeout. Probably, the same
can happen with another io_kill_timeout() call site, that is
io_commit_cqring().
Signed-off-by: NPavel Begunkov <asml.silence@gmail.com>
Signed-off-by: NJens Axboe <axboe@kernel.dk>

f0e20b89

io_uring: free fixed_file_data after RCU grace period · c1e2148f

由 Jens Axboe 提交于 3月 04, 2020

The percpu refcount protects this structure, and we can have an atomic
switch in progress when exiting. This makes it unsafe to just free the
struct normally, and can trigger the following KASAN warning:

BUG: KASAN: use-after-free in percpu_ref_switch_to_atomic_rcu+0xfa/0x1b0
Read of size 1 at addr ffff888181a19a30 by task swapper/0/0

CPU: 0 PID: 0 Comm: swapper/0 Not tainted 5.6.0-rc4+ #5747
Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.10.2-1ubuntu1 04/01/2014
Call Trace:
 <IRQ>
 dump_stack+0x76/0xa0
 print_address_description.constprop.0+0x3b/0x60
 ? percpu_ref_switch_to_atomic_rcu+0xfa/0x1b0
 ? percpu_ref_switch_to_atomic_rcu+0xfa/0x1b0
 __kasan_report.cold+0x1a/0x3d
 ? percpu_ref_switch_to_atomic_rcu+0xfa/0x1b0
 percpu_ref_switch_to_atomic_rcu+0xfa/0x1b0
 rcu_core+0x370/0x830
 ? percpu_ref_exit+0x50/0x50
 ? rcu_note_context_switch+0x7b0/0x7b0
 ? run_rebalance_domains+0x11d/0x140
 __do_softirq+0x10a/0x3e9
 irq_exit+0xd5/0xe0
 smp_apic_timer_interrupt+0x86/0x200
 apic_timer_interrupt+0xf/0x20
 </IRQ>
RIP: 0010:default_idle+0x26/0x1f0

Fix this by punting the final exit and free of the struct to RCU, then
we know that it's safe to do so. Jann suggested the approach of using a
double rcu callback to achieve this. It's important that we do a nested
call_rcu() callback, as otherwise the free could be ordered before the
atomic switch, even if the latter was already queued.

Reported-by: syzbot+e017e49c39ab484ac87a@syzkaller.appspotmail.com
Suggested-by: NJann Horn <jannh@google.com>
Reviewed-by: NPaul E. McKenney <paulmck@kernel.org>
Signed-off-by: NJens Axboe <axboe@kernel.dk>

c1e2148f

locks: fix a potential use-after-free problem when wakeup a waiter · 6d390e4b

由 yangerkun 提交于 3月 04, 2020

'16306a61 ("fs/locks: always delete_block after waiting.")' add the
logic to check waiter->fl_blocker without blocked_lock_lock. And it will
trigger a UAF when we try to wakeup some waiter：

Thread 1 has create a write flock a on file, and now thread 2 try to
unlock and delete flock a, thread 3 try to add flock b on the same file.

Thread2                         Thread3
                                flock syscall(create flock b)
	                        ...flock_lock_inode_wait
				    flock_lock_inode(will insert
				    our fl_blocked_member list
				    to flock a's fl_blocked_requests)
				   sleep
flock syscall(unlock)
...flock_lock_inode_wait
    locks_delete_lock_ctx
    ...__locks_wake_up_blocks
        __locks_delete_blocks(
	b->fl_blocker = NULL)
	...
                                   break by a signal
				   locks_delete_block
				    b->fl_blocker == NULL &&
				    list_empty(&b->fl_blocked_requests)
	                            success, return directly
				 locks_free_lock b
	wake_up(&b->fl_waiter)
	trigger UAF

Fix it by remove this logic, and this patch may also fix CVE-2019-19769.

Cc: stable@vger.kernel.org
Fixes: 16306a61 ("fs/locks: always delete_block after waiting.")
Signed-off-by: Nyangerkun <yangerkun@huawei.com>
Signed-off-by: NJeff Layton <jlayton@kernel.org>

6d390e4b

06 3月, 2020 2 次提交

fat: fix uninit-memory access for partial initialized inode · bc87302a

由 OGAWA Hirofumi 提交于 3月 05, 2020

When get an error in the middle of reading an inode, some fields in the
inode might be still not initialized. And then the evict_inode path may
access those fields via iput().

To fix, this makes sure that inode fields are initialized.

Reported-by: syzbot+9d82b8de2992579da5d0@syzkaller.appspotmail.com
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NOGAWA Hirofumi <hirofumi@mail.parknet.co.jp>
Cc: <stable@vger.kernel.org>
Link: http://lkml.kernel.org/r/871rqnreqx.fsf@mail.parknet.co.jpSigned-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

bc87302a

futex: Fix inode life-time issue · 8019ad13

由 Peter Zijlstra 提交于 3月 04, 2020

As reported by Jann, ihold() does not in fact guarantee inode
persistence. And instead of making it so, replace the usage of inode
pointers with a per boot, machine wide, unique inode identifier.

This sequence number is global, but shared (file backed) futexes are
rare enough that this should not become a performance issue.
Reported-by: NJann Horn <jannh@google.com>
Suggested-by: NLinus Torvalds <torvalds@linux-foundation.org>
Signed-off-by: NPeter Zijlstra (Intel) <peterz@infradead.org>

8019ad13

03 3月, 2020 3 次提交

fcntl: Distribute switch variables for initialization · 0a68ff5e

由 Kees Cook 提交于 2月 19, 2020

Variables declared in a switch statement before any case statements
cannot be automatically initialized with compiler instrumentation (as
they are not part of any execution flow). With GCC's proposed automatic
stack variable initialization feature, this triggers a warning (and they
don't get initialized). Clang's automatic stack variable initialization
(via CONFIG_INIT_STACK_ALL=y) doesn't throw a warning, but it also
doesn't initialize such variables[1]. Note that these warnings (or silent
skipping) happen before the dead-store elimination optimization phase,
so even when the automatic initializations are later elided in favor of
direct initializations, the warnings remain.

To avoid these problems, move such variables into the "case" where
they're used or lift them up into the main function body.

fs/fcntl.c: In function ‘send_sigio_to_task’:
fs/fcntl.c:738:20: warning: statement will never be executed [-Wswitch-unreachable]
  738 |   kernel_siginfo_t si;
      |                    ^~

[1] https://bugs.llvm.org/show_bug.cgi?id=44916Signed-off-by: NKees Cook <keescook@chromium.org>
Signed-off-by: NJeff Layton <jlayton@kernel.org>

0a68ff5e

btrfs: fix RAID direct I/O reads with alternate csums · e7a04894

由 Omar Sandoval 提交于 3月 02, 2020

btrfs_lookup_and_bind_dio_csum() does pointer arithmetic which assumes
32-bit checksums. If using a larger checksum, this leads to spurious
failures when a direct I/O read crosses a stripe. This is easy
to reproduce:

  # mkfs.btrfs -f --checksum blake2 -d raid0 /dev/vdc /dev/vdd
  ...
  # mount /dev/vdc /mnt
  # cd /mnt
  # dd if=/dev/urandom of=foo bs=1M count=1 status=none
  # dd if=foo of=/dev/null bs=1M iflag=direct status=none
  dd: error reading 'foo': Input/output error
  # dmesg | tail -1
  [  135.821568] BTRFS warning (device vdc): csum failed root 5 ino 257 off 421888 ...

Fix it by using the actual checksum size.

Fixes: 1e25a2e3 ("btrfs: don't assume ordered sums to be 4 bytes")
CC: stable@vger.kernel.org # 5.4+
Reviewed-by: NJohannes Thumshirn <johannes.thumshirn@wdc.com>
Signed-off-by: NOmar Sandoval <osandov@fb.com>
Reviewed-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NDavid Sterba <dsterba@suse.com>

e7a04894

io-wq: remove io_wq_flush and IO_WQ_WORK_INTERNAL · 80ad8943

由 Pavel Begunkov 提交于 3月 02, 2020

io_wq_flush() is buggy, during cancelation of a flush, the associated
work may be passed to the caller's (i.e. io_uring) @match callback. That
callback is expecting it to be embedded in struct io_kiocb. Cancelation
of internal work probably doesn't make a lot of sense to begin with.

As the flush helper is no longer used, just delete it and the associated
work flag.
Signed-off-by: NPavel Begunkov <asml.silence@gmail.com>
Signed-off-by: NJens Axboe <axboe@kernel.dk>

80ad8943

openeuler / Kernel 接近 2 年 前同步成功

openeuler / Kernel
接近 2 年前同步成功