提交 · d8a3ba9c143bf89c032deced8a686ffa53b46098 · openeuler / Kernel

13 4月, 2022 2 次提交

io_uring: verify that resv2 is 0 in io_uring_rsrc_update2 · d8a3ba9c

由 Dylan Yudaken 提交于 4月 12, 2022

Verify that the user does not pass in anything but 0 for this field.

Fixes: 992da01a ("io_uring: change registration/upd/rsrc tagging ABI")
Signed-off-by: NDylan Yudaken <dylany@fb.com>
Link: https://lore.kernel.org/r/20220412163042.2788062-3-dylany@fb.comSigned-off-by: NJens Axboe <axboe@kernel.dk>

d8a3ba9c

io_uring: move io_uring_rsrc_update2 validation · 565c5e61

由 Dylan Yudaken 提交于 4月 12, 2022

Move validation to be more consistently straight after
copy_from_user. This is already done in io_register_rsrc_update and so
this removes that redundant check.
Signed-off-by: NDylan Yudaken <dylany@fb.com>
Link: https://lore.kernel.org/r/20220412163042.2788062-2-dylany@fb.comSigned-off-by: NJens Axboe <axboe@kernel.dk>

565c5e61

12 4月, 2022 4 次提交

io_uring: fix assign file locking issue · 0f8da75b

由 Pavel Begunkov 提交于 4月 12, 2022

io-wq work cancellation path can't take uring_lock as how it's done on
file assignment, we have to handle IO_WQ_WORK_CANCEL first, this fixes
encountered hangs.

Fixes: 6bf9c47a ("io_uring: defer file assignment")
Signed-off-by: NPavel Begunkov <asml.silence@gmail.com>
Link: https://lore.kernel.org/r/0d9b9f37841645518503f6a207e509d14a286aba.1649773463.git.asml.silence@gmail.comSigned-off-by: NJens Axboe <axboe@kernel.dk>

0f8da75b

io_uring: stop using io_wq_work as an fd placeholder · 82733d16

由 Jens Axboe 提交于 4月 10, 2022

There are two reasons why this isn't the best idea:

- It's an odd area to grab a bit of storage space, hence it's an odd area
  to grab storage from.
- It puts the 3rd io_kiocb cacheline into the hot path, where normal hot
  path just needs the first two.

Use 'cflags' for joint fd/cflags storage. We only need fd until we
successfully issue, and we only need cflags once a request is done and is
completed.

Fixes: 6bf9c47a ("io_uring: defer file assignment")
Signed-off-by: NJens Axboe <axboe@kernel.dk>

82733d16

io_uring: move apoll->events cache · 2804ecd8

由 Jens Axboe 提交于 4月 11, 2022

In preparation for fixing a regression with pulling in an extra cacheline
for IO that doesn't usually touch the last cacheline of the io_kiocb,
move the cached location of apoll->events to space shared with some other
completion data. Like cflags, this isn't used until after the request
has been completed, so we can piggy back on top of comp_list.

Fixes: 81459350 ("io_uring: cache req->apoll->events in req->cflags")
Signed-off-by: NJens Axboe <axboe@kernel.dk>

2804ecd8

io_uring: io_kiocb_update_pos() should not touch file for non -1 offset · 6f83ab22

由 Jens Axboe 提交于 4月 11, 2022

-1 tells use to use the current position, but we check if the file is
a stream regardless of that. Fix up io_kiocb_update_pos() to only
dip into file if we need to. This is both more efficient and also drops
12 bytes of text on aarch64 and 64 bytes on x86-64.

Fixes: b4aec400 ("io_uring: do not recalculate ppos unnecessarily")
Signed-off-by: NJens Axboe <axboe@kernel.dk>

6f83ab22

11 4月, 2022 1 次提交

io_uring: flag the fact that linked file assignment is sane · c4212f3e

由 Jens Axboe 提交于 4月 10, 2022

Give applications a way to tell if the kernel supports sane linked files,
as in files being assigned at the right time to be able to reliably
do <open file direct into slot X><read file from slot X> while using
IOSQE_IO_LINK to order them.

Not really a bug fix, but flag it as such so that it gets pulled in with
backports of the deferred file assignment.

Fixes: 6bf9c47a ("io_uring: defer file assignment")
Signed-off-by: NJens Axboe <axboe@kernel.dk>

c4212f3e

09 4月, 2022 1 次提交

io_uring: fix race between timeout flush and removal · e677edbc

由 Jens Axboe 提交于 4月 08, 2022

io_flush_timeouts() assumes the timeout isn't in progress of triggering
or being removed/canceled, so it unconditionally removes it from the
timeout list and attempts to cancel it.

Leave it on the list and let the normal timeout cancelation take care
of it.

Cc: stable@vger.kernel.org # 5.5+
Signed-off-by: NJens Axboe <axboe@kernel.dk>

e677edbc

08 4月, 2022 9 次提交

io_uring: use nospec annotation for more indexes · 4cdd158b

由 Pavel Begunkov 提交于 4月 07, 2022

There are still several places that using pre array_index_nospec()
indexes, fix them up.
Signed-off-by: NPavel Begunkov <asml.silence@gmail.com>
Link: https://lore.kernel.org/r/b01ef5ee83f72ed35ad525912370b729f5d145f4.1649336342.git.asml.silence@gmail.comSigned-off-by: NJens Axboe <axboe@kernel.dk>

4cdd158b

io_uring: zero tag on rsrc removal · 8f0a2480

由 Pavel Begunkov 提交于 4月 07, 2022

Automatically default rsrc tag in io_queue_rsrc_removal(), it's safer
than leaving it there and relying on the rest of the code to behave and
not use it.
Signed-off-by: NPavel Begunkov <asml.silence@gmail.com>
Link: https://lore.kernel.org/r/1cf262a50df17478ea25b22494dcc19f3a80301f.1649336342.git.asml.silence@gmail.comSigned-off-by: NJens Axboe <axboe@kernel.dk>

8f0a2480

io_uring: don't touch scm_fp_list after queueing skb · a07211e3

由 Pavel Begunkov 提交于 4月 06, 2022

It's safer to not touch scm_fp_list after we queued an skb to which it
was assigned, there might be races lurking if we screw subtle sync
guarantees on the io_uring side.

Fixes: 6b06314c ("io_uring: add file set registration")
Signed-off-by: NPavel Begunkov <asml.silence@gmail.com>
Signed-off-by: NJens Axboe <axboe@kernel.dk>

a07211e3

io_uring: nospec index for tags on files update · 34bb7718

由 Pavel Begunkov 提交于 4月 06, 2022

Don't forget to array_index_nospec() for indexes before updating rsrc
tags in __io_sqe_files_update(), just use already safe and precalculated
index @i.

Fixes: c3bdad02 ("io_uring: add generic rsrc update with tags")
Signed-off-by: NPavel Begunkov <asml.silence@gmail.com>
Signed-off-by: NJens Axboe <axboe@kernel.dk>

34bb7718

io_uring: implement compat handling for IORING_REGISTER_IOWQ_AFF · 0f5e4b83

由 Eugene Syromiatnikov 提交于 4月 06, 2022

Similarly to the way it is done im mbind syscall.

Cc: stable@vger.kernel.org # 5.14
Fixes: fe76421d ("io_uring: allow user configurable IO thread CPU affinity")
Signed-off-by: NEugene Syromiatnikov <esyr@redhat.com>
Signed-off-by: NJens Axboe <axboe@kernel.dk>

0f5e4b83

Revert "io_uring: Add support for napi_busy_poll" · cb318216

由 Jens Axboe 提交于 4月 05, 2022

This reverts commit adc8682e.

There's some discussion on the API not being as good as it can be.
Rather than ship something and be stuck with it forever, let's revert
the NAPI support for now and work on getting something sorted out
for the next kernel release instead.

Link: https://lore.kernel.org/io-uring/b7bbc124-8502-0ee9-d4c8-7c41b4487264@kernel.dk/Signed-off-by: NJens Axboe <axboe@kernel.dk>

cb318216

io_uring: drop the old style inflight file tracking · d5361233

由 Jens Axboe 提交于 3月 31, 2022

io_uring tracks requests that are referencing an io_uring descriptor to
be able to cancel without worrying about loops in the references. Since
we now assign the file at execution time, the easier approach is to drop
a potentially problematic reference before we punt the request. This
eliminates the need to special case these types of files beyond just
marking them as such, and simplifies cancelation quite a bit.

This also fixes a recent issue where an async punted tee operation would
with the io_uring descriptor as the output file would crash when
attempting to get a reference to the file from the io-wq worker. We
could have worked around that, but this is the much cleaner fix.

Fixes: 6bf9c47a ("io_uring: defer file assignment")
Reported-by: syzbot+c4b9303500a21750b250@syzkaller.appspotmail.com
Signed-off-by: NJens Axboe <axboe@kernel.dk>

d5361233

io_uring: defer file assignment · 6bf9c47a

由 Jens Axboe 提交于 3月 29, 2022

If an application uses direct open or accept, it knows in advance what
direct descriptor value it will get as it picks it itself. This allows
combined requests such as:

sqe = io_uring_get_sqe(ring);
io_uring_prep_openat_direct(sqe, ..., file_slot);
sqe->flags |= IOSQE_IO_LINK | IOSQE_CQE_SKIP_SUCCESS;

sqe = io_uring_get_sqe(ring);
io_uring_prep_read(sqe,file_slot, buf, buf_size, 0);
sqe->flags |= IOSQE_FIXED_FILE;

io_uring_submit(ring);

where we prepare both a file open and read, and only get a completion
event for the read when both have completed successfully.

Currently links are fully prepared before the head is issued, but that
fails if the dependent link needs a file assigned that isn't valid until
the head has completed.

Conversely, if the same chain is performed but the fixed file slot is
already valid, then we would be unexpectedly returning data from the
old file slot rather than the newly opened one. Make sure we're
consistent here.

Allow deferral of file setup, which makes this documented case work.

Cc: stable@vger.kernel.org # v5.15+
Signed-off-by: NJens Axboe <axboe@kernel.dk>

6bf9c47a

io_uring: propagate issue_flags state down to file assignment · 5106dd6e

由 Jens Axboe 提交于 4月 04, 2022

We'll need this in a future patch, when we could be assigning the file
after the prep stage. While at it, get rid of the io_file_get() helper,
it just makes the code harder to read.
Signed-off-by: NJens Axboe <axboe@kernel.dk>

5106dd6e

05 4月, 2022 2 次提交

io_uring: move read/write file prep state into actual opcode handler · 584b0180

由 Jens Axboe 提交于 3月 29, 2022

In preparation for not necessarily having a file assigned at prep time,
defer any initialization associated with the file to when the opcode
handler is run.

Cc: stable@vger.kernel.org # v5.15+
Signed-off-by: NJens Axboe <axboe@kernel.dk>

584b0180

io_uring: defer splice/tee file validity check until command issue · a3e4bc23

由 Jens Axboe 提交于 3月 29, 2022

In preparation for not using the file at prep time, defer checking if this
file refers to a valid io_uring instance until issue time.

This also means we can get rid of the cleanup flag for splice and tee.

Cc: stable@vger.kernel.org # v5.15+
Signed-off-by: NJens Axboe <axboe@kernel.dk>

a3e4bc23

04 4月, 2022 1 次提交

io_uring: don't check req->file in io_fsync_prep() · ec858afd

由 Jens Axboe 提交于 3月 30, 2022

This is a leftover from the really old days where we weren't able to
track and error early if we need a file and it wasn't assigned. Kill
the check.

Cc: stable@vger.kernel.org # v5.15+
Signed-off-by: NJens Axboe <axboe@kernel.dk>

ec858afd

30 3月, 2022 2 次提交

io_uring: defer msg-ring file validity check until command issue · 3f1d52ab

由 Jens Axboe 提交于 3月 29, 2022

In preparation for not using the file at prep time, defer checking if this
file refers to a valid io_uring instance until issue time.
Signed-off-by: NJens Axboe <axboe@kernel.dk>

3f1d52ab

io_uring: fail links if msg-ring doesn't succeeed · 9666d420

由 Jens Axboe 提交于 3月 29, 2022

We must always call req_set_fail() if the request is failed, otherwise
we won't sever links for dependent chains correctly.

Fixes: 4f57f06c ("io_uring: add support for IORING_OP_MSG_RING command")
Signed-off-by: NJens Axboe <axboe@kernel.dk>

9666d420

26 3月, 2022 1 次提交

io_uring: fix memory leak of uid in files registration · c86d18f4

由 Pavel Begunkov 提交于 3月 25, 2022

When there are no files for __io_sqe_files_scm() to process in the
range, it'll free everything and return. However, it forgets to put uid.

Fixes: 08a45173 ("io_uring: allow sparse fixed file sets")
Signed-off-by: NPavel Begunkov <asml.silence@gmail.com>
Link: https://lore.kernel.org/r/accee442376f33ce8aaebb099d04967533efde92.1648226048.git.asml.silence@gmail.comSigned-off-by: NJens Axboe <axboe@kernel.dk>

c86d18f4

25 3月, 2022 6 次提交

io_uring: fix put_kbuf without proper locking · 8197b053

由 Pavel Begunkov 提交于 3月 25, 2022

io_put_kbuf_comp() should only be called while holding
->completion_lock, however there is no such assumption in io_clean_op()
and thus it can corrupt ->io_buffer_comp. Take the lock there, and
workaround the only user of io_clean_op() calling it with locks. Not
the prettiest solution, but it's easier to refactor it for-next.

Fixes: cc3cec83 ("io_uring: speedup provided buffer handling")
Signed-off-by: NPavel Begunkov <asml.silence@gmail.com>
Link: https://lore.kernel.org/r/743e2130b73ec6d48c4c5dd15db896c433431e6d.1648212967.git.asml.silence@gmail.comSigned-off-by: NJens Axboe <axboe@kernel.dk>

8197b053

io_uring: fix invalid flags for io_put_kbuf() · ab0ac095

由 Pavel Begunkov 提交于 3月 25, 2022

io_req_complete_failed() doesn't require callers to hold ->uring_lock,
use IO_URING_F_UNLOCKED version of io_put_kbuf(). The only affected
place is the fail path of io_apoll_task_func(). Also add a lockdep
annotation to catch such bugs in the future.

Fixes: 3b2b78a8 ("io_uring: extend provided buf return to fails")
Signed-off-by: NPavel Begunkov <asml.silence@gmail.com>
Link: https://lore.kernel.org/r/ccf602dbf8df3b6a8552a262d8ee0a13a086fbc7.1648212967.git.asml.silence@gmail.comSigned-off-by: NJens Axboe <axboe@kernel.dk>

ab0ac095

io_uring: improve req fields comments · 41cdcc22

由 Pavel Begunkov 提交于 3月 25, 2022

Move a misplaced comment about req->creds and add a line with
assumptions about req->link.
Signed-off-by: NPavel Begunkov <asml.silence@gmail.com>
Link: https://lore.kernel.org/r/1e51d1e6b1f3708c2d4127b4e371f9daa4c5f859.1648209006.git.asml.silence@gmail.comSigned-off-by: NJens Axboe <axboe@kernel.dk>

41cdcc22

io_uring: enable EPOLLEXCLUSIVE for accept poll · 52dd8640

由 Dylan Yudaken 提交于 3月 25, 2022

When polling sockets for accept, use EPOLLEXCLUSIVE. This is helpful
when multiple accept SQEs are submitted.

For O_NONBLOCK sockets multiple queued SQEs would previously have all
completed at once, but most with -EAGAIN as the result. Now only one
wakes up and completes.

For sockets without O_NONBLOCK there is no user facing change, but
internally the extra requests would previously be queued onto a worker
thread as they would wake up with no connection waiting, and be
punted. Now they do not wake up unnecessarily.
Co-developed-by: NJens Axboe <axboe@kernel.dk>
Signed-off-by: NDylan Yudaken <dylany@fb.com>
Link: https://lore.kernel.org/r/20220325093755.4123343-1-dylany@fb.comSigned-off-by: NJens Axboe <axboe@kernel.dk>

52dd8640

io_uring: improve task work cache utilization · 34d2bfe7

由 Jens Axboe 提交于 3月 24, 2022

While profiling task_work intensive workloads, I noticed that most of
the time in tctx_task_work() is spending stalled on loading 'req'. This
is one of the unfortunate side effects of using linked lists,
particularly when they end up being passe around.

Prefetch the next request, if there is one. There's a sufficient amount
of work in between that this makes it available for the next loop.

While fiddling with the cache layout, move the link outside of the
hot completion cacheline. It's rarely used in hot workloads, so better
to bring in kbuf which is used for networked loads with provided buffers.

This reduces tctx_task_work() overhead from ~3% to 1-1.5% in my testing.
Signed-off-by: NJens Axboe <axboe@kernel.dk>

34d2bfe7

io_uring: fix async accept on O_NONBLOCK sockets · a73825ba

由 Dylan Yudaken 提交于 3月 24, 2022

Do not set REQ_F_NOWAIT if the socket is non blocking. When enabled this
causes the accept to immediately post a CQE with EAGAIN, which means you
cannot perform an accept SQE on a NONBLOCK socket asynchronously.

By removing the flag if there is no pending accept then poll is armed as
usual and when a connection comes in the CQE is posted.
Signed-off-by: NDylan Yudaken <dylany@fb.com>
Link: https://lore.kernel.org/r/20220324143435.2875844-1-dylany@fb.comSigned-off-by: NJens Axboe <axboe@kernel.dk>

a73825ba

24 3月, 2022 3 次提交

io_uring: remove IORING_CQE_F_MSG · 7ef66d18

由 Jens Axboe 提交于 3月 24, 2022

This was introduced with the message ring opcode, but isn't strictly
required for the request itself. The sender can encode what is needed
in user_data, which is passed to the receiver. It's unclear if having
a separate flag that essentially says "This CQE did not originate from
an SQE on this ring" provides any real utility to applications. While
we can always re-introduce a flag to provide this information, we cannot
take it away at a later point in time.

Remove the flag while we still can, before it's in a released kernel.
Signed-off-by: NJens Axboe <axboe@kernel.dk>

7ef66d18

io_uring: add flag for disabling provided buffer recycling · 8a3e8ee5

由 Jens Axboe 提交于 3月 23, 2022

If we need to continue doing this IO, then we don't want a potentially
selected buffer recycled. Add a flag for that.

Set this for recv/recvmsg if they do partial IO.
Signed-off-by: NJens Axboe <axboe@kernel.dk>

8a3e8ee5

io_uring: ensure recv and recvmsg handle MSG_WAITALL correctly · 7ba89d2a

由 Jens Axboe 提交于 3月 23, 2022

We currently don't attempt to get the full asked for length even if
MSG_WAITALL is set, if we get a partial receive. If we do see a partial
receive, then just note how many bytes we did and return -EAGAIN to
get it retried.

The iov is advanced appropriately for the vector based case, and we
manually bump the buffer and remainder for the non-vector case.

Cc: stable@vger.kernel.org
Reported-by: NConstantine Gavrilov <constantine.gavrilov@gmail.com>
Signed-off-by: NJens Axboe <axboe@kernel.dk>

7ba89d2a

23 3月, 2022 3 次提交

io_uring: don't recycle provided buffer if punted to async worker · 4d55f238

由 Jens Axboe 提交于 3月 22, 2022

We only really need to recycle the buffer when going async for a file
type that has an indefinite reponse time (eg non-file/bdev). And for
files that to arm poll, the async worker will arm poll anyway and the
buffer will get recycled there.

In that latter case, we're not holding ctx->uring_lock. Ensure we take
the issue_flags into account and acquire it if we need to.

Fixes: b1c62645 ("io_uring: recycle provided buffers if request goes async")
Reported-by: NStefan Roesch <shr@fb.com>
Signed-off-by: NJens Axboe <axboe@kernel.dk>

4d55f238

io_uring: fix assuming triggered poll waitqueue is the single poll · d89a4fac

由 Jens Axboe 提交于 3月 22, 2022

syzbot reports a recent regression:

BUG: KASAN: use-after-free in __wake_up_common+0x637/0x650 kernel/sched/wait.c:101
Read of size 8 at addr ffff888011e8a130 by task syz-executor413/3618

CPU: 0 PID: 3618 Comm: syz-executor413 Tainted: G        W         5.17.0-syzkaller-01402-g8565d644 #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 01/01/2011
Call Trace:
 <TASK>
 __dump_stack lib/dump_stack.c:88 [inline]
 dump_stack_lvl+0xcd/0x134 lib/dump_stack.c:106
 print_address_description.constprop.0.cold+0x8d/0x303 mm/kasan/report.c:255
 __kasan_report mm/kasan/report.c:442 [inline]
 kasan_report.cold+0x83/0xdf mm/kasan/report.c:459
 __wake_up_common+0x637/0x650 kernel/sched/wait.c:101
 __wake_up_common_lock+0xd0/0x130 kernel/sched/wait.c:138
 tty_release+0x657/0x1200 drivers/tty/tty_io.c:1781
 __fput+0x286/0x9f0 fs/file_table.c:317
 task_work_run+0xdd/0x1a0 kernel/task_work.c:164
 exit_task_work include/linux/task_work.h:32 [inline]
 do_exit+0xaff/0x29d0 kernel/exit.c:806
 do_group_exit+0xd2/0x2f0 kernel/exit.c:936
 __do_sys_exit_group kernel/exit.c:947 [inline]
 __se_sys_exit_group kernel/exit.c:945 [inline]
 __x64_sys_exit_group+0x3a/0x50 kernel/exit.c:945
 do_syscall_x64 arch/x86/entry/common.c:50 [inline]
 do_syscall_64+0x35/0xb0 arch/x86/entry/common.c:80
 entry_SYSCALL_64_after_hwframe+0x44/0xae
RIP: 0033:0x7f439a1fac69

which is due to leaving the request on the waitqueue mistakenly. The
reproducer is using a tty device, which means we end up arming the same
poll queue twice (it uses the same poll waitqueue for both), but in
io_poll_wake() we always just clear REQ_F_SINGLE_POLL regardless of which
entry triggered. This leaves one waitqueue potentially armed after we're
done, which then blows up in tty when the waitqueue is attempted removed.

We have no room to store this information, so simply encode it in the
wait_queue_entry->private where we store the io_kiocb request pointer.

Fixes: 91eac1c6 ("io_uring: cache poll/double-poll state with a request flag")
Reported-by: syzbot+09ad4050dd3a120bfccd@syzkaller.appspotmail.com
Signed-off-by: NJens Axboe <axboe@kernel.dk>

d89a4fac

io_uring: bump poll refs to full 31-bits · e2c0cb7c

由 Jens Axboe 提交于 3月 22, 2022

The previous commit:

1bc84c40088 ("io_uring: remove poll entry from list when canceling all")

removed a potential overflow condition for the poll references. They
are currently limited to 20-bits, even if we have 31-bits available. The
upper bit is used to mark for cancelation.

Bump the poll ref space to 31-bits, making that kind of situation much
harder to trigger in general. We'll separately add overflow checking
and handling.

Fixes: aa43477b ("io_uring: poll rework")
Signed-off-by: NJens Axboe <axboe@kernel.dk>

e2c0cb7c

22 3月, 2022 1 次提交

io_uring: remove poll entry from list when canceling all · 61bc84c4

由 Jens Axboe 提交于 3月 21, 2022

When the ring is exiting, as part of the shutdown, poll requests are
removed. But io_poll_remove_all() does not remove entries when finding
them, and since completions are done out-of-band, we can find and remove
the same entry multiple times.

We do guard the poll execution by poll ownership, but that does not
exclude us from reissuing a new one once the previous removal ownership
goes away.

This can race with poll execution as well, where we then end up seeing
req->apoll be NULL because a previous task_work requeue finished the
request.

Remove the poll entry when we find it and get ownership of it. This
prevents multiple invocations from finding it.

Fixes: aa43477b ("io_uring: poll rework")
Reported-by: NDylan Yudaken <dylany@fb.com>
Signed-off-by: NJens Axboe <axboe@kernel.dk>

61bc84c4

21 3月, 2022 2 次提交

io_uring: fix memory ordering when SQPOLL thread goes to sleep · 649bb75d

由 Almog Khaikin 提交于 3月 21, 2022

Without a full memory barrier between the store to the flags and the
load of the SQ tail the two operations can be reordered and this can
lead to a situation where the SQPOLL thread goes to sleep while the
application writes to the SQ tail and doesn't see the wakeup flag.
This memory barrier pairs with a full memory barrier in the application
between its store to the SQ tail and its load of the flags.
Signed-off-by: NAlmog Khaikin <almogkh@gmail.com>
Link: https://lore.kernel.org/r/20220321090059.46313-1-almogkh@gmail.comSigned-off-by: NJens Axboe <axboe@kernel.dk>

649bb75d

io_uring: ensure that fsnotify is always called · f63cf519

由 Jens Axboe 提交于 3月 20, 2022

Ensure that we call fsnotify_modify() if we write a file, and that we
do fsnotify_access() if we read it. This enables anyone using inotify
on the file to get notified.

Ditto for fallocate, ensure that fsnotify_modify() is called.

Cc: stable@vger.kernel.org
Signed-off-by: NJens Axboe <axboe@kernel.dk>

f63cf519

20 3月, 2022 1 次提交

io_uring: recycle provided before arming poll · abdad709

由 Jens Axboe 提交于 3月 19, 2022

We currently have a race where we recycle the selected buffer if poll
returns IO_APOLL_OK. But that's too late, as the poll could already be
triggering or have triggered. If that race happens, then we're putting a
buffer that's already being used.

Fix this by recycling before we arm poll. This does mean that we'll
sometimes almost instantly re-select the buffer, but it's rare enough in
testing that it should not pose a performance issue.

Fixes: b1c62645 ("io_uring: recycle provided buffers if request goes async")
Signed-off-by: NJens Axboe <axboe@kernel.dk>

abdad709

19 3月, 2022 1 次提交

io_uring: terminate manual loop iterator loop correctly for non-vecs · 5e929367

由 Jens Axboe 提交于 3月 18, 2022

The fix for not advancing the iterator if we're using fixed buffers is
broken in that it can hit a condition where we don't terminate the loop.
This results in io-wq looping forever, asking to read (or write) 0 bytes
for every subsequent loop.
Reported-by: NJoel Jaeschke <joel.jaeschke@gmail.com>
Link: https://github.com/axboe/liburing/issues/549
Fixes: 16c8d2df ("io_uring: ensure symmetry in handling iter types in loop_rw_iter()")
Signed-off-by: NJens Axboe <axboe@kernel.dk>

5e929367

openeuler / Kernel 接近 2 年 前同步成功

openeuler / Kernel
接近 2 年前同步成功