提交 · 930218affeadd1325ea17e053f0dcecf218f5a4f · openeuler / raspberrypi-kernel

23 5月, 2018 1 次提交

uio, lib: Fix CONFIG_ARCH_HAS_UACCESS_MCSAFE compilation · 522239b4

由 Dan Williams 提交于 5月 22, 2018

Add a common Kconfig CONFIG_ARCH_HAS_UACCESS_MCSAFE that archs can
optionally select, and fixup the declaration of _copy_to_iter_mcsafe().

Fixes: 8780356e ("x86/asm/memcpy_mcsafe: Define copy_to_iter_mcsafe()")
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

522239b4

15 5月, 2018 1 次提交

x86/asm/memcpy_mcsafe: Define copy_to_iter_mcsafe() · 8780356e

由 Dan Williams 提交于 5月 03, 2018

Use the updated memcpy_mcsafe() implementation to define
copy_user_mcsafe() and copy_to_iter_mcsafe(). The most significant
difference from typical copy_to_iter() is that the ITER_KVEC and
ITER_BVEC iterator types can fail to complete a full transfer.
Signed-off-by: NDan Williams <dan.j.williams@intel.com>
Cc: Al Viro <viro@zeniv.linux.org.uk>
Cc: Andrew Morton <akpm@linux-foundation.org>
Cc: Andy Lutomirski <luto@amacapital.net>
Cc: Borislav Petkov <bp@alien8.de>
Cc: Linus Torvalds <torvalds@linux-foundation.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: Tony Luck <tony.luck@intel.com>
Cc: hch@lst.de
Cc: linux-fsdevel@vger.kernel.org
Cc: linux-nvdimm@lists.01.org
Link: http://lkml.kernel.org/r/152539239150.31796.9189779163576449784.stgit@dwillia2-desk3.amr.corp.intel.comSigned-off-by: NIngo Molnar <mingo@kernel.org>

8780356e

12 10月, 2017 2 次提交

new primitive: iov_iter_for_each_range() · 09cf698a

由 Al Viro 提交于 2月 18, 2017

For kvec and bvec: feeds segments to given callback as long as it
returns 0.  For iovec and pipe: fails.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

09cf698a

A
kill iov_shorten() · faea1329
由 Al Viro 提交于 9月 24, 2017
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
faea1329

10 7月, 2017 1 次提交

fix brown paperbag bug in inlined copy_..._iter() · c43aeb19

由 Al Viro 提交于 7月 10, 2017

"copied nothing" == "return 0", not "return full size".

Fixes: aa28de27 "iov_iter/hardening: move object size checks to inlined part"
Spotted-by: NArnd Bergmann <arnd@arndb.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

c43aeb19

30 6月, 2017 1 次提交

iov_iter/hardening: move object size checks to inlined part · aa28de27

由 Al Viro 提交于 6月 29, 2017

There we actually have useful information about object sizes.
Note: this patch has them done for all iov_iter flavours.
Right now we do them twice in iovec case, but that'll change
very shortly.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

aa28de27

10 6月, 2017 1 次提交

x86, uaccess: introduce copy_from_iter_flushcache for pmem / cache-bypass operations · 0aed55af

由 Dan Williams 提交于 5月 29, 2017

The pmem driver has a need to transfer data with a persistent memory
destination and be able to rely on the fact that the destination writes are not
cached. It is sufficient for the writes to be flushed to a cpu-store-buffer
(non-temporal / "movnt" in x86 terms), as we expect userspace to call fsync()
to ensure data-writes have reached a power-fail-safe zone in the platform. The
fsync() triggers a REQ_FUA or REQ_FLUSH to the pmem driver which will turn
around and fence previous writes with an "sfence".

Implement a __copy_from_user_inatomic_flushcache, memcpy_page_flushcache, and
memcpy_flushcache, that guarantee that the destination buffer is not dirty in
the cpu cache on completion. The new copy_from_iter_flushcache and sub-routines
will be used to replace the "pmem api" (include/linux/pmem.h +
arch/x86/include/asm/pmem.h). The availability of copy_from_iter_flushcache()
and memcpy_flushcache() are gated by the CONFIG_ARCH_HAS_UACCESS_FLUSHCACHE
config symbol, and fallback to copy_from_iter_nocache() and plain memcpy()
otherwise.

This is meant to satisfy the concern from Linus that if a driver wants to do
something beyond the normal nocache semantics it should be something private to
that driver [1], and Al's concern that anything uaccess related belongs with
the rest of the uaccess code [2].

The first consumer of this interface is a new 'copy_from_iter' dax operation so
that pmem can inject cache maintenance operations without imposing this
overhead on other dax-capable drivers.

[1]: https://lists.01.org/pipermail/linux-nvdimm/2017-January/008364.html
[2]: https://lists.01.org/pipermail/linux-nvdimm/2017-April/009942.html

Cc: <x86@kernel.org>
Cc: Jan Kara <jack@suse.cz>
Cc: Jeff Moyer <jmoyer@redhat.com>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: Christoph Hellwig <hch@lst.de>
Cc: Toshi Kani <toshi.kani@hpe.com>
Cc: "H. Peter Anvin" <hpa@zytor.com>
Cc: Al Viro <viro@zeniv.linux.org.uk>
Cc: Thomas Gleixner <tglx@linutronix.de>
Cc: Matthew Wilcox <mawilcox@microsoft.com>
Reviewed-by: NRoss Zwisler <ross.zwisler@linux.intel.com>
Signed-off-by: NDan Williams <dan.j.williams@intel.com>

0aed55af

03 4月, 2017 1 次提交

[iov_iter] new privimitive: iov_iter_revert() · 27c0e374

由 Al Viro 提交于 2月 17, 2017

opposite to iov_iter_advance(); the caller is responsible for never
using it to move back past the initial position.

Cc: stable@vger.kernel.org
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

27c0e374

06 12月, 2016 1 次提交

[iov_iter] new primitives - copy_from_iter_full() and friends · cbbd26b8

由 Al Viro 提交于 11月 01, 2016

copy_from_iter_full(), copy_from_iter_full_nocache() and
csum_and_copy_from_iter_full() - counterparts of copy_from_iter()
et.al., advancing iterator only in case of successful full copy
and returning whether it had been successful or not.

Convert some obvious users.  *NOTE* - do not blindly assume that
something is a good candidate for those unless you are sure that
not advancing iov_iter in failure case is the right thing in
this case.  Anything that does short read/short write kind of
stuff (or is in a loop, etc.) is unlikely to be a good one.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

cbbd26b8

01 11月, 2016 1 次提交

fs: decouple READ and WRITE from the block layer ops · d3849953

由 Christoph Hellwig 提交于 11月 01, 2016

Move READ and WRITE to kernel.h and don't define them in terms of block
layer ops; they are our generic data direction indicators these days
and have no more resemblance with the block layer ops.
Signed-off-by: NChristoph Hellwig <hch@lst.de>
Signed-off-by: NJens Axboe <axboe@fb.com>

d3849953

11 10月, 2016 1 次提交
- A
  constify iov_iter_count() and iter_is_iovec() · b57332b4
  由 Al Viro 提交于 10月 10, 2016
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  b57332b4
06 10月, 2016 1 次提交

new iov_iter flavour: pipe-backed · 241699cd

由 Al Viro 提交于 9月 22, 2016

iov_iter variant for passing data into pipe.  copy_to_iter()
copies data into page(s) it has allocated and stuffs them into
the pipe; copy_page_to_iter() stuffs there a reference to the
page given to it.  Both will try to coalesce if possible.
iov_iter_zero() is similar to copy_to_iter(); iov_iter_get_pages()
and friends will do as copy_to_iter() would have and return the
pages where the data would've been copied.  iov_iter_advance()
will truncate everything past the spot it has advanced to.

New primitive: iov_iter_pipe(), used for initializing those.
pipe should be locked all along.

Running out of space acts as fault would for iovec-backed ones;
in other words, giving it to ->read_iter() may result in short
read if the pipe overflows, or -EFAULT if it happens with nothing
copied there.

In other words, ->read_iter() on those acts pretty much like
->splice_read().  Moreover, all generic_file_splice_read() users,
as well as many other ->splice_read() instances can be switched
to that scheme - that'll happen in the next commit.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

241699cd

28 9月, 2016 1 次提交

get rid of separate multipage fault-in primitives · 4bce9f6e

由 Al Viro 提交于 9月 17, 2016

* the only remaining callers of "short" fault-ins are just as happy with generic
variants (both in lib/iov_iter.c); switch them to multipage variants, kill the
"short" ones
* rename the multipage variants to now available plain ones.
* get rid of compat macro defining iov_iter_fault_in_multipage_readable by
expanding it in its only user.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

4bce9f6e

18 9月, 2016 1 次提交

fix iov_iter_fault_in_readable() · d4690f1e

由 Al Viro 提交于 9月 16, 2016

... by turning it into what used to be multipages counterpart

Cc: stable@vger.kernel.org
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

d4690f1e

09 4月, 2016 1 次提交
- A
  fix the copy vs. map logics in blk_rq_map_user_iov() · 357f435d
  由 Al Viro 提交于 4月 08, 2016
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  357f435d
07 12月, 2015 1 次提交
- A
  iov_iter: constify {csum_and_,}copy_to_iter() · 36f7a8a4
  由 Al Viro 提交于 12月 06, 2015
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  36f7a8a4
12 4月, 2015 2 次提交

new helper: iov_iter_rw() · bd8e0ff9

由 Omar Sandoval 提交于 3月 17, 2015

Get either READ or WRITE out of iter->type.
Signed-off-by: NOmar Sandoval <osandov@osandov.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

bd8e0ff9

VFS: Add iov_iter_fault_in_multipages_readable() · 171a0203

由 Anton Altaparmakov 提交于 3月 11, 2015

simillar to iov_iter_fault_in_readable() but differs in that it is
not limited to faulting in the first iovec and instead faults in
"bytes" bytes iterating over the iovecs as necessary.

Also, instead of only faulting in the first and last page of the
range, all pages are faulted in.

This function is needed by NTFS when it does multi page file
writes.
Signed-off-by: NAnton Altaparmakov <anton@tuxera.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

171a0203

30 3月, 2015 1 次提交

saner iov_iter initialization primitives · bc917be8

由 Al Viro 提交于 3月 21, 2015

iovec-backed iov_iter instances are assumed to satisfy several properties:
	* no more than UIO_MAXIOV elements in iovec array
	* total size of all ranges is no more than MAX_RW_COUNT
	* all ranges pass access_ok().

The problem is, invariants of data structures should be established in the
primitives creating those data structures, not in the code using those
primitives.  And iov_iter_init() violates that principle.  For a while we
managed to get away with that, but once the use of iov_iter started to
spread, it didn't take long for shit to hit the fan - missed check in
sys_sendto() had introduced a roothole.

We _do_ have primitives for importing and validating iovecs (both native and
compat ones) and those primitives are almost always followed by shoving the
resulting iovec into iov_iter.  Life would be considerably simpler (and safer)
if we combined those primitives with initializing iov_iter.

That gives us two new primitives - import_iovec() and compat_import_iovec().
Calling conventions:
	iovec = iov_array;
	err = import_iovec(direction, uvec, nr_segs,
			   ARRAY_SIZE(iov_array), &iovec,
			   &iter);
imports user vector into kernel space (into iov_array if it fits, allocated
if it doesn't fit or if iovec was NULL), validates it and sets iter up to
refer to it.  On success 0 is returned and allocated kernel copy (or NULL
if the array had fit into caller-supplied one) is returned via iovec.
On failure all allocations are undone and -E... is returned.  If the total
size of ranges exceeds MAX_RW_COUNT, the excess is silently truncated.

compat_import_iovec() expects uvec to be a pointer to user array of compat_iovec;
otherwise it's identical to import_iovec().

Finally, import_single_range() sets iov_iter backed by single-element iovec
covering a user-supplied range -

	err = import_single_range(direction, address, size, iovec, &iter);

does validation and sets iter up.  Again, size in excess of MAX_RW_COUNT gets
silently truncated.

Next commits will be switching the things up to use of those and reducing
the amount of iov_iter_init() instances.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

bc917be8

18 2月, 2015 1 次提交

new helper: dup_iter() · 4b8164b9

由 Al Viro 提交于 1月 31, 2015

Copy iter and kmemdup the underlying array for the copy.  Returns
a pointer to result of kmemdup() to be kfree()'d later.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

4b8164b9

04 2月, 2015 3 次提交

vhost: vhost_scsi_handle_vq() should just use copy_from_user() · 57dd8a07

由 Al Viro 提交于 12月 10, 2014

it has just verified that it asks no more than the length of the
first segment of iovec.

And with that the last user of stuff in lib/iovec.c is gone.
RIP.

Cc: Michael S. Tsirkin <mst@redhat.com>
Cc: Nicholas A. Bellinger <nab@linux-iscsi.org>
Cc: kvm@vger.kernel.org
Cc: virtualization@lists.linux-foundation.org
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

57dd8a07

vhost: don't bother copying iovecs in handle_rx(), kill memcpy_toiovecend() · ba7438ae

由 Al Viro 提交于 12月 10, 2014

Cc: Michael S. Tsirkin <mst@redhat.com>
Cc: kvm@vger.kernel.org
Cc: virtualization@lists.linux-foundation.org
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

ba7438ae

vhost: switch vhost get_indirect() to iov_iter, kill memcpy_fromiovec() · aad9a1ce

由 Al Viro 提交于 12月 10, 2014

Cc: Michael S. Tsirkin <mst@redhat.com>
Cc: kvm@vger.kernel.org
Cc: virtualization@lists.linux-foundation.org
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

aad9a1ce

29 1月, 2015 1 次提交

new helper: iov_iter_bvec() · 05afcb77

由 Al Viro 提交于 1月 23, 2015

similar to iov_iter_kvec(), for ITER_BVEC ones
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

05afcb77

17 12月, 2014 1 次提交
- A
  new helper: iter_is_iovec() · 777eda2c
  由 Al Viro 提交于 12月 17, 2014
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  777eda2c
10 12月, 2014 1 次提交
- A
  bury memcpy_toiovec() · 218321e7
  由 Al Viro 提交于 11月 24, 2014
```
no users left
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  218321e7
09 12月, 2014 4 次提交
- A
  copy_from_iter_nocache() · aa583096
  由 Al Viro 提交于 11月 27, 2014
```
BTW, do we want memcpy_nocache()?
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  aa583096
- A
  new helper: iov_iter_kvec() · abb78f87
  由 Al Viro 提交于 11月 24, 2014
```
initialization of kvec-backed iov_iter
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  abb78f87
- A
  csum_and_copy_..._iter() · a604ec7e
  由 Al Viro 提交于 11月 24, 2014
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  a604ec7e
- A
  iov_iter.c: handle ITER_KVEC directly · a280455f
  由 Al Viro 提交于 11月 27, 2014
```
... without bothering with copy_..._user()
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  a280455f
09 10月, 2014 1 次提交

Add copy_to_iter(), copy_from_iter() and iov_iter_zero() · c35e0248

由 Matthew Wilcox 提交于 8月 01, 2014

For DAX, we want to be able to copy between iovecs and kernel addresses
that don't necessarily have a struct page.  This is a fairly simple
rearrangement for bvec iters to kmap the pages outside and pass them in,
but for user iovecs it gets more complicated because we might try various
different ways to kmap the memory.  Duplicating the existing logic works
out best in this case.

We need to be able to write zeroes to an iovec for reads from unwritten
ranges in a file.  This is performed by the new iov_iter_zero() function,
again patterned after the existing code that handles iovec iterators.

[AV: and export the buggers...]
Signed-off-by: NMatthew Wilcox <willy@linux.intel.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

c35e0248

27 9月, 2014 1 次提交

fuse: honour max_read and max_write in direct_io mode · 2c80929c

由 Miklos Szeredi 提交于 9月 24, 2014

The third argument of fuse_get_user_pages() "nbytesp" refers to the number of
bytes a caller asked to pack into fuse request. This value may be lesser
than capacity of fuse request or iov_iter.  So fuse_get_user_pages() must
ensure that *nbytesp won't grow.

Now, when helper iov_iter_get_pages() performs all hard work of extracting
pages from iov_iter, it can be done by passing properly calculated
"maxsize" to the helper.

The other caller of iov_iter_get_pages() (dio_refill_pages()) doesn't need
this capability, so pass LONG_MAX as the maxsize argument here.

Fixes: c9c37e2e ("fuse: switch to iov_iter_get_pages()")
Reported-by: NWerner Baumann <werner.baumann@onlinehome.de>
Tested-by: NMaxim Patlasov <mpatlasov@parallels.com>
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

2c80929c

08 8月, 2014 1 次提交
- A
  switch iov_iter_get_pages() to passing maximal number of pages · c7f3888a
  由 Al Viro 提交于 6月 18, 2014
```
... instead of maximal size.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  c7f3888a
28 6月, 2014 1 次提交

iovec: move memcpy_from/toiovecend to lib/iovec.c · ac5ccdba

由 Michael S. Tsirkin 提交于 6月 19, 2014

ERROR: "memcpy_fromiovecend" [drivers/vhost/vhost_scsi.ko] undefined!

commit 9f977ef7
    vhost-scsi: Include prot_bytes into expected data transfer length
in target-pending makes drivers/vhost/scsi.c call memcpy_fromiovecend().
This function is not available when CONFIG_NET is not enabled.

socket.h already includes uio.h, so no callers need updating.
Reported-by: NRandy Dunlap <rdunlap@infradead.org>
Cc: Stephen Rothwell <sfr@canb.auug.org.au>
Cc: "David S. Miller" <davem@davemloft.net>
Signed-off-by: NDavid S. Miller <davem@davemloft.net>
Signed-off-by: NMichael S. Tsirkin <mst@redhat.com>
Signed-off-by: NNicholas Bellinger <nab@linux-iscsi.org>

ac5ccdba

27 6月, 2014 1 次提交

Fix 32-bit regression in block device read(2) · 0b86dbf6

由 Al Viro 提交于 6月 23, 2014

blkdev_read_iter() wants to cap the iov_iter by the amount of data
remaining to the end of device.  That's what iov_iter_truncate() is for
(trim iter->count if it's above the given limit).  So far, so good, but
the argument of iov_iter_truncate() is size_t, so on 32bit boxen (in
case of a large device) we end up with that upper limit truncated down
to 32 bits *before* comparing it with iter->count.

Easily fixed by making iov_iter_truncate() take 64bit argument - it does
the right thing after such change (we only reach the assignment in there
when the current value of iter->count is greater than the limit, i.e.
for anything that would get truncated we don't reach the assignment at
all) and that argument is not the new value of iter->count - it's an
upper limit for such.

The overhead of passing u64 is not an issue - the thing is inlined, so
callers passing size_t won't pay any penalty.
Reported-and-tested-by: NTheodore Tso <tytso@mit.edu>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
Tested-by: NAlan Cox <gnomes@lxorguk.ukuu.org.uk>
Tested-by: NBruno Wolff III <bruno@wolff.to>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

0b86dbf6

07 5月, 2014 5 次提交

bio_vec-backed iov_iter · 62a8067a

由 Al Viro 提交于 4月 04, 2014

New variant of iov_iter - ITER_BVEC in iter->type, backed with
bio_vec array instead of iovec one.  Primitives taught to deal
with such beasts, __swap_write() switched to using that kind
of iov_iter.

Note that bio_vec is just a <page, offset, length> triple - there's
nothing block-specific about it.  I've left the definition where it
was, but took it from under ifdef CONFIG_BLOCK.

Next target: ->splice_write()...
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

62a8067a

lustre: get rid of messing with iovecs · b42b15fd

由 Al Viro 提交于 4月 04, 2014

* switch to ->read_iter/->write_iter
* keep a pointer to iov_iter instead of iov/nr_segs
* do not modify iovecs; use iov_iter_truncate()/iov_iter_advance() and
a new primitive - iov_iter_reexpand() (expand previously truncated
iterator) istead.
* (racy) check for lustre VMAs intersecting with iovecs kept for now as
for_each_iov() loop.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

b42b15fd

new helper: copy_page_from_iter() · f0d1bec9

由 Al Viro 提交于 4月 03, 2014

parallel to copy_page_to_iter().  pipe_write() switched to it (and became
->write_iter()).
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

f0d1bec9

iov_iter_truncate() · 0c949334

由 Al Viro 提交于 3月 22, 2014

Now It Can Be Done(tm) - we don't need to do iov_shorten() in
generic_file_direct_write() anymore, now that all ->direct_IO()
instances are converted to proper iov_iter methods and honour
iter->count and iter->iov_offset properly.

Get rid of count/ocount arguments of generic_file_direct_write(),
while we are at it.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

0c949334

new helper: iov_iter_get_pages_alloc() · 91f79c43

由 Al Viro 提交于 3月 21, 2014

same as iov_iter_get_pages(), except that pages array is allocated
(kmalloc if possible, vmalloc if that fails) and left for caller to
free.  Lustre and NFS ->direct_IO() switched to it.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

91f79c43