提交 · 49a57857aeea06ca831043acbb0fa5e0f50602fd · openeuler / Kernel

08 1月, 2019 2 次提交

ceph: use vmf_error() in ceph_filemap_fault() · c64a2b05

由 Souptick Joarder 提交于 1月 05, 2019

This code is converted to use vmf_error().
Signed-off-by: NSouptick Joarder <jrdr.linux@gmail.com>
Reviewed-by: NIlya Dryomov <idryomov@gmail.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

c64a2b05

libceph: allow setting abort_on_full for rbd · 02b2f549

由 Dongsheng Yang 提交于 12月 18, 2018

Introduce a new option abort_on_full, default to false. Then
we can get -ENOSPC when the pool is full, or reaches quota.

[ Don't show abort_on_full in /proc/mounts. ]
Signed-off-by: NDongsheng Yang <dongsheng.yang@easystack.cn>
Reviewed-by: NIlya Dryomov <idryomov@gmail.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

02b2f549

05 1月, 2019 1 次提交

fs: don't open code lru_to_page() · f86196ea

由 Nikolay Borisov 提交于 1月 03, 2019

Multiple filesystems open code lru_to_page().  Rectify this by moving
the macro from mm_inline (which is specific to lru stuff) to the more
generic mm.h header and start using the macro where appropriate.

No functional changes.

Link: http://lkml.kernel.org/r/20181129104810.23361-1-nborisov@suse.com
Link: https://lkml.kernel.org/r/20181129075301.29087-1-nborisov@suse.comSigned-off-by: NNikolay Borisov <nborisov@suse.com>
Acked-by: NMichal Hocko <mhocko@suse.com>
Reviewed-by: NDavid Hildenbrand <david@redhat.com>
Reviewed-by: NMike Rapoport <rppt@linux.ibm.com>
Acked-by: NPankaj gupta <pagupta@redhat.com>
Acked-by: "Yan, Zheng" <zyan@redhat.com>		[ceph]
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

f86196ea

29 12月, 2018 1 次提交

mm: convert totalram_pages and totalhigh_pages variables to atomic · ca79b0c2

由 Arun KS 提交于 12月 28, 2018

totalram_pages and totalhigh_pages are made static inline function.

Main motivation was that managed_page_count_lock handling was complicating
things.  It was discussed in length here,
https://lore.kernel.org/patchwork/patch/995739/#1181785 So it seemes
better to remove the lock and convert variables to atomic, with preventing
poteintial store-to-read tearing as a bonus.

[akpm@linux-foundation.org: coding style fixes]
Link: http://lkml.kernel.org/r/1542090790-21750-4-git-send-email-arunks@codeaurora.orgSigned-off-by: NArun KS <arunks@codeaurora.org>
Suggested-by: NMichal Hocko <mhocko@suse.com>
Suggested-by: NVlastimil Babka <vbabka@suse.cz>
Reviewed-by: NKonstantin Khlebnikov <khlebnikov@yandex-team.ru>
Reviewed-by: NPavel Tatashin <pasha.tatashin@soleen.com>
Acked-by: NMichal Hocko <mhocko@suse.com>
Acked-by: NVlastimil Babka <vbabka@suse.cz>
Cc: David Hildenbrand <david@redhat.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

ca79b0c2

26 12月, 2018 7 次提交

ceph: don't encode inode pathes into reconnect message · 5ccedf1c

由 Yan, Zheng 提交于 12月 13, 2018

mds hasn't used inode pathes since introducing inode backtrace.
Signed-off-by: N"Yan, Zheng" <zyan@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

5ccedf1c

ceph: update wanted caps after resuming stale session · d2f8bb27

由 Yan, Zheng 提交于 12月 10, 2018

mds contains an optimization, it does not re-issue stale caps if
client does not want any cap.

A special case of the optimization is that client wants some caps,
but skipped updating 'wanted'. For this case, client needs to update
'wanted' when stale session get renewed.
Signed-off-by: N"Yan, Zheng" <zyan@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

d2f8bb27

ceph: skip updating 'wanted' caps if caps are already issued · fdac94fa

由 Yan, Zheng 提交于 11月 22, 2018

When reading cached inode that already has Fscr caps, this can avoid
two cap messages (one updats 'wanted' caps, one clears 'wanted' caps).
Signed-off-by: N"Yan, Zheng" <zyan@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

fdac94fa

Y
ceph: don't request excl caps when mount is readonly · 8a2ac3a8
由 Yan, Zheng 提交于 12月 05, 2018
```
Signed-off-by: N"Yan, Zheng" <zyan@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>
```
8a2ac3a8

ceph: don't update importing cap's mseq when handing cap export · 3c1392d4

由 Yan, Zheng 提交于 11月 29, 2018

Updating mseq makes client think importer mds has accepted all prior
cap messages and importer mds knows what caps client wants. Actually
some cap messages may have been dropped because of mseq mismatch.

If mseq is left untouched, importing cap's mds_wanted later will get
reset by cap import message.

Cc: stable@vger.kernel.org
Signed-off-by: N"Yan, Zheng" <zyan@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

3c1392d4

ceph: remove redundant assignment · 0cab9f33

由 Chengguang Xu 提交于 11月 15, 2018

There is redundant assighment of variable i in
ceph_mdsmap_get_random_mds(), just remvoe it.
Signed-off-by: NChengguang Xu <cgxu519@gmx.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

0cab9f33

ceph: cleanup splice_dentry() · 2bf996ac

由 Yan, Zheng 提交于 10月 25, 2018

splice_dentry() may drop the original dentry and return other
dentry. It relies on its caller to update pointer that points
to the dropped dentry. This is error-prone.
Signed-off-by: N"Yan, Zheng" <zyan@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

2bf996ac

12 12月, 2018 1 次提交

ceph: make 'nocopyfrom' a default mount option · 6f9718fe

由 Luis Henriques 提交于 12月 10, 2018

Since we found a problem with the 'copy-from' operation after objects have
been truncated, offloading object copies to OSDs should be discouraged
until the issue is fixed.

Thus, this patch adds the 'nocopyfrom' mount option to the default mount
options which effectily means that remote copies won't be done in
copy_file_range unless they are explicitly enabled at mount time.

[ Adjust ceph_show_options() accordingly. ]

Link: https://tracker.ceph.com/issues/37378Signed-off-by: NLuis Henriques <lhenriques@suse.com>
Reviewed-by: NIlya Dryomov <idryomov@gmail.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

6f9718fe

09 11月, 2018 3 次提交

libceph: assume argonaut on the server side · 23c625ce

由 Ilya Dryomov 提交于 11月 08, 2018

No one is running pre-argonaut.  In addition one of the argonaut
features (NOSRCADDR) has been required since day one (and a half,
2.6.34 vs 2.6.35) of the kernel client.

Allow for the possibility of reusing these feature bits later.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>
Reviewed-by: NSage Weil <sage@redhat.com>

23c625ce

ceph: quota: fix null pointer dereference in quota check · 71f2cc64

由 Luis Henriques 提交于 11月 05, 2018

This patch fixes a possible null pointer dereference in
check_quota_exceeded, detected by the static checker smatch, with the
following warning:

   fs/ceph/quota.c:240 check_quota_exceeded()
    error: we previously assumed 'realm' could be null (see line 188)

Fixes: b7a29217 ("ceph: quota: support for ceph.quota.max_files")
Reported-by: NDan Carpenter <dan.carpenter@oracle.com>
Signed-off-by: NLuis Henriques <lhenriques@suse.com>
Reviewed-by: N"Yan, Zheng" <zyan@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

71f2cc64

ceph: add destination file data sync before doing any remote copy · c2c6d3ce

由 Luis Henriques 提交于 10月 23, 2018

If we try to copy into a file that was just written, any data that is
remote copied will be overwritten by our buffered writes once they are
flushed.  When this happens, the call to invalidate_inode_pages2_range
will also return a -EBUSY error.

This patch fixes this by also sync'ing the destination file before
starting any copy.

Fixes: 503f82a9 ("ceph: support copy_file_range file operation")
Signed-off-by: NLuis Henriques <lhenriques@suse.com>
Reviewed-by: N"Yan, Zheng" <zyan@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

c2c6d3ce

24 10月, 2018 2 次提交

iov_iter: Separate type from direction and use accessor functions · aa563d7b

由 David Howells 提交于 10月 20, 2018

In the iov_iter struct, separate the iterator type from the iterator
direction and use accessor functions to access them in most places.

Convert a bunch of places to use switch-statements to access them rather
then chains of bitwise-AND statements. This makes it easier to add further
iterator types. Also, this can be more efficient as to implement a switch
of small contiguous integers, the compiler can use ~50% fewer compare
instructions than it has to use bitwise-and instructions.

Further, cease passing the iterator type into the iterator setup function.
The iterator function can set that itself. Only the direction is required.
Signed-off-by: NDavid Howells <dhowells@redhat.com>

aa563d7b

iov_iter: Use accessor function · 00e23707

由 David Howells 提交于 10月 22, 2018

Use accessor functions to access an iterator's type and direction. This
allows for the possibility of using some other method of determining the
type of iterator than if-chains with bitwise-AND conditions.
Signed-off-by: NDavid Howells <dhowells@redhat.com>

00e23707

22 10月, 2018 16 次提交

ceph: new mount option to disable usage of copy-from op · ea4cdc54

由 Luis Henriques 提交于 10月 15, 2018

Add a new mount option 'nocopyfrom' that will prevent the usage of the
RADOS 'copy-from' operation in cephfs.  This could be useful, for example,
for an administrator to temporarily mitigate any possible bugs in the
'copy-from' implementation.

Currently, only copy_file_range uses this RADOS operation.  Setting this
mount option will result in this syscall reverting to the default VFS
implementation, i.e. to perform the copies locally instead of doing remote
object copies.
Signed-off-by: NLuis Henriques <lhenriques@suse.com>
Reviewed-by: N"Yan, Zheng" <zyan@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

ea4cdc54

ceph: support copy_file_range file operation · 503f82a9

由 Luis Henriques 提交于 10月 15, 2018

This commit implements support for the copy_file_range syscall in cephfs.
It is implemented using the RADOS 'copy-from' operation, which allows to
do a remote object copy, without the need to download/upload data from/to
the OSDs.

Some manual copy may however be required if the source/destination file
offsets aren't object aligned or if the copy length is smaller than the
object size.
Signed-off-by: NLuis Henriques <lhenriques@suse.com>
Reviewed-by: N"Yan, Zheng" <zyan@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

503f82a9

ceph: add non-blocking parameter to ceph_try_get_caps() · 2ee9dd95

由 Luis Henriques 提交于 10月 15, 2018

ceph_try_get_caps currently calls try_get_cap_refs with the nonblock
parameter always set to 'true'.  This change adds a new parameter that
allows to set it's value.  This will be useful for a follow-up patch that
will need to get two sets of capabilities for two different inodes without
risking a deadlock.
Signed-off-by: NLuis Henriques <lhenriques@suse.com>
Reviewed-by: N"Yan, Zheng" <zyan@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

2ee9dd95

libceph: preallocate message data items · 0d9c1ab3

由 Ilya Dryomov 提交于 10月 15, 2018

Currently message data items are allocated with ceph_msg_data_create()
in setup_request_data() inside send_request().  send_request() has never
been allowed to fail, so each allocation is followed by a BUG_ON:

  data = ceph_msg_data_create(...);
  BUG_ON(!data);

It's been this way since support for multiple message data items was
added in commit 6644ed7b ("libceph: make message data be a pointer")
in 3.10.

There is no reason to delay the allocation of message data items until
the last possible moment and we certainly don't need a linked list of
them as they are only ever appended to the end and never erased.  Make
ceph_msg_new2() take max_data_items and adapt the rest of the code.
Reported-by: NJerry Lee <leisurelysw24@gmail.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

0d9c1ab3

libceph, rbd, ceph: move ceph_osdc_alloc_messages() calls · 26f887e0

由 Ilya Dryomov 提交于 10月 15, 2018

The current requirement is that ceph_osdc_alloc_messages() should be
called after oid and oloc are known.  In preparation for preallocating
message data items, move ceph_osdc_alloc_messages() further down, so
that it is called when OSD op codes are known.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

26f887e0

ceph: num_ops is off by one in ceph_aio_retry_work() · 61d2f855

由 Ilya Dryomov 提交于 10月 11, 2018

Two OSD op slots are allocated, but only one is ever used.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

61d2f855

ceph: set timeout conditionally in __cap_delay_requeue · 66802884

由 Xuehan Xu 提交于 10月 11, 2018

__cap_delay_requeue could be invoked through ceph_check_caps when there
exists caps that needs to be sent and are delayed by "i_hold_caps_min"
or "i_hold_caps_max". If __cap_delay_requeue sets timeout unconditionally,
there could be a chance that some "wanted" caps can not be release for a
long since their timeouts are reset every time they get delayed.

Fixes: http://tracker.ceph.com/issues/36369Signed-off-by: NXuehan Xu <xuxuehan@360.cn>
Reviewed-by: N"Yan, Zheng" <zyan@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

66802884

libceph: don't consume a ref on pagelist in ceph_msg_data_add_pagelist() · 89486833

由 Ilya Dryomov 提交于 9月 28, 2018

Because send_mds_reconnect() wants to send a message with a pagelist
and pass the ownership to the messenger, ceph_msg_data_add_pagelist()
consumes a ref which is then put in ceph_msg_data_destroy().  This
makes managing pagelists in the OSD client (where they are wrapped in
ceph_osd_data) unnecessarily hard because the handoff only happens in
ceph_osdc_start_request() instead of when the pagelist is passed to
ceph_osd_data_pagelist_init().  I counted several memory leaks on
various error paths.

Fix up ceph_msg_data_add_pagelist() and carry a pagelist ref in
ceph_osd_data.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

89486833

libceph: introduce ceph_pagelist_alloc() · 33165d47

由 Ilya Dryomov 提交于 9月 28, 2018

struct ceph_pagelist cannot be embedded into anything else because it
has its own refcount. Merge allocation and initialization together.
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

33165d47

ceph: only allow punch hole mode in fallocate · bddff633

由 Luis Henriques 提交于 10月 09, 2018

Current implementation of cephfs fallocate isn't correct as it doesn't
really reserve the space in the cluster, which means that a subsequent
call to a write may actually fail due to lack of space. In fact, it is
currently possible to fallocate an amount space that is larger than the
free space in the cluster. It has behaved this way since the initial
commit ad7a60de ("ceph: punch hole support").

Since there's no easy solution to fix this at the moment, this patch
simply removes support for all fallocate operations but
FALLOC_FL_PUNCH_HOLE (which implies FALLOC_FL_KEEP_SIZE).

Link: https://tracker.ceph.com/issues/36317Signed-off-by: NLuis Henriques <lhenriques@suse.com>
Reviewed-by: N"Yan, Zheng" <zyan@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

bddff633

ceph: refactor ceph_sync_read() · fce7a974

由 Yan, Zheng 提交于 9月 29, 2018

Avoid allocating memory for the entire user request: striped_read()
does a synchronous OSD request per object, so it doesn't need more than
object size worth of pages at a time.

[ Preserve the comment, changelog. ]
Signed-off-by: N"Yan, Zheng" <zyan@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

fce7a974

ceph: check if LOOKUPNAME request was aborted when filling trace · 74c9e6bf

由 Yan, Zheng 提交于 9月 28, 2018

d_lookup()/d_alloc() require parent inode locked. Parent inode is
not locked if request is aborted.
Signed-off-by: N"Yan, Zheng" <zyan@redhat.com>
Reviewed-by: NJeff Layton <jlayton@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

74c9e6bf

ceph: fix dentry leak in ceph_readdir_prepopulate · c58f450b

由 Yan, Zheng 提交于 9月 28, 2018

Signed-off-by: N"Yan, Zheng" <zyan@redhat.com>
Reviewed-by: NJeff Layton <jlayton@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

c58f450b

Revert "ceph: fix dentry leak in splice_dentry()" · efe32823

由 Yan, Zheng 提交于 9月 27, 2018

This reverts commit 8b8f53af.

splice_dentry() is used by three places. For two places, req->r_dentry
is passed to splice_dentry(). In the case of error, req->r_dentry does
not get updated. So splice_dentry() should not drop reference.

Cc: stable@vger.kernel.org # 4.18+
Signed-off-by: N"Yan, Zheng" <zyan@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

efe32823

ceph: check snap first in ceph_set_acl() · 5da20799

由 Chengguang Xu 提交于 9月 02, 2018

Do the snap check first in ceph_set_acl(), so we can avoid
unnecessary operations when the inode has snap.
Signed-off-by: NChengguang Xu <cgxu519@gmx.com>
Reviewed-by: N"Yan, Zheng" <zyan@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

5da20799

ceph: reset cap hold timeout only for requeued inode · 3167893a

由 Chengguang Xu 提交于 7月 30, 2018

__cap_delay_requeue() only requeue inode which does not
have CEPH_I_FLUSH flag, so avoid reset cap hold timeout
for that inode.
Signed-off-by: NChengguang Xu <cgxu519@gmx.com>
Reviewed-by: N"Yan, Zheng" <zyan@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

3167893a

06 9月, 2018 1 次提交

ceph: avoid a use-after-free in ceph_destroy_options() · 8aaff151

由 Ilya Dryomov 提交于 8月 24, 2018

syzbot reported a use-after-free in ceph_destroy_options(), called from
ceph_mount().  The problem was that create_fs_client() consumed the opt
pointer on some errors, but not on all of them.  Make sure it always
consumes both libceph and ceph options.

Reported-by: syzbot+8ab6f1042021b4eed062@syzkaller.appspotmail.com
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>
Reviewed-by: N"Yan, Zheng" <zyan@redhat.com>

8aaff151

13 8月, 2018 6 次提交

ceph: don't drop message if it contains more data than expected · 0fcf6c02

由 Yan, Zheng 提交于 8月 03, 2018

Later version mds may encode more data into messages.
Signed-off-by: N"Yan, Zheng" <zyan@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

0fcf6c02

ceph: support cephfs' own feature bits · 342ce182

由 Yan, Zheng 提交于 5月 11, 2018

Signed-off-by: N"Yan, Zheng" <zyan@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

342ce182

ceph: refactor error handling code in ceph_reserve_caps() · e5bc08d0

由 Chengguang Xu 提交于 7月 28, 2018

Call new helper __ceph_unreserve_caps() to reduce duplicated code.
Signed-off-by: NChengguang Xu <cgxu519@gmx.com>
Reviewed-by: N"Yan, Zheng" <zyan@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

e5bc08d0

ceph: refactor ceph_unreserve_caps() · 7bf8f736

由 Chengguang Xu 提交于 7月 28, 2018

The code of ceph_unreserve_caps() and error handling in
ceph_reserve_caps() are duplicated, so introduce a helper
__ceph_unreserve_caps() to reduce duplicated code.
Signed-off-by: NChengguang Xu <cgxu519@gmx.com>
Reviewed-by: N"Yan, Zheng" <zyan@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

7bf8f736

ceph: change to void return type for __do_request() · d5548492

由 Chengguang Xu 提交于 7月 28, 2018

We do not check return code for __do_request() in all callers,
so change to void return type.
Signed-off-by: NChengguang Xu <cgxu519@gmx.com>
Reviewed-by: N"Yan, Zheng" <zyan@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

d5548492

ceph: compare fsc->max_file_size and inode->i_size for max file size limit · 9da12e3a

由 Chengguang Xu 提交于 7月 19, 2018

In ceph_llseek(), we compare fsc->max_file_size and inode->i_size to
choose max file size limit.
Signed-off-by: NChengguang Xu <cgxu519@gmx.com>
Reviewed-by: N"Yan, Zheng" <zyan@redhat.com>
Signed-off-by: NIlya Dryomov <idryomov@gmail.com>

9da12e3a

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功