提交 · 293542d8e501dc47e32ca82276aa9a0a5d9358b5 · openanolis / cloud-kernel

23 5月, 2018 20 次提交

A
hfsplus: switch to d_splice_alias() · 293542d8
由 Al Viro 提交于 5月 03, 2018
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
293542d8

hfs: don't allow mounting over .../rsrc · 0e5c56fd

由 Al Viro 提交于 4月 30, 2018

That's one case when unlink() destroys a subtree, thanks to "resource
fork" idiocy.  We might forcibly evict that shit on unlink(2), but
for now let's just disallow overmounting; as it is, anything that
plays games with those would leak mounts.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

0e5c56fd

A
hfs: use d_splice_alias() · 6b9cceea
由 Al Viro 提交于 4月 30, 2018
```
code is simpler that way
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
6b9cceea
A
omfs_lookup(): report IO errors, use d_splice_alias() · 18fbbfc2
由 Al Viro 提交于 4月 30, 2018
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
18fbbfc2

orangefs_lookup: simplify · 04bb1ba1

由 Al Viro 提交于 4月 30, 2018

d_splice_alias() can handle NULL and ERR_PTR() for inode just fine...
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

04bb1ba1

A
openpromfs: switch to d_splice_alias() · 0ed883fd
由 Al Viro 提交于 5月 03, 2018
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
0ed883fd

xfs_vn_lookup: simplify a bit · b113a6d3

由 Al Viro 提交于 4月 30, 2018

have all post-xfs_lookup() branches converge on d_splice_alias()

Cc: Darrick J. Wong <darrick.wong@oracle.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

b113a6d3

A
adfs_lookup: do not fail with ENOENT on negatives, use d_splice_alias() · 9a7dddca
由 Al Viro 提交于 4月 30, 2018
```
Cc: Russell King <linux@armlinux.org.uk>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
9a7dddca
A
adfs_lookup_byname: .. *is* taken care of in fs/namei.c · 686bb96d
由 Al Viro 提交于 4月 30, 2018
```
Cc: Russell King <linux@armlinux.org.uk>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
686bb96d
A
romfs_lookup: switch to d_splice_alias() · 8130c151
由 Al Viro 提交于 4月 30, 2018
```
... and hash negative lookups
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
8130c151
A
qnx6_lookup: switch to d_splice_alias() · c1481700
由 Al Viro 提交于 4月 30, 2018
```
... and hash negative lookups
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
c1481700

ubifs_lookup: use d_splice_alias() · 191ac107

由 Al Viro 提交于 4月 30, 2018

code is simpler that way
Acked-by: NRichard Weinberger <richard@nod.at>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

191ac107

sysv_lookup: use d_splice_alias() · 5bf35449

由 Al Viro 提交于 4月 30, 2018

code is simpler that way

Cc: Christoph Hellwig <hch@lst.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

5bf35449

qnx4_lookup: use d_splice_alias() · b135dcea

由 Al Viro 提交于 4月 30, 2018

code is simpler that way
Acked-by: NAnders Larsen <al@alarsen.net>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

b135dcea

A
minix_lookup: use d_splice_alias() · b0149516
由 Al Viro 提交于 4月 30, 2018
```
code is simpler that way
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
b0149516

freevxfs_lookup(): use d_splice_alias() · 72ff0b03

由 Al Viro 提交于 4月 30, 2018

code is simpler that way
Cc: Christoph Hellwig <hch@lst.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

72ff0b03

cramfs_lookup(): use d_splice_alias() · d023b3a1

由 Al Viro 提交于 4月 30, 2018

simpler code that way, actually
Acked-by: NNicolas Pitre <nico@linaro.org>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

d023b3a1

bfs_add_entry: pass name/len as qstr pointer · b455ecd4

由 Al Viro 提交于 4月 30, 2018

same story as with bfs_find_entry()

Cc: "Tigran A. Aivazian" <aivazian.tigran@gmail.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

b455ecd4

bfs_find_entry: pass name/len as qstr pointer · 33ebdebe

由 Al Viro 提交于 4月 30, 2018

all callers feed something->name/something->len anyway

Cc: "Tigran A. Aivazian" <aivazian.tigran@gmail.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

33ebdebe

bfs_lookup(): use d_splice_alias() · a596a23b

由 Al Viro 提交于 4月 30, 2018

code is actually simpler that way.
Acked-by: N"Tigran A. Aivazian" <aivazian.tigran@gmail.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

a596a23b

22 5月, 2018 11 次提交

A

Merge branch 'work.misc' into work.lookup · 837f3ec6
由 Al Viro 提交于 5月 21, 2018

837f3ec6

aio: fix io_destroy(2) vs. lookup_ioctx() race · baf10564

由 Al Viro 提交于 5月 20, 2018

kill_ioctx() used to have an explicit RCU delay between removing the
reference from ->ioctx_table and percpu_ref_kill() dropping the refcount.
At some point that delay had been removed, on the theory that
percpu_ref_kill() itself contained an RCU delay. Unfortunately, that was
the wrong kind of RCU delay and it didn't care about rcu_read_lock() used
by lookup_ioctx(). As the result, we could get ctx freed right under
lookup_ioctx(). Tejun has fixed that in a6d7cff4 ("fs/aio: Add explicit
RCU grace period when freeing kioctx"); however, that fix is not enough.

Suppose io_destroy() from one thread races with e.g. io_setup() from another;
CPU1 removes the reference from current->mm->ioctx_table[...] just as CPU2
has picked it (under rcu_read_lock()). Then CPU1 proceeds to drop the
refcount, getting it to 0 and triggering a call of free_ioctx_users(),
which proceeds to drop the secondary refcount and once that reaches zero
calls free_ioctx_reqs(). That does
INIT_RCU_WORK(&ctx->free_rwork, free_ioctx);
queue_rcu_work(system_wq, &ctx->free_rwork);
and schedules freeing the whole thing after RCU delay.

In the meanwhile CPU2 has gotten around to percpu_ref_get(), bumping the
refcount from 0 to 1 and returned the reference to io_setup().

Tejun's fix (that queue_rcu_work() in there) guarantees that ctx won't get
freed until after percpu_ref_get(). Sure, we'd increment the counter before
ctx can be freed. Now we are out of rcu_read_lock() and there's nothing to
stop freeing of the whole thing. Unfortunately, CPU2 assumes that since it
has grabbed the reference, ctx is *NOT* going away until it gets around to
dropping that reference.

The fix is obvious - use percpu_ref_tryget_live() and treat failure as miss.
It's not costlier than what we currently do in normal case, it's safe to
call since freeing *is* delayed and it closes the race window - either
lookup_ioctx() comes before percpu_ref_kill() (in which case ctx->users
won't reach 0 until the caller of lookup_ioctx() drops it) or lookup_ioctx()
fails, ctx->users is unaffected and caller of lookup_ioctx() doesn't see
the object in question at all.

Cc: stable@kernel.org
Fixes: a6d7cff4 "fs/aio: Add explicit RCU grace period when freeing kioctx"
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

baf10564

ext2: fix a block leak · 5aa1437d

由 Al Viro 提交于 5月 17, 2018

open file, unlink it, then use ioctl(2) to make it immutable or
append only.  Now close it and watch the blocks *not* freed...

Immutable/append-only checks belong in ->setattr().
Note: the bug is old and backport to anything prior to 737f2e93
("ext2: convert to use the new truncate convention") will need
these checks lifted into ext2_setattr().

Cc: stable@kernel.org
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

5aa1437d

nfsd: vfs_mkdir() might succeed leaving dentry negative unhashed · 3819bb0d

由 Al Viro 提交于 5月 11, 2018

That can (and does, on some filesystems) happen - ->mkdir() (and thus
vfs_mkdir()) can legitimately leave its argument negative and just
unhash it, counting upon the lookup to pick the object we'd created
next time we try to look at that name.

Some vfs_mkdir() callers forget about that possibility...
Acked-by: NJ. Bruce Fields <bfields@redhat.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

3819bb0d

cachefiles: vfs_mkdir() might succeed leaving dentry negative unhashed · 9c3e9025

由 Al Viro 提交于 5月 10, 2018

That can (and does, on some filesystems) happen - ->mkdir() (and thus
vfs_mkdir()) can legitimately leave its argument negative and just
unhash it, counting upon the lookup to pick the object we'd created
next time we try to look at that name.

Some vfs_mkdir() callers forget about that possibility...
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

9c3e9025

unfuck sysfs_mount() · 7b745a4e

由 Al Viro 提交于 5月 14, 2018

new_sb is left uninitialized in case of early failures in kernfs_mount_ns(),
and while IS_ERR(root) is true in all such cases, using IS_ERR(root) || !new_sb
is not a solution - IS_ERR(root) is true in some cases when new_sb is true.

Make sure new_sb is initialized (and matches the reality) in all cases and
fix the condition for dropping kobj reference - we want it done precisely
in those situations where the reference has not been transferred into a new
super_block instance.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

7b745a4e

kernfs: deal with kernfs_fill_super() failures · 82382ace

由 Al Viro 提交于 4月 03, 2018

make sure that info->node is initialized early, so that kernfs_kill_sb()
can list_del() it safely.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

82382ace

cramfs: Fix IS_ENABLED typo · 08a8f308

由 Joe Perches 提交于 5月 13, 2018

There's an extra C here...

Fixes: 99c18ce5 ("cramfs: direct memory access support")
Acked-by: NNicolas Pitre <nico@linaro.org>
Signed-off-by: NJoe Perches <joe@perches.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

08a8f308

befs_lookup(): use d_splice_alias() · f4e4d434

由 Al Viro 提交于 4月 30, 2018

RTFS(Documentation/filesystems/nfs/Exporting) if you try to make
something exportable.

Fixes: ac632f5b "befs: add NFS export support"
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

f4e4d434

affs_lookup: switch to d_splice_alias() · 87fbd639

由 Al Viro 提交于 5月 06, 2018

Making something exportable takes more than providing ->s_export_ops.
In particular, ->lookup() *MUST* use d_splice_alias() instead of
d_add().

Reading Documentation/filesystems/nfs/Exporting would've been a good idea;
as it is, exporting AFFS is badly (and exploitably) broken.

Partially-Fixes: ed4433d7 "fs/affs: make affs exportable"
Acked-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

87fbd639

affs_lookup(): close a race with affs_remove_link() · 30da870c

由 Al Viro 提交于 5月 06, 2018

we unlock the directory hash too early - if we are looking at secondary
link and primary (in another directory) gets removed just as we unlock,
we could have the old primary moved in place of the secondary, leaving
us to look into freed entry (and leaving our dentry with ->d_fsdata
pointing to a freed entry).

Cc: stable@vger.kernel.org # 2.4.4+
Acked-by: NDavid Sterba <dsterba@suse.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

30da870c

18 5月, 2018 2 次提交

vfs: namei: use path_equal() in follow_dotdot() · 030c7e0b

由 Danilo Krummrich 提交于 4月 23, 2018

Use path_equal() to detect whether we're already in root.
Signed-off-by: NDanilo Krummrich <danilokrummrich@dk-develop.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

030c7e0b

fs.h: fix outdated comment about file flags · 75abe329

由 Li Qiang 提交于 5月 17, 2018

The __dentry_open function was removed in
commit <2a027e7a>("fold __dentry_open() into its sole caller").
Signed-off-by: NLi Qiang <liq3ea@gmail.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

75abe329

14 5月, 2018 5 次提交

A
__inode_security_revalidate() never gets NULL opt_dentry · e9193288
由 Al Viro 提交于 4月 24, 2018
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
e9193288
A
make xattr_getsecurity() static · 2220c5b0
由 Al Viro 提交于 4月 24, 2018
```
many years overdue...
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
2220c5b0

fix breakage caused by d_find_alias() semantics change · b127125d

由 Al Viro 提交于 4月 25, 2018

"VFS: don't keep disconnected dentries on d_anon" had a non-trivial
side-effect - d_unhashed() now returns true for those dentries,
making d_find_alias() skip them altogether.  For most of its callers
that's fine - we really want a connected alias there.  However,
there is a codepath where we relied upon picking such aliases
if nothing else could be found - selinux delayed initialization
of contexts for inodes on already mounted filesystems used to
rely upon that.

Cc: stable@kernel.org # f1ee6162 "VFS: don't keep disconnected dentries on d_anon"
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

b127125d

vfat: simplify checks in vfat_lookup() · f6ddc161

由 Al Viro 提交于 4月 25, 2018

vfat_d_anon_disconn() is called only if alias->d_parent is equal to
dentry->d_parent *and* it returns false unless alias->d_parent == alias.
But in that case alias is the directory we are doing lookup in, and
d_splice_alias() would've done the right thing.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

f6ddc161

get rid of dead code in d_find_alias() · 61fec493

由 Al Viro 提交于 4月 25, 2018

All "try disconnected alias if nothing else fits" logics in d_find_alias()
got accidentally disabled by Neil a while ago; for most of the callers it
was the right thing to do, so fixes belong in few callers that *do* want
disconnected aliases.  This just takes the now-dead code in d_find_alias()
out.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

61fec493

12 5月, 2018 2 次提交

fs: don't scan the inode cache before SB_BORN is set · 79f546a6

由 Dave Chinner 提交于 5月 11, 2018

We recently had an oops reported on a 4.14 kernel in
xfs_reclaim_inodes_count() where sb->s_fs_info pointed to garbage
and so the m_perag_tree lookup walked into lala land.  It produces
an oops down this path during the failed mount:

  radix_tree_gang_lookup_tag+0xc4/0x130
  xfs_perag_get_tag+0x37/0xf0
  xfs_reclaim_inodes_count+0x32/0x40
  xfs_fs_nr_cached_objects+0x11/0x20
  super_cache_count+0x35/0xc0
  shrink_slab.part.66+0xb1/0x370
  shrink_node+0x7e/0x1a0
  try_to_free_pages+0x199/0x470
  __alloc_pages_slowpath+0x3a1/0xd20
  __alloc_pages_nodemask+0x1c3/0x200
  cache_grow_begin+0x20b/0x2e0
  fallback_alloc+0x160/0x200
  kmem_cache_alloc+0x111/0x4e0

The problem is that the superblock shrinker is running before the
filesystem structures it depends on have been fully set up. i.e.
the shrinker is registered in sget(), before ->fill_super() has been
called, and the shrinker can call into the filesystem before
fill_super() does it's setup work. Essentially we are exposed to
both use-after-free and use-before-initialisation bugs here.

To fix this, add a check for the SB_BORN flag in super_cache_count.
In general, this flag is not set until ->fs_mount() completes
successfully, so we know that it is set after the filesystem
setup has completed. This matches the trylock_super() behaviour
which will not let super_cache_scan() run if SB_BORN is not set, and
hence will not allow the superblock shrinker from entering the
filesystem while it is being set up or after it has failed setup
and is being torn down.

Cc: stable@kernel.org
Signed-Off-By: NDave Chinner <dchinner@redhat.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

79f546a6

do d_instantiate/unlock_new_inode combinations safely · 1e2e547a

由 Al Viro 提交于 5月 04, 2018

For anything NFS-exported we do _not_ want to unlock new inode
before it has grown an alias; original set of fixes got the
ordering right, but missed the nasty complication in case of
lockdep being enabled - unlock_new_inode() does
	lockdep_annotate_inode_mutex_key(inode)
which can only be done before anyone gets a chance to touch
->i_mutex.  Unfortunately, flipping the order and doing
unlock_new_inode() before d_instantiate() opens a window when
mkdir can race with open-by-fhandle on a guessed fhandle, leading
to multiple aliases for a directory inode and all the breakage
that follows from that.

	Correct solution: a new primitive (d_instantiate_new())
combining these two in the right order - lockdep annotate, then
d_instantiate(), then the rest of unlock_new_inode().  All
combinations of d_instantiate() with unlock_new_inode() should
be converted to that.

Cc: stable@kernel.org	# 2.6.29 and later
Tested-by: NMike Marshall <hubcap@omnibond.com>
Reviewed-by: NAndreas Dilger <adilger@dilger.ca>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

1e2e547a

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功