提交 · aa263c43fee300af018aec437dbd64570ec65825 · gsplhtlxg / clone-Linux

20 2月, 2016 1 次提交

fs/pnode.c: treat zero mnt_group_id-s as unequal · 7ae8fd03

由 Maxim Patlasov 提交于 2月 16, 2016

propagate_one(m) calculates "type" argument for copy_tree() like this:

>    if (m->mnt_group_id == last_dest->mnt_group_id) {
>        type = CL_MAKE_SHARED;
>    } else {
>        type = CL_SLAVE;
>        if (IS_MNT_SHARED(m))
>           type |= CL_MAKE_SHARED;
>   }

The "type" argument then governs clone_mnt() behavior with respect to flags
and mnt_master of new mount. When we iterate through a slave group, it is
possible that both current "m" and "last_dest" are not shared (although,
both are slaves, i.e. have non-NULL mnt_master-s). Then the comparison
above erroneously makes new mount shared and sets its mnt_master to
last_source->mnt_master. The patch fixes the problem by handling zero
mnt_group_id-s as though they are unequal.

The similar problem exists in the implementation of "else" clause above
when we have to ascend upward in the master/slave tree by calling:

>    last_source = last_source->mnt_master;
>    last_dest = last_source->mnt_parent;

proper number of times. The last step is governed by
"n->mnt_group_id != last_dest->mnt_group_id" condition that may lie if
both are zero. The patch fixes this case in the same way as the former one.

[AV: don't open-code an obvious helper...]
Signed-off-by: NMaxim Patlasov <mpatlasov@virtuozzo.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

7ae8fd03

03 4月, 2015 5 次提交

mnt: Don't propagate unmounts to locked mounts · 0c56fe31

由 Eric W. Biederman 提交于 1月 05, 2015

If the first mount in shared subtree is locked don't unmount the
shared subtree.

This is ensured by walking through the mounts parents before children
and marking a mount as unmountable if it is not locked or it is locked
but it's parent is marked.

This allows recursive mount detach to propagate through a set of
mounts when unmounting them would not reveal what is under any locked
mount.

Cc: stable@vger.kernel.org
Signed-off-by: N"Eric W. Biederman" <ebiederm@xmission.com>

0c56fe31

mnt: On an unmount propagate clearing of MNT_LOCKED · 5d88457e

由 Eric W. Biederman 提交于 1月 03, 2015

A prerequisite of calling umount_tree is that the point where the tree
is mounted at is valid to unmount.

If we are propagating the effect of the unmount clear MNT_LOCKED in
every instance where the same filesystem is mounted on the same
mountpoint in the mount tree, as we know (by virtue of the fact
that umount_tree was called) that it is safe to reveal what
is at that mountpoint.

Cc: stable@vger.kernel.org
Signed-off-by: N"Eric W. Biederman" <ebiederm@xmission.com>

5d88457e

mnt: Delay removal from the mount hash. · 411a938b

由 Eric W. Biederman 提交于 12月 22, 2014

- Modify __lookup_mnt_hash_last to ignore mounts that have MNT_UMOUNTED set.
- Don't remove mounts from the mount hash table in propogate_umount
- Don't remove mounts from the mount hash table in umount_tree before
  the entire list of mounts to be umounted is selected.
- Remove mounts from the mount hash table as the last thing that
  happens in the case where a mount has a parent in umount_tree.
  Mounts without parents are not hashed (by definition).

This paves the way for delaying removal from the mount hash table even
farther and fixing the MNT_LOCKED vs MNT_DETACH issue.

Cc: stable@vger.kernel.org
Signed-off-by: N"Eric W. Biederman" <ebiederm@xmission.com>

411a938b

mnt: Add MNT_UMOUNT flag · 590ce4bc

由 Eric W. Biederman 提交于 12月 22, 2014

In some instances it is necessary to know if the the unmounting
process has begun on a mount.  Add MNT_UMOUNT to make that reliably
testable.

This fix gets used in fixing locked mounts in MNT_DETACH

Cc: stable@vger.kernel.org
Signed-off-by: N"Eric W. Biederman" <ebiederm@xmission.com>

590ce4bc

mnt: In umount_tree reuse mnt_list instead of mnt_hash · c003b26f

由 Eric W. Biederman 提交于 12月 18, 2014

umount_tree builds a list of mounts that need to be unmounted.
Utilize mnt_list for this purpose instead of mnt_hash.  This begins to
allow keeping a mount on the mnt_hash after it is unmounted, which is
necessary for a properly functioning MNT_LOCKED implementation.

The fact that mnt_list is an ordinary list makding available list_move
is nice bonus.

Cc: stable@vger.kernel.org
Signed-off-by: N"Eric W. Biederman" <ebiederm@xmission.com>

c003b26f

03 12月, 2014 1 次提交

mnt: Move the clear of MNT_LOCKED from copy_tree to it's callers. · 8486a788

由 Eric W. Biederman 提交于 10月 07, 2014

Clear MNT_LOCKED in the callers of copy_tree except copy_mnt_ns, and
collect_mounts.  In copy_mnt_ns it is necessary to create an exact
copy of a mount tree, so not clearing MNT_LOCKED is important.
Similarly collect_mounts is used to take a snapshot of the mount tree
for audit logging purposes and auditing using a faithful copy of the
tree is important.

This becomes particularly significant when we start setting MNT_LOCKED
on rootfs to prevent it from being unmounted.
Signed-off-by: N"Eric W. Biederman" <ebiederm@xmission.com>

8486a788

31 8月, 2014 1 次提交

get rid of propagate_umount() mistakenly treating slaves as busy. · 88b368f2

由 Al Viro 提交于 8月 18, 2014

The check in __propagate_umount() ("has somebody explicitly mounted
something on that slave?") is done *before* taking the already doomed
victims out of the child lists.

Cc: stable@vger.kernel.org
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

88b368f2

02 4月, 2014 1 次提交

smarter propagate_mnt() · f2ebb3a9

由 Al Viro 提交于 2月 27, 2014

The current mainline has copies propagated to *all* nodes, then
tears down the copies we made for nodes that do not contain
counterparts of the desired mountpoint.  That sets the right
propagation graph for the copies (at teardown time we move
the slaves of removed node to a surviving peer or directly
to master), but we end up paying a fairly steep price in
useless allocations.  It's fairly easy to create a situation
where N calls of mount(2) create exactly N bindings, with
O(N^2) vfsmounts allocated and freed in process.

Fortunately, it is possible to avoid those allocations/freeings.
The trick is to create copies in the right order and find which
one would've eventually become a master with the current algorithm.
It turns out to be possible in O(nodes getting propagation) time
and with no extra allocations at all.

One part is that we need to make sure that eventual master will be
created before its slaves, so we need to walk the propagation
tree in a different order - by peer groups.  And iterate through
the peers before dealing with the next group.

Another thing is finding the (earlier) copy that will be a master
of one we are about to create; to do that we are (temporary) marking
the masters of mountpoints we are attaching the copies to.

Either we are in a peer of the last mountpoint we'd dealt with,
or we have the following situation: we are attaching to mountpoint M,
the last copy S_0 had been attached to M_0 and there are sequences
S_0...S_n, M_0...M_n such that S_{i+1} is a master of S_{i},
S_{i} mounted on M{i} and we need to create a slave of the first S_{k}
such that M is getting propagation from M_{k}.  It means that the master
of M_{k} will be among the sequence of masters of M.  On the
other hand, the nearest marked node in that sequence will either
be the master of M_{k} or the master of M_{k-1} (the latter -
in the case if M_{k-1} is a slave of something M gets propagation
from, but in a wrong peer group).

So we go through the sequence of masters of M until we find
a marked one (P).  Let N be the one before it.  Then we go through
the sequence of masters of S_0 until we find one (say, S) mounted
on a node D that has P as master and check if D is a peer of N.
If it is, S will be the master of new copy, if not - the master of S
will be.

That's it for the hard part; the rest is fairly simple.  Iterator
is in next_group(), handling of one prospective mountpoint is
propagate_one().

It seems to survive all tests and gives a noticably better performance
than the current mainline for setups that are seriously using shared
subtrees.

Cc: stable@vger.kernel.org
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

f2ebb3a9

31 3月, 2014 1 次提交

switch mnt_hash to hlist · 38129a13

由 Al Viro 提交于 3月 20, 2014

fixes RCU bug - walking through hlist is safe in face of element moves,
since it's self-terminating.  Cyclic lists are not - if we end up jumping
to another hash chain, we'll loop infinitely without ever hitting the
original list head.

[fix for dumb braino folded]

Spotted by: Max Kellermann <mk@cm4all.com>
Cc: stable@vger.kernel.org
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

38129a13

25 10月, 2013 3 次提交

split __lookup_mnt() in two functions · 474279dc

由 Al Viro 提交于 10月 01, 2013

Instead of passing the direction as argument (and checking it on every
step through the hash chain), just have separate __lookup_mnt() and
__lookup_mnt_last().  And use the standard iterators...
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

474279dc

new helpers: lock_mount_hash/unlock_mount_hash · 719ea2fb

由 Al Viro 提交于 9月 29, 2013

aka br_write_{lock,unlock} of vfsmount_lock.  Inlines in fs/mount.h,
vfsmount_lock extern moved over there as well.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

719ea2fb

A
namespace.c: get rid of mnt_ghosts · aba809cf
由 Al Viro 提交于 9月 28, 2013
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
aba809cf

01 6月, 2013 1 次提交

vfs: Fix invalid ida_remove() call · 5d477b60

由 Takashi Iwai 提交于 5月 10, 2013

When the group id of a shared mount is not allocated, the umount still
tries to call mnt_release_group_id(), which eventually hits a kernel
warning at ida_remove() spewing a message like:
  ida_remove called for id=0 which is not allocated.

This patch fixes the bug simply checking the group id in the caller.
Reported-by: NCristian Rodríguez <crrodriguez@opensuse.org>
Signed-off-by: NTakashi Iwai <tiwai@suse.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

5d477b60

10 4月, 2013 2 次提交
- A
  switch unlock_mount() to namespace_unlock(), convert all umount_tree() callers · 328e6d90
  由 Al Viro 提交于 3月 16, 2013
```
which allows to kill the last argument of umount_tree() and make release_mounts()
static.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  328e6d90
- A
  get rid of full-hash scan on detaching vfsmounts · 84d17192
  由 Al Viro 提交于 3月 15, 2013
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  84d17192
27 3月, 2013 1 次提交

vfs: Carefully propogate mounts across user namespaces · 132c94e3

由 Eric W. Biederman 提交于 3月 22, 2013

As a matter of policy MNT_READONLY should not be changable if the
original mounter had more privileges than creator of the mount
namespace.

Add the flag CL_UNPRIVILEGED to note when we are copying a mount from
a mount namespace that requires more privileges to a mount namespace
that requires fewer privileges.

When the CL_UNPRIVILEGED flag is set cause clone_mnt to set MNT_NO_REMOUNT
if any of the mnt flags that should never be changed are set.

This protects both mount propagation and the initial creation of a less
privileged mount namespace.

Cc: stable@vger.kernel.org
Acked-by: NSerge Hallyn <serge.hallyn@canonical.com>
Reported-by: NAndy Lutomirski <luto@amacapital.net>
Signed-off-by: N"Eric W. Biederman" <ebiederm@xmission.com>

132c94e3

14 7月, 2012 1 次提交

VFS: Make clone_mnt()/copy_tree()/collect_mounts() return errors · be34d1a3

由 David Howells 提交于 6月 25, 2012

copy_tree() can theoretically fail in a case other than ENOMEM, but always
returns NULL which is interpreted by callers as -ENOMEM.  Change it to return
an explicit error.

Also change clone_mnt() for consistency and because union mounts will add new
error cases.

Thanks to Andreas Gruenbacher <agruen@suse.de> for a bug fix.
[AV: folded braino fix by Dan Carpenter]

Original-author: Valerie Aurora <vaurora@redhat.com>
Signed-off-by: NDavid Howells <dhowells@redhat.com>
Cc: Valerie Aurora <valerie.aurora@gmail.com>
Cc: Andreas Gruenbacher <agruen@suse.de>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

be34d1a3

30 5月, 2012 1 次提交

brlocks/lglocks: API cleanups · 962830df

由 Andi Kleen 提交于 5月 08, 2012

lglocks and brlocks are currently generated with some complicated macros
in lglock.h.  But there's no reason to not just use common utility
functions and put all the data into a common data structure.

In preparation, this patch changes the API to look more like normal
function calls with pointers, not magic macros.

The patch is rather large because I move over all users in one go to keep
it bisectable.  This impacts the VFS somewhat in terms of lines changed.
But no actual behaviour change.

[akpm@linux-foundation.org: checkpatch fixes]
Signed-off-by: NAndi Kleen <ak@linux.intel.com>
Cc: Al Viro <viro@zeniv.linux.org.uk>
Cc: Rusty Russell <rusty@rustcorp.com.au>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NRusty Russell <rusty@rustcorp.com.au>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

962830df

04 1月, 2012 21 次提交
- A
  vfs: switch pnode.h macros to struct mount * · fc7be130
  由 Al Viro 提交于 11月 25, 2011
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  fc7be130
- A
  vfs: move the rest of int fields to struct mount · 863d684f
  由 Al Viro 提交于 11月 25, 2011
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  863d684f
- A
  vfs: mnt_id/mnt_group_id moved · 15169fe7
  由 Al Viro 提交于 11月 25, 2011
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  15169fe7
- A
  vfs: mnt_ns moved to struct mount · 143c8c91
  由 Al Viro 提交于 11月 25, 2011
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  143c8c91
- A
  vfs: take mnt_share/mnt_slave/mnt_slave_list and mnt_expire to struct mount · 6776db3d
  由 Al Viro 提交于 11月 25, 2011
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  6776db3d
- A
  vfs: and now we can make ->mnt_master point to struct mount · 32301920
  由 Al Viro 提交于 11月 25, 2011
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  32301920
- A
  vfs: take mnt_master to struct mount · d10e8def
  由 Al Viro 提交于 11月 25, 2011
```
make IS_MNT_SLAVE take struct mount * at the same time
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  d10e8def
- A
  vfs: spread struct mount - remaining argument of mnt_set_mountpoint() · 14cf1fa8
  由 Al Viro 提交于 11月 25, 2011
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  14cf1fa8
- A
  vfs: spread struct mount - propagate_mnt() · a8d56d8e
  由 Al Viro 提交于 11月 24, 2011
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  a8d56d8e
- A
  vfs: spread struct mount - shared subtree iterators · c937135d
  由 Al Viro 提交于 11月 24, 2011
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  c937135d
- A
  vfs: spread struct mount - get_dominating_id / do_make_slave · 6fc7871f
  由 Al Viro 提交于 11月 24, 2011
```
next pile of horrors, similar to mnt_parent one; this time it's
mnt_master.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  6fc7871f
- A
  vfs: take mnt_child/mnt_mounts to struct mount · 6b41d536
  由 Al Viro 提交于 11月 24, 2011
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  6b41d536
- A
  vfs: spread struct mount - work with counters · 83adc753
  由 Al Viro 提交于 11月 24, 2011
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  83adc753
- A
  vfs: move mnt_mountpoint to struct mount · a73324da
  由 Al Viro 提交于 11月 24, 2011
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  a73324da
- A
  vfs: now it can be done - make mnt_parent point to struct mount · 0714a533
  由 Al Viro 提交于 11月 24, 2011
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  0714a533
- A
  vfs: mnt_parent moved to struct mount · 3376f34f
  由 Al Viro 提交于 11月 24, 2011
```
the second victim...
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  3376f34f
- A
  vfs: spread struct mount - is_path_reachable · 643822b4
  由 Al Viro 提交于 11月 24, 2011
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  643822b4
- A
  vfs: spread struct mount - do_umount/propagate_mount_busy · 1ab59738
  由 Al Viro 提交于 11月 24, 2011
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  1ab59738
- A
  vfs: spread struct mount mnt_set_mountpoint child argument · 44d964d6
  由 Al Viro 提交于 11月 24, 2011
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  44d964d6
- A
  vfs: spread struct mount - clone_mnt/copy_tree argument · 87129cc0
  由 Al Viro 提交于 11月 24, 2011
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  87129cc0
- A
  vfs: spread struct mount - umount_tree argument · 761d5c38
  由 Al Viro 提交于 11月 24, 2011
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  761d5c38