提交 · 12b9fa6a97b3150477ab182e321be512b59fa899 · xiphi1978 / linux

28 2月, 2016 4 次提交

do_last(): ELOOP failure exit should be done after leaving RCU mode · 5129fa48

由 Al Viro 提交于 2月 27, 2016

... or we risk seeing a bogus value of d_is_symlink() there.

Cc: stable@vger.kernel.org # v4.2+
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

5129fa48

should_follow_link(): validate ->d_seq after having decided to follow · a7f77542

由 Al Viro 提交于 2月 27, 2016

... otherwise d_is_symlink() above might have nothing to do with
the inode value we've got.

Cc: stable@vger.kernel.org # v4.2+
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

a7f77542

namei: ->d_inode of a pinned dentry is stable only for positives · d4565649

由 Al Viro 提交于 2月 27, 2016

both do_last() and walk_component() risk picking a NULL inode out
of dentry about to become positive, *then* checking its flags and
seeing that it's not negative anymore and using (already stale by
then) value they'd fetched earlier.  Usually ends up oopsing soon
after that...

Cc: stable@vger.kernel.org # v3.13+
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

d4565649

do_last(): don't let a bogus return value from ->open() et.al. to confuse us · c80567c8

由 Al Viro 提交于 2月 27, 2016

... into returning a positive to path_openat(), which would interpret that
as "symlink had been encountered" and proceed to corrupt memory, etc.
It can only happen due to a bug in some ->open() instance or in some LSM
hook, etc., so we report any such event *and* make sure it doesn't trick
us into further unpleasantness.

Cc: stable@vger.kernel.org # v3.6+, at least
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

c80567c8

23 1月, 2016 1 次提交

wrappers for ->i_mutex access · 5955102c

由 Al Viro 提交于 1月 22, 2016

parallel to mutex_{lock,unlock,trylock,is_locked,lock_nested},
inode_foo(inode) being mutex_foo(&inode->i_mutex).

Please, use those for access to ->i_mutex; over the coming cycle
->i_mutex will become rwsem, with ->lookup() done with it held
only shared.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

5955102c

09 1月, 2016 1 次提交

nfsd: don't hold i_mutex over userspace upcalls · bbddca8e

由 NeilBrown 提交于 1月 07, 2016

We need information about exports when crossing mountpoints during
lookup or NFSv4 readdir.  If we don't already have that information
cached, we may have to ask (and wait for) rpc.mountd.

In both cases we currently hold the i_mutex on the parent of the
directory we're asking rpc.mountd about.  We've seen situations where
rpc.mountd performs some operation on that directory that tries to take
the i_mutex again, resulting in deadlock.

With some care, we may be able to avoid that in rpc.mountd.  But it
seems better just to avoid holding a mutex while waiting on userspace.

It appears that lookup_one_len is pretty much the only operation that
needs the i_mutex.  So we could just drop the i_mutex elsewhere and do
something like

	mutex_lock()
	lookup_one_len()
	mutex_unlock()

In many cases though the lookup would have been cached and not required
the i_mutex, so it's more efficient to create a lookup_one_len() variant
that only takes the i_mutex when necessary.
Signed-off-by: NNeilBrown <neilb@suse.de>
Signed-off-by: NJ. Bruce Fields <bfields@redhat.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

bbddca8e

04 1月, 2016 1 次提交
- A
  don't carry MAY_OPEN in op->acc_mode · 62fb4a15
  由 Al Viro 提交于 12月 26, 2015
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  62fb4a15
31 12月, 2015 1 次提交
- A
  switch ->get_link() to delayed_call, kill ->put_link() · fceef393
  由 Al Viro 提交于 12月 29, 2015
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  fceef393
09 12月, 2015 3 次提交

teach page_get_link() to work in RCU mode · d3883d4f

由 Al Viro 提交于 11月 17, 2015

more or less along the lines of Neil's patchset, sans the insanity
around kmap().
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

d3883d4f

replace ->follow_link() with new method that could stay in RCU mode · 6b255391

由 Al Viro 提交于 11月 17, 2015

new method: ->get_link(); replacement of ->follow_link().  The differences
are:
	* inode and dentry are passed separately
	* might be called both in RCU and non-RCU mode;
the former is indicated by passing it a NULL dentry.
	* when called that way it isn't allowed to block
and should return ERR_PTR(-ECHILD) if it needs to be called
in non-RCU mode.

It's a flagday change - the old method is gone, all in-tree instances
converted.  Conversion isn't hard; said that, so far very few instances
do not immediately bail out when called in RCU mode.  That'll change
in the next commits.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

6b255391

don't put symlink bodies in pagecache into highmem · 21fc61c7

由 Al Viro 提交于 11月 17, 2015

kmap() in page_follow_link_light() needed to go - allowing to hold
an arbitrary number of kmaps for long is a great way to deadlocking
the system.

new helper (inode_nohighmem(inode)) needs to be used for pagecache
symlinks inodes; done for all in-tree cases.  page_follow_link_light()
instrumented to yell about anything missed.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

21fc61c7

07 12月, 2015 7 次提交

restore_nameidata(): no need to clear now->stack · e1a63bbc

由 Al Viro 提交于 12月 05, 2015

microoptimization: in all callers *now is in the frame we are about to leave.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

e1a63bbc

namei.c: take "jump to root" into a new helper · 248fb5b9

由 Al Viro 提交于 12月 05, 2015

... and use it both in path_init() (for absolute pathnames) and
get_link() (for absolute symlinks).
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

248fb5b9

path_init(): set nd->inode earlier in cwd-relative case · ef55d917

由 Al Viro 提交于 12月 05, 2015

that allows to kill the recheck of nd->seq on the way out in
this case, and this check on the way out is left only for
absolute pathnames.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

ef55d917

A
namei.c: fold set_root_rcu() into set_root() · 9e6697e2
由 Al Viro 提交于 12月 05, 2015
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
9e6697e2
M
typo in fs/namei.c comment · 57e3715c
由 Mike Marshall 提交于 11月 30, 2015
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
57e3715c
A
namei: page_getlink() and page_follow_link_light() are the same thing · aa80deab
由 Al Viro 提交于 11月 16, 2015
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
aa80deab

Don't reset ->total_link_count on nested calls of vfs_path_lookup() · 2788cc47

由 Al Viro 提交于 12月 06, 2015

we already zero it on outermost set_nameidata(), so initialization in
path_init() is pointless and wrong.  The same DoS exists on pre-4.2
kernels, but there a slightly different fix will be needed.

Cc: stable@vger.kernel.org # v4.2
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

2788cc47

07 11月, 2015 1 次提交

mm, fs: introduce mapping_gfp_constraint() · c62d2555

由 Michal Hocko 提交于 11月 06, 2015

There are many places which use mapping_gfp_mask to restrict a more
generic gfp mask which would be used for allocations which are not
directly related to the page cache but they are performed in the same
context.

Let's introduce a helper function which makes the restriction explicit and
easier to track.  This patch doesn't introduce any functional changes.

[akpm@linux-foundation.org: coding-style fixes]
Signed-off-by: NMichal Hocko <mhocko@suse.com>
Suggested-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

c62d2555

28 10月, 2015 1 次提交

namei: permit linking with CAP_FOWNER in userns · f2ca3796

由 Dirk Steinmetz 提交于 10月 20, 2015

Attempting to hardlink to an unsafe file (e.g. a setuid binary) from
within an unprivileged user namespace fails, even if CAP_FOWNER is held
within the namespace. This may cause various failures, such as a gentoo
installation within a lxc container failing to build and install specific
packages.

This change permits hardlinking of files owned by mapped uids, if
CAP_FOWNER is held for that namespace. Furthermore, it improves consistency
by using the existing inode_owner_or_capable(), which is aware of
namespaced capabilities as of 23adbe12 ("fs,userns: Change
inode_capable to capable_wrt_inode_uidgid").
Signed-off-by: NDirk Steinmetz <public@rsjtdrjgfuzkfg.com>

This is hitting us in Ubuntu during some dpkg upgrades in containers.
When upgrading a file dpkg creates a hard link to the old file to back
it up before overwriting it. When packages upgrade suid files owned by a
non-root user the link isn't permitted, and the package upgrade fails.
This patch fixes our problem.
Tested-by: NSeth Forshee <seth.forshee@canonical.com>
Signed-off-by: NEric W. Biederman <ebiederm@xmission.com>

f2ca3796

11 10月, 2015 1 次提交

namei: results of d_is_negative() should be checked after dentry revalidation · daf3761c

由 Trond Myklebust 提交于 10月 09, 2015

Leandro Awa writes:
 "After switching to version 4.1.6, our parallelized and distributed
  workflows now fail consistently with errors of the form:

  T34: ./regex.c:39:22: error: config.h: No such file or directory

  From our 'git bisect' testing, the following commit appears to be the
  possible cause of the behavior we've been seeing: commit 766c4cbf"

Al Viro says:
 "What happens is that 766c4cbf got the things subtly wrong.

  We used to treat d_is_negative() after lookup_fast() as "fall with
  ENOENT".  That was wrong - checking ->d_flags outside of ->d_seq
  protection is unreliable and failing with hard error on what should've
  fallen back to non-RCU pathname resolution is a bug.

  Unfortunately, we'd pulled the test too far up and ran afoul of
  another kind of staleness.  The dentry might have been absolutely
  stable from the RCU point of view (and we might be on UP, etc), but
  stale from the remote fs point of view.  If ->d_revalidate() returns
  "it's actually stale", dentry gets thrown away and the original code
  wouldn't even have looked at its ->d_flags.

  What we need is to check ->d_flags where 766c4cbf does (prior to
  ->d_seq validation) but only use the result in cases where we do not
  discard this dentry outright"
Reported-by: NLeandro Awa <lawa@nvidia.com>
Link: https://bugzilla.kernel.org/show_bug.cgi?id=104911
Fixes: 766c4cbf ("namei: d_is_negative() should be checked...")
Tested-by: NLeandro Awa <lawa@nvidia.com>
Cc: stable@vger.kernel.org # v4.1+
Signed-off-by: NTrond Myklebust <trond.myklebust@primarydata.com>
Acked-by: NAl Viro <viro@zeniv.linux.org.uk>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

daf3761c

29 9月, 2015 1 次提交

fs: Drop unlikely before IS_ERR(_OR_NULL) · a1c83681

由 Viresh Kumar 提交于 8月 12, 2015

IS_ERR(_OR_NULL) already contain an 'unlikely' compiler flag and there
is no need to do that again from its callers. Drop it.
Signed-off-by: NViresh Kumar <viresh.kumar@linaro.org>
Reviewed-by: NJeff Layton <jlayton@poochiereds.net>
Reviewed-by: NDavid Howells <dhowells@redhat.com>
Reviewed-by: NSteve French <smfrench@gmail.com>
Signed-off-by: NJiri Kosina <jkosina@suse.cz>

a1c83681

11 9月, 2015 1 次提交

namei: fix warning while make xmldocs caused by namei.c · 2a78b857

由 Masanari Iida 提交于 9月 09, 2015

Fix the following warnings:

Warning(.//fs/namei.c:2422): No description found for parameter 'nd'
Warning(.//fs/namei.c:2422): Excess function parameter 'nameidata'
description in 'path_mountpoint'
Signed-off-by: NMasanari Iida <standby24x7@gmail.com>
Acked-by: NRandy Dunlap <rdunlap@infradead.org>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

2a78b857

21 8月, 2015 1 次提交

vfs: Test for and handle paths that are unreachable from their mnt_root · 397d425d

由 Eric W. Biederman 提交于 8月 15, 2015

In rare cases a directory can be renamed out from under a bind mount.
In those cases without special handling it becomes possible to walk up
the directory tree to the root dentry of the filesystem and down
from the root dentry to every other file or directory on the filesystem.

Like division by zero .. from an unconnected path can not be given
a useful semantic as there is no predicting at which path component
the code will realize it is unconnected.  We certainly can not match
the current behavior as the current behavior is a security hole.

Therefore when encounting .. when following an unconnected path
return -ENOENT.

- Add a function path_connected to verify path->dentry is reachable
  from path->mnt.mnt_root.  AKA to validate that rename did not do
  something nasty to the bind mount.

  To avoid races path_connected must be called after following a path
  component to it's next path component.
Signed-off-by: N"Eric W. Biederman" <ebiederm@xmission.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

397d425d

05 8月, 2015 1 次提交

may_follow_link() should use nd->inode · aa65fa35

由 Al Viro 提交于 8月 04, 2015

Now that we can get there in RCU mode, we shouldn't play with
nd->path.dentry->d_inode - it's not guaranteed to be stable.
Use nd->inode instead.
Reported-by: NHugh Dickins <hughd@google.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

aa65fa35

02 8月, 2015 1 次提交

link_path_walk(): be careful when failing with ENOTDIR · 97242f99

由 Al Viro 提交于 8月 01, 2015

In RCU mode we might end up with dentry evicted just we check
that it's a directory.  In such case we should return ECHILD
rather than ENOTDIR, so that pathwalk would be retries in non-RCU
mode.

Breakage had been introduced in commit b18825a7 - prior to that
we were looking at nd->inode, which had been fetched before
verifying that ->d_seq was still valid.  That form of check
would only be satisfied if at some point the pathname prefix
would indeed have resolved to a non-directory.  The fix consists
of checking ->d_seq after we'd run into a non-directory dentry,
and failing with ECHILD in case of mismatch.

Note that all branches since 3.12 have that problem...
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

97242f99

30 6月, 2015 1 次提交

namei: make set_root_rcu() return void · 06d7137e

由 Al Viro 提交于 6月 29, 2015

The only caller that cares about its return value can just
as easily pick it from nd->root_seq itself.  We used to just
calculate it and return to caller, but these days we are
storing it in nd->root_seq in all cases.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

06d7137e

15 5月, 2015 13 次提交

A
turn user_{path_at,path,lpath,path_dir}() into static inlines · b853a161
由 Al Viro 提交于 5月 13, 2015
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
b853a161

namei: move saved_nd pointer into struct nameidata · 9883d185

由 Al Viro 提交于 5月 13, 2015

these guys are always declared next to each other; might as well put
the former (pointer to previous instance) into the latter and simplify
the calling conventions for {set,restore}_nameidata()
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

9883d185

A
inline user_path_create() · 520ae687
由 Al Viro 提交于 5月 13, 2015
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
520ae687
A
inline user_path_parent() · a2ec4a2d
由 Al Viro 提交于 5月 13, 2015
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
a2ec4a2d

namei: trim do_last() arguments · 76ae2a5a

由 Al Viro 提交于 5月 12, 2015

now that struct filename is stashed in nameidata we have no need to
pass it in
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

76ae2a5a

A
namei: stash dfd and name into nameidata · c8a53ee5
由 Al Viro 提交于 5月 12, 2015
```
fewer arguments to pass around...
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
c8a53ee5

namei: fold path_cleanup() into terminate_walk() · 102b8af2

由 Al Viro 提交于 5月 12, 2015

they are always called next to each other; moreover,
terminate_walk() is more symmetrical that way.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

102b8af2

namei: saner calling conventions for filename_parentat() · 5c31b6ce

由 Al Viro 提交于 5月 12, 2015

a) make it reject ERR_PTR() for name
b) make it putname(name) on all other failure exits
c) make it return name on success

again, simplifies the callers
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

5c31b6ce

namei: saner calling conventions for filename_create() · 181c37b6

由 Al Viro 提交于 5月 12, 2015

a) make it reject ERR_PTR() for name
b) make it putname(name) upon return in all other cases.

seriously simplifies the callers...
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

181c37b6

A
namei: shift nameidata down into filename_parentat() · 391172c4
由 Al Viro 提交于 5月 09, 2015
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
391172c4
A
namei: make filename_lookup() reject ERR_PTR() passed as name · abc9f5be
由 Al Viro 提交于 5月 12, 2015
```
makes for much easier life in callers
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
abc9f5be

namei: shift nameidata inside filename_lookup() · 9ad1aaa6

由 Al Viro 提交于 5月 12, 2015

pass root instead; non-NULL => copy to nd.root and
set LOOKUP_ROOT in flags
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

9ad1aaa6

A
namei: move putname() call into filename_lookup() · e4bd1c1a
由 Al Viro 提交于 5月 12, 2015
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
e4bd1c1a