提交 · f874e1ac21d7708464dc656a10312542c54719f1 · openanolis / cloud-kernel

28 7月, 2010 21 次提交

inotify: force inotify and fsnotify use same bits · f874e1ac

由 Eric Paris 提交于 7月 28, 2010

inotify uses bits called IN_* and fsnotify uses bits called FS_*.  These
need to line up.  This patch adds build time checks to make sure noone can
change these bits so they are not the same.
Signed-off-by: NEric Paris <eparis@redhat.com>

f874e1ac

inotify: allow users to request not to recieve events on unlinked children · 8c1934c8

由 Eric Paris 提交于 7月 28, 2010

An inotify watch on a directory will send events for children even if those
children have been unlinked.  This patch add a new inotify flag IN_EXCL_UNLINK
which allows a watch to specificy they don't care about unlinked children.
This should fix performance problems seen by tasks which add a watch to
/tmp and then are overrun with events when other processes are reading and
writing to unlinked files they created in /tmp.

https://bugzilla.kernel.org/show_bug.cgi?id=16296Requested-by: NMatthias Clasen <mclasen@redhat.com>
Signed-off-by: NEric Paris <eparis@redhat.com>

8c1934c8

inotify: send IN_UNMOUNT events · 611da04f

由 Eric Paris 提交于 7月 28, 2010

Since the .31 or so notify rewrite inotify has not sent events about
inodes which are unmounted.  This patch restores those events.
Signed-off-by: NEric Paris <eparis@redhat.com>

611da04f

inotify_user.c: make local symbol static · 0a24887a

由 H Hartley Sweeten 提交于 5月 14, 2010

The symbol inotify_max_user_watches is not used outside this
file and should be static.
Signed-off-by: NH Hartley Sweeten <hsweeten@visionengravers.com>
Cc: John McCutchan <john@johnmccutchan.com>
Cc: Robert Love <rlove@rlove.org>
Cc: Eric Paris <eparis@parisplace.org>
Signed-off-by: NEric Paris <eparis@redhat.com>

0a24887a

fsnotify: intoduce a notification merge argument · 6e5f77b3

由 Eric Paris 提交于 12月 17, 2009

Each group can define their own notification (and secondary_q) merge
function. Inotify does tail drop, fanotify does matching and drop which
can actually allocate a completely new event. But for fanotify to properly
deal with permissions events it needs to know the new event which was
ultimately added to the notification queue. This patch just implements a
void ** argument which is passed to the merge function. fanotify can use
this field to pass the new event back to higher layers.
Signed-off-by: NEric Paris <eparis@redhat.com>
for fanotify to properly deal with permissions events

6e5f77b3

fsnotify: allow marks to not pin inodes in core · 90b1e7a5

由 Eric Paris 提交于 12月 17, 2009

inotify marks must pin inodes in core. dnotify doesn't technically need to
since they are closed when the directory is closed. fanotify also need to
pin inodes in core as it works today. But the next step is to introduce
the concept of 'ignored masks' which is actually a mask of events for an
inode of no interest. I claim that these should be liberally sent to the
kernel and should not pin the inode in core. If the inode is brought back
in the listener will get an event it may have thought excluded, but this is
not a serious situation and one any listener should deal with.

This patch lays the ground work for non-pinning inode marks by using lazy
inode pinning. We do not pin a mark until it has a non-zero mask entry. If a
listener new sets a mask we never pin the inode.
Signed-off-by: NEric Paris <eparis@redhat.com>

90b1e7a5

fsnotify: split generic and inode specific mark code · 5444e298

由 Eric Paris 提交于 12月 17, 2009

currently all marking is done by functions in inode-mark.c. Some of this
is pretty generic and should be instead done in a generic function and we
should only put the inode specific code in inode-mark.c
Signed-off-by: NEric Paris <eparis@redhat.com>

5444e298

fsnotify: take inode->i_lock inside fsnotify_find_mark_entry() · 35566087

由 Andreas Gruenbacher 提交于 12月 17, 2009

All callers to fsnotify_find_mark_entry() except one take and
release inode->i_lock around the call.  Take the lock inside
fsnotify_find_mark_entry() instead.
Signed-off-by: NAndreas Gruenbacher <agruen@suse.de>
Signed-off-by: NEric Paris <eparis@redhat.com>

35566087

inotify: rename mark_entry to just mark · 000285de

由 Eric Paris 提交于 12月 17, 2009

rename anything in inotify that deals with mark_entry to just be mark.  It
makes a lot more sense.
Signed-off-by: NEric Paris <eparis@redhat.com>

000285de

E
fsnotify: rename fsnotify_find_mark_entry to fsnotify_find_mark · d0775441
由 Eric Paris 提交于 12月 17, 2009
```
the _entry portion of fsnotify functions is useless.  Drop it.
Signed-off-by: NEric Paris <eparis@redhat.com>
```
d0775441

fsnotify: rename fsnotify_mark_entry to just fsnotify_mark · e61ce867

由 Eric Paris 提交于 12月 17, 2009

The name is long and it serves no real purpose.  So rename
fsnotify_mark_entry to just fsnotify_mark.
Signed-off-by: NEric Paris <eparis@redhat.com>

e61ce867

fsnotify: put inode specific fields in an fsnotify_mark in a union · 2823e04d

由 Eric Paris 提交于 12月 17, 2009

The addition of marks on vfs mounts will be simplified if the inode
specific parts of a mark and the vfsmnt specific parts of a mark are
actually in a union so naming can be easy.  This patch just implements the
inode struct and the union.
Signed-off-by: NEric Paris <eparis@redhat.com>

2823e04d

fsnotify: drop mask argument from fsnotify_alloc_group · 0d2e2a1d

由 Eric Paris 提交于 12月 17, 2009

Nothing uses the mask argument to fsnotify_alloc_group.  This patch drops
that argument.
Signed-off-by: NEric Paris <eparis@redhat.com>

0d2e2a1d

fsnotify: fsnotify_obtain_group should be fsnotify_alloc_group · ffab8340

由 Eric Paris 提交于 12月 17, 2009

fsnotify_obtain_group was intended to be able to find an already existing
group.  Nothing uses that functionality.  This just renames it to
fsnotify_alloc_group so it is clear what it is doing.
Signed-off-by: NEric Paris <eparis@redhat.com>

ffab8340

fsnotify: remove group_num altogether · 74be0cc8

由 Eric Paris 提交于 12月 17, 2009

The original fsnotify interface has a group-num which was intended to be
able to find a group after it was added.  I no longer think this is a
necessary thing to do and so we remove the group_num.
Signed-off-by: NEric Paris <eparis@redhat.com>

74be0cc8

fsnotify: per group notification queue merge types · 74766bbf

由 Eric Paris 提交于 12月 17, 2009

inotify only wishes to merge a new event with the last event on the
notification fifo. fanotify is willing to merge any events including by
means of bitwise OR masks of multiple events together. This patch moves
the inotify event merging logic out of the generic fsnotify notification.c
and into the inotify code. This allows each use of fsnotify to provide
their own merge functionality.
Signed-off-by: NEric Paris <eparis@redhat.com>

74766bbf

inotify: do not spam console without limit · d7f0ce4e

由 Eric Paris 提交于 12月 22, 2009

inotify was supposed to have a dmesg printk ratelimitor which would cause
inotify to only emit one message per boot.  The static bool was never set
so it kept firing messages.  This patch correctly limits warnings in multiple
places.
Signed-off-by: NEric Paris <eparis@redhat.com>

d7f0ce4e

inotify: do not reuse watch descriptors · 7050c488

由 Eric Paris 提交于 12月 17, 2009

Prior to 2.6.31 inotify would not reuse watch descriptors until all of
them had been used at least once. After the rewrite inotify would reuse
watch descriptors. The selinux utility 'restorecond' was found to have
problems when watch descriptors were reused. This patch reverts to the
pre inotify rewrite behavior to not reuse watch descriptors.
Signed-off-by: NEric Paris <eparis@redhat.com>

7050c488

inotify: use container_of instead of casting · 31ddd326

由 Eric Paris 提交于 12月 17, 2009

inotify_free_mark casts directly from an fsnotify_mark_entry to an
inotify_inode_mark_entry.  This works, but should use container_of instead
for future proofing.
Signed-off-by: NEric Paris <eparis@redhat.com>

31ddd326

fsnotify: allow addition of duplicate fsnotify marks · 40554c3d

由 Eric Paris 提交于 12月 17, 2009

This patch allows a task to add a second fsnotify mark to an inode for the
same group. This mark will be added to the end of the inode's list and
this will never be found by the stand fsnotify_find_mark() function. This
is useful if a user wants to add a new mark before removing the old one.
Signed-off-by: NEric Paris <eparis@redhat.com>

40554c3d

inotify: simplify the inotify idr handling · b7ba8371

由 Eric Paris 提交于 12月 17, 2009

This patch moves all of the idr editing operations into their own idr
functions.  It makes it easier to prove locking correctness and to to
understand the code flow.
Signed-off-by: NEric Paris <eparis@redhat.com>

b7ba8371

14 5月, 2010 2 次提交

inotify: race use after free/double free in inotify inode marks · e0873344

由 Eric Paris 提交于 5月 11, 2010

There is a race in the inotify add/rm watch code.  A task can find and
remove a mark which doesn't have all of it's references.  This can
result in a use after free/double free situation.

Task A					Task B
------------				-----------
inotify_new_watch()
 allocate a mark (refcnt == 1)
 add it to the idr
					inotify_rm_watch()
					 inotify_remove_from_idr()
					  fsnotify_put_mark()
					      refcnt hits 0, free
 take reference because we are on idr
 [at this point it is a use after free]
 [time goes on]
 refcnt may hit 0 again, double free

The fix is to take the reference BEFORE the object can be found in the
idr.
Signed-off-by: NEric Paris <eparis@redhat.com>
Cc: <stable@kernel.org>

e0873344

inotify: clean up the inotify_add_watch out path · 3dbc6fb6

由 Eric Paris 提交于 5月 11, 2010

inotify_add_watch explictly frees the unused inode mark, but it can just
use the generic code.  Just do that.
Signed-off-by: NEric Paris <eparis@redhat.com>

3dbc6fb6

19 2月, 2010 1 次提交
- A
  switch inotify_user to anon_inode · c44dcc56
  由 Al Viro 提交于 2月 11, 2010
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  c44dcc56
16 1月, 2010 1 次提交

inotify: do not reuse watch descriptors · 9e572cc9

由 Eric Paris 提交于 1月 15, 2010

Since commit 7e790dd5 ("inotify: fix
error paths in inotify_update_watch") inotify changed the manor in which
it gave watch descriptors back to userspace.  Previous to this commit
inotify acted like the following:

  inotify_add_watch(X, Y, Z) = 1
  inotify_rm_watch(X, 1);
  inotify_add_watch(X, Y, Z) = 2

but after this patch inotify would return watch descriptors like so:

  inotify_add_watch(X, Y, Z) = 1
  inotify_rm_watch(X, 1);
  inotify_add_watch(X, Y, Z) = 1

which I saw as equivalent to opening an fd where

  open(file) = 1;
  close(1);
  open(file) = 1;

seemed perfectly reasonable.  The issue is that quite a bit of userspace
apparently relies on the behavior in which watch descriptors will not be
quickly reused.  KDE relies on it, I know some selinux packages rely on
it, and I have heard complaints from other random sources such as debian
bug 558981.

Although the man page implies what we do is ok, we broke userspace so
this patch almost reverts us to the old behavior.  It is still slightly
racey and I have patches that would fix that, but they are rather large
and this will fix it for all real world cases.  The race is as follows:

 - task1 creates a watch and blocks in idr_new_watch() before it updates
   the hint.
 - task2 creates a watch and updates the hint.
 - task1 updates the hint with it's older wd
 - task removes the watch created by task2
 - task adds a new watch and will reuse the wd originally given to task2

it requires moving some locking around the hint (last_wd) but this should
solve it for the real world and be -stable safe.

As a side effect this patch papers over a bug in the lib/idr code which
is causing a large number WARN's to pop on people's system and many
reports in kerneloops.org.  I'm working on the root cause of that idr
bug seperately but this should make inotify immune to that issue.
Signed-off-by: NEric Paris <eparis@redhat.com>
Cc: stable@kernel.org
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

9e572cc9

17 12月, 2009 2 次提交

switch alloc_file() to passing struct path · 2c48b9c4

由 Al Viro 提交于 8月 09, 2009

... and have the caller grab both mnt and dentry; kill
leak in infiniband, while we are at it.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

2c48b9c4

A
switched inotify_init1() to alloc_file() · 825f9692
由 Al Viro 提交于 8月 05, 2009
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
825f9692

04 12月, 2009 1 次提交
- G
  inotify: remove superfluous return code check · 336e8683
  由 Giuseppe Scrivano 提交于 12月 03, 2009
```
Signed-off-by: NGiuseppe Scrivano <gscrivano@gnu.org>
Signed-off-by: NJiri Kosina <jkosina@suse.cz>
```
  336e8683
19 11月, 2009 1 次提交

sysctl: Drop & in front of every proc_handler. · 6d456111

由 Eric W. Biederman 提交于 11月 16, 2009

For consistency drop & in front of every proc_handler.  Explicity
taking the address is unnecessary and it prevents optimizations
like stubbing the proc_handlers to NULL.

Cc: Alexey Dobriyan <adobriyan@gmail.com>
Cc: Ingo Molnar <mingo@elte.hu>
Cc: Joe Perches <joe@perches.com>
Signed-off-by: NEric W. Biederman <ebiederm@xmission.com>

6d456111

12 11月, 2009 1 次提交

sysctl fs: Remove dead binary sysctl support · ab09203e

由 Eric W. Biederman 提交于 11月 05, 2009

Now that sys_sysctl is a generic wrapper around /proc/sys  .ctl_name
and .strategy members of sysctl tables are dead code.  Remove them.

Cc: Jan Harkes <jaharkes@cs.cmu.edu>
Signed-off-by: NEric W. Biederman <ebiederm@xmission.com>

ab09203e

29 8月, 2009 1 次提交

inotify: update the group mask on mark addition · 750a8870

由 Eric Paris 提交于 8月 28, 2009

Seperating the addition and update of marks in inotify resulted in a
regression in that inotify never gets events. The inotify group mask is
always 0. This mask should be updated any time a new mark is added.
Signed-off-by: NEric Paris <eparis@redhat.com>

750a8870

28 8月, 2009 2 次提交

inotify: fix length reporting and size checking · 83cb10f0

由 Eric Paris 提交于 8月 28, 2009

0db501bd introduced a regresion in that it now sends a nul
terminator but the length accounting when checking for space or
reporting to userspace did not take this into account.  This corrects
all of the rounding logic.
Signed-off-by: NEric Paris <eparis@redhat.com>

83cb10f0

inotify: do not send a block of zeros when no pathname is available · b962e731

由 Brian Rogers 提交于 8月 28, 2009

When an event has no pathname, there's no need to pad it with a null byte and
therefore generate an inotify_event sized block of zeros. This fixes a
regression introduced by commit 0db501bd where
my system wouldn't finish booting because some process was being confused by
this.
Signed-off-by: NBrian Rogers <brian@xyzw.org>
Signed-off-by: NEric Paris <eparis@redhat.com>

b962e731

27 8月, 2009 3 次提交

inotify: Ensure we alwasy write the terminating NULL. · 0db501bd

由 Eric W. Biederman 提交于 8月 27, 2009

Before the rewrite copy_event_to_user always wrote a terqminating '\0'
byte to user space after the filename.  Since the rewrite that
terminating byte was skipped if your filename is exactly a multiple of
event_size.  Ouch!

So add one byte to name_size before we round up and use clear_user to
set userspace to zero like /dev/zero does instead of copying the
strange nul_inotify_event.  I can't quite convince myself len_to_zero
will never exceed 16 and even if it doesn't clear_user should be more
efficient and a more accurate reflection of what the code is trying to
do.
Signed-off-by: NEric W. Biederman <ebiederm@aristanetworks.com>
Signed-off-by: NEric Paris <eparis@redhat.com>

0db501bd

inotify: fix locking around inotify watching in the idr · dead537d

由 Eric Paris 提交于 8月 24, 2009

The are races around the idr storage of inotify watches. It's possible
that a watch could be found from sys_inotify_rm_watch() in the idr, but it
could be removed from the idr before that code does it's removal. Move the
locking and the refcnt'ing so that these have to happen atomically.
Signed-off-by: NEric Paris <eparis@redhat.com>

dead537d

inotify: seperate new watch creation updating existing watches · 52cef755

由 Eric Paris 提交于 8月 24, 2009

There is nothing known wrong with the inotify watch addition/modification
but this patch seperates the two code paths to make them each easy to
verify as correct.
Signed-off-by: NEric Paris <eparis@redhat.com>

52cef755

18 8月, 2009 2 次提交

inotify: start watch descriptor count at 1 · 08e53fcb

由 Eric Paris 提交于 8月 16, 2009

The inotify_add_watch man page specifies that inotify_add_watch() will
return a non-negative integer.  However, historically the inotify
watches started at 1, not at 0.

Turns out that the inotifywait program provided by the inotify-tools
package doesn't properly handle a 0 watch descriptor.  In 7e790dd5 we
changed from starting at 1 to starting at 0.  This patch starts at 1,
just like in previous kernels, but also just like in previous kernels
it's possible for it to wrap back to 0.  This preserves the kernel
functionality exactly like it was before the patch (neither method broke
the spec)
Signed-off-by: NEric Paris <eparis@redhat.com>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

08e53fcb

notify: unused event private race · eef3a116

由 Eric Paris 提交于 8月 16, 2009

inotify decides if private data it passed to get added to an event was
used by checking list_empty().  But it's possible that the event may
have been dequeued and the private event removed so it would look empty.

The fix is to use the return code from fsnotify_add_notify_event rather
than looking at the list.
Signed-off-by: NEric Paris <eparis@redhat.com>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

eef3a116

22 7月, 2009 2 次提交

inotify: use GFP_NOFS under potential memory pressure · f44aebcc

由 Eric Paris 提交于 7月 15, 2009

inotify can have a watchs removed under filesystem reclaim.

=================================
[ INFO: inconsistent lock state ]
2.6.31-rc2 #16
---------------------------------
inconsistent {IN-RECLAIM_FS-W} -> {RECLAIM_FS-ON-W} usage.
khubd/217 [HC0[0]:SC0[0]:HE1:SE1] takes:
 (iprune_mutex){+.+.?.}, at: [<c10ba899>] invalidate_inodes+0x20/0xe3
{IN-RECLAIM_FS-W} state was registered at:
  [<c10536ab>] __lock_acquire+0x2c9/0xac4
  [<c1053f45>] lock_acquire+0x9f/0xc2
  [<c1308872>] __mutex_lock_common+0x2d/0x323
  [<c1308c00>] mutex_lock_nested+0x2e/0x36
  [<c10ba6ff>] shrink_icache_memory+0x38/0x1b2
  [<c108bfb6>] shrink_slab+0xe2/0x13c
  [<c108c3e1>] kswapd+0x3d1/0x55d
  [<c10449b5>] kthread+0x66/0x6b
  [<c1003fdf>] kernel_thread_helper+0x7/0x10
  [<ffffffff>] 0xffffffff

Two things are needed to fix this.  First we need a method to tell
fsnotify_create_event() to use GFP_NOFS and second we need to stop using
one global IN_IGNORED event and allocate them one at a time.  This solves
current issues with multiple IN_IGNORED on a queue having tail drop
problems and simplifies the allocations since we don't have to worry about
two tasks opperating on the IGNORED event concurrently.
Signed-off-by: NEric Paris <eparis@redhat.com>

f44aebcc

inotify: fix error paths in inotify_update_watch · 7e790dd5

由 Eric Paris 提交于 7月 07, 2009

inotify_update_watch could leave things in a horrid state on a number of
error paths. We could try to remove idr entries that didn't exist, we
could send an IN_IGNORED to userspace for watches that don't exist, and a
bit of other stupidity. Clean these up by doing the idr addition before we
put the mark on the inode since we can clean that up on error and getting
off the inode's mark list is hard.
Signed-off-by: NEric Paris <eparis@redhat.com>

7e790dd5

openanolis / cloud-kernel 12 个月 前同步成功

openanolis / cloud-kernel
12 个月前同步成功