提交 · 0feae5c47aabdde59cbbec32d150e17102de37f0 · openeuler / Kernel

23 6月, 2006 4 次提交

[PATCH] Fix dcache race during umount · 0feae5c4

由 NeilBrown 提交于 6月 22, 2006

The race is that the shrink_dcache_memory shrinker could get called while a
filesystem is being unmounted, and could try to prune a dentry belonging to
that filesystem.

If it does, then it will call in to iput on the inode while the dentry is
no longer able to be found by the umounting process.  If iput takes a
while, generic_shutdown_super could get all the way though
shrink_dcache_parent and shrink_dcache_anon and invalidate_inodes without
ever waiting on this particular inode.

Eventually the superblock gets freed anyway and if the iput tried to touch
it (which some filesystems certainly do), it will lose.  The promised
"Self-destruct in 5 seconds" doesn't lead to a nice day.

The race is closed by holding s_umount while calling prune_one_dentry on
someone else's dentry.  As a down_read_trylock is used,
shrink_dcache_memory will no longer try to prune the dentry of a filesystem
that is being unmounted, and unmount will not be able to start until any
such active prune_one_dentry completes.

This requires that prune_dcache *knows* which filesystem (if any) it is
doing the prune on behalf of so that it can be careful of other
filesystems.  shrink_dcache_memory isn't called it on behalf of any
filesystem, and so is careful of everything.

shrink_dcache_anon is now passed a super_block rather than the s_anon list
out of the superblock, so it can get the s_anon list itself, and can pass
the superblock down to prune_dcache.

If prune_dcache finds a dentry that it cannot free, it leaves it where it
is (at the tail of the list) and exits, on the assumption that some other
thread will be removing that dentry soon.  To try to make sure that some
work gets done, a limited number of dnetries which are untouchable are
skipped over while choosing the dentry to work on.

I believe this race was first found by Kirill Korotaev.

Cc: Jan Blunck <jblunck@suse.de>
Acked-by: NKirill Korotaev <dev@openvz.org>
Cc: Olaf Hering <olh@suse.de>
Acked-by: NBalbir Singh <balbir@in.ibm.com>
Signed-off-by: NNeil Brown <neilb@suse.de>
Signed-off-by: NBalbir Singh <balbir@in.ibm.com>
Acked-by: NDavid Howells <dhowells@redhat.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

0feae5c4

[PATCH] remove steal_locks() · c89681ed

由 Miklos Szeredi 提交于 6月 22, 2006

This patch removes the steal_locks() function.

steal_locks() doesn't work correctly with any filesystem that does it's own
lock management, including NFS, CIFS, etc.

In addition it has weird semantics on local filesystems in case tasks
sharing file-descriptor tables are doing POSIX locking operations in
parallel to execve().

The steal_locks() function has an effect on applications doing:

clone(CLONE_FILES)
  /* in child */
  lock
  execve
  lock

POSIX locks acquired before execve (by "child", "parent" or any further
task sharing files_struct) will after the execve be owned exclusively by
"child".

According to Chris Wright some LSB/LTP kind of suite triggers without the
stealing behavior, but there's no known real-world application that would
also fail.

Apps using NPTL are not affected, since all other threads are killed before
execve.

Apps using LinuxThreads are only affected if they

  - have multiple threads during exec (LinuxThreads doesn't kill other
    threads, the app may do it with pthread_kill_other_threads_np())
  - rely on POSIX locks being inherited across exec

Both conditions are documented, but not their interaction.

Apps using clone() natively are affected if they

  - use clone(CLONE_FILES)
  - rely on POSIX locks being inherited across exec

The above scenarios are unlikely, but possible.

If the patch is vetoed, there's a plan B, that involves mostly keeping the
weird stealing semantics, but changing the way lock ownership is handled so
that network and local filesystems work consistently.

That would add more complexity though, so this solution seems to be
preferred by most people.
Signed-off-by: NMiklos Szeredi <miklos@szeredi.hu>
Cc: Trond Myklebust <trond.myklebust@fys.uio.no>
Cc: Matthew Wilcox <willy@debian.org>
Cc: Chris Wright <chrisw@sous-sol.org>
Cc: Christoph Hellwig <hch@lst.de>
Cc: Steven French <sfrench@us.ibm.com>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

c89681ed

[PATCH] Fix a race condition between ->i_mapping and iput() · 09d967c6

由 OGAWA Hirofumi 提交于 6月 22, 2006

This race became a cause of oops, and can reproduce by the following.

    while true; do
	dd if=/dev/zero of=/dev/.static/dev/hdg1 bs=512 count=1000 & sync
    done

This race condition was between __sync_single_inode() and iput().

          cpu0 (fs's inode)                 cpu1 (bdev's inode)
          -----------------                 -------------------
                                       close("/dev/hda2")
                                       [...]
__sync_single_inode()
   /* copy the bdev's ->i_mapping */
   mapping = inode->i_mapping;

                                       generic_forget_inode()
                                          bdev_clear_inode()
					     /* restre the fs's ->i_mapping */
				             inode->i_mapping = &inode->i_data;
				          /* bdev's inode was freed */
                                          destroy_inode(inode);

   if (wait) {
      /* dereference a freed bdev's mapping->host */
      filemap_fdatawait(mapping);  /* Oops */

Since __sync_single_inode() is only taking a ref-count of fs's inode, the
another process can be close() and freeing the bdev's inode while writing
fs's inode.  So, __sync_signle_inode() accesses the freed ->i_mapping,
oops.

This patch takes a ref-count on the bdev's inode for the fs's inode before
setting a ->i_mapping, and the clear_inode() of the fs's inode does iput() on
the bdev's inode.  So if the fs's inode is still living, bdev's inode
shouldn't be freed.
Signed-off-by: NOGAWA Hirofumi <hirofumi@mail.parknet.co.jp>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

09d967c6

[PATCH] NTFS: Critical bug fix (affects MIPS and possibly others) · f893afbe

由 Anton Altaparmakov 提交于 6月 22, 2006

Many thanks to Pauline Ng for the detailed bug report and analysis!
Signed-off-by: NAnton Altaparmakov <aia21@cantab.net>
Cc: <stable@kernel.org>
Signed-off-by: NAndrew Morton <akpm@osdl.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

f893afbe

22 6月, 2006 1 次提交

[PATCH] Driver core: add generic "subsystem" link to all devices · b9d9c82b

由 Kay Sievers 提交于 6月 15, 2006

Like the SUBSYTEM= key we find in the environment of the uevent, this
creates a generic "subsystem" link in sysfs for every device. Userspace
usually doesn't care at all if its a "class" or a "bus" device. This
provides an unified way to determine the subsytem of a device, regardless
of the way the driver core has created it.
Signed-off-by: NKay Sievers <kay.sievers@suse.de>
Signed-off-by: NGreg Kroah-Hartman <gregkh@suse.de>

b9d9c82b

20 6月, 2006 11 次提交

[PATCH] log more info for directory entry change events · 9c937dcc

由 Amy Griffis 提交于 6月 08, 2006

When an audit event involves changes to a directory entry, include
a PATH record for the directory itself.  A few other notable changes:

    - fixed audit_inode_child() hooks in fsnotify_move()
    - removed unused flags arg from audit_inode()
    - added audit log routines for logging a portion of a string

Here's some sample output.

before patch:
type=SYSCALL msg=audit(1149821605.320:26): arch=40000003 syscall=39 success=yes exit=0 a0=bf8d3c7c a1=1ff a2=804e1b8 a3=bf8d3c7c items=1 ppid=739 pid=800 auid=0 uid=0 gid=0 euid=0 suid=0 fsuid=0 egid=0 sgid=0 fsgid=0 tty=ttyS0 comm="mkdir" exe="/bin/mkdir" subj=root:system_r:unconfined_t:s0-s0:c0.c255
type=CWD msg=audit(1149821605.320:26):  cwd="/root"
type=PATH msg=audit(1149821605.320:26): item=0 name="foo" parent=164068 inode=164010 dev=03:00 mode=040755 ouid=0 ogid=0 rdev=00:00 obj=root:object_r:user_home_t:s0

after patch:
type=SYSCALL msg=audit(1149822032.332:24): arch=40000003 syscall=39 success=yes exit=0 a0=bfdd9c7c a1=1ff a2=804e1b8 a3=bfdd9c7c items=2 ppid=714 pid=777 auid=0 uid=0 gid=0 euid=0 suid=0 fsuid=0 egid=0 sgid=0 fsgid=0 tty=ttyS0 comm="mkdir" exe="/bin/mkdir" subj=root:system_r:unconfined_t:s0-s0:c0.c255
type=CWD msg=audit(1149822032.332:24):  cwd="/root"
type=PATH msg=audit(1149822032.332:24): item=0 name="/root" inode=164068 dev=03:00 mode=040750 ouid=0 ogid=0 rdev=00:00 obj=root:object_r:user_home_dir_t:s0
type=PATH msg=audit(1149822032.332:24): item=1 name="foo" inode=164010 dev=03:00 mode=040755 ouid=0 ogid=0 rdev=00:00 obj=root:object_r:user_home_t:s0
Signed-off-by: NAmy Griffis <amy.griffis@hp.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

9c937dcc

A
[PATCH] proc_loginuid_write() uses simple_strtoul() on non-terminated array · e0182909
由 Al Viro 提交于 5月 18, 2006
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
e0182909
A
[PATCH] execve argument logging · 473ae30b
由 Al Viro 提交于 4月 26, 2006
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
473ae30b

[PATCH] inotify (4/5): allow watch removal from event handler · 3ca10067

由 Amy Griffis 提交于 6月 01, 2006

Allow callers to remove watches from their event handler via
inotify_remove_watch_locked().  This functionality can be used to
achieve IN_ONESHOT-like functionality for a subset of events in the
mask.
Signed-off-by: NAmy Griffis <amy.griffis@hp.com>
Acked-by: NRobert Love <rml@novell.com>
Acked-by: NJohn McCutchan <john@johnmccutchan.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

3ca10067

[PATCH] inotify (3/5): add interfaces to kernel API · a9dc971d

由 Amy Griffis 提交于 6月 01, 2006

Add inotify_init_watch() so caller can use inotify_watch refcounts
before calling inotify_add_watch().

Add inotify_find_watch() to find an existing watch for an (ih,inode)
pair.  This is similar to inotify_find_update_watch(), but does not
update the watch's mask if one is found.

Add inotify_rm_watch() to remove a watch via the watch pointer instead
of the watch descriptor.
Signed-off-by: NAmy Griffis <amy.griffis@hp.com>
Acked-by: NRobert Love <rml@novell.com>
Acked-by: NJohn McCutchan <john@johnmccutchan.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

a9dc971d

[PATCH] inotify (2/5): add name's inode to event handler · 7c297722

由 Amy Griffis 提交于 6月 01, 2006

When an inotify event includes a dentry name, also include the inode
associated with that name.
Signed-off-by: NAmy Griffis <amy.griffis@hp.com>
Acked-by: NRobert Love <rml@novell.com>
Acked-by: NJohn McCutchan <john@johnmccutchan.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

7c297722

[PATCH] inotify (1/5): split kernel API from userspace support · 2d9048e2

由 Amy Griffis 提交于 6月 01, 2006

The following series of patches introduces a kernel API for inotify,
making it possible for kernel modules to benefit from inotify's
mechanism for watching inodes.  With these patches, inotify will
maintain for each caller a list of watches (via an embedded struct
inotify_watch), where each inotify_watch is associated with a
corresponding struct inode.  The caller registers an event handler and
specifies for which filesystem events their event handler should be
called per inotify_watch.
Signed-off-by: NAmy Griffis <amy.griffis@hp.com>
Acked-by: NRobert Love <rml@novell.com>
Acked-by: NJohn McCutchan <john@johnmccutchan.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

2d9048e2

N
[XFS] Remove files from the build that are now unused. · d8ce7532
由 Nathan Scott 提交于 6月 20, 2006
```
Signed-off-by: NNathan Scott <nathans@sgi.com>
```
d8ce7532
N
[XFS] Fix a Makefile issue related to exports.o handling. · d7b849da
由 Nathan Scott 提交于 6月 20, 2006
```
Signed-off-by: NNathan Scott <nathans@sgi.com>
```
d7b849da
N
[XFS] Remove version 1 directory code. Never functioned on Linux, just · f6c2d1fa
由 Nathan Scott 提交于 6月 20, 2006
```
pure bloat.

SGI-PV: 952969
SGI-Modid: xfs-linux-melb:xfs-kern:26251a
Signed-off-by: NNathan Scott <nathans@sgi.com>
```
f6c2d1fa

[XFS] Map EFSCORRUPTED to an actual error code, not just a made up one · da2f4d67

由 Nathan Scott 提交于 6月 20, 2006

(990).	Turns out some ye-olde unices used EUCLEAN as
Filesystem-needs-cleaning, so now we use that too.

SGI-PV: 953954
SGI-Modid: xfs-linux-melb:xfs-kern:26286a
Signed-off-by: NNathan Scott <nathans@sgi.com>

da2f4d67

19 6月, 2006 7 次提交

[XFS] Kill direct access to ->count in valusema(); all we ever use it for · 0d8fee32

由 Al Viro 提交于 6月 19, 2006

is check if semaphore is actually locked, which can be trivially done in
portable way. Code gets more reabable, while we are at it... 

SGI-PV: 953915
SGI-Modid: xfs-linux-melb:xfs-kern:26274a
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
Signed-off-by: NNathan Scott <nathans@sgi.com>

0d8fee32

N
[XFS] Remove unneeded conditional code on NFS export interface related · a805bad5
由 Nathan Scott 提交于 6月 19, 2006
```
code paths.

SGI-PV: 904196
SGI-Modid: xfs-linux-melb:xfs-kern:26250a
Signed-off-by: NNathan Scott <nathans@sgi.com>
```
a805bad5
N
[XFS] Remove an incorrect use of unlikely() on a relatively likely code · 6fe90e6d
由 Nathan Scott 提交于 6月 19, 2006
```
path.

SGI-PV: 904196
SGI-Modid: xfs-linux-melb:xfs-kern:26249a
Signed-off-by: NNathan Scott <nathans@sgi.com>
```
6fe90e6d
N
[XFS] Push some common code out of write path into core XFS code for · 1e69dd0e
由 Nathan Scott 提交于 6月 19, 2006
```
sharing.

SGI-PV: 904196
SGI-Modid: xfs-linux-melb:xfs-kern:26248a
Signed-off-by: NNathan Scott <nathans@sgi.com>
```
1e69dd0e
N
[XFS] Remove unnecessary local from open_exec dmapi path. · 1d47bec2
由 Nathan Scott 提交于 6月 19, 2006
```
SGI-PV: 904196
SGI-Modid: xfs-linux-melb:xfs-kern:26247a
Signed-off-by: NNathan Scott <nathans@sgi.com>
```
1d47bec2

[JFFS2] Check CRC32 on dirent and data nodes each time they're read · 1046d880

由 David Woodhouse 提交于 6月 18, 2006

Also, make sure dirents are marked REF_UNCHECKED when we 'discover' them
through eraseblock summary.
Signed-off-by: NDavid Woodhouse <dwmw2@infradead.org>

1046d880

[JFFS2] When retiring nextblock, allocate a node_ref for the wasted space · fc6612f6

由 David Woodhouse 提交于 6月 18, 2006

Failing to do so makes the calculated length of the last node incorrect,
when we're not using eraseblock summaries.
Signed-off-by: NDavid Woodhouse <dwmw2@infradead.org>

fc6612f6

18 6月, 2006 3 次提交

D
[JFFS2] Mark XATTR support as experimental, for now · 2ba72cb7
由 David Woodhouse 提交于 6月 18, 2006
```
Signed-off-by: NDavid Woodhouse <dwmw2@infradead.org>
```
2ba72cb7

[JFFS2] Don't trust node headers before the CRC is checked. · 3877f0b6

由 David Woodhouse 提交于 6月 18, 2006

Especially when summary code is used, we can have in-memory data
structures referencing certain nodes without them actually being readable
on the flash. Discard the nodes gracefully in that case, rather than
triggering a BUG().
Signed-off-by: NDavid Woodhouse <dwmw2@infradead.org>

3877f0b6

[PATCH] Fix missing ret assignment in __bio_map_user() error path · 99172157

由 Jens Axboe 提交于 6月 16, 2006

If get_user_pages() returns less pages than what we asked for, we jump
to out_unmap which will return ERR_PTR(ret).  But ret can contain a
positive number just smaller than local_nr_pages, so be sure to set it
to -EFAULT always.

Problem found and diagnosed by Damien Le Moal <damien@sdl.hitachi.co.jp>
Signed-off-by: NJens Axboe <axboe@suse.de>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

99172157

14 6月, 2006 1 次提交

[PATCH] Return error in case flock_lock_file failure · 9cedc194

由 Kirill Korotaev 提交于 6月 14, 2006

If flock_lock_file() failed to allocate flock with locks_alloc_lock()
then "error = 0" is returned. Need to return some non-zero.
Signed-off-by: NPavel Emelianov <xemul@openvz.org>
Signed-off-by: NKirill Korotaev <dev@openvz.org>
Signed-off-by: NLinus Torvalds <torvalds@osdl.org>

9cedc194

13 6月, 2006 1 次提交
- N
  [XFS] Minor XFS documentation updates. · d7ede1aa
  由 Nathan Scott 提交于 6月 13, 2006
```
Signed-off-by: NNathan Scott <nathans@sgi.com>
```
  d7ede1aa
09 6月, 2006 12 次提交

[JFFS2] Fix more breakage caused by janitorial meddling. · 4ed0156f

由 David Woodhouse 提交于 6月 09, 2006

jffs2_zlib_exit() and free_workspaces() shouldn't be marked __exit because
they get called in the error case from the init functions.
Signed-off-by: NDavid Woodhouse <dwmw2@infradead.org>

4ed0156f

N
[XFS] Fix broken const use inside local suffix_strtoul routine. · b190f113
由 Nathan Scott 提交于 6月 09, 2006
```
SGI-PV: 904196
SGI-Modid: xfs-linux-melb:xfs-kern:26201a
Signed-off-by: NNathan Scott <nathans@sgi.com>
```
b190f113

[XFS] Fix nused counter. It's currently getting set to -1 rather than · 477829ef

由 Mandy Kirkconnell 提交于 6月 09, 2006

getting decremented by 1.  Since nused never reaches 0, the "if
(!free->hdr.nused)" check in xfs_dir2_leafn_remove() fails every time and
xfs_dir2_shrink_inode() doesn't get called when it should.  This causes
extra blocks to be left on an empty directory and the directory in unable
to be converted back to inline extent mode.

SGI-PV: 951958
SGI-Modid: xfs-linux-melb:xfs-kern:211382a
Signed-off-by: NMandy Kirkconnell <alkirkco@sgi.com>
Signed-off-by: NNathan Scott <nathans@sgi.com>

477829ef

N
[XFS] Fix mismerge of the fs_writable cleanup patch causing a freeze/thaw · 421ad134
由 Nathan Scott 提交于 6月 09, 2006
```
test hang.

SGI-PV: 953563
SGI-Modid: xfs-linux-melb:xfs-kern:26182a
Signed-off-by: NNathan Scott <nathans@sgi.com>
```
421ad134

[XFS] Fix up debug code so that bulkstat wont generate thousands of · 4d1a2ed3

由 Nathan Scott 提交于 6月 09, 2006

fsstress warnings.

SGI-PV: 904196
SGI-Modid: xfs-linux-melb:xfs-kern:26111a
Signed-off-by: NNathan Scott <nathans@sgi.com>

4d1a2ed3

N
[XFS] Remove unused parameter from di2xflags routine. · a916e2bd
由 Nathan Scott 提交于 6月 09, 2006
```
SGI-PV: 904192
SGI-Modid: xfs-linux-melb:xfs-kern:26110a
Signed-off-by: NNathan Scott <nathans@sgi.com>
```
a916e2bd
N
[XFS] Cleanup a missed porting conversion, and freezing. · 34327e13
由 Nathan Scott 提交于 6月 09, 2006
```
SGI-PV: 953338
SGI-Modid: xfs-linux-melb:xfs-kern:26109a
Signed-off-by: NNathan Scott <nathans@sgi.com>
```
34327e13
N
[XFS] Resolve a namespace collision on remaining vtypes for FreeBSD · 8285fb58
由 Nathan Scott 提交于 6月 09, 2006
```
porters.

SGI-PV: 953338
SGI-Modid: xfs-linux-melb:xfs-kern:26108a
Signed-off-by: NNathan Scott <nathans@sgi.com>
```
8285fb58
N
[XFS] Resolve a namespace collision on vnode/vnodeops for FreeBSD porters. · 67fcaa73
由 Nathan Scott 提交于 6月 09, 2006
```
SGI-PV: 953338
SGI-Modid: xfs-linux-melb:xfs-kern:26107a
Signed-off-by: NNathan Scott <nathans@sgi.com>
```
67fcaa73
N
[XFS] Resolve a namespace collision on vfs/vfsops for FreeBSD porters. · b83bd138
由 Nathan Scott 提交于 6月 09, 2006
```
SGI-PV: 9533338
SGI-Modid: xfs-linux-melb:xfs-kern:26106a
Signed-off-by: NNathan Scott <nathans@sgi.com>
```
b83bd138

[XFS] statvfs component of directory/project quota support, code · 932f2c32

由 Nathan Scott 提交于 6月 09, 2006

originally by Glen.

SGI-PV: 932952
SGI-Modid: xfs-linux-melb:xfs-kern:26105a
Signed-off-by: NNathan Scott <nathans@sgi.com>

932f2c32

N
[XFS] Portability changes: remove prdev, stick to one diagnostic · b6574520
由 Nathan Scott 提交于 6月 09, 2006
```
interface.

SGI-PV: 953338
SGI-Modid: xfs-linux-melb:xfs-kern:26103a
Signed-off-by: NNathan Scott <nathans@sgi.com>
```
b6574520

openeuler / Kernel 11 个月 前同步成功

openeuler / Kernel
11 个月前同步成功