提交 · 294f2ad5a545eb71d397623743ddd8201131bdad · openeuler / raspberrypi-kernel

18 7月, 2012 1 次提交

GFS2: kernel panic with small gfs2 filesystems - 1 RG · 294f2ad5

由 Abhijith Das 提交于 7月 18, 2012

In the unlikely setup where there's only one resource group in the gfs2
filesystem, gfs2_rgrpd_get_next() returns a NULL rgd that is not dealt with
properly, causing a kernel NULL ptr dereference. This patch fixes this issue.
Signed-off-by: NAbhi Das <adas@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

294f2ad5

28 6月, 2012 1 次提交

GFS2: Fixing double brelse'ing bh allocated in gfs2_meta_read when EIO occurs · 44b8db13

由 Masatake YAMATO 提交于 6月 18, 2012

This patch fixes buffer_head double free in following code path:

gfs2_block_map
=> gfs2_meta_inode_buffer
 => gfs2_meta_indirect_buffer
  => gfs2_meta_read
=> release_metapath

gfs2_block_map calls gfs2_meta_inode_buffer with &mp.mp_bh[0]
as an argument. mp.mp_bh are filled with zero at the beginning
of gfs2_block_map.

If gfs2_meta_inode_buffer returns non-zero value, gfs2_block_map
calls release_metapath to free buffers chained to mp.mp_bh.
release_metapath checks each slot of mp.mp_bh[i] and
free(with brelse) unless the slot is filled with NULL.

&mp.mp_bh[0] passed to gfs2_meta_inode_buffer is filled at
gfs2_meta_read. gfs2_meta_read is filled a buffer allocated with
gfs2_getbuf even if EIO occurs. When EIO occurs, the allocated buffer
is brelse'ed though the pointer(wrong poiner) points the brelse'ed is
passed back to caller via an argument bhp.

gfs2_meta_indirect_buffer, the caller also pass the wrong pointer
to its caller with EIO. Finally gfs2_block_map gets both EIO and
&mp.mp_bh[0] filled with the wrong pointer. release_metapath
calls brelse again on the wrong pointer.
Signed-off-by: NMasatake YAMATO <yamato@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

44b8db13

14 6月, 2012 1 次提交

GFS2: Combine functions get_local_rgrp and gfs2_inplace_reserve · 666d1d8a

由 Bob Peterson 提交于 6月 13, 2012

This function combines rgrp functions get_local_rgrp and
gfs2_inplace_reserve so that the double retry loop is gone.
Signed-off-by: NBob Peterson <rpeterso@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

666d1d8a

13 6月, 2012 1 次提交

GFS2: Add kobject release method · 0d515210

由 Bob Peterson 提交于 6月 13, 2012

This patch adds a kobject release function that properly maintains
the kobject use count, so that accesses to the sysfs files do not
cause an access to freed kernel memory after an unmount.
Signed-off-by: NBob Peterson <rpeterso@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

0d515210

11 6月, 2012 3 次提交

GFS2: Size seq_file buffer more carefully · 0fe2f1e9

由 Steven Whitehouse 提交于 6月 11, 2012

This places a limit on the buffer size for archs with larger
PAGE_SIZE.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>
Reported-by: NEric Dumazet <eric.dumazet@gmail.com>

0fe2f1e9

GFS2: Use seq_vprintf for glocks debugfs file · 1bb49303

由 Steven Whitehouse 提交于 6月 11, 2012

Make use of the newly added seq_vprintf() function.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>
Reported-by: NEric Dumazet <eric.dumazet@gmail.com>
Acked-by: NAl Viro <viro@ZenIV.linux.org.uk>

1bb49303

seq_file: Add seq_vprintf function and export it · a4808147

由 Steven Whitehouse 提交于 6月 11, 2012

The existing seq_printf function is rewritten in terms of the new
seq_vprintf which is also exported to modules. This allows GFS2
(and potentially other seq_file users) to have a vprintf based
interface and to avoid an extra copy into a temporary buffer in
some cases.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>
Reported-by: NEric Dumazet <eric.dumazet@gmail.com>
Acked-by: NAl Viro <viro@ZenIV.linux.org.uk>

a4808147

08 6月, 2012 2 次提交

GFS2: Use lvbs for storing rgrp information with mount option · 90306c41

由 Benjamin Marzinski 提交于 5月 29, 2012

Instead of reading in the resource groups when gfs2 is checking
for free space to allocate from, gfs2 can store the necessary infromation
in the resource group's lvb. Also, instead of searching for unlinked
inodes in every resource group that's checked for free space, gfs2 can
store the number of unlinked but inodes in the lvb, and only check for
unlinked inodes if it will find some.

The first time a resource group is locked, the lvb must initialized.
Since this involves counting the unlinked inodes in the resource group,
this takes a little extra time. But after that, if the resource group
is locked with GL_SKIP, the buffer head won't be read in unless it's
actually needed.

Enabling the resource groups lvbs is done via the rgrplvb mount option. If
this option isn't set, the lvbs will still be set and updated, but they won't
be verfied or used by the filesystem. To safely turn on this option, all of
the nodes mounting the filesystem must be running code with this patch, and
the filesystem must have been completely unmounted since they were updated.
Signed-off-by: NBenjamin Marzinski <bmarzins@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

90306c41

GFS2: Cache last hash bucket for glock seq_files · ba1ddcb6

由 Steven Whitehouse 提交于 6月 08, 2012

For the glocks and glstats seq_files, which are exposed via debugfs
we should cache the most recent hash bucket, along with the offset
into that bucket. This allows us to restart from that point, rather
than having to begin at the beginning each time.

This is an idea from Eric Dumazet, however I've slightly extended it
so that if the position from which we are due to start is at any
point beyond the last cached point, we start from the last cached
point, plus whatever is the appropriate offset. I don't really expect
people to be lseeking around these files, but if they did so with only
positive offsets, then we'd still get some of the benefit of using a
cached offset.

With my simple test of around 200k entries in the file, I'm seeing
an approx 10x speed up.

Cc: Eric Dumazet <eric.dumazet@gmail.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

ba1ddcb6

07 6月, 2012 1 次提交

GFS2: Increase buffer size for glocks and glstats debugfs files · df5d2f55

由 Steven Whitehouse 提交于 6月 07, 2012

As per Al Viro's suggestion, this increases the buffer size used
for these two files. This provides a speed up of slightly less than
8x (i.e. proportional to the buffer size) for cases when we have
large numbers of glocks.

Cc: Al Viro <viro@ZenIV.linux.org.uk>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

df5d2f55

06 6月, 2012 4 次提交

GFS2: Fix error handling when reading an invalid block from the journal · 1b8ba31a

由 Steven Whitehouse 提交于 5月 29, 2012

When we read an invalid block from the journal, we should not call
withdraw, but simply print a message and return an error. It is
up to the caller to then handle that error. In the case of mount
that means a failed mount, rather than a withdraw (requiring a
reboot). In the case of recovering another nodes journal then
we return an error via the uevent.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

1b8ba31a

GFS2: Add "top dir" flag support · 23d0bb83

由 Steven Whitehouse 提交于 5月 28, 2012

This patch adds support for the "top dir" flag. Currently this is unused
but a subsequent patch is planned which will add support for the
Orlov allocation policy when allocating subdirectories in a parent
with this flag set.

In order to ensure backward compatible behaviour, mkfs.gfs2 does
not currently tag the root directory with this flag, it must always be
set manually.
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

23d0bb83

GFS2: Fold quota data into the reservations struct · 5407e242

由 Bob Peterson 提交于 5月 18, 2012

This patch moves the ancillary quota data structures into the
block reservations structure. This saves GFS2 some time and
effort in allocating and deallocating the qadata structure.
Signed-off-by: NBob Peterson <rpeterso@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

5407e242

GFS2: Extend the life of the reservations · 0a305e49

由 Bob Peterson 提交于 6月 06, 2012

This patch lengthens the lifespan of the reservations structure for
inodes. Before, they were allocated and deallocated for every write
operation. With this patch, they are allocated when the first write
occurs, and deallocated when the last process closes the file.
It's more efficient to do it this way because it saves GFS2 a lot of
unnecessary allocates and frees. It also gives us more flexibility
for the future: (1) we can now fold the qadata structure back into
the structure and save those alloc/frees, (2) we can use this for
multi-block reservations.
Signed-off-by: NBob Peterson <rpeterso@redhat.com>
Signed-off-by: NSteven Whitehouse <swhiteho@redhat.com>

0a305e49

05 6月, 2012 1 次提交

vfs: Fix /proc/<tid>/fdinfo/<fd> file handling · 0640113b

由 Linus Torvalds 提交于 6月 04, 2012

Cyrill Gorcunov reports that I broke the fdinfo files with commit
30a08bf2 ("proc: move fd symlink i_mode calculations into
tid_fd_revalidate()"), and he's quite right.

The tid_fd_revalidate() function is not just used for the <tid>/fd
symlinks, it's also used for the <tid>/fdinfo/<fd> files, and the
permission model for those are different.

So do the dynamic symlink permission handling just for symlinks, making
the fdinfo files once more appear as the proper regular files they are.

Of course, Al Viro argued (probably correctly) that we shouldn't do the
symlink permission games at all, and make the symlinks always just be
the normal 'lrwxrwxrwx'. That would have avoided this issue too, but
since somebody noticed that the permissions had changed (which was the
reason for that original commit 30a08bf2 in the first place), people
do apparently use this feature.

[ Basically, you can use the symlink permission data as a cheap "fdinfo"
replacement, since you see whether the file is open for reading and/or
writing by just looking at st_mode of the symlink. So the feature
does make sense, even if the pain it has caused means we probably
shouldn't have done it to begin with. ]
Reported-and-tested-by: NCyrill Gorcunov <gorcunov@openvz.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

0640113b

02 6月, 2012 25 次提交

nls: fix (and rename) mac NLS table files and config options · 8b8c0daa

由 Linus Torvalds 提交于 6月 01, 2012

The config options in the Kconfig file (with _CODEPAGE_ in the name)
didn't match the config option name in the Makefile (no _CODEPAGE_).

And both of them were of the hard-to-read MACXYZZY variety, which made
them hard to parse for normal humans: MACROMAN easily reads as "macro
man", not as "Mac Roman".

So rename the options to be consistent, and be NLS_MAC_xyzzy.  Rename
the files to be mac-xyzzy.c too, and drop the "nls" part entirely (it's
already in the directory name).
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

8b8c0daa

fs/nls/Makefile: remove bogus CONFIG_ assignments · 92a89563

由 Andrew Morton 提交于 6月 01, 2012

These were debug things which snuck through.
Reported-by: NYinghai Lu <yinghai@kernel.org>
Cc: Vladimir Serbinenko <phcoder@gmail.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

92a89563

CIFS: Move get_next_mid to ops struct · 88257360

由 Pavel Shilovsky 提交于 5月 23, 2012

Reviewed-by: NJeff Layton <jlayton@redhat.com>
Signed-off-by: NPavel Shilovsky <pshilovsky@samba.org>
Signed-off-by: NSteve French <sfrench@us.ibm.com>

88257360

P
CIFS: Make accessing is_valid_oplock/dump_detail ops struct field safe · 7f0adb53
由 Pavel Shilovsky 提交于 5月 28, 2012
```
Signed-off-by: NPavel Shilovsky <pshilovsky@samba.org>
Signed-off-by: NSteve French <sfrench@us.ibm.com>
```
7f0adb53

CIFS: Improve identation in cifs_unlock_range · ea319d57

由 Pavel Shilovsky 提交于 5月 31, 2012

Signed-off-by: NPavel Shilovsky <pshilovsky@samba.org>
Signed-off-by: NSteve French <sfrench@us.ibm.com>

ea319d57

CIFS: Fix possible wrong memory allocation · 0013fb4c

由 Pavel Shilovsky 提交于 5月 31, 2012

when cifs_reconnect sets maxBuf to 0 and we try to calculate a size
of memory we need to store locks.
Signed-off-by: NPavel Shilovsky <pshilovsky@samba.org>
Signed-off-by: NSteve French <sfrench@us.ibm.com>

0013fb4c

HAVE_RESTORE_SIGMASK is defined on all architectures now · 754421c8

由 Al Viro 提交于 4月 26, 2012

Everyone either defines it in arch thread_info.h or has TIF_RESTORE_SIGMASK
and picks default set_restore_sigmask() in linux/thread_info.h.  Kill the
ifdefs, slap #error in linux/thread_info.h to catch breakage when new ones
get merged.
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

754421c8

nfs: don't open in ->d_revalidate · 0ef97dcf

由 Miklos Szeredi 提交于 5月 21, 2012

NFSv4 can't do reliable opens in d_revalidate, since it cannot know whether a
mount needs to be followed or not.  It does check d_mountpoint() on the dentry,
which can result in a weird error if the VFS found that the mount does not in
fact need to be followed, e.g.:

  # mount --bind /mnt/nfs /mnt/nfs-clone
  # echo something > /mnt/nfs/tmp/bar
  # echo x > /tmp/file
  # mount --bind /tmp/file /mnt/nfs-clone/tmp/bar
  # cat  /mnt/nfs/tmp/bar
  cat: /mnt/nfs/tmp/bar: Not a directory

Which should, by any sane filesystem, result in "something" being printed.

So instead do the open in f_op->open() and in the unlikely case that the cached
dentry turned out to be invalid, drop the dentry and return EOPENSTALE to let
the VFS retry.
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>
CC: Trond Myklebust <Trond.Myklebust@netapp.com>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

0ef97dcf

vfs: retry last component if opening stale dentry · 16b1c1cd

由 Miklos Szeredi 提交于 5月 21, 2012

NFS optimizes away d_revalidates for last component of open.  This means that
open itself can find the dentry stale.

This patch allows the filesystem to return EOPENSTALE and the VFS will retry the
lookup on just the last component if possible.

If the lookup was done using RCU mode, including the last component, then this
is not possible since the parent dentry is lost.  In this case fall back to
non-RCU lookup.  Currently this is not used since NFS will always leave RCU
mode.
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

16b1c1cd

vfs: nameidata_to_filp(): don't throw away file on error · 50ee93af

由 Miklos Szeredi 提交于 5月 21, 2012

If open fails, don't put the file.  This allows it to be reused if open needs to
be retried.
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

50ee93af

vfs: nameidata_to_filp(): inline __dentry_open() · 91daee98

由 Miklos Szeredi 提交于 5月 21, 2012

Copy __dentry_open() into nameidata_to_filp().
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

91daee98

vfs: do_dentry_open(): don't put filp · 78f71eff

由 Miklos Szeredi 提交于 5月 21, 2012

Move put_filp() out to __dentry_open(), the only caller now.
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

78f71eff

vfs: split __dentry_open() · 90ad1a8e

由 Miklos Szeredi 提交于 5月 21, 2012

Split __dentry_open() into two functions:

  do_dentry_open() - does most of the actual work, doesn't put file on failure
  open_check_o_direct() - after a successful open, checks direct_IO method

This will allow i_op->atomic_open to do just the file initialization and leave
the direct_IO checking to the VFS.
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

90ad1a8e

vfs: do_last() common post lookup · 5f5daac1

由 Miklos Szeredi 提交于 5月 21, 2012

Now the post lookup code can be shared between O_CREAT and plain opens since
they are essentially the same.
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

5f5daac1

vfs: do_last(): add audit_inode before open · d7fdd7f6

由 Miklos Szeredi 提交于 5月 21, 2012

This allows this code to be shared between O_CREAT and plain opens.
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

d7fdd7f6

vfs: do_last(): only return EISDIR for O_CREAT · 050ac841

由 Miklos Szeredi 提交于 5月 21, 2012

This allows this code to be shared between O_CREAT and plain opens.
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

050ac841

vfs: do_last(): check LOOKUP_DIRECTORY · af2f5542

由 Miklos Szeredi 提交于 5月 21, 2012

Check for ENOTDIR before finishing open.  This allows this code to be shared
between O_CREAT and plain opens.
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

af2f5542

vfs: do_last(): make ENOENT exit RCU safe · 54c33e7f

由 Miklos Szeredi 提交于 5月 21, 2012

This will allow this code to be used in RCU mode.
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

54c33e7f

vfs: make follow_link check RCU safe · d45ea867

由 Miklos Szeredi 提交于 5月 21, 2012

This will allow this code to be used in RCU mode.
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

d45ea867

vfs: do_last(): use inode variable · decf3400

由 Miklos Szeredi 提交于 5月 21, 2012

Use helper variable instead of path->dentry->d_inode before complete_walk().
This will allow this code to be used in RCU mode.
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

decf3400

vfs: do_last(): inline walk_component() · a1eb3315

由 Miklos Szeredi 提交于 5月 21, 2012

Copy walk_component() into do_lookup().
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

a1eb3315

vfs: do_last(): make exit RCU safe · e276ae67

由 Miklos Szeredi 提交于 5月 21, 2012

Allow returning from do_last() with LOOKUP_RCU still set on the "out:" and
"exit:" labels.
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

e276ae67

vfs: split do_lookup() · 697f514d

由 Miklos Szeredi 提交于 5月 21, 2012

Split do_lookup() into two functions:

  lookup_fast() - does cached lookup without i_mutex
  lookup_slow() - does lookup with i_mutex

Both follow managed dentries.

The new functions are needed by atomic_open.
Signed-off-by: NMiklos Szeredi <mszeredi@suse.cz>
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>

697f514d

Btrfs: move over to use ->update_time · e41f941a

由 Josef Bacik 提交于 3月 26, 2012

Btrfs had been doing it's own file_update_time so we could catch ENOSPC
properly, so just update our btrfs_update_time to work with the new stuff and
then we'll be fancy later.  Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>

e41f941a

fs: introduce inode operation ->update_time · c3b2da31

由 Josef Bacik 提交于 3月 26, 2012

Btrfs has to make sure we have space to allocate new blocks in order to modify
the inode, so updating time can fail.  We've gotten around this by having our
own file_update_time but this is kind of a pain, and Christoph has indicated he
would like to make xfs do something different with atime updates.  So introduce
->update_time, where we will deal with i_version an a/m/c time updates and
indicate which changes need to be made.  The normal version just does what it
has always done, updates the time and marks the inode dirty, and then
filesystems can choose to do something different.

I've gone through all of the users of file_update_time and made them check for
errors with the exception of the fault code since it's complicated and I wasn't
quite sure what to do there, also Jan is going to be pushing the file time
updates into page_mkwrite for those who have it so that should satisfy btrfs and
make it not a big deal to check the file_update_time() return code in the
generic fault path. Thanks,
Signed-off-by: NJosef Bacik <josef@redhat.com>

c3b2da31