提交 · a4ffc152198efba2ed9e6eac0eb97f17bfebce85 · openeuler / raspberrypi-kernel

29 3月, 2012 25 次提交

由 Mikulas Patocka 提交于 3月 28, 2012

This device-mapper target creates a read-only device that transparently
validates the data on one underlying device against a pre-generated tree
of cryptographic checksums stored on a second device.

Two checksum device formats are supported: version 0 which is already
shipping in Chromium OS and version 1 which incorporates some
improvements.
Signed-off-by: NMikulas Patocka <mpatocka@redhat.com>
Signed-off-by: NMandeep Singh Baines <msb@chromium.org>
Signed-off-by: NWill Drewry <wad@chromium.org>
Signed-off-by: NElly Jones <ellyjones@chromium.org>
Cc: Milan Broz <mbroz@redhat.com>
Cc: Olof Johansson <olofj@chromium.org>
Cc: Steffen Klassert <steffen.klassert@secunet.com>
Cc: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

a4ffc152

dm bufio: prefetch · a66cc28f

由 Mikulas Patocka 提交于 3月 28, 2012

This patch introduces a new function dm_bufio_prefetch. It prefetches
the specified range of blocks into dm-bufio cache without waiting
for i/o completion.
Signed-off-by: NMikulas Patocka <mpatocka@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

a66cc28f

dm thin: add pool target flags to control discard · 67e2e2b2

由 Joe Thornber 提交于 3月 28, 2012

Add dm thin target arguments to control discard support.

ignore_discard: Disables discard support

no_discard_passdown: Don't pass discards down to the underlying data
device, but just remove the mapping within the thin provisioning target.
Signed-off-by: NJoe Thornber <ejt@redhat.com>
Signed-off-by: NMike Snitzer <snitzer@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

67e2e2b2

dm thin: support discards · 104655fd

由 Joe Thornber 提交于 3月 28, 2012

Support discards in the thin target.

On discard the corresponding mapping(s) are removed from the thin
device.  If the associated block(s) are no longer shared the discard
is passed to the underlying device.

All bios other than discards now have an associated deferred_entry
that is saved to the 'all_io_entry' in endio_hook.  When non-discard
IO completes and associated mappings are quiesced any discards that
were deferred, via ds_add_work() in process_discard(), will be queued
for processing by the worker thread.
Signed-off-by: NJoe Thornber <ejt@redhat.com>
Signed-off-by: NMike Snitzer <snitzer@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

drivers/md/dm-thin.c |  173 ++++++++++++++++++++++++++++++++++++++++++++++----
 drivers/md/dm-thin.c |  172 ++++++++++++++++++++++++++++++++++++++++++++++-----
 1 file changed, 158 insertions(+), 14 deletions(-)

104655fd

dm thin: prepare to support discard · eb2aa48d

由 Joe Thornber 提交于 3月 28, 2012

This patch contains the ground work needed for dm-thin to support discard.

  - Adds endio function that replaces shared_read_endio.

  - Introduce an explicit 'quiesced' flag into the new_mapping structure.
    Before, this was implicitly indicated by m->list being empty.

  - The map_info->ptr remains constant for the duration of a bio's trip
    through the thin target.  Make it easier to reason about it.
Signed-off-by: NJoe Thornber <ejt@redhat.com>
Signed-off-by: NMike Snitzer <snitzer@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

eb2aa48d

dm thin: use dm_target_offset · 6efd6e83

由 Alasdair G Kergon 提交于 3月 28, 2012

Use dm_target_offset wrapper instead of referencing the awkward ti->begin
explicitly.
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

6efd6e83

dm thin: support read only external snapshot origins · 2dd9c257

由 Joe Thornber 提交于 3月 28, 2012

Support the use of an external _read only_ device as an origin for a thin
device.

Any read to an unprovisioned area of the thin device will be passed
through to the origin.  Writes trigger allocation of new blocks as
usual.

One possible use case for this would be VM hosts that want to run
guests on thinly-provisioned volumes but have the base image on another
device (possibly shared between many VMs).
Signed-off-by: NJoe Thornber <ejt@redhat.com>
Signed-off-by: NMike Snitzer <snitzer@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

2dd9c257

dm thin: relax hard limit on the maximum size of a metadata device · c4a69ecd

由 Mike Snitzer 提交于 3月 28, 2012

The thin metadata format can only make use of a device that is <=
THIN_METADATA_MAX_SECTORS (currently 15.9375 GB).  Therefore, there is no
practical benefit to using a larger device.

However, it may be that other factors impose a certain granularity for
the space that is allocated to a device (E.g. lvm2 can impose a coarse
granularity through the use of large, >= 1 GB, physical extents).

Rather than reject a larger metadata device, during thin-pool device
construction, switch to allowing it but issue a warning if a device
larger than THIN_METADATA_MAX_SECTORS_WARNING (16 GB) is
provided.  Any space over 15.9375 GB will not be used.
Signed-off-by: NMike Snitzer <snitzer@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

c4a69ecd

dm persistent data: remove space map ref_count entries if redundant · 71fd5ae2

由 Joe Thornber 提交于 3月 28, 2012

Save space by removing entries from the space map ref_count tree if
they're no longer needed.

Ref counts are stored in two places: a bitmap if the ref_count is
below 3, or a btree of uint32_t if 3 or above.

When a ref_count that was above 3 drops below we can remove it from
the tree and save some metadata space.  This removal was commented out
before because I was unsure why this was causing under-populated btree
nodes.  Earlier patches have fixed this issue.
Signed-off-by: NJoe Thornber <ejt@redhat.com>
Signed-off-by: NMike Snitzer <snitzer@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

71fd5ae2

dm thin: commit outstanding data every second · 905e51b3

由 Joe Thornber 提交于 3月 28, 2012

Commit unwritten data every second to prevent too much building up.

Released blocks don't become available until after the next commit
(for crash resilience).  Prior to this patch commits were only
triggered by a message to the target or a REQ_{FLUSH,FUA} bio.  This
allowed far too big a position to build up.

The interval is hard-coded to 1 second.  This is a sensible setting.
I'm not making this user configurable, since there isn't much to be
gained by tweaking this - and a lot lost by setting it far too high.
Signed-off-by: NJoe Thornber <ejt@redhat.com>
Signed-off-by: NMike Snitzer <snitzer@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

905e51b3

dm: reject trailing characters in sccanf input · 31998ef1

由 Mikulas Patocka 提交于 3月 28, 2012

Device mapper uses sscanf to convert arguments to numbers. The problem is that
the way we use it ignores additional unmatched characters in the scanned string.

For example, this `if (sscanf(string, "%d", &number) == 1)' will match a number,
but also it will match number with some garbage appended, like "123abc".

As a result, device mapper accepts garbage after some numbers. For example
the command `dmsetup create vg1-new --table "0 16384 linear 254:1bla 34816bla"'
will pass without an error.

This patch fixes all sscanf uses in device mapper. It appends "%c" with
a pointer to a dummy character variable to every sscanf statement.

The construct `if (sscanf(string, "%d%c", &number, &dummy) == 1)' succeeds
only if string is a null-terminated number (optionally preceded by some
whitespace characters). If there is some character appended after the number,
sscanf matches "%c", writes the character to the dummy variable and returns 2.
We check the return value for 1 and consequently reject numbers with some
garbage appended.
Signed-off-by: NMikulas Patocka <mpatocka@redhat.com>
Acked-by: NMike Snitzer <snitzer@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

31998ef1

dm raid: handle failed devices during start up · 0447568f

由 Jonathan E Brassow 提交于 3月 28, 2012

The dm-raid code currently fails to create a RAID array if any of the
superblocks cannot be read.  This was an oversight as there is already
code to handle this case if the values ('- -') were provided for the
failed array position.

With this patch, if a superblock cannot be read, the array position's
fields are initialized as though '- -' was set in the table.  That is,
the device is failed and the position should not be used, but if there
is sufficient redundancy, the array should still be activated.
Signed-off-by: NJonathan Brassow <jbrassow@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

0447568f

dm thin metadata: pass correct space map to dm_sm_root_size · fef838cc

由 Joe Thornber 提交于 3月 28, 2012

Fix a harmless typo.

The root is a chunk of data that gets written to the superblock.  This
data is used to recreate the space map when opening a metadata area.
We have two space maps; one tracking space on the metadata device and
one of the data device.  Both of these use the same format for their
root, so this typo was harmless.
Signed-off-by: NJoe Thornber <ejt@redhat.com>
Signed-off-by: NMike Snitzer <snitzer@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

fef838cc

dm persistent data: remove redundant value_size arg from value_ptr · a3aefb39

由 Joe Thornber 提交于 3月 28, 2012

Now that the value_size is held within every node of the btrees we can
remove this argument from value_ptr().

For the last few months a BUG_ON has been checking this argument is
the same as that held in the node.  No issues were reported.  So this
is a safe change.
Signed-off-by: NJoe Thornber <ejt@redhat.com>
Signed-off-by: NMike Snitzer <snitzer@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

a3aefb39

dm mpath: detect invalid map_context · 466891f9

由 Jun'ichi Nomura 提交于 3月 28, 2012

The map_context pointer should always be set. However, we have reports
that upon requeuing it is not set correctly.  So add set and clear
functions with a BUG_ON() to track the issue properly.
Signed-off-by: NJun'ichi Nomura <j-nomura@ce.jp.nec.com>
Cc: Mike Snitzer <snitzer@redhat.com>
Acked-by: NHannes Reinecke <hare@suse.de>
Tested-by: NHeiko Carstens <heiko.carstens@de.ibm.com>
Acked-by: NDave Wysochanski <dwysocha@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

466891f9

dm: clear bi_end_io on remapping failure · 4d7b38b7

由 Hannes Reinecke 提交于 3月 28, 2012

As a precaution, set bi_end_io to NULL when failing to remap.
Signed-off-by: NHannes Reinecke <hare@suse.de>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

4d7b38b7

dm table: simplify call to free_devices · 574ce07e

由 Hannes Reinecke 提交于 3月 28, 2012

free_devices in dm_table.c already uses list_for_each(), so we don't
need to check if the list is empty.
Signed-off-by: NHannes Reinecke <hare@suse.de>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

574ce07e

dm thin: correct comments · fe878f34

由 Joe Thornber 提交于 3月 28, 2012

Remove documentation for unimplemented 'trim' message.

I'd planned a 'trim' target message for shrinking thin devices, but
this is better handled via the discard ioctl.
Signed-off-by: NJoe Thornber <ejt@redhat.com>
Signed-off-by: NMike Snitzer <snitzer@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

fe878f34

dm raid: no longer experimental · 035220b3

由 Alasdair G Kergon 提交于 3月 28, 2012

The dm raid module (using md) is becoming the preferred way of creating long-lived
mirrors through userspace LVM so remove the EXPERIMENTAL tag.
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

035220b3

dm uevent: no longer experimental · e0b215da

由 Alasdair G Kergon 提交于 3月 28, 2012

Drop EXPERIMENTAL tag from dm-uevent.

It's not changed for a while and some userspace tools are relying upon it.
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

e0b215da

dm persistent data: fix btree rebalancing after remove · b0988900

由 Joe Thornber 提交于 3月 28, 2012

When we remove an entry from a node we sometimes rebalance with it's
two neighbours.  This wasn't being done correctly; in some cases
entries have to move all the way from the right neighbour to the left
neighbour, or vice versa.  This patch pretty much re-writes the
balancing code to fix it.

This code is barely used currently; only when you delete a thin
device, and then only if you have hundreds of them in the same pool.
Once we have discard support, which removes mappings, this will be used
much more heavily.
Signed-off-by: NJoe Thornber <ejt@redhat.com>
Cc: stable@kernel.org
Signed-off-by: NMike Snitzer <snitzer@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

b0988900

dm thin: fix stacked bi_next usage · 6f94a4c4

由 Joe Thornber 提交于 3月 28, 2012

Avoid using the bi_next field for the holder of a cell when deferring
bios because a stacked device below might change it.  Store the
holder in a new field in struct cell instead.

When a cell is created, the bio that triggered creation (the holder) was
added to the same bio list as subsequent bios.  In some cases we pass
this holder bio directly to devices underneath.  If those devices use
the bi_next field there will be trouble...

This also simplifies some code that had to work out which bio was the
holder.
Signed-off-by: NJoe Thornber <ejt@redhat.com>
Cc: stable@kernel.org
Signed-off-by: NMike Snitzer <snitzer@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

6f94a4c4

dm crypt: add missing error handling · 72c6e7af

由 Mikulas Patocka 提交于 3月 28, 2012

Always set io->error to -EIO when an error is detected in dm-crypt.

There were cases where an error code would be set only if we finish
processing the last sector. If there were other encryption operations in
flight, the error would be ignored and bio would be returned with
success as if no error happened.

This bug is present in kcryptd_crypt_write_convert, kcryptd_crypt_read_convert
and kcryptd_async_done.
Signed-off-by: NMikulas Patocka <mpatocka@redhat.com>
Cc: stable@kernel.org
Reviewed-by: NMilan Broz <mbroz@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

72c6e7af

dm crypt: fix mempool deadlock · aeb2deae

由 Mikulas Patocka 提交于 3月 28, 2012

This patch fixes a possible deadlock in dm-crypt's mempool use.

Currently, dm-crypt reserves a mempool of MIN_BIO_PAGES reserved pages.
It allocates first MIN_BIO_PAGES with non-failing allocation (the allocation
cannot fail and waits until the mempool is refilled). Further pages are
allocated with different gfp flags that allow failing.

Because allocations may be done in parallel, this code can deadlock. Example:
There are two processes, each tries to allocate MIN_BIO_PAGES and the processes
run simultaneously.
It may end up in a situation where each process allocates (MIN_BIO_PAGES / 2)
pages. The mempool is exhausted. Each process waits for more pages to be freed
to the mempool, which never happens.

To avoid this deadlock scenario, this patch changes the code so that only
the first page is allocated with non-failing gfp mask. Allocation of further
pages may fail.
Signed-off-by: NMikulas Patocka <mpatocka@redhat.com>
Cc: stable@kernel.org
Signed-off-by: NMilan Broz <mbroz@redhat.com>
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

aeb2deae

dm exception store: fix init error path · aadbe266

由 Andrei Warkentin 提交于 3月 28, 2012

Call the correct exit function on failure in dm_exception_store_init.
Signed-off-by: NAndrei Warkentin <andrey.warkentin@gmail.com>
Acked-by: NMike Snitzer <snitzer@redhat.com>
Cc: stable@kernel.org
Signed-off-by: NAlasdair G Kergon <agk@redhat.com>

aadbe266

22 3月, 2012 15 次提交

B
drm/nouveau/dp: support version 4.0 of DP table · 65445992
由 Ben Skeggs 提交于 3月 11, 2012
```
Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
```
65445992
B
drm/nve0/disp: nvidia randomly decided to move the dithering method · e592c73b
由 Ben Skeggs 提交于 3月 05, 2012
```
Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
```
e592c73b
B
drm/nve0: initial modesetting support for kepler chipsets · 68455a43
由 Ben Skeggs 提交于 3月 04, 2012
```
Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
```
68455a43
B
drm/nouveau: add bios connector type for dms59 · fa2c113a
由 Ben Skeggs 提交于 3月 22, 2012
```
Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
```
fa2c113a

drm/nouveau: move out of staging drivers · f3c93cbd

由 Ben Skeggs 提交于 3月 16, 2012

There's really no good reason for us to be in here anymore, we have to
maintain this ABI anyway to avoid angering people.
Signed-off-by: NBen Skeggs <bskeggs@redhat.com>

f3c93cbd

drm/nouveau: bump version to 1.0.0 · f887c425

由 Ben Skeggs 提交于 3月 16, 2012

The time has come to get a proper version number that we can change to
indicate new features etc, rather than the lock-step 0.0.XX that we
previously had.

libdrm has recognised this version as compatible with 0.0.16 since 2.4.22,
so hopefully any breakage people see should be very minimal.
Signed-off-by: NBen Skeggs <bskeggs@redhat.com>

f887c425

drm/nvd0/disp: ignore clock set if no pclk · dd62608b

由 Ben Skeggs 提交于 3月 16, 2012

This happens somehow during init on a machine I have, and leads to a
divide-by-zero.

Lets avoid that...
Signed-off-by: NBen Skeggs <bskeggs@redhat.com>

dd62608b

B
drm/nouveau: oops, increase channel dispc_vma to 4 · 27100ac9
由 Ben Skeggs 提交于 3月 16, 2012
```
Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
```
27100ac9

drm/nouveau: inform userspace of new kernel subchannel requirements · 664695ae

由 Ben Skeggs 提交于 3月 21, 2012

All available subchannels are now available for userspace to do with as it
pleases on NVC0+.

On all earlier chipsets, the kernel still uses a software object on subc 0
to implement the page flip completion method.  I hope to find some decent
way of addressing this too, but it's a tad tricker prior to fermi.
Signed-off-by: NBen Skeggs <bskeggs@redhat.com>

664695ae

B
drm/nouveau: remove m2mf creation on userspace channels · 48aca13f
由 Ben Skeggs 提交于 3月 18, 2012
```
Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
```
48aca13f
B
drm/nvc0-/disp: reimplement flip completion method as fifo method · d5316e25
由 Ben Skeggs 提交于 3月 21, 2012
```
Removes need for M2MF subchannel usage on NVC0+.
Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
```
d5316e25

drm/nouveau: move fence sequence check to start of loop · b08abd4e

由 Ben Skeggs 提交于 3月 21, 2012

I want to be able to use REF_CNT from other places in the kernel without
pushing a fence object onto the list of emitted fences.

The current code makes an assumption that every time the acked sequence is
bumped that there's at least one fence on the list that'll be signalled.

This will no longer be true in the near future.
Signed-off-by: NBen Skeggs <bskeggs@redhat.com>

b08abd4e

B
drm/nouveau: remove subchannel names from places where it doesn't matter · b5b2e598
由 Ben Skeggs 提交于 3月 16, 2012
```
These are FIFO methods, it doesn't matter what subchannel is being used.
Signed-off-by: NBen Skeggs <bskeggs@redhat.com>
```
b5b2e598

drm/nouveau/ttm: always do buffer moves on kernel channel · accf9496

由 Ben Skeggs 提交于 3月 16, 2012

There was once good reasons for wanting the drm to be able to use M2MF etc
on user channels, but they're not relevant anymore. For the general
buffer move case, we've already lost by transferring between vram/sysmem
already so the context switching overhead is minimal in comparison.
Signed-off-by: NBen Skeggs <bskeggs@redhat.com>

accf9496

mm, oom: force oom kill on sysrq+f · 08ab9b10

由 David Rientjes 提交于 3月 21, 2012

The oom killer chooses not to kill a thread if:

 - an eligible thread has already been oom killed and has yet to exit,
   and

 - an eligible thread is exiting but has yet to free all its memory and
   is not the thread attempting to currently allocate memory.

SysRq+F manually invokes the global oom killer to kill a memory-hogging
task.  This is normally done as a last resort to free memory when no
progress is being made or to test the oom killer itself.

For both uses, we always want to kill a thread and never defer.  This
patch causes SysRq+F to always kill an eligible thread and can be used to
force a kill even if another oom killed thread has failed to exit.
Signed-off-by: NDavid Rientjes <rientjes@google.com>
Acked-by: NKOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com>
Acked-by: NPekka Enberg <penberg@kernel.org>
Acked-by: NKAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

08ab9b10