提交 · b6809ee573cc2f8de97f7c8f45eacc5db1129060 · openeuler / raspberrypi-kernel

01 3月, 2016 1 次提交

iommu/amd: Detach device from domain before removal · b6809ee5

由 Joerg Roedel 提交于 2月 26, 2016

Detach the device that is about to be removed from its
domain (if it has one) to clear any related state like DTE
entry and device's ATS state.
Reported-by: NKelly Zytaruk <Kelly.Zytaruk@amd.com>
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

b6809ee5

25 2月, 2016 2 次提交

iommu/amd: Apply workaround for ATS write permission check · 358875fd

由 Jay Cornwall 提交于 2月 10, 2016

The AMD Family 15h Models 30h-3Fh (Kaveri) BIOS and Kernel Developer's
Guide omitted part of the BIOS IOMMU L2 register setup specification.
Without this setup the IOMMU L2 does not fully respect write permissions
when handling an ATS translation request.

The IOMMU L2 will set PTE dirty bit when handling an ATS translation with
write permission request, even when PTE RW bit is clear. This may occur by
direct translation (which would cause a PPR) or by prefetch request from
the ATC.

This is observed in practice when the IOMMU L2 modifies a PTE which maps a
pagecache page. The ext4 filesystem driver BUGs when asked to writeback
these (non-modified) pages.

Enable ATS write permission check in the Kaveri IOMMU L2 if BIOS has not.
Signed-off-by: NJay Cornwall <jay@jcornwall.me>
Cc: <stable@vger.kernel.org> # v3.19+
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

358875fd

iommu/amd: Fix boot warning when device 00:00.0 is not iommu covered · 38e45d02

由 Suravee Suthikulpanit 提交于 2月 23, 2016

The setup code for the performance counters in the AMD IOMMU driver
tests whether the counters can be written. It tests to setup a counter
for device 00:00.0, which fails on systems where this particular device
is not covered by the IOMMU.

Fix this by not relying on device 00:00.0 but only on the IOMMU being
present.

Cc: stable@vger.kernel.org
Signed-off-by: NSuravee Suthikulpanit <Suravee.Suthikulpanit@amd.com>
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

38e45d02

15 2月, 2016 1 次提交

iommu/vt-d: Clear PPR bit to ensure we get more page request interrupts · 46924008

由 David Woodhouse 提交于 2月 15, 2016

According to the VT-d specification we need to clear the PPR bit in
the Page Request Status register when handling page requests, or the
hardware won't generate any more interrupts.

This wasn't actually necessary on SKL/KBL (which may well be the
subject of a hardware erratum, although it's harmless enough). But
other implementations do appear to get it right, and we only ever get
one interrupt unless we clear the PPR bit.
Reported-by: NCQ Tang <cq.tang@intel.com>
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
Cc: stable@vger.kernel.org

46924008

29 1月, 2016 3 次提交

iommu/amd: Correct the wrong setting of alias DTE in do_attach · 9b1a12d2

由 Baoquan He 提交于 1月 20, 2016

In below commit alias DTE is set when its peripheral is
setting DTE. However there's a code bug here to wrongly
set the alias DTE, correct it in this patch.

commit e25bfb56
Author: Joerg Roedel <jroedel@suse.de>
Date:   Tue Oct 20 17:33:38 2015 +0200

    iommu/amd: Set alias DTE in do_attach/do_detach
Signed-off-by: NBaoquan He <bhe@redhat.com>
Tested-by: NMark Hounschell <markh@compro.net>
Cc: stable@vger.kernel.org # v4.4
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

9b1a12d2

iommu/vt-d: Don't skip PCI devices when disabling IOTLB · da972fb1

由 Jeremy McNicoll 提交于 1月 14, 2016

Fix a simple typo when disabling IOTLB on PCI(e) devices.

Fixes: b16d0cb9 ("iommu/vt-d: Always enable PASID/PRI PCI capabilities before ATS")
Cc: stable@vger.kernel.org  # v4.4
Signed-off-by: NJeremy McNicoll <jmcnicol@redhat.com>
Reviewed-by: NAlex Williamson <alex.williamson@redhat.com>
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

da972fb1

iommu/io-pgtable-arm: Fix io-pgtable-arm build failure · 8f6aff98

由 Lada Trimasova 提交于 1月 27, 2016

Trying to build a kernel for ARC with both options CONFIG_COMPILE_TEST
and CONFIG_IOMMU_IO_PGTABLE_LPAE enabled (e.g. as a result of "make
allyesconfig") results in the following build failure:

 | CC drivers/iommu/io-pgtable-arm.o
 | linux/drivers/iommu/io-pgtable-arm.c: In
 | function ‘__arm_lpae_alloc_pages’:
 | linux/drivers/iommu/io-pgtable-arm.c:221:3:
 | error: implicit declaration of function ‘dma_map_single’
 | [-Werror=implicit-function-declaration]
 | dma = dma_map_single(dev, pages, size, DMA_TO_DEVICE);
 | ^
 | linux/drivers/iommu/io-pgtable-arm.c:221:42:
 | error: ‘DMA_TO_DEVICE’ undeclared (first use in this function)
 | dma = dma_map_single(dev, pages, size, DMA_TO_DEVICE);
 | ^

Since IOMMU_IO_PGTABLE_LPAE depends on DMA API, io-pgtable-arm.c should
include linux/dma-mapping.h. This fixes the reported failure.

Cc: Alexey Brodkin <abrodkin@synopsys.com>
Cc: Vineet Gupta <vgupta@synopsys.com>
Cc: Joerg Roedel <joro@8bytes.org>
Signed-off-by: NLada Trimasova <ltrimas@synopsys.com>
Signed-off-by: NWill Deacon <will.deacon@arm.com>
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

8f6aff98

14 1月, 2016 2 次提交

iommu/vt-d: Fix 64-bit accesses to 32-bit DMAR_GSTS_REG · fda3bec1

由 CQ Tang 提交于 1月 13, 2016

This is a 32-bit register. Apparently harmless on real hardware, but
causing justified warnings in simulation.
Signed-off-by: NCQ Tang <cq.tang@intel.com>
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
Cc: stable@vger.kernel.org

fda3bec1

iommu/vt-d: Fix mm refcounting to hold mm_count not mm_users · e57e58bd

由 David Woodhouse 提交于 1月 12, 2016

Holding mm_users works OK for graphics, which was the first user of SVM
with VT-d. However, it works less well for other devices, where we actually
do a mmap() from the file descriptor to which the SVM PASID state is tied.

In this case on process exit we end up with a recursive reference count:
 - The MM remains alive until the file is closed and the driver's release()
   call ends up unbinding the PASID.
 - The VMA corresponding to the mmap() remains intact until the MM is
   destroyed.
 - Thus the file isn't closed, even when exit_files() runs, because the
   VMA is still holding a reference to it. And the MM remains alive…

To address this issue, we *stop* holding mm_users while the PASID is bound.
We already hold mm_count by virtue of the MMU notifier, and that can be
made to be sufficient.

It means that for a period during process exit, the fun part of mmput()
has happened and exit_mmap() has been called so the MM is basically
defunct. But the PGD still exists and the PASID is still bound to it.

During this period, we have to be very careful — exit_mmap() doesn't use
mm->mmap_sem because it doesn't expect anyone else to be touching the MM
(quite reasonably, since mm_users is zero). So we also need to fix the
fault handler to just report failure if mm_users is already zero, and to
temporarily bump mm_users while handling any faults.

Additionally, exit_mmap() calls mmu_notifier_release() *before* it tears
down the page tables, which is too early for us to flush the IOTLB for
this PASID. And __mmu_notifier_release() removes every notifier from the
list, so when exit_mmap() finally *does* tear down the mappings and
clear the page tables, we don't get notified. So we work around this by
clearing the PASID table entry in our MMU notifier release() callback.
That way, the hardware *can't* get any pages back from the page tables
before they get cleared.

Hardware designers have confirmed that the resulting 'PASID not present'
faults should be handled just as gracefully as 'page not present' faults,
the important criterion being that they don't perturb the operation for
any *other* PASID in the system.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
Cc: stable@vger.kernel.org

e57e58bd

07 1月, 2016 4 次提交

iommu/vt-d: Fix up error handling in alloc_iommu · bc847454

由 Joerg Roedel 提交于 1月 07, 2016

Only check for error when iommu->iommu_dev has been assigned
and only assign drhd->iommu when the function can't fail
anymore.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

bc847454

iommu/vt-d: Check the return value of iommu_device_create() · 59203379

由 Nicholas Krause 提交于 1月 04, 2016

This adds the proper check to alloc_iommu to make sure that
the call to iommu_device_create has completed successfully
and if not return the error code to the caller after freeing
up resources allocated previously.
Signed-off-by: NNicholas Krause <xerofoify@gmail.com>
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

59203379

iommu/dma: Use correct offset in map_sg · 164afb1d

由 Robin Murphy 提交于 1月 04, 2016

When mapping a non-page-aligned scatterlist entry, we copy the original
offset to the output DMA address before aligning it to hand off to
iommu_map_sg(), then later adding the IOVA page address portion to get
the final mapped address. However, when the IOVA page size is smaller
than the CPU page size, it is the offset within the IOVA page we want,
not that within the CPU page, which can easily be larger than an IOVA
page and thus result in an incorrect final address.

Fix the bug by taking only the IOVA-aligned part of the offset as the
basis of the DMA address, not the whole thing.
Signed-off-by: NRobin Murphy <robin.murphy@arm.com>
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

164afb1d

iommu/amd: Remove an unneeded condition · 1fb260bc

由 Dan Carpenter 提交于 1月 07, 2016

get_device_id() returns an unsigned short device id.  It never fails and
it never returns a negative so we can remove this condition.
Signed-off-by: NDan Carpenter <dan.carpenter@oracle.com>
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

1fb260bc

29 12月, 2015 26 次提交

iommu/amd: Preallocate dma_ops apertures based on dma_mask · a639a8ee

由 Joerg Roedel 提交于 12月 22, 2015

Preallocate between 4 and 8 apertures when a device gets it
dma_mask. With more apertures we reduce the lock contention
of the domain lock significantly.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

a639a8ee

iommu/amd: Use trylock to aquire bitmap_lock · 7b5e25b8

由 Joerg Roedel 提交于 12月 22, 2015

First search for a non-contended aperture with trylock
before spinning.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

7b5e25b8

iommu/amd: Make dma_ops_domain->next_index percpu · 5f6bed50

由 Joerg Roedel 提交于 12月 22, 2015

Make this pointer percpu so that we start searching for new
addresses in the range we last stopped and which is has a
higher probability of being still in the cache.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

5f6bed50

iommu/amd: Relax locking in dma_ops path · 92d420ec

由 Joerg Roedel 提交于 12月 21, 2015

Remove the long holding times of the domain->lock and rely
on the bitmap_lock instead.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

92d420ec

iommu/amd: Initialize new aperture range before making it visible · a73c1566

由 Joerg Roedel 提交于 12月 21, 2015

Make sure the aperture range is fully initialized before it
is visible to the address allocator.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

a73c1566

iommu/amd: Build io page-tables with cmpxchg64 · 7bfa5bd2

由 Joerg Roedel 提交于 12月 21, 2015

This allows to build up the page-tables without holding any
locks. As a consequence it removes the need to pre-populate
dma_ops page-tables.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

7bfa5bd2

J
iommu/amd: Allocate new aperture ranges in dma_ops_alloc_addresses · 266a3bd2
由 Joerg Roedel 提交于 12月 21, 2015
```
It really belongs there and not in __map_single.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>
```
266a3bd2

iommu/amd: Optimize dma_ops_free_addresses · 4eeca8c5

由 Joerg Roedel 提交于 12月 22, 2015

Don't flush the iommu tlb when we free something behind the
current next_bit pointer. Update the next_bit pointer
instead and let the flush happen on the next wraparound in
the allocation path.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

4eeca8c5

iommu/amd: Remove need_flush from struct dma_ops_domain · ab7032bb

由 Joerg Roedel 提交于 12月 21, 2015

The flushing of iommu tlbs is now done on a per-range basis.
So there is no need anymore for domain-wide flush tracking.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

ab7032bb

iommu/amd: Iterate over all aperture ranges in dma_ops_area_alloc · 2a87442c

由 Joerg Roedel 提交于 12月 21, 2015

This way we don't need to care about the next_index wrapping
around in dma_ops_alloc_addresses.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

2a87442c

iommu/amd: Flush iommu tlb in dma_ops_free_addresses · d41ab098

由 Joerg Roedel 提交于 12月 21, 2015

Instead of setting need_flush, do the flush directly in
dma_ops_free_addresses.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

d41ab098

iommu/amd: Rename dma_ops_domain->next_address to next_index · ebaecb42

由 Joerg Roedel 提交于 12月 21, 2015

It points to the next aperture index to allocate from. We
don't need the full address anymore because this is now
tracked in struct aperture_range.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

ebaecb42

iommu/amd: Remove 'start' parameter from dma_ops_area_alloc · 05ab49e0

由 Joerg Roedel 提交于 12月 21, 2015

Parameter is not needed because the value is part of the
already passed in struct dma_ops_domain.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

05ab49e0

iommu/amd: Flush iommu tlb in dma_ops_aperture_alloc() · ccb50e03

由 Joerg Roedel 提交于 12月 21, 2015

Since the allocator wraparound happens in this function now,
flush the iommu tlb there too.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

ccb50e03

iommu/amd: Retry address allocation within one aperture · 60e6a7cb

由 Joerg Roedel 提交于 12月 21, 2015

Instead of skipping to the next aperture, first try again in
the current one.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

60e6a7cb

iommu/amd: Move aperture_range.offset to another cache-line · ae62d49c

由 Joerg Roedel 提交于 12月 21, 2015

Moving it before the pte_pages array puts in into the same
cache-line as the spin-lock and the bitmap array pointer.
This should safe a cache-miss.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

ae62d49c

iommu/amd: Add dma_ops_aperture_alloc() function · a0f51447

由 Joerg Roedel 提交于 12月 21, 2015

Make this a wrapper around iommu_ops_area_alloc() for now
and add more logic to this function later on.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

a0f51447

iommu/amd: Pass correct shift to iommu_area_alloc() · b57c3c80

由 Joerg Roedel 提交于 12月 21, 2015

The page-offset of the aperture must be passed instead of 0.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

b57c3c80

iommu/amd: Flush the IOMMU TLB before the addresses are freed · 84b3a0bc

由 Joerg Roedel 提交于 12月 21, 2015

This allows to keep the bitmap_lock only for a very short
period of time.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

84b3a0bc

iommu/amd: Flush IOMMU TLB on __map_single error path · 53b3b65a

由 Joerg Roedel 提交于 12月 21, 2015

There have been present PTEs which in theory could have made
it to the IOMMU TLB. Flush the addresses out on the error
path to make sure no stale entries remain.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

53b3b65a

iommu/amd: Introduce bitmap_lock in struct aperture_range · 08c5fb93

由 Joerg Roedel 提交于 12月 21, 2015

This lock only protects the address allocation bitmap in one
aperture.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

08c5fb93

iommu/amd: Move 'struct dma_ops_domain' definition to amd_iommu.c · 007b74ba

由 Joerg Roedel 提交于 12月 21, 2015

It is only used in this file anyway, so keep it there. Same
with 'struct aperture_range'.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

007b74ba

J
iommu/amd: Warn only once on unexpected pte value · a7fb668f
由 Joerg Roedel 提交于 12月 21, 2015
```
This prevents possible flooding of the kernel log.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>
```
a7fb668f

iommu/ipmmu-vmsa: Don't truncate ttbr if LPAE is not enabled · f64232ee

由 Geert Uytterhoeven 提交于 12月 22, 2015

If CONFIG_PHYS_ADDR_T_64BIT=n:

    drivers/iommu/ipmmu-vmsa.c: In function 'ipmmu_domain_init_context':
    drivers/iommu/ipmmu-vmsa.c:434:2: warning: right shift count >= width of type
      ipmmu_ctx_write(domain, IMTTUBR0, ttbr >> 32);
      ^

As io_pgtable_cfg.arm_lpae_s1_cfg.ttbr[] is an array of u64s, assigning
it to a phys_addr_t may truncates it.  Make ttbr u64 to fix this.
Signed-off-by: NGeert Uytterhoeven <geert+renesas@glider.be>
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

f64232ee

iommu/dma: Avoid unlikely high-order allocations · 0a9afeda

由 Robin Murphy 提交于 12月 18, 2015

Doug reports that the equivalent page allocator on 32-bit ARM exhibits
particularly pathalogical behaviour under memory pressure when
fragmentation is high, where allocating a 4MB buffer takes tens of
seconds and the number of calls to alloc_pages() is over 9000![1]

We can drastically improve that situation without losing the other
benefits of high-order allocations when they would succeed, by assuming
memory pressure is relatively constant over the course of an allocation,
and not retrying allocations at orders we know to have failed before.
This way, the best-case behaviour remains unchanged, and in the worst
case we should see at most a dozen or so (MAX_ORDER - 1) failed attempts
before falling back to single pages for the remainder of the buffer.

[1]:http://lists.infradead.org/pipermail/linux-arm-kernel/2015-December/394660.htmlReported-by: NDouglas Anderson <dianders@chromium.org>
Signed-off-by: NRobin Murphy <robin.murphy@arm.com>
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

0a9afeda

iommu/dma: Add some missing #includes · 5b11e9cd

由 Robin Murphy 提交于 12月 18, 2015

dma-iommu.c was naughtily relying on an implicit transitive #include of
linux/vmalloc.h, which is apparently not present on some architectures.
Add that, plus a couple more headers for other functions which are used
similarly.
Reported-by: Nkbuild test robot <lkp@intel.com>
Signed-off-by: NRobin Murphy <robin.murphy@arm.com>
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

5b11e9cd

19 12月, 2015 1 次提交

x86/cpufeature: Remove unused and seldomly used cpu_has_xx macros · 362f924b

由 Borislav Petkov 提交于 12月 07, 2015

Those are stupid and code should use static_cpu_has_safe() or
boot_cpu_has() instead. Kill the least used and unused ones.

The remaining ones need more careful inspection before a conversion can
happen. On the TODO.
Signed-off-by: NBorislav Petkov <bp@suse.de>
Link: http://lkml.kernel.org/r/1449481182-27541-4-git-send-email-bp@alien8.de
Cc: David Sterba <dsterba@suse.com>
Cc: Herbert Xu <herbert@gondor.apana.org.au>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Matt Mackall <mpm@selenic.com>
Cc: Chris Mason <clm@fb.com>
Cc: Josef Bacik <jbacik@fb.com>
Signed-off-by: NThomas Gleixner <tglx@linutronix.de>

362f924b