提交 · c8acb28b331364b32a5c81dbfbdfc8475b2f1f27 · openanolis / cloud-kernel

16 8月, 2017 8 次提交

iommu/vt-d: Allow to flush more than 4GB of device TLBs · c8acb28b

由 Joerg Roedel 提交于 8月 11, 2017

The shift qi_flush_dev_iotlb() is done on an int, which
limits the mask to 32 bits. Make the mask 64 bits wide so
that more than 4GB of address range can be flushed at once.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

c8acb28b

iommu/amd: Make use of iova queue flushing · 9003d618

由 Joerg Roedel 提交于 8月 10, 2017

Rip out the implementation in the AMD IOMMU driver and use
the one in the common iova code instead.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

9003d618

iommu/iova: Add flush timer · 9a005a80

由 Joerg Roedel 提交于 8月 10, 2017

Add a timer to flush entries from the Flush-Queues every
10ms. This makes sure that no stale TLB entries remain for
too long after an IOVA has been unmapped.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

9a005a80

iommu/iova: Add locking to Flush-Queues · 8109c2a2

由 Joerg Roedel 提交于 8月 10, 2017

The lock is taken from the same CPU most of the time. But
having it allows to flush the queue also from another CPU if
necessary.

This will be used by a timer to regularily flush any pending
IOVAs from the Flush-Queues.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

8109c2a2

iommu/iova: Add flush counters to Flush-Queue implementation · fb418dab

由 Joerg Roedel 提交于 8月 10, 2017

There are two counters:

	* fq_flush_start_cnt  - Increased when a TLB flush
	                        is started.

	* fq_flush_finish_cnt - Increased when a TLB flush
				is finished.

The fq_flush_start_cnt is assigned to every Flush-Queue
entry on its creation. When freeing entries from the
Flush-Queue, the value in the entry is compared to the
fq_flush_finish_cnt. The entry can only be freed when its
value is less than the value of fq_flush_finish_cnt.

The reason for these counters it to take advantage of IOMMU
TLB flushes that happened on other CPUs. These already
flushed the TLB for Flush-Queue entries on other CPUs so
that they can already be freed without flushing the TLB
again.

This makes it less likely that the Flush-Queue is full and
saves IOMMU TLB flushes.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

fb418dab

iommu/iova: Implement Flush-Queue ring buffer · 19282101

由 Joerg Roedel 提交于 8月 10, 2017

Add a function to add entries to the Flush-Queue ring
buffer. If the buffer is full, call the flush-callback and
free the entries.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

19282101

iommu/iova: Add flush-queue data structures · 42f87e71

由 Joerg Roedel 提交于 8月 10, 2017

This patch adds the basic data-structures to implement
flush-queues in the generic IOVA code. It also adds the
initialization and destroy routines for these data
structures.

The initialization routine is designed so that the use of
this feature is optional for the users of IOVA code.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

42f87e71

iommu/of: Fix of_iommu_configure() for disabled IOMMUs · da4b0275

由 Robin Murphy 提交于 8月 04, 2017

Sudeep reports that the logic got slightly broken when a PCI iommu-map
entry targets an IOMMU marked as disabled in DT, since of_pci_map_rid()
succeeds in following a phandle, and of_iommu_xlate() doesn't return an
error value, but we miss checking whether ops was actually non-NULL.
Whilst this could be solved with a point fix in of_pci_iommu_init(), it
suggests that all the juggling of ERR_PTR values through the ops pointer
is proving rather too complicated for its own good, so let's instead
simplify the whole flow (with a side-effect of eliminating the cause of
the bug).

The fact that we now rely on iommu_fwspec means that we no longer need
to pass around an iommu_ops pointer at all - we can simply propagate a
regular int return value until we know whether we have a viable IOMMU,
then retrieve the ops from the fwspec if and when we actually need them.
This makes everything a bit more uniform and certainly easier to follow.

Fixes: d87beb74 ("iommu/of: Handle PCI aliases properly")
Reported-by: NSudeep Holla <sudeep.holla@arm.com>
Tested-by: NSudeep Holla <sudeep.holla@arm.com>
Signed-off-by: NRobin Murphy <robin.murphy@arm.com>
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

da4b0275

10 8月, 2017 4 次提交

iommu: Finish making iommu_group support mandatory · 05f80300

由 Robin Murphy 提交于 7月 21, 2017

Now that all the drivers properly implementing the IOMMU API support
groups (I'm ignoring the etnaviv GPU MMUs which seemingly only do just
enough to convince the ARM DMA mapping ops), we can remove the FIXME
workarounds from the core code. In the process, it also seems logical to
make the .device_group callback non-optional for drivers calling
iommu_group_get_for_dev() - the current callers all implement it anyway,
and it doesn't make sense for any future callers not to either.
Signed-off-by: NRobin Murphy <robin.murphy@arm.com>
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

05f80300

iommu/tegra-gart: Add iommu_group support · 15f9a310

由 Robin Murphy 提交于 7月 21, 2017

As the last step to making groups mandatory, clean up the remaining
drivers by adding basic support. Whilst it may not perfectly reflect the
isolation capabilities of the hardware, using generic_device_group()
should at least maintain existing behaviour with respect to the API.
Signed-off-by: NRobin Murphy <robin.murphy@arm.com>
Tested-by: NDmitry Osipenko <digetx@gmail.com>
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

15f9a310

iommu/tegra-smmu: Add iommu_group support · d92e1f84

由 Robin Murphy 提交于 7月 21, 2017

As the last step to making groups mandatory, clean up the remaining
drivers by adding basic support. Whilst it may not perfectly reflect
the isolation capabilities of the hardware (tegra_smmu_swgroup sounds
suspiciously like something that might warrant representing at the
iommu_group level), using generic_device_group() should at least
maintain existing behaviour with respect to the API.
Signed-off-by: NRobin Murphy <robin.murphy@arm.com>
Tested-by: NMikko Perttunen <mperttunen@nvidia.com>
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

d92e1f84

iommu/msm: Add iommu_group support · ce2eb8f4

由 Robin Murphy 提交于 7月 21, 2017

As the last step to making groups mandatory, clean up the remaining
drivers by adding basic support. Whilst it may not perfectly reflect the
isolation capabilities of the hardware, using generic_device_group()
should at least maintain existing behaviour with respect to the API.
Signed-off-by: NRobin Murphy <robin.murphy@arm.com>
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

ce2eb8f4

26 7月, 2017 2 次提交

iommu: Convert to using %pOF instead of full_name · 6bd4f1c7

由 Rob Herring 提交于 7月 18, 2017

Now that we have a custom printf format specifier, convert users of
full_name to use %pOF instead. This is preparation to remove storing
of the full path string for each node.
Signed-off-by: NRob Herring <robh@kernel.org>
Cc: Joerg Roedel <joro@8bytes.org>
Cc: Heiko Stuebner <heiko@sntech.de>
Cc: iommu@lists.linux-foundation.org
Cc: linux-arm-kernel@lists.infradead.org
Cc: linux-rockchip@lists.infradead.org
Reviewed-by: NHeiko Stuebner <heiko@sntech.de>
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

6bd4f1c7

iommu/of: Handle PCI aliases properly · d87beb74

由 Robin Murphy 提交于 5月 31, 2017

When a PCI device has DMA quirks, we need to ensure that an upstream
IOMMU knows about all possible aliases, since the presence of a DMA
quirk does not preclude the device still also emitting transactions
(e.g. MSIs) on its 'real' RID. Similarly, the rules for bridge aliasing
are relatively complex, and some bridges may only take ownership of
transactions under particular transient circumstances, leading again to
multiple RIDs potentially being seen at the IOMMU for the given device.

Take all this into account in the OF code by translating every RID
produced by the alias walk, not just whichever one comes out last.
Happily, this also makes things tidy enough that we can reduce the
number of both total lines of code, and confusing levels of indirection,
by pulling the "iommus"/"iommu-map" parsing helpers back in-line again.
Signed-off-by: NRobin Murphy <robin.murphy@arm.com>
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

d87beb74

28 6月, 2017 10 次提交

x86: remove arch specific dma_supported implementation · 5860acc1

由 Christoph Hellwig 提交于 5月 22, 2017

And instead wire it up as method for all the dma_map_ops instances.

Note that this also means the arch specific check will be fully instead
of partially applied in the AMD iommu driver.
Signed-off-by: NChristoph Hellwig <hch@lst.de>

5860acc1

iommu/amd: implement ->mapping_error · a869572c

由 Christoph Hellwig 提交于 5月 21, 2017

DMA_ERROR_CODE is going to go away, so don't rely on it.
Signed-off-by: NChristoph Hellwig <hch@lst.de>

a869572c

iommu/amd: Fix interrupt remapping when disable guest_mode · 84a21dbd

由 Suravee Suthikulpanit 提交于 6月 26, 2017

Pass-through devices to VM guest can get updated IRQ affinity
information via irq_set_affinity() when not running in guest mode.
Currently, AMD IOMMU driver in GA mode ignores the updated information
if the pass-through device is setup to use vAPIC regardless of guest_mode.
This could cause invalid interrupt remapping.

Also, the guest_mode bit should be set and cleared only when
SVM updates posted-interrupt interrupt remapping information.
Signed-off-by: NSuravee Suthikulpanit <suravee.suthikulpanit@amd.com>
Cc: Joerg Roedel <jroedel@suse.de>
Fixes: d98de49a ('iommu/amd: Enable vAPIC interrupt remapping mode by default')
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

84a21dbd

iommu/vt-d: Constify intel_dma_ops · 01e1932a

由 Arvind Yadav 提交于 6月 28, 2017

Most dma_map_ops structures are never modified. Constify these
structures such that these can be write-protected.
Signed-off-by: NArvind Yadav <arvind.yadav.cs@gmail.com>
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

01e1932a

iommu: Warn once when device_group callback returns NULL · 72dcac63

由 Joerg Roedel 提交于 6月 28, 2017

This callback should never return NULL. Print a warning if
that happens so that we notice and can fix it.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

72dcac63

iommu/omap: Return ERR_PTR in device_group call-back · 8faf5e5a

由 Joerg Roedel 提交于 6月 28, 2017

Make sure that the device_group callback returns an ERR_PTR
instead of NULL.
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

8faf5e5a

iommu: Return ERR_PTR() values from device_group call-backs · 7f7a2304

由 Joerg Roedel 提交于 6月 28, 2017

The generic device_group call-backs in iommu.c return NULL
in case of error. Since they are getting ERR_PTR values from
iommu_group_alloc(), just pass them up instead.
Reported-by: NGerald Schaefer <gerald.schaefer@de.ibm.com>
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

7f7a2304

iommu/s390: Use iommu_group_get_for_dev() in s390_iommu_add_device() · 0929deca

由 Joerg Roedel 提交于 6月 15, 2017

The iommu_group_get_for_dev() function also attaches the
device to its group, so this code doesn't need to be in the
iommu driver.

Further by using this function the driver can make use of
default domains in the future.
Reviewed-by: NGerald Schaefer <gerald.schaefer@de.ibm.com>
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

0929deca

iommu/vt-d: Don't disable preemption while accessing deferred_flush() · 58c4a95f

由 Sebastian Andrzej Siewior 提交于 6月 27, 2017

get_cpu() disables preemption and returns the current CPU number. The
CPU number is only used once while retrieving the address of the local's
CPU deferred_flush pointer.
We can instead use raw_cpu_ptr() while we remain preemptible. The worst
thing that can happen is that flush_unmaps_timeout() is invoked multiple
times: once by taskA after seeing HIGH_WATER_MARK and then preempted to
another CPU and then by taskB which saw HIGH_WATER_MARK on the same CPU
as taskA. It is also likely that ->size got from HIGH_WATER_MARK to 0
right after its read because another CPU invoked flush_unmaps_timeout()
for this CPU.
The access to flush_data is protected by a spinlock so even if we get
migrated to another CPU or preempted - the data structure is protected.

While at it, I marked deferred_flush static since I can't find a
reference to it outside of this file.

Cc: David Woodhouse <dwmw2@infradead.org>
Cc: Joerg Roedel <joro@8bytes.org>
Cc: iommu@lists.linux-foundation.org
Cc: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: NSebastian Andrzej Siewior <bigeasy@linutronix.de>
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

58c4a95f

iommu/iova: Don't disable preempt around this_cpu_ptr() · aaffaa8a

由 Sebastian Andrzej Siewior 提交于 6月 27, 2017

Commit 583248e6 ("iommu/iova: Disable preemption around use of
this_cpu_ptr()") disables preemption while accessing a per-CPU variable.
This does keep lockdep quiet. However I don't see the point why it is
bad if we get migrated after its access to another CPU.
__iova_rcache_insert() and __iova_rcache_get() immediately locks the
variable after obtaining it - before accessing its members.
_If_ we get migrated away after retrieving the address of cpu_rcache
before taking the lock then the *other* task on the same CPU will
retrieve the same address of cpu_rcache and will spin on the lock.

alloc_iova_fast() disables preemption while invoking
free_cpu_cached_iovas() on each CPU. The function itself uses
per_cpu_ptr() which does not trigger a warning (like this_cpu_ptr()
does). It _could_ make sense to use get_online_cpus() instead but the we
have a hotplug notifier for CPU down (and none for up) so we are good.

Cc: Joerg Roedel <joro@8bytes.org>
Cc: iommu@lists.linux-foundation.org
Cc: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: NSebastian Andrzej Siewior <bigeasy@linutronix.de>
Signed-off-by: NJoerg Roedel <jroedel@suse.de>

aaffaa8a

24 6月, 2017 16 次提交

iommu/arm-smmu-v3: Add workaround for Cavium ThunderX2 erratum #126 · f935448a

由 Geetha Sowjanya 提交于 6月 23, 2017

Cavium ThunderX2 SMMU doesn't support MSI and also doesn't have unique irq
lines for gerror, eventq and cmdq-sync.

New named irq "combined" is set as a errata workaround, which allows to
share the irq line by register single irq handler for all the interrupts.
Acked-by: NLorenzo Pieralisi <lorenzo.pieralisi@arm.com>
Signed-off-by: NGeetha sowjanya <gakula@caviumnetworks.com>
[will: reworked irq equality checking and added SPI check]
Signed-off-by: NWill Deacon <will.deacon@arm.com>

f935448a

iommu/arm-smmu-v3: Enable ACPI based HiSilicon CMD_PREFETCH quirk(erratum 161010701) · 99caf177

由 shameer 提交于 5月 17, 2017

HiSilicon SMMUv3 on Hip06/Hip07 platforms doesn't support CMD_PREFETCH
command. The dt based support for this quirk is already present in the
driver(hisilicon,broken-prefetch-cmd). This adds ACPI support for the
quirk using the IORT smmu model number.
Signed-off-by: Nshameer <shameerali.kolothum.thodi@huawei.com>
Signed-off-by: Nhanjun <guohanjun@huawei.com>
[will: rewrote patch]
Signed-off-by: NWill Deacon <will.deacon@arm.com>

99caf177

iommu/arm-smmu-v3: Add workaround for Cavium ThunderX2 erratum #74 · e5b829de

由 Linu Cherian 提交于 6月 22, 2017

Cavium ThunderX2 SMMU implementation doesn't support page 1 register space
and PAGE0_REGS_ONLY option is enabled as an errata workaround.
This option when turned on, replaces all page 1 offsets used for
EVTQ_PROD/CONS, PRIQ_PROD/CONS register access with page 0 offsets.

SMMU resource size checks are now based on SMMU option PAGE0_REGS_ONLY,
since resource size can be either 64k/128k.
For this, arm_smmu_device_dt_probe/acpi_probe has been moved before
platform_get_resource call, so that SMMU options are set beforehand.
Signed-off-by: NLinu Cherian <linu.cherian@cavium.com>
Signed-off-by: NGeetha Sowjanya <geethasowjanya.akula@cavium.com>
Signed-off-by: NWill Deacon <will.deacon@arm.com>

e5b829de

iommu/arm-smmu-v3, acpi: Add temporary Cavium SMMU-V3 IORT model number definitions · 12275bf0

由 Robert Richter 提交于 6月 22, 2017

The model number is already defined in acpica and we are actually
waiting for the acpi maintainers to include it:

 https://github.com/acpica/acpica/commit/d00a4eb86e64

Adding those temporary definitions until the change makes it into
include/acpi/actbl2.h. Once that is done this patch can be reverted.
Acked-by: NLorenzo Pieralisi <lorenzo.pieralisi@arm.com>
Signed-off-by: NRobert Richter <rrichter@cavium.com>
Signed-off-by: NWill Deacon <will.deacon@arm.com>

12275bf0

iommu/io-pgtable-arm: Use dma_wmb() instead of wmb() when publishing table · 77f34458

由 Will Deacon 提交于 6月 23, 2017

When writing a new table entry, we must ensure that the contents of the
table is made visible to the SMMU page table walker before the updated
table entry itself.

This is currently achieved using wmb(), which expands to an expensive and
unnecessary DSB instruction. Ideally, we'd just use cmpxchg64_release when
writing the table entry, but this doesn't have memory ordering semantics
on !SMP systems.

Instead, use dma_wmb(), which emits DMB OSHST. Strictly speaking, this
does more than we require (since it targets the outer-shareable domain),
but it's likely to be significantly faster than the DSB approach.
Reported-by: NLinu Cherian <linu.cherian@cavium.com>
Suggested-by: NRobin Murphy <robin.murphy@arm.com>
Signed-off-by: NWill Deacon <will.deacon@arm.com>

77f34458

iommu/io-pgtable: depend on !GENERIC_ATOMIC64 when using COMPILE_TEST with LPAE · c1004803

由 Will Deacon 提交于 6月 23, 2017

The LPAE/ARMv8 page table format relies on the ability to read and write
64-bit page table entries in an atomic fashion. With the move to a lockless
implementation, we also need support for cmpxchg64 to resolve races when
installing table entries concurrently.

Unfortunately, not all architectures support cmpxchg64, so the code can
fail to compiler when building for these architectures using COMPILE_TEST.
Rather than disable COMPILE_TEST altogether, instead check that
GENERIC_ATOMIC64 is not selected, which is a reasonable indication that
the architecture has support for 64-bit cmpxchg.
Reported-by: Nkbuild test robot <fengguang.wu@intel.com>
Signed-off-by: NWill Deacon <will.deacon@arm.com>

c1004803

iommu/arm-smmu-v3: Remove io-pgtable spinlock · 58188afe