提交 · 147202aa772329a02c6e80bc2b7a6b8dd3deac0b · openeuler / Kernel

08 7月, 2009 1 次提交

intel-iommu: Speed up map routines by using cached domain ASAP · 147202aa

由 David Woodhouse 提交于 7月 07, 2009

We did before, in the end -- but it was at the bottom of a long stack of
functions. Add an inline wrapper get_valid_domain_for_dev() which will
use the cached one _first_ and only make the out-of-line call if it's
not already set.

This takes the average time taken for a 1-page intel_map_sg() from 5961
cycles to 4812 cycles on my Lenovo x200s test box -- a modest 20%.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>

147202aa

05 7月, 2009 2 次提交

intel-iommu: Don't use identity mapping for PCI devices behind bridges · 3dfc813d

由 David Woodhouse 提交于 7月 04, 2009

Our current strategy for pass-through mode is to put all devices into
the 1:1 domain at startup (which is before we know what their dma_mask
will be), and only _later_ take them out of that domain, if it turns out
that they really can't address all of memory.

However, when there are a bunch of PCI devices behind a bridge, they all
end up with the same source-id on their DMA transactions, and hence in
the same IOMMU domain. This means that we _can't_ easily move them from
the 1:1 domain into their own domain at runtime, because there might be DMA
in-flight from their siblings.

So we have to adjust our pass-through strategy: For PCI devices not on
the root bus, and for the bridges which will take responsibility for
their transactions, we have to start up _out_ of the 1:1 domain, just in
case.

This fixes the BUG() we see when we have 32-bit-capable devices behind a
PCI-PCI bridge, and use the software identity mapping.

It does mean that we might end up using 'normal' mapping mode for some
devices which could actually live with the faster 1:1 mapping -- but
this is only for PCI devices behind bridges, which presumably aren't the
devices for which people are most concerned about performance.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>

3dfc813d

intel-iommu: Use iommu_should_identity_map() at startup time too. · 6941af28

由 David Woodhouse 提交于 7月 04, 2009

At boot time, the dma_mask won't have been set on any devices, so we
assume that all devices will be 64-bit capable (and thus get a 1:1 map).
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>

6941af28

04 7月, 2009 6 次提交

intel-iommu: No mapping for non-PCI devices · 73676832

由 David Woodhouse 提交于 7月 04, 2009

This should fix kernel.org bug #11821, where the dcdbas driver makes up
a platform device and then uses dma_alloc_coherent() on it, in an
attempt to get memory < 4GiB.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>

73676832

intel-iommu: Restore DMAR_BROKEN_GFX_WA option for broken graphics drivers · 62edf5dc

由 David Woodhouse 提交于 7月 04, 2009

We need to give people a little more time to fix the broken drivers.
Re-introduce this, but tied in properly with the 'iommu=pt' support this
time. Change the config option name and make it default to 'no' too.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>

62edf5dc

intel-iommu: Add iommu_should_identity_map() function · 40e4aa34

由 David Woodhouse 提交于 7月 04, 2009

We do this twice, and it's about to get more complicated. This makes the
code slightly clearer about what it's doing, too.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>

40e4aa34

intel-iommu: Fix reattaching of devices to identity mapping domain · 1b7bc0a1

由 David Woodhouse 提交于 7月 04, 2009

When we reattach a device to the si_domain (because it's been removed
from a VM), we weren't calling domain_context_mapping() to actually tell
the hardware about that.

We should really put the call to domain_context_mapping() into
domain_add_dev_info() -- we never call the latter without also doing the
former, and we can keep the error paths simple that way. But that's a
cleanup which can wait for 2.6.32 now.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>

1b7bc0a1

intel-iommu: Don't set identity mapping for bypassed graphics devices · 1e4c64c4

由 David Woodhouse 提交于 7月 04, 2009

We should check iommu_dummy() _first_, because that means it's attached
to an iommu that we've just disabled completely. At the moment, we might
try to put the device into the identity mapping domain.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>

1e4c64c4

intel-iommu: Fix dma vs. mm page confusion with aligned_nrpages() · 5a5e02a6

由 David Woodhouse 提交于 7月 04, 2009

The aligned_nrpages() function rounds up to the next VM page, but
returns its result as a number of DMA pages.

Purely theoretical except on IA64, which doesn't boot with VT-d right
now anyway.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>

5a5e02a6

02 7月, 2009 6 次提交

intel-iommu: Don't keep freeing page zero in dma_pte_free_pagetable() · 6a43e574

由 David Woodhouse 提交于 7月 02, 2009

Check dma_pte_present() and only free the page if there _is_ one.
Kind of surprising that there was no warning about this.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>

6a43e574

intel-iommu: Introduce first_pte_in_page() to simplify PTE-setting loops · 75e6bf96

由 David Woodhouse 提交于 7月 02, 2009

On Wed, 2009-07-01 at 16:59 -0700, Linus Torvalds wrote:
> I also _really_ hate how you do
>
>         (unsigned long)pte >> VTD_PAGE_SHIFT ==
>         (unsigned long)first_pte >> VTD_PAGE_SHIFT

Kill this, in favour of just looking to see if the incremented pte
pointer has 'wrapped' onto the next page. Which means we have to check
it _after_ incrementing it, not before.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>

75e6bf96

D
intel-iommu: Use cmpxchg64_local() for setting PTEs · 7766a3fb
由 David Woodhouse 提交于 7月 01, 2009
```
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
```
7766a3fb

intel-iommu: Warn about unmatched unmap requests · 85b98276

由 David Woodhouse 提交于 7月 01, 2009

This would have found the bug in i386 pci_unmap_addr() a long time ago.
We shouldn't just silently return without doing anything.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>

85b98276

intel-iommu: Kill superfluous mapping_lock · 206a73c1

由 David Woodhouse 提交于 7月 01, 2009

Since we're using cmpxchg64() anyway (because that's the only way to do
an atomic 64-bit store on i386), we might as well ditch the extra
locking and just use cmpxchg64() to ensure that we don't add the page
twice.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>

206a73c1

D
intel-iommu: Ensure that PTE writes are 64-bit atomic, even on i386 · c85994e4
由 David Woodhouse 提交于 7月 01, 2009
```
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
```
c85994e4

30 6月, 2009 5 次提交

intel-iommu: Performance improvement for dma_pte_free_pagetable() · f3a0a52f

由 David Woodhouse 提交于 6月 30, 2009

As with other functions, batch the CPU data cache flushes and don't keep
recalculating PTE addresses.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>

f3a0a52f

intel-iommu: Don't free too much in dma_pte_free_pagetable() · 3d7b0e41

由 David Woodhouse 提交于 6月 30, 2009

The loop condition was wrong -- we should free a PMD only if its
_entire_ range is within the range we're intending to clear. The
early-termination condition was right, but not the loop.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>

3d7b0e41

D
intel-iommu: dump mappings but don't die on pte already set · 1bf20f0d
由 David Woodhouse 提交于 6月 29, 2009
```
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
```
1bf20f0d
D
intel-iommu: Combine domain_pfn_mapping() and domain_sg_mapping() · 9051aa02
由 David Woodhouse 提交于 6月 29, 2009
```
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
```
9051aa02

intel-iommu: Introduce domain_sg_mapping() to speed up intel_map_sg() · e1605495

由 David Woodhouse 提交于 6月 29, 2009

Instead of calling domain_pfn_mapping() repeatedly with single or
small numbers of pages, just pass the sglist in. It can optimise the
number of cache flushes like domain_pfn_mapping() does, and gives a huge
speedup for large scatterlists.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>

e1605495

29 6月, 2009 20 次提交

intel-iommu: Simplify __intel_alloc_iova() · 875764de

由 David Woodhouse 提交于 6月 28, 2009

There's no need for the separate iommu_alloc_iova() function, and
certainly not for it to be global. Remove the underscores while we're at
it.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>

875764de

intel-iommu: Performance improvement for domain_pfn_mapping() · 6f6a00e4

由 David Woodhouse 提交于 6月 28, 2009

As with dma_pte_clear_range(), don't keep flushing a single PTE at a
time. And also micro-optimise the setting of PTE values rather than
using the helper functions to do all the masking.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>

6f6a00e4

intel-iommu: Performance improvement for dma_pte_clear_range() · 310a5ab9

由 David Woodhouse 提交于 6月 28, 2009

It's a bit silly to repeatedly call domain_flush_cache() for each PTE
individually, as we clear it. Instead, batch them up and flush a whole
range at a time. We might as well refrain from recalculating the PTE
address from scratch each time round the loop too.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>

310a5ab9

D
intel-iommu: Clean up iommu_domain_identity_map() · c5395d5c
由 David Woodhouse 提交于 6月 28, 2009
```
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
```
c5395d5c

intel-iommu: Remove last use of PHYSICAL_PAGE_MASK, for reserving PCI BARs · 1a4a4551

由 David Woodhouse 提交于 6月 28, 2009

This is fairly broken anyway -- it doesn't take hotplug into account.
We should probably be checking page_is_ram() instead.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>

1a4a4551

intel-iommu: Make iommu_flush_iotlb_psi() take pfn as argument · 03d6a246

由 David Woodhouse 提交于 6月 28, 2009

Most of its callers are having to shift for themselves anyway, so we might
as well do it in iommu_flush_iotlb_psi().
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>

03d6a246

D
intel-iommu: Change aligned_size() to aligned_nrpages() · 88cb6a74
由 David Woodhouse 提交于 6月 28, 2009
```
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
```
88cb6a74
D
intel-iommu: Clean up intel_map_sg(), remove domain_page_mapping() · b536d24d
由 David Woodhouse 提交于 6月 28, 2009
```
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
```
b536d24d
D
intel-iommu: Use domain_pfn_mapping() in intel_iommu_map_range() · ad051221
由 David Woodhouse 提交于 6月 28, 2009
```
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
```
ad051221
D
intel-iommu: Use domain_pfn_mapping() in __intel_map_single() · 0ab36de2
由 David Woodhouse 提交于 6月 28, 2009
```
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
```
0ab36de2

intel-iommu: Introduce domain_pfn_mapping() · 61df7443

由 David Woodhouse 提交于 6月 28, 2009

... and use it in the trivial cases; the other callers want individual
(and bisectable) attention, since I screwed them up the first time...

Make the BUG_ON() happen on too-large virtual address rather than
physical address, too. That's the one we care about.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>

61df7443

D
intel-iommu: Clean up address handling in domain_page_mapping() · 1c5a46ed
由 David Woodhouse 提交于 6月 28, 2009
```
No more masking and alignment; just use pfns.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
```
1c5a46ed
D
intel-iommu: Change addr_to_dma_pte() to pfn_to_dma_pte() · b026fd28
由 David Woodhouse 提交于 6月 28, 2009
```
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
```
b026fd28

intel-iommu: Clean up intel_iommu_unmap_range() · 163cc52c

由 David Woodhouse 提交于 6月 28, 2009

Use unaligned address for domain->max_addr. That algorithm isn't ideal
anyway -- we should probably just look at the last iova in the tree.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>

163cc52c

intel-iommu: Make dma_pte_free_pagetable() take pfns as argument · d794dc9b

由 David Woodhouse 提交于 6月 28, 2009

With some cleanup of intel_unmap_page(), intel_unmap_sg() and
vm_domain_exit() to no longer play with 64-bit addresses.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>

d794dc9b

D
intel-iommu: Make dma_pte_free_pagetable() use pfns · 6660c63a
由 David Woodhouse 提交于 6月 27, 2009
```
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
```
6660c63a
D
intel-iommu: Make dma_pte_clear_range() take pfns as argument · 595badf5
由 David Woodhouse 提交于 6月 27, 2009
```
Noting that this is now an _inclusive_ range.
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
```
595badf5
D
intel-iommu: Make dma_pte_clear_range() use pfns · 04b18e65
由 David Woodhouse 提交于 6月 27, 2009
```
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
```
04b18e65
D
intel-iommu: Don't just mask out too-big physical addresses; BUG() instead · 66eae846
由 David Woodhouse 提交于 6月 27, 2009
```
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
```
66eae846
D
intel-iommu: Make dma_pte_clear_one() take pfn not address · a75f7cf9
由 David Woodhouse 提交于 6月 27, 2009
```
Signed-off-by: NDavid Woodhouse <David.Woodhouse@intel.com>
```
a75f7cf9

openeuler / Kernel 大约 1 年 前同步成功

openeuler / Kernel
大约 1 年前同步成功