提交 · b236fa1d339670cc997b68c31be57855bbabc126 · openeuler / raspberrypi-kernel

16 3月, 2018 2 次提交

drm/amdgpu: Add helper to turn an existing VM into a compute VM · b236fa1d

由 Felix Kuehling 提交于 3月 15, 2018

v2: Removed updating and checking of vm->vm_context
v3: Enable amdgpu_vm_clear_bo in amdgpu_vm_make_compute
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

b236fa1d

drm/amdgpu: Move KFD-specific fields into struct amdgpu_vm · 5b21d3e5

由 Felix Kuehling 提交于 3月 15, 2018

Remove struct amdkfd_vm and move the fields into struct amdgpu_vm.
This will allow turning a VM created by a DRM render node into a
KFD VM.

v2: Removed vm_context field
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

5b21d3e5

20 2月, 2018 1 次提交

drm/amdgpu: reduce reserved VA size · 18d09e63

由 Christian König 提交于 1月 22, 2018

1MB should be more than enough, currently we use about 8K.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NMonk Liu <monk.liu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

18d09e63

28 12月, 2017 2 次提交

drm/amdgpu: drop client_id from VM · 0e36b9b2

由 Christian König 提交于 12月 18, 2017

Use the fence context from the scheduler entity.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NChunming Zhou <david1.zhou@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

0e36b9b2

drm/amdgpu: separate VMID and PASID handling · 620f774f

由 Christian König 提交于 12月 18, 2017

Move both into the new files amdgpu_ids.[ch]. No functional change.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NChunming Zhou <david1.zhou@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

620f774f

19 12月, 2017 1 次提交

drm/amdgpu: implement 2+1 PD support for Raven v3 · 6a42fd6f

由 Christian König 提交于 12月 05, 2017

Instead of falling back to 2 level and very limited address space use
2+1 PD support and 128TB + 512GB of virtual address space.

v2: cleanup defines, rebase on top of level enum
v3: fix inverted check in hardware setup
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-and-Tested-by: NChunming Zhou <david1.zhou@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

6a42fd6f

15 12月, 2017 1 次提交

drm/amdgpu: add enumerate for PDB/PTB v3 · 196f7489

由 Chunming Zhou 提交于 12月 13, 2017

v2:
  remove SUBPTB member
v3:
  remove last_level, use AMDGPU_VM_PTB directly instead.
Signed-off-by: NChunming Zhou <david1.zhou@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

196f7489

13 12月, 2017 2 次提交

drm/amdgpu: remove keeping the addr of the VM PDs · 78eb2f0c

由 Christian König 提交于 11月 30, 2017

No more double house keeping.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NChunming Zhou <david1.zhou@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

78eb2f0c

drm/amdgpu: remove last_entry_used from the VM code · 8f19cd78

由 Christian König 提交于 11月 30, 2017

Not needed any more.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NChunming Zhou <david1.zhou@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

8f19cd78

07 2月, 2018 1 次提交

drm/amdgpu: Fix header file dependencies · 61b100e9

由 Felix Kuehling 提交于 2月 06, 2018

Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NOded Gabbay <oded.gabbay@gmail.com>

61b100e9

08 12月, 2017 1 次提交

drm: move amd_gpu_scheduler into common location · 1b1f42d8

由 Lucas Stach 提交于 12月 06, 2017

This moves and renames the AMDGPU scheduler to a common location in DRM
in order to facilitate re-use by other drivers. This is mostly a straight
forward rename with no code changes.

One notable exception is the function to_drm_sched_fence(), which is no
longer a inline header function to avoid the need to export the
drm_sched_fence_ops_scheduled and drm_sched_fence_ops_finished structures.
Reviewed-by: NChunming Zhou <david1.zhou@amd.com>
Tested-by: NDieter Nützel <Dieter@nuetzel-hh.de>
Acked-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NLucas Stach <l.stach@pengutronix.de>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

1b1f42d8

07 12月, 2017 4 次提交

drm/amdgpu: move validation of the VM size into the VM code · f3368128

由 Christian König 提交于 11月 23, 2017

This moves validation of the VM size parameter into amdgpu_vm_adjust_size().
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

f3368128

drm/amdgpu: unify VM size handling of Vega10 with older generation · b38f41eb

由 Christian König 提交于 11月 22, 2017

One function to rule them all.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

b38f41eb

drm/amdgpu: fix VA hole handling on Vega10 v3 · bb7939b2

由 Christian König 提交于 11月 06, 2017

Similar to the CPU address space the VA on Vega10 has a hole in it.

v2: use dev_dbg instead of dev_err
v3: add some more comments to explain how the hw works
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
CC: stable@vger.kernel.org
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

bb7939b2

drm/amdgpu: cleanup vm_size handling · fdd5faaa

由 Christian König 提交于 11月 04, 2017

It's pointless to have the same value twice, just always use max_pfn.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

fdd5faaa

14 11月, 2017 1 次提交

drm/amdgpu: make AMDGPU_VA_RESERVED_SIZE 64bit · ff4cd389

由 Christian König 提交于 11月 06, 2017

Even when it's a small handle it as 64bit value as well.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

ff4cd389

10 10月, 2017 1 次提交

drm/amdgpu: Set the correct value for PDEs/PTEs of ATC memory on Raven · 6d16dac8

由 Yong Zhao 提交于 8月 31, 2017

Without the additional bits set in PDEs/PTEs, the ATC memory access
would have failed on Raven.
Signed-off-by: NYong Zhao <yong.zhao@amd.com>
Acked-by: NAlex Deucher <alexander.deucher@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

6d16dac8

29 9月, 2017 1 次提交

drm/amdgpu: Handle GPUVM fault storms · c98171cc

由 Felix Kuehling 提交于 9月 21, 2017

When many wavefronts cause VM faults at the same time, it can
overwhelm the interrupt handler and cause IH ring overflows before
the driver can notify or kill the faulting application.

As a workaround I'm introducing limited per-VM fault credit. After
that number of VM faults have occurred, further VM faults are
filtered out at the prescreen stage of processing.

This depends on the PASID in the interrupt packet, so it currently
only works for KFD contexts.
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

c98171cc

27 9月, 2017 2 次提交

drm/amdgpu: Track pending retry faults in IH and VM (v2) · a2f14820

由 Felix Kuehling 提交于 8月 26, 2017

IH tracks pending retry faults in a hash table for fast lookup in
interrupt context. Each VM has a short FIFO of pending VM faults for
processing in a bottom half.

The IH prescreening stage adds retry faults and filters out repeated
retry interrupts to minimize the impact of interrupt storms.

It's the VM's responsibility remove pending faults once they are
handled. For now this is only done when the VM is destroyed.

v2:
- Made the hash table smaller and the FIFO longer. I never want the
  FIFO to fill up, because that would make prescreen take longer.
  128 pending page faults should be enough to keep migrations busy.
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Acked-by: Christian König <christian.koenig@amd.com> (v1)
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

a2f14820

drm/amdgpu: Add PASID management · 02208441

由 Felix Kuehling 提交于 8月 25, 2017

Allows assigning a PASID to a VM for identifying VMs involved in page
faults. The global PASID manager is also exported in the KFD
interface so that AMDGPU and KFD can share the PASID space.

PASIDs of different sizes can be requested. On APUs, the PASID size
is deterined by the capabilities of the IOMMU. So KFD must be able
to allocate PASIDs in a smaller range.
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Acked-by: NAlex Deucher <alexander.deucher@amd.com>
Reviewed-by: NOded Gabbay <oded.gabbay@gmail.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

02208441

14 9月, 2017 1 次提交

drm/amdgpu: fix amdgpu_vm_handle_moved as well v2 · 4e55eb38

由 Christian König 提交于 9月 11, 2017

There is no guarantee that the last BO_VA actually needed an update.

Additional to that all command submissions must wait for moved BOs to
be cleared, not just the first one.

v2: Don't overwrite any newer fence.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NChunming Zhou <david1.zhou@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

4e55eb38

13 9月, 2017 2 次提交

drm/amdgpu: fix VM sync with always valid BOs v2 · d5884513

由 Christian König 提交于 9月 08, 2017

All users of a VM must always wait for updates with always
valid BOs to be completed.

v2: remove debugging leftovers, rename struct member
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NRoger He <Hongbo.He@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

d5884513

drm/amdgpu: rework amdgpu_cs_find_mapping · aebc5e6f

由 Christian König 提交于 9月 06, 2017

Use the VM instead of the BO list to find the BO for a virtual address.

This fixes UVD/VCE in physical mode with VM local BOs.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Acked-by: NLeo Liu <leo.liu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

aebc5e6f

09 9月, 2017 1 次提交

lib/interval_tree: fast overlap detection · f808c13f

由 Davidlohr Bueso 提交于 9月 08, 2017

Allow interval trees to quickly check for overlaps to avoid unnecesary
tree lookups in interval_tree_iter_first().

As of this patch, all interval tree flavors will require using a
'rb_root_cached' such that we can have the leftmost node easily
available.  While most users will make use of this feature, those with
special functions (in addition to the generic insert, delete, search
calls) will avoid using the cached option as they can do funky things
with insertions -- for example, vma_interval_tree_insert_after().

[jglisse@redhat.com: fix deadlock from typo vm_lock_anon_vma()]
  Link: http://lkml.kernel.org/r/20170808225719.20723-1-jglisse@redhat.com
Link: http://lkml.kernel.org/r/20170719014603.19029-12-dave@stgolabs.netSigned-off-by: NDavidlohr Bueso <dbueso@suse.de>
Signed-off-by: NJérôme Glisse <jglisse@redhat.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Acked-by: NPeter Zijlstra (Intel) <peterz@infradead.org>
Acked-by: NDoug Ledford <dledford@redhat.com>
Acked-by: NMichael S. Tsirkin <mst@redhat.com>
Cc: David Airlie <airlied@linux.ie>
Cc: Jason Wang <jasowang@redhat.com>
Cc: Christian Benvenuti <benve@cisco.com>
Signed-off-by: NAndrew Morton <akpm@linux-foundation.org>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

f808c13f

01 9月, 2017 2 次提交

drm/amdgpu: add support for per VM BOs v2 · 73fb16e7

由 Christian König 提交于 8月 16, 2017

Per VM BOs are handled like VM PDs and PTs. They are always valid and don't
need to be specified in the BO lists.

v2: validate PDs/PTs first
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

73fb16e7

drm/amdgpu: rework page directory filling v2 · ea09729c

由 Christian König 提交于 8月 09, 2017

Keep track off relocated PDs/PTs instead of walking and checking all PDs.

v2: fix root PD handling
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: Alex Deucher <alexander.deucher@amd.com> (v1)
Reviewed-by: NChunming Zhou <david1.zhou@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

ea09729c

30 8月, 2017 4 次提交

drm/amdgpu: track evicted page tables v2 · 3f3333f8

由 Christian König 提交于 8月 03, 2017

Instead of validating all page tables when one was evicted,
track which one needs a validation.

v2: simplify amdgpu_vm_ready as well
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: Alex Deucher <alexander.deucher@amd.com> (v1)
Reviewed-by: NChunming Zhou <david1.zhou@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

3f3333f8

drm/amdgpu: add bo_va cleared flag again v2 · cb7b6ec2

由 Christian König 提交于 8月 15, 2017

We changed this to use an extra list a while back, but for the next
series I need a separate flag again.

v2: reorder to avoid unlocked list access
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NChunming Zhou <david1.zhou@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

cb7b6ec2

drm/amdgpu: rework moved handling in the VM v2 · 3d7d4d3a

由 Christian König 提交于 8月 23, 2017

Instead of using the vm_state use a separate flag to note
that the BO was moved.

v2: reorder patches to avoid temporary lockless access
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NChunming Zhou <david1.zhou@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

3d7d4d3a

drm/amdgpu: fix and cleanup VM ready check · 34d7be5d

由 Christian König 提交于 8月 24, 2017

Stop checking the mapped BO itself, cause that one is
certainly not a page table.

Additional to that move the code into amdgpu_vm.c
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NChunming Zhou <david1.zhou@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

34d7be5d

18 8月, 2017 7 次提交

drm/amd/amdgpu: expose fragment size as module parameter (v2) · d07f14be

由 Roger He 提交于 8月 15, 2017

Allow overrides on the command line.

v2: agd: sqaush in spelling fix and bogus default value warning
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NRoger He <Hongbo.He@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

d07f14be

drm/amd/amdgpu: store fragment_size in vm_manager · e618d306

由 Roger He 提交于 8月 11, 2017

adds fragment_size in the vm_manager structure and
implements hardware setup for it.
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NRoger He <Hongbo.He@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

e618d306

drm/amdgpu: rename VM invalidated to moved · 27c7b9ae

由 Christian König 提交于 8月 01, 2017

That better describes what happens here with the BO.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

27c7b9ae

drm/amdgpu: separate bo_va structure · ec681545

由 Christian König 提交于 8月 01, 2017

Split that into vm_bo_base and bo_va to allow other uses as well.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

ec681545

drm/amdgpu: drop the extra VM huge page flag v2 · 4ab4016a

由 Christian König 提交于 8月 03, 2017

Just add the flags to the addr field as well.

v2: add some more comments that the flag is for huge pages.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

4ab4016a

drm/amdgpu: cleanup static CSA handling · 0f4b3c68

由 Christian König 提交于 7月 31, 2017

Move the CSA bo_va from the VM to the fpriv structure.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

0f4b3c68

drm/amdgpu: only move VM BOs in the LRU during validation v2 · b6369225

由 Christian König 提交于 8月 03, 2017

This should save us a bunch of command submission overhead.

v2: move the LRU move to the right place to avoid the move for the root BO
    and handle the shadow BOs as well. This turned out to be a bug fix because
    the move needs to happen before the kmap.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Acked-by: NChunming Zhou <david1.zhou@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

b6369225

16 8月, 2017 1 次提交

drm/amdgpu: Support IOMMU on Raven · 51ac7eec

由 Yong Zhao 提交于 7月 27, 2017

We achieved that by setting S(SYSTEM) and P(PDE as PTE) bit to 1 for
PDEs and setting S bit to 1 for PTEs when the corresponding addresses
are not occupied by gpu driver allocated buffers.
Signed-off-by: NYong Zhao <Yong.Zhao@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

51ac7eec

26 7月, 2017 2 次提交

drm/amdgpu: enable huge page handling in the VM v5 · cf2f0a37

由 Alex Deucher 提交于 7月 25, 2017

The hardware can use huge pages to map 2MB of address space with only one PDE.

v2: few cleanups and rebased
v3: skip PT updates if we are using the PDE
v4: rebased, added support for CPU based updates
v5: fix CPU based updates once more
v6: fix ndw estimation
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-and-tested-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

cf2f0a37

drm/amdgpu: increase fragmentation size for Vega10 v2 · 6be7adb3

由 Christian König 提交于 5月 23, 2017

The fragment bits work differently for Vega10 compared to previous generations.

Increase the fragment size to 2MB for now to better handle that.

v2: handle the hardware setup as well
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-and-tested-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

6be7adb3