提交 · 26eedf6daec4e7937c8f0f1dde5e9b8e3dcebfd3 · openeuler / Kernel

10 10月, 2017 3 次提交

drm/amdgpu: introduce AMDGPU_GEM_CREATE_EXPLICIT_SYNC v2 · 177ae09b

由 Andres Rodriguez 提交于 9月 15, 2017

Introduce a flag to signal that access to a BO will be synchronized
through an external mechanism.

Currently all buffers shared between contexts are subject to implicit
synchronization. However, this is only required for protocols that
currently don't support an explicit synchronization mechanism (DRI2/3).

This patch introduces the AMDGPU_GEM_CREATE_EXPLICIT_SYNC, so that
users can specify when it is safe to disable implicit sync.

v2: only disable explicit sync in amdgpu_cs_ioctl
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAndres Rodriguez <andresx7@gmail.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

177ae09b

drm/amdgpu: add helper to convert a ttm bo to amdgpu_bo · b82485fd

由 Andres Rodriguez 提交于 9月 15, 2017

Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAndres Rodriguez <andresx7@gmail.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

b82485fd

drm/amdgpu: Reserve shared memory on VRAM for SR-IOV · a05502e5

由 Horace Chen 提交于 9月 29, 2017

SR-IOV need to reserve a piece of shared VRAM at the exact place
to exchange data betweem PF and VF. The start address and size of
the shared mem are passed to guest through VBIOS structure
VRAM_UsageByFirmware.

VRAM_UsageByFirmware is a general feature in VBIOS, it indicates
that VBIOS need to reserve a piece of memory on the VRAM.

Because the mem address is specified. Reserve it early in
amdgpu_ttm_init to make sure that it can monoplize the space.
Signed-off-by: NHorace Chen <horace.chen@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

a05502e5

27 9月, 2017 5 次提交

drm/amdgpu: Fix a bug in amdgpu_fill_buffer() · 7bdc53f9

由 Yong Zhao 提交于 9月 15, 2017

When max_bytes is not 8 bytes aligned and bo size is larger than
max_bytes, the last 8 bytes in a ttm node may be left unchanged.
For example, on pre SDMA 4.0, max_bytes = 0x1fffff, and the bo size
is 0x200000, the problem will happen.

In order to fix the problem, we separately store the max nums of
PTEs/PDEs a single operation can set in amdgpu_vm_pte_funcs
structure, rather than inferring it from bytes limit of SDMA
constant fill, i.e. fill_max_bytes.

Together with the fix, we replace the hard code value "10" in
amdgpu_vm_bo_update_mapping() with the corresponding values from
structure amdgpu_vm_pte_funcs.
Signed-off-by: NYong Zhao <yong.zhao@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

7bdc53f9

drm/amd/amdgpu: Partial revert of iova debugfs · 10cfafd6

由 Tom St Denis 提交于 9月 19, 2017

We discovered that on some devices even with iommu enabled
you can access all of system memory through the iommu translation.

Therefore, we revert the read method to the translation only service
and drop the write method completely.
Signed-off-by: NTom St Denis <tom.stdenis@amd.com>
Reviewed-by: NChristan König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

10cfafd6

drm/amd/amdgpu: remove usage of ttm trace · 79ba2800

由 Tom St Denis 提交于 9月 18, 2017

Signed-off-by: NTom St Denis <tom.stdenis@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

79ba2800

drm/amd/amdgpu: add support for iova_to_phys to replace TTM trace (v5) · 38290b2c

由 Tom St Denis 提交于 9月 18, 2017

Signed-off-by: NTom St Denis <tom.stdenis@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

(v2): Add domain to iova debugfs
(v3): Add true read/write methods to access system memory of pages
      mapped to the device
(v4): Move get_domain call out of loop and return on error
(v5): Just use kmap/kunmap

38290b2c

drm/amd/amdgpu: Fold TTM debugfs entries into array (v2) · a40cfa0b

由 Tom St Denis 提交于 9月 18, 2017

Signed-off-by: NTom St Denis <tom.stdenis@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

(v2): add domains and avoid strcmp

a40cfa0b

19 9月, 2017 1 次提交

drm/amd/amdgpu: Support VM environments in amdgpu_ttm_access_memory() · 97bae49c

由 Tom St Denis 提交于 9月 14, 2017

Signed-off-by: NTom St Denis <tom.stdenis@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

97bae49c

14 9月, 2017 1 次提交

drm/amd/amdgpu: Change vram debugfs to NO_KIQ for VM environments · c3057281

由 Tom St Denis 提交于 9月 13, 2017

Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NTom St Denis <tom.stdenis@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

c3057281

13 9月, 2017 4 次提交

drm/amdgpu: move userptr BOs to CPU domain during CS v2 · 1b0c0f9d

由 Christian König 提交于 9月 05, 2017

Instead of moving them in the MMU notifier move them during CS.

v2: still mark pages as accessed/dirty
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: Felix Kuehling <Felix.Kuehling@amd.com> (v1)
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

1b0c0f9d

drm/amdgpu: stop using BO status for user pages · ca666a3c

由 Christian König 提交于 9月 05, 2017

Instead use a counter to figure out if we need to set new pages or not.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

ca666a3c

drm/amdgpu: move taking mmap_sem into get_user_pages v2 · b72cf4fc

由 Christian König 提交于 9月 03, 2017

This didn't helped as intended, just simplify the code.

v2: unlock mmap_sem in the error path as well
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

b72cf4fc

drm/amdgpu: fix userptr put_page handling · a216ab09

由 Christian König 提交于 9月 02, 2017

Move calling put_page into the unpopulate callback. Otherwise we mess up the pages
reference count when it is unbound multiple times.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

a216ab09

02 9月, 2017 1 次提交

drm/amdgpu: fix placement flags in amdgpu_ttm_bind · 70a9c6b9

由 Christian König 提交于 9月 01, 2017

Otherwise we lose the NO_EVICT flag and can try to evict pinned BOs.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

70a9c6b9

30 8月, 2017 5 次提交

drm/amd/amdgpu: Add write() method to VRAM debugfs entry (v2) · 08cab989

由 Tom St Denis 提交于 8月 29, 2017

Allows writing data to vram via debugfs.
Signed-off-by: NTom St Denis <tom.stdenis@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>

(v2):  Call get_user before holding spinlock.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

08cab989

drm/amd/amdgpu: Use new TTM populate/map helper function · 7405e0da

由 Tom St Denis 提交于 8月 18, 2017

Signed-off-by: NTom St Denis <tom.stdenis@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

7405e0da

drm/amd/amdgpu: Remove AMDGPU tracepoint and use new TTM tracepoint (v2) · ca3670aa

由 Tom St Denis 提交于 8月 23, 2017

Switches the AMDGPU driver over to the TTM tracepoint and removes
our old one.  Now you can enable traces before loading the module
and trace all mappings.
Signed-off-by: NTom St Denis <tom.stdenis@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

(v2): Use struct device instead of pci in trace.

ca3670aa

drm/amdgpu: inline amdgpu_ttm_do_bind again · 1cacc86a

由 Christian König 提交于 8月 22, 2017

The function is called only once and doesn't do anything special.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Reviewed-by: NRoger He <Hongbo.He@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

1cacc86a

drm/amdgpu: fix amdgpu_ttm_bind · 9b0655e3

由 Christian König 提交于 8月 22, 2017

Use ttm_bo_mem_space instead of manually allocating GART space.

This allows us to evict BOs when there isn't enought GART space any more.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

9b0655e3

24 8月, 2017 2 次提交

drm/amdgpu: inline amdgpu_ttm_do_bind again · ac7afe6b

由 Christian König 提交于 8月 22, 2017

The function is called only once and doesn't do anything special.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Reviewed-by: NRoger He <Hongbo.He@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

ac7afe6b

drm/amdgpu: fix amdgpu_ttm_bind · 1d00402b

由 Christian König 提交于 8月 22, 2017

Use ttm_bo_mem_space instead of manually allocating GART space.

1d00402b

18 8月, 2017 3 次提交

drm/amdgpu: move debug print into the MM managers · 97cbb284

由 Christian König 提交于 8月 07, 2017

Instead of the separate switch/case in the calling function.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

97cbb284

drm/amdgpu: fix incorrect use of the lru_lock · 12d4ac58

由 Christian König 提交于 8月 07, 2017

The BO manager has its own lock and doesn't use the lru_lock.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

12d4ac58

drm/amd/amdgpu: Add tracepoint for DMA page mapping (v4) · aca81718

由 Tom St Denis 提交于 7月 31, 2017

This helps map DMA addresses back to physical addresses.
Signed-off-by: NTom St Denis <tom.stdenis@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

(v2):  Added tracepoints for USERPTR, SG mappings, and
     SWIOTBL mappings.  Reformatted trace call perform
     PCI decoding internal to the trace.

(v3):  Add unmap tracepoints as well

(v4):  Move traces into separate functions

aca81718

16 8月, 2017 4 次提交

drm/amdgpu: Uninitialized variable in amdgpu_ttm_backend_bind() · 2ce3f5dc

由 Dan Carpenter 提交于 8月 09, 2017

My static checker complains that it's possible for "r" to be
uninitialized.  It used to be set to zero so this returns it to the old
behavior.

Fixes: 98a7f88c ("drm/amdgpu: bind BOs with GTT space allocated directly v2")
Reviewed-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NMichel Dänzer <michel.daenzer@amd.com>
Signed-off-by: NDan Carpenter <dan.carpenter@oracle.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

2ce3f5dc

drm/amdgpu: Fix stolen typo · 5af2c10d

由 Kent Russell 提交于 8月 08, 2017

Change "stollen" to "stolen"
Signed-off-by: NKent Russell <kent.russell@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

5af2c10d

drm/amdgpu: use amdgpu_bo_create_kernel more often · a4a02777

由 Christian König 提交于 7月 27, 2017

Saves us quite a bunch of loc.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

a4a02777

drm/amdgpu: Add support for filling a buffer with 64 bit value · 330df03b

由 Yong Zhao 提交于 7月 20, 2017

That function will be used later to support setting a page table
block with 64 bit value.
Signed-off-by: NYong Zhao <Yong.Zhao@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

330df03b

26 7月, 2017 1 次提交

drm/amdgpu: Implement ttm_bo_driver.access_memory callback v2 · e342610c

由 Felix Kuehling 提交于 7月 03, 2017

Allows gdb to access contents of user mode mapped VRAM BOs.

v2: return error for non-VRAM pools
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NMichel Dänzer <michel.daenzer@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

e342610c

14 7月, 2017 10 次提交

drm/amdgpu: Try evicting from CPU visible to invisible VRAM first · cb2dd1a6

由 Michel Dänzer 提交于 7月 04, 2017

This gives BOs which haven't been accessed by the CPU since they were
moved to visible VRAM another chance to stay in VRAM when another BO
needs to go to visible VRAM.

This should allow BOs to stay in VRAM longer in some cases.

v2:
* Only do this for BOs which don't have the
  AMDGPU_GEM_CREATE_CPU_ACCESS_REQUIRED flag set.
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NMichel Dänzer <michel.daenzer@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

cb2dd1a6

drm/amdgpu: Set/clear CPU_ACCESS flag on page fault and move to VRAM · 96cf8271

由 John Brooks 提交于 6月 30, 2017

When a BO is moved to VRAM, clear AMDGPU_GEM_CREATE_CPU_ACCESS_REQUIRED.
This allows it to potentially later move to invisible VRAM if the CPU
does not access it again.

Setting the CPU_ACCESS flag in amdgpu_bo_fault_reserve_notify() also means
that we can remove the loop to restrict lpfn to the end of visible VRAM,
because amdgpu_ttm_placement_init() will do it for us.

v3 [Michel Dänzer]
* Use AMDGPU_GEM_CREATE_CPU_ACCESS_REQUIRED instead of a new flag
  (Christian König)
* Clear flag in amdgpu_bo_move instead of amdgpu_move_ram_vram
  (Christian)
* Explicitly mention amdgpu_bo_fault_reserve_notify in amdgpu_bo_move
* Also clear flag in amdgpu_bo_create_restricted
Suggested-by: NMichel Dänzer <michel.daenzer@amd.com>
Signed-off-by: NJohn Brooks <john@fastquake.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NMichel Dänzer <michel.daenzer@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

96cf8271

drm/amdgpu: Add vis_vramlimit module parameter · 218b5dcd

由 John Brooks 提交于 6月 27, 2017

Allow specifying a limit on visible VRAM via a module parameter. This is
helpful for testing performance under visible VRAM pressure.

v2: Add cast to 64-bit (Christian König)
Signed-off-by: NJohn Brooks <john@fastquake.com>
Reviewed-by: NMichel Dänzer <michel.daenzer@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NMichel Dänzer <michel.daenzer@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

218b5dcd

drm/amdgpu: add new gttsize module parameter v2 · 36d38372

由 Christian König 提交于 7月 07, 2017

This allows setting the gtt size independent of the gart size.

v2: fix copy and paste typo
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

36d38372

drm/amdgpu: consistent name all GART related parts · 6f02a696

由 Christian König 提交于 7月 07, 2017

Rename symbols from gtt_ to gart_ as appropriate.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

6f02a696

drm/amdgpu: stop mapping BOs to GTT · 5e7e8396

由 Christian König 提交于 6月 30, 2017

No need to map BOs to GTT on eviction and intermediate transfers any more.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

5e7e8396

drm/amdgpu: use the GTT windows for BO moves v2 · abca90f1

由 Christian König 提交于 6月 30, 2017

This way we don't need to map the full BO at a time any more.

v2: use fixed windows for src/dst
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

abca90f1

drm: amd: amdgpu: constify ttm_place structures. · 1aaa5602

由 Arvind Yadav 提交于 7月 02, 2017

ttm_place are not supposed to change at runtime. All functions
working with ttm_place provided by <drm/ttm/ttm_placement.h> work
with const ttm_place. So mark the non-const structs as const.
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NArvind Yadav <arvind.yadav.cs@gmail.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

1aaa5602

drm/amdgpu: bind BOs with GTT space allocated directly v2 · 98a7f88c

由 Christian König 提交于 6月 30, 2017

This avoids binding them later on.

v2: fix typo in function name
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NFelix Kuehling <Felix.Kuehling@amd.com>

98a7f88c

drm/amdgpu: bind BOs to TTM only once · 92c60d9c

由 Christian König 提交于 6月 29, 2017

No need to do this on every round.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NFelix Kuehling <Felix.Kuehling@amd.com>

92c60d9c

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功