提交 · 35161bbc135a748dd0a3c822030b3341cdefbd33 · openanolis / cloud-kernel

29 9月, 2017 1 次提交

drm/amdgpu: map compute rings by least recently used pipe · 35161bbc

由 Andres Rodriguez 提交于 9月 26, 2017

This patch provides a guarantee that the first n queues allocated by
an application will be on different pipes. Where n is the number of
pipes available from the hardware.

This helps avoid ring aliasing which can result in work executing in
time-sliced mode instead of truly parallel mode.
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAndres Rodriguez <andresx7@gmail.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

35161bbc

30 8月, 2017 1 次提交

drm/amdgpu: set sched_hw_submission higher for KIQ (v3) · b249e18d

由 Alex Deucher 提交于 8月 22, 2017

KIQ doesn't really use the GPU scheduler.  The base
drivers generally use the KIQ ring directly rather than
submitting IBs.  However, amdgpu_sched_hw_submission
(which defaults to 2) limits the number of outstanding
fences to 2.  KFD uses the KIQ for TLB flushes and the
2 fence limit hurts performance when there are several KFD
processes running.

v2: move some expressions to one line
    change KIQ sched_hw_submission to at least 16
v3: bump to 256
Reviewed-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

b249e18d

16 8月, 2017 3 次提交

drm/amdgpu: don't finish the ring if not initialized · 41cc07cf

由 Trigger Huang 提交于 8月 08, 2017

If a ring is not initialized, it also should not be finished.
For example, in Vega10's SR-IOV environment, UVD's decode ring is not
initialized, but will be finnished in amdgpu_uvd_sw_fini, because UVD
driver put all the uvd decode ring's finish operation into
amdgpu_uvd_sw_fini function, while not uvd_vXXX_0_sw_fini. This will
lead to amdgpu module unloading failure.
Signed-off-by: NTrigger Huang <trigger.huang@amd.com>
Reviewed-by: NMonk Liu <monk.liu@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

41cc07cf

drm/amdgpu: use 256 bit buffers for all wb allocations (v2) · 97407b63

由 Alex Deucher 提交于 7月 28, 2017

May waste a bit of memory, but simplifies the interface
significantly.

v2: convert internal accounting to use 256bit slots
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

97407b63

drm/amdgpu: make wb 256bit function names consistent · eacf3e14

由 Alex Deucher 提交于 7月 27, 2017

Use a lower case b to be consistent with the other wb functions.
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

eacf3e14

26 7月, 2017 1 次提交

drm/amdgpu:fix gfx fence allocate size · 0915fdbc

由 Monk Liu 提交于 6月 19, 2017

1, for sriov, we need 8dw for the gfx fence due to CP
behaviour
2, cleanup wrong logic in wptr/rptr wb alloc and free

Change-Id: Ifbfed17a4621dae57244942ffac7de1743de0294
Signed-off-by: NMonk Liu <Monk.Liu@amd.com>
Signed-off-by: NXiangliang Yu <Xiangliang.Yu@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

0915fdbc

02 6月, 2017 1 次提交

drm/amdgpu: Move compute vm bug logic to amdgpu_vm.c · e59c0205

由 Alex Xie 提交于 6月 01, 2017

  In review, Christian would like to keep the logic
  inside amdgpu_vm.c with a cost of slightly slower.
  The loop is still optimized out with this patch.

v2: remove the if statement. Now it is not slower.
Signed-off-by: NAlex Xie <AlexBin.Xie@amd.com>
Reviewed-by: NChristian König <christian.koeng@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

e59c0205

01 6月, 2017 3 次提交

drm/amdgpu: guarantee bijective mapping of ring ids for LRU v3 · 6065343a

由 Andres Rodriguez 提交于 3月 17, 2017

Depending on usage patterns, the current LRU policy may create a
non-injective mapping between userspace ring ids and kernel rings.

This behaviour is undesired as apps that attempt to fill all HW blocks
would be unable to reach some of them.

This change forces the LRU policy to create bijective mappings only.

v2: compress ring_blacklist
v3: simplify amdgpu_ring_is_blacklisted() logic
Signed-off-by: NAndres Rodriguez <andresx7@gmail.com>
Reviewed-by: NNicolai Hähnle <nicolai.haehnle@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

6065343a

drm/amdgpu: implement lru amdgpu_queue_mgr policy for compute v4 · 795f2813

由 Andres Rodriguez 提交于 3月 06, 2017

Use an LRU policy to map usermode rings to HW compute queues.

Most compute clients use one queue, and usually the first queue
available. This results in poor pipe/queue work distribution when
multiple compute apps are running. In most cases pipe 0 queue 0 is
the only queue that gets used.

In order to better distribute work across multiple HW queues, we adopt
a policy to map the usermode ring ids to the LRU HW queue.

This fixes a large majority of multi-app compute workloads sharing the
same HW queue, even though 7 other queues are available.

v2: use ring->funcs->type instead of ring->hw_ip
v3: remove amdgpu_queue_mapper_funcs
v4: change ring_lru_list_lock to spinlock, grab only once in lru_get()
Signed-off-by: NAndres Rodriguez <andresx7@gmail.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

795f2813

drm/amdgpu: Optimize a function called by every IB sheduling · dd684d31

由 Alex Xie 提交于 5月 30, 2017

  Move several if statements and a loop statment from
  run time to initialization time.
Signed-off-by: NAlex Xie <AlexBin.Xie@amd.com>
Reviewed-by: NChunming Zhou <david1.zhou@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

dd684d31

30 3月, 2017 5 次提交

drm/amd/amdgpu: Correct ring wptr address in debugfs (v2) · ec63982e

由 Tom St Denis 提交于 3月 29, 2017

On gfx9 hardware the value is not wrapped and is a 64-bit value.  So
we reduce it modulo the ring size.
Signed-off-by: NTom St Denis <tom.stdenis@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>

(v2) use buf_mask instead of computing on the fly
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

ec63982e

drm/amdgpu:fix ring init sequence · e09706f4

由 Monk Liu 提交于 3月 21, 2017

ring->buf_mask need be set prior to ring_clear_ring invoke
and fix ring_clear_ring as well which should use buf_mask
instead of ptr_mask
Signed-off-by: NMonk Liu <Monk.Liu@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

e09706f4

drm/amdgpu: add 64bit wb functions · 7014285a

由 Ken Wang 提交于 3月 18, 2016

Newer asics need 64 bit writeback slots.
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NKen Wang <Qingqing.Wang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

7014285a

drm/amdgpu: change wptr to 64 bits (v2) · 536fbf94

由 Ken Wang 提交于 3月 12, 2016

Newer asics need 64 bit wptrs.  If the wptr is now
smaller than the rptr that doesn't indicate a wrap-around
anymore.

v2: integrate Christian's comments.
Signed-off-by: NKen Wang <Qingqing.Wang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

536fbf94

drm/amdgpu:use clear_ring to clr RB · f6bd7942

由 Monk Liu 提交于 2月 08, 2017

In resume routine, we need clr RB prior to the
ring test of engine, otherwise some engine hang
duplicated during GPU reset.
Signed-off-by: NMonk Liu <Monk.Liu@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

f6bd7942

28 1月, 2017 1 次提交

drm/amdgpu:set cond_exec polling value to 1 in ring_init · 714fbf80

由 Monk Liu 提交于 1月 18, 2017

no need to set it per ib_schedule(), hw won't override
this polling address.
Signed-off-by: NMonk Liu <Monk.Liu@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

714fbf80

05 12月, 2016 1 次提交
- A
  don't open-code file_inode() · 45063097
  由 Al Viro 提交于 12月 04, 2016
```
Signed-off-by: NAl Viro <viro@zeniv.linux.org.uk>
```
  45063097
26 10月, 2016 2 次提交

drm/amdgpu: move align_mask and nop into ring funcs as well (v2) · 79887142

由 Christian König 提交于 10月 05, 2016

They are constant as well.

v2: update uvd and vce phys ring structures as well
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

79887142

drm/amdgpu: move the ring type into the funcs structure (v2) · 21cd942e

由 Christian König 提交于 10月 05, 2016

It's constant, so it doesn't make to much sense to keep it
with the variable data.

v2: update vce and uvd phys mode ring structures as well
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

21cd942e

14 10月, 2016 1 次提交

drm/amdgpu: potential NULL dereference in debugfs code · eeb2fa0c

由 Dan Carpenter 提交于 10月 12, 2016

debugfs_create_file() returns NULL on error, it only returns error
pointers if debugfs isn't enabled in the config and we checked for that
earlier so it can't happen.

Fixes: 4f4824b5 ('drm/amd/amdgpu: Convert ring debugfs entries to binary')
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NDan Carpenter <dan.carpenter@oracle.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

eeb2fa0c

28 9月, 2016 1 次提交

drm/amdgpu: clear ring pointer in amdgpu_device on teardown · d8907643

由 Grazvydas Ignotas 提交于 9月 25, 2016

This is in symmetry to setup done in amdgpu_ring_init.
Signed-off-by: NGrazvydas Ignotas <notasas@gmail.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

d8907643

15 9月, 2016 1 次提交

drm/amdgpu: free the BO in kernel by helper amdgpu_bo_free_kernel() · 8640faed

由 Junwei Zhang 提交于 9月 07, 2016

Signed-off-by: NJunwei Zhang <Jerry.Zhang@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

8640faed

08 8月, 2016 1 次提交

drm/amdgpu: use amdgpu_bo_create_kernel in amdgpu_ring.c · 37ac235b

由 Christian König 提交于 7月 26, 2016

Saves us quite a bunch of code.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NChunming Zhou <david1.zhou@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

37ac235b

30 7月, 2016 2 次提交

drm/amdgpu: add begin/end_use ring callbacks · f06505b8

由 Christian König 提交于 7月 20, 2016

For manual UVD/VCE power and clock gating.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NEdward O'Callaghan <funfunctor@folklore1984.net>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

f06505b8

drm/amdgpu: remove fence_lock · 7c23ace2

由 Christian König 提交于 7月 19, 2016

Was never used as far as I can see.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NChunming Zhou <david1.zhou@amd.com>
Reviewed-by: NEdward O'Callaghan <funfunctor@folklore1984.net>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

7c23ace2

08 7月, 2016 6 次提交

drm/amdgpu: remove more of the ring backup code · 33b7ed01

由 Alex Deucher 提交于 7月 06, 2016

Not used anymore.
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

33b7ed01

drm/amdgpu: clean up ring_backup code, no need more · 40019dc4

由 Chunming Zhou 提交于 6月 29, 2016

Signed-off-by: NChunming Zhou <David1.Zhou@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

40019dc4

drm/amdgpu: fix ring debugfs bug · a909c6bd

由 Monk Liu 提交于 6月 14, 2016

debugfs file added but not released after driver unloaded
Signed-off-by: NMonk Liu <Monk.Liu@amd.com>
Reviewed-by: NChunming Zhou <david1.zhou@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

a909c6bd

drm/amd/amdgpu: ring debugfs is read in increments of 4 bytes · c71dbd93

由 Tom St Denis 提交于 5月 02, 2016

If a user tries to read a non-multiple of 4 bytes it would have
read until the end of the ring potentially crashing the user
task.
Signed-off-by: NTom St Denis <tom.stdenis@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

c71dbd93

drm/amd/amdgpu: Convert ring debugfs entries to binary · 4f4824b5

由 Tom St Denis 提交于 4月 27, 2016

They now emit ring data in binary which will be read/written by
the userspace tool umr shortly.
Signed-off-by: NTom St Denis <tom.stdenis@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

4f4824b5

drm/amdgpu: clear RB at ring init · cc7d8c79

由 Monk Liu 提交于 6月 01, 2016

This help fix reloading driver hang issue of SDMA
ring.
Signed-off-by: NMonk Liu <Monk.Liu@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

cc7d8c79

09 6月, 2016 1 次提交

drm/amdgpu: fix missing free wb for cond_exec · 67a6a504

由 Monk Liu 提交于 5月 30, 2016

Signed-off-by: NMonk Liu <Monk.Liu@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

67a6a504

05 5月, 2016 4 次提交

drm/amdgpu: fix the coding style in amdgpu_ring.c · eb430969

由 Christian König 提交于 4月 13, 2016

No functional change.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

eb430969

drm/amdgpu: use the ring name for debugfs (v2) · 771c8ec1

由 Christian König 提交于 4月 13, 2016

Instead of hard coding just another name in the ring code.

v2: squash in Tom's rebase fix
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

771c8ec1

drm/amdgpu: use max_dw in ring_init · a3f1cf35

由 Christian König 提交于 4月 12, 2016

Instead of specifying the total ring size calculate that from the maximum
number of dw a submission can have and the number of concurrent submissions.

This fixes UVD with 8 concurrent submissions or more.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

a3f1cf35

drm/amdgpu: Mark all instances of struct drm_info_list as const · 06ab6832

由 Nils Wallménius 提交于 5月 02, 2016

All these are compile time constand and the
drm_debugfs_create/remove_files functions take a const
pointer argument.
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NNils Wallménius <nils.wallmenius@gmail.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

06ab6832

03 5月, 2016 1 次提交

drm/amdgpu: support cond exec · 128cff1a

由 Monk Liu 提交于 1月 14, 2016

This adds the groundwork for conditional execution on
SDMA which is necessary for preemption.
Signed-off-by: NMonk Liu <monk.liu@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

128cff1a

17 3月, 2016 1 次提交

drm/amdgpu: add number of hardware submissions to amdgpu_fence_driver_init_ring · e6151a08

由 Christian König 提交于 3月 15, 2016

Make this a parameter instead of using the global variable directly.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Acked-by: NAlex Deucher <alexander.deucher@amd.com>
Reviewed-by: NChunming Zhou <david1.zhou@amd.com>

e6151a08

15 3月, 2016 1 次提交

drm/amdgpu: remove amdgpu_ring_from_fence · f104fbcb

由 Christian König 提交于 3月 11, 2016

Not used any more.
Signed-off-by: NChristian König <christian.koenig@amd.com>
Acked-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

f104fbcb

11 2月, 2016 1 次提交

drm/amdgpu: make pad_ib a ring function v3 · 9e5d5309

由 Christian König 提交于 1月 31, 2016

The padding depends on the firmware version and we need that for BO moves as
well, not only for VM updates.

v2: new approach of making pad_ib a ring function
v3: fix typo in macro name
Signed-off-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucer@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

9e5d5309

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功