提交 · d6b20c87696347c0a74f21fd18349c86e0ab4bc1 · openeuler / Kernel

08 6月, 2017 4 次提交

drm/amdgpu/gfx8: whitespace change · d6b20c87

由 Alex Deucher 提交于 6月 07, 2017

Make it consistent.
Reviewed-by: NAlex Xie <AlexBin.Xie@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

d6b20c87

drm/amdgpu/gfx9: Raven has two MECs · 5e7c8b06

由 Alex Deucher 提交于 6月 07, 2017

This was missed when Andres' queue patches were rebased.

Fixes: 42794b27 (drm/amdgpu: take ownership of per-pipe configuration v3)
Reviewed-by: NAlex Xie <AlexBin.Xie@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

5e7c8b06

drm/amdgpu: move gfx_v*_0_compute_queue_acquire to common code · 41f6a99a

由 Alex Deucher 提交于 6月 07, 2017

Same function was duplicated in all gfx IP files.
Reviewed-by: NAlex Xie <AlexBin.Xie@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

41f6a99a

drm/amdgpu: fix mec queue policy on single MEC asics · cf8b611f

由 Alex Deucher 提交于 6月 07, 2017

Fixes hangs on single MEC asics.

Fixes: 2ed286fb434 (drm/amdgpu: new queue policy, take first 2 queues of each pipe v2)
Reviewed-by: NAlex Xie <AlexBin.Xie@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

cf8b611f

07 6月, 2017 26 次提交

drm/amdgpu/gfx: create a common bitmask function (v2) · 378506a7

由 Alex Deucher 提交于 6月 06, 2017

The same function was duplicated in all the gfx IPs. Use
a single implementation for all.

v2: use static inline (Alex Xie)
Reviewed-by: NAlex Xie <AlexBin.Xie@amd.com>
Suggested-by: NAndres Rodriguez <andresx7@gmail.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

378506a7

drm/amdgpu/gfx8: drop per-APU CU limits · 943c05bd

由 Alex Deucher 提交于 5月 31, 2017

Always use the max for the family rather than the per sku limits.
This makes sure the mask is always the max size to avoid reporting
the wrong number of CUs.
Reviewed-by: NAlex Xie <AlexBin.Xie@amd.com>
Reviewed-by: NAndres Rodriguez <andresx7@gmail.com>
Cc: stable@vger.kernel.org
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

943c05bd

drm/amdgpu/gfx6: properly cache mc_arb_ramcfg · 6653ebd4

由 Alex Deucher 提交于 6月 02, 2017

This was missing for gfx6.
Acked-by: NHuang Rui <ray.huang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Cc: stable@vger.kernel.org

6653ebd4

drm/amdgpu/gfx9: new queue policy, take first 2 queues of each pipe · a7049de1

由 Alex Deucher 提交于 6月 05, 2017

Instead of taking the first pipe and giving the rest to kfd, take the
first 2 queues of each pipe.

Effectively, amdgpu and amdkfd own the same number of queues. But
because the queues are spread over multiple pipes the hardware will be
able to better handle concurrent compute workloads.

amdgpu goes from 1 pipe to 4 pipes, i.e. from 1 compute threads to 4
amdkfd goes from 3 pipe to 4 pipes, i.e. from 3 compute threads to 4

gfx9 was missed when this patch set was rebased to include gfx9.
Acked-by: NTom St Denis <tom.stdenis@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAndres Rodriguez <andresx7@gmail.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

a7049de1

drm/amdgpu/gfx9: allocate queues horizontally across pipes · 1361f455

由 Alex Deucher 提交于 6月 05, 2017

Pipes provide better concurrency than queues, therefore we want to make
sure that apps use queues from different pipes whenever possible.

Optimize for the trivial case where an app will consume rings in order,
therefore we don't want adjacent rings to belong to the same pipe.

gfx9 was missed when these patches were rebased.
Reviewed-by: NTom St Denis <tom.stdenis@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NAndres Rodriguez <andresx7@gmail.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

1361f455

drm/amdgpu: update to use RREG32_SOC15/WREG32_SOC15 for gmc9 · b9509c80

由 Huang Rui 提交于 6月 01, 2017

Signed-off-by: NHuang Rui <ray.huang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

b9509c80

drm/amdgpu: update to use RREG32_SOC15/WREG32_SOC15 for mmhub · 2a419183

由 Huang Rui 提交于 6月 01, 2017

Signed-off-by: NHuang Rui <ray.huang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

2a419183

drm/amdgpu: update to use RREG32_SOC15/WREG32_SOC15 for gfxhub · 89f99ceb

由 Huang Rui 提交于 6月 01, 2017

Signed-off-by: NHuang Rui <ray.huang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

89f99ceb

drm/amdgpu: fix the gart table cleared issue for S3 · 916910ad

由 Huang Rui 提交于 5月 31, 2017

Something writes over the first 8 MB so reserve this
on vega10 until we root cause it.
Signed-off-by: NHuang Rui <ray.huang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

916910ad

drm/amdgpu: add ip block number prints · a0bae357

由 Huang Rui 提交于 5月 03, 2017

User is able to follow the ip block number to write the ip_block_mask for
selecting the one which user would like to enable.
Signed-off-by: NHuang Rui <ray.huang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

a0bae357

drm/amdgpu: add ip name print for selecting ips with ip_block_mask · ed8cf00c

由 Huang Rui 提交于 5月 03, 2017

Signed-off-by: NHuang Rui <ray.huang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

ed8cf00c

drm/amdgpu: remove mmhub ip · 1191d110

由 Huang Rui 提交于 5月 31, 2017

Signed-off-by: NHuang Rui <ray.huang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

1191d110

drm/amdgpu: remove gfxhub ip · 373f5923

由 Huang Rui 提交于 5月 31, 2017

Signed-off-by: NHuang Rui <ray.huang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

373f5923

drm/amdgpu: export mmhub get clockgating into gmc · 13052be5

由 Huang Rui 提交于 5月 31, 2017

Signed-off-by: NHuang Rui <ray.huang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

13052be5

drm/amdgpu: export mmhub set clockgating into gmc · d5583d4f

由 Huang Rui 提交于 5月 31, 2017

Signed-off-by: NHuang Rui <ray.huang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

d5583d4f

drm/amdgpu: export mmhub sw_init into gmc · 77f6c763

由 Huang Rui 提交于 5月 31, 2017

Signed-off-by: NHuang Rui <ray.huang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

77f6c763

drm/amdgpu: export gfxhub sw_init into gmc · 0c8c0847

由 Huang Rui 提交于 5月 31, 2017

Signed-off-by: NHuang Rui <ray.huang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

0c8c0847

drm/amdgpu: fix to miss program invalidation at resume · 1e4eccda

由 Huang Rui 提交于 5月 31, 2017

This patch moves invalidation into gart enable function from hw_init.
Because we would like align the sequence calling between init and resume.
Signed-off-by: NHuang Rui <ray.huang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

1e4eccda

drm/amdgpu: abstract setup vmid config for gfxhub/mmhub · 3dff4cc4

由 Huang Rui 提交于 5月 31, 2017

Signed-off-by: NHuang Rui <ray.huang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

3dff4cc4

drm/amdgpu: abstract disable identity aperture for gfxhub/mmhub · d5c87390

由 Huang Rui 提交于 5月 31, 2017

Signed-off-by: NHuang Rui <ray.huang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

d5c87390

drm/amdgpu: abstract system domain enablement for gfxhub/mmhub · 02c4704b

由 Huang Rui 提交于 5月 31, 2017

Signed-off-by: NHuang Rui <ray.huang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

02c4704b

drm/amdgpu: abstract cache initialization for gfxhub/mmhub · 41f6f311

由 Huang Rui 提交于 5月 31, 2017

Signed-off-by: NHuang Rui <ray.huang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

41f6f311

drm/amdgpu: abstract TLB initialization for gfxhub/mmhub · 34269839

由 Huang Rui 提交于 5月 31, 2017

Signed-off-by: NHuang Rui <ray.huang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

34269839

drm/amdgpu: abstract system aperture initialization for gfxhub/mmhub · fc4b884b

由 Huang Rui 提交于 5月 31, 2017

Signed-off-by: NHuang Rui <ray.huang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

fc4b884b

drm/amdgpu: abstract gart aperture initialization for gfxhub/mmhub · 9bbad6fd

由 Huang Rui 提交于 5月 31, 2017

Signed-off-by: NHuang Rui <ray.huang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

9bbad6fd

drm/amdgpu: abstract gart table initialization for gfxhub/mmhub · a51dca4f

由 Huang Rui 提交于 5月 31, 2017

Signed-off-by: NHuang Rui <ray.huang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

a51dca4f

02 6月, 2017 4 次提交

drm/amdgpu: add saved_bo to save vce 4.0 context when suspend · a107ebf6

由 Leo Liu 提交于 5月 31, 2017

We are using PSP to resume firmware after suspend, and it is
resumed at where it got suspended, so we'd better save the
the context.
Signed-off-by: NLeo Liu <leo.liu@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

a107ebf6

drm/amdgpu: use existing function amdgpu_bo_create_kernel · 78b3c839

由 Leo Liu 提交于 5月 31, 2017

To simplify vce bo create
Signed-off-by: NLeo Liu <leo.liu@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

78b3c839

drm/amdgpu: add vcpu_bo cpu address for vce · 91415a09

由 Leo Liu 提交于 5月 31, 2017

Signed-off-by: NLeo Liu <leo.liu@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

91415a09

drm/amdgpu: Move compute vm bug logic to amdgpu_vm.c · e59c0205

由 Alex Xie 提交于 6月 01, 2017

  In review, Christian would like to keep the logic
  inside amdgpu_vm.c with a cost of slightly slower.
  The loop is still optimized out with this patch.

v2: remove the if statement. Now it is not slower.
Signed-off-by: NAlex Xie <AlexBin.Xie@amd.com>
Reviewed-by: NChristian König <christian.koeng@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

e59c0205

01 6月, 2017 6 次提交

drm/amdgpu: use LRU mapping policy for SDMA engines · 90c11309

由 Andres Rodriguez 提交于 3月 17, 2017

Spreading the load across multiple SDMA engines can increase memory
transfer performance.
Signed-off-by: NAndres Rodriguez <andresx7@gmail.com>
Reviewed-by: NNicolai Hähnle <nicolai.haehnle@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

90c11309

drm/amdgpu: guarantee bijective mapping of ring ids for LRU v3 · 6065343a

由 Andres Rodriguez 提交于 3月 17, 2017

Depending on usage patterns, the current LRU policy may create a
non-injective mapping between userspace ring ids and kernel rings.

This behaviour is undesired as apps that attempt to fill all HW blocks
would be unable to reach some of them.

This change forces the LRU policy to create bijective mappings only.

v2: compress ring_blacklist
v3: simplify amdgpu_ring_is_blacklisted() logic
Signed-off-by: NAndres Rodriguez <andresx7@gmail.com>
Reviewed-by: NNicolai Hähnle <nicolai.haehnle@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

6065343a

drm/amdgpu: implement lru amdgpu_queue_mgr policy for compute v4 · 795f2813

由 Andres Rodriguez 提交于 3月 06, 2017

Use an LRU policy to map usermode rings to HW compute queues.

Most compute clients use one queue, and usually the first queue
available. This results in poor pipe/queue work distribution when
multiple compute apps are running. In most cases pipe 0 queue 0 is
the only queue that gets used.

In order to better distribute work across multiple HW queues, we adopt
a policy to map the usermode ring ids to the LRU HW queue.

This fixes a large majority of multi-app compute workloads sharing the
same HW queue, even though 7 other queues are available.

v2: use ring->funcs->type instead of ring->hw_ip
v3: remove amdgpu_queue_mapper_funcs
v4: change ring_lru_list_lock to spinlock, grab only once in lru_get()
Signed-off-by: NAndres Rodriguez <andresx7@gmail.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

795f2813

drm/amdgpu: untie user ring ids from kernel ring ids v6 · effd924d

由 Andres Rodriguez 提交于 2月 16, 2017

Add amdgpu_queue_mgr, a mechanism that allows disjointing usermode's
ring ids from the kernel's ring ids.

The queue manager maintains a per-file descriptor map of user ring ids
to amdgpu_ring pointers. Once a map is created it is permanent (this is
required to maintain FIFO execution guarantees for a context's ring).

Different queue map policies can be configured for each HW IP.
Currently all HW IPs use the identity mapper, i.e. kernel ring id is
equal to the user ring id.

The purpose of this mechanism is to distribute the load across multiple
queues more effectively for HW IPs that support multiple rings.
Userspace clients are unable to check whether a specific resource is in
use by a different client. Therefore, it is up to the kernel driver to
make the optimal choice.

v2: remove amdgpu_queue_mapper_funcs
v3: made amdgpu_queue_mgr per context instead of per-fd
v4: add context_put on error paths
v5: rebase and include new IPs UVD_ENC & VCN_*
v6: drop unused amdgpu_ring_is_valid_index (Alex)
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAndres Rodriguez <andresx7@gmail.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

effd924d

drm/amdgpu: workaround tonga HW bug in HQD programming sequence · ecd910eb

由 Andres Rodriguez 提交于 2月 24, 2017

Tonga based asics may experience hangs when an HQD's EOP parameters
are modified.

Workaround this HW issue by avoiding writes to these registers for
tonga asics.

Based on the following ROCm commit:
2a0fb8 - drm/amdgpu: Synchronize KFD HQD load protocol with CP scheduler

From the ROCm git repository:
https://github.com/RadeonOpenCompute/ROCK-Kernel-Driver.git

CC: Jay Cornwall <Jay.Cornwall@amd.com>
Suggested-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAndres Rodriguez <andresx7@gmail.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

ecd910eb

drm/amdgpu: condense mqd programming sequence · 894700f3

由 Andres Rodriguez 提交于 2月 24, 2017

The MQD structure matches the reg layout. Take advantage of this to
simplify HQD programming.

Note that the ACTIVE field still needs to be programmed last.
Suggested-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAndres Rodriguez <andresx7@gmail.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

894700f3

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功