提交 · 5217811e74d3b9b6d830476ab8419bbe3d42596e · openeuler / Kernel

24 3月, 2021 40 次提交

drm/amdgpu: add gc powerbrake support (v2) · 5217811e

由 Kevin Wang 提交于 1月 15, 2021

add GC power brake feature support for Aldebaran.

v2: squash in fixes (Alex)
Signed-off-by: NKevin Wang <kevin1.wang@amd.com>
Reviewed-by: NKenneth Feng <kenneth.feng@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

5217811e

drm/amdgpu: update TCP_CHAN_STEER_1 golden value for aldebaran · b3ecf36b

由 Hawking Zhang 提交于 11月 12, 2020

The golden setting was changed recently. update to
the latest one
Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Reviewed-by: NKevin Wang <kevin1.wang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

b3ecf36b

drm/amdgpu: add common gc golden settings for aldebaran · 9f55d7ed

由 Hawking Zhang 提交于 11月 11, 2020

golden settings that should be applied
Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Reviewed-by: NKevin Wang <kevin1.wang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

9f55d7ed

drm/amdgpu: apply gc v9_4_2 golden settings for aldebaran · 264aef8b

由 Hawking Zhang 提交于 10月 19, 2020

Those registers should be programmed as one-time initialization
Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Reviewed-by: NKevin Wang <kevin1.wang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

264aef8b

drm/amdgpu: restore aldebaran save ttmp and trap config on init (v2) · 16171a25

由 Jonathan Kim 提交于 8月 21, 2020

Initialization of TRAP_DATA0/1 is still required for the debugger to detect
new waves on Aldebaran.  Also, per-vmid global trap enablement may be
required outside of debugger scope so move to init phase.

v2: just add the gfx 9.4.2 changes (Alex)
Signed-off-by: NJonathan Kim <Jonathan.Kim@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

16171a25

drm/amdkfd: add aldebaran kfd2kgd callbacks to kfd device (v2) · 5073506c

由 Jonathan Kim 提交于 9月 05, 2020

Create dedicated Aldebaran kfd2kgd callbacks to prepare
for new per-vmid register instructions for debug trap
setting functions and sending host traps.

v2: rebase (Alex)
Signed-off-by: NJonathan Kim <Jonathan.Kim@amd.com>
Reviewed-by: NOak Zeng <Oak.Zeng@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

5073506c

drm/amdkfd: Add kernel parameter to stop queue eviction on vm fault · 6d909c5d

由 Oak Zeng 提交于 6月 22, 2020

This is to keep wavefront context for debug purpose
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

6d909c5d

drm/amdgpu: allow use psp to load firmware (v2) · 2f669734

由 Hawking Zhang 提交于 4月 13, 2020

Match existing asics.

v2: rebase (Alex)
Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Reviewed-by: NKevin Wang <kevin1.wang@amd.com>
Reviewed-by: NLe Ma <Le.Ma@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

2f669734

drm/amdgpu: use pd addr based on gart level page table · ec8631e0

由 Alex Sierra 提交于 2月 03, 2021

With a recent gart page table re-construction, the gart page
table is now 2-level for some ASICs: PDB0->PTB.
In the case of 2-level gart page table, the page_table_base
of vmid0 should point to PDB0 instead of PTB.
Signed-off-by: NAlex Sierra <alex.sierra@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NOak Zeng <Oak.Zeng@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

ec8631e0

drm/amdgpu: Fix the comment in amdgpu_gmc.h · be0478e7

由 Oak Zeng 提交于 2月 03, 2021

More accurate words are used to address a
code review feedback
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

be0478e7

drm/amdgpu: Fix GART page table s-bit · 79194dac

由 Oak Zeng 提交于 1月 22, 2021

For the new 2-level GART table, the last PDE0 points
to PTB. Since PTB is in vram and right now we are
runing under s=0 mode (vram is treated as FB carveout),
so the s bit of this PDE0 should be set to 0.
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

79194dac

drm/amdgpu: update mmhub client ids for Aldebaran · f4ec3e50

由 Alex Sierra 提交于 2月 02, 2021

update mmhub client id table for Aldebaran.
Signed-off-by: NAlex Sierra <alex.sierra@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

f4ec3e50

drm/amdgpu: enable sram initialization for aldebaran · abe5ee57

由 Dennis Li 提交于 2月 01, 2021

Aldebaran can share the same initializing shader code witn
arcturus.
Signed-off-by: NDennis Li <Dennis.Li@amd.com>
Reviewed-by: NHawking Zhang <Hawking.Zhang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

abe5ee57

drm/amdgpu: workaround the TMR MC address issue (v2) · 2f055097

由 Oak Zeng 提交于 1月 26, 2021

With the 2-level gart page table,  vram is squeezed into gart aperture
and FB aperture is disabled. Therefore all VRAM virtual addresses are
 in the GART aperture. However currently PSP requires TMR addresses
in FB aperture. So we need some design change at PSP FW level to support
this 2-level gart table driver change. Right now this PSP FW support
doesn't exist. To workaround this issue temporarily, FB aperture is
added back and the gart aperture address is converted back to FB aperture
for this PSP TMR address.

Will revert it after we get a fix from PSP FW.

v2: squash in tmr fix for other asics (Kevin)
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Reviewed-by: NFelix Kuehling <felix.kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

2f055097

drm/amdgpu: HW setup of 2-level vmid0 page table · 0c19cab5

由 Oak Zeng 提交于 9月 17, 2020

Set up HW for 2-level vmid0 page table: 1. Set up
PAGE_TABLE_START/END registers. Currently only plan
to do 2-level page table for ALDEBARAN, so only gfxhub1.0
and mmhub1.7 is changed. 2. Set page table base register.
For 2-level page table, the page table base should point
to PDB0. 3. Disable AGP and FB aperture as they are not
used.
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Reviewed-by: NChristian Konig <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <felix.kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

0c19cab5

drm/amdgpu: Set up vmid0 PDB0 · 522510a6

由 Oak Zeng 提交于 9月 17, 2020

If use gart for FB translation, allocate and fill
PDB0.
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Reviewed-by: NChristian Konig <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <felix.kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

522510a6

drm/amdgpu: Add function to allocate and fill PDB0 · a2902c09

由 Oak Zeng 提交于 9月 17, 2020

Add functions to allocate PDB0, map it for CPU access,
and fill it.

Those functions are only used for 2-level vmid0 page
table construction
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Reviewed-by: NFelix Kuehling <felix.kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

a2902c09

drm/amdgpu: Use different gart table parameters for 2-level gart table · 7b454b3a

由 Oak Zeng 提交于 9月 17, 2020

If use gart for FB translation, we will squeeze vram into
sysvm aperture. This requires 2 level gart table. Add
page table depth and page table block size parameters
to gmc. This is prepare work to 2-level gart table
construction
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Reviewed-by: NChristian Konig <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <felix.kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

7b454b3a

drm/amdgpu: Placement of gart and vram in sysvm aperture · f527f310

由 Oak Zeng 提交于 9月 15, 2020

If use GART for FB translation, place both vram and gart to sysvm
aperture. AGP aperture is not set up in this case because it
is not used
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Reviewed-by: NChristian Konig <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <felix.kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

f527f310

drm/amdgpu: Modify comments of vram_start/end · 6e93ef8b

由 Oak Zeng 提交于 9月 15, 2020

Modify the comment to reflect the fact that, if
use GART for vram address translation for vmid0,
[vram_start, vram_end] will be placed inside SYSVM
aperture, together with GART.
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Reviewed-by: NChristian Konig <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <felix.kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

6e93ef8b

drm/amdgpu: Moved gart_size calculation to mc_init functions · f1dc12ca

由 Oak Zeng 提交于 10月 02, 2020

In amdgpu_gmc_gart_location function, gart_size is adjusted
by a smu_prv_buffer_size. This logic shouldn't belong to
this function. Move the logic to the mc_init functions
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Reviewed-by: NChristian Konig <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <felix.kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

f1dc12ca

drm/amdgpu: Use physical translation mode to access page table · 1f928f51

由 Oak Zeng 提交于 1月 23, 2021

On A+A platform, CPU write page directory and page table in cached
mode. So it is necessary for page table walker to snoop CPU cache.
This setting is necessary for page walker to snoop page directory
and page table data out of CPU cache.
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Acked-by: NChristian Konig <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <felix.kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

1f928f51

drm/amdgpu: Don't reserve vram as WC for A+A · 35d5f224

由 Oak Zeng 提交于 1月 22, 2021

On A+A platform, vram can be mapped as WB. Not necessarily
to always map vram as WC on such platform.

Calling function arch_io_reserve_memtype_wc will mark the
whole vram region as WC. So don't call it for A+A platform.
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Suggested-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NChristian Konig <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <felix.kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

35d5f224

drm/amdgpu: mask the xgmi number of hops reported from psp to kfd · 4ac5617c

由 Jonathan Kim 提交于 1月 27, 2021

The psp supplies the link type in the upper 2 bits of the psp xgmi node
information num_hops field. With a new link type, Aldebaran has these
bits set to a non-zero value (1 = xGMI3) so the KFD topology will report
the incorrect IO link weights without proper masking.
The actual number of hops is located in the 3 least significant bits of
this field so mask if off accordingly before passing it to the KFD.
Signed-off-by: NJonathan Kim <jonathan.kim@amd.com>
Reviewed-by: NAmber Lin <amber.lin@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

4ac5617c

drm/amdgpu: enable 48-bit IH timestamp counter · 9a9c59a8

由 Alex Sierra 提交于 1月 15, 2021

By default this timestamp is 32 bit counter. It gets
overflowed in around 10 minutes.
Signed-off-by: NAlex Sierra <alex.sierra@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

9a9c59a8

drm/amdgpu: enable retry fault wptr overflow · b672cb1e

由 Philip Yang 提交于 9月 22, 2020

If xnack is on, VM retry fault interrupt send to IH ring1, and ring1
will be full quickly. IH cannot receive other interrupts, this causes
deadlock if migrating buffer using sdma and waiting for sdma done while
handling retry fault.

Remove VMC from IH storm client, enable ring1 write pointer overflow,
then IH will drop retry fault interrupts and be able to receive other
interrupts while driver is handling retry fault.

IH ring1 write pointer doesn't writeback to memory by IH, and ring1
write pointer recorded by self-irq is not updated, so always read
the latest ring1 write pointer from register.
Signed-off-by: NPhilip Yang <Philip.Yang@amd.com>
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

b672cb1e

drm/amdgpu: Use free system memory size for kfd memory accounting · df23d1bb

由 Oak Zeng 提交于 1月 18, 2021

With the current kfd memory accounting scheme, kfd applications
can use up to 15/16 of total system memory. For system which
has small total system memory size it leaves small system memory
for OS. For example, if the system has totally 16GB of system
memory, this scheme leave OS and non-kfd applications only 1GB
of system memory. In many cases, this leads to OOM killer.

This patch changed the KFD system memory accounting scheme.
15/16 of free system memory when kfd driver load. This deduct
the system memory that OS already use.
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Suggested-by: NPhilip Yang <Philip.Yang@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

df23d1bb

drm/amdgpu: apply new pmfw loading sequence to arcturus and onwards · b335f289

由 Hawking Zhang 提交于 1月 20, 2021

Arcturus and onwards products should follow the same sequence
that have pmfw loading ahead of tmr setup
Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Reviewed-by: NKevin Wang <kevin1.wang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

b335f289

drm/amdgpu: Fix aldebaran MMHUB CG/LS logic · 6d905921

由 Lijo Lazar 提交于 1月 18, 2021

Aldebaran MMHUB CG/LS logic is controlled by VBIOS. Enable the state
change logic only if driver is used for control.
Signed-off-by: NLijo Lazar <lijo.lazar@amd.com>
Reviewed-by: NFeifei Xu <Feifei.Xu@amd.com>
Reviewed-by: NKevin Wang <kevin1.wang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

6d905921

drm/amdgpu: Enable CP idle interrupts · 8cf3dccb

由 Lijo Lazar 提交于 1月 16, 2021

v1: The interrupts need to be enabled to move to DS clocks.
v2: Don't enable GFX IDLE interrupts if there are no GFX rings.
Signed-off-by: NLijo Lazar <lijo.lazar@amd.com>
Reviewed-by: NHawking Zhang <Hawking.Zhang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

8cf3dccb

drm/amdgpu: Add clock gating support for aldebaran · 48a6379a

由 Lijo Lazar 提交于 3月 05, 2021

Aldebaran clock gating support for GFX,SDMA,IH blocks
VCN/JPEG blocks are excluded in this patch, to be enabled later
Signed-off-by: NLijo Lazar <lijo.lazar@amd.com>
Acked-by: NFeifei Xu <Feifei.Xu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

48a6379a

drm/amdgpu: add mmhub client ids for aldebaran · e844cd99

由 Alex Deucher 提交于 1月 05, 2021

Add the mmhub client id table for aldebaran.
Reviewed-by: NHawking Zhang <Hawking.Zhang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

e844cd99

drm/amdgpu: enable dpg indirect sram mode on aldebaran · 557da413

由 James Zhu 提交于 12月 17, 2020

Enable dpg indirect sram mode on aldebaran.
Signed-off-by: NJames Zhu <James.Zhu@amd.com>
Reviewed-by: NLeo Liu <leo.liu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

557da413

drm/amdgpu: enable vcn dpg mode on aldebaran · bd937973

由 James Zhu 提交于 12月 17, 2020

Enable vcn dpg mode on aldebaran
Signed-off-by: NJames Zhu <James.Zhu@amd.com>
Reviewed-by: NLeo Liu <leo.liu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

bd937973

drm/amdgpu: enable vcn and jpeg on aldebaran · fdb1fdef

由 James Zhu 提交于 12月 17, 2020

Enable vcn and jpeg 2.6 on aldebaran.
Signed-off-by: NJames Zhu <James.Zhu@amd.com>
Reviewed-by: NLeo Liu <leo.liu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

fdb1fdef

drm/amdgpu: Enable swsmu block on aldebaran · bd7228ab

由 Lijo Lazar 提交于 12月 22, 2020

Enable smu13 block on aldebaran
Signed-off-by: NLijo Lazar <lijo.lazar@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

bd7228ab

drm/amdgpu: switch to cached noretry setting for aldebaran · 84281136

由 Hawking Zhang 提交于 12月 31, 2020

global noretry setting now is cached to gmc.noretry
Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Reviewed-by: NKevin Wang <kevin1.wang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

84281136

drm/amdgpu: bypass hdp read cache invalidation for aldebaran (v2) · d02692ae

由 Hawking Zhang 提交于 12月 22, 2020

hdp read cache is removed in aldebaran. don't issue
an mmio write or write data packet to hardware.

v2: rebase
Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Reviewed-by: NFeifei Xu <Feifei.Xu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

d02692ae

drm/amdgpu: Aldebaran doesn't use semaphore · b7daed1b

由 Amber Lin 提交于 12月 14, 2020

Simplify all Aldebaran DIDs into one ASIC type.
Signed-off-by: NAmber Lin <Amber.Lin@amd.com>
Reviewed-by: NKevin Wang <kevin1.wang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

b7daed1b

drm/amdgpu: UTLC1 RB SDMA timeout on Aldebaran · 07744e90

由 Alex Sierra 提交于 12月 14, 2020

[Why]
This causes infinite retries on the UTCL1 RB, preventing
higher priority RB such as paging RB.

[How]
Set to one the SDMAx_UTLC1_TIMEOUT registers for all SDMAs.
Signed-off-by: NAlex Sierra <alex.sierra@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NHawking Zhang <Hawking.Zhang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

07744e90

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功