提交 · 5217811e74d3b9b6d830476ab8419bbe3d42596e · openeuler / Kernel

24 3月, 2021 40 次提交

drm/amdgpu: add gc powerbrake support (v2) · 5217811e

由 Kevin Wang 提交于 1月 15, 2021

add GC power brake feature support for Aldebaran.

v2: squash in fixes (Alex)
Signed-off-by: NKevin Wang <kevin1.wang@amd.com>
Reviewed-by: NKenneth Feng <kenneth.feng@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

5217811e

drm/amdgpu: update TCP_CHAN_STEER_1 golden value for aldebaran · b3ecf36b

由 Hawking Zhang 提交于 11月 12, 2020

The golden setting was changed recently. update to
the latest one
Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Reviewed-by: NKevin Wang <kevin1.wang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

b3ecf36b

drm/amdgpu: add common gc golden settings for aldebaran · 9f55d7ed

由 Hawking Zhang 提交于 11月 11, 2020

golden settings that should be applied
Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Reviewed-by: NKevin Wang <kevin1.wang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

9f55d7ed

drm/amdgpu: apply gc v9_4_2 golden settings for aldebaran · 264aef8b

由 Hawking Zhang 提交于 10月 19, 2020

Those registers should be programmed as one-time initialization
Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Reviewed-by: NKevin Wang <kevin1.wang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

264aef8b

drm/amdgpu: restore aldebaran save ttmp and trap config on init (v2) · 16171a25

由 Jonathan Kim 提交于 8月 21, 2020

Initialization of TRAP_DATA0/1 is still required for the debugger to detect
new waves on Aldebaran.  Also, per-vmid global trap enablement may be
required outside of debugger scope so move to init phase.

v2: just add the gfx 9.4.2 changes (Alex)
Signed-off-by: NJonathan Kim <Jonathan.Kim@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

16171a25

drm/amdkfd: add aldebaran kfd2kgd callbacks to kfd device (v2) · 5073506c

由 Jonathan Kim 提交于 9月 05, 2020

Create dedicated Aldebaran kfd2kgd callbacks to prepare
for new per-vmid register instructions for debug trap
setting functions and sending host traps.

v2: rebase (Alex)
Signed-off-by: NJonathan Kim <Jonathan.Kim@amd.com>
Reviewed-by: NOak Zeng <Oak.Zeng@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

5073506c

drm/amdkfd: Check HIQ's MQD for queue preemption status · 51a0f459

由 Oak Zeng 提交于 7月 07, 2020

MEC firmware can silently fail the queue preemption request
without time out. In this case, HIQ's MQD's queue_doorbell_id
will be set. Check this field to see whether last queue preemption
was successful or not.
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Suggested-by: NJay Cornwall <Jay.Cornwall@amd.com>
Acked-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

51a0f459

drm/amdkfd: Add kernel parameter to stop queue eviction on vm fault · 6d909c5d

由 Oak Zeng 提交于 6月 22, 2020

This is to keep wavefront context for debug purpose
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

6d909c5d

drm/amdgpu: allow use psp to load firmware (v2) · 2f669734

由 Hawking Zhang 提交于 4月 13, 2020

Match existing asics.

v2: rebase (Alex)
Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Reviewed-by: NKevin Wang <kevin1.wang@amd.com>
Reviewed-by: NLe Ma <Le.Ma@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

2f669734

drm/amd/pm: Enable user min/max gfxclk on aldebaran · 65ec7c08

由 Lijo Lazar 提交于 2月 04, 2021

Aldebaran has fine grained DPM for GFXCLK. Instead of a discrete level,
user can specify a min/max range of GFXCLK for any profiling/tuning
purpose.This option is available only in manual performance level mode.
Select "manual" as power_dpm_force_performance_level and specify the
min/max range using pp_dpm_sclk sysfs node. User cannot specify a min/max
range outside of the default min/max range of the ASIC. If specified
outside the range, values will be bound by the default min/max range.

Ex: To use gfxclk min = 600MHz and max = 900MHz

echo manual > /sys/bus/pci/devices/.../power_dpm_force_performance_level
echo min 600 max 900 > /sys/bus/pci/devices/.../pp_dpm_sclk
Signed-off-by: NLijo Lazar <lijo.lazar@amd.com>
Reviewed-by: NHawking Zhang <Hawking.Zhang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

65ec7c08

drm/amd/pm: remove aldebaran serial number support · 2bb8ac85

由 Kevin Wang 提交于 2月 04, 2021

the following message is not supported.

PPSMC_MSG_ReadSerialNumTop32
PPSMC_MSG_ReadSerialNumBottom32
Signed-off-by: NKevin Wang <kevin1.wang@amd.com>
Reviewed-by: NKenneth Feng <kenneth.feng@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

2bb8ac85

drm/amdgpu: use pd addr based on gart level page table · ec8631e0

由 Alex Sierra 提交于 2月 03, 2021

With a recent gart page table re-construction, the gart page
table is now 2-level for some ASICs: PDB0->PTB.
In the case of 2-level gart page table, the page_table_base
of vmid0 should point to PDB0 instead of PTB.
Signed-off-by: NAlex Sierra <alex.sierra@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NOak Zeng <Oak.Zeng@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

ec8631e0

drm/amdgpu: Fix the comment in amdgpu_gmc.h · be0478e7

由 Oak Zeng 提交于 2月 03, 2021

More accurate words are used to address a
code review feedback
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

be0478e7

drm/amdgpu: Fix GART page table s-bit · 79194dac

由 Oak Zeng 提交于 1月 22, 2021

For the new 2-level GART table, the last PDE0 points
to PTB. Since PTB is in vram and right now we are
runing under s=0 mode (vram is treated as FB carveout),
so the s bit of this PDE0 should be set to 0.
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

79194dac

drm/amdgpu: update mmhub client ids for Aldebaran · f4ec3e50

由 Alex Sierra 提交于 2月 02, 2021

update mmhub client id table for Aldebaran.
Signed-off-by: NAlex Sierra <alex.sierra@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

f4ec3e50

drm/amdgpu: enable sram initialization for aldebaran · abe5ee57

由 Dennis Li 提交于 2月 01, 2021

Aldebaran can share the same initializing shader code witn
arcturus.
Signed-off-by: NDennis Li <Dennis.Li@amd.com>
Reviewed-by: NHawking Zhang <Hawking.Zhang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

abe5ee57

drm/amdgpu: workaround the TMR MC address issue (v2) · 2f055097

由 Oak Zeng 提交于 1月 26, 2021

With the 2-level gart page table,  vram is squeezed into gart aperture
and FB aperture is disabled. Therefore all VRAM virtual addresses are
 in the GART aperture. However currently PSP requires TMR addresses
in FB aperture. So we need some design change at PSP FW level to support
this 2-level gart table driver change. Right now this PSP FW support
doesn't exist. To workaround this issue temporarily, FB aperture is
added back and the gart aperture address is converted back to FB aperture
for this PSP TMR address.

Will revert it after we get a fix from PSP FW.

v2: squash in tmr fix for other asics (Kevin)
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Reviewed-by: NFelix Kuehling <felix.kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

2f055097

drm/amdgpu: HW setup of 2-level vmid0 page table · 0c19cab5

由 Oak Zeng 提交于 9月 17, 2020

Set up HW for 2-level vmid0 page table: 1. Set up
PAGE_TABLE_START/END registers. Currently only plan
to do 2-level page table for ALDEBARAN, so only gfxhub1.0
and mmhub1.7 is changed. 2. Set page table base register.
For 2-level page table, the page table base should point
to PDB0. 3. Disable AGP and FB aperture as they are not
used.
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Reviewed-by: NChristian Konig <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <felix.kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

0c19cab5

drm/amdgpu: Set up vmid0 PDB0 · 522510a6

由 Oak Zeng 提交于 9月 17, 2020

If use gart for FB translation, allocate and fill
PDB0.
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Reviewed-by: NChristian Konig <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <felix.kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

522510a6

drm/amdgpu: Add function to allocate and fill PDB0 · a2902c09

由 Oak Zeng 提交于 9月 17, 2020

Add functions to allocate PDB0, map it for CPU access,
and fill it.

Those functions are only used for 2-level vmid0 page
table construction
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Reviewed-by: NFelix Kuehling <felix.kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

a2902c09

drm/amdgpu: Use different gart table parameters for 2-level gart table · 7b454b3a

由 Oak Zeng 提交于 9月 17, 2020

If use gart for FB translation, we will squeeze vram into
sysvm aperture. This requires 2 level gart table. Add
page table depth and page table block size parameters
to gmc. This is prepare work to 2-level gart table
construction
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Reviewed-by: NChristian Konig <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <felix.kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

7b454b3a

drm/amdgpu: Placement of gart and vram in sysvm aperture · f527f310

由 Oak Zeng 提交于 9月 15, 2020

If use GART for FB translation, place both vram and gart to sysvm
aperture. AGP aperture is not set up in this case because it
is not used
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Reviewed-by: NChristian Konig <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <felix.kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

f527f310

drm/amdgpu: Modify comments of vram_start/end · 6e93ef8b

由 Oak Zeng 提交于 9月 15, 2020

Modify the comment to reflect the fact that, if
use GART for vram address translation for vmid0,
[vram_start, vram_end] will be placed inside SYSVM
aperture, together with GART.
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Reviewed-by: NChristian Konig <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <felix.kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

6e93ef8b

drm/amdgpu: Moved gart_size calculation to mc_init functions · f1dc12ca

由 Oak Zeng 提交于 10月 02, 2020

In amdgpu_gmc_gart_location function, gart_size is adjusted
by a smu_prv_buffer_size. This logic shouldn't belong to
this function. Move the logic to the mc_init functions
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Reviewed-by: NChristian Konig <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <felix.kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

f1dc12ca

drm/amdgpu: Use physical translation mode to access page table · 1f928f51

由 Oak Zeng 提交于 1月 23, 2021

On A+A platform, CPU write page directory and page table in cached
mode. So it is necessary for page table walker to snoop CPU cache.
This setting is necessary for page walker to snoop page directory
and page table data out of CPU cache.
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Acked-by: NChristian Konig <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <felix.kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

1f928f51

drm/amdgpu: Don't reserve vram as WC for A+A · 35d5f224

由 Oak Zeng 提交于 1月 22, 2021

On A+A platform, vram can be mapped as WB. Not necessarily
to always map vram as WC on such platform.

Calling function arch_io_reserve_memtype_wc will mark the
whole vram region as WC. So don't call it for A+A platform.
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Suggested-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NChristian Konig <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <felix.kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

35d5f224

drm/amd/pm: Correct msg status check for powerlimit · debd629a

由 Lijo Lazar 提交于 1月 29, 2021

Status 0 indicates success, fix the check before using PPTable limit
Signed-off-by: NLijo Lazar <lijo.lazar@amd.com>
Reviewed-by: NEvan Quan <evan.quan@amd.com>
Reviewed-by: Kevin Wang <kevin1.wang@amd.com>`
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

debd629a

drm/amd/pm: Enable performance determinism on aldebaran · 6be64246

由 Lijo Lazar 提交于 3月 05, 2021

Performance Determinism is a new mode in Aldebaran where PMFW tries to
maintain sustained performance level. It can be enabled on a per-die
basis on aldebaran. To guarantee that it remains within the power cap,
a max GFX frequency needs to be specified in this mode. A new
power_dpm_force_performance_level, "perf_determinism", is defined to enable
this mode in amdgpu. The max frequency (in MHz) can be specified through
pp_dpm_sclk. The mode will be disabled once any other performance level
is chosen.

Ex: To enable perf determinism at 900Mhz max gfx clock

echo perf_determinism > /sys/bus/pci/devices/.../power_dpm_force_performance_level
echo max 900 > /sys/bus/pci/devices/.../pp_dpm_sclk
Signed-off-by: NLijo Lazar <lijo.lazar@amd.com>
Reviewed-by: NKenneth Feng <kenneth.feng@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

6be64246

drm/amd/pm: Add DCBTC support for aldebaran · 26256ca8

由 Lijo Lazar 提交于 1月 28, 2021

On aldebaran DCBTC should be run after enabling DPM. DCBTC won't be run
if support is not enabled in PPTable. Without PPTable support the message
is dummy and will return success always.
Signed-off-by: NLijo Lazar <lijo.lazar@amd.com>
Reviewed-by: NHawking Zhang <Hawking.Zhang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

26256ca8

drm/amd/pm: Fix power limit query on aldebaran · d6f19a99

由 Lijo Lazar 提交于 1月 28, 2021

Aldebaran doesn't have AC/DC power limits. Separate the implementation
from SMU13. Max power limit is queried from PPTable.
Signed-off-by: NLijo Lazar <lijo.lazar@amd.com>
Reviewed-by: NKevin Wang <kevin1.wang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

d6f19a99

drm/amdgpu: mask the xgmi number of hops reported from psp to kfd · 4ac5617c

由 Jonathan Kim 提交于 1月 27, 2021

The psp supplies the link type in the upper 2 bits of the psp xgmi node
information num_hops field. With a new link type, Aldebaran has these
bits set to a non-zero value (1 = xGMI3) so the KFD topology will report
the incorrect IO link weights without proper masking.
The actual number of hops is located in the 3 least significant bits of
this field so mask if off accordingly before passing it to the KFD.
Signed-off-by: NJonathan Kim <jonathan.kim@amd.com>
Reviewed-by: NAmber Lin <amber.lin@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

4ac5617c

drm/amdgpu: enable 48-bit IH timestamp counter · 9a9c59a8

由 Alex Sierra 提交于 1月 15, 2021

By default this timestamp is 32 bit counter. It gets
overflowed in around 10 minutes.
Signed-off-by: NAlex Sierra <alex.sierra@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

9a9c59a8

drm/amdgpu: enable retry fault wptr overflow · b672cb1e

由 Philip Yang 提交于 9月 22, 2020

If xnack is on, VM retry fault interrupt send to IH ring1, and ring1
will be full quickly. IH cannot receive other interrupts, this causes
deadlock if migrating buffer using sdma and waiting for sdma done while
handling retry fault.

Remove VMC from IH storm client, enable ring1 write pointer overflow,
then IH will drop retry fault interrupts and be able to receive other
interrupts while driver is handling retry fault.

IH ring1 write pointer doesn't writeback to memory by IH, and ring1
write pointer recorded by self-irq is not updated, so always read
the latest ring1 write pointer from register.
Signed-off-by: NPhilip Yang <Philip.Yang@amd.com>
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

b672cb1e

drm/amdgpu: Use free system memory size for kfd memory accounting · df23d1bb

由 Oak Zeng 提交于 1月 18, 2021

With the current kfd memory accounting scheme, kfd applications
can use up to 15/16 of total system memory. For system which
has small total system memory size it leaves small system memory
for OS. For example, if the system has totally 16GB of system
memory, this scheme leave OS and non-kfd applications only 1GB
of system memory. In many cases, this leads to OOM killer.

This patch changed the KFD system memory accounting scheme.
15/16 of free system memory when kfd driver load. This deduct
the system memory that OS already use.
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Suggested-by: NPhilip Yang <Philip.Yang@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

df23d1bb

drm/amdgpu: apply new pmfw loading sequence to arcturus and onwards · b335f289

由 Hawking Zhang 提交于 1月 20, 2021

Arcturus and onwards products should follow the same sequence
that have pmfw loading ahead of tmr setup
Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Reviewed-by: NKevin Wang <kevin1.wang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

b335f289

drm/amdgpu: Fix aldebaran MMHUB CG/LS logic · 6d905921

由 Lijo Lazar 提交于 1月 18, 2021

Aldebaran MMHUB CG/LS logic is controlled by VBIOS. Enable the state
change logic only if driver is used for control.
Signed-off-by: NLijo Lazar <lijo.lazar@amd.com>
Reviewed-by: NFeifei Xu <Feifei.Xu@amd.com>
Reviewed-by: NKevin Wang <kevin1.wang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

6d905921

drm/amdgpu: Enable CP idle interrupts · 8cf3dccb

由 Lijo Lazar 提交于 1月 16, 2021

v1: The interrupts need to be enabled to move to DS clocks.
v2: Don't enable GFX IDLE interrupts if there are no GFX rings.
Signed-off-by: NLijo Lazar <lijo.lazar@amd.com>
Reviewed-by: NHawking Zhang <Hawking.Zhang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

8cf3dccb

drm/amdgpu/pm: Remove redundant generic message index · 8a6b6b66

由 Lijo Lazar 提交于 1月 12, 2021

Remove SMU_MSG_GfxDriverReset generic index.
Always use SMU_MSG_GfxDeviceDriverReset as the generic index for reset.
Signed-off-by: NLijo Lazar <lijo.lazar@amd.com>
Reviewed-by: NFeifei Xu <Feifei.Xu@amd.com>
Reviewed-by: NKevin Wang <kevin1.wang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

8a6b6b66

drm/amdgpu/pm: Fix reset message mapping on aldebaran · ced7e082

由 Lijo Lazar 提交于 1月 12, 2021

Use the correct mapping for mode-reset messages on aldebaran
Signed-off-by: NLijo Lazar <lijo.lazar@amd.com>
Reviewed-by: NFeifei Xu <Feifei.Xu@amd.com>
Reviewed-by: NKevin Wang <kevin1.wang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

ced7e082

drm/amdgpu/pm: Remove unsupported MP1 messages from aldebaran · 701db675

由 Lijo Lazar 提交于 1月 11, 2021

PrepareMp1Reset and SoftReset messages are not supported on aldebaran.
Signed-off-by: NLijo Lazar <lijo.lazar@amd.com>
Reviewed-by: NFeifei Xu <Feifei.Xu@amd.com>
Reviewed-by: NKevin Wang <kevin1.wang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

701db675

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功