提交 · 7b454b3a34335381cb660864eb1e11077847c5db · openeuler / Kernel

24 3月, 2021 40 次提交

drm/amdgpu: Use different gart table parameters for 2-level gart table · 7b454b3a

由 Oak Zeng 提交于 9月 17, 2020

If use gart for FB translation, we will squeeze vram into
sysvm aperture. This requires 2 level gart table. Add
page table depth and page table block size parameters
to gmc. This is prepare work to 2-level gart table
construction
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Reviewed-by: NChristian Konig <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <felix.kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

7b454b3a

drm/amdgpu: Placement of gart and vram in sysvm aperture · f527f310

由 Oak Zeng 提交于 9月 15, 2020

If use GART for FB translation, place both vram and gart to sysvm
aperture. AGP aperture is not set up in this case because it
is not used
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Reviewed-by: NChristian Konig <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <felix.kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

f527f310

drm/amdgpu: Modify comments of vram_start/end · 6e93ef8b

由 Oak Zeng 提交于 9月 15, 2020

Modify the comment to reflect the fact that, if
use GART for vram address translation for vmid0,
[vram_start, vram_end] will be placed inside SYSVM
aperture, together with GART.
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Reviewed-by: NChristian Konig <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <felix.kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

6e93ef8b

drm/amdgpu: Moved gart_size calculation to mc_init functions · f1dc12ca

由 Oak Zeng 提交于 10月 02, 2020

In amdgpu_gmc_gart_location function, gart_size is adjusted
by a smu_prv_buffer_size. This logic shouldn't belong to
this function. Move the logic to the mc_init functions
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Reviewed-by: NChristian Konig <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <felix.kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

f1dc12ca

drm/amdgpu: Use physical translation mode to access page table · 1f928f51

由 Oak Zeng 提交于 1月 23, 2021

On A+A platform, CPU write page directory and page table in cached
mode. So it is necessary for page table walker to snoop CPU cache.
This setting is necessary for page walker to snoop page directory
and page table data out of CPU cache.
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Acked-by: NChristian Konig <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <felix.kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

1f928f51

drm/amdgpu: Don't reserve vram as WC for A+A · 35d5f224

由 Oak Zeng 提交于 1月 22, 2021

On A+A platform, vram can be mapped as WB. Not necessarily
to always map vram as WC on such platform.

Calling function arch_io_reserve_memtype_wc will mark the
whole vram region as WC. So don't call it for A+A platform.
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Suggested-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NChristian Konig <christian.koenig@amd.com>
Reviewed-by: NFelix Kuehling <felix.kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

35d5f224

drm/amdgpu: mask the xgmi number of hops reported from psp to kfd · 4ac5617c

由 Jonathan Kim 提交于 1月 27, 2021

The psp supplies the link type in the upper 2 bits of the psp xgmi node
information num_hops field. With a new link type, Aldebaran has these
bits set to a non-zero value (1 = xGMI3) so the KFD topology will report
the incorrect IO link weights without proper masking.
The actual number of hops is located in the 3 least significant bits of
this field so mask if off accordingly before passing it to the KFD.
Signed-off-by: NJonathan Kim <jonathan.kim@amd.com>
Reviewed-by: NAmber Lin <amber.lin@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

4ac5617c

drm/amdgpu: enable 48-bit IH timestamp counter · 9a9c59a8

由 Alex Sierra 提交于 1月 15, 2021

By default this timestamp is 32 bit counter. It gets
overflowed in around 10 minutes.
Signed-off-by: NAlex Sierra <alex.sierra@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

9a9c59a8

drm/amdgpu: enable retry fault wptr overflow · b672cb1e

由 Philip Yang 提交于 9月 22, 2020

If xnack is on, VM retry fault interrupt send to IH ring1, and ring1
will be full quickly. IH cannot receive other interrupts, this causes
deadlock if migrating buffer using sdma and waiting for sdma done while
handling retry fault.

Remove VMC from IH storm client, enable ring1 write pointer overflow,
then IH will drop retry fault interrupts and be able to receive other
interrupts while driver is handling retry fault.

IH ring1 write pointer doesn't writeback to memory by IH, and ring1
write pointer recorded by self-irq is not updated, so always read
the latest ring1 write pointer from register.
Signed-off-by: NPhilip Yang <Philip.Yang@amd.com>
Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

b672cb1e

drm/amdgpu: Use free system memory size for kfd memory accounting · df23d1bb

由 Oak Zeng 提交于 1月 18, 2021

With the current kfd memory accounting scheme, kfd applications
can use up to 15/16 of total system memory. For system which
has small total system memory size it leaves small system memory
for OS. For example, if the system has totally 16GB of system
memory, this scheme leave OS and non-kfd applications only 1GB
of system memory. In many cases, this leads to OOM killer.

This patch changed the KFD system memory accounting scheme.
15/16 of free system memory when kfd driver load. This deduct
the system memory that OS already use.
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Suggested-by: NPhilip Yang <Philip.Yang@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

df23d1bb

drm/amdgpu: apply new pmfw loading sequence to arcturus and onwards · b335f289

由 Hawking Zhang 提交于 1月 20, 2021

Arcturus and onwards products should follow the same sequence
that have pmfw loading ahead of tmr setup
Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Reviewed-by: NKevin Wang <kevin1.wang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

b335f289

drm/amdgpu: Fix aldebaran MMHUB CG/LS logic · 6d905921

由 Lijo Lazar 提交于 1月 18, 2021

Aldebaran MMHUB CG/LS logic is controlled by VBIOS. Enable the state
change logic only if driver is used for control.
Signed-off-by: NLijo Lazar <lijo.lazar@amd.com>
Reviewed-by: NFeifei Xu <Feifei.Xu@amd.com>
Reviewed-by: NKevin Wang <kevin1.wang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

6d905921

drm/amdgpu: Enable CP idle interrupts · 8cf3dccb

由 Lijo Lazar 提交于 1月 16, 2021

v1: The interrupts need to be enabled to move to DS clocks.
v2: Don't enable GFX IDLE interrupts if there are no GFX rings.
Signed-off-by: NLijo Lazar <lijo.lazar@amd.com>
Reviewed-by: NHawking Zhang <Hawking.Zhang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

8cf3dccb

drm/amdgpu: Add clock gating support for aldebaran · 48a6379a

由 Lijo Lazar 提交于 3月 05, 2021

Aldebaran clock gating support for GFX,SDMA,IH blocks
VCN/JPEG blocks are excluded in this patch, to be enabled later
Signed-off-by: NLijo Lazar <lijo.lazar@amd.com>
Acked-by: NFeifei Xu <Feifei.Xu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

48a6379a

drm/amdgpu: add mmhub client ids for aldebaran · e844cd99

由 Alex Deucher 提交于 1月 05, 2021

Add the mmhub client id table for aldebaran.
Reviewed-by: NHawking Zhang <Hawking.Zhang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

e844cd99

drm/amdgpu: enable dpg indirect sram mode on aldebaran · 557da413

由 James Zhu 提交于 12月 17, 2020

Enable dpg indirect sram mode on aldebaran.
Signed-off-by: NJames Zhu <James.Zhu@amd.com>
Reviewed-by: NLeo Liu <leo.liu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

557da413

drm/amdgpu: enable vcn dpg mode on aldebaran · bd937973

由 James Zhu 提交于 12月 17, 2020

Enable vcn dpg mode on aldebaran
Signed-off-by: NJames Zhu <James.Zhu@amd.com>
Reviewed-by: NLeo Liu <leo.liu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

bd937973

drm/amdgpu: enable vcn and jpeg on aldebaran · fdb1fdef

由 James Zhu 提交于 12月 17, 2020

Enable vcn and jpeg 2.6 on aldebaran.
Signed-off-by: NJames Zhu <James.Zhu@amd.com>
Reviewed-by: NLeo Liu <leo.liu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

fdb1fdef

drm/amdgpu: Enable swsmu block on aldebaran · bd7228ab

由 Lijo Lazar 提交于 12月 22, 2020

Enable smu13 block on aldebaran
Signed-off-by: NLijo Lazar <lijo.lazar@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

bd7228ab

drm/amdgpu: switch to cached noretry setting for aldebaran · 84281136

由 Hawking Zhang 提交于 12月 31, 2020

global noretry setting now is cached to gmc.noretry
Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Reviewed-by: NKevin Wang <kevin1.wang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

84281136

drm/amdgpu: bypass hdp read cache invalidation for aldebaran (v2) · d02692ae

由 Hawking Zhang 提交于 12月 22, 2020

hdp read cache is removed in aldebaran. don't issue
an mmio write or write data packet to hardware.

v2: rebase
Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Reviewed-by: NFeifei Xu <Feifei.Xu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

d02692ae

drm/amdgpu: Aldebaran doesn't use semaphore · b7daed1b

由 Amber Lin 提交于 12月 14, 2020

Simplify all Aldebaran DIDs into one ASIC type.
Signed-off-by: NAmber Lin <Amber.Lin@amd.com>
Reviewed-by: NKevin Wang <kevin1.wang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

b7daed1b

drm/amdgpu: UTLC1 RB SDMA timeout on Aldebaran · 07744e90

由 Alex Sierra 提交于 12月 14, 2020

[Why]
This causes infinite retries on the UTCL1 RB, preventing
higher priority RB such as paging RB.

[How]
Set to one the SDMAx_UTLC1_TIMEOUT registers for all SDMAs.
Signed-off-by: NAlex Sierra <alex.sierra@amd.com>
Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NHawking Zhang <Hawking.Zhang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

07744e90

drm/amdpgu: add ATOM_DGPU_VRAM_TYPE_HBM2E vram type · 8081f8fa

由 Feifei Xu 提交于 12月 16, 2020

0x61 is assigned to HBM2E in atom_dgpu_vram_type.
Signed-off-by: NFeifei Xu <Feifei.Xu@amd.com>
Reviewed-by: NHawking Zhang <Hawking.Zhang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

8081f8fa

drm/amdgpu: retire aldebaran gpu_info firmware · 44b3253a

由 Hawking Zhang 提交于 11月 16, 2020

driver should use the gfx_info atomfirmware interface
Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Reviewed-by: NFeifei Xu <Feifei.Xu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

44b3253a

drm/amdgpu: query aldebaran gfx_config through atomfirmware i/f · 7159a36e

由 Hawking Zhang 提交于 11月 13, 2020

For ASICs that don't support ip discovery feature, query
gfx configuration through atomfirmware interface, rather
than gpu_info firmware.
Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Reviewed-by: NFeifei Xu <Feifei.Xu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

7159a36e

drm/amd/amdgpu: Add smu_pptable module parameter · 8738a82b

由 Lijo Lazar 提交于 11月 28, 2020

Temporarily add smu_pptable module parameter for aldebaran.This is used
to force soft PPTable use overriding any VBIOS PPTable.
Signed-off-by: NLijo Lazar <Lijo.Lazar@amd.com>
Reviewed-by: NKenneth Feng <kenneth.feng@amd.com>
Reviewed-by: NKevin Wang <kevin1.wang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

8738a82b

drm/amdgpu: Don't do FB resize under A+A config · be566196

由 Oak Zeng 提交于 11月 21, 2020

Disable PCIe BAR resizing on A+A config. It's not needed because we won't use the
PCIe BAR, but it breaks the PCI BAR configuration with the current SBIOS.

Error message of FB BAR resize failure under A+A:

[  154.913731] [drm:amdgpu_device_resize_fb_bar [amdgpu]] *ERROR* Problem resizing BAR0 (-22).
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Reviewed-by: NAmber Lin <Amber.Lin@amd.com>
Reviewed-by: NFelix Kuehling <Felix.kuehling@amd.com>
Reviewed-by: NChristian Koenig <Christian.Koenig@amd.com>
Tested-by: NAmber Lin <Amber.Lin@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

be566196

drm/amdgpu: pre-map device buffer as cached for A+A config · 9d0af8b4

由 Oak Zeng 提交于 11月 20, 2020

For A+A configuration, device memory is supposed to be mapped as
cachable from CPU side. For kernel pre-map gpu device memory using
ioremap_cache
Signed-off-by: NOak Zeng <Oak.Zeng@amd.com>
Reviewed-by: NChristian Koenig <Christian.Koenig@amd.com>
Tested-by: NAmber Lin <Amber.Lin@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

9d0af8b4

drm/amdgpu: disallow use semaphore on aldebaran · d477c5aa

由 Hawking Zhang 提交于 11月 24, 2020

shall revisit the change later
Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

d477c5aa

drm/amdgpu: switch to vega20 ih block for aldebaran · 10c71e6c

由 Hawking Zhang 提交于 12月 01, 2020

replace vega10 ih block with vega20 ih block for
aldebaran.
Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Acked-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NDennis Li <Dennis.Li@amd.com>
Reviewed-by: NFeifei Xu <Feifei.Xu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

10c71e6c

drm/amdgpu: correct IH_CHICKEN programming for aldebaran · eed4bbd3

由 Hawking Zhang 提交于 12月 01, 2020

For aldebaran, psp firmware won't program IH_CHICKEN.
it now depends on driver to program it properly so
either bus address or gpu virtual address is just
working for ih ring.
Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Acked-by: NFelix Kuehling <Felix.Kuehling@amd.com>
Reviewed-by: NDennis Li <Dennis.Li@amd.com>
Reviewed-by: NFeifei Xu <Feifei.Xu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

eed4bbd3

drm/amdgpu: add mmhub error status query callback for aldebaran · b45589b8

由 Hawking Zhang 提交于 11月 19, 2020

The callback will be invoked to query mmea error
status when needed.
Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Reviewed-by: Dennis Li<Dennis.Li@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

b45589b8

drm/amdgpu: add mmhub ras error reset callback for aldebaran · 27ad2ca6

由 Hawking Zhang 提交于 11月 19, 2020

The callback will be invoked to reset mmhub ras error
counters when needed.
Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Reviewed-by: Dennis Li<Dennis.Li@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

27ad2ca6

drm/amdgpu: add mmhub ras error query callback for aldebaran · cbb84e7a

由 Hawking Zhang 提交于 11月 19, 2020

The callback will be invoked to harvest all kinds
of mmhub ras error
Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Reviewed-by: Dennis Li<Dennis.Li@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

cbb84e7a

drm/amdgpu: add sdma ras error reset callback for aldebaran · f5f0e4a0

由 Hawking Zhang 提交于 11月 19, 2020

The callback will be invoked to reset sdma ras error
counters when needed.
Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Reviewed-by: Dennis Li<Dennis.Li@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

f5f0e4a0

drm/amdgpu: add sdma ras error query callback for aldebaran · b2459840

由 Hawking Zhang 提交于 11月 18, 2020

The callback will be invoked to harvest all kinds
of sdma ras error
Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Reviewed-by: Dennis Li<Dennis.Li@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

b2459840

drm/amdgpu: add sdma v4_4 ras function · 2fdb91a2

由 Hawking Zhang 提交于 11月 18, 2020

sdma ras function is the main structure to support
sdma ras on aldebaran. the patch initializes late_init
late_fini callbacks.
Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Reviewed-by: Dennis Li<Dennis.Li@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

2fdb91a2

drm/amdgpu: apply sdma golden settings for aldebaran · a6d9d6ab

由 Hawking Zhang 提交于 11月 18, 2020

perform one-time initialization for sdma registers
Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Reviewed-by: NKevin Wang <kevin1.wang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

a6d9d6ab

drm/amdgpu: use physical_node_id to calculate aper_base · 3de60d96

由 Hawking Zhang 提交于 11月 17, 2020

Similar as xgmi connected gpu nodes, physical_node_id
* segment_size should be used to calculate the offset
of aper_base.

The asic type check is redundant. once physical_node_id
and segment_size are initialized, it should be count
on.
Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Reviewed-by: NKevin Wang <kevin1.wang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

3de60d96

openeuler / Kernel 接近 2 年 前同步成功

openeuler / Kernel
接近 2 年前同步成功