提交 · 833fa075b87c36be437a941393d750c36022d902 · openeuler / Kernel

14 7月, 2017 6 次提交

drm/amd/powerplay: added new se_cac_idx r/w APIs v2 · 16abb5d2

由 Evan Quan 提交于 7月 04, 2017

  - v2: added missing spinlock init
Signed-off-by: NEvan Quan <evan.quan@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

16abb5d2

drm/amdgpu: add workaround for S3 issues on some vega10 boards · 47ed4e1c

由 Ken Wang 提交于 7月 04, 2017

Certain MC registers need a delay after writing them to properly
update in the init sequence.
Signed-off-by: NKen Wang <Ken.Wang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

47ed4e1c

drm/amdgpu: unify some atombios/atomfirmware scratch reg functions · d05da0e2

由 Alex Deucher 提交于 6月 30, 2017

Now that we use a pointer to the scratch reg start offset,
most of the functions were duplicated.
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

d05da0e2

drm/amdgpu: Support passing amdgpu critical error to host via GPU Mailbox. · 89041940

由 Gavin Wan 提交于 6月 23, 2017

This feature works for SRIOV enviroment. For non-SRIOV enviroment, the
trans_error function does nothing.

The error information includes error_code (16bit), error_flags(16bit)
and error_data(64bit). Since there are not many errors, we keep the
errors in an array and transfer all errors to Host before amdgpu
initialization function (amdgpu_device_init) exit.
Signed-off-by: NGavin Wan <Gavin.Wan@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

89041940

drm/amdgpu: remove *_mc_access from display funcs · e4f6b39e

由 Alex Deucher 提交于 12月 08, 2016

These are no longer needed now that we use the fb_location
programmed by the vbios.
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

e4f6b39e

drm/amdgpu: use kernel is_power_of_2 rather than local version · 76117507

由 Alex Deucher 提交于 6月 21, 2017

Use the kernel provided version.
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

76117507

30 6月, 2017 1 次提交

drm/amdgpu: fix typo in amdgpu_debugfs_test_ib_init · 27bad5b9

由 Arnd Bergmann 提交于 6月 21, 2017

The debugfs interface has calls a function that was evidently
defined under the wrong name in some configurations:

drivers/gpu/drm/amd/amdgpu/amdgpu_device.c:64:12: error: 'amdgpu_debugfs_test_ib_ring_init' used but never defined [-Werror]
drivers/gpu/drm/amd/amdgpu/amdgpu_device.c:3803:12: error: 'amdgpu_debugfs_test_ib_init' defined but not used [-Werror=unused-function]

This fixes the function name.

Fixes: 4f0955fc ("drm/amdgpu: export test ib debugfs interface")
Signed-off-by: NArnd Bergmann <arnd@arndb.de>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

27bad5b9

17 6月, 2017 1 次提交

drm/amdgpu: don't check the default value for vm size · 64dab074

由 Alex Deucher 提交于 6月 15, 2017

Avoids printing spurious messages like this:
[    3.102059] amdgpu 0000:01:00.0: VM size (-1) must be a power of 2
Reviewed-by: NChristian König <christian.koenig@amd.com>
Reviewed-by: NMichel Dänzer <michel.daenzer@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

64dab074

15 6月, 2017 3 次提交

drm/amdgpu: fix missed gpu info firmware when cache firmware during S3 · ab4fe3e1

由 Huang Rui 提交于 6月 05, 2017

gpu_info firmware is released after data is used. But when system enters into
suspend, upper class driver will cache all firmware names. At that time,
gpu_info will be failing to load. It seems an upper class issue, that we should
not release gpu_info firmware until device finished.

[  903.236589] cache_firmware: amdgpu/vega10_sdma1.bin
[  903.236590] fw_set_page_data: fw-amdgpu/vega10_sdma1.bin buf=ffff88041eee10c0 data=ffffc90002561000 size=17408
[  903.236591] cache_firmware: amdgpu/vega10_sdma1.bin ret=0
[  903.464160] __allocate_fw_buf: fw-amdgpu/vega10_gpu_info.bin buf=ffff88041eee2c00
[  903.471815] (NULL device *): loading /lib/firmware/updates/4.11.0-custom/amdgpu/vega10_gpu_info.bin failed with error -2
[  903.482870] (NULL device *): loading /lib/firmware/updates/amdgpu/vega10_gpu_info.bin failed with error -2
[  903.492716] (NULL device *): loading /lib/firmware/4.11.0-custom/amdgpu/vega10_gpu_info.bin failed with error -2
[  903.503156] (NULL device *): direct-loading amdgpu/vega10_gpu_info.bin
Signed-off-by: NHuang Rui <ray.huang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

ab4fe3e1

drm/amdgpu: export test ib debugfs interface · 4f0955fc

由 Huang Rui 提交于 5月 10, 2017

As Christian and David's suggestion, submit the test ib ring debug interfaces.
It's useful for debugging with the command submission without VM case.
Signed-off-by: NHuang Rui <ray.huang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

4f0955fc

drm/amdgpu: add new member in gpu_info fw · 51fd0370

由 Hawking Zhang 提交于 6月 09, 2017

Signed-off-by: NHawking Zhang <Hawking.Zhang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

51fd0370

09 6月, 2017 1 次提交

drm/amdgpu: move comment to the right place · 0fa49558

由 Alex Xie 提交于 6月 08, 2017

Signed-off-by: NAlex Xie <AlexBin.Xie@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

0fa49558

07 6月, 2017 4 次提交

drm/amdgpu: add ip block number prints · a0bae357

由 Huang Rui 提交于 5月 03, 2017

User is able to follow the ip block number to write the ip_block_mask for
selecting the one which user would like to enable.
Signed-off-by: NHuang Rui <ray.huang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

a0bae357

drm/amdgpu: add ip name print for selecting ips with ip_block_mask · ed8cf00c

由 Huang Rui 提交于 5月 03, 2017

Signed-off-by: NHuang Rui <ray.huang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

ed8cf00c

drm/amdgpu: remove mmhub ip · 1191d110

由 Huang Rui 提交于 5月 31, 2017

Signed-off-by: NHuang Rui <ray.huang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

1191d110

drm/amdgpu: remove gfxhub ip · 373f5923

由 Huang Rui 提交于 5月 31, 2017

Signed-off-by: NHuang Rui <ray.huang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

373f5923

02 6月, 2017 1 次提交

drm/amdgpu: Move compute vm bug logic to amdgpu_vm.c · e59c0205

由 Alex Xie 提交于 6月 01, 2017

  In review, Christian would like to keep the logic
  inside amdgpu_vm.c with a cost of slightly slower.
  The loop is still optimized out with this patch.

v2: remove the if statement. Now it is not slower.
Signed-off-by: NAlex Xie <AlexBin.Xie@amd.com>
Reviewed-by: NChristian König <christian.koeng@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

e59c0205

01 6月, 2017 2 次提交

drm/amdgpu: implement lru amdgpu_queue_mgr policy for compute v4 · 795f2813

由 Andres Rodriguez 提交于 3月 06, 2017

Use an LRU policy to map usermode rings to HW compute queues.

Most compute clients use one queue, and usually the first queue
available. This results in poor pipe/queue work distribution when
multiple compute apps are running. In most cases pipe 0 queue 0 is
the only queue that gets used.

In order to better distribute work across multiple HW queues, we adopt
a policy to map the usermode ring ids to the LRU HW queue.

This fixes a large majority of multi-app compute workloads sharing the
same HW queue, even though 7 other queues are available.

v2: use ring->funcs->type instead of ring->hw_ip
v3: remove amdgpu_queue_mapper_funcs
v4: change ring_lru_list_lock to spinlock, grab only once in lru_get()
Signed-off-by: NAndres Rodriguez <andresx7@gmail.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

795f2813

drm/amdgpu: optimize amdgpu driver load & resume time · 2dc80b00

由 Shirish S 提交于 5月 25, 2017

amdgpu_device_resume() & amdgpu_device_init() have a high
time consuming call of amdgpu_late_init() which sets the
clock_gating state of all IP blocks and is blocking.
This patch defers only this setting of clock gating state
operation to post resume of amdgpu driver but ideally before
the UI comes up or in some cases post ui as well.

With this change the resume time of amdgpu_device comes down
from 1.299s to 0.199s which further helps in reducing the overall
system resume time.

V1: made the optimization applicable during driver load as well.

TEST:(For ChromiumOS on STONEY only)
* UI comes up
* amdgpu_late_init() call gets called consistently and no errors reported.
Signed-off-by: NShirish S <shirish.s@amd.com>
Reviewed-by: NHuang Rui <ray.huang@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

2dc80b00

25 5月, 2017 15 次提交

drm/amdgpu: properly byteswap gpu_info firmware · b5ab16bf

由 Alex Deucher 提交于 5月 11, 2017

It's stored in LE format.
Reviewed-by: NMichel Dänzer <michel.daenzer@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

b5ab16bf

drm/amdgpu: return -ENODEV to user space when vram is lost v2 · f1892138

由 Chunming Zhou 提交于 5月 15, 2017

below ioctl will return -ENODEV:
amdgpu_cs_ioctl
amdgpu_cs_wait_ioctl
amdgpu_cs_wait_fences_ioctl
amdgpu_gem_va_ioctl
amdgpu_info_ioctl

v2: only for map and replace cases in amdgpu_gem_va_ioctl
Signed-off-by: NChunming Zhou <David1.Zhou@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

f1892138

drm/amdgpu: check if vram is lost v2 · 0c49e0b8

由 Chunming Zhou 提交于 5月 15, 2017

backup first 64 byte of gart table as reset magic, check if magic is same
after gpu hw reset.
v2: use memcmp instead of manual innovation.
Signed-off-by: NChunming Zhou <David1.Zhou@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

0c49e0b8

drm/amdgpu: add raven gpu_info support · 2d2e5e7e

由 Alex Deucher 提交于 5月 09, 2017

Add support for parsing the gpu info table on raven.
This is required to get the gpu config data for raven.
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NTom St Denis <tom.stdenis@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

2d2e5e7e

drm/amdgpu: add RAVEN family id definition · 2ca8a5d2

由 Chunming Zhou 提交于 12月 07, 2016

RAVEN is a new APU.
Signed-off-by: NChunming Zhou <David1.Zhou@amd.com>
Reviewed-by: NHawking Zhang <Hawking.Zhang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

2ca8a5d2

drm/amdgpu:use job's list instead of check fence · 4f059ecd

由 Monk Liu 提交于 5月 11, 2017

because if the fence is really signaled, it could already
released so the fence pointer is a wild pointer, but if
we use job->base.node we are safe because job will not
be released untill amdgpu_job_timedout finished.
Signed-off-by: NMonk Liu <Monk.Liu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

4f059ecd

drm/amdgpu/SRIOV:implement guilty job TDR for(V2) · 65781c78

由 Monk Liu 提交于 5月 11, 2017

1,TDR will kickout guilty job if it hang exceed the threshold
of the given one from kernel paramter "job_hang_limit", that
way a bad command stream will not infinitly cause GPU hang.

by default this threshold is 1 so a job will be kicked out
after it hang.

2,if a job timeout TDR routine will not reset all sched/ring,
instead if will only reset on the givn one which is indicated
by @job of amdgpu_sriov_gpu_reset, that way we don't need to
reset and recover each sched/ring if we already know which job
cause GPU hang.

3,unblock sriov_gpu_reset for AI family.

V2:
1:put kickout guilty job after sched parked.
2:since parking scheduler prior to kickout already occupies a
while, we can do last check on the in question job before
doing hw_reset.

TODO:
1:when a job is considered as guilty, we should mark some flag
in its fence status flag, and let UMD side aware that this
fence signaling is not due to job complete but job hang.

2:if gpu reset cause all video memory lost, we need introduce
a new policy to implement TDR, like drop all jobs not yet
signaled, and all IOCTL on this device will return ERROR
DEVICE_LOST.
this will be implemented later.
Signed-off-by: NMonk Liu <Monk.Liu@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

65781c78

drm/amdgpu:use job* to replace voluntary · 7225f873

由 Monk Liu 提交于 4月 26, 2017

that way we can know which job cause hang and
can do per sched reset/recovery instead of all
sched.
Signed-off-by: NMonk Liu <Monk.Liu@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

7225f873

drm/amdgpu:don't invoke srio-gpu-reset in gpu-reset (v2) · 4fbf87e2

由 Monk Liu 提交于 5月 05, 2017

because we don't want to do sriov-gpu-reset under certain
cases, so just split those two funtion and don't invoke
sr-iov one from bare-metal one.

V2:
remove debugfs_gpu_reset routine on SRIOV case.
Signed-off-by: NMonk Liu <Monk.Liu@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

4fbf87e2

drm/amdgpu: print when gpu reset successed · 6643be65

由 Chunming Zhou 提交于 5月 05, 2017

Signed-off-by: NChunming Zhou <David1.Zhou@amd.com>
Reviewed-by: NRoger.He <Hongbo.He@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

6643be65

drm/amdgpu: fix ring0 failed on pro card · fcf0649f

由 Chunming Zhou 提交于 5月 05, 2017

the root cause is vram content is lost completely after pci reset.
Signed-off-by: NChunming Zhou <David1.Zhou@amd.com>
Reviewed-by: NRoger.He <Hongbo.He@amd.com>
Acked-by: NChristian König <christian.koenig@amd.com>
Acked-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

fcf0649f

drm/amdgpu: Fix comments in source code · 455a7bc2

由 Alex Xie 提交于 5月 08, 2017

Signed-off-by: NAlex Xie <AlexBin.Xie@amd.com>
Reviewed-by: NMichel Dänzer <michel.daenzer@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

455a7bc2

drm/amdgpu: fix errors in comments. · ea81a173

由 Alex Xie 提交于 5月 08, 2017

Signed-off-by: NAlex Xie <AlexBin.Xie@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

ea81a173

drm/amdgpu:re-write sriov_reinit_early/late (v2) · 2cb681b6

由 Monk Liu 提交于 4月 26, 2017

1,this way we make those routines compatible with the sequence
  requirment for both Tonga and Vega10
2,ignore PSP hw init when doing TDR, because for SR-IOV device
the ucode won't get lost after VF FLR, so no need to invoke PSP
doing the ucode reloading again.

v2: squash in ARRAY_SIZE fix
Signed-off-by: NMonk Liu <Monk.Liu@amd.com>
Reviewed-by: NXiangliang Yu <Xiangliang.Yu@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

2cb681b6

drm/amdgpu: parse the gpu_info firmware (v4) · e2a75f88

由 Alex Deucher 提交于 4月 27, 2017

And populate the gfx structures from it.

v2: update the structures updated by the table
v3: rework based on new table structure
v4: simplify things
Reviewed-by: NJunwei Zhang <Jerry.Zhang@amd.com>
Tested-by: NJunwei Zhang <Jerry.Zhang@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

e2a75f88

06 5月, 2017 2 次提交

drm/amdgpu: fix mutex list null pointer reference · db2c2a97

由 Pixel Ding 提交于 4月 25, 2017

Fix NULL pointer reference.
Signed-off-by: NPixel Ding <Pixel.Ding@amd.com>
Signed-off-by: NXiangliang Yu <Xiangliang.Yu@amd.com>
Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

db2c2a97

drm/amdgpu:fix waiting on dirty fence · 236763d3

由 Monk Liu 提交于 5月 01, 2017

if bo->shadow is NULL (race issue:BO shadow was just released
and gpu-reset kick in but BO hasn't yet) recover_vram_from_shadow
won't set @next, so the following "fence=next"
will wrongly use a fence pointer which may already dirty.
fixing it by set next to NULL prior to recover_vram_from_shadow
Signed-off-by: NMonk Liu <Monk.Liu@amd.com>
Reviewed-by: Chunming Zhou<david1.zhou@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

236763d3

29 4月, 2017 4 次提交

drm/amdgpu: validate shadow before restoring from it · 82521316

由 Roger.He 提交于 4月 21, 2017

Reviewed-by: NChunming Zhou <david1.zhou@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NRoger.He <Hongbo.He@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

82521316

drm/amdgpu: Fix use of interruptible waiting · 1d284797

由 Alex Xie 提交于 4月 24, 2017

1. The signal interrupt can affect the expected behaviour.
2. There is no good mechanism to handle the corresponding error.
Signed-off-by: NAlex Xie <AlexBin.Xie@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

1d284797

drm/amdgpu: Fix use of interruptible waiting · 7a6901d7

由 Alex Xie 提交于 4月 24, 2017

1. The signal interrupt can affect the expected behaviour.
2. There is no mechanism to handle the corresponding error.
Signed-off-by: NAlex Xie <AlexBin.Xie@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

7a6901d7

drm/amdgpu: Fix use of interruptible waiting · 8ab25b4f

由 Alex Xie 提交于 4月 24, 2017

If amdgpu_bo_reserve function is interrupted by signal,
amdgpu_bo_kunmap function is not called.
Signed-off-by: NAlex Xie <AlexBin.Xie@amd.com>
Reviewed-by: NChristian König <christian.koenig@amd.com>
Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>

8ab25b4f

openeuler / Kernel 1 年多 前同步成功

openeuler / Kernel
1 年多前同步成功