1. 14 1月, 2022 3 次提交
  2. 19 10月, 2021 1 次提交
  3. 15 10月, 2021 2 次提交
  4. 15 6月, 2021 1 次提交
    • L
      drm/amd/amdgpu: fix a potential deadlock in gpu reset · b65aa179
      Lang Yu 提交于
      stable inclusion
      from stable-5.10.42
      commit 4951dd498d483fa961c92541b55ffb32db7f2dbf
      bugzilla: 55093
      CVE: NA
      
      --------------------------------
      
      [ Upstream commit 9c2876d5 ]
      
      When amdgpu_ib_ring_tests failed, the reset logic called
      amdgpu_device_ip_suspend twice, then deadlock occurred.
      Deadlock log:
      
      [  805.655192] amdgpu 0000:04:00.0: amdgpu: ib ring test failed (-110).
      [  806.290952] [drm] free PSP TMR buffer
      
      [  806.319406] ============================================
      [  806.320315] WARNING: possible recursive locking detected
      [  806.321225] 5.11.0-custom #1 Tainted: G        W  OEL
      [  806.322135] --------------------------------------------
      [  806.323043] cat/2593 is trying to acquire lock:
      [  806.323825] ffff888136b1cdc8 (&adev->dm.dc_lock){+.+.}-{3:3}, at: dm_suspend+0xb8/0x1d0 [amdgpu]
      [  806.325668]
                     but task is already holding lock:
      [  806.326664] ffff888136b1cdc8 (&adev->dm.dc_lock){+.+.}-{3:3}, at: dm_suspend+0xb8/0x1d0 [amdgpu]
      [  806.328430]
                     other info that might help us debug this:
      [  806.329539]  Possible unsafe locking scenario:
      
      [  806.330549]        CPU0
      [  806.330983]        ----
      [  806.331416]   lock(&adev->dm.dc_lock);
      [  806.332086]   lock(&adev->dm.dc_lock);
      [  806.332738]
                      *** DEADLOCK ***
      
      [  806.333747]  May be due to missing lock nesting notation
      
      [  806.334899] 3 locks held by cat/2593:
      [  806.335537]  #0: ffff888100d3f1b8 (&attr->mutex){+.+.}-{3:3}, at: simple_attr_read+0x4e/0x110
      [  806.337009]  #1: ffff888136b1fd78 (&adev->reset_sem){++++}-{3:3}, at: amdgpu_device_lock_adev+0x42/0x94 [amdgpu]
      [  806.339018]  #2: ffff888136b1cdc8 (&adev->dm.dc_lock){+.+.}-{3:3}, at: dm_suspend+0xb8/0x1d0 [amdgpu]
      [  806.340869]
                     stack backtrace:
      [  806.341621] CPU: 6 PID: 2593 Comm: cat Tainted: G        W  OEL    5.11.0-custom #1
      [  806.342921] Hardware name: AMD Celadon-CZN/Celadon-CZN, BIOS WLD0C23N_Weekly_20_12_2 12/23/2020
      [  806.344413] Call Trace:
      [  806.344849]  dump_stack+0x93/0xbd
      [  806.345435]  __lock_acquire.cold+0x18a/0x2cf
      [  806.346179]  lock_acquire+0xca/0x390
      [  806.346807]  ? dm_suspend+0xb8/0x1d0 [amdgpu]
      [  806.347813]  __mutex_lock+0x9b/0x930
      [  806.348454]  ? dm_suspend+0xb8/0x1d0 [amdgpu]
      [  806.349434]  ? amdgpu_device_indirect_rreg+0x58/0x70 [amdgpu]
      [  806.350581]  ? _raw_spin_unlock_irqrestore+0x47/0x50
      [  806.351437]  ? dm_suspend+0xb8/0x1d0 [amdgpu]
      [  806.352437]  ? rcu_read_lock_sched_held+0x4f/0x80
      [  806.353252]  ? rcu_read_lock_sched_held+0x4f/0x80
      [  806.354064]  mutex_lock_nested+0x1b/0x20
      [  806.354747]  ? mutex_lock_nested+0x1b/0x20
      [  806.355457]  dm_suspend+0xb8/0x1d0 [amdgpu]
      [  806.356427]  ? soc15_common_set_clockgating_state+0x17d/0x19 [amdgpu]
      [  806.357736]  amdgpu_device_ip_suspend_phase1+0x78/0xd0 [amdgpu]
      [  806.360394]  amdgpu_device_ip_suspend+0x21/0x70 [amdgpu]
      [  806.362926]  amdgpu_device_pre_asic_reset+0xb3/0x270 [amdgpu]
      [  806.365560]  amdgpu_device_gpu_recover.cold+0x679/0x8eb [amdgpu]
      Signed-off-by: NLang Yu <Lang.Yu@amd.com>
      Acked-by: NChristian KÃnig <christian.koenig@amd.com>
      Reviewed-by: NAndrey Grodzovsky <andrey.grodzovsky@amd.com>
      Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
      Signed-off-by: NSasha Levin <sashal@kernel.org>
      Signed-off-by: NChen Jun <chenjun102@huawei.com>
      Acked-by: NWeilong Chen <chenweilong@huawei.com>
      Signed-off-by: NZheng Zengkai <zhengzengkai@huawei.com>
      b65aa179
  5. 03 6月, 2021 1 次提交
  6. 08 2月, 2021 1 次提交
  7. 28 1月, 2021 2 次提交
  8. 25 11月, 2020 1 次提交
  9. 04 11月, 2020 1 次提交
  10. 29 10月, 2020 2 次提交
    • M
      amdgpu: fix a few kernel-doc markup issues · b28d70c6
      Mauro Carvalho Chehab 提交于
      A kernel-doc markup can't be mixed with a random comment,
      as it causes parsing problems.
      
      While here, change an invalid kernel-doc markup into
      a common comment.
      Signed-off-by: NMauro Carvalho Chehab <mchehab+huawei@kernel.org>
      Link: https://lore.kernel.org/r/e899f50404e94ac9a7c3267dd34f951c1a44fb2b.1603791716.git.mchehab+huawei@kernel.orgSigned-off-by: NJonathan Corbet <corbet@lwn.net>
      b28d70c6
    • M
      drm: amdgpu: kernel-doc: update some adev parameters · ca766ff0
      Mauro Carvalho Chehab 提交于
      Running "make htmldocs: produce lots of warnings on those files:
      	./drivers/gpu/drm/amd/amdgpu/amdgpu_vram_mgr.c:177: warning: Excess function parameter 'man' description in 'amdgpu_vram_mgr_init'
      	./drivers/gpu/drm/amd/amdgpu/amdgpu_vram_mgr.c:177: warning: Excess function parameter 'p_size' description in 'amdgpu_vram_mgr_init'
      	./drivers/gpu/drm/amd/amdgpu/amdgpu_vram_mgr.c:211: warning: Excess function parameter 'man' description in 'amdgpu_vram_mgr_fini'
      	./drivers/gpu/drm/amd/amdgpu/amdgpu_vram_mgr.c:177: warning: Excess function parameter 'man' description in 'amdgpu_vram_mgr_init'
      	./drivers/gpu/drm/amd/amdgpu/amdgpu_vram_mgr.c:177: warning: Excess function parameter 'p_size' description in 'amdgpu_vram_mgr_init'
      	./drivers/gpu/drm/amd/amdgpu/amdgpu_vram_mgr.c:211: warning: Excess function parameter 'man' description in 'amdgpu_vram_mgr_fini'
      	./drivers/gpu/drm/amd/amdgpu/amdgpu_vram_mgr.c:177: warning: Excess function parameter 'man' description in 'amdgpu_vram_mgr_init'
      	./drivers/gpu/drm/amd/amdgpu/amdgpu_vram_mgr.c:177: warning: Excess function parameter 'p_size' description in 'amdgpu_vram_mgr_init'
      	./drivers/gpu/drm/amd/amdgpu/amdgpu_vram_mgr.c:211: warning: Excess function parameter 'man' description in 'amdgpu_vram_mgr_fini'
      	./drivers/gpu/drm/amd/amdgpu/amdgpu_vram_mgr.c:177: warning: Excess function parameter 'man' description in 'amdgpu_vram_mgr_init'
      	./drivers/gpu/drm/amd/amdgpu/amdgpu_vram_mgr.c:177: warning: Excess function parameter 'p_size' description in 'amdgpu_vram_mgr_init'
      	./drivers/gpu/drm/amd/amdgpu/amdgpu_vram_mgr.c:211: warning: Excess function parameter 'man' description in 'amdgpu_vram_mgr_fini'
      	./drivers/gpu/drm/amd/amdgpu/amdgpu_gtt_mgr.c:90: warning: Excess function parameter 'man' description in 'amdgpu_gtt_mgr_init'
      	./drivers/gpu/drm/amd/amdgpu/amdgpu_gtt_mgr.c:90: warning: Excess function parameter 'p_size' description in 'amdgpu_gtt_mgr_init'
      	./drivers/gpu/drm/amd/amdgpu/amdgpu_gtt_mgr.c:134: warning: Excess function parameter 'man' description in 'amdgpu_gtt_mgr_fini'
      	./drivers/gpu/drm/amd/amdgpu/amdgpu_gtt_mgr.c:90: warning: Excess function parameter 'man' description in 'amdgpu_gtt_mgr_init'
      	./drivers/gpu/drm/amd/amdgpu/amdgpu_gtt_mgr.c:90: warning: Excess function parameter 'p_size' description in 'amdgpu_gtt_mgr_init'
      	./drivers/gpu/drm/amd/amdgpu/amdgpu_gtt_mgr.c:134: warning: Excess function parameter 'man' description in 'amdgpu_gtt_mgr_fini'
      	./drivers/gpu/drm/amd/amdgpu/amdgpu_device.c:675: warning: Excess function parameter 'dev' description in 'amdgpu_device_asic_init'
      	./drivers/gpu/drm/amd/amdgpu/amdgpu_device.c:675: warning: Excess function parameter 'dev' description in 'amdgpu_device_asic_init'
      	./drivers/gpu/drm/amd/amdgpu/amdgpu_device.c:675: warning: Excess function parameter 'dev' description in 'amdgpu_device_asic_init'
      	./drivers/gpu/drm/amd/amdgpu/amdgpu_device.c:675: warning: Excess function parameter 'dev' description in 'amdgpu_device_asic_init'
      
      They're related to the repacement of some parameters by adev,
      and due to a few renamed parameters.
      
      While here, uniform the name of the parameter for it to be
      the same on all functions using a pointer to struct amdgpu_device.
      
      Update the kernel-doc documentation accordingly.
      Signed-off-by: NMauro Carvalho Chehab <mchehab+huawei@kernel.org>
      Link: https://lore.kernel.org/r/5755c2b361890b8ae5cea0f61dfd70b1c135eefe.1603791716.git.mchehab+huawei@kernel.orgSigned-off-by: NJonathan Corbet <corbet@lwn.net>
      ca766ff0
  11. 22 10月, 2020 1 次提交
  12. 10 10月, 2020 1 次提交
  13. 01 10月, 2020 2 次提交
  14. 30 9月, 2020 1 次提交
  15. 26 9月, 2020 4 次提交
  16. 18 9月, 2020 2 次提交
  17. 16 9月, 2020 8 次提交
  18. 04 9月, 2020 2 次提交
  19. 29 8月, 2020 1 次提交
    • N
      drm/amdgpu: fix compiler warnings · e230ac11
      Nirmoy Das 提交于
      Fixes below compiler warnings:
       CC [M]  drivers/gpu/drm/amd/amdgpu/amdgpu_device.o
      drivers/gpu/drm/amd/amdgpu/amdgpu_device.c:381:1: warning: ‘static’ is not at beginning of declaration [-Wold-style-declaration]
        381 | void static inline amdgpu_mm_wreg_mmio(struct amdgpu_device *adev, uint32_t reg, uint32_t v, uint32_t acc_flags)
            | ^~~~
      drivers/gpu/drm/amd/amdgpu/amdgpu_device.c:381:1: warning: ‘inline’ is not at beginning of declaration [-Wold-style-declaration]
      drivers/gpu/drm/amd/amdgpu/amdgpu_device.c: In function ‘amdgpu_device_fini’:
      drivers/gpu/drm/amd/amdgpu/amdgpu_device.c:3381:6: warning: variable ‘r’ set but not used [-Wunused-but-set-variable]
       3381 |  int r;
            |      ^
      Signed-off-by: NNirmoy Das <nirmoy.das@amd.com>
      Reviewed-by: NAlex Deucher <alexander.deucher@amd.com>
      Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
      e230ac11
  20. 27 8月, 2020 3 次提交