1. 19 8月, 2020 1 次提交
    • G
      drm/amdgpu: fix NULL pointer access issue when unloading driver · 1a68d96f
      Guchun Chen 提交于
      When unloading driver by "modprobe -r amdgpu", one NULL pointer
      dereference bug occurs in ras debugfs releasing. The cause is the
      duplicated debugfs_remove, as drm debugfs_root dir has been cleaned
      up already by drm_minor_unregister.
      
      BUG: kernel NULL pointer dereference, address: 00000000000000a0
      PGD 0 P4D 0
      Oops: 0002 [#1] SMP PTI
      CPU: 11 PID: 1526 Comm: modprobe Tainted: G           OE     5.6.0-guchchen #1
      Hardware name: System manufacturer System Product Name/TUF Z370-PLUS GAMING II, BIOS 0411 09/21/2018
      RIP: 0010:down_write+0x15/0x40
      Code: eb de e8 7e 17 72 ff cc cc cc cc cc cc cc cc cc cc cc cc cc cc 0f 1f 44 00 00 53 48 89 fb e8 92
      d8 ff ff 31 c0 ba 01 00 00 00 <f0> 48 0f b1 13 75 0f 65 48 8b 04 25 c0 8b 01 00 48 89 43 08 5b c3
      RSP: 0018:ffffb1590386fcd0 EFLAGS: 00010246
      RAX: 0000000000000000 RBX: 00000000000000a0 RCX: 0000000000000000
      RDX: 0000000000000001 RSI: ffffffff85b2fcc2 RDI: 00000000000000a0
      RBP: ffffb1590386fd30 R08: ffffffff85b2fcc2 R09: 000000000002b3c0
      R10: ffff97a330618c40 R11: 00000000000005f6 R12: ffff97a3481beb40
      R13: 00000000000000a0 R14: ffff97a3481beb40 R15: 0000000000000000
      FS:  00007fb11a717540(0000) GS:ffff97a376cc0000(0000) knlGS:0000000000000000
      CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
      CR2: 00000000000000a0 CR3: 00000004066d6006 CR4: 00000000003606e0
      DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
      DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
      Call Trace:
       simple_recursive_removal+0x63/0x370
       ? debugfs_remove+0x60/0x60
       debugfs_remove+0x40/0x60
       amdgpu_ras_fini+0x82/0x230 [amdgpu]
       ? __kernfs_remove.part.17+0x101/0x1f0
       ? kernfs_name_hash+0x12/0x80
       amdgpu_device_fini+0x1c0/0x580 [amdgpu]
       amdgpu_driver_unload_kms+0x3e/0x70 [amdgpu]
       amdgpu_pci_remove+0x36/0x60 [amdgpu]
       pci_device_remove+0x3b/0xb0
       device_release_driver_internal+0xe5/0x1c0
       driver_detach+0x46/0x90
       bus_remove_driver+0x58/0xd0
       pci_unregister_driver+0x29/0x90
       amdgpu_exit+0x11/0x25 [amdgpu]
       __x64_sys_delete_module+0x13d/0x210
       do_syscall_64+0x5f/0x250
       entry_SYSCALL_64_after_hwframe+0x44/0xa9
      Signed-off-by: NGuchun Chen <guchun.chen@amd.com>
      Reviewed-by: NTao Zhou <tao.zhou1@amd.com>
      Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
      1a68d96f
  2. 07 8月, 2020 1 次提交
  3. 16 7月, 2020 1 次提交
  4. 01 7月, 2020 1 次提交
  5. 03 6月, 2020 2 次提交
  6. 29 5月, 2020 1 次提交
  7. 15 5月, 2020 1 次提交
  8. 06 5月, 2020 1 次提交
  9. 01 5月, 2020 1 次提交
  10. 23 4月, 2020 2 次提交
  11. 14 4月, 2020 1 次提交
  12. 09 4月, 2020 1 次提交
  13. 08 4月, 2020 1 次提交
  14. 02 4月, 2020 2 次提交
    • E
      drm/amdgpu: fix non-pointer dereference for non-RAS supported · a9d82d2f
      Evan Quan 提交于
      Backtrace on gpu recover test on Navi10.
      
      [ 1324.516681] RIP: 0010:amdgpu_ras_set_error_query_ready+0x15/0x20 [amdgpu]
      [ 1324.523778] Code: 4c 89 f7 e8 cd a2 a0 d8 e9 99 fe ff ff 45 31 ff e9 91 fe ff ff 0f 1f 44 00 00 55 48 85 ff 48 89 e5 74 0e 48 8b 87 d8 2b 01 00 <40> 88 b0 38 01 00 00 5d c3 66 90 0f 1f 44 00 00 55 31 c0 48 85 ff
      [ 1324.543452] RSP: 0018:ffffaa1040e4bd28 EFLAGS: 00010286
      [ 1324.549025] RAX: 0000000000000000 RBX: ffff911198b20000 RCX: 0000000000000000
      [ 1324.556217] RDX: 00000000000c0a01 RSI: 0000000000000000 RDI: ffff911198b20000
      [ 1324.563514] RBP: ffffaa1040e4bd28 R08: 0000000000001000 R09: ffff91119d0028c0
      [ 1324.570804] R10: ffffffff9a606b40 R11: 0000000000000000 R12: 0000000000000000
      [ 1324.578413] R13: ffffaa1040e4bd70 R14: ffff911198b20000 R15: 0000000000000000
      [ 1324.586464] FS:  00007f4441cbf540(0000) GS:ffff91119ed80000(0000) knlGS:0000000000000000
      [ 1324.595434] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
      [ 1324.601345] CR2: 0000000000000138 CR3: 00000003fcdf8004 CR4: 00000000003606e0
      [ 1324.608694] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
      [ 1324.616303] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
      [ 1324.623678] Call Trace:
      [ 1324.626270]  amdgpu_device_gpu_recover+0x6e7/0xc50 [amdgpu]
      [ 1324.632018]  ? seq_printf+0x4e/0x70
      [ 1324.636652]  amdgpu_debugfs_gpu_recover+0x50/0x80 [amdgpu]
      [ 1324.643371]  seq_read+0xda/0x420
      [ 1324.647601]  full_proxy_read+0x5c/0x90
      [ 1324.652426]  __vfs_read+0x1b/0x40
      [ 1324.656734]  vfs_read+0x8e/0x130
      [ 1324.660981]  ksys_read+0xa7/0xe0
      [ 1324.665201]  __x64_sys_read+0x1a/0x20
      [ 1324.669907]  do_syscall_64+0x57/0x1c0
      [ 1324.674517]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
      [ 1324.680654] RIP: 0033:0x7f44417cf081
      Signed-off-by: NEvan Quan <evan.quan@amd.com>
      Reviewed-by: NJohn Clements <John.Clements@amd.com>
      Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
      a9d82d2f
    • J
      drm/amdgpu: disable ras query and iject during gpu reset · 61380faa
      John Clements 提交于
      added flag to ras context to indicate if ras query functionality is ready
      Reviewed-by: NHawking Zhang <Hawking.Zhang@amd.com>
      Signed-off-by: NJohn Clements <john.clements@amd.com>
      Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
      61380faa
  15. 20 3月, 2020 1 次提交
  16. 13 3月, 2020 2 次提交
  17. 11 3月, 2020 2 次提交
  18. 07 3月, 2020 1 次提交
  19. 27 2月, 2020 1 次提交
  20. 19 2月, 2020 1 次提交
  21. 23 1月, 2020 1 次提交
  22. 17 1月, 2020 1 次提交
  23. 14 1月, 2020 1 次提交
  24. 24 12月, 2019 1 次提交
  25. 19 12月, 2019 2 次提交
  26. 06 12月, 2019 1 次提交
  27. 19 11月, 2019 1 次提交
  28. 07 11月, 2019 1 次提交
  29. 30 10月, 2019 1 次提交
  30. 26 10月, 2019 2 次提交
  31. 16 10月, 2019 1 次提交
  32. 11 10月, 2019 2 次提交