1. 14 10月, 2021 2 次提交
    • Y
      drm/amdkfd: fix KFDSVMRangeTest.PartialUnmapSysMemTest fails · 9c152f54
      Yifan Zhang 提交于
      [ RUN      ] KFDSVMRangeTest.PartialUnmapSysMemTest
      /home/yifan/brahma/libhsakmt/tests/kfdtest/src/KFDTestUtil.cpp:245: Failure
      Value of: (hsaKmtAllocMemory(m_Node, m_Size, m_Flags, &m_pBuf))
        Actual: 1
      Expected: HSAKMT_STATUS_SUCCESS
      Which is: 0
      /home/yifan/brahma/libhsakmt/tests/kfdtest/src/KFDTestUtil.cpp:248: Failure
      Value of: (hsaKmtMapMemoryToGPUNodes(m_pBuf, m_Size, __null, mapFlags, 1, &m_Node))
        Actual: 1
      Expected: HSAKMT_STATUS_SUCCESS
      Which is: 0
      /home/yifan/brahma/libhsakmt/tests/kfdtest/src/KFDTestUtil.cpp:306: Failure
      Expected: ((void *)__null) != (ptr), actual: NULL vs NULL
      Segmentation fault (core dumped)
      [          ] Profile: Full Test
      [          ] HW capabilities: 0x9
      
      kernel log:
      
      [  102.029150]  ret_from_fork+0x22/0x30
      [  102.029158] ---[ end trace 15c34e782714f9a3 ]---
      [ 3613.603598] amdgpu: Address: 0x7f7149ccc000 already allocated by SVM
      [ 3613.610620] show_signal_msg: 27 callbacks suppressed
      
      These is race with deferred actions from previous memory map
      changes (e.g. munmap).Flush pending deffered work to avoid such case.
      Signed-off-by: NYifan Zhang <yifan1.zhang@amd.com>
      Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
      Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
      9c152f54
    • A
      drm/amdkfd: avoid conflicting address mappings · 71cbfeb3
      Alex Sierra 提交于
      [Why]
      Avoid conflict with address ranges mapped by SVM
      mechanism that try to be allocated again through
      ioctl_alloc in the same process. And viceversa.
      
      [How]
      For ioctl_alloc_memory_of_gpu allocations
      Check if the address range passed into ioctl memory
      alloc does not exist already in the kfd_process
      svms->objects interval tree.
      
      For SVM allocations
      Look for the address range into the interval tree VA from
      the VM inside of each pdds used in a kfd_process.
      Signed-off-by: NAlex Sierra <alex.sierra@amd.com>
      Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
      Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
      71cbfeb3
  2. 03 8月, 2021 4 次提交
  3. 29 7月, 2021 3 次提交
  4. 13 7月, 2021 6 次提交
  5. 16 6月, 2021 1 次提交
    • F
      drm/amdkfd: Disable SVM per GPU, not per process · 5a75ea56
      Felix Kuehling 提交于
      When some GPUs don't support SVM, don't disabe it for the entire process.
      That would be inconsistent with the information the process got from the
      topology, which indicates SVM support per GPU.
      
      Instead disable SVM support only for the unsupported GPUs. This is done
      by checking any per-device attributes against the bitmap of supported
      GPUs. Also use the supported GPU bitmap to initialize access bitmaps for
      new SVM address ranges.
      
      Don't handle recoverable page faults from unsupported GPUs. (I don't
      think there will be unsupported GPUs that can generate recoverable page
      faults. But better safe than sorry.)
      Signed-off-by: NFelix Kuehling <Felix.Kuehling@amd.com>
      Reviewed-by: NPhilip Yang <philip.yang@amd.com>
      Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
      5a75ea56
  6. 12 6月, 2021 1 次提交
  7. 05 6月, 2021 3 次提交
  8. 21 4月, 2021 6 次提交
  9. 10 4月, 2021 1 次提交
  10. 06 3月, 2021 1 次提交
  11. 09 12月, 2020 2 次提交
  12. 23 9月, 2020 2 次提交
  13. 27 8月, 2020 1 次提交
    • H
      drm/amdkfd: implement the dGPU fallback path for apu (v6) · 6127896f
      Huang Rui 提交于
      We still have a few iommu issues which need to address, so force raven
      as "dgpu" path for the moment.
      
      This is to add the fallback path to bypass IOMMU if IOMMU v2 is disabled
      or ACPI CRAT table not correct.
      
      v2: Use ignore_crat parameter to decide whether it will go with IOMMUv2.
      v3: Align with existed thunk, don't change the way of raven, only renoir
          will use "dgpu" path by default.
      v4: don't update global ignore_crat in the driver, and revise fallback
          function if CRAT is broken.
      v5: refine acpi crat good but no iommu support case, and rename the
          title.
      v6: fix the issue of dGPU initialized firstly, just modify the report
          value in the node_show().
      Signed-off-by: NHuang Rui <ray.huang@amd.com>
      Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
      Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
      6127896f
  14. 16 7月, 2020 1 次提交
    • A
      drm/amdkfd: Provide SMI events watch · 938a0650
      Amber Lin 提交于
      When the compute is malfunctioning or performance drops, the system admin
      will use SMI (System Management Interface) tool to monitor/diagnostic what
      went wrong. This patch provides an event watch interface for the user
      space to register devices and subscribe events they are interested. After
      registered, the user can use annoymous file descriptor's poll function
      with wait-time specified and wait for events to happen. Once an event
      happens, the user can use read() to retrieve information related to the
      event.
      
      VM fault event is done in this patch.
      
      v2: - remove UNREGISTER and add event ENABLE/DISABLE
          - correct kfifo usage
          - move event message API to kfd_ioctl.h
      v3: send the event msg in text than in binary
      v4: support multiple clients
      v5: move events enablement from ioctl to fd write
      v6: sparse fix
      Signed-off-by: NAmber Lin <Amber.Lin@amd.com>
      Reviewed-by: NFelix Kuehling <Felix.Kuehling@amd.com>
      Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
      938a0650
  15. 01 5月, 2020 1 次提交
  16. 29 4月, 2020 3 次提交
  17. 29 2月, 2020 1 次提交
  18. 13 2月, 2020 1 次提交