1. 20 10月, 2017 6 次提交
  2. 10 10月, 2017 2 次提交
  3. 07 10月, 2017 1 次提交
  4. 27 9月, 2017 1 次提交
  5. 14 9月, 2017 1 次提交
  6. 13 9月, 2017 11 次提交
  7. 01 9月, 2017 1 次提交
  8. 30 8月, 2017 2 次提交
  9. 29 8月, 2017 1 次提交
  10. 25 8月, 2017 1 次提交
  11. 18 8月, 2017 5 次提交
  12. 16 8月, 2017 3 次提交
  13. 14 7月, 2017 1 次提交
    • J
      drm/amdgpu: Throttle visible VRAM moves separately · 00f06b24
      John Brooks 提交于
      The BO move throttling code is designed to allow VRAM to fill quickly if it
      is relatively empty. However, this does not take into account situations
      where the visible VRAM is smaller than total VRAM, and total VRAM may not
      be close to full but the visible VRAM segment is under pressure. In such
      situations, visible VRAM would experience unrestricted swapping and
      performance would drop.
      
      Add a separate counter specifically for moves involving visible VRAM, and
      check it before moving BOs there.
      
      v2: Only perform calculations for separate counter if visible VRAM is
          smaller than total VRAM. (Michel Dänzer)
      v3: [Michel Dänzer]
      * Use BO's location rather than the AMDGPU_GEM_CREATE_CPU_ACCESS_REQUIRED
        flag to determine whether to account a move for visible VRAM in most
        cases.
      * Use a single
      
      	if (adev->mc.visible_vram_size < adev->mc.real_vram_size) {
      
        block in amdgpu_cs_get_threshold_for_moves.
      
      Fixes: 95844d20 (drm/amdgpu: throttle buffer migrations at CS using a fixed MBps limit (v2))
      Signed-off-by: NJohn Brooks <john@fastquake.com>
      Reviewed-by: NChristian König <christian.koenig@amd.com>
      Signed-off-by: NMichel Dänzer <michel.daenzer@amd.com>
      Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
      00f06b24
  14. 06 7月, 2017 1 次提交
  15. 30 6月, 2017 2 次提交
  16. 17 6月, 2017 1 次提交
    • D
      amdgpu: use drm sync objects for shared semaphores (v6) · 660e8558
      Dave Airlie 提交于
      This creates a new command submission chunk for amdgpu
      to add in and out sync objects around the submission.
      
      Sync objects are managed via the drm syncobj ioctls.
      
      The command submission interface is enhanced with two new
      chunks, one for syncobj pre submission dependencies,
      and one for post submission sync obj signalling,
      and just takes a list of handles for each.
      
      This is based on work originally done by David Zhou at AMD,
      with input from Christian Konig on what things should look like.
      
      In theory VkFences could be backed with sync objects and
      just get passed into the cs as syncobj handles as well.
      
      NOTE: this interface addition needs a version bump to expose
      it to userspace.
      
      TODO: update to dep_sync when rebasing onto amdgpu master.
      (with this - r-b from Christian)
      
      v1.1: keep file reference on import.
      v2: move to using syncobjs
      v2.1: change some APIs to just use p pointer.
      v3: make more robust against CS failures, we now add the
      wait sems but only remove them once the CS job has been
      submitted.
      v4: rewrite names of API and base on new syncobj code.
      v5: move post deps earlier, rename some apis
      v6: lookup post deps earlier, and just replace fences
      in post deps stage (Christian)
      Reviewed-by: NChristian König <christian.koenig@amd.com>
      Signed-off-by: NDave Airlie <airlied@redhat.com>
      Signed-off-by: NAlex Deucher <alexander.deucher@amd.com>
      660e8558