1. 26 6月, 2022 1 次提交
  2. 05 6月, 2022 1 次提交
  3. 27 3月, 2022 1 次提交
  4. 03 3月, 2022 1 次提交
  5. 20 2月, 2022 1 次提交
  6. 09 2月, 2022 1 次提交
  7. 05 1月, 2022 1 次提交
  8. 27 12月, 2021 1 次提交
  9. 20 12月, 2021 1 次提交
  10. 17 12月, 2021 1 次提交
  11. 08 12月, 2021 1 次提交
    • F
      Fix CUDAGraphAllocator bug for StreamSafeCUDAAllocator (#37821) · b4a67491
      From00 提交于
      * Fix CUDAGraph bug for StreamSafeCUDAAllocator
      
      * Add CUDAGrapthAllocator check in multi-stream interface
      
      * Set FLAGS_use_stream_safe_cuda_allocator defaulted to false
      
      * Fix environment error for cmake
      
      * Fix cmake error
      
      * Add UT of GetAllocatorInterfaceTest
      
      * Add UT of CUDAGraphExceptionTest
      
      * Enhance CUDAGraphExceptionTest
      b4a67491
  12. 03 12月, 2021 1 次提交
  13. 25 11月, 2021 1 次提交
    • F
      Support multi-stream allocation for CUDA place (#37290) · b9c464c3
      From00 提交于
      * Support multi-stream allocation for CUDA place
      
      * Do not notify the retrying from other streams when free CUDA allocation
      
      * Fix compile error for CPU
      
      * Fix compile error for HIP
      
      * Release memory for StreamSafeCUDAAllocaRetry in malloc_test
      
      * Add FLAGS_use_stream_safe_cuda_allocator
      
      * Fix CI error for 'set_tests_properties'
      
      * Invalidate stream safe CUDA allocator for naive_best_fit and thread_local strategy
      
      * Performance improvement: insert allocation pair to outstanding_events_map when free but not alloc; replace recursive_mutex with SpinLock
      
      * FLAGS priority changes: FLAGS_use_system_allocator > FLAGS_use_stream_safe_cuda_allocator
      
      * Performance improvement: directly delete allocation when the recorded_streams is empty in FreeImpl of StreamSafeCUDAAllocator
      
      * Add UT for alloc interface
      
      * Changes multi-stream interface; move retry code from AllocatorFacadePrivate to StreamSafeCUDAAllocator
      b9c464c3