1. 17 12月, 2021 1 次提交
  2. 25 11月, 2021 1 次提交
    • F
      Support multi-stream allocation for CUDA place (#37290) · b9c464c3
      From00 提交于
      * Support multi-stream allocation for CUDA place
      
      * Do not notify the retrying from other streams when free CUDA allocation
      
      * Fix compile error for CPU
      
      * Fix compile error for HIP
      
      * Release memory for StreamSafeCUDAAllocaRetry in malloc_test
      
      * Add FLAGS_use_stream_safe_cuda_allocator
      
      * Fix CI error for 'set_tests_properties'
      
      * Invalidate stream safe CUDA allocator for naive_best_fit and thread_local strategy
      
      * Performance improvement: insert allocation pair to outstanding_events_map when free but not alloc; replace recursive_mutex with SpinLock
      
      * FLAGS priority changes: FLAGS_use_system_allocator > FLAGS_use_stream_safe_cuda_allocator
      
      * Performance improvement: directly delete allocation when the recorded_streams is empty in FreeImpl of StreamSafeCUDAAllocator
      
      * Add UT for alloc interface
      
      * Changes multi-stream interface; move retry code from AllocatorFacadePrivate to StreamSafeCUDAAllocator
      b9c464c3
  3. 04 2月, 2021 1 次提交
  4. 06 11月, 2020 1 次提交
  5. 04 11月, 2020 1 次提交
  6. 24 9月, 2019 1 次提交
  7. 11 9月, 2019 1 次提交
    • H
      Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989) · 12542320
      Huihuang Zheng 提交于
      TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory.
      
      We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton.
      
      Also added data_feed_proto to operator to fix CI in CPU compilation
      12542320
  8. 10 6月, 2019 1 次提交
  9. 16 11月, 2018 1 次提交
  10. 14 11月, 2018 2 次提交
  11. 09 11月, 2018 2 次提交
  12. 08 11月, 2018 1 次提交
  13. 19 10月, 2018 1 次提交
  14. 10 10月, 2018 1 次提交
  15. 28 9月, 2018 2 次提交
  16. 08 8月, 2018 1 次提交
  17. 09 7月, 2018 1 次提交
  18. 29 6月, 2018 1 次提交
    • C
      Init allocated memory for unit test (#11657) · d2ad4a5c
      chengduo 提交于
      * memory init
      
      * add env
      
      * refine anounce
      
      * Add check for Nan
      
      * Debug
      
      * Add env for cc_test
      
      * Add env for py_test and nv_test
      
      * Remove py_test env
      
      * Add env for py_test
      
      * serial test_recognize_digits
      
      * Test FLAGS_init_allocated_mem function for unit test
      
      * Init allocated mem for op unit test
      
      * Add env for all unit test
      d2ad4a5c
  19. 08 4月, 2018 3 次提交
  20. 02 4月, 2018 1 次提交
  21. 28 3月, 2018 1 次提交
  22. 27 3月, 2018 1 次提交
  23. 26 3月, 2018 3 次提交
  24. 20 3月, 2018 4 次提交
  25. 12 2月, 2018 1 次提交
  26. 10 2月, 2018 2 次提交
  27. 09 1月, 2018 1 次提交
  28. 25 12月, 2017 1 次提交
  29. 07 12月, 2017 1 次提交