1. 08 11月, 2021 1 次提交
    • W
      Use cuda virtual memory management and merge blocks (#36189) · a1ec1d5a
      wanghuancoder 提交于
      * Use cuda virtual memory management and merge blocks, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * window dll, test=develop
      
      * fix cuda error of CUDA_ERROR_NOT_INITIALIZED, test=develop
      
      * use autogrowthv2 for system allocator, test=develop
      
      * remove ~CUDAVirtualMemAllocator(), test=develop
      
      * refine, test=develop
      
      * fix cuda error of CUDA_ERROR_NOT_INITIALIZED, test=develop
      
      * fix cuda error of CUDA_ERROR_NOT_INITIALIZED, test=develop
      
      * fix bug, test=develop
      
      * revert system allocator, test =develop
      
      * revert multiprocessing, test=develop
      
      * fix AutoGrowthBestFitAllocatorV2 mutxt, test=develop
      
      * catch cudaErrorInitializationError when create allocator, test=develop
      
      * fix cuMemSetAccess use, test=develop
      
      * refine cuda api use, test=develop
      
      * refine, test=develop
      
      * for test, test=develop
      
      * for test, test=develop
      
      * switch to v2, test=develop
      
      * refine virtual allocator, test=develop
      
      * Record cuMemCreate and cuMemRelease, test=develop
      
      * refine, test=develop
      
      * avoid out of bounds, test=develop
      
      * rename allocator, test=develop
      
      * refine, test=develop
      
      * use PADDLE_ENFORCE_CUDA_SUCCESS, test=develop
      
      * for test,test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      a1ec1d5a
  2. 28 9月, 2021 1 次提交
    • Y
      Add paddle.device.cuda.get_device_properties (#35661) · 4cbed9e5
      Yanxing Shi 提交于
      * Initial Commit
      
      * add unittest and add error information
      
      * modify doc
      
      * fix some error
      
      * fix some word
      
      * fix bug cudaDeviceProp* and modify error explanation
      
      * fix cudaDeviceProp* error and unnitest samples
      
      * fix hip error and PADDLE_WITH_HIP
      
      * update style
      
      * fix error is_compiled_with_cuda
      
      * fix paddle.device.cuda.get_device_properties
      
      * fix error for multi thread safe
      
      * update style
      
      * merge conflict
      
      * modify after mentor review
      
      * update style
      
      * delete word
      
      * fix unittest error for windows
      
      * support string input and modify some code
      
      * modify doc to support string input
      
      * fix error for express information
      
      * fix error for express information
      
      * fix unnitest for windows
      
      * fix device.startswith('gpu:')
      
      * format error and doc
      
      * fix after review
      
      * format code
      
      * fix error for doc compile
      
      * fix error for doc compile
      
      * fix error for doc compile
      
      * fix error for doc compile
      
      * fix error for doc compile
      
      * fix py2 error
      
      * fix wrong words and doc
      
      * fix _gpuDeviceProperties
      4cbed9e5
  3. 14 9月, 2021 1 次提交
    • C
      Add api paddle.device.cuda.empty_cache to release idle gpu memory hold by allocator。 (#35427) · 83932715
      chenenquan 提交于
      * Add empty_cache api to release idle gpu memory hold by allocator,test=develop
      
      * Add empty_cache api to release idle gpu memory hold by allocator,test=develop
      
      * Add empty_cache api to release idle gpu memory hold by allocator,test=develop
      
      * Fix test coverage problem for empty_cache
      
      * delete redundant check for empty_cache
      
      * fix the problem of empty_cache's doc
      
      * delete the nvidia-smi comment in doc of empty_cache, test=document_fix
      83932715
  4. 07 2月, 2021 1 次提交
  5. 16 8月, 2020 1 次提交
  6. 04 3月, 2020 1 次提交
    • Z
      Add flags to limit gpu memory (#22793) · d41d802b
      Zeng Jinle 提交于
      * add recorded cuda memory apis, fix typo, test=develop
      
      * add more ut, test=develop
      
      * follow comments, test=develop
      
      * fix py35 incompatible issues, test=develop
      d41d802b
  7. 09 1月, 2020 1 次提交
  8. 08 1月, 2020 1 次提交
  9. 14 11月, 2019 1 次提交
  10. 06 11月, 2019 1 次提交
  11. 31 7月, 2019 1 次提交
  12. 21 3月, 2019 1 次提交
    • S
      add more unittest · 953214ad
      sneaxiy 提交于
      modify allocator strategy
      remove changes of legacy buddy_allocator
      test=develop
      953214ad
  13. 19 3月, 2019 1 次提交
  14. 04 12月, 2018 1 次提交
  15. 22 11月, 2018 1 次提交
    • C
      Refine cublas to support CUBLAS_TENSOR_OP_MATH (#13929) · 00b9e9a1
      chengduo 提交于
      * refine cublase
      test=develop
      
      * code refine
      
      * refine cublas
      
      * add GEMME_EX
      
      * add enable_cublas_tensor_op_math doc and add cublasCall
      test=develop
      
      * fix CublasCall for cuda version
      test=develop
      
      * fix error
      test=develop
      
      * fix GEMM_EX to be compatible with gcc 4.8
      test=develop
      
      * add GEMM_EX
      test=develop
      
      * to compatiable with gcc4.8
      test=develop
      00b9e9a1
  16. 15 10月, 2018 1 次提交
  17. 27 9月, 2018 1 次提交
  18. 26 9月, 2018 1 次提交
  19. 23 4月, 2018 1 次提交
  20. 08 4月, 2018 2 次提交
  21. 10 3月, 2018 1 次提交
  22. 03 3月, 2018 1 次提交
  23. 12 2月, 2018 1 次提交
  24. 10 2月, 2018 1 次提交
  25. 22 12月, 2017 1 次提交
    • D
      "remove GPU Sync Interface" (#6793) · abde3130
      dzhwinter 提交于
      * "remove GPU Sync Interface"
      
      * "fix typo"
      
      * "fix type cast error"
      
      * "fix related Copy with stream"
      
      * "fix failed tests with DevicePool"
      
      * "fix stupid removed position error"
      abde3130
  26. 16 11月, 2017 1 次提交
  27. 10 10月, 2017 1 次提交
  28. 05 10月, 2017 3 次提交
  29. 26 9月, 2017 1 次提交
  30. 18 8月, 2017 2 次提交
  31. 19 7月, 2017 1 次提交
  32. 11 7月, 2017 1 次提交
  33. 04 7月, 2017 1 次提交
  34. 29 6月, 2017 2 次提交
  35. 28 6月, 2017 1 次提交