1. 08 11月, 2021 1 次提交
    • W
      Use cuda virtual memory management and merge blocks (#36189) · a1ec1d5a
      wanghuancoder 提交于
      * Use cuda virtual memory management and merge blocks, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * window dll, test=develop
      
      * fix cuda error of CUDA_ERROR_NOT_INITIALIZED, test=develop
      
      * use autogrowthv2 for system allocator, test=develop
      
      * remove ~CUDAVirtualMemAllocator(), test=develop
      
      * refine, test=develop
      
      * fix cuda error of CUDA_ERROR_NOT_INITIALIZED, test=develop
      
      * fix cuda error of CUDA_ERROR_NOT_INITIALIZED, test=develop
      
      * fix bug, test=develop
      
      * revert system allocator, test =develop
      
      * revert multiprocessing, test=develop
      
      * fix AutoGrowthBestFitAllocatorV2 mutxt, test=develop
      
      * catch cudaErrorInitializationError when create allocator, test=develop
      
      * fix cuMemSetAccess use, test=develop
      
      * refine cuda api use, test=develop
      
      * refine, test=develop
      
      * for test, test=develop
      
      * for test, test=develop
      
      * switch to v2, test=develop
      
      * refine virtual allocator, test=develop
      
      * Record cuMemCreate and cuMemRelease, test=develop
      
      * refine, test=develop
      
      * avoid out of bounds, test=develop
      
      * rename allocator, test=develop
      
      * refine, test=develop
      
      * use PADDLE_ENFORCE_CUDA_SUCCESS, test=develop
      
      * for test,test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      a1ec1d5a
  2. 11 10月, 2021 1 次提交
  3. 27 5月, 2021 1 次提交
  4. 20 4月, 2020 1 次提交
    • Z
      Optimize the error messages of paddle CUDA API (#23816) · 78170037
      Zhou Wei 提交于
      * Optimize the error messages of paddle CUDA API, test=develop
      
      * fix the error messages of paddle CUDA API, test=develop
      
      * Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL,test=develop
      
      * remove build_ex_string,test=develop
      
      * merge conflict,test=develop
      78170037
  5. 13 4月, 2020 1 次提交
  6. 28 3月, 2019 1 次提交
    • C
      Fix the interface of Pass::Apply (#16484) · ed61d67c
      chengduo 提交于
      * modify the interface of Pass::Allay
      test=develop
      
      * Polish code
      test=develop
      
      * Fix Travis CI
      test=develop
      
      * fix Pass::Apply interface
      test=develop
      
      * Fix Travis CI
      test=develop
      ed61d67c
  7. 29 1月, 2019 1 次提交
  8. 31 10月, 2018 1 次提交
    • S
      add depthwise conv mkldnn pass · 4e2aaf01
      Sylwester Fraczek 提交于
      added depthwise conv mkldnn pass which for MKLDNN changes depthwise_conv operator to conv operator because for mkldnn this is the same api
      test=develop
      4e2aaf01
  9. 19 10月, 2018 1 次提交
  10. 27 8月, 2018 1 次提交
  11. 25 8月, 2018 1 次提交
  12. 23 8月, 2018 1 次提交
    • W
      Resovle multi gpu async deps (#12828) · b8da70c3
      Wu Yi 提交于
      * dist transpiler add control dependency var between send and recv
      
      * fix async deps
      
      * follow comments and refine
      
      * fix deps connect for rpc ops
      b8da70c3
  13. 22 7月, 2018 1 次提交
  14. 18 7月, 2018 2 次提交
  15. 12 2月, 2018 1 次提交
  16. 10 2月, 2018 2 次提交
  17. 24 11月, 2017 1 次提交
  18. 13 10月, 2017 1 次提交
  19. 15 9月, 2017 1 次提交
  20. 25 5月, 2017 1 次提交
  21. 09 12月, 2016 1 次提交
  22. 29 8月, 2016 1 次提交