1. 05 9月, 2019 1 次提交
    • Y
      Integrate NVRTC to support compiling CUDA kernel at runtime (#19422) · 42b5bec6
      Yiqun Liu 提交于
      * Add the dynamic load of nvrtc, and support runtime compiling of CUDA kernel using nvrtc.
      test=develop
      
      * Call CUDA driver api to launch the kernel compiled by nvrtc.
      test=develop
      
      * Disable for mac and windows.
      test=develop
      
      * Refine the codes to support manually specified num_threads and workload_per_thread.
      test=develop
      
      * Refine the CUDA kernel to support large dims.
      test=develop
      42b5bec6
  2. 12 8月, 2019 1 次提交
  3. 05 8月, 2019 1 次提交
    • L
      fix warpctc.dll not found issue (#18761) · a43a763b
      liuwei1031 提交于
      * fix warpctc.dll not found issue, test=develop
      
      * revert the linux platform change, test=develop
      
      * delete warpctc_lib_path.h.in, test=develop
      
      * add SetPySitePackagePath function
      
      * fix warpctc.dylib not found issue on Mac, test=develop
      
      * improve the paddle lib path setting logic, test=develop
      
      * fix mac ci issue caused by test_warpctc_op unittest, test=develop
      
      * tweak code, test=develop
      a43a763b
  4. 03 6月, 2019 1 次提交
  5. 07 5月, 2019 1 次提交
  6. 03 4月, 2019 1 次提交
  7. 02 4月, 2019 1 次提交
    • C
      Model data cryption link all lib (#16555) · c38c7c56
      Chen Weihang 提交于
      * link the libwbaes.so into paddle
      
      * polish detail, test=develop
      
      * try fix mac_pr_ci error, test=develop
      
      * add compile option, test=develop
      
      * fix ci error, test=develop
      
      * ignore failed to find mac lib, test=develop
      
      * change cdn to bj, cdn can't get the latest version
      
      * trigger ci, test=develop
      
      * temporary delete win32 lib linking, test=develop
      
      * change https to http, test=develop
      
      * turn compile option on to off
      
      * turn compile option off to on, test=develop
      
      * try lib compiled by gcc4.8, test=develop
      
      * update lib version, test=develop
      
      * link other lib, test=develop
      
      * add setup config
      
      * delete false, test=develop
      
      * delete no_soname, test=develop
      
      * recover so name set
      
      * fix, test=develop
      
      * adjust make config, test=develop
      
      * remove link to wbaes, test=develop
      
      * remove useless define, test=develop
      c38c7c56
  8. 26 12月, 2018 1 次提交
  9. 19 12月, 2018 1 次提交
  10. 18 12月, 2018 3 次提交
  11. 26 11月, 2018 1 次提交
  12. 08 11月, 2018 1 次提交
  13. 28 9月, 2018 1 次提交
  14. 05 9月, 2018 1 次提交
  15. 27 8月, 2018 4 次提交
  16. 26 8月, 2018 1 次提交
  17. 24 8月, 2018 1 次提交
  18. 05 7月, 2018 1 次提交
  19. 21 6月, 2018 1 次提交
  20. 20 6月, 2018 2 次提交
  21. 16 4月, 2018 1 次提交
  22. 08 4月, 2018 1 次提交
  23. 28 2月, 2018 1 次提交
  24. 26 2月, 2018 1 次提交
  25. 12 2月, 2018 1 次提交
  26. 10 2月, 2018 2 次提交
  27. 14 1月, 2018 1 次提交
    • D
      "cudnn operators change to cudnn kernel" (#6660) · 5ad1aef0
      dzhwinter 提交于
      * "unified operators"
      
      * "add CUDNN register"
      
      * "add use cudnn attribute"
      
      * "add attribute"
      
      * "test conv tranpose op"
      
      * "remove duplicated attr"
      
      * "fix op test"
      
      * "add attribute to set cudnn"
      
      * "add more log"
      
      * "need layout op register support"
      
      * "add more log"
      
      * "change GetExpectedKernelType "
      
      * "fix Get attr in conv_op"
      
      * "fix CI"
      
      * "fix tests"
      
      * "removed kernel priority fallback"
      
      * "fix CI"
      
      * "fix stack pointer bug"
      
      * "refine buggy interface"
      
      * "add const cast to save life"
      
      * "fix get_output_with_grad"
      
      * "fix op test with dataformat"
      
      * ""fix pooling
      
      * "fix pooling test"
      
      * "fix CI"
      
      * "fix with_gpu error"
      
      * "add transform needed functional check"
      
      * "fix unpack list error"
      
      * "comment out parallel.do temporary"
      
      * "fix CI"
      
      * "fix compile doc error"
      
      * "make threshold larger"
      5ad1aef0
  28. 07 12月, 2017 1 次提交
  29. 24 10月, 2017 1 次提交
    • Y
      Feature/nccl dso (#5001) · 43c6ff21
      Yu Yang 提交于
      * "add nccl enforce"
      
      * Dev
      
      * Update comment
      
      * Add nccl test
      
      * Follow comments
      43c6ff21
  30. 15 10月, 2017 1 次提交
  31. 15 7月, 2017 1 次提交
  32. 04 7月, 2017 2 次提交