1. 20 1月, 2021 1 次提交
    • W
      use nvtx push pop in timeline (#30567) · 90773473
      wanghuancoder 提交于
      * delete empty line of pybing.cc, test=develop
      
      * use nvtx push pop in timeline, test=develop
      
      * change year, test=develop
      
      * add #ifdef PADDLE_WITH_CUDA, test=develop
      
      * add #ifndef WIN32, test=develop
      
      * is_pushed to is_pushed_, test=develop
      90773473
  2. 06 1月, 2021 1 次提交
  3. 16 12月, 2020 1 次提交
    • Y
      添加rocm平台支持代码 (#29342) · 76738504
      Y_Xuan 提交于
      * 添加rocm平台支持代码
      
      * 修改一些问题
      
      * 修改一些歧义并添加备注
      
      * 修改代码格式
      
      * 解决冲突后的代码修改
      
      * 修改operators.cmake
      
      * 修改格式
      
      * 修正错误
      
      * 统一接口
      
      * 修改日期
      76738504
  4. 27 11月, 2020 1 次提交
  5. 21 10月, 2020 1 次提交
  6. 24 9月, 2020 1 次提交
    • W
      use iwyu clean include (#27267) · df43905f
      wanghuancoder 提交于
      * use iwyu clean include, test=develop, test=win
      
      * compilation error, test=develop
      
      * fix compilation error2, test=develop
      
      * fix compilation error3, test=develop
      
      * fix compilation error4, test=develop
      
      * fix compilation error5, test=develop
      
      * fix compilation error6, test=develop
      
      * fix compilation error7, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error10, test=develop
      
      * fix compilation error11, test=develop
      df43905f
  7. 20 7月, 2020 1 次提交
  8. 07 7月, 2020 1 次提交
  9. 02 7月, 2020 1 次提交
  10. 24 6月, 2020 1 次提交
    • C
      Add default cudnn lib path (#25175) · 353ea9e8
      Chen Weihang 提交于
      * add default cudnn lib path, test=develop
      
      * change default path in func, test=develop
      
      * move to linux branch, test=develop
      
      * fix var error in other plat, test=develop
      353ea9e8
  11. 18 5月, 2020 1 次提交
    • Y
      Add some check for CUDA Driver API and NVRTC (#22719) · 560c8153
      Yiqun Liu 提交于
      * Add the check for whether CUDA Driver and NVRTC is available for the runtime system.
      
      * Call cuInit to initialize the CUDA Driver API before all CUDA callings.
      test=develop
      
      * Change the behavior when libnvrtc.so can not be found, printing a warning instead of exiting.
      test=develop
      
      * Do not initialize CUDA Driver API for windows and macos.
      test=develop
      
      * Remove the call of cuInit when entering paddle and enable the test_code_generator.
      test=develop
      
      * Add some built-in functions for __half.
      test=develop
      
      * Change save_intermediate_out to false in unittest.
      test=develop
      
      * Fix error reference to tempropary variable when seting including path for device_code.
      test=develop
      560c8153
  12. 08 5月, 2020 1 次提交
  13. 30 4月, 2020 1 次提交
    • G
      Fix cusolver loader for Windows (#24157) · 1fc6cc50
      Guo Sheng 提交于
      * Fix cusolver loader for Windows in dynamic_loader.cc. test=develop
      
      * Fix missing CUSOLVER_ROUTINE_EACH_R1.
      test=gpu
      test=develop
      
      * Add unsupprot for cusolver on Windows temporarily. test=develop
      
      * Fix GetCusolverDsoHandle error message. test=develop
      1fc6cc50
  14. 24 4月, 2020 1 次提交
    • G
      Add cholesky_op (#23543) · a8c0fb4e
      Guo Sheng 提交于
      * Add cholesky_op forward part. test=develop
      
      * Complete cholesky_op forward part. test=develop
      
      * Add cholesky_op backward part. test=develop
      
      * Complete cholesky_op backward part. test=develop
      
      * Refine cholesky_op error check and docs. test=develop
      
      * Add grad_check unit test for cholesky_op. test=develop
      
      * Fix sample code in cholesky doc. test=develop
      
      * Refine some error messages of cholesky_op. test=develop
      
      * Refine some error messages of cholesky_op. test=develop
      
      * Remove unused input in cholesky_grad. test=develop
      
      * Remove unused input in cholesky_grad. test=develop
      
      * Fix stream for cusolverDnSetStream. test=develop
      
      * Update PADDLE_ENFORCE_CUDA_SUCCESS from cholesky_op to adapt to latest code.
      test=develop
      
      * Add CUSOLVER ERROR in enforce.h
      test=develop
      
      * Fix the missing return value in cholesky. test=develop
      a8c0fb4e
  15. 28 9月, 2019 1 次提交
    • Q
      Enable users to create custom cpp op outside framework. (#19256) · 1a3eef02
      qingqing01 提交于
      * How to write custom op needs to follow framework OP spec.
      * Package fluid_framework.so and headers into whl.
      * Add paddle.sysconfig.get_include() and paddle.sysconfig.get_lib() to get include dir and lib dir.
      * Export some C-APIs to merge OpInfo between core.so and custom_op.so.
      * Add unit testing.
      * Update API.spec.
      1a3eef02
  16. 05 9月, 2019 1 次提交
    • Y
      Integrate NVRTC to support compiling CUDA kernel at runtime (#19422) · 42b5bec6
      Yiqun Liu 提交于
      * Add the dynamic load of nvrtc, and support runtime compiling of CUDA kernel using nvrtc.
      test=develop
      
      * Call CUDA driver api to launch the kernel compiled by nvrtc.
      test=develop
      
      * Disable for mac and windows.
      test=develop
      
      * Refine the codes to support manually specified num_threads and workload_per_thread.
      test=develop
      
      * Refine the CUDA kernel to support large dims.
      test=develop
      42b5bec6
  17. 12 8月, 2019 1 次提交
  18. 05 8月, 2019 1 次提交
    • L
      fix warpctc.dll not found issue (#18761) · a43a763b
      liuwei1031 提交于
      * fix warpctc.dll not found issue, test=develop
      
      * revert the linux platform change, test=develop
      
      * delete warpctc_lib_path.h.in, test=develop
      
      * add SetPySitePackagePath function
      
      * fix warpctc.dylib not found issue on Mac, test=develop
      
      * improve the paddle lib path setting logic, test=develop
      
      * fix mac ci issue caused by test_warpctc_op unittest, test=develop
      
      * tweak code, test=develop
      a43a763b
  19. 03 6月, 2019 1 次提交
  20. 07 5月, 2019 1 次提交
  21. 03 4月, 2019 1 次提交
  22. 02 4月, 2019 1 次提交
    • C
      Model data cryption link all lib (#16555) · c38c7c56
      Chen Weihang 提交于
      * link the libwbaes.so into paddle
      
      * polish detail, test=develop
      
      * try fix mac_pr_ci error, test=develop
      
      * add compile option, test=develop
      
      * fix ci error, test=develop
      
      * ignore failed to find mac lib, test=develop
      
      * change cdn to bj, cdn can't get the latest version
      
      * trigger ci, test=develop
      
      * temporary delete win32 lib linking, test=develop
      
      * change https to http, test=develop
      
      * turn compile option on to off
      
      * turn compile option off to on, test=develop
      
      * try lib compiled by gcc4.8, test=develop
      
      * update lib version, test=develop
      
      * link other lib, test=develop
      
      * add setup config
      
      * delete false, test=develop
      
      * delete no_soname, test=develop
      
      * recover so name set
      
      * fix, test=develop
      
      * adjust make config, test=develop
      
      * remove link to wbaes, test=develop
      
      * remove useless define, test=develop
      c38c7c56
  23. 26 12月, 2018 1 次提交
  24. 19 12月, 2018 1 次提交
  25. 18 12月, 2018 3 次提交
  26. 26 11月, 2018 1 次提交
  27. 08 11月, 2018 1 次提交
  28. 28 9月, 2018 1 次提交
  29. 05 9月, 2018 1 次提交
  30. 27 8月, 2018 4 次提交
  31. 26 8月, 2018 1 次提交
  32. 24 8月, 2018 1 次提交
  33. 05 7月, 2018 1 次提交
  34. 21 6月, 2018 1 次提交
  35. 20 6月, 2018 1 次提交