1. 03 12月, 2021 1 次提交
  2. 01 12月, 2021 1 次提交
    • L
      HostEventRecorder (#37629) · feda7c1d
      liutiexing 提交于
      * add align for WorkQueue
      
      * add spinlock
      
      * merge develop
      
      * merge
      
      * Add EventsWaiter
      
      * Revert "Add EventsWaiter"
      
      This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.
      
      * update HostEventTracer
      
      * update HostEventTracer
      
      * fix c++17
      
      * update
      
      * update
      
      * update
      
      * update
      
      * fix bug
      Co-authored-by: Nliutiexing <liutiexing@google.com>
      feda7c1d
  3. 18 9月, 2021 1 次提交
    • H
      Basic PR on Cost Model (#35774) · 5ba9fe6e
      Huihuang Zheng 提交于
      Add basic Cost Model, it uses executor to run program and profile it to get op time.
      
      This is an early basic version, we will add more functions in the future.
      5ba9fe6e
  4. 07 2月, 2021 1 次提交
  5. 20 1月, 2021 1 次提交
    • W
      use nvtx push pop in timeline (#30567) · 90773473
      wanghuancoder 提交于
      * delete empty line of pybing.cc, test=develop
      
      * use nvtx push pop in timeline, test=develop
      
      * change year, test=develop
      
      * add #ifdef PADDLE_WITH_CUDA, test=develop
      
      * add #ifndef WIN32, test=develop
      
      * is_pushed to is_pushed_, test=develop
      90773473
  6. 08 12月, 2020 1 次提交
    • J
      added internal and external reorders to profiler (#29443) · 57a4f16d
      jakpiase 提交于
      * added external reorder to profiler
      
      * added external and internal reorders to profiler
      
      * added internal and external reorder to profiler
      
      * added formatting to int/ext reorder commit
      
      * removed unnecessary comment
      57a4f16d
  7. 15 7月, 2020 1 次提交
  8. 03 6月, 2020 1 次提交
  9. 25 5月, 2020 1 次提交
  10. 25 4月, 2020 1 次提交
  11. 20 4月, 2020 1 次提交
    • Z
      Optimize the error messages of paddle CUDA API (#23816) · 78170037
      Zhou Wei 提交于
      * Optimize the error messages of paddle CUDA API, test=develop
      
      * fix the error messages of paddle CUDA API, test=develop
      
      * Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL,test=develop
      
      * remove build_ex_string,test=develop
      
      * merge conflict,test=develop
      78170037
  12. 31 3月, 2020 1 次提交
  13. 13 3月, 2020 1 次提交
  14. 12 3月, 2020 1 次提交
  15. 07 3月, 2020 2 次提交
  16. 03 3月, 2020 1 次提交
  17. 02 3月, 2020 2 次提交
  18. 25 2月, 2020 1 次提交
  19. 24 2月, 2020 1 次提交
  20. 19 2月, 2020 1 次提交
  21. 18 2月, 2020 1 次提交
  22. 06 2月, 2020 1 次提交
  23. 10 1月, 2020 1 次提交
  24. 09 1月, 2020 1 次提交
  25. 05 12月, 2019 1 次提交
  26. 28 11月, 2019 1 次提交
  27. 13 3月, 2019 1 次提交
  28. 22 2月, 2019 1 次提交
  29. 21 2月, 2019 4 次提交
    • D
      test=develop · 35a90e06
      Dun Liang 提交于
      35a90e06
    • D
      test=develop · c9080f51
      Dun Liang 提交于
      c9080f51
    • D
      test=develop · 1c7bb0e4
      Dun Liang 提交于
      1c7bb0e4
    • D
      Profiler refine and add CUDA runtime api tracer (#15301) · a83e4704
      Dun 提交于
      * refine profiler && add runtime tracer
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * fix bug && test=develop
      
      * add thread id map && test=develop
      
      * test=develop
      
      * testing
      
      * bug fix
      
      * remove cuda event && refine code && test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * fix windows temp file && test=develop
      
      * test=develop
      
      * fix windows bug && test=develop
      
      * fix start up issue && test=develop
      
      * code polish &&  test=develop
      
      * remove unused code && test=develop
      
      * add some cupti cbid && test=develop
      
      * add FLAGS_multiple_of_cupti_buffer_size && test=develop
      
      * fix compile error && test=develop
      
      * add keyword && test=develop
      
      * fix && test=develop
      
      * code polish && test=develop
      a83e4704
  30. 28 12月, 2018 1 次提交
  31. 06 12月, 2018 1 次提交
  32. 22 11月, 2018 1 次提交
    • W
      Windows/online (#14474) · d9a1f3e5
      wopeizl 提交于
      * add recordio support
      
      * disable the openblas multi-thread on windows since no support
      adjust the python script
      
      * code style
      
      * code style
      test=develop
      
      * add create_recordio_file_reader back
      
      * fix code style
      test=develop
      
      * fix the gtest.cmake on windows
      
      * fix cc_test on windows
      
      * fix the win build
      test=develop
      
      * remove fused compile support on windows
      test=develop
      
      * add the jit support
      test=develop
      
      * add the jit support, test=develop
      
      * add the jit support, test=develop
      
      * add the jit back
      fix compile error on windows
      
      * rollback test=develop
      
      * test case fix
      
      * disable DSO by default on windows
      
      * exclude warpctc_op on windows
      
      * exclude the dynload_warpctc out on windows
      test=develop
      
      * fix the scripts error
      test=develop
      
      * disable avx on windows by default
      test=develop
      
      * re-organize the cmake file
      
      * disable mkl on windows by default
      
      * add warp_ctc back
      
      * fix the dependency
      
      * fix the dependency
      
      * fix the build issue on windows
      
      * remove unsupported flag on windows
      
      * code style
      
      * code style
      test=develop
      
      * fix issue
      
      * add profiler, parallel_executor back
      
      * clean up the pre-definitions on windows
      
      * fix build issue
      
      * test=develop
      d9a1f3e5
  33. 21 11月, 2018 2 次提交
  34. 06 11月, 2018 1 次提交