1. 18 9月, 2021 1 次提交
    • H
      Basic PR on Cost Model (#35774) · 5ba9fe6e
      Huihuang Zheng 提交于
      Add basic Cost Model, it uses executor to run program and profile it to get op time.
      
      This is an early basic version, we will add more functions in the future.
      5ba9fe6e
  2. 26 5月, 2021 1 次提交
  3. 07 2月, 2021 1 次提交
  4. 20 1月, 2021 1 次提交
    • W
      use nvtx push pop in timeline (#30567) · 90773473
      wanghuancoder 提交于
      * delete empty line of pybing.cc, test=develop
      
      * use nvtx push pop in timeline, test=develop
      
      * change year, test=develop
      
      * add #ifdef PADDLE_WITH_CUDA, test=develop
      
      * add #ifndef WIN32, test=develop
      
      * is_pushed to is_pushed_, test=develop
      90773473
  5. 24 9月, 2020 1 次提交
    • W
      use iwyu clean include (#27267) · df43905f
      wanghuancoder 提交于
      * use iwyu clean include, test=develop, test=win
      
      * compilation error, test=develop
      
      * fix compilation error2, test=develop
      
      * fix compilation error3, test=develop
      
      * fix compilation error4, test=develop
      
      * fix compilation error5, test=develop
      
      * fix compilation error6, test=develop
      
      * fix compilation error7, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error10, test=develop
      
      * fix compilation error11, test=develop
      df43905f
  6. 03 6月, 2020 1 次提交
  7. 25 5月, 2020 1 次提交
  8. 24 4月, 2020 1 次提交
  9. 03 3月, 2020 1 次提交
  10. 02 3月, 2020 1 次提交
  11. 19 2月, 2020 1 次提交
  12. 18 2月, 2020 1 次提交
  13. 06 2月, 2020 1 次提交
  14. 13 3月, 2019 1 次提交
  15. 25 2月, 2019 1 次提交
  16. 22 2月, 2019 1 次提交
  17. 21 2月, 2019 1 次提交
    • D
      Profiler refine and add CUDA runtime api tracer (#15301) · a83e4704
      Dun 提交于
      * refine profiler && add runtime tracer
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * fix bug && test=develop
      
      * add thread id map && test=develop
      
      * test=develop
      
      * testing
      
      * bug fix
      
      * remove cuda event && refine code && test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * fix windows temp file && test=develop
      
      * test=develop
      
      * fix windows bug && test=develop
      
      * fix start up issue && test=develop
      
      * code polish &&  test=develop
      
      * remove unused code && test=develop
      
      * add some cupti cbid && test=develop
      
      * add FLAGS_multiple_of_cupti_buffer_size && test=develop
      
      * fix compile error && test=develop
      
      * add keyword && test=develop
      
      * fix && test=develop
      
      * code polish && test=develop
      a83e4704
  18. 22 11月, 2018 1 次提交
    • W
      Windows/online (#14474) · d9a1f3e5
      wopeizl 提交于
      * add recordio support
      
      * disable the openblas multi-thread on windows since no support
      adjust the python script
      
      * code style
      
      * code style
      test=develop
      
      * add create_recordio_file_reader back
      
      * fix code style
      test=develop
      
      * fix the gtest.cmake on windows
      
      * fix cc_test on windows
      
      * fix the win build
      test=develop
      
      * remove fused compile support on windows
      test=develop
      
      * add the jit support
      test=develop
      
      * add the jit support, test=develop
      
      * add the jit support, test=develop
      
      * add the jit back
      fix compile error on windows
      
      * rollback test=develop
      
      * test case fix
      
      * disable DSO by default on windows
      
      * exclude warpctc_op on windows
      
      * exclude the dynload_warpctc out on windows
      test=develop
      
      * fix the scripts error
      test=develop
      
      * disable avx on windows by default
      test=develop
      
      * re-organize the cmake file
      
      * disable mkl on windows by default
      
      * add warp_ctc back
      
      * fix the dependency
      
      * fix the dependency
      
      * fix the build issue on windows
      
      * remove unsupported flag on windows
      
      * code style
      
      * code style
      test=develop
      
      * fix issue
      
      * add profiler, parallel_executor back
      
      * clean up the pre-definitions on windows
      
      * fix build issue
      
      * test=develop
      d9a1f3e5
  19. 21 11月, 2018 1 次提交
  20. 22 10月, 2018 1 次提交
  21. 18 10月, 2018 1 次提交
  22. 27 8月, 2018 3 次提交
  23. 24 8月, 2018 1 次提交
  24. 31 7月, 2018 1 次提交
  25. 23 5月, 2018 1 次提交
  26. 15 5月, 2018 1 次提交
    • X
      Fix a profiler race condition · 94c0a64d
      Xin Pan 提交于
      In multi-thread condition, EnableProfiler can
      be called after RecordEvent is constructed. In this
      case, RecordEvent constructor will not init anything,
      but RecordEvent destructor will do something since EnableProfiler
      was called.
      This PR fixes it.
      94c0a64d
  27. 04 5月, 2018 1 次提交
  28. 03 5月, 2018 2 次提交
  29. 10 4月, 2018 1 次提交
  30. 16 3月, 2018 1 次提交
  31. 14 3月, 2018 1 次提交
  32. 01 3月, 2018 2 次提交
  33. 26 2月, 2018 1 次提交
  34. 12 2月, 2018 1 次提交
  35. 10 2月, 2018 2 次提交