1. 19 2月, 2022 1 次提交
    • C
      fix RecordEvent interface (#39675) · 019a552b
      chenjian 提交于
      * fix RecordEvent interface
      
      * modify default level to 4
      
      * update interface use
      
      * add const default trace level
      
      * update operator.cc
      019a552b
  2. 28 1月, 2022 1 次提交
  3. 10 1月, 2022 1 次提交
    • L
      Profiler skeleton (#38826) · a8afed69
      liutiexing 提交于
      * add align for WorkQueue
      
      * add spinlock
      
      * merge develop
      
      * merge
      
      * Add EventsWaiter
      
      * Revert "Add EventsWaiter"
      
      This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.
      
      * profiler skeleton
      
      * update
      
      * update
      
      * update
      Co-authored-by: Nliutiexing <liutiexing@google.com>
      a8afed69
  4. 29 12月, 2021 1 次提交
    • L
      Make profiler better (#38280) · 851637fd
      liutiexing 提交于
      * add align for WorkQueue
      
      * add spinlock
      
      * merge develop
      
      * merge
      
      * Add EventsWaiter
      
      * Revert "Add EventsWaiter"
      
      This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.
      
      * update OS info
      
      * split host_event_recorder
      
      * split host_event_recorder
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      Co-authored-by: Nliutiexing <liutiexing@google.com>
      851637fd
  5. 16 12月, 2021 1 次提交
  6. 08 12月, 2021 1 次提交
  7. 01 12月, 2021 1 次提交
    • L
      HostEventRecorder (#37629) · feda7c1d
      liutiexing 提交于
      * add align for WorkQueue
      
      * add spinlock
      
      * merge develop
      
      * merge
      
      * Add EventsWaiter
      
      * Revert "Add EventsWaiter"
      
      This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.
      
      * update HostEventTracer
      
      * update HostEventTracer
      
      * fix c++17
      
      * update
      
      * update
      
      * update
      
      * update
      
      * fix bug
      Co-authored-by: Nliutiexing <liutiexing@google.com>
      feda7c1d
  8. 18 9月, 2021 1 次提交
    • H
      Basic PR on Cost Model (#35774) · 5ba9fe6e
      Huihuang Zheng 提交于
      Add basic Cost Model, it uses executor to run program and profile it to get op time.
      
      This is an early basic version, we will add more functions in the future.
      5ba9fe6e
  9. 17 9月, 2021 1 次提交
    • Z
      Make flag adding easier (#35823) · 2c781455
      Zeng Jinle 提交于
      * make flag setter easier
      
      * update
      
      * rename macro name
      
      * fix bug of public/writable
      
      * update to pass CI
      
      * polish
      
      * fix CPU link error
      2c781455
  10. 26 5月, 2021 1 次提交
  11. 07 2月, 2021 1 次提交
  12. 04 2月, 2021 1 次提交
  13. 20 1月, 2021 1 次提交
    • W
      use nvtx push pop in timeline (#30567) · 90773473
      wanghuancoder 提交于
      * delete empty line of pybing.cc, test=develop
      
      * use nvtx push pop in timeline, test=develop
      
      * change year, test=develop
      
      * add #ifdef PADDLE_WITH_CUDA, test=develop
      
      * add #ifndef WIN32, test=develop
      
      * is_pushed to is_pushed_, test=develop
      90773473
  14. 24 9月, 2020 1 次提交
    • W
      use iwyu clean include (#27267) · df43905f
      wanghuancoder 提交于
      * use iwyu clean include, test=develop, test=win
      
      * compilation error, test=develop
      
      * fix compilation error2, test=develop
      
      * fix compilation error3, test=develop
      
      * fix compilation error4, test=develop
      
      * fix compilation error5, test=develop
      
      * fix compilation error6, test=develop
      
      * fix compilation error7, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error10, test=develop
      
      * fix compilation error11, test=develop
      df43905f
  15. 15 7月, 2020 1 次提交
  16. 09 6月, 2020 1 次提交
  17. 26 5月, 2020 1 次提交
  18. 25 5月, 2020 1 次提交
  19. 24 4月, 2020 1 次提交
  20. 03 3月, 2020 1 次提交
  21. 02 3月, 2020 1 次提交
  22. 25 2月, 2020 1 次提交
  23. 24 2月, 2020 1 次提交
  24. 19 2月, 2020 1 次提交
  25. 18 2月, 2020 1 次提交
  26. 06 2月, 2020 1 次提交
  27. 10 1月, 2020 1 次提交
  28. 09 1月, 2020 1 次提交
  29. 05 12月, 2019 1 次提交
  30. 28 11月, 2019 1 次提交
  31. 13 3月, 2019 1 次提交
  32. 22 2月, 2019 1 次提交
  33. 21 2月, 2019 4 次提交
    • D
      test=develop · 35a90e06
      Dun Liang 提交于
      35a90e06
    • D
      test=develop · c9080f51
      Dun Liang 提交于
      c9080f51
    • D
      test=develop · 1c7bb0e4
      Dun Liang 提交于
      1c7bb0e4
    • D
      Profiler refine and add CUDA runtime api tracer (#15301) · a83e4704
      Dun 提交于
      * refine profiler && add runtime tracer
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * fix bug && test=develop
      
      * add thread id map && test=develop
      
      * test=develop
      
      * testing
      
      * bug fix
      
      * remove cuda event && refine code && test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * fix windows temp file && test=develop
      
      * test=develop
      
      * fix windows bug && test=develop
      
      * fix start up issue && test=develop
      
      * code polish &&  test=develop
      
      * remove unused code && test=develop
      
      * add some cupti cbid && test=develop
      
      * add FLAGS_multiple_of_cupti_buffer_size && test=develop
      
      * fix compile error && test=develop
      
      * add keyword && test=develop
      
      * fix && test=develop
      
      * code polish && test=develop
      a83e4704
  34. 28 12月, 2018 1 次提交
  35. 06 12月, 2018 1 次提交
  36. 22 11月, 2018 1 次提交
    • W
      Windows/online (#14474) · d9a1f3e5
      wopeizl 提交于
      * add recordio support
      
      * disable the openblas multi-thread on windows since no support
      adjust the python script
      
      * code style
      
      * code style
      test=develop
      
      * add create_recordio_file_reader back
      
      * fix code style
      test=develop
      
      * fix the gtest.cmake on windows
      
      * fix cc_test on windows
      
      * fix the win build
      test=develop
      
      * remove fused compile support on windows
      test=develop
      
      * add the jit support
      test=develop
      
      * add the jit support, test=develop
      
      * add the jit support, test=develop
      
      * add the jit back
      fix compile error on windows
      
      * rollback test=develop
      
      * test case fix
      
      * disable DSO by default on windows
      
      * exclude warpctc_op on windows
      
      * exclude the dynload_warpctc out on windows
      test=develop
      
      * fix the scripts error
      test=develop
      
      * disable avx on windows by default
      test=develop
      
      * re-organize the cmake file
      
      * disable mkl on windows by default
      
      * add warp_ctc back
      
      * fix the dependency
      
      * fix the dependency
      
      * fix the build issue on windows
      
      * remove unsupported flag on windows
      
      * code style
      
      * code style
      test=develop
      
      * fix issue
      
      * add profiler, parallel_executor back
      
      * clean up the pre-definitions on windows
      
      * fix build issue
      
      * test=develop
      d9a1f3e5
  37. 21 11月, 2018 1 次提交