1. 14 7月, 2022 1 次提交
    • L
      refine allocation cmake (#44241) · dc5a0420
      Leo Chen 提交于
      * build into one static library
      
      * move memory/detail to memory/allocation
      
      * fix bug
      
      * fix profiler
      
      * fix framework_proto
      
      * fix deps
      
      * fix inference compilation
      
      * fix rocm compile
      
      * follow comments
      
      * fix buddy_allocator_test
      dc5a0420
  2. 24 6月, 2022 1 次提交
    • C
      record memory and op supplement info (#43550) · 8dd0a3b9
      chenjian 提交于
      * record memory and op supplement info
      
      * update
      
      * update
      
      * fix a bug
      
      * fix memory recording
      
      * fix a bug
      
      * update
      
      * update
      
      * fix a bug
      
      * update
      
      * fix a bug
      
      * fix a bug
      
      * fix a bug
      
      * Revert "fix a bug"
      
      This reverts commit c1d4df52762ba9ae7c7e27cd2ba4fc3a7ed9c7a5.
      
      * fix a bug
      
      * fix format
      
      * fix
      8dd0a3b9
  3. 08 3月, 2022 1 次提交
    • C
      add python profiler package (#40065) · 10325a82
      chenjian 提交于
      * add python profiler package
      
      * update according to review
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * add unit test
      
      * Revert "add unit test"
      
      This reverts commit 4e69ff71b0645e069afe5dd8fea0d07717852c48.
      
      * reduce for pr
      
      * add unit test
      
      * modify for pr
      
      * fix unittest
      
      * update for ci coverage
      
      * modify according to review
      
      * fix bug
      
      * improve coverage
      10325a82
  4. 10 1月, 2022 1 次提交
    • L
      Profiler skeleton (#38826) · a8afed69
      liutiexing 提交于
      * add align for WorkQueue
      
      * add spinlock
      
      * merge develop
      
      * merge
      
      * Add EventsWaiter
      
      * Revert "Add EventsWaiter"
      
      This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.
      
      * profiler skeleton
      
      * update
      
      * update
      
      * update
      Co-authored-by: Nliutiexing <liutiexing@google.com>
      a8afed69
  5. 29 12月, 2021 1 次提交
    • L
      Make profiler better (#38280) · 851637fd
      liutiexing 提交于
      * add align for WorkQueue
      
      * add spinlock
      
      * merge develop
      
      * merge
      
      * Add EventsWaiter
      
      * Revert "Add EventsWaiter"
      
      This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.
      
      * update OS info
      
      * split host_event_recorder
      
      * split host_event_recorder
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      Co-authored-by: Nliutiexing <liutiexing@google.com>
      851637fd
  6. 08 12月, 2021 1 次提交
  7. 03 12月, 2021 1 次提交
  8. 01 12月, 2021 1 次提交
    • L
      HostEventRecorder (#37629) · feda7c1d
      liutiexing 提交于
      * add align for WorkQueue
      
      * add spinlock
      
      * merge develop
      
      * merge
      
      * Add EventsWaiter
      
      * Revert "Add EventsWaiter"
      
      This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.
      
      * update HostEventTracer
      
      * update HostEventTracer
      
      * fix c++17
      
      * update
      
      * update
      
      * update
      
      * update
      
      * fix bug
      Co-authored-by: Nliutiexing <liutiexing@google.com>
      feda7c1d
  9. 18 9月, 2021 1 次提交
    • H
      Basic PR on Cost Model (#35774) · 5ba9fe6e
      Huihuang Zheng 提交于
      Add basic Cost Model, it uses executor to run program and profile it to get op time.
      
      This is an early basic version, we will add more functions in the future.
      5ba9fe6e
  10. 26 5月, 2021 1 次提交
  11. 07 2月, 2021 1 次提交
  12. 20 1月, 2021 1 次提交
    • W
      use nvtx push pop in timeline (#30567) · 90773473
      wanghuancoder 提交于
      * delete empty line of pybing.cc, test=develop
      
      * use nvtx push pop in timeline, test=develop
      
      * change year, test=develop
      
      * add #ifdef PADDLE_WITH_CUDA, test=develop
      
      * add #ifndef WIN32, test=develop
      
      * is_pushed to is_pushed_, test=develop
      90773473
  13. 24 9月, 2020 1 次提交
    • W
      use iwyu clean include (#27267) · df43905f
      wanghuancoder 提交于
      * use iwyu clean include, test=develop, test=win
      
      * compilation error, test=develop
      
      * fix compilation error2, test=develop
      
      * fix compilation error3, test=develop
      
      * fix compilation error4, test=develop
      
      * fix compilation error5, test=develop
      
      * fix compilation error6, test=develop
      
      * fix compilation error7, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error10, test=develop
      
      * fix compilation error11, test=develop
      df43905f
  14. 03 6月, 2020 1 次提交
  15. 25 5月, 2020 1 次提交
  16. 24 4月, 2020 1 次提交
  17. 03 3月, 2020 1 次提交
  18. 02 3月, 2020 1 次提交
  19. 19 2月, 2020 1 次提交
  20. 18 2月, 2020 1 次提交
  21. 06 2月, 2020 1 次提交
  22. 13 3月, 2019 1 次提交
  23. 25 2月, 2019 1 次提交
  24. 22 2月, 2019 1 次提交
  25. 21 2月, 2019 1 次提交
    • D
      Profiler refine and add CUDA runtime api tracer (#15301) · a83e4704
      Dun 提交于
      * refine profiler && add runtime tracer
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * fix bug && test=develop
      
      * add thread id map && test=develop
      
      * test=develop
      
      * testing
      
      * bug fix
      
      * remove cuda event && refine code && test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * fix windows temp file && test=develop
      
      * test=develop
      
      * fix windows bug && test=develop
      
      * fix start up issue && test=develop
      
      * code polish &&  test=develop
      
      * remove unused code && test=develop
      
      * add some cupti cbid && test=develop
      
      * add FLAGS_multiple_of_cupti_buffer_size && test=develop
      
      * fix compile error && test=develop
      
      * add keyword && test=develop
      
      * fix && test=develop
      
      * code polish && test=develop
      a83e4704
  26. 22 11月, 2018 1 次提交
    • W
      Windows/online (#14474) · d9a1f3e5
      wopeizl 提交于
      * add recordio support
      
      * disable the openblas multi-thread on windows since no support
      adjust the python script
      
      * code style
      
      * code style
      test=develop
      
      * add create_recordio_file_reader back
      
      * fix code style
      test=develop
      
      * fix the gtest.cmake on windows
      
      * fix cc_test on windows
      
      * fix the win build
      test=develop
      
      * remove fused compile support on windows
      test=develop
      
      * add the jit support
      test=develop
      
      * add the jit support, test=develop
      
      * add the jit support, test=develop
      
      * add the jit back
      fix compile error on windows
      
      * rollback test=develop
      
      * test case fix
      
      * disable DSO by default on windows
      
      * exclude warpctc_op on windows
      
      * exclude the dynload_warpctc out on windows
      test=develop
      
      * fix the scripts error
      test=develop
      
      * disable avx on windows by default
      test=develop
      
      * re-organize the cmake file
      
      * disable mkl on windows by default
      
      * add warp_ctc back
      
      * fix the dependency
      
      * fix the dependency
      
      * fix the build issue on windows
      
      * remove unsupported flag on windows
      
      * code style
      
      * code style
      test=develop
      
      * fix issue
      
      * add profiler, parallel_executor back
      
      * clean up the pre-definitions on windows
      
      * fix build issue
      
      * test=develop
      d9a1f3e5
  27. 21 11月, 2018 1 次提交
  28. 22 10月, 2018 1 次提交
  29. 18 10月, 2018 1 次提交
  30. 27 8月, 2018 3 次提交
  31. 24 8月, 2018 1 次提交
  32. 31 7月, 2018 1 次提交
  33. 23 5月, 2018 1 次提交
  34. 15 5月, 2018 1 次提交
    • X
      Fix a profiler race condition · 94c0a64d
      Xin Pan 提交于
      In multi-thread condition, EnableProfiler can
      be called after RecordEvent is constructed. In this
      case, RecordEvent constructor will not init anything,
      but RecordEvent destructor will do something since EnableProfiler
      was called.
      This PR fixes it.
      94c0a64d
  35. 04 5月, 2018 1 次提交
  36. 03 5月, 2018 2 次提交
  37. 10 4月, 2018 1 次提交