1. 18 9月, 2021 1 次提交
    • H
      Basic PR on Cost Model (#35774) · 5ba9fe6e
      Huihuang Zheng 提交于
      Add basic Cost Model, it uses executor to run program and profile it to get op time.
      
      This is an early basic version, we will add more functions in the future.
      5ba9fe6e
  2. 17 9月, 2021 1 次提交
    • Z
      Make flag adding easier (#35823) · 2c781455
      Zeng Jinle 提交于
      * make flag setter easier
      
      * update
      
      * rename macro name
      
      * fix bug of public/writable
      
      * update to pass CI
      
      * polish
      
      * fix CPU link error
      2c781455
  3. 26 5月, 2021 1 次提交
  4. 07 2月, 2021 1 次提交
  5. 04 2月, 2021 1 次提交
  6. 20 1月, 2021 1 次提交
    • W
      use nvtx push pop in timeline (#30567) · 90773473
      wanghuancoder 提交于
      * delete empty line of pybing.cc, test=develop
      
      * use nvtx push pop in timeline, test=develop
      
      * change year, test=develop
      
      * add #ifdef PADDLE_WITH_CUDA, test=develop
      
      * add #ifndef WIN32, test=develop
      
      * is_pushed to is_pushed_, test=develop
      90773473
  7. 24 9月, 2020 1 次提交
    • W
      use iwyu clean include (#27267) · df43905f
      wanghuancoder 提交于
      * use iwyu clean include, test=develop, test=win
      
      * compilation error, test=develop
      
      * fix compilation error2, test=develop
      
      * fix compilation error3, test=develop
      
      * fix compilation error4, test=develop
      
      * fix compilation error5, test=develop
      
      * fix compilation error6, test=develop
      
      * fix compilation error7, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error10, test=develop
      
      * fix compilation error11, test=develop
      df43905f
  8. 15 7月, 2020 1 次提交
  9. 09 6月, 2020 1 次提交
  10. 26 5月, 2020 1 次提交
  11. 25 5月, 2020 1 次提交
  12. 24 4月, 2020 1 次提交
  13. 03 3月, 2020 1 次提交
  14. 02 3月, 2020 1 次提交
  15. 25 2月, 2020 1 次提交
  16. 24 2月, 2020 1 次提交
  17. 19 2月, 2020 1 次提交
  18. 18 2月, 2020 1 次提交
  19. 06 2月, 2020 1 次提交
  20. 10 1月, 2020 1 次提交
  21. 09 1月, 2020 1 次提交
  22. 05 12月, 2019 1 次提交
  23. 28 11月, 2019 1 次提交
  24. 13 3月, 2019 1 次提交
  25. 22 2月, 2019 1 次提交
  26. 21 2月, 2019 4 次提交
    • D
      test=develop · 35a90e06
      Dun Liang 提交于
      35a90e06
    • D
      test=develop · c9080f51
      Dun Liang 提交于
      c9080f51
    • D
      test=develop · 1c7bb0e4
      Dun Liang 提交于
      1c7bb0e4
    • D
      Profiler refine and add CUDA runtime api tracer (#15301) · a83e4704
      Dun 提交于
      * refine profiler && add runtime tracer
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * fix bug && test=develop
      
      * add thread id map && test=develop
      
      * test=develop
      
      * testing
      
      * bug fix
      
      * remove cuda event && refine code && test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * fix windows temp file && test=develop
      
      * test=develop
      
      * fix windows bug && test=develop
      
      * fix start up issue && test=develop
      
      * code polish &&  test=develop
      
      * remove unused code && test=develop
      
      * add some cupti cbid && test=develop
      
      * add FLAGS_multiple_of_cupti_buffer_size && test=develop
      
      * fix compile error && test=develop
      
      * add keyword && test=develop
      
      * fix && test=develop
      
      * code polish && test=develop
      a83e4704
  27. 28 12月, 2018 1 次提交
  28. 06 12月, 2018 1 次提交
  29. 22 11月, 2018 1 次提交
    • W
      Windows/online (#14474) · d9a1f3e5
      wopeizl 提交于
      * add recordio support
      
      * disable the openblas multi-thread on windows since no support
      adjust the python script
      
      * code style
      
      * code style
      test=develop
      
      * add create_recordio_file_reader back
      
      * fix code style
      test=develop
      
      * fix the gtest.cmake on windows
      
      * fix cc_test on windows
      
      * fix the win build
      test=develop
      
      * remove fused compile support on windows
      test=develop
      
      * add the jit support
      test=develop
      
      * add the jit support, test=develop
      
      * add the jit support, test=develop
      
      * add the jit back
      fix compile error on windows
      
      * rollback test=develop
      
      * test case fix
      
      * disable DSO by default on windows
      
      * exclude warpctc_op on windows
      
      * exclude the dynload_warpctc out on windows
      test=develop
      
      * fix the scripts error
      test=develop
      
      * disable avx on windows by default
      test=develop
      
      * re-organize the cmake file
      
      * disable mkl on windows by default
      
      * add warp_ctc back
      
      * fix the dependency
      
      * fix the dependency
      
      * fix the build issue on windows
      
      * remove unsupported flag on windows
      
      * code style
      
      * code style
      test=develop
      
      * fix issue
      
      * add profiler, parallel_executor back
      
      * clean up the pre-definitions on windows
      
      * fix build issue
      
      * test=develop
      d9a1f3e5
  30. 21 11月, 2018 2 次提交
  31. 06 11月, 2018 1 次提交
  32. 22 10月, 2018 1 次提交
  33. 17 10月, 2018 1 次提交
  34. 15 10月, 2018 1 次提交
  35. 12 10月, 2018 1 次提交
    • Q
      Profiler support merge data of all thread (#13811) · 5428cb99
      Qiao Longfei 提交于
      * profiler infor merge thread statistic information
      
      * update profiler
      
      * fix bug
      
      * add merge thread msg to report
      
      * optimize report
      
      * statistic the time of ops in each thread but not all
      
      * optimize report format
      
      * optimize profile report
      
      * optimize profile report
      test=develop
      5428cb99
  36. 15 8月, 2018 1 次提交