- 07 4月, 2022 1 次提交
-
-
由 Yiqun Liu 提交于
* Add GPU memory usage information in the print of profiler. * Add ifdef.
-
- 16 12月, 2021 1 次提交
-
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * add os_info * update * update * update * update * update * update for bugfix * update * update * update Co-authored-by: Nliutiexing <liutiexing@google.com>
-
- 03 12月, 2021 1 次提交
-
-
由 ronnywang 提交于
* refine structure for cuda and rocm * update * update * update * update
-
- 01 12月, 2021 1 次提交
-
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * update HostEventTracer * update HostEventTracer * fix c++17 * update * update * update * update * fix bug Co-authored-by: Nliutiexing <liutiexing@google.com>
-
- 18 9月, 2021 1 次提交
-
-
由 Huihuang Zheng 提交于
Add basic Cost Model, it uses executor to run program and profile it to get op time. This is an early basic version, we will add more functions in the future.
-
- 07 2月, 2021 1 次提交
-
-
由 Qi Li 提交于
-
- 20 1月, 2021 1 次提交
-
-
由 wanghuancoder 提交于
* delete empty line of pybing.cc, test=develop * use nvtx push pop in timeline, test=develop * change year, test=develop * add #ifdef PADDLE_WITH_CUDA, test=develop * add #ifndef WIN32, test=develop * is_pushed to is_pushed_, test=develop
-
- 08 12月, 2020 1 次提交
-
-
由 jakpiase 提交于
* added external reorder to profiler * added external and internal reorders to profiler * added internal and external reorder to profiler * added formatting to int/ext reorder commit * removed unnecessary comment
-
- 15 7月, 2020 1 次提交
-
-
由 GaoWei8 提交于
* Refine PADDLE_ENFORCE in paddle/fluid/platform test=develop
-
- 03 6月, 2020 1 次提交
-
-
由 Zhang Ting 提交于
* fix negative framework overhead, test=develop * use overhead summary, test=develop
-
- 25 5月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 25 4月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 20 4月, 2020 1 次提交
-
-
由 Zhou Wei 提交于
* Optimize the error messages of paddle CUDA API, test=develop * fix the error messages of paddle CUDA API, test=develop * Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL,test=develop * remove build_ex_string,test=develop * merge conflict,test=develop
-
- 31 3月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
* refine output of profiler for child event
-
- 13 3月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 12 3月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 07 3月, 2020 2 次提交
-
-
由 Zhang Ting 提交于
-
由 wangchaochaohu 提交于
* refine the profiler print test=develop
-
- 03 3月, 2020 1 次提交
-
-
由 Zhang Ting 提交于
-
- 02 3月, 2020 2 次提交
-
-
由 wangchaochaohu 提交于
-
由 wangchaochaohu 提交于
* add profiler_help.h to refine the code test=develop
-
- 25 2月, 2020 1 次提交
-
-
由 Zhang Ting 提交于
* add framework overhead ratio, test=develop * print GpuMemcpy overhead, test=develop
-
- 24 2月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
* add support for the driver api callback and fix the profiler name show bug
-
- 19 2月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
* fix the profile print error test=develop
-
- 18 2月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
* add python flag to control profile level test=develop
-
- 06 2月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 10 1月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
* fix the bug of profile update test=develop
-
- 09 1月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
* add support for nested profiling event and printing in different level
-
- 05 12月, 2019 1 次提交
-
-
由 wangchaochaohu 提交于
* fix profiler warning message in cpu profile mode test=develop
-
- 28 11月, 2019 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 13 3月, 2019 1 次提交
-
-
由 chengduo 提交于
test=develop
-
- 22 2月, 2019 1 次提交
-
-
由 chengduo 提交于
test=develop
-
- 21 2月, 2019 4 次提交
-
-
由 Dun Liang 提交于
-
由 Dun Liang 提交于
-
由 Dun Liang 提交于
-
由 Dun 提交于
* refine profiler && add runtime tracer * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * fix bug && test=develop * add thread id map && test=develop * test=develop * testing * bug fix * remove cuda event && refine code && test=develop * test=develop * test=develop * test=develop * fix windows temp file && test=develop * test=develop * fix windows bug && test=develop * fix start up issue && test=develop * code polish && test=develop * remove unused code && test=develop * add some cupti cbid && test=develop * add FLAGS_multiple_of_cupti_buffer_size && test=develop * fix compile error && test=develop * add keyword && test=develop * fix && test=develop * code polish && test=develop
-
- 28 12月, 2018 1 次提交
-
-
由 Yancey1989 提交于
-
- 06 12月, 2018 1 次提交
-
-
由 Yancey1989 提交于
-
- 22 11月, 2018 1 次提交
-
-
由 wopeizl 提交于
* add recordio support * disable the openblas multi-thread on windows since no support adjust the python script * code style * code style test=develop * add create_recordio_file_reader back * fix code style test=develop * fix the gtest.cmake on windows * fix cc_test on windows * fix the win build test=develop * remove fused compile support on windows test=develop * add the jit support test=develop * add the jit support, test=develop * add the jit support, test=develop * add the jit back fix compile error on windows * rollback test=develop * test case fix * disable DSO by default on windows * exclude warpctc_op on windows * exclude the dynload_warpctc out on windows test=develop * fix the scripts error test=develop * disable avx on windows by default test=develop * re-organize the cmake file * disable mkl on windows by default * add warp_ctc back * fix the dependency * fix the dependency * fix the build issue on windows * remove unsupported flag on windows * code style * code style test=develop * fix issue * add profiler, parallel_executor back * clean up the pre-definitions on windows * fix build issue * test=develop
-
- 21 11月, 2018 1 次提交
-
-
由 peizhilin 提交于
-