- 18 9月, 2021 1 次提交
-
-
由 Huihuang Zheng 提交于
Add basic Cost Model, it uses executor to run program and profile it to get op time. This is an early basic version, we will add more functions in the future.
-
- 24 9月, 2020 1 次提交
-
-
由 wanghuancoder 提交于
* use iwyu clean include, test=develop, test=win * compilation error, test=develop * fix compilation error2, test=develop * fix compilation error3, test=develop * fix compilation error4, test=develop * fix compilation error5, test=develop * fix compilation error6, test=develop * fix compilation error7, test=develop * fix compilation error8, test=develop * fix compilation error8, test=develop * fix compilation error10, test=develop * fix compilation error11, test=develop
-
- 26 5月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 25 5月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 13 3月, 2019 1 次提交
-
-
由 chengduo 提交于
test=develop
-
- 11 3月, 2019 1 次提交
-
- 04 3月, 2019 3 次提交
-
-
由 chengduo 提交于
Add Event for TensorCopy
- 01 3月, 2019 1 次提交
-
-
由 chengduo 提交于
Add Event for TensorCopy
-
- 25 2月, 2019 1 次提交
-
-
由 chengduo 提交于
* refile profiler test=develop * follow comment test=develop
-
- 21 2月, 2019 1 次提交
-
-
由 Dun 提交于
* refine profiler && add runtime tracer * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * fix bug && test=develop * add thread id map && test=develop * test=develop * testing * bug fix * remove cuda event && refine code && test=develop * test=develop * test=develop * test=develop * fix windows temp file && test=develop * test=develop * fix windows bug && test=develop * fix start up issue && test=develop * code polish && test=develop * remove unused code && test=develop * add some cupti cbid && test=develop * add FLAGS_multiple_of_cupti_buffer_size && test=develop * fix compile error && test=develop * add keyword && test=develop * fix && test=develop * code polish && test=develop
-
- 04 12月, 2018 1 次提交
-
-
由 ZongwuYang 提交于
Fix the bug that profiler cannot trace the nccl allreduce operator
-
- 22 11月, 2018 1 次提交
-
-
由 wopeizl 提交于
* add recordio support * disable the openblas multi-thread on windows since no support adjust the python script * code style * code style test=develop * add create_recordio_file_reader back * fix code style test=develop * fix the gtest.cmake on windows * fix cc_test on windows * fix the win build test=develop * remove fused compile support on windows test=develop * add the jit support test=develop * add the jit support, test=develop * add the jit support, test=develop * add the jit back fix compile error on windows * rollback test=develop * test case fix * disable DSO by default on windows * exclude warpctc_op on windows * exclude the dynload_warpctc out on windows test=develop * fix the scripts error test=develop * disable avx on windows by default test=develop * re-organize the cmake file * disable mkl on windows by default * add warp_ctc back * fix the dependency * fix the dependency * fix the build issue on windows * remove unsupported flag on windows * code style * code style test=develop * fix issue * add profiler, parallel_executor back * clean up the pre-definitions on windows * fix build issue * test=develop
-
- 21 11月, 2018 1 次提交
-
-
由 peizhilin 提交于
-
- 27 8月, 2018 3 次提交
- 24 8月, 2018 2 次提交
- 31 7月, 2018 1 次提交
-
-
由 Xin Pan 提交于
Add a few more RecordEvent. Cleanup
-
- 23 7月, 2018 1 次提交
-
-
由 qiaolongfei 提交于
-
- 10 4月, 2018 1 次提交
-
-
由 Yi Wang 提交于
-
- 14 3月, 2018 1 次提交
-
-
由 Xin Pan 提交于
-
- 02 3月, 2018 1 次提交
-
-
由 Xin Pan 提交于
-
- 01 3月, 2018 2 次提交
- 26 2月, 2018 2 次提交