- 26 12月, 2022 2 次提交
-
-
由 Ruibiao Chen 提交于
-
由 Ruibiao Chen 提交于
* Memory search for stream analyzer * Shrink redundant waiters
-
- 23 12月, 2022 1 次提交
-
-
由 JZ-LIANG 提交于
* memcpy overlap * memcpy newexe
-
- 19 12月, 2022 2 次提交
- 12 12月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Add UT * Support cross-step stream synchronization for standalone executor * Fix typos * Fix typos * Update UTs
-
- 08 12月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
-
- 30 11月, 2022 1 次提交
-
-
由 zhangbo9674 提交于
* add fuse act add grad pass * polish code * refine code * add test * refine code
-
- 29 11月, 2022 1 次提交
-
-
由 Sławomir Siwek 提交于
-
- 28 11月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Remove kSyncRun in StreamAnalyzer * Update code
-
- 26 11月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* hot fix * fix compile * merge develop * follow comments
-
- 25 11月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Move stream_anayzer to interpreter * Refactor StreamAnalyzer * Refactor RunNextInstructionList * Remove no_data_transform_index * Fix typos * Fix data_transfer OpFuncType error * Add event for depend_op * Update transfer OpFuncType for heter place
-
- 15 11月, 2022 1 次提交
-
-
由 Sławomir Siwek 提交于
* cleanup unused code * unify is_int8 is_bfloat16 * Simplify matmul_v2 FWD kernel * remove RunKernel methods * remove import namespace * remove headers * clean fluid/phi cross imports * remove fluid axpy_handler * delete fluid methods * activations * OneDNNMemDesc * MKLDNNFormatForSize * MatchShapeToLayout * MKLDNNMemoryFormat * MKLDNNFormat * ReorderMKLDNNHandler * to_void_cast * review suggestions * interpolate * remove fluid depedency
-
- 14 11月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
-
- 11 11月, 2022 1 次提交
-
-
由 zhangbo9674 提交于
* refine shape op in new_exe * Revert "refine shape op in new_exe" This reverts commit 0e0336ddc5eede3da019b348a0bcc0ef0f3be64e. * refine shape op in new_exe * refine shape expected_kernel_type * add SelectedRows check for shape op * refine code
-
- 07 11月, 2022 1 次提交
-
-
由 HongyuJia 提交于
* move cudnn hardcode outside GetExpectedKernelType * add header file * debug * update interpreter_util with hardcode * update interpreter_util headerfile * solve activation hardcode * debug with CI * add mkldnn_op_list header file * temporarily uncomment mkldnn * temporarily uncomment mkldnn * delete sequence_softmax cudnn hardcode * add hardcode to data_transfer.cc * update data_transfer headerfile * try fix segment fault * update cudnn&miopen_helper * reset HasAttr of DygraphExctnCtx * debug, this commit should pass all CI * debug should pass CI, temporarily disable activation * debug should pass CI * fix default_attr=nullptr bug * clean debug code * Call SetDnnFallback function in the base class * activation fallback to plain kernel * fix default GetExpectedKernelType find wrong kernel * search cudnn kernel instead of fallback * fix cudnn_handle bug * remove tanh use_cudnn * restore tanh use_cudnn * debug tanh * fix tanh bug * delete activation cudnn kernel * polish code
-
- 03 11月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Dispath computation OPs before communication in standalone executor * Update code * Fix CI errors * Improve performance of coalesce_tensor and depend OP in standalone executor * pre-commit check
-
- 02 11月, 2022 2 次提交
-
-
由 Ruibiao Chen 提交于
* Dispath computation OPs before communication in standalone executor * Update code * Fix CI errors
- 01 11月, 2022 2 次提交
-
-
由 HongyuJia 提交于
* move cudnn hardcode outside GetExpectedKernelType * add header file * debug * update interpreter_util with hardcode * update interpreter_util headerfile * solve activation hardcode * debug with CI * add mkldnn_op_list header file * temporarily uncomment mkldnn * temporarily uncomment mkldnn * delete sequence_softmax cudnn hardcode * add hardcode to data_transfer.cc * update data_transfer headerfile * try fix segment fault * update cudnn&miopen_helper * reset HasAttr of DygraphExctnCtx * debug, this commit should pass all CI * debug should pass CI, temporarily disable activation * debug should pass CI * fix default_attr=nullptr bug * clean debug code
-
由 Ruibiao Chen 提交于
* [Auto Parallel] Improve the c++ dist attr * [Auto Parallel] Modify test_program.py * Support custom stream for standalone executor Co-authored-by: NYulong Ao <aoyulong@baidu.com>
-
- 31 10月, 2022 1 次提交
-
-
由 kangguangli 提交于
* replace executor in conditional_block_op.run with standalone_executor * add block_id as the argument of standalone executor's method run; add print for program * fix scope bug about conditional block op * fix bug: unnecessary return of fetch value * fix typo * fix: quantization will set variable persistable, and these variables must exist in global scope * add interpretercore cache for conditional block op but not activate in default * fix bug: local scope reuse for conditional block op * reset scope when conditional block op runs * fix typo * fix typo and code style * add build scope for conditional block op * add skip for transfer_layout kernel * refind code * fix reset_scope * fix reset_scope * refine code * refine code * refine code 1. remove flag use in conditional_block_op 2. pass execution_config to BuildOpFuncList instead of individual parameter * refine code * remove the use of FLAGS_control_flow_use_new_executor_cache * change FLAGS_control_flow_use_new_executor to false
-
- 26 10月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* remove using lodtensor part2 * resolve code format error * resolve conflict * resolve conflict * replace added frameworrk tensor
-
- 19 10月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Support stream overlap for c_allreduce_sum * Test CI * Add notes * Add SingleStreamGuard for BuildOpFuncList
-
- 10 10月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* reduce time cost on atomic in interpretercore * clear code of PrepareAtomic in interpretercore * refine threadpool cache
-
- 27 9月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
-
- 23 9月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Add ExecutionConfig and fix last-live-op bug for standalone executor * Improve code design
-
- 02 9月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
-
- 19 8月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Fix random op depenency and lr_shedule bugs for standalone executor * Fix CI errors * Fix CI errors * Fix CI errors
-
- 02 8月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Refactor build_op_downstream_map for standalone executor * Add some comments
-
- 20 7月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Add dependency for read op in standalone executor * Fix CI errors * Add UT * add_dependency -> dependency_utils * Fix CI errors
-