- 01 11月, 2022 2 次提交
-
-
由 HongyuJia 提交于
* move cudnn hardcode outside GetExpectedKernelType * add header file * debug * update interpreter_util with hardcode * update interpreter_util headerfile * solve activation hardcode * debug with CI * add mkldnn_op_list header file * temporarily uncomment mkldnn * temporarily uncomment mkldnn * delete sequence_softmax cudnn hardcode * add hardcode to data_transfer.cc * update data_transfer headerfile * try fix segment fault * update cudnn&miopen_helper * reset HasAttr of DygraphExctnCtx * debug, this commit should pass all CI * debug should pass CI, temporarily disable activation * debug should pass CI * fix default_attr=nullptr bug * clean debug code
-
由 Ruibiao Chen 提交于
* [Auto Parallel] Improve the c++ dist attr * [Auto Parallel] Modify test_program.py * Support custom stream for standalone executor Co-authored-by: NYulong Ao <aoyulong@baidu.com>
-
- 31 10月, 2022 1 次提交
-
-
由 kangguangli 提交于
* replace executor in conditional_block_op.run with standalone_executor * add block_id as the argument of standalone executor's method run; add print for program * fix scope bug about conditional block op * fix bug: unnecessary return of fetch value * fix typo * fix: quantization will set variable persistable, and these variables must exist in global scope * add interpretercore cache for conditional block op but not activate in default * fix bug: local scope reuse for conditional block op * reset scope when conditional block op runs * fix typo * fix typo and code style * add build scope for conditional block op * add skip for transfer_layout kernel * refind code * fix reset_scope * fix reset_scope * refine code * refine code * refine code 1. remove flag use in conditional_block_op 2. pass execution_config to BuildOpFuncList instead of individual parameter * refine code * remove the use of FLAGS_control_flow_use_new_executor_cache * change FLAGS_control_flow_use_new_executor to false
-
- 26 10月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* remove using lodtensor part2 * resolve code format error * resolve conflict * resolve conflict * replace added frameworrk tensor
-
- 19 10月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Support stream overlap for c_allreduce_sum * Test CI * Add notes * Add SingleStreamGuard for BuildOpFuncList
-
- 13 10月, 2022 1 次提交
-
-
由 HongyuJia 提交于
* remove PADDLE_WITH_MKLDNN, test white_list=abs * fix unique_ptr * fix op.Type() * remove TODO in kernel_dispatch.h * remove IndicateVarDataType function, update white_list * remove mkldnn hard code * add comments * fix == * update mkldnn_op_list * delete hard code of OPs * update mkldnn_op_list * update mkldnn_op_list, remove interp * add error check for ExecutionContext * update mkldnn_op_list, remove transpose2_grad * remove interpolate mkldnn * remove fill_constant mkldnn * opt HasAttr in DygraphExecutionContext * deprecated commit, test mkldnn_white_list * deprecated commit, test mkldnn_white_list * deprecated commit, test mkldnn_black_list * update mkldnn_op_list, add assert error op * solve cudnn related op * fix error * add mkldnn fallback in phi_utils.cc * remove mkldnn fallback in phi_utils.cc * opt code implementation * polish Copyright License
-
- 12 10月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* refactor * refine code
-
- 10 10月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* reduce time cost on atomic in interpretercore * clear code of PrepareAtomic in interpretercore * refine threadpool cache
-
- 09 10月, 2022 1 次提交
-
-
由 zhangbo9674 提交于
-
- 30 9月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
-
- 27 9月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
-
- 23 9月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Add ExecutionConfig and fix last-live-op bug for standalone executor * Improve code design
-
- 20 9月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* add config * add config * follow comments * fix serial run
-
- 13 9月, 2022 1 次提交
-
-
由 pangyoki 提交于
* optimize executor log * delete log in new exe * add log for old executor * use LOG_FIRST_N(INFO, 1)
-
- 09 9月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Allow manaully set py_reader name in standalone executor * Fix CI errors
-
- 30 8月, 2022 1 次提交
-
-
由 zyfncg 提交于
* add runtime config in phi * add runtime attr for op desc and op * fix no proto error * adjust opdesc set_attr impl * try to remove conv_op extra attrs * add init runtime attr map * change extra header path * fix runtime_attr * fix trace_op * fix bug of pass * fix merge conflict * fix dygraph attrs * fix bug of pass * fix dygraph bug * fix unittest module * delete extra attr default * fix dropout kernel * polish code * fix extra output of instance_norm * fix merge confilct * fix op_desc bug * add extra attr in yaml for conv3d_transpose * don't remove extra input and output * fix save_inference_model * fix bug of batch_norm * revert some change * polish log * polish code * add code comment Co-authored-by: NChen Weihang <chenweihang@baidu.com>
-
- 29 8月, 2022 1 次提交
-
-
由 zhangbo9674 提交于
* add interpretercore * refine backward program id * add code * refine program * refine code * create forward/backward_program by prog2graph2prog method * test, do not care * refine code * refine code * refine code * test, do not care * add interpretorcore * add scope * refine scope create method * add jit for new_exe * solve conflict * delete unused code * polish code * polish code * refine scope in inplace * refine for datatransfer * refine _rebuild_from_desc * refine control eager deletion attr * refine used_for_jit * refine jit for infer * op size0 use ori program * polish code * refine jit * refine run_program_op ut * refine inplace * refine control * refine graph helper * refine control * refine inplace * refine buffer_share_inplace_pass * polish code * polish code * refine usage for compilerProgram * refine control * test * test core cache * refine code * refine io.py * increase test_seq2seq timeout * refine convert program * refine interpretercore_cache release * delete buildinplace * refine partial_program && io * refine code for io * test * test * test
-
- 23 8月, 2022 1 次提交
-
-
由 pangyoki 提交于
-
- 02 8月, 2022 2 次提交
-
-
由 Weilong Wu 提交于
* polish and rename, pt* -> phi* * fix code format
-
由 Ruibiao Chen 提交于
* Refactor build_op_downstream_map for standalone executor * Add some comments
-
- 20 7月, 2022 2 次提交
-
-
由 houj04 提交于
* device_guard support xpu. test=kunlun * sum op of xpu support LoDTensorArray. add test for while op of xpu. test=kunlun.
-
由 Ruibiao Chen 提交于
* Add dependency for read op in standalone executor * Fix CI errors * Add UT * add_dependency -> dependency_utils * Fix CI errors
-
- 11 7月, 2022 1 次提交
-
-
由 王明冬 提交于
-
- 29 6月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* separate variable scope and scope * hot fix for lod_tensor_blocking_queue * fix bug that variable exists in global scope
-
- 26 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 24 6月, 2022 1 次提交
-
-
由 王明冬 提交于
-
- 23 6月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Multiplex Workqueue for InterpreterCore * Delete ResetWorkQueueOptions * Update code format
-
- 16 6月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Support disable GC for some vars in standalone executor * Setting skip_gc_vars in interprecore construction
-
- 14 6月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Support sequential run for standalone executor * Add UTs * Fix test_standalone_multiply_write * Remove unnecessary UTs
-
- 09 6月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 12 5月, 2022 1 次提交
-
-
由 Shuangchi He 提交于
-
- 25 4月, 2022 1 次提交
-
-
由 tiancaishaonvjituizi 提交于
* fix bug of std::move and others * fix an compile error in debug mode * fix wrong copy assignment operator Signed-off-by: Ntiancaishaonvjituizi <452565578@qq.com> * reformat Signed-off-by: Ntiancaishaonvjituizi <452565578@qq.com> * reformat Signed-off-by: Ntiancaishaonvjituizi <452565578@qq.com> * fix ArrayRef constructor following llvm * fix format * fix conflict with master
-
- 24 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* add variant and replace any * split attribute
-
- 21 4月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Support cinn_launch OP in standalone executor * Remove some redundant code
-
- 20 4月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* clear the listener after run * only sync variables in program * refine code * fit for lod_tensor_blocking_queue
-
- 18 4月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* shrink downstream map * shrink last live ops of var * add comment * fix bug
-
- 12 4月, 2022 1 次提交
-
-
由 Leo Chen 提交于
-
- 09 4月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* fix bug that no thread is waked up when adding task to threadpool * fix typo
-
- 05 4月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* enable new executor by default * enable stream safe allocator * test=document_fix;test=coverage * do not use scope in op kernel * fit empty program for new executor * fix communication depend * fix test_sync_batch_norm * skip unsupported place * refine datatransfer * fit for dirtributed program * fix dependencpy * fix some ut
-