- 26 8月, 2022 2 次提交
-
-
由 kangguangli 提交于
* remove fluid kernel and activate phi kernel * fix parameter error * transfer mkldnn part * modify header file path * fix compile error * transfer special case * fix lod setting and special case for layout setting * add testcase and refine code
-
由 王明冬 提交于
-
- 23 8月, 2022 1 次提交
-
-
由 pangyoki 提交于
-
- 19 8月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Fix random op depenency and lr_shedule bugs for standalone executor * Fix CI errors * Fix CI errors * Fix CI errors
-
- 18 8月, 2022 1 次提交
-
-
由 pangyoki 提交于
apply buffer_shared_inplace_pass and inplace_addto_op_pass pass to program in Standalone Executor (#45085) * apply inplace addto in python apply_pass * fix * apply inplace pass for program * skip feed and fetch var * fix block_desc.move_from * fix block desc * alltoall remove inplace * fix
-
- 16 8月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* move check finite and unscale kernel into phi * move infershape into phi * move update_loss_scaling kernel into phi * remove original kernels * move update loss scaling infershape into phi * add header for xpu and npu * solve coverage failed * fix npu test failed * remove mutable data in cu file * fix new executor failed * add valid check for meta tensor output
-
- 13 8月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* add cached_serialize_str_ * support program hash * add sha * add ut * use hash_str only for new_exe * fix attr order
-
- 12 8月, 2022 1 次提交
-
-
由 kangguangli 提交于
* transfer memcpy_h2d from fluid to phi * use UnchangedInferMeta instead * restore test_standalone_executor * add newline to fix codestyle check * rename pt -> phi * simplify logic and add check * make the comment more clear * remove useless comment * refine code
-
- 10 8月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* set cuda device before run * add header file * fix compile
-
- 04 8月, 2022 1 次提交
-
-
由 王明冬 提交于
-
- 02 8月, 2022 3 次提交
-
-
由 Weilong Wu 提交于
* polish and rename, pt* -> phi* * fix code format
-
由 Ruibiao Chen 提交于
* Skip inplace for coalesce_tensor_op outputs * Fix typos * Add UTs * Fix typos
-
由 Ruibiao Chen 提交于
* Refactor build_op_downstream_map for standalone executor * Add some comments
-
- 01 8月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* remove cudaDeviceContext * remove more template * fix rocm compile * remove alias name CUDADeviceContext * fix compile * fix tests * revert changes
-
- 25 7月, 2022 1 次提交
-
-
由 lyq 提交于
-
- 20 7月, 2022 3 次提交
-
-
由 houj04 提交于
* device_guard support xpu. test=kunlun * sum op of xpu support LoDTensorArray. add test for while op of xpu. test=kunlun.
-
由 Ruibiao Chen 提交于
* Add dependency for read op in standalone executor * Fix CI errors * Add UT * add_dependency -> dependency_utils * Fix CI errors
-
由 tianshuo78520a 提交于
-
- 19 7月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Rename BOOST_GET macros * Fix conflicts
-
- 18 7月, 2022 1 次提交
-
-
由 王明冬 提交于
-
- 14 7月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
* Compilation optimization
-
- 11 7月, 2022 1 次提交
-
-
由 王明冬 提交于
-
- 06 7月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* not run startup program in constructor of StandaloneExecutor * clear interface of standalone executor * clean debug code
-
- 30 6月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* support scope_guard * fix test
-
- 29 6月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* separate variable scope and scope * hot fix for lod_tensor_blocking_queue * fix bug that variable exists in global scope
-
- 27 6月, 2022 1 次提交
-
-
由 Leo Chen 提交于
-
- 26 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 24 6月, 2022 2 次提交
-
-
由 王明冬 提交于
-
由 chenjian 提交于
* record memory and op supplement info * update * update * fix a bug * fix memory recording * fix a bug * update * update * fix a bug * update * fix a bug * fix a bug * fix a bug * Revert "fix a bug" This reverts commit c1d4df52762ba9ae7c7e27cd2ba4fc3a7ed9c7a5. * fix a bug * fix format * fix
-
- 23 6月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Multiplex Workqueue for InterpreterCore * Delete ResetWorkQueueOptions * Update code format
-
- 16 6月, 2022 2 次提交
-
-
由 Ruibiao Chen 提交于
* Support disable GC for some vars in standalone executor * Setting skip_gc_vars in interprecore construction
-
由 Leo Chen 提交于
* lazy creating work queue * fix dry_run
-
- 14 6月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Support sequential run for standalone executor * Add UTs * Fix test_standalone_multiply_write * Remove unnecessary UTs
-
- 10 6月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Refactor DeviceContextPool * Adjust header file order
-
- 09 6月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 04 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 16 5月, 2022 1 次提交
-
-
由 liutiexing 提交于
* WorkQueue supports always_spinning option * update * update * fix stat
-
- 12 5月, 2022 1 次提交
-
-
由 Shuangchi He 提交于
-
- 01 5月, 2022 1 次提交
-
-
由 Lijunhui 提交于
-