- 11 9月, 2019 1 次提交
-
-
由 chengduo 提交于
* Enable fused_all_reduce_op_handle support GPU and CPU Gradients
-
- 04 9月, 2019 1 次提交
-
-
由 baojun 提交于
* enable ngraph throught build_strategy test=develop * add unittest test=develop * put use_ngraph unconditional test=develop * remove paddle_enforce test=develop * remove paddle_enforce test=develop * fix copyright test=develop * limit for ngraph only test=develop
-
- 02 8月, 2019 1 次提交
-
-
由 chengduo 提交于
* Disable fuse optimization test=develop
-
- 29 7月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* remove legacy memory optimization codes, test=develop * follow huihuang's comments,test=develop * follow luotao's comments, test=develop
-
- 27 7月, 2019 1 次提交
-
-
由 chengduo 提交于
* open fuse optimization ops test=develop
-
- 26 7月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* first version memory optimize pass, test=develop * remove move_tensor_sharing_pass, test=develop * refine code comments, add unittests, test=develop * turn off memory_optimize by default, test=develop * follow huihuang's comments, test=develop * follow chengduoZH's comments, test=develop * fix grammar error, add const qualifier, fix pass_test exception message, test=develop * follow chengduoZH's comments 2nd, test=develop
-
- 23 7月, 2019 1 次提交
-
-
由 chengduo 提交于
* support sparse gradients test=develop
-
- 11 7月, 2019 2 次提交
-
-
由 gongweibao 提交于
-
由 Zeng Jinle 提交于
* feature/buffer_shared_inplace, test=develop * refine code, test=develop * fix elementwise_add op cpu inplace and sum inplace bug, test=develop * add unittest and debug log, test=develop * fix parallel_executor scope bug, polish code, test=develop * fix sum op, activation op, single_in_place_inference bug, test=develop * remove kLocalExecScopeName, test=develop * fix unittest,test=develop * fix out_var first version bug, test=develop * follow comments,test=develop
-
- 14 6月, 2019 1 次提交
-
-
由 gongweibao 提交于
-
- 06 6月, 2019 1 次提交
-
-
由 gongweibao 提交于
-
- 27 5月, 2019 1 次提交
-
-
由 gongweibao 提交于
-
- 20 5月, 2019 1 次提交
-
-
由 Tao Luo 提交于
test=develop
-
- 14 5月, 2019 1 次提交
-
-
由 Tao Luo 提交于
* make parallel_executor support FLAGS_use_mkldnn test=develop * add warning when set mkldnn_enabled_op_types_ in non-mkldnn env test=develop
-
- 08 5月, 2019 1 次提交
-
-
由 chengduo 提交于
* move pass to ir * polish code test=develop * fix dependency test=develop
-
- 06 5月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* add use_cuda to inplace pass,test=develop * add test softmax_with_xe_inplace test,test=develop
-
- 23 4月, 2019 1 次提交
-
-
由 chengduo 提交于
* Add fuse momenutum ops
-
- 21 4月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* speedup gc and inplace softmax_with_cross_entropy_grad test=develop * refine models gpu mem Merge skip vars and warning messages of mem opt remove relu mem opt test=develop * follow comments test=develop
-
- 12 4月, 2019 1 次提交
-
-
由 chengduo 提交于
* fix bug of fuse optimize ops
-
- 11 4月, 2019 1 次提交
-
-
由 Yiqun Liu 提交于
* Add an option to enable the cache of expected kernel in train phase. test=develop * Change the default value of cache_expected_kernel to true.
-
- 08 4月, 2019 2 次提交
-
-
由 gongweibao 提交于
-
由 Yiqun Liu 提交于
* Try to enable the runtime_context_cache pass in train phase. * Put the append of runtime_context_cache pass ahead of multi_dev passes. test=develop
-
- 03 4月, 2019 1 次提交
-
-
由 chengduo 提交于
-
- 28 3月, 2019 2 次提交
- 22 3月, 2019 1 次提交
-
-
由 chengduo 提交于
* refine parallelExecutor test=develop * Polish op_handle test=develop * Remove unnecessary op_handle test=develop * Fix Travis CI test=develop * Fix fetch bug test=develop * Remove WaitInputVarGenerated * Fix OpHandleBase::Run test=develop * debug test=develop * use origin fetch_op_handle test=develop * Revert op_handle_base.cc test=develop * Polish code test=develop * Fix OpHandleBase::Run test=develop * code refine * test CI and CE test=develop * fix OpHandle::Run test=develop * refine AllReduceOpHandle test=develop * Polish code test=develop
-
- 20 3月, 2019 1 次提交
-
-
由 chengduo 提交于
* fuse all_reduce test=develop * add fuse_parameter_groups_size test=develop * Polish code test=develop * Fix travis-ci test=develop * Add SetGroupAccordingToLayers and SetGroupAccordingToGroupSize test=develop * Add SetGroupAccordingToMemorySize test=develop * fix multi_devices_graph test=develop * reset params_grads test=develop * Polish code test=develop
-
- 15 3月, 2019 1 次提交
-
-
由 qingqing01 提交于
* Support Sync Batch Norm. * Note, do not enable it in one device. Usage: build_strategy = fluid.BuildStrategy() build_strategy.sync_batch_norm = True binary = fluid.compiler.CompiledProgram(tp).with_data_parallel( loss_name=loss_mean.name, build_strategy=build_strategy)
-
- 07 3月, 2019 1 次提交
-
-
由 Qiao Longfei 提交于
-
- 05 3月, 2019 1 次提交
-
-
由 Qiao Longfei 提交于
-
- 23 2月, 2019 1 次提交
-
-
由 Qiao Longfei 提交于
-
- 22 2月, 2019 2 次提交
- 21 2月, 2019 2 次提交
-
-
由 Xin Pan 提交于
test=develop
-
由 Qiao Longfei 提交于
-
- 19 2月, 2019 1 次提交
-
-
由 Yancey1989 提交于
-
- 18 2月, 2019 2 次提交
-
-
由 Yancey1989 提交于
-
由 dzhwinter 提交于
-
- 14 2月, 2019 2 次提交
-
-
由 Yancey1989 提交于
-
由 Yancey1989 提交于
-