- 05 2月, 2020 1 次提交
-
-
由 Wilber 提交于
cmake选项中添加了WITH_NCCL,显示指定是否编译NCCL的部分代码,WITH_NCCL默认打开,但如果WITH_GPU为OFF,则关闭WITH_NCCL 添加了PADDLE_WITH_NCCL定义 单机单卡能够关闭NCCL编译,多卡的话需要默认打开NCCL,如果关闭NCCL,则只能使用单卡 Co-authored-by: N石晓伟 <39303645+Shixiaowei02@users.noreply.github.com>
-
- 13 9月, 2019 1 次提交
-
-
由 chengduo 提交于
* Open fuse all reduce op test=develop * Add Fuse optimization op log * Add log in fuse_optimizer op pass and fuse all_reduce op pass * replace with boost::optional<bool> test=develop * Polish code test=develop * fix code coverage test=develop
-
- 11 9月, 2019 1 次提交
-
-
由 chengduo 提交于
* Enable fused_all_reduce_op_handle support GPU and CPU Gradients
-
- 28 8月, 2019 1 次提交
-
-
由 tangwei12 提交于
* fix correctness of the communicator * fix a bug in send thread when sending var context is empty, test=develop * add lookup_table_prefetch_op and prefetch optimize, test=develop * remove remote prefetch GPU supported * word2vec force with CPU, test=develop * test dist remote lookup table force with CPU, test=develop
-
- 14 6月, 2019 1 次提交
-
-
由 gongweibao 提交于
-
- 27 5月, 2019 1 次提交
-
-
由 gongweibao 提交于
-
- 23 5月, 2019 1 次提交
-
-
由 Qiao Longfei 提交于
Async exe support communicator
-
- 08 5月, 2019 1 次提交
-
-
由 chengduo 提交于
* move pass to ir * polish code test=develop * fix dependency test=develop
-
- 18 4月, 2019 1 次提交
-
-
由 gongweibao 提交于
-
- 01 4月, 2019 1 次提交
-
-
由 Qiao Longfei 提交于
-
- 31 3月, 2019 2 次提交
-
-
由 Qiao Longfei 提交于
-
由 Qiao Longfei 提交于
-
- 28 3月, 2019 3 次提交
-
-
由 chengduo 提交于
* fuse optimizer
-
由 chengduo 提交于
* modify the interface of Pass::Allay test=develop * Polish code test=develop * Fix Travis CI test=develop * fix Pass::Apply interface test=develop * Fix Travis CI test=develop
-
由 gongweibao 提交于
-
- 27 3月, 2019 1 次提交
-
-
由 Qiao Longfei 提交于
-
- 20 3月, 2019 2 次提交
-
-
由 chengduo 提交于
* fuse all_reduce test=develop * add fuse_parameter_groups_size test=develop * Polish code test=develop * Fix travis-ci test=develop * Add SetGroupAccordingToLayers and SetGroupAccordingToGroupSize test=develop * Add SetGroupAccordingToMemorySize test=develop * fix multi_devices_graph test=develop * reset params_grads test=develop * Polish code test=develop
-
由 Wu Yi 提交于
* wip allreduce in op * wip * wip * wip * wip adding test * wip for conflict with mp mode * fix tests test=develop * fix cpu build test=develop * fix travis clang format test=develop * fix cpu build test=develop * update api.spec test=develop * delete comment test=develop * fix cpplint test=develop * fix test=develop * follow comment test=develop * add file test=develop * fix build test=develop * update test=develop * to be compatible with sync_bn, and fix mp mode in develop test=develop
-
- 08 3月, 2019 1 次提交
-
-
由 Qiao Longfei 提交于
-
- 05 3月, 2019 1 次提交
-
-
由 Qiao Longfei 提交于
-
- 21 2月, 2019 1 次提交
-
-
由 Qiao Longfei 提交于
-
- 14 2月, 2019 3 次提交
-
-
由 Yancey1989 提交于
-
由 Yancey1989 提交于
-
由 乔龙飞 Qiao Longfei 提交于
-
- 12 2月, 2019 2 次提交
-
-
由 乔龙飞 Qiao Longfei 提交于
-
由 Qiao Longfei 提交于
-
- 10 2月, 2019 1 次提交
-
-
由 Qiao Longfei 提交于
-
- 17 1月, 2019 1 次提交
-
-
由 Qiao Longfei 提交于
-
- 07 1月, 2019 1 次提交
-
-
由 chengduo 提交于
* Refactor ParallelExecutor test=develop * extract Reduce and AllReduce mode from MultiDevSSAGraphBuilder test=develop * Refactor MultiDevSSAGraphBuilder test=developt * Remove enable_data_balance test=develop * code refine test=develop * remove data balance test=develop * refine ScaleLossGradOp test=develop * remove uncessary file test=develop * code refine test=develop * modify function name test=develop * follow comments test=develop * add is_distribution field test=develop * set is_distribution test=develop * fix DistSSAGraphBuilder test=develop
-
- 27 12月, 2018 1 次提交
-
-
由 chengduo 提交于
* refine parallel_exe test=develop * rename shared_var_device * code refine * add test_weight_decay * remove Sort test=develop * Add SortForReduce test=develop * code refine test=develop * follow comment test=develop
-
- 26 12月, 2018 1 次提交
-
-
由 Wu Yi 提交于
* wip * wip * wip * wip for test * add fp16 tests test=develop * fix cpu build test=develop * fix test=develop * fix py3 tests test=develop * fix lr_scheduler dtype test=develop * fix test=dvelop * test fix ci compile test=develop * fix build and merge test=develop * fallback momentumop change to general test=develop * make fp16 lr schedule simple test=develop * fix ut test=develop * fix tests test=develop * remove fp16 learning rate cast test=develop
-
- 20 12月, 2018 3 次提交
-
-
由 typhoonzero 提交于
This reverts commit 3d750f9c.
-
由 chengduo 提交于
* refine parallel_executor * remove uncessary code test=develop
-
由 Wu Yi 提交于
* wip * wip * wip * wip for test * add fp16 tests test=develop * fix cpu build test=develop * fix test=develop * fix py3 tests test=develop * fix lr_scheduler dtype test=develop * fix test=dvelop * test fix ci compile test=develop * fix build and merge test=develop * fallback momentumop change to general test=develop
-
- 22 11月, 2018 1 次提交
-
-
由 peizhilin 提交于
-
- 06 11月, 2018 1 次提交
-
-
由 Xin Pan 提交于
-
- 29 10月, 2018 2 次提交
-
-
由 Wu Yi 提交于
* fix nccl2 lars dist support * put lars in momentum op * add tests lars * fix ci * fix cpu kernel * soft warning * remove lars in test_recognize_digits.py * move to another op * add file * update api.spec test=develop * update test=develop * fix api.spec test=develop * wip * wip, finish grad merge ops * wip, finish graph build * wip test running * work on 1 gpu * workable version * update * fix tests * fuse broadcast op * fix compile failed * refine * add batch merge test mnist * fix CI test=develop * fix build * use independent bn params for batch merge test=develop * update api.spec * follow comments and for test * wip * refine tests test=develop * follow comments test=develop * remove startup bn modify test=develop * follow comments test=develop * fix merge test=develop
-
由 sneaxiy 提交于
test=develop
-
- 12 10月, 2018 1 次提交
-
-
由 sneaxiy 提交于
-
- 21 9月, 2018 1 次提交
-
-
由 Wu Yi 提交于
* wip * clean up * should fix running with memopt * add ut * mark lr schedule op role * hide lr_schedule_guard * use op_role_var instead of ufind * unify dist test name * wip for py3 support * fix var deref * fix python3 mem_opt order * remove comments
-