- 02 3月, 2020 1 次提交
-
-
由 tianshuo78520a 提交于
-
- 24 2月, 2020 1 次提交
-
-
由 tangwei12 提交于
* add sync communicator and implement
-
- 11 2月, 2020 1 次提交
-
-
由 Wilber 提交于
[cherry-pick] #22484 支持不依赖nccl进行编译。 多卡下,如果没有打开WITH_NCCL开关编译,则只能使用单卡
-
- 05 2月, 2020 1 次提交
-
- 20 1月, 2020 1 次提交
-
-
由 tangwei12 提交于
* add half_async in the communicator * fix DistributedStrategy
-
- 14 1月, 2020 1 次提交
-
-
由 123malin 提交于
* test=develop, bug fix for sparse recorder
-
- 06 1月, 2020 1 次提交
-
-
由 123malin 提交于
* add distributed_strategy
-
- 25 12月, 2019 1 次提交
-
-
由 zhouwei25 提交于
-
- 19 12月, 2019 1 次提交
-
-
由 Chengmo 提交于
* test=develop, speed dense calc & communication
-
- 12 12月, 2019 1 次提交
-
-
由 tangwei12 提交于
* add fake init for the trainer, fix large memory hold in the trainer * do not merge recv vars from a remote endpoint, test=develop * add recv and save op, merge slice var in one op, save memory * remove hsigmoid with pull sparse, test=develop
-
- 03 12月, 2019 1 次提交
-
-
由 Tao Luo 提交于
test=develop
-
- 02 12月, 2019 1 次提交
-
-
由 Tao Luo 提交于
* fix -Wno-error=sign-compare warning in gcc8 test=develop * fix warning in distributed codes test=develop
-
- 28 11月, 2019 1 次提交
-
-
由 Tao Luo 提交于
* remove -Wno-error=sign-compare, make warning as error test=develop test=document_fix * fix exist compile warning test=develop
-
- 05 11月, 2019 1 次提交
-
-
由 Chengmo 提交于
* test=develop,fix rpc not wait in geo
-
- 01 11月, 2019 1 次提交
-
-
由 123malin 提交于
* update pserver decay blocks * update distributed notify handler
-
- 31 10月, 2019 1 次提交
-
-
由 Chengmo 提交于
* fix PaddleCloud Role maker & add warning in distribute transpiler & change rpc_retry_times
-
- 20 10月, 2019 1 次提交
-
-
由 123malin 提交于
* test=develop, communicator_is_sgd_optimizer flags
-
- 18 10月, 2019 1 次提交
-
-
由 gongweibao 提交于
-
- 16 10月, 2019 2 次提交
-
-
由 tangwei12 提交于
-
由 gongweibao 提交于
-
- 15 10月, 2019 2 次提交
-
-
由 Chengmo 提交于
* test=develop,Fix communicator slow bug * test=develop, delete if() in stop_worker() * test=develop * fix UT, test=develop * fix bug in fetch handler, test=develop * fix bug in fetch handler, test=develop * test=develop, fix fetch barrier bug * test=develop, bug fix * test=develop, bug fix * test=develop, fix bug
-
由 123malin 提交于
* bug fix: invalid learning rate decay in pserver async mode
-
- 07 10月, 2019 3 次提交
- 30 9月, 2019 1 次提交
-
-
由 Chengmo 提交于
* refector geo sgd & communicator
-
- 27 9月, 2019 1 次提交
-
-
由 tangwei12 提交于
* add a base class for the Communicator * add AsyncCommunicator Impl for async distributed training
-
- 18 9月, 2019 1 次提交
-
-
由 123malin 提交于
* rpc retry for asycsend/get/prefetch * test=develop, change retry vlog level to 3 * test=develop, set default grpc_retry_times is 3
-
- 07 9月, 2019 1 次提交
-
-
由 Tao Luo 提交于
* remove -Wmaybe-uninitialized warning test=develop * remove uninitialized op_handle_ in scale_loss_grad_op_handle.cc test=develop
-
- 05 9月, 2019 2 次提交
- 28 8月, 2019 1 次提交
-
-
由 tangwei12 提交于
* fix correctness of the communicator * fix a bug in send thread when sending var context is empty, test=develop * add lookup_table_prefetch_op and prefetch optimize, test=develop * remove remote prefetch GPU supported * word2vec force with CPU, test=develop * test dist remote lookup table force with CPU, test=develop
-
- 18 8月, 2019 1 次提交
-
-
由 gongweibao 提交于
Unset unittests http_proxy env to avoid timeout.
-
- 16 8月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 12 8月, 2019 1 次提交
-
-
由 gongweibao 提交于
Polish fleet API to support cuda collective mode and nccl2 mode
-
- 27 6月, 2019 1 次提交
-
-
由 tangwei12 提交于
* add is_runnning in communicator, test=develop
-
- 25 6月, 2019 1 次提交
-
-
由 Qiao Longfei 提交于
* optimize communicator merge sparse gradient test=develop * revert multithread selected rows merge add test=develop * follow comment test=develop
-
- 12 6月, 2019 1 次提交
-
-
由 tangwei12 提交于
* fix save/load in Fleet * add UT framework of Fleet
-
- 10 6月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* remove attribute in Allocator::Allocate, test=develop * fix travis ci error, test=develop
-
- 23 5月, 2019 1 次提交
-
-
由 Qiao Longfei 提交于
Async exe support communicator
-