- 17 10月, 2022 1 次提交
-
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * Support allow_partial switch, which can be configure in pipeline_configs. If sent tensor are not the same from different hosts, they shouldn't been sent partially and then concated as a whole tensor. * Change name allow_partial to enable_partial_send_recv. * Add global variable _enable_partial_send_recv
-
- 12 10月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* [CodeStyle][F401] remove unused import in python/paddle/distributed * remove pass * empty commit * Fix ValueError: list.remove(x): x not in list for meta_optimizer_names. Fix ValueError: list.remove(x): x not in list for meta_optimizer_names. * Fix split import. Fix split import. * add noqa after meta_optimizers in factory * restort collective ops * expand `import *` * add noqa after required imports * try to fix APIs without core.ops * Revert "try to fix APIs without core.ops" This reverts commit 6172beaf601e84bf61f2490c12c4739f0edaa5eb. * fix an increment * empty commit * add noqa after required imports * expand `import *`, fix ci error Co-authored-by: NShuangchi He <34329208+Yulv-git@users.noreply.github.com>
-
- 22 9月, 2022 1 次提交
-
-
由 Yuang Liu 提交于
-
- 21 9月, 2022 1 次提交
-
-
由 LiYuRio 提交于
-
- 19 9月, 2022 1 次提交
-
-
由 wuhuachaocoding 提交于
-
- 06 9月, 2022 1 次提交
-
-
由 Yuang Liu 提交于
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
* use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed
-
- 19 4月, 2022 1 次提交
-
-
由 ShenLiang 提交于
* fix utest * fix time
-
- 18 10月, 2021 1 次提交
-
-
由 Haohongxiang 提交于
* [HybridParallel]Support fp16 in dygraph hybrid parallel * update * update * update for recompute * add unittest of pp+fp16 * add unittest of recompute+fp16 * update * modify ut
-
- 14 10月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* add no_sync for parameters sync * add pipeline for moe
-
- 24 9月, 2021 1 次提交
-
-
由 ShenLiang 提交于
-
- 13 9月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* support grad group * fix single card condition
-
- 10 9月, 2021 1 次提交
-
-
由 JZ-LIANG 提交于
* sharding support dp * sharding support mp * sharding support pp
-
- 01 9月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* add cache for send_recv * add eval_batch for pipeline * add eval batch for pipelineparallel * add style code
-
- 12 8月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* add recompute for pp * add recompute offload * add recompute partition
-
- 05 8月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* fix bug of p2p for partial * fix error
-
- 02 8月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* support 1f1b for pipeline * add utest * add send_partial/recv_partial * support amp for pp * fix logger
-
- 14 7月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* add scatter-gather opt * fix topo for pp * rename function
-
- 05 7月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* add amp support for pp * add amp untest
-
- 01 7月, 2021 1 次提交
-
-
由 ShenLiang 提交于
-
- 16 6月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* add pplayer * add sharedlayerdesc
-
- 24 5月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* fix pipeline * fix mp pp dp * fix utest of hybrid parallel * add utest for tuple
-
- 07 5月, 2021 1 次提交
-
-
由 lilong12 提交于
-
- 06 5月, 2021 1 次提交
-
-
由 zhiboniu 提交于
-
- 03 5月, 2021 1 次提交
-
-
由 lilong12 提交于
* update, test=develop
-
- 25 4月, 2021 1 次提交
-
-
由 lilong12 提交于
* add pp dygraph, test=develop
-