- 17 4月, 2023 1 次提交
-
-
由 Haohongxiang 提交于
-
- 31 3月, 2023 1 次提交
-
-
由 张春乔 提交于
* autofix Co-authored-by: NLiyulingyue <83450930+Liyulingyue@users.noreply.github.com> * revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py * empty commit, trigger ci * fix test_slice --------- Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
- 29 3月, 2023 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 25 3月, 2023 1 次提交
-
-
由 张春乔 提交于
-
- 09 3月, 2023 1 次提交
-
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * Remove climits. * Fix bug of hybrid parallel strategy with recompute using bf16. * Fix bug of recompute_hybrid ctx.amp_dtype * Fix bug of amp_dtype. * Fix bug of auto_cast.
-
- 09 2月, 2023 1 次提交
-
-
由 Roc 提交于
Co-authored-by: Nzhangxiaoci <zhangxiaoci@baidu.com>
-
- 11 1月, 2023 1 次提交
-
-
由 Wen Sun 提交于
-
- 25 12月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
* delete legacy dygraph code in python/paddle/distributed * refine
-
- 29 11月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* isort all files * revert conflicting files * revert conflicting files * revert conflicting files
-
- 17 11月, 2022 1 次提交
-
-
由 ShenLiang 提交于
-
- 23 10月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format
-
- 21 10月, 2022 1 次提交
-
-
由 Yuang Liu 提交于
-
- 19 10月, 2022 1 次提交
-
-
由 Roc 提交于
-
- 17 10月, 2022 1 次提交
-
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * Support allow_partial switch, which can be configure in pipeline_configs. If sent tensor are not the same from different hosts, they shouldn't been sent partially and then concated as a whole tensor. * Change name allow_partial to enable_partial_send_recv. * Add global variable _enable_partial_send_recv
-
- 12 10月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* [CodeStyle][F401] remove unused import in python/paddle/distributed * remove pass * empty commit * Fix ValueError: list.remove(x): x not in list for meta_optimizer_names. Fix ValueError: list.remove(x): x not in list for meta_optimizer_names. * Fix split import. Fix split import. * add noqa after meta_optimizers in factory * restort collective ops * expand `import *` * add noqa after required imports * try to fix APIs without core.ops * Revert "try to fix APIs without core.ops" This reverts commit 6172beaf601e84bf61f2490c12c4739f0edaa5eb. * fix an increment * empty commit * add noqa after required imports * expand `import *`, fix ci error Co-authored-by: NShuangchi He <34329208+Yulv-git@users.noreply.github.com>
-
- 08 10月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
-
- 28 9月, 2022 1 次提交
-
-
由 Yuang Liu 提交于
-
- 22 9月, 2022 1 次提交
-
-
由 Yuang Liu 提交于
-
- 20 9月, 2022 1 次提交
-
-
由 Yuang Liu 提交于
-
- 19 9月, 2022 1 次提交
-
-
由 wuhuachaocoding 提交于
-
- 14 9月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* trim trailing whitespace * fix `.cmake-format.py` * revert npu ut changes, avoid npu ci error
-
- 09 9月, 2022 1 次提交
-
-
由 Yuang Liu 提交于
-
- 06 9月, 2022 1 次提交
-
-
由 Yuang Liu 提交于
-
- 26 8月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
-
- 22 7月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
-
- 13 7月, 2022 1 次提交
-
-
由 ShenLiang 提交于
-
- 27 6月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
* rename eagerpylayer
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
* use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed
-
- 26 5月, 2022 1 次提交
-
-
由 ShenLiang 提交于
-
- 16 5月, 2022 1 次提交
-
-
由 ShenLiang 提交于
* fix recompute in mp * fix recompute
-
- 19 4月, 2022 1 次提交
-
-
由 ShenLiang 提交于
* fix utest * fix time
-
- 18 10月, 2021 1 次提交
-
-
由 Haohongxiang 提交于
* [HybridParallel]Support fp16 in dygraph hybrid parallel * update * update * update for recompute * add unittest of pp+fp16 * add unittest of recompute+fp16 * update * modify ut
-
- 13 10月, 2021 1 次提交
-
-
由 Leo Chen 提交于
* refine amp level * fix typo * update tracer._amp_level
-
- 17 9月, 2021 1 次提交
-
-
由 zhangbo9674 提交于
* add pure fp16 major function in auto_cast & tracer * support master weight in dygraph for pure fp16 * check mix dtype of fp16&fp32 for check_finite_and_unscale op * change pure fp16 funtion name * refine some bug in auto_cast * refine auto_cast interface logic * add param _casted_by_pure_fp16 for class Layer * support state_dict hook for save model by user appointed dtype in pure_fp16_decorator * refine pure_fp16_decorator as decorator * add unittest * add comment * add comment * support recompute * add comment for auto_cast and decorator * support to_static_state_dict for paddle.jit.save * unlimite models num and optimizers num * add lookup_table in black_list * fix momentum and layer state_dict * fix bug in layer state_dict * fix bug in layer state_dict_helper * refine unittest * refine test_momentun_op * refine interface and some code * refine amp_decorator interface * refine pure fp16 interface * refine master weight interface
-
- 10 9月, 2021 1 次提交
-
-
由 ShenLiang 提交于
-
- 01 9月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* add cache for send_recv * add eval_batch for pipeline * add eval batch for pipelineparallel * add style code
-
- 13 8月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* fix bug of py36 import
-
- 12 8月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* add recompute for pp * add recompute offload * add recompute partition
-
- 06 8月, 2021 1 次提交
-
-
由 ShenLiang 提交于
-
- 05 8月, 2021 1 次提交
-
-
由 ShenLiang 提交于
-