- 13 6月, 2022 2 次提交
-
-
由 zhaoyingli 提交于
* fix fetch list * fix unittest
-
由 wangguanqun 提交于
* gpups default config and dataset * codestyle * add unittest * code style
-
- 09 6月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* add nproc_per_node for DistributedFusedLamb * fix nproc_per_node communicator bug * fix ring_id = 1 init bug * fix ci * fix test_parallel_executor_mnist.py
-
- 08 6月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
* add fetch_list * fix evaluate log * tiny fix
-
- 07 6月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
* fix bugs of reducer * update * update
-
- 06 6月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
* fix gradient merge * bug fix * update annotation
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
* use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed
-
- 02 6月, 2022 4 次提交
-
-
由 Haohongxiang 提交于
-
由 ziyoujiyi 提交于
* back fl * delete ssl cert * . * make warning * . * unittest paral degree * solve unittest * heter & multi cloud commm ready * . * . * fl-ps v1.0 * . * support N + N mode * . * . * . * . * delete print * . * . * . * .
-
由 zhaoyingli 提交于
* prepare only once
-
由 zhaoyingli 提交于
-
- 01 6月, 2022 4 次提交
-
-
由 JZ-LIANG 提交于
* adapt for 10 loss * partitioner support optimizer
-
由 caozhou 提交于
-
由 Yulong Ao 提交于
* [Auto Parallel] Add the parallel tuner * [Auto Parallel] Improve the parallel tuner and fix some bugs * upodate cost model * update import Resharder by dist op * update cost model * fix comp cost bug * update cost model * [Auto Parallel] Amend the dist attr for #processses=1 * update cost model and tuner * update cost model and tuner * update cost model and tuner * update cluster * update reshard * [Auto Parallel] Add the estimation from the cost model * [Auto Parallel] Reimplement the backup and restore functions * [Auto Parallel] Fix the bugs of the parallel tuner * [Auto Parallel] Update the engine api and dist context * [Auto Parallel] Work around the high order grad problem * [Auto Parallel] Add some miscellaneous improvements * [Auto Parallel] Add a unittest for DistributedContext Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>
- 31 5月, 2022 2 次提交
-
-
由 yaozhixin 提交于
* [IPU] support paddle.distributed.launch with IPUs * add device_num to env_args_mapping
-
由 Haohongxiang 提交于
-
- 30 5月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
* use original id in dist_op_context.grad_op_id_to_op_id * del assert * remove redundant map
-
- 26 5月, 2022 2 次提交
-
-
由 danleifeng 提交于
-
由 ShenLiang 提交于
-
- 25 5月, 2022 2 次提交
-
-
由 danleifeng 提交于
-
由 Baibaifan 提交于
* fix_sharding3_offload * fix_fp16dtype_bug
-
- 24 5月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
-
- 23 5月, 2022 1 次提交
-
-
由 Weilong Wu 提交于
-
- 20 5月, 2022 1 次提交
-
-
由 zn 提交于
-
- 19 5月, 2022 3 次提交
-
-
由 danleifeng 提交于
-
由 JZ-LIANG 提交于
* auto parallel support primitive op with data parallel * add primitive change * 5 loss 3D cylinder acc aligned * add unitest
-
由 zhaoyingli 提交于
* slice data in dist_loader & flag to scale grad * bug fix * update unittest * enable static
-
- 18 5月, 2022 2 次提交
- 16 5月, 2022 1 次提交
-
-
由 ShenLiang 提交于
* fix recompute in mp * fix recompute
-
- 13 5月, 2022 3 次提交
-
-
由 lilong12 提交于
* update
-
由 Tao CHANG 提交于
-
由 kuizhiqing 提交于
* add nvsmi * collect gpu info to log * fix unitest * rm ret_type
-
- 12 5月, 2022 2 次提交
-
-
由 Shuangchi He 提交于
-
由 sneaxiy 提交于
* add cinn pass to program * remove build_cinn_pass ut * polish ut, add ut * guard ut with is_compiled_with_cinn * enable ut test_build_cinn_pass_resnet
-
- 10 5月, 2022 3 次提交
- 07 5月, 2022 1 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Replace the old planner by the new partition tuner * [Auto Parallel] Improve the completion and distributed context * [Auto Parallel] Fix some bugs of the compatible check of some dist ops * [Auto Parallel] Fix some bugs
-