- 24 8月, 2021 1 次提交
-
-
由 gongweibao 提交于
-
- 18 8月, 2021 1 次提交
-
-
由 Leo Chen 提交于
* add retry for HcclGetRootInfo * refine code * reduce retry interval
-
- 16 8月, 2021 1 次提交
-
-
由 Baibaifan 提交于
-
- 13 8月, 2021 1 次提交
-
-
由 Baibaifan 提交于
-
- 06 8月, 2021 1 次提交
-
-
由 gongweibao 提交于
-
- 03 8月, 2021 1 次提交
-
-
由 WangXi 提交于
-
- 02 8月, 2021 2 次提交
-
-
由 gongweibao 提交于
-
由 Leo Chen 提交于
-
- 29 7月, 2021 1 次提交
-
-
由 gongweibao 提交于
-
- 19 7月, 2021 1 次提交
-
-
由 Roc 提交于
-
- 14 7月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* add scatter-gather opt * fix topo for pp * rename function
-
- 13 7月, 2021 2 次提交
- 23 6月, 2021 2 次提交
- 10 6月, 2021 1 次提交
-
-
由 lilong12 提交于
-
- 09 6月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* add parallel_cross_entropy * add grad for crossentropy * fix cross entropy
-
- 07 6月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* fix c_split bug * fix utest * add c_embedding for tensorparallel
-
- 13 5月, 2021 1 次提交
-
-
由 Baibaifan 提交于
-
- 08 5月, 2021 1 次提交
-
-
由 Baibaifan 提交于
* add c_identity_op_npu
-
- 06 5月, 2021 1 次提交
-
-
由 gongweibao 提交于
-
- 30 4月, 2021 1 次提交
-
-
由 Baibaifan 提交于
-
- 27 4月, 2021 2 次提交
- 26 4月, 2021 2 次提交
- 23 4月, 2021 2 次提交
- 21 4月, 2021 2 次提交
-
-
由 zhang wenhui 提交于
* add allreduce and broadcast without test (#31024) add allreduce and broadcast without test * Refactor HCCLCommContext to be compatible with Paddle (#31359) Refactor HCCLCommContext to be compatible with Paddle (#31359) * [NPU] add npu kernel for communication op (#31437) * add allreduce and broadcast without test * add c_broadcast_test case * build c_comm_init and c_create_group operators * make the whole thing compile * add broadcast and init op test case but run failed * make unit test compile * fix broadcast test bug and change into hcom for ccl * change c_comm_init and c_create_group ops accordingly * make tests compile * transfer code to 27 * compiled successfully in 28, but run failed * test broadcast in 28, but failed * make hcom primitives work * change hccl data type for base.h * fix broadcast bug * make attributes work * fix group name bug * add allreduce but test failed * allreduce bug for qiuliang * allreduce finished * add allgather and reducescatter * merge all op code * add allgather test * finish run all ccl op test exclude send/recv * all all op and test exclude send/recv * send_v2_npu.cc recv_v2_npiu.cc compiled * fix ccl core dump bug and test allgather, reducescatter, broadcast op * fix allreduce bug just for test * hcom send&recv test pass, without hcom_destroy * for qiuliang test * Ascend Send&Recv Test Pass * all op (ex send/recv) ok * fix bug * merge all ccl op * style merge to PaddlePaddle * merge style * new merge style * merge style 2 * insert an empty at the end * disable ctest for hcom to pass ci Co-authored-by: Nvoid-main <voidmain1313113@gmail.com> Co-authored-by: Nf2hkop <f2huestc@outlook.com> * Add auto-increasing tag id for Hcom OPs (#31702) * add c_reduce_sum op (#31793) add c_reduce_sum op * update Ascendrc hccl to 20.3 (#32126) update Ascendrc hccl to 20.3 (#32126) * fix merge code * change cmake.txt1 * [NPU] Support npu kernel for c sync stream op (#31386) * sync stream npu op * add with_ascend_acl * update c++ unittest * compile all failed * try to pre commit * after pre commit * merge&compile&test hccl successfully! * fix code style * fix code style * fix bugs about hccl * fix some bugs * fix code style * fix style * fix style * fix * fixed * merge develop Co-authored-by: Nlw921014 <liuwei921014@yeah.net> Co-authored-by: NVoid Main <voidmain1313113@gmail.com> Co-authored-by: Nf2hkop <f2huestc@outlook.com> Co-authored-by: Nxiayanming <41795079@qq.com>
-
由 liuyuhui 提交于
-
- 14 4月, 2021 1 次提交
-
-
由 tianshuo78520a 提交于
* Delete grpc.cmake/distribeted/distributed_ops * reset operators/CMakeLists.txt * rm test_transpiler_ops.py * del test_transpiler_ops.py
-
- 07 4月, 2021 1 次提交
-
-
由 zhang wenhui 提交于
* Ascend rc (#30483) * Fix compilcation on CANN20.1 and older (#30494) Fix compilcation on CANN20.1 and older * Add distribution supported (#30578) Add distribution supported * Build praser for Hcom* operators (#30627) Build praser for Hcom* operators * Pass device_ids info from launch to trainer. (#30632) Pass device_ids info from launch to trainer * Add Hccl program group (#30642) Add Hccl program group * Add startup bash files of test_ascend_group. (#30645) Add startup bash files of test_ascend_group * cleanup (#30646) cleanup test_ascend_group.py * [Feature] Build parser to support distributed training (#30658) [Feature] Build parser to support distributed training * fix compilation on ascend-20.1 (#30722) fix compilation on ascend-20.1 * Dev/fix ascend string (#30749) Dev/fix ascend string * code style (#30781) code style * Merge ascend_optimizer and ascend_parser. (#30776) Merge ascend_optimizer and ascend_parser. * Ascendrc add converted op : [range/equal/range/uniform_random/expand/squeeze], fix cast op bug (#30797) Ascendrc add converted op : [range/equal/range/uniform_random/expand/squeeze], fix cast op bug * Add paddle ascend distribution training supported (#30796) Add paddle ascend distribution training supported * pass cxx_flags to gloo cmake (#30857) * Destroy session first. (#30954) Destroy session first. * merge * fix, test=develop * fix, test=develop * fix style, test=develop * fix, test=develop * fix * fix log fatal, test=develop * fix enforce style, test=develop * fix, test=develop * fix, test=develop * fix rccl, test=develop * fix test, test=develop * fix, test=develop * fix, test=develop * fix, test=develop * fix node_num, test=develop * fix ids str, test=develop * fix ids str, test=develop * fix ids str, test=develop * fix, test=develop * fix, test=develop * fix, test=develop * fix, test=develop * fix, test=develop * fix, test=develop * fix, test=develop * fix, test=develop * fix style code, test=develop * fix style code, test=develop * fix style code, test=develop * fix style code, test=develop Co-authored-by: Nhutuxian <hutuxian2011@sina.cn> Co-authored-by: Ngongweibao <weibao.gong@gmail.com> Co-authored-by: NVoid Main <voidmain1313113@gmail.com> Co-authored-by: NLeo Chen <chenqiuliang@baidu.com> Co-authored-by: Ndingsiyu <18369187719@163.com> Co-authored-by: NOleNet <olenet@126.com>
-
- 01 4月, 2021 1 次提交
-
-
由 kuizhiqing 提交于
* new group * ci compatible fix * assert nccl
-
- 15 3月, 2021 1 次提交
-
-
由 WangXi 提交于
-
- 10 3月, 2021 2 次提交
- 05 3月, 2021 1 次提交
-
-
由 liuyuhui 提交于
[Kunlun]Multi xpu dygraph performance optimization , add distributed.spawn support for multi xpu and some bug-fixes (#31130)
-
- 24 2月, 2021 1 次提交
-
-
由 Qi Li 提交于
-
- 05 2月, 2021 1 次提交
-
-
由 liuyuhui 提交于
-
- 04 2月, 2021 1 次提交
-
-
由 wanghuancoder 提交于
* use iwyu clean include second time, test=develop
-