• L
    [NPU] add npu kernel for communication op (#31437) · 15823bb0
    lw921014 提交于
    * add allreduce and broadcast without test
    
    * add c_broadcast_test case
    
    * build c_comm_init and c_create_group operators
    
    * make the whole thing compile
    
    * add broadcast and init op test case but run failed
    
    * make unit test compile
    
    * fix broadcast test bug and change into hcom for ccl
    
    * change c_comm_init and c_create_group ops accordingly
    
    * make tests compile
    
    * transfer code to 27
    
    * compiled successfully in 28, but run failed
    
    * test broadcast in 28, but failed
    
    * make hcom primitives work
    
    * change hccl data type for base.h
    
    * fix broadcast bug
    
    * make attributes work
    
    * fix group name bug
    
    * add allreduce but test failed
    
    * allreduce bug for qiuliang
    
    * allreduce finished
    
    * add allgather and reducescatter
    
    * merge all op code
    
    * add allgather test
    
    * finish run all ccl op test exclude send/recv
    
    * all all op and test exclude send/recv
    
    * send_v2_npu.cc recv_v2_npiu.cc compiled
    
    * fix ccl core dump bug and test allgather, reducescatter, broadcast op
    
    * fix allreduce bug just for test
    
    * hcom send&recv test pass, without hcom_destroy
    
    * for qiuliang test
    
    * Ascend Send&Recv Test Pass
    
    * all op (ex send/recv) ok
    
    * fix bug
    
    * merge all ccl op
    
    * style merge to PaddlePaddle
    
    * merge style
    
    * new merge style
    
    * merge style 2
    
    * insert an empty at the end
    
    * disable ctest for hcom to pass ci
    Co-authored-by: Nvoid-main <voidmain1313113@gmail.com>
    Co-authored-by: Nf2hkop <f2huestc@outlook.com>
    15823bb0
collective_helper.h 10.8 KB