support KL2 multi-card training, *test=kunlun (#43889)
* update xccl lib * use separate streams for compute/comm on XPU * add broadcast op to xpu2_op_list
Showing
想要评论请 注册 或 登录
* update xccl lib * use separate streams for compute/comm on XPU * add broadcast op to xpu2_op_list