Optimize initialize time by decrease the number of pp group (#53559)
* use global group to pass meta * use batch isend irecv * add partial send/recv * remove communication group * remove p2p on npu and xpu * remove virtual pp ut
Showing
想要评论请 注册 或 登录