Improve performance of coalesce_tensor and depend op in standalone executor (#47606)
* Dispath computation OPs before communication in standalone executor * Update code * Fix CI errors * Improve performance of coalesce_tensor and depend OP in standalone executor * pre-commit check
Showing
想要评论请 注册 或 登录