feat: refactor reduce struct to control network order, avoiding network contention (#994)
* feat: avoid net contention by adding ctrl edge in ReduceStruct * refine(task_graph.h/cpp): refine AddCtrlEdgeInReduceStruct() * fix(graph/task_graph.cpp): fix the bug of machine order * fix(graph/task_graph.cpp): do not add ctrl edge with reduce scatter * feat: add ReduceGlobalAddCompActor * fix: fix the bug of reduce_global_actor/kernel * chore: remove used vim .swp file * fix(graph/task_graph.cpp): fix the bug of sorting copycomment when build reduce ctrl edge * fix(graph/task_graph.h/cpp): add CtrlEdge for ReduceGather * revert: remove the ReduceGlobalAddCompActor from this PR * feat: add use_ordered_allreduce_in_mdupdt in OtherConf Former-commit-id: 6d9663c7
Showing
想要评论请 注册 或 登录