[cherry-pick] Fix perf issues of mp/pp/fuse in eager mode (#47071)
* [Dygraph] Fix performance of pp+mp by using send/recv_calc_stream instead of send/recv (#46116) * [Dygraph] Fix Perf of FusedFeedForward and FusedAttention with AllReduce (#46780) * update
Showing
想要评论请 注册 或 登录