Add use_hierarchical_allreduce for DistributedFusedLAMB (#44821)
* add use_hierarchical_allreduce * support hierarchical allreduce for more cases
Showing
想要评论请 注册 或 登录
* add use_hierarchical_allreduce * support hierarchical allreduce for more cases