[cherry-pick 2.0] optimize gradient merge (#30185)
* Optimization grad merge performance (#29784)
* [fleet] combine amp and gradient merge, test=develop (#30086)
* fix assign_op_xpu concat_op_xpu warining (#30120)
Co-authored-by: Nliuyuhui <liuyuhui@baidu.com>
Showing
想要评论请 注册 或 登录