[cherry-pick 2.3] Cherry parallel fused transformer api (#43505)
* Rename dropout is test (#43098)
* replace dropout_is_test with is_test.
* improve atol on a100.
* fused_attention fused_feedforward api support Model Tensor Parallel (#42985)
* fix is_test bug in fused_feedforward. (#43508)
Co-authored-by: NLi Min <11663212+limin2021@users.noreply.github.com>
Showing
想要评论请 注册 或 登录