[Auto parallel] Transformer MHA & FFN Fused Dist op (#41163)
* adapot dist op * [Auto Parallel] Support the auto completion of while_op * add dist_fill_constant_batch_size_like * align infer accuracy
Showing
想要评论请 注册 或 登录
Fork自 PaddlePaddle / Paddle
* adapot dist op * [Auto Parallel] Support the auto completion of while_op * add dist_fill_constant_batch_size_like * align infer accuracy