Support float16 when using ClipGradByGlobalNorm. (#33565)
This PR supports gradient clip (ClipGradByGlobalNorm) when training with AMP(auto mixed precision).
Showing
想要评论请 注册 或 登录
Fork自 PaddlePaddle / Paddle
This PR supports gradient clip (ClipGradByGlobalNorm) when training with AMP(auto mixed precision).