Fork自 PaddlePaddle / Paddle
This PR supports gradient clip (ClipGradByGlobalNorm) when training with AMP(auto mixed precision).
拖放文件到此处或点击上传