Fork自 PaddlePaddle / Paddle
Global Norm need to compulte L2 norm of grads. It will calculate sum{grad^2}. Using float32 is easily overflowed. test=release/1.0.0
拖放文件到此处或点击上传