Optimization for layerNormGrad [Part1] (#51282)
* first commit * fix code bugs in for_loop * fix bugs in cuLoadAddStridedInputs. * optimization for LayerNormBackwardComputeGradInput * add unitest for validating the optimization * fix windows ci error
Showing
想要评论请 注册 或 登录