Fix index overflow bug of the CUDA kernel loop increment (#25435)
* fix softmax_with_cross_entropy cuda kernel overflow bug, test=develop * replace old macro & for condition, test=develop * polish details, test=develop
Showing
想要评论请 注册 或 登录