Optimization for StackGradCUDAKernel for last dimension stack case. (#48992)
* add stack grad kernel optimization * add basic optimization kernel for stack_grad_kernel * optimization of stack_grad_kernel for last dim stack and change code format with pre-commit
Showing
想要评论请 注册 或 登录