Integrate rmsnorm kernel (#54998)
* add rmsnorm kernel * add static graph test * fix round type * use alignas to avoid msvc compile error * remove redundant headerfile to avoid rocm compile error * fix rocm compile not found cub * Add document
Showing
想要评论请 注册 或 登录