• MarDino's avatar
    Integrate rmsnorm kernel (#54998) · 97d3d6ee
    MarDino 提交于
    * add rmsnorm kernel
    * add static graph test
    * fix round type
    * use alignas to avoid msvc compile error
    * remove redundant headerfile to avoid rocm compile error
    * fix rocm compile not found cub
    * Add document
    97d3d6ee
test_rms_norm_op.py 5.5 KB