[Bfloat16]register bfloat16 datatype for squared l2 norm (#50908)
* register bfloat16 datatype for squared l2 norm * register bfloat16 datatype for softmax with upper triangular mask * register bfloat16 for tril triu cuda kernel
Showing
想要评论请 注册 或 登录