[bf16] add bf16 kernel: scale gather sum (#39683)
* add scale gather sum * refine CUDA_ATOMIC_WRAPPER ADD for bf16 * add gather unittest * solve conflict * add scale uinttest * add sum unittest * solve conflict * refine gather unittest * refine unittest
Showing
想要评论请 注册 或 登录