fix shared memory over usage in embedding grad kernel on deterministic mode (#53247)
* fix shared memory over usage in embedding grad kernel on determistic mode * use IdT as interger dtype
Showing
想要评论请 注册 或 登录
* fix shared memory over usage in embedding grad kernel on determistic mode * use IdT as interger dtype