[paddle-inference]support setting fully connected in multi-head attention...
[paddle-inference]support setting fully connected in multi-head attention static shape branch to int8 (#39660) * fix inference int * update * add unittest
Showing
想要评论请 注册 或 登录