Optimize the ernie inference performance on xpu backend. (#50357)
* Optimize the ernie inference performance on xpu * fix enable runtime cache logic * when op's input shape has changed, should create a new runtime context * fix * set flag when input shape has changed
Showing
想要评论请 注册 或 登录