Add rank_attention_op attributes for GPU memory in contrib (#23915)
* optimize rank_attention, test=develop * use the paddle memory pool, test=develop * set max size, test=develop * limit the max size, test=develop * fix the head of cu, test=develop * add AsDispensable, test=develop
Showing
想要评论请 注册 或 登录