[cherry-pick] Fix multihead op bug. (#20783) (#21438)
The op should handle k=1024
Fix seq_len < warpsize error.
test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
Showing
想要评论请 注册 或 登录
Fork自 PaddlePaddle / Paddle
The op should handle k=1024
Fix seq_len < warpsize error.
test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>