“f6c9f56394838021af5db26d046f3c90606a17fc”上不存在“doc_cn/design/tensor_array.html”
[Paddle-TRT] Implement MHA fp16 order same as training (#32629) (#32785)
* implement MHA order same as training
* fix fp16 compile issue on old architecture
Co-authored-by: Nzlsh80826 <rewang@nvidia.com>
Showing
想要评论请 注册 或 登录