Fork自 PaddlePaddle / Paddle
前往新版Gitcode,体验更适合开发者的 AI 搜索 >>
* implement MHA order same as training * fix fp16 compile issue on old architecture * fix format * fix format
拖放文件到此处或点击上传