- 08 1月, 2020 1 次提交
-
-
由 zhaoyuchen2018 提交于
* Fix softmax cuda bug * Refine multihead log and softmax logic * Align block to 32
-
- 02 12月, 2019 1 次提交
-
-
由 zhaoyuchen2018 提交于
The op should handle k=1024 Fix seq_len < warpsize error. test=develop Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
-
- 03 10月, 2019 1 次提交
-
-
由 zhaoyuchen2018 提交于
test=release/1.6 * Add multihead op for ernie opt * Refine softmax * Refine kernel. * Refine cuda kernel * Refine cuda version * Refine cmake
-