- 15 1月, 2021 1 次提交
-
-
由 Yang Zhang 提交于
built-in `rsqrt` is shadowed
-
- 07 1月, 2021 1 次提交
-
-
由 furnace 提交于
* Layer norm fp16 (#29169) * add fp16 for layer_norm op * revert layernorm api * fix forward * fix forward * fix backward for layernorm with fp16 * fix unit test for layernorm with fp16 * fix with_mkldnn compile error for layernorm with fp16 * 1. revert to PADDLE_ENFORCE_NOT_NULL, 2. change static_cast<float> to static_cast<U> * fix with_mkldnn compile error for layernorm with fp16 * fix with_mkldnn compile error for layernorm with fp16 Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com> * fix layer_norm accuracy (#29434) * Layernorm opt (#29522) * layernorm fw opt * layernorm bw opt * fix typo, test=develop * remove const dim3 for windows CI compatibility * merge develop Co-authored-by: Nzlsh80826 <zlsh80826@gmail.com> * Fix compile problem when cuda_arch < 6000 (#29576) * fix compile problem when cuda_arch < 6000 * refine code * refine code Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com> Co-authored-by: Nzlsh80826 <zlsh80826@gmail.com>
-
- 14 5月, 2020 1 次提交
-
-
由 lijianshe02 提交于
* API/OP (group_norm, layer_norm, unpool) error message enhancement test=develop
-
- 20 4月, 2020 1 次提交
-
-
由 mapingshuo 提交于
restrict block num of layer_norm_grad cuda kernel to 128, test=develop
-
- 06 1月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* add gelu plugin * align trt bert with gpu * add support for fused fc with relu, * add unittest for bert trt
-
- 05 9月, 2018 1 次提交
-
-
由 Yu Yang 提交于
-
- 08 8月, 2018 1 次提交
-
-
由 sneaxiy 提交于
-
- 12 2月, 2018 1 次提交
-
-
由 qingqing01 提交于
-
- 10 2月, 2018 2 次提交
- 03 2月, 2018 2 次提交
-
-
由 chengduoZH 提交于
-
由 chengduoZH 提交于
-