- 23 9月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* optimize slice TRT plugin This patch removes unnecessary barrier for data transfer of needed offset, so data transfer can be overlap with GPU kernel execution. This patch also fixes incorrect name of slice plugin. That is, replaces "layernorm" with "slice" test=develop * add serialize/deserialize to slice plugin * add static shape slice trt plugin * fix slice trt op convertor dynamic shape bug * fix format by clang-format * fix pylint format error * fix problems commented by peiyang Co-authored-by: NRyan Jeng <rjeng@nvidia.com> Co-authored-by: NShang Zhizhou <shangzhizhou@baidu.com> Co-authored-by: NRyan Jeng <rjeng@nvidia.com>
-
- 23 4月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* Chery_pick:[Eernie TRT]: add slice op and add emb eltwise layernorm fp16 support (#23723) * refine ernie trt dynamic shape support 1. add slice op converter 2. add emb eltwise layernorm fp16 support test=develop * fix dynamic shape test ut test=develop * fix comments. test=develop * fix comments test=develop * cherry-pick [BUG]: Head number can only be > 1 on multihead op (#23974) * support the head number == 1 test=develop * fix slice op error. test=develop * cherry-pick :disable trt test, test=develop (#23984) test=release/2.0-beta
-