- 05 11月, 2020 1 次提交
-
-
由 Shang Zhizhou 提交于
* Fix TRT plugin registry without TRT lib (#25982) * fix trt plugin registry without trt lib * support trt4 * refine code style * pick ea851796 from develop * cherry-pick develop PR #26273 && #27796 * fix unittest error * fix unittest error * remove const_cast Co-authored-by: NPei Yang <peiyang@baidu.com>
-
- 23 9月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* optimize slice TRT plugin This patch removes unnecessary barrier for data transfer of needed offset, so data transfer can be overlap with GPU kernel execution. This patch also fixes incorrect name of slice plugin. That is, replaces "layernorm" with "slice" test=develop * add serialize/deserialize to slice plugin * add static shape slice trt plugin * fix slice trt op convertor dynamic shape bug * fix format by clang-format * fix pylint format error * fix problems commented by peiyang Co-authored-by: NRyan Jeng <rjeng@nvidia.com> Co-authored-by: NShang Zhizhou <shangzhizhou@baidu.com> Co-authored-by: NRyan Jeng <rjeng@nvidia.com>
-
- 27 4月, 2020 1 次提交
-
-
由 石晓伟 提交于
-
- 25 4月, 2020 1 次提交
-
-
由 arlesniak 提交于
test=release/2.0
-
- 24 4月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* cherry-pick: fix converage ci (#23738) test=release/2.0-beta * modity ut test=release/2.0-beta
-
- 23 4月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* Chery_pick:[Eernie TRT]: add slice op and add emb eltwise layernorm fp16 support (#23723) * refine ernie trt dynamic shape support 1. add slice op converter 2. add emb eltwise layernorm fp16 support test=develop * fix dynamic shape test ut test=develop * fix comments. test=develop * fix comments test=develop * cherry-pick [BUG]: Head number can only be > 1 on multihead op (#23974) * support the head number == 1 test=develop * fix slice op error. test=develop * cherry-pick :disable trt test, test=develop (#23984) test=release/2.0-beta
-
- 12 4月, 2020 1 次提交
-
-
由 mamingjie-China 提交于
* API(argsort, argmax, argmin, cast, diag) error message enhancement, test=develop
-
- 11 4月, 2020 1 次提交
-
-
由 chenhaoze 提交于
* Add three passes and api reference of paddle_pass_builder.h
-
- 08 4月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 01 4月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
* add attr support for fusion group and add support for fill_constant and scale Op
-
- 20 3月, 2020 2 次提交
- 16 3月, 2020 1 次提交
-
-
由 liu zhengxi 提交于
* add transpose_flatten_concat_fuse_pass tests for gpu and trt, test=develop * update test_inference_api.py, test=develop
-
- 13 3月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
* add fusion group test for backward and refine code
-
- 12 3月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
* add support for expression type convert and add cast Op support in fusion group
-
- 11 3月, 2020 2 次提交
-
-
由 Wilber 提交于
* add skip_layernorm pass. test=develop
-
由 Zhaolong Xing 提交于
* 1. add embedding eltwise layernorm fuse 2. add embedding eltwise layernorm op 3. refine inplace_add_relu 4. refine fc_eltwise_layernorm test=develop * 1. refine fc test=develop * fix comments test=develop * fix comments test=develop
-
- 01 3月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
* Add the codegen and auto fusion for sum Op in fusion group
-
- 28 2月, 2020 1 次提交
-
-
由 tianshuo78520a 提交于
-
- 21 2月, 2020 1 次提交
-
-
由 Yiqun Liu 提交于
-
- 20 2月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
* add fuse_group python unintest
-
- 10 2月, 2020 1 次提交
-
-
由 liu zhengxi 提交于
* add InterencePassTest for testing precision of inference passes, test=develop
-
- 17 1月, 2020 1 次提交
-
-
由 Yiqun Liu 提交于
* Implement a common python unittest to test the ir passes. test=develop * Save the results in np.array and support to startup on CPU. test=develop * Fix the unittest. test=develop * Add check_program to check whether the optimized program is different from the origin one. test=develop * Remove the inferface all_ops. test=develop * Add exception test in pass_test. test=develop
-