- 02 9月, 2020 2 次提交
-
-
由 zlsh80826 提交于
-
由 Zhaolong Xing 提交于
test=develop
-
- 01 9月, 2020 1 次提交
-
-
由 zlsh80826 提交于
* add stack_op to CMakeLists * add dim=3 support for scale op * add trt stack op, test=develop * remove debug message * add stack plugin serialize * remove slice, scale op, will add later * enhence error message * revise trt ernie test to conver the stack op CI testi, test=develop * add stack op serialization * fix test shape after adding stack op * remove slice op, will add after implementing serialization * roll back to min_graph=5 to avoid using slice op * fix scale op output layer * implement stack op createPlugin * use workspace and move the defination to .cu * move stack plugin creator definition to .cu, test=develop
-
- 31 8月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* support trt dynamic shape int8 * add unittest * add support for sigmoid; adapt to trt6+ api
-
- 30 8月, 2020 1 次提交
-
-
由 zlsh80826 提交于
-
- 28 8月, 2020 3 次提交
- 26 8月, 2020 1 次提交
-
-
由 zlsh80826 提交于
-
- 25 8月, 2020 2 次提交
- 21 8月, 2020 1 次提交
-
-
由 Pei Yang 提交于
-
- 19 8月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
test=develop
-
- 17 8月, 2020 1 次提交
-
-
由 zlsh80826 提交于
-
- 09 8月, 2020 1 次提交
-
-
由 zlsh80826 提交于
-
- 08 8月, 2020 3 次提交
- 07 8月, 2020 5 次提交
- 06 8月, 2020 1 次提交
-
-
由 zlsh80826 提交于
-
- 05 8月, 2020 2 次提交
-
-
由 zlsh80826 提交于
-
由 Pei Yang 提交于
* develop dynamic shape serilization * add test param for gelu * fix bugs * delete redundant comments * debug * fix conflict. test=develop * fix bug. test=develop * add trt dynamic shape serialized support * fix ernie serialized bug test=develop * fix codestyle test=develop * fix bug test=develop * fix bug.test=develop * modify cmakelist test=develop * fix bug test=develop * fix error message. test=develop * fix trt register plugin based on pr#25003 * add trt dynload * fix deserialization bug of not finding plugin registration * refine code style * recover engine key in tensorrt_subgraph_pass * for ci coverage * add unittest for deserialization Co-authored-by: Nhaozech <chenhaoze94@gmail.com>
-
- 04 8月, 2020 3 次提交
- 03 8月, 2020 1 次提交
-
-
由 Pei Yang 提交于
-
- 28 7月, 2020 2 次提交
- 19 7月, 2020 2 次提交
- 10 7月, 2020 1 次提交
-
-
由 Jeng Bai-Cheng 提交于
Use vector instruction (LDG.128) to improve qkv transpose. It provides 1.4X speedup at same GPU base frequency. test=develop
-
- 07 7月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* fix multhead matmul's instable test=develop * fix multihead matmul bug test=develop * fix converage problem test=develop
-
- 28 6月, 2020 1 次提交
-
-
由 ReeseWang 提交于
-
- 23 6月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* Paddle-TensorRT support slim QAT. test=develop * add comments. test=develop * use RenameInput instead of ResetInputs. test=develop
-
- 18 6月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
test=develop
-
- 17 6月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* fix bug of prelu when rank not equal 4, test=develop * fix prelu inference, test=develop * fix api, test=develop * fix shape when mode is chennel, test=develop * remove debug code, test=develop * add unittest, test=develop
-