- 21 4月, 2023 1 次提交
-
-
由 tianshuo78520a 提交于
* mv inference/api infer_ut * mv test * merge develop fix error * fix * fix build error * fix build error * fix bug * fix tester_helper.h * fix analyzer_transformer_profile_tester.cc * fix * fix mac * fix mac * fix error * fix * fix
-
- 19 12月, 2022 1 次提交
-
-
由 Wangzheee 提交于
* General optimization for no_varlen embedding layernorm
-
- 13 12月, 2022 1 次提交
-
-
由 Wangzheee 提交于
* fix some transformer unitest
-
- 08 12月, 2022 1 次提交
-
-
由 Wangzheee 提交于
* general optimization no_varlen embedding layernorm
-
- 13 10月, 2022 1 次提交
-
-
由 Wangzheee 提交于
* test=infer-coverage
-
- 12 10月, 2022 1 次提交
-
-
由 Wangzheee 提交于
-
- 15 8月, 2022 1 次提交
-
-
由 zlsh80826 提交于
* Reduce pool2d test configuration * Reduce depthwise_conv2d test configuration * Reduce trt_convert_conv2d_fusion test configuration * Reduce trt_convert_conv2d test configuration * Reduce trt_convert_conv2d_transpose test configuration * Reduce trt_convert_hard_swish test configuration * Enhance trt auto scan test error message and mechanism * Increase FP16 trt ut tolerance
-
- 26 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 02 6月, 2022 1 次提交
-
-
由 Wangzheee 提交于
* new general transformer inference support
-
- 13 12月, 2021 1 次提交
-
-
由 zlsh80826 提交于
* add restriction on plugin supportsFormat to eliminate errors from TensorRT8 * ernie-varlen is only supported on architecture >= sm75
-
- 25 6月, 2021 1 次提交
-
-
由 wenbin 提交于
* qkv * ci_test
-
- 21 6月, 2021 1 次提交
-
-
由 Pei Yang 提交于
-
- 17 6月, 2021 1 次提交
-
-
由 Wilber 提交于
[Inference Tensorrt] Add attr for trt engine and handle the input seq problem for ernie var len. (#33575)
-
- 11 1月, 2021 1 次提交
-
-
由 石晓伟 提交于
-
- 29 12月, 2020 1 次提交
-
-
由 Pei Yang 提交于
fix ut:trt_resnext_test, trt_quant_int8_yolov3_r50_test, test_trt_dynamic_shape_ernie, test_trt_dynamic_shape_ernie_fp16_ser_deser, trt_cascade_rcnn_test (#29977)
-
- 27 11月, 2020 1 次提交
-
-
由 Shang Zhizhou 提交于
* remove -DSUPPORTS_CUDA_FP16 in cuda.cmake * comile with cuda9 * add some unittest * notest;test=coverage * add unittest for trt plugin swish && split * update ernie unittest * fix some error message * remove repeated judgement of CUDA version in mbEltwiseLayerNormOpConverter * fix comile errror when CUDA_ARCH_NAME < Pascal" * fix comile error * update unittest timeout * compile with cuda9 * update error msg * fix code style * add some comments * add define IF_CUDA_ARCH_SUPPORT_FP16 * rename IF_CUDA_ARCH_SUPPORT_FP16 to CUDA_ARCH_FP16_SUPPORTED
-
- 03 11月, 2020 1 次提交
-
-
由 Shang Zhizhou 提交于
* fp16 result ok * change -DWITH_NVINFER_PLUGIN toconfig.EnableTensorRtOSS * auto detect special slice op converter for ernie with trt oss * ernie oss only support fp16 * fix special_slice_plugin serialize bug * matmul in tensorrt ok * ernie unittest ok * add matmul tensorrt unittest * remove demo code
-
- 01 9月, 2020 1 次提交
-
-
由 zlsh80826 提交于
* add stack_op to CMakeLists * add dim=3 support for scale op * add trt stack op, test=develop * remove debug message * add stack plugin serialize * remove slice, scale op, will add later * enhence error message * revise trt ernie test to conver the stack op CI testi, test=develop * add stack op serialization * fix test shape after adding stack op * remove slice op, will add after implementing serialization * roll back to min_graph=5 to avoid using slice op * fix scale op output layer * implement stack op createPlugin * use workspace and move the defination to .cu * move stack plugin creator definition to .cu, test=develop
-
- 19 8月, 2020 1 次提交
-
-
由 Pei Yang 提交于
-
- 05 8月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* develop dynamic shape serilization * add test param for gelu * fix bugs * delete redundant comments * debug * fix conflict. test=develop * fix bug. test=develop * add trt dynamic shape serialized support * fix ernie serialized bug test=develop * fix codestyle test=develop * fix bug test=develop * fix bug.test=develop * modify cmakelist test=develop * fix bug test=develop * fix error message. test=develop * fix trt register plugin based on pr#25003 * add trt dynload * fix deserialization bug of not finding plugin registration * refine code style * recover engine key in tensorrt_subgraph_pass * for ci coverage * add unittest for deserialization Co-authored-by: Nhaozech <chenhaoze94@gmail.com>
-
- 15 5月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
test=develop
-
- 19 4月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* refine ernie trt dynamic shape support 1. add slice op converter 2. add emb eltwise layernorm fp16 support test=develop * fix dynamic shape test ut test=develop * fix comments. test=develop * fix comments test=develop
-
- 26 3月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* add dynamic plugin support. test=develop * change emb eltwise layernorm to math function test=develop * add emb eltwise layernorm test=develop * can run dynamic shape ernie test=develop * fix ci test=develop * add ut for trt ernie dynamic test=develop * refine dynamic shape c++ interface. test=develop * fix comments test=develop * fix comments test=develop
-