- 14 9月, 2022 2 次提交
- 09 9月, 2022 1 次提交
-
-
由 Wangzheee 提交于
-
- 08 9月, 2022 5 次提交
-
-
由 Wilber 提交于
-
由 wenbin 提交于
-
由 Aurelius84 提交于
-
由 Aurelius84 提交于
* [OpAttr]Refine Teller logic if encounter OpDesc with Variable type Attribute * fix iterator * fix typo * fix lambda expr * fix ptr
-
由 wenbin 提交于
-
- 07 9月, 2022 2 次提交
-
-
由 Wilber 提交于
* enable memory optimize when fp16. * optimiza params sync between cpu and gpu.
-
由 wenbin 提交于
* first commit * conver done * correct format * layernorm_shift_partition * correct convert * redefine plugin * runable * bug fix * modify ShiftPartitionPattern * correct * add UT * modify ut * compile * modify enforce * modify UT
-
- 06 9月, 2022 3 次提交
-
-
由 Wilber 提交于
-
由 LielinJiang 提交于
* add silu converter
-
由 Wangzheee 提交于
* remove int8 fallback
-
- 05 9月, 2022 2 次提交
-
-
由 yeliang2258 提交于
* support onnx format quantized model * update code * add test * add test * fix * fix test * fix cmake * update code * change scale file path to calibration file path * update code * update code * fix build bug * fix build bugs * fix * fix
-
由 denglin-github 提交于
* add config param for enable_dlnne and support calibration mode * remove useless file * refine code and add annotation * refine code of Warnning tips
-
- 02 9月, 2022 2 次提交
-
-
由 Sylwester Fraczek 提交于
-
由 feng_shuai 提交于
* vit_384_opt * just support trt8 * padding + unpadding * fix:unit test * refactor:padding * fix: change the position of round_up * refactor: delete workspace
-
- 30 8月, 2022 1 次提交
-
-
由 zhoutianzi666 提交于
add constant folding pass, for some model,it will get less latency;
-
- 29 8月, 2022 1 次提交
-
-
由 Yuanle Liu 提交于
-
- 26 8月, 2022 2 次提交
-
-
由 Wang Bojun 提交于
* fix bug fix * add shape size check * polish code * multi -1 shape fix * code style improve * bug fix * code style fix
-
由 Wangzheee 提交于
-
- 25 8月, 2022 2 次提交
-
-
由 Wilber 提交于
-
由 zhoutianzi666 提交于
-
- 24 8月, 2022 3 次提交
-
-
由 Wang Bojun 提交于
* fix bug fix
-
由 Yuanle Liu 提交于
-
由 Wilber 提交于
-
- 22 8月, 2022 4 次提交
-
-
由 joanna.wozna.intel 提交于
* Add int8 support for matmul+elementwiae_add fuse * Corrections after review and ernie test fix
-
由 Sławomir Siwek 提交于
* merge conv_concat_relu to conv_act * fix typo * extend unit test * reuse existing gpd * codestyle * enforce mkldnn conv
-
由 zhoutianzi666 提交于
-
由 Yuanle Liu 提交于
-
- 19 8月, 2022 2 次提交
-
-
由 Wang Bojun 提交于
* fix layernormTrt meanVar alloc bug
-
由 Wang Bojun 提交于
* add group_norm dyanmic plugin
-
- 18 8月, 2022 2 次提交
-
-
由 heliqi 提交于
* predictor add GetInputType interface * predictor change GetInputType to GetInputTypes * predictor add tester * predictor add tester * predictor change GetInputType to GetInputTypes * predictor change GetInputType to GetInputTypes * predictor add tester
-
由 JingZhuangzhuang 提交于
* fix infer tans scop * fix infer trans scope * fic infer trans scope * fic infer trans scope Co-authored-by: Ndingjiawei <327396238@qq.com>
-
- 16 8月, 2022 2 次提交
-
-
由 feng_shuai 提交于
* convert multihead to oss * fix:bug * fix:delete const cast * fix:don't support bias_qk * add vit pass * fix:convert bug and add preln_residual_bias * support length=-1 * add UT for convert * add no_bias_qk support for gpu_multihead_op * delete infer_shape depends on bias_qk * oss just can be used in T4 and A* * fix:change api for ROCM CI
-
由 Wilber 提交于
-
- 15 8月, 2022 3 次提交
-
-
由 Yuanle Liu 提交于
-
由 zlsh80826 提交于
* Reduce pool2d test configuration * Reduce depthwise_conv2d test configuration * Reduce trt_convert_conv2d_fusion test configuration * Reduce trt_convert_conv2d test configuration * Reduce trt_convert_conv2d_transpose test configuration * Reduce trt_convert_hard_swish test configuration * Enhance trt auto scan test error message and mechanism * Increase FP16 trt ut tolerance
-
由 Wilber 提交于
* convert_fp16 support multi block * update * update
-
- 14 8月, 2022 1 次提交
-
-
由 xiaoxiaohehe001 提交于
This reverts commit 84bf5c31.
-