- 02 3月, 2023 1 次提交
-
-
由 gaoziyuan 提交于
-
- 28 2月, 2023 1 次提交
-
-
由 wenbin 提交于
* fix concat bug * recommit for ci
-
- 27 2月, 2023 1 次提交
-
-
由 gaoziyuan 提交于
-
- 24 2月, 2023 2 次提交
-
-
由 zhoutianzi666 提交于
* allow fall back to fp16 when int8 * refine code * refine code * refine code
-
由 zhoutianzi666 提交于
* fix multihead * fix multihead
-
- 22 2月, 2023 1 次提交
-
-
由 Shuangchi He 提交于
* Fix some typos. Signed-off-by: Yulv-git <yulvchi@qq.com> * pre-commit Signed-off-by: Yulv-git <yulvchi@qq.com> --------- Signed-off-by: Yulv-git <yulvchi@qq.com>
-
- 20 2月, 2023 1 次提交
-
-
由 Wang Bojun 提交于
-
- 16 2月, 2023 1 次提交
-
-
由 Huang Jiyi 提交于
* move layer_norm_kernel.cu.h to phi * fix bugs * fix namespace * fix bugs * fix CI-Windwos * replace mutable_data * fix bugs * fix bugs
-
- 14 2月, 2023 1 次提交
-
-
由 xjmxyt 提交于
* add cast setvalue op * add set_value to op teller * renew test and add description * add setAxis and add complex test * change test
-
- 11 2月, 2023 1 次提交
-
-
由 Wang Bojun 提交于
* eleadd_trans first version log fix * refine code for linear format, add pass check * linear format refine and ut fix * fix ut * windows ut * windows ut 2 * move tensorMeta and alloc to configure
-
- 09 2月, 2023 2 次提交
-
-
由 zhoutianzi666 提交于
* add fmha_flashattention oss plugin * add fmhca * add oss fmhca * code reconstruct and add ut * code style refine * fix ut and enforce check * refine trt version check refine compile fix compile * fix cross ut * code refine * use runtime trt version check * bug fix and code refine * compile fix * merge develop * add GN QDQ kernel * support GN int8 fake kernel * add with_int8 * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 UT * add verison > 8000 in GN int8 UT * add some check in .cu * add stdlib.h in UT * little change in .cu * remove rand_r use rand * remove use rand * setAxis(1) * when int8 is on allow fall back to fp16 --------- Co-authored-by: Nwwbitejotunn <wang_bojun@outlook.com>
-
由 Wang Bojun 提交于
* trans_layernorm
-
- 08 2月, 2023 2 次提交
-
-
由 Zhang Jun 提交于
* update * update * format code * update * Update test_trt_convert_nearest_interp_v2.py
-
由 gaoziyuan 提交于
* remove_engine_info * remove_engine_info * remove_engine_info * change trtlayerinformation line to json --------- Co-authored-by: Ngaoziyuan <gaoziyuan@baidu.com>
-
- 07 2月, 2023 1 次提交
-
-
由 xiaoxiaohehe001 提交于
-
- 06 2月, 2023 2 次提交
-
-
由 wenbin 提交于
-
由 xiaoxiaohehe001 提交于
* add_hasattri_check * add_hasattri_check
-
- 01 2月, 2023 1 次提交
-
-
由 Wang Bojun 提交于
* preln_residual 2 fused_bias_residual * skip layernorm fix and ut * code refine * code style refine * fix ut * fix output * add trt layer fall back info * refine op teller and ut * DropoutMaskOut output fix
-
- 31 1月, 2023 3 次提交
- 18 1月, 2023 1 次提交
-
-
由 wenbin 提交于
* fix cast issue * add ut
-
- 17 1月, 2023 1 次提交
-
-
由 YuanRisheng 提交于
* change feed_op to phi kernel * fix ci bugs * fix build bugs * fix ci bugs * fix compile bugs * fix ci bugs * perfect code * perfect comment code * fix install bugs * modify code according comment * remove visitor in feed_op * modify according comment * perfect code according comment * add infershape * fix py3 bugs * fix getexpected kernel type * fix getexpected kernel type * fix ci bugs * add registry for custom device * fix py3 bugs * fix floating point error * fix py3 test bugs
-
- 16 1月, 2023 1 次提交
-
-
由 Zhang Jun 提交于
* add outvar name for nvtx mark * nly network created with kEXPLICIT_BATCH can setsetMaxBatchSize
-
- 13 1月, 2023 2 次提交
-
-
由 Wang Bojun 提交于
* add fmha_flashattention oss plugin
-
由 Zhang Jun 提交于
* update trt engine to set in/out data type * update * Update engine.cc * Update engine.cc * update * set engine output type before freeze the network * update * update trt autoscan ut * update * update ut * fix equal bug, update ut * fix cast and equal ut * update cast ut using TRT < 8.4 * set datatype from scope * check output var is nullptr * Update op_converter.h * update tensorrt_engine_op_test ut * update
-
- 12 1月, 2023 2 次提交
-
-
由 xiaoxiaohehe001 提交于
-
由 wenbin 提交于
* compile fix * fix compile * compile fix * add more preln
-
- 11 1月, 2023 2 次提交
- 10 1月, 2023 5 次提交
-
-
由 xiaoxiaohehe001 提交于
-
由 xiaoxiaohehe001 提交于
-
由 Sanbu 提交于
-
由 wenbin 提交于
* gn bug fix * bug fix * gn bug fix
-
由 MarDino 提交于
-
- 09 1月, 2023 2 次提交
-
-
由 Yuanle Liu 提交于
-
由 wenbin 提交于
* skip_groupnorm * init * preln * add ut * more assert * set timeout * fix windows ci issue
-
- 05 1月, 2023 2 次提交
-
-
由 xiaoxiaohehe001 提交于
-
由 Zhang Jun 提交于
* update nearest_interp, expand_v2, cast for stable diffusion * update nearest_interp, expand_v2, cast for stable diffusion * correct shape rank * Update expand_v2_op.cc
-
- 04 1月, 2023 1 次提交
-
-
由 Yuanle Liu 提交于
-