- 19 8月, 2022 1 次提交
-
-
由 Wang Bojun 提交于
* add group_norm dyanmic plugin
-
- 18 8月, 2022 2 次提交
-
-
由 heliqi 提交于
* predictor add GetInputType interface * predictor change GetInputType to GetInputTypes * predictor add tester * predictor add tester * predictor change GetInputType to GetInputTypes * predictor change GetInputType to GetInputTypes * predictor add tester
-
由 JingZhuangzhuang 提交于
* fix infer tans scop * fix infer trans scope * fic infer trans scope * fic infer trans scope Co-authored-by: Ndingjiawei <327396238@qq.com>
-
- 16 8月, 2022 2 次提交
-
-
由 feng_shuai 提交于
* convert multihead to oss * fix:bug * fix:delete const cast * fix:don't support bias_qk * add vit pass * fix:convert bug and add preln_residual_bias * support length=-1 * add UT for convert * add no_bias_qk support for gpu_multihead_op * delete infer_shape depends on bias_qk * oss just can be used in T4 and A* * fix:change api for ROCM CI
-
由 Wilber 提交于
-
- 15 8月, 2022 3 次提交
-
-
由 Yuanle Liu 提交于
-
由 zlsh80826 提交于
* Reduce pool2d test configuration * Reduce depthwise_conv2d test configuration * Reduce trt_convert_conv2d_fusion test configuration * Reduce trt_convert_conv2d test configuration * Reduce trt_convert_conv2d_transpose test configuration * Reduce trt_convert_hard_swish test configuration * Enhance trt auto scan test error message and mechanism * Increase FP16 trt ut tolerance
-
由 Wilber 提交于
* convert_fp16 support multi block * update * update
-
- 14 8月, 2022 1 次提交
-
-
由 xiaoxiaohehe001 提交于
This reverts commit 84bf5c31.
-
- 12 8月, 2022 1 次提交
-
-
由 Yuanle Liu 提交于
* trt engine input data type should be consistent with trt input bindings type * fix some bugs * fix some bugs * fix some bugs
-
- 11 8月, 2022 1 次提交
-
-
由 whs 提交于
-
- 10 8月, 2022 2 次提交
-
-
由 Wangzheee 提交于
* disable_skip_layernorm_fp16
-
由 xiaoxiaohehe001 提交于
* cuda_graph * cuda_graph_ * cuda_graph_ * cuda_graph_
-
- 09 8月, 2022 1 次提交
-
-
由 Allen Guo 提交于
-
- 08 8月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* clean tensor.h * fix gather_nd
-
- 05 8月, 2022 2 次提交
-
-
由 Sławomir Siwek 提交于
* remove v2_transpose_reshape * matmul_transpose_reshape * reshape_transpose_matmul * restore ut * adjust old ut * restore parallel UT ruels * feedback from review
-
由 Zhang Jun 提交于
* update trt workspace size param * update * update * update * use int64_t * use int64_t * upate * update
-
- 04 8月, 2022 4 次提交
-
-
由 Sławomir Siwek 提交于
* Add unit tests * matmul_v2 + activation * matmuls + elementwise_add * matmul_v2 postops * transform matmul to v2 * opcompat * fix fusing matmul with multipe outs * add shape constraints * remove unused vars * change pass order * - Unit tests to be debugged - fix - refactor - diagnostic - more diagnostic - fix - Fix number two - fix - fix - fix - alpha added - more fixes - compilation fix - removed diagnostic code - cosmetic fixes * lint * add alpha constraint * merge matmul refactor * trigger CI * - fix * - another fix * code style * add support for matmul+elementwise_add+activation * code style * fix bfloat16 bugs * change append_binary to append_sum Co-authored-by: NJacek Czaja <jacek.czaja@intel.com>
-
由 zhoutianzi666 提交于
* add rnn
-
由 ccrrong 提交于
-
由 Wilber 提交于
* convert support multi block. * update
-
- 03 8月, 2022 1 次提交
-
-
由 zhoutianzi666 提交于
-
- 02 8月, 2022 1 次提交
-
-
由 Wilber 提交于
* multihead matmul add fp16 * fix windows error * fix rocm error * fix rocm error
-
- 01 8月, 2022 4 次提交
-
-
由 Leo Chen 提交于
* remove cudaDeviceContext * remove more template * fix rocm compile * remove alias name CUDADeviceContext * fix compile * fix tests * revert changes
-
由 Wilber 提交于
* infer context fix place error. * update * update
-
由 Wangzheee 提交于
* add varlen_token_prune plugin, pass, convert
-
由 heliqi 提交于
-
- 29 7月, 2022 2 次提交
- 28 7月, 2022 1 次提交
-
-
由 heliqi 提交于
-
- 26 7月, 2022 1 次提交
-
-
由 Wilber 提交于
* multi stream support handle lazy init. * support eigen lazy init * update * fix ci problem
-
- 25 7月, 2022 1 次提交
-
-
由 Zhang Jun 提交于
* update * empty commit * update * update * update
-
- 22 7月, 2022 3 次提交
-
-
由 zhoutianzi666 提交于
-
由 xiaoxiaohehe001 提交于
-
由 Wilber 提交于
-
- 21 7月, 2022 4 次提交
-
-
由 ming1753 提交于
* fc support fp16 * add a ‘,’ on paddle_pass_builder.cc * fc support fp16 on non-cuda.
-
由 zhoutianzi666 提交于
* fix_fill_constant * fix_fill_constant * fix_ernie
-
由 Wilber 提交于
* fix some error found in tipc. * update
-
由 xiaoxiaohehe001 提交于
* convfusionfp16 * convfusionfp16 * convfusionfp16
-
- 20 7月, 2022 1 次提交
-
-
由 tianshuo78520a 提交于
-