- 03 11月, 2022 2 次提交
-
-
由 Sławomir Siwek 提交于
-
由 yeliang2258 提交于
* add constant_folding_pass pass for mkldnn int8 * update UpdateScaleOpInOutScales
-
- 29 10月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* [JITLayer]Enable OneDNN on CPU and Fix zero shape
-
- 28 10月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* [JIT] Add Predictor for JITLayer (#47379) * add predictor_engine * add predictor_engine * fix zero shape * fix lodTensor * fix unittest * fix code style * update CmakeList * fix new executor
-
- 20 10月, 2022 2 次提交
-
-
由 Kaipeng Deng 提交于
* add fused_attention_pass. test=develop * support fp16. test=develop * fix format. test=develop
-
由 yeliang2258 提交于
* Fix quantize model deploy bugs when using MKLDNN (#45920) * fix immutable op quantize bugs * fix * fix build bug * fix test * notest,test=inference * fix ppyoloe acc drop bugs * fix test * fix test * add test * fix * fix * fix test * fix refined name bug * fix test * bias fix * fix matmul weight dequant bug * re-ci * fix tester * fix test * fix tester * update weight dequantize func * update code * update test for converage * update test * update cmake * update cmakelist * update code * rerun ci * remove useless code * re-ci * update code * update code * fix header * update code for log
-
- 18 10月, 2022 1 次提交
-
-
由 zhoutianzi666 提交于
-
- 17 10月, 2022 1 次提交
-
-
由 Allen Guo 提交于
* paddle-inference support custom-ops Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> * fix tolower Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>
-
- 14 10月, 2022 4 次提交
-
-
由 Wilber 提交于
-
由 xiaoxiaohehe001 提交于
-
由 Zhang Jun 提交于
* fix reshape2 opteller; add elementwise min/max register for tensorrt
-
由 zhoutianzi666 提交于
-
- 11 10月, 2022 1 次提交
-
-
由 Yuanle Liu 提交于
-
- 28 9月, 2022 1 次提交
-
-
由 zhoutianzi666 提交于
-
- 20 9月, 2022 2 次提交
-
-
由 zhoutianzi666 提交于
* Support matmul_v2 in Paddle-TensorRT converter.
-
由 zyfncg 提交于
* fix wrong eigen header include * fix complie bug * fix nan_inf_utils_detail * fix resource_manager * fix conv_miopen_helper
-
- 15 9月, 2022 1 次提交
-
-
由 weishengying 提交于
-
- 07 9月, 2022 1 次提交
-
-
由 wenbin 提交于
* first commit * conver done * correct format * layernorm_shift_partition * correct convert * redefine plugin * runable * bug fix * modify ShiftPartitionPattern * correct * add UT * modify ut * compile * modify enforce * modify UT
-
- 06 9月, 2022 2 次提交
-
-
由 Wilber 提交于
-
由 LielinJiang 提交于
* add silu converter
-
- 05 9月, 2022 2 次提交
-
-
由 yeliang2258 提交于
* support onnx format quantized model * update code * add test * add test * fix * fix test * fix cmake * update code * change scale file path to calibration file path * update code * update code * fix build bug * fix build bugs * fix * fix
-
由 denglin-github 提交于
* add config param for enable_dlnne and support calibration mode * remove useless file * refine code and add annotation * refine code of Warnning tips
-
- 02 9月, 2022 1 次提交
-
-
由 Sylwester Fraczek 提交于
-
- 30 8月, 2022 1 次提交
-
-
由 zhoutianzi666 提交于
add constant folding pass, for some model,it will get less latency;
-
- 29 8月, 2022 1 次提交
-
-
由 Yuanle Liu 提交于
-
- 22 8月, 2022 3 次提交
-
-
由 joanna.wozna.intel 提交于
* Add int8 support for matmul+elementwiae_add fuse * Corrections after review and ernie test fix
-
由 Sławomir Siwek 提交于
* merge conv_concat_relu to conv_act * fix typo * extend unit test * reuse existing gpd * codestyle * enforce mkldnn conv
-
由 Yuanle Liu 提交于
-
- 18 8月, 2022 2 次提交
-
-
由 heliqi 提交于
* predictor add GetInputType interface * predictor change GetInputType to GetInputTypes * predictor add tester * predictor add tester * predictor change GetInputType to GetInputTypes * predictor change GetInputType to GetInputTypes * predictor add tester
-
由 JingZhuangzhuang 提交于
* fix infer tans scop * fix infer trans scope * fic infer trans scope * fic infer trans scope Co-authored-by: Ndingjiawei <327396238@qq.com>
-
- 16 8月, 2022 2 次提交
-
-
由 feng_shuai 提交于
* convert multihead to oss * fix:bug * fix:delete const cast * fix:don't support bias_qk * add vit pass * fix:convert bug and add preln_residual_bias * support length=-1 * add UT for convert * add no_bias_qk support for gpu_multihead_op * delete infer_shape depends on bias_qk * oss just can be used in T4 and A* * fix:change api for ROCM CI
-
由 Wilber 提交于
-
- 15 8月, 2022 1 次提交
-
-
由 Yuanle Liu 提交于
-
- 14 8月, 2022 1 次提交
-
-
由 xiaoxiaohehe001 提交于
This reverts commit 84bf5c31.
-
- 10 8月, 2022 1 次提交
-
-
由 xiaoxiaohehe001 提交于
* cuda_graph * cuda_graph_ * cuda_graph_ * cuda_graph_
-
- 05 8月, 2022 2 次提交
-
-
由 Sławomir Siwek 提交于
* remove v2_transpose_reshape * matmul_transpose_reshape * reshape_transpose_matmul * restore ut * adjust old ut * restore parallel UT ruels * feedback from review
-
由 Zhang Jun 提交于
* update trt workspace size param * update * update * update * use int64_t * use int64_t * upate * update
-
- 04 8月, 2022 3 次提交
-
-
由 Sławomir Siwek 提交于
* Add unit tests * matmul_v2 + activation * matmuls + elementwise_add * matmul_v2 postops * transform matmul to v2 * opcompat * fix fusing matmul with multipe outs * add shape constraints * remove unused vars * change pass order * - Unit tests to be debugged - fix - refactor - diagnostic - more diagnostic - fix - Fix number two - fix - fix - fix - alpha added - more fixes - compilation fix - removed diagnostic code - cosmetic fixes * lint * add alpha constraint * merge matmul refactor * trigger CI * - fix * - another fix * code style * add support for matmul+elementwise_add+activation * code style * fix bfloat16 bugs * change append_binary to append_sum Co-authored-by: NJacek Czaja <jacek.czaja@intel.com>
-
由 zhoutianzi666 提交于
* add rnn
-
由 Wilber 提交于
* convert support multi block. * update
-