- 02 3月, 2023 2 次提交
-
-
由 gaoziyuan 提交于
-
由 Yuanle Liu 提交于
-
- 01 3月, 2023 1 次提交
-
-
由 zhupengyang 提交于
-
- 28 2月, 2023 4 次提交
-
-
由 joanna.wozna.intel 提交于
* Add gru qat int8 test * Change place of model downloading * Update paddle/fluid/inference/tests/api/CMakeLists.txt Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> * Correct flags names and add description --------- Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
-
由 zhupengyang 提交于
-
由 Yuanle Liu 提交于
-
由 wenbin 提交于
* fix concat bug * recommit for ci
-
- 27 2月, 2023 2 次提交
- 24 2月, 2023 3 次提交
-
-
由 zhoutianzi666 提交于
* allow fall back to fp16 when int8 * refine code * refine code * refine code
-
由 Yuanle Liu 提交于
-
由 zhoutianzi666 提交于
* fix multihead * fix multihead
-
- 23 2月, 2023 3 次提交
-
-
由 csy0225 提交于
-
由 Huang Jiyi 提交于
* move fluid generator to phi * move fluid generator to phi * update .gitignore * fix bugs * fix cannot find "glog/logging.h" in "generator.h" * fix bugs
-
由 zhupengyang 提交于
-
- 22 2月, 2023 2 次提交
-
-
由 Shuangchi He 提交于
* Fix some typos. Signed-off-by: Yulv-git <yulvchi@qq.com> * pre-commit Signed-off-by: Yulv-git <yulvchi@qq.com> --------- Signed-off-by: Yulv-git <yulvchi@qq.com>
-
由 zhupengyang 提交于
-
- 21 2月, 2023 2 次提交
-
-
由 YuanRisheng 提交于
* decouple_memory * perfect memory utils * fix ci bugs * fix inference bugs * fix custom test bugs * fix converage bugs * modify code according comment * modify namespace * deal with compile bugs
-
由 csy0225 提交于
* Optimize the ernie inference performance on xpu * fix enable runtime cache logic * when op's input shape has changed, should create a new runtime context * fix * set flag when input shape has changed
-
- 20 2月, 2023 1 次提交
-
-
由 Wang Bojun 提交于
-
- 17 2月, 2023 1 次提交
-
-
由 zhupengyang 提交于
[XPU] add multi_encoder_xpu_slice_fuse_pass, generate_sequence_xpu_fuse_pass, generate_sequence_xpu kernel (#50570)
-
- 16 2月, 2023 5 次提交
-
-
由 joanna.wozna.intel 提交于
* Add matmul_v2 to the quantization process and adjust Ernie model test * Correct cpu_quantize_pass test * Move op to fuse transformation to placement pass * Correct test
-
由 Tomasz Socha 提交于
-
由 shentanyue 提交于
* support xpu multi-card infer * add ut * clean code * clean code * fix * fix * fix * fix
-
由 Huang Jiyi 提交于
* move layer_norm_kernel.cu.h to phi * fix bugs * fix namespace * fix bugs * fix CI-Windwos * replace mutable_data * fix bugs * fix bugs
-
由 zhupengyang 提交于
-
- 14 2月, 2023 2 次提交
-
-
由 duanyanhui 提交于
* expand mix_precision to custom_device * fix bug * fix bug * fix comment * fix DEFINE bug
-
由 xjmxyt 提交于
* add cast setvalue op * add set_value to op teller * renew test and add description * add setAxis and add complex test * change test
-
- 11 2月, 2023 1 次提交
-
-
由 Wang Bojun 提交于
* eleadd_trans first version log fix * refine code for linear format, add pass check * linear format refine and ut fix * fix ut * windows ut * windows ut 2 * move tensorMeta and alloc to configure
-
- 10 2月, 2023 1 次提交
-
-
由 zhupengyang 提交于
-
- 09 2月, 2023 4 次提交
-
-
由 Zhang Jun 提交于
* update * support int64 shape tensor as engine input * add inference_predictor ut
-
由 joanna.wozna.intel 提交于
* Adjust mkldnn_placement_pass to check library type and data type * Check if var has inputs * Remove unrelated test * Refactor
-
由 zhoutianzi666 提交于
* add fmha_flashattention oss plugin * add fmhca * add oss fmhca * code reconstruct and add ut * code style refine * fix ut and enforce check * refine trt version check refine compile fix compile * fix cross ut * code refine * use runtime trt version check * bug fix and code refine * compile fix * merge develop * add GN QDQ kernel * support GN int8 fake kernel * add with_int8 * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 fake kernel * add GN int8 UT * add verison > 8000 in GN int8 UT * add some check in .cu * add stdlib.h in UT * little change in .cu * remove rand_r use rand * remove use rand * setAxis(1) * when int8 is on allow fall back to fp16 --------- Co-authored-by: Nwwbitejotunn <wang_bojun@outlook.com>
-
由 Wang Bojun 提交于
* trans_layernorm
-
- 08 2月, 2023 4 次提交
-
-
由 Paulina Gacek 提交于
* QuantTranpose pattern is being found by pass * quant + transpose fuse * code style changes * UT written, reorder fixed * Dequantize + transpose2 fuse added * pass name changed * UT added & shift corrected * got rid of redundancy * review changes * AsIntermediate corrected * compat added
-
由 Zhang Jun 提交于
* update * update * format code * update * Update test_trt_convert_nearest_interp_v2.py
-
由 weishengying 提交于
-
由 gaoziyuan 提交于
* remove_engine_info * remove_engine_info * remove_engine_info * change trtlayerinformation line to json --------- Co-authored-by: Ngaoziyuan <gaoziyuan@baidu.com>
-
- 07 2月, 2023 1 次提交
-
-
由 xiaoxiaohehe001 提交于
-
- 06 2月, 2023 1 次提交
-
-
由 Yuanle Liu 提交于
* disable conv2d_fusion_layout_transfer_pass temporarily * disable conv2d_fusion_layout_transfer_pass temporarily
-