- 21 2月, 2023 2 次提交
-
-
由 YuanRisheng 提交于
* decouple_memory * perfect memory utils * fix ci bugs * fix inference bugs * fix custom test bugs * fix converage bugs * modify code according comment * modify namespace * deal with compile bugs
-
由 csy0225 提交于
* Optimize the ernie inference performance on xpu * fix enable runtime cache logic * when op's input shape has changed, should create a new runtime context * fix * set flag when input shape has changed
-
- 17 2月, 2023 1 次提交
-
-
由 zhupengyang 提交于
[XPU] add multi_encoder_xpu_slice_fuse_pass, generate_sequence_xpu_fuse_pass, generate_sequence_xpu kernel (#50570)
-
- 16 2月, 2023 3 次提交
-
-
由 joanna.wozna.intel 提交于
* Add matmul_v2 to the quantization process and adjust Ernie model test * Correct cpu_quantize_pass test * Move op to fuse transformation to placement pass * Correct test
-
由 shentanyue 提交于
* support xpu multi-card infer * add ut * clean code * clean code * fix * fix * fix * fix
-
由 zhupengyang 提交于
-
- 14 2月, 2023 2 次提交
-
-
由 duanyanhui 提交于
* expand mix_precision to custom_device * fix bug * fix bug * fix comment * fix DEFINE bug
-
由 xjmxyt 提交于
* add cast setvalue op * add set_value to op teller * renew test and add description * add setAxis and add complex test * change test
-
- 11 2月, 2023 1 次提交
-
-
由 Wang Bojun 提交于
* eleadd_trans first version log fix * refine code for linear format, add pass check * linear format refine and ut fix * fix ut * windows ut * windows ut 2 * move tensorMeta and alloc to configure
-
- 10 2月, 2023 1 次提交
-
-
由 zhupengyang 提交于
-
- 09 2月, 2023 3 次提交
-
-
由 Zhang Jun 提交于
* update * support int64 shape tensor as engine input * add inference_predictor ut
-
由 joanna.wozna.intel 提交于
* Adjust mkldnn_placement_pass to check library type and data type * Check if var has inputs * Remove unrelated test * Refactor
-
由 Wang Bojun 提交于
* trans_layernorm
-
- 08 2月, 2023 1 次提交
-
-
由 Paulina Gacek 提交于
* QuantTranpose pattern is being found by pass * quant + transpose fuse * code style changes * UT written, reorder fixed * Dequantize + transpose2 fuse added * pass name changed * UT added & shift corrected * got rid of redundancy * review changes * AsIntermediate corrected * compat added
-
- 06 2月, 2023 1 次提交
-
-
由 Yuanle Liu 提交于
* disable conv2d_fusion_layout_transfer_pass temporarily * disable conv2d_fusion_layout_transfer_pass temporarily
-
- 01 2月, 2023 2 次提交
-
-
由 Wang Bojun 提交于
* preln_residual 2 fused_bias_residual * skip layernorm fix and ut * code refine * code style refine * fix ut * fix output * add trt layer fall back info * refine op teller and ut * DropoutMaskOut output fix
-
由 Hui Zhang 提交于
* jit layer support multi thread * fix bug * clone prediector not do graph optimizer * format * fix comment and format * fix override and fromat * fix * fix
-
- 31 1月, 2023 2 次提交
-
-
由 wenbin 提交于
* gn_silu * add ut * set TIMEOUT * correct comments * comments * disable windows ut * rename parameter
-
由 Yuanle Liu 提交于
-
- 19 1月, 2023 1 次提交
-
-
由 heliqi 提交于
* support PaddlePaddle Backend on Triton * fix test cases * fix Codestyle * add test case * add test case
-
- 17 1月, 2023 1 次提交
-
-
由 YuanRisheng 提交于
* change feed_op to phi kernel * fix ci bugs * fix build bugs * fix ci bugs * fix compile bugs * fix ci bugs * perfect code * perfect comment code * fix install bugs * modify code according comment * remove visitor in feed_op * modify according comment * perfect code according comment * add infershape * fix py3 bugs * fix getexpected kernel type * fix getexpected kernel type * fix ci bugs * add registry for custom device * fix py3 bugs * fix floating point error * fix py3 test bugs
-
- 16 1月, 2023 2 次提交
-
-
由 Yuanle Liu 提交于
* add trt_support_nhwc_pass
-
由 Yuanle Liu 提交于
* add gpu_cpu_map_matmul_to_mul_pass to kGpuLowerPrecisionPasses * disable fc_elementwise_layernorm_fuse_pass in mixed precision
-
- 13 1月, 2023 1 次提交
-
-
由 Wang Bojun 提交于
* add fmha_flashattention oss plugin
-
- 11 1月, 2023 1 次提交
-
-
由 zhangxin81 提交于
* fix paddle_infer_contrib include
-
- 10 1月, 2023 2 次提交
-
-
由 xiaoxiaohehe001 提交于
-
由 Sanbu 提交于
-
- 09 1月, 2023 2 次提交
- 06 1月, 2023 2 次提交
-
-
由 Yuanle Liu 提交于
-
由 Yuanle Liu 提交于
-
- 05 1月, 2023 2 次提交
-
-
由 Wilber 提交于
-
由 Yuanle Liu 提交于
-
- 04 1月, 2023 1 次提交
-
-
由 lzy 提交于
-
- 03 1月, 2023 3 次提交
-
-
由 Yuanle Liu 提交于
-
由 zhoutianzi666 提交于
* Implement conv2d_fusion NHWC format using CUTLASS * Add unit testing for CUTLASS Conv in inference * Add experimental API for CUTLASS.
-
由 Sanbu 提交于
-
- 28 12月, 2022 1 次提交
-
-
由 Yuanle Liu 提交于
-
- 22 12月, 2022 1 次提交
-
-
由 gem5 提交于
-
- 21 12月, 2022 1 次提交
-
-
由 zyfncg 提交于
* refactor conv_activation_mkldnn_fuse_pass * refactor conv_affine_channel_mkldnn_fuse_pass * fix conv_activation_mkldnn_fuse_pass * fix mkldnn unittest * refactor int8_scale_calculation_mkldnn_pass and params_quantization_mkldnn_pass * refactor conv_elementwise_add_mkldnn_fuse_pass * fix quant * refactor conv_bn_fuse_pass * fix conv_bn_fuse_pass * refactor depthwise_conv_bn_fuse_pass * fix unittest * fix conv_bn_fuse_pass * remove redundant conv2d in params_quantization_mkldnn_pass * fix params_quantization_mkldnn_pass_tester
-