- 21 12月, 2022 1 次提交
-
-
由 zyfncg 提交于
* refactor conv_activation_mkldnn_fuse_pass * refactor conv_affine_channel_mkldnn_fuse_pass * fix conv_activation_mkldnn_fuse_pass * fix mkldnn unittest * refactor int8_scale_calculation_mkldnn_pass and params_quantization_mkldnn_pass * refactor conv_elementwise_add_mkldnn_fuse_pass * fix quant * refactor conv_bn_fuse_pass * fix conv_bn_fuse_pass * refactor depthwise_conv_bn_fuse_pass * fix unittest * fix conv_bn_fuse_pass * remove redundant conv2d in params_quantization_mkldnn_pass * fix params_quantization_mkldnn_pass_tester
-
- 06 12月, 2022 1 次提交
-
-
由 zyfncg 提交于
* delete Bias and ResidualData in OpMaker of conv2d * delete extra input of conv3d * refactor pass of conv_bias_fusion * fix mkldnn dependency * fix mkldnn compile * fix test_conv_bias_mkldnn_fuse_pass * police some code * remove useless log * fix analyzer_vit_ocr_tester * fix conv_activation_mkldnn_fuse_pass * fix test_analyzer_ocr * add fused_conv_sig * fix performence regression * fix performance regression
-
- 05 12月, 2022 1 次提交
-
-
由 Wang Bojun 提交于
* pass * pass * draft version * share mem opt * remove sharemem * add pattern for the case with circle_shift=0 * add UT * pass opt * test_fix * code-commit * code-style * code style * code-style * ut-fix * op teller refine * resolve conflict * adjust position op_teller list and pass order for swin * ut code style update * adjust paddle pass order * refine pass order * refine pass order * refine pass order
-
- 01 12月, 2022 1 次提交
-
-
由 zhoutianzi666 提交于
* remove conv_act_set from graph_pattern_detector.cc
-
- 30 11月, 2022 2 次提交
-
-
由 zhangbo9674 提交于
* add fuse act add grad pass * polish code * refine code * add test * refine code
-
由 RichardWooSJTU 提交于
* delete unnecessary shape and slice op Co-authored-by: NYour Name <you@example.com>
-
- 21 11月, 2022 1 次提交
-
-
由 Sylwester Fraczek 提交于
* add fc-residual quantization * revert removal of check for use_mkldnn * fix bug * add disable_logs * review fix call twice AreScalesPresntForNodes instead of if-else * rewrite residual input to output * revert fc mkldnn taking residual data * format fix * fix LoDTensor->DenseTensor * LoDTensor->DenseTensor * output->input * revert changes to unsupported script revert changes to unsupported script * remove fc residualdata from output blocklist in cpu_bfloat16_pass.cc
-
- 16 11月, 2022 1 次提交
-
-
由 Piotr Paturej 提交于
* Enable bf16 in oneDNN bilinear_interp kernel * Fix bilinear_interp_v2 not enabled in models * Remove unnecessary checks
-
- 15 11月, 2022 1 次提交
-
-
由 jakpiase 提交于
* optimization for ln * fix * added output to gpd * added formatting * fix
-
- 14 11月, 2022 1 次提交
-
-
由 yeliang2258 提交于
-
- 07 11月, 2022 1 次提交
-
-
由 Hui Zhang 提交于
* suqeeze2 transpose2 fuse onednn * format * fix output shape * fix conflict * format * format * remove useless * remove log * simply pass * fix comment * fix * fix msg * fix error msg * format
-
- 04 11月, 2022 1 次提交
-
-
由 jakpiase 提交于
* tmp save * minor chnage * CI fix * added FC optimizations * latest update * CI fix * fixed bug with fusing fc
-
- 20 10月, 2022 1 次提交
-
-
由 Kaipeng Deng 提交于
* add fused_multi_transformer_encoder/decoder pass, run GPT-3 success
-
- 18 10月, 2022 1 次提交
-
-
由 Wang Bojun 提交于
* first version, accuracy corrected * disable debug print * use blockReduceSum in phi * add UT * add opCompat * code style * code refine * bug fix * code refine * test fix * bugfix * codesytle fix * code style * code-style * code-style * code-style
-
- 17 10月, 2022 2 次提交
-
-
由 Wang Bojun 提交于
* first version of ln_s_p with s>0 * refine and UT * pass opt draft * pass opt * code refine * code-style * bug fix * fix ci test * code style
-
由 jakpiase 提交于
-
- 10 10月, 2022 1 次提交
-
-
由 Sylwester Fraczek 提交于
* fix fc pattern remove use_bias add residual input switch fix references to pattern * review fixes
-
- 07 9月, 2022 1 次提交
-
-
由 wenbin 提交于
* first commit * conver done * correct format * layernorm_shift_partition * correct convert * redefine plugin * runable * bug fix * modify ShiftPartitionPattern * correct * add UT * modify ut * compile * modify enforce * modify UT
-
- 22 8月, 2022 2 次提交
-
-
由 joanna.wozna.intel 提交于
* Add int8 support for matmul+elementwiae_add fuse * Corrections after review and ernie test fix
-
由 Sławomir Siwek 提交于
* merge conv_concat_relu to conv_act * fix typo * extend unit test * reuse existing gpd * codestyle * enforce mkldnn conv
-
- 16 8月, 2022 2 次提交
-
-
由 feng_shuai 提交于
* convert multihead to oss * fix:bug * fix:delete const cast * fix:don't support bias_qk * add vit pass * fix:convert bug and add preln_residual_bias * support length=-1 * add UT for convert * add no_bias_qk support for gpu_multihead_op * delete infer_shape depends on bias_qk * oss just can be used in T4 and A* * fix:change api for ROCM CI
-
由 Wangzheee 提交于
-
- 04 8月, 2022 1 次提交
-
-
由 Sławomir Siwek 提交于
* Add unit tests * matmul_v2 + activation * matmuls + elementwise_add * matmul_v2 postops * transform matmul to v2 * opcompat * fix fusing matmul with multipe outs * add shape constraints * remove unused vars * change pass order * - Unit tests to be debugged - fix - refactor - diagnostic - more diagnostic - fix - Fix number two - fix - fix - fix - alpha added - more fixes - compilation fix - removed diagnostic code - cosmetic fixes * lint * add alpha constraint * merge matmul refactor * trigger CI * - fix * - another fix * code style * add support for matmul+elementwise_add+activation * code style * fix bfloat16 bugs * change append_binary to append_sum Co-authored-by: NJacek Czaja <jacek.czaja@intel.com>
-
- 27 7月, 2022 1 次提交
-
-
由 pangyoki 提交于
* fix RemoveNode in fuse_elewise_add_act_pass * fix * change pointer to share_ptr * fix * fix * fix format * fix * fix graph_safe_remove_nodes
-
- 19 7月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Rename BOOST_GET macros * Fix conflicts
-
- 12 7月, 2022 1 次提交
-
-
由 xiaoxiaohehe001 提交于
-
- 11 7月, 2022 1 次提交
-
-
由 Sławomir Siwek 提交于
* reduce redundancy * python code style * fix int8 ut
-
- 06 7月, 2022 1 次提交
-
-
由 xiaoxiaohehe001 提交于
* conv_fusion
-
- 05 7月, 2022 1 次提交
-
-
由 Zuza Gawrysiak 提交于
* Refactor quantization of immutable ops * Fix code formatting * Fix formatting * Specify input names * Fix formatting * Change string to reference * Formatting
-
- 30 6月, 2022 1 次提交
-
-
由 JingZhuangzhuang 提交于
* modify graph_pattern to thread_local * modify graph_pattern to thread_local
-
- 26 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 23 6月, 2022 1 次提交
-
-
由 Sylwester Fraczek 提交于
* sylwek prototype params to int8 pass * trying to make warmup work * wip * wip * change test to cpp test * review fixes, refactoring * more refactoring * add erasevars * change test to fixture * rename pass and reorder erasevars and graphsaferemovenodes * fix * more refactoring and fixed bug * formatting * remove scale count * enfroce message too short * remove erasevars erasevars couldbe cauuse of memory issues some other fixes * add count of successfull fuses to name of new nodes * FindVar -> GetVar and use ConvResidual pattern * use tensor->clear() instead of new variable * Update paddle/fluid/framework/ir/mkldnn/params_quantization_mkldnn_pass_tester.cc Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> * Update paddle/fluid/framework/ir/mkldnn/params_quantization_mkldnn_pass_tester.cc Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> * Update paddle/fluid/inference/tests/api/analyzer_lexical_analysis_gru_tester.cc Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> * add log (review fix)c * review fix (2 functions to one) * code review: Conv->QuantizeConv * revert * fix formatting * remove unused functions * add paddle enforce Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
-
- 21 6月, 2022 1 次提交
-
-
由 joanna.wozna.intel 提交于
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 02 6月, 2022 1 次提交
-
-
由 Tomasz Socha 提交于
* Fix bfloat16 placement pass * Make it nicer * Fix leftovers * Style
-
- 19 5月, 2022 1 次提交
-
-
由 shentanyue 提交于
* support yolov5s static/int8 * fix eltwise_sub and div weight compute * fix delete_fill_constant_pass
-
- 12 5月, 2022 1 次提交
-
-
由 Shuangchi He 提交于
-
- 28 4月, 2022 1 次提交
-
-
由 Tomasz Socha 提交于
* Refactor Quantization * Refactor Dequantization * Classy solution * Style I * Style II * Style III * Use VLOG(4) for debug info * Style IV
-
- 04 4月, 2022 1 次提交
-
-
由 Sławomir Siwek 提交于
* DRY * change nodes names * add const prefix * change asX to as_x in all files
-
- 02 4月, 2022 1 次提交
-
-
由 Wangzheee 提交于
* paddle inference support new quant_model
-