- 26 10月, 2022 1 次提交
-
-
由 Sławomir Siwek 提交于
* fc/matmuls + scale fuse pass * remove double-extension * add unit tests * comments from review * codestyle * add pass to int8 list * new codestyle * attr name typo
-
- 20 10月, 2022 1 次提交
-
-
由 Kaipeng Deng 提交于
* add fused_multi_transformer_encoder/decoder pass, run GPT-3 success
-
- 18 10月, 2022 1 次提交
-
-
由 Wang Bojun 提交于
* first version, accuracy corrected * disable debug print * use blockReduceSum in phi * add UT * add opCompat * code style * code refine * bug fix * code refine * test fix * bugfix * codesytle fix * code style * code-style * code-style * code-style
-
- 17 10月, 2022 1 次提交
-
- 16 10月, 2022 1 次提交
-
-
由 ZeKai Zhou 提交于
-
- 27 9月, 2022 1 次提交
-
-
由 Wangzheee 提交于
* [Paddle Inference]support n lookup_tables fuse to embeddinglayernorm(3)
-
- 22 9月, 2022 1 次提交
-
-
由 Leo Chen 提交于
-
- 21 9月, 2022 1 次提交
-
-
由 Zhen Wang 提交于
* use cinn in the paddle inference * fix some cmake errors * Avoid division by zero in the arange_kernel. * Avoid dynamic ops. * Remove some useless codes. * Use OpTransInfo to encapsulate some codes used in the build_cinn_pass.
-
- 07 9月, 2022 1 次提交
-
-
由 wenbin 提交于
* first commit * conver done * correct format * layernorm_shift_partition * correct convert * redefine plugin * runable * bug fix * modify ShiftPartitionPattern * correct * add UT * modify ut * compile * modify enforce * modify UT
-
- 31 8月, 2022 1 次提交
-
-
由 Hui Zhang 提交于
* add del dropout op pass to jit pe enigne * add delete dropout test
-
- 30 8月, 2022 1 次提交
-
-
由 zhoutianzi666 提交于
add constant folding pass, for some model,it will get less latency;
-
- 22 8月, 2022 2 次提交
-
-
由 Sławomir Siwek 提交于
* merge conv_concat_relu to conv_act * fix typo * extend unit test * reuse existing gpd * codestyle * enforce mkldnn conv
-
由 Yuanle Liu 提交于
-
- 16 8月, 2022 1 次提交
-
-
由 feng_shuai 提交于
* convert multihead to oss * fix:bug * fix:delete const cast * fix:don't support bias_qk * add vit pass * fix:convert bug and add preln_residual_bias * support length=-1 * add UT for convert * add no_bias_qk support for gpu_multihead_op * delete infer_shape depends on bias_qk * oss just can be used in T4 and A* * fix:change api for ROCM CI
-
- 15 8月, 2022 1 次提交
-
-
由 Yuanle Liu 提交于
-
- 05 8月, 2022 1 次提交
-
-
由 Sławomir Siwek 提交于
* remove v2_transpose_reshape * matmul_transpose_reshape * reshape_transpose_matmul * restore ut * adjust old ut * restore parallel UT ruels * feedback from review
-
- 04 8月, 2022 1 次提交
-
-
由 Sławomir Siwek 提交于
* Add unit tests * matmul_v2 + activation * matmuls + elementwise_add * matmul_v2 postops * transform matmul to v2 * opcompat * fix fusing matmul with multipe outs * add shape constraints * remove unused vars * change pass order * - Unit tests to be debugged - fix - refactor - diagnostic - more diagnostic - fix - Fix number two - fix - fix - fix - alpha added - more fixes - compilation fix - removed diagnostic code - cosmetic fixes * lint * add alpha constraint * merge matmul refactor * trigger CI * - fix * - another fix * code style * add support for matmul+elementwise_add+activation * code style * fix bfloat16 bugs * change append_binary to append_sum Co-authored-by: NJacek Czaja <jacek.czaja@intel.com>
-
- 26 7月, 2022 2 次提交
-
-
由 Ruibiao Chen 提交于
-
由 Ruibiao Chen 提交于
* Set more attrs in ReplaceScaleLossGradOp * Fix typos * Fix CI errors * Add UT
-
- 12 7月, 2022 1 次提交
-
-
由 Sławomir Siwek 提交于
* add method for post ops * format code * gpd * format style * add matmul+act test * implement matmul+activation * whitespaces * code style * python code format * Increase UT timeout * code format * update style * generalize activation fuse passes * change order * Unify activation GPD * Revert changes with op_act * remove softmax mkldnn attrs * set common name for act attributes * whitespace * append postops by helper function * ut style * revert changes related to quantization * Reduce redundancy * reduce number of parameters * trigger CI * validate attribute * trim unit test
-
- 07 7月, 2022 1 次提交
-
-
由 Sing_chan 提交于
* copy onnxruntime.dll to c++ test folder in windows * remove ut that failed due to onnxrumtime.dll * test_api_impl failed of diff * use TARGET to make sure if the test exist; use POST_BUILD to add copy command
-
- 04 7月, 2022 1 次提交
-
-
由 yaozhixin 提交于
-
- 24 6月, 2022 1 次提交
-
-
由 Wilber 提交于
* revert 40531 * update
-
- 23 6月, 2022 1 次提交
-
-
由 Sylwester Fraczek 提交于
* sylwek prototype params to int8 pass * trying to make warmup work * wip * wip * change test to cpp test * review fixes, refactoring * more refactoring * add erasevars * change test to fixture * rename pass and reorder erasevars and graphsaferemovenodes * fix * more refactoring and fixed bug * formatting * remove scale count * enfroce message too short * remove erasevars erasevars couldbe cauuse of memory issues some other fixes * add count of successfull fuses to name of new nodes * FindVar -> GetVar and use ConvResidual pattern * use tensor->clear() instead of new variable * Update paddle/fluid/framework/ir/mkldnn/params_quantization_mkldnn_pass_tester.cc Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> * Update paddle/fluid/framework/ir/mkldnn/params_quantization_mkldnn_pass_tester.cc Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> * Update paddle/fluid/inference/tests/api/analyzer_lexical_analysis_gru_tester.cc Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> * add log (review fix)c * review fix (2 functions to one) * code review: Conv->QuantizeConv * revert * fix formatting * remove unused functions * add paddle enforce Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
-
- 20 6月, 2022 1 次提交
-
-
由 whs 提交于
-
- 13 6月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
-
- 09 6月, 2022 1 次提交
-
-
由 minghaoBD 提交于
-
- 04 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 02 6月, 2022 1 次提交
-
-
由 Wangzheee 提交于
* new general transformer inference support
-
- 19 5月, 2022 1 次提交
-
-
由 shentanyue 提交于
* support yolov5s static/int8 * fix eltwise_sub and div weight compute * fix delete_fill_constant_pass
-
- 17 5月, 2022 1 次提交
-
-
由 zhupengyang 提交于
-
- 13 5月, 2022 1 次提交
-
-
由 Qi Li 提交于
* [IPU] fix ipu and add python infer api, test=develop * [IPU] add paddlepaddle-ipu package name, test=develop
-
- 12 5月, 2022 1 次提交
-
-
由 Wangzheee 提交于
* [Paddle-Inference] support transformer generation: some passes
-
- 11 5月, 2022 1 次提交
-
-
由 Zuza Gawrysiak 提交于
* Add int8 scales gathering pass for convolution * Fix typo * Add unittest * Add corrected unit test * Change test name * Remove enabling mkldnn in test * Speed up test * Change max examples * Add functional test * Change test name * Add new test case * Rename pass
-
- 10 5月, 2022 1 次提交
-
-
由 piotrekobi 提交于
* Readd conv_affine_channel fuse pass as mkldnn pass * Fix formatting * Add new test to parallel_UT_rule.py * Fix Coverage and Windows CI issues * Revert "Fix Coverage and Windows CI issues" This reverts commit f33459846385c9fd51c07f9f44e7ff283a652637. * Fix CI errors * Remove unnecessary conv_eltwise_add_affine_channel fuse pass * Remove test from parallel_UT_rule.py
-
- 06 5月, 2022 1 次提交
-
-
由 Allen Guo 提交于
* rm transfer_cast_op_pass * rm header
-
- 27 4月, 2022 1 次提交
-
-
由 jakpiase 提交于
* added test for shuffle_channel_mkldnn_detect_pass * added UT using new framework * CI fix
-
- 14 4月, 2022 3 次提交
-
-
由 Sławomir Siwek 提交于
* Change tensor name to match activation * declare fc_eltwise_add pass * merge conv_eltwise refactor PR * first compilable draft * unittest feedback tools * Fuse pass tester * Move IsReachable() to shared file * 100% coverage of fuse_pass_tester.cc * register pass * Add bias node * Improve unit tests / remove bias node from pattern * improve fc_eltwiseadd_unittest * cancel eltwise_add fuse if act is already fused * Add elementwise_input scale * Residual MVP * Add new FC attrs * Add more test cases * Add missing op attrs * Adapt code to new Elementwise pattern * reuse existing fcpattern * improve code style * remove unused arguments * fix typo * remove whitespace * remove int8 related code * Remove attributes from base ops * style * style check * Remove input from base op * Set attribute during fuse * ut timeout * download and test model * DRY * apply feedback from review * Style check * fix typo * cosmetic changes * explicitly set residual as output * VIT-OCR accuracy check * trigger CI * remove whitespaces * fix missing data file
-
由 baoachun 提交于
* add mkldnn int8 pass [step3] * Add test for compute_propagate_scales_mkldnn_pass * update pass * update api comment and python api Co-authored-by: Nwozna <joanna.wozna@intel.com>
-
由 jakpiase 提交于
* added shuffle_channel bf16/fp32 fwd kernel * added missing files * CI fix * changed from pten to phi * tmp save * added reviewers suggestions * fix for test
-