1. 16 2月, 2023 2 次提交
  2. 08 2月, 2023 2 次提交
  3. 21 12月, 2022 1 次提交
    • Z
      Refactor Pass for fused_conv (#48848) · 7f0eb2e3
      zyfncg 提交于
      * refactor conv_activation_mkldnn_fuse_pass
      
      * refactor conv_affine_channel_mkldnn_fuse_pass
      
      * fix conv_activation_mkldnn_fuse_pass
      
      * fix mkldnn unittest
      
      * refactor int8_scale_calculation_mkldnn_pass and params_quantization_mkldnn_pass
      
      * refactor conv_elementwise_add_mkldnn_fuse_pass
      
      * fix quant
      
      * refactor conv_bn_fuse_pass
      
      * fix conv_bn_fuse_pass
      
      * refactor depthwise_conv_bn_fuse_pass
      
      * fix unittest
      
      * fix conv_bn_fuse_pass
      
      * remove redundant conv2d in params_quantization_mkldnn_pass
      
      * fix params_quantization_mkldnn_pass_tester
      7f0eb2e3
  4. 06 12月, 2022 1 次提交
    • Z
      Clear extra input (Bias, ResidualData) in OpMaker of conv2d (#47579) · 0a2dfa38
      zyfncg 提交于
      * delete Bias and ResidualData in OpMaker of conv2d
      
      * delete extra input of conv3d
      
      * refactor pass of conv_bias_fusion
      
      * fix mkldnn dependency
      
      * fix mkldnn compile
      
      * fix test_conv_bias_mkldnn_fuse_pass
      
      * police some code
      
      * remove useless log
      
      * fix analyzer_vit_ocr_tester
      
      * fix conv_activation_mkldnn_fuse_pass
      
      * fix test_analyzer_ocr
      
      * add fused_conv_sig
      
      * fix performence regression
      
      * fix performance regression
      0a2dfa38
  5. 05 12月, 2022 1 次提交
    • W
      Reverse roll fuse (#46914) · feb68dd1
      Wang Bojun 提交于
      * pass
      
      * pass
      
      * draft version
      
      * share mem opt
      
      * remove sharemem
      
      * add pattern for the case with circle_shift=0
      
      * add UT
      
      * pass opt
      
      * test_fix
      
      * code-commit
      
      * code-style
      
      * code style
      
      * code-style
      
      * ut-fix
      
      * op teller refine
      
      * resolve conflict
      
      * adjust position op_teller list and pass order for swin
      
      * ut code style update
      
      * adjust paddle pass order
      
      * refine pass order
      
      * refine pass order
      
      * refine pass order
      feb68dd1
  6. 01 12月, 2022 1 次提交
  7. 30 11月, 2022 2 次提交
  8. 21 11月, 2022 1 次提交
    • S
      add fc-residual quantization (#46917) · fed0ed34
      Sylwester Fraczek 提交于
      * add fc-residual quantization
      
      * revert removal of check for use_mkldnn
      
      * fix bug
      
      * add disable_logs
      
      * review fix
      
      call twice AreScalesPresntForNodes instead of if-else
      
      * rewrite residual input to output
      
      * revert fc mkldnn taking residual data
      
      * format fix
      
      * fix LoDTensor->DenseTensor
      
      * LoDTensor->DenseTensor
      
      * output->input
      
      * revert changes to unsupported script
      
      revert changes to unsupported script
      
      * remove fc residualdata from output blocklist in cpu_bfloat16_pass.cc
      fed0ed34
  9. 16 11月, 2022 1 次提交
  10. 15 11月, 2022 1 次提交
  11. 14 11月, 2022 1 次提交
  12. 07 11月, 2022 1 次提交
    • H
      suqeeze2 + transpose2 fuse onednn (#47592) · fa874a46
      Hui Zhang 提交于
      * suqeeze2 transpose2 fuse onednn
      
      * format
      
      * fix output shape
      
      * fix conflict
      
      * format
      
      * format
      
      * remove useless
      
      * remove log
      
      * simply pass
      
      * fix comment
      
      * fix
      
      * fix msg
      
      * fix error msg
      
      * format
      fa874a46
  13. 04 11月, 2022 1 次提交
  14. 20 10月, 2022 1 次提交
  15. 18 10月, 2022 1 次提交
    • W
      Merge layernorm trt fuse (#46320) · 5e9f491e
      Wang Bojun 提交于
      * first version, accuracy corrected
      
      * disable debug print
      
      * use blockReduceSum in phi
      
      * add UT
      
      * add opCompat
      
      * code style
      
      * code refine
      
      * bug fix
      
      * code refine
      
      * test fix
      
      * bugfix
      
      * codesytle fix
      
      * code style
      
      * code-style
      
      * code-style
      
      * code-style
      5e9f491e
  16. 17 10月, 2022 2 次提交
  17. 10 10月, 2022 1 次提交
  18. 07 9月, 2022 1 次提交
    • W
      Layernorm shift partition (#45736) · 960109af
      wenbin 提交于
      * first commit
      
      * conver done
      
      * correct format
      
      * layernorm_shift_partition
      
      * correct convert
      
      * redefine plugin
      
      * runable
      
      * bug fix
      
      * modify ShiftPartitionPattern
      
      * correct
      
      * add UT
      
      * modify ut
      
      * compile
      
      * modify enforce
      
      * modify UT
      960109af
  19. 22 8月, 2022 2 次提交
  20. 16 8月, 2022 2 次提交
    • F
      convert multihead to oss (#45019) · f706d95d
      feng_shuai 提交于
      * convert multihead to oss
      
      * fix:bug
      
      * fix:delete const cast
      
      * fix:don't support bias_qk
      
      * add vit pass
      
      * fix:convert bug and add preln_residual_bias
      
      * support length=-1
      
      * add UT for convert
      
      * add no_bias_qk support for gpu_multihead_op
      
      * delete infer_shape depends on bias_qk
      
      * oss just can be used in T4 and A*
      
      * fix:change api for ROCM CI
      f706d95d
    • W
      fix new quant (#45155) · 2fb65e44
      Wangzheee 提交于
      2fb65e44
  21. 04 8月, 2022 1 次提交
    • S
      Matmuls with activation and elementwise_add fuses (#44655) · 0420d514
      Sławomir Siwek 提交于
      * Add unit tests
      
      * matmul_v2 + activation
      
      * matmuls + elementwise_add
      
      * matmul_v2 postops
      
      * transform matmul to v2
      
      * opcompat
      
      * fix fusing matmul with multipe outs
      
      * add shape constraints
      
      * remove unused vars
      
      * change pass order
      
      * - Unit tests to be debugged
      
      - fix
      
      - refactor
      
      - diagnostic
      
      - more diagnostic
      
      - fix
      
      - Fix number two
      
      - fix
      
      - fix
      
      - fix
      
      - alpha added
      
      - more fixes
      
      - compilation fix
      
      - removed diagnostic code
      
      - cosmetic fixes
      
      * lint
      
      * add alpha constraint
      
      * merge matmul refactor
      
      * trigger CI
      
      * - fix
      
      * - another fix
      
      * code style
      
      * add support for matmul+elementwise_add+activation
      
      * code style
      
      * fix bfloat16 bugs
      
      * change append_binary to append_sum
      Co-authored-by: NJacek Czaja <jacek.czaja@intel.com>
      0420d514
  22. 27 7月, 2022 1 次提交
  23. 19 7月, 2022 1 次提交
  24. 12 7月, 2022 1 次提交
  25. 11 7月, 2022 1 次提交
  26. 06 7月, 2022 1 次提交
  27. 05 7月, 2022 1 次提交
  28. 30 6月, 2022 1 次提交
  29. 26 6月, 2022 1 次提交
  30. 23 6月, 2022 1 次提交
    • S
      [external reviewing] Params to int8 pass (#42625) · b8b2d6a9
      Sylwester Fraczek 提交于
      * sylwek
      
      prototype params to int8 pass
      
      * trying to make warmup work
      
      * wip
      
      * wip
      
      * change test to cpp test
      
      * review fixes, refactoring
      
      * more refactoring
      
      * add erasevars
      
      * change test to fixture
      
      * rename pass
      
      and reorder erasevars and graphsaferemovenodes
      
      * fix
      
      * more refactoring and fixed bug
      
      * formatting
      
      * remove scale count
      
      * enfroce message too short
      
      * remove erasevars
      
      erasevars couldbe cauuse of memory issues
      
      some other fixes
      
      * add count of successfull fuses to name of new nodes
      
      * FindVar -> GetVar and use ConvResidual pattern
      
      * use tensor->clear() instead of new variable
      
      * Update paddle/fluid/framework/ir/mkldnn/params_quantization_mkldnn_pass_tester.cc
      Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
      
      * Update paddle/fluid/framework/ir/mkldnn/params_quantization_mkldnn_pass_tester.cc
      Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
      
      * Update paddle/fluid/inference/tests/api/analyzer_lexical_analysis_gru_tester.cc
      Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
      
      * add log (review fix)c
      
      * review fix (2 functions to one)
      
      * code review: Conv->QuantizeConv
      
      * revert
      
      * fix formatting
      
      * remove unused functions
      
      * add paddle enforce
      Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
      b8b2d6a9
  31. 21 6月, 2022 1 次提交
  32. 05 6月, 2022 1 次提交
  33. 02 6月, 2022 1 次提交
  34. 19 5月, 2022 1 次提交