1. 17 2月, 2023 1 次提交
  2. 16 2月, 2023 1 次提交
  3. 11 2月, 2023 1 次提交
    • W
      [TRT] elementwise_add+transpose fusion (#50081) · fd0d4fa4
      Wang Bojun 提交于
      * eleadd_trans first version
      
      log fix
      
      * refine code for linear format, add pass check
      
      * linear format refine and ut fix
      
      * fix ut
      
      * windows ut
      
      * windows ut 2
      
      * move tensorMeta and alloc to configure
      fd0d4fa4
  4. 10 2月, 2023 1 次提交
  5. 09 2月, 2023 1 次提交
  6. 08 2月, 2023 1 次提交
    • P
      fuse quantize+transpose and transpose+dequantize (#49509) · 197a4ffe
      Paulina Gacek 提交于
      * QuantTranpose pattern is being found by pass
      
      * quant + transpose fuse
      
      * code style changes
      
      * UT written, reorder fixed
      
      * Dequantize + transpose2 fuse  added
      
      * pass name changed
      
      * UT added & shift corrected
      
      * got rid of redundancy
      
      * review changes
      
      * AsIntermediate corrected
      
      * compat added
      197a4ffe
  7. 06 2月, 2023 1 次提交
  8. 31 1月, 2023 1 次提交
    • W
      gn_silu (#49928) · 111075a3
      wenbin 提交于
      * gn_silu
      
      * add ut
      
      * set TIMEOUT
      
      * correct comments
      
      * comments
      
      * disable windows ut
      
      * rename parameter
      111075a3
  9. 16 1月, 2023 2 次提交
  10. 13 1月, 2023 1 次提交
  11. 09 1月, 2023 2 次提交
  12. 06 1月, 2023 1 次提交
  13. 05 1月, 2023 1 次提交
  14. 04 1月, 2023 1 次提交
  15. 03 1月, 2023 1 次提交
  16. 22 12月, 2022 1 次提交
  17. 19 12月, 2022 1 次提交
  18. 14 12月, 2022 2 次提交
  19. 12 12月, 2022 1 次提交
  20. 08 12月, 2022 4 次提交
  21. 06 12月, 2022 1 次提交
  22. 05 12月, 2022 1 次提交
    • W
      Reverse roll fuse (#46914) · feb68dd1
      Wang Bojun 提交于
      * pass
      
      * pass
      
      * draft version
      
      * share mem opt
      
      * remove sharemem
      
      * add pattern for the case with circle_shift=0
      
      * add UT
      
      * pass opt
      
      * test_fix
      
      * code-commit
      
      * code-style
      
      * code style
      
      * code-style
      
      * ut-fix
      
      * op teller refine
      
      * resolve conflict
      
      * adjust position op_teller list and pass order for swin
      
      * ut code style update
      
      * adjust paddle pass order
      
      * refine pass order
      
      * refine pass order
      
      * refine pass order
      feb68dd1
  23. 30 11月, 2022 2 次提交
  24. 23 11月, 2022 1 次提交
  25. 21 11月, 2022 2 次提交
    • S
      add fc-residual quantization (#46917) · fed0ed34
      Sylwester Fraczek 提交于
      * add fc-residual quantization
      
      * revert removal of check for use_mkldnn
      
      * fix bug
      
      * add disable_logs
      
      * review fix
      
      call twice AreScalesPresntForNodes instead of if-else
      
      * rewrite residual input to output
      
      * revert fc mkldnn taking residual data
      
      * format fix
      
      * fix LoDTensor->DenseTensor
      
      * LoDTensor->DenseTensor
      
      * output->input
      
      * revert changes to unsupported script
      
      revert changes to unsupported script
      
      * remove fc residualdata from output blocklist in cpu_bfloat16_pass.cc
      fed0ed34
    • R
      delete unnecessary shape and slice op (#48112) · 41483383
      RichardWooSJTU 提交于
      41483383
  26. 16 11月, 2022 1 次提交
  27. 15 11月, 2022 1 次提交
  28. 10 11月, 2022 2 次提交
  29. 09 11月, 2022 2 次提交
    • J
      Fix U2++ perf (#47780) · b1fb2360
      joanna.wozna.intel 提交于
      b1fb2360
    • P
      Enable fc passes (#45704) · 7e914386
      Paulina Gacek 提交于
      * Analysis API interface for disabling fc passes
      
      * Unit tests corrected
      
      * Python API added
      
      * test runs only when PADDLE_WITH_MKLDNN
      
      * Fc op changed to relu in matmul_op_test
      
      * Disable fc passes in tests where acc drops
      
      * code formating
      
      * Unit test for analysisConf added
      
      * Unit test gpu added
      
      * fc passes disabled when iterations=0 in gru test
      
      * style
      
      * passes disabled when fp32 in gru test
      
      * fc passes disabled in lstm test
      
      * Import from inference, not fluid in doc
      7e914386
  30. 08 11月, 2022 1 次提交