1. 09 11月, 2022 1 次提交
    • P
      Enable fc passes (#45704) · 7e914386
      Paulina Gacek 提交于
      * Analysis API interface for disabling fc passes
      
      * Unit tests corrected
      
      * Python API added
      
      * test runs only when PADDLE_WITH_MKLDNN
      
      * Fc op changed to relu in matmul_op_test
      
      * Disable fc passes in tests where acc drops
      
      * code formating
      
      * Unit test for analysisConf added
      
      * Unit test gpu added
      
      * fc passes disabled when iterations=0 in gru test
      
      * style
      
      * passes disabled when fp32 in gru test
      
      * fc passes disabled in lstm test
      
      * Import from inference, not fluid in doc
      7e914386
  2. 08 11月, 2022 1 次提交
  3. 07 11月, 2022 1 次提交
    • H
      suqeeze2 + transpose2 fuse onednn (#47592) · fa874a46
      Hui Zhang 提交于
      * suqeeze2 transpose2 fuse onednn
      
      * format
      
      * fix output shape
      
      * fix conflict
      
      * format
      
      * format
      
      * remove useless
      
      * remove log
      
      * simply pass
      
      * fix comment
      
      * fix
      
      * fix msg
      
      * fix error msg
      
      * format
      fa874a46
  4. 04 11月, 2022 1 次提交
  5. 03 11月, 2022 1 次提交
  6. 26 10月, 2022 2 次提交
  7. 20 10月, 2022 2 次提交
  8. 18 10月, 2022 1 次提交
    • W
      Merge layernorm trt fuse (#46320) · 5e9f491e
      Wang Bojun 提交于
      * first version, accuracy corrected
      
      * disable debug print
      
      * use blockReduceSum in phi
      
      * add UT
      
      * add opCompat
      
      * code style
      
      * code refine
      
      * bug fix
      
      * code refine
      
      * test fix
      
      * bugfix
      
      * codesytle fix
      
      * code style
      
      * code-style
      
      * code-style
      
      * code-style
      5e9f491e
  9. 17 10月, 2022 2 次提交
  10. 16 10月, 2022 1 次提交
  11. 10 10月, 2022 1 次提交
  12. 27 9月, 2022 1 次提交
  13. 21 9月, 2022 1 次提交
  14. 07 9月, 2022 1 次提交
    • W
      Layernorm shift partition (#45736) · 960109af
      wenbin 提交于
      * first commit
      
      * conver done
      
      * correct format
      
      * layernorm_shift_partition
      
      * correct convert
      
      * redefine plugin
      
      * runable
      
      * bug fix
      
      * modify ShiftPartitionPattern
      
      * correct
      
      * add UT
      
      * modify ut
      
      * compile
      
      * modify enforce
      
      * modify UT
      960109af
  15. 02 9月, 2022 1 次提交
  16. 30 8月, 2022 1 次提交
  17. 22 8月, 2022 3 次提交
  18. 16 8月, 2022 1 次提交
    • F
      convert multihead to oss (#45019) · f706d95d
      feng_shuai 提交于
      * convert multihead to oss
      
      * fix:bug
      
      * fix:delete const cast
      
      * fix:don't support bias_qk
      
      * add vit pass
      
      * fix:convert bug and add preln_residual_bias
      
      * support length=-1
      
      * add UT for convert
      
      * add no_bias_qk support for gpu_multihead_op
      
      * delete infer_shape depends on bias_qk
      
      * oss just can be used in T4 and A*
      
      * fix:change api for ROCM CI
      f706d95d
  19. 15 8月, 2022 1 次提交
  20. 14 8月, 2022 1 次提交
  21. 10 8月, 2022 1 次提交
  22. 05 8月, 2022 1 次提交
  23. 04 8月, 2022 1 次提交
    • S
      Matmuls with activation and elementwise_add fuses (#44655) · 0420d514
      Sławomir Siwek 提交于
      * Add unit tests
      
      * matmul_v2 + activation
      
      * matmuls + elementwise_add
      
      * matmul_v2 postops
      
      * transform matmul to v2
      
      * opcompat
      
      * fix fusing matmul with multipe outs
      
      * add shape constraints
      
      * remove unused vars
      
      * change pass order
      
      * - Unit tests to be debugged
      
      - fix
      
      - refactor
      
      - diagnostic
      
      - more diagnostic
      
      - fix
      
      - Fix number two
      
      - fix
      
      - fix
      
      - fix
      
      - alpha added
      
      - more fixes
      
      - compilation fix
      
      - removed diagnostic code
      
      - cosmetic fixes
      
      * lint
      
      * add alpha constraint
      
      * merge matmul refactor
      
      * trigger CI
      
      * - fix
      
      * - another fix
      
      * code style
      
      * add support for matmul+elementwise_add+activation
      
      * code style
      
      * fix bfloat16 bugs
      
      * change append_binary to append_sum
      Co-authored-by: NJacek Czaja <jacek.czaja@intel.com>
      0420d514
  24. 02 8月, 2022 1 次提交
  25. 29 7月, 2022 1 次提交
  26. 21 7月, 2022 2 次提交
  27. 12 7月, 2022 1 次提交
    • S
      matmul+activation fuse pass (#43519) · 3333a439
      Sławomir Siwek 提交于
      * add method for post ops
      
      * format code
      
      * gpd
      
      * format style
      
      * add matmul+act test
      
      * implement matmul+activation
      
      * whitespaces
      
      * code style
      
      * python code format
      
      * Increase UT timeout
      
      * code format
      
      * update style
      
      * generalize activation fuse passes
      
      * change order
      
      * Unify activation GPD
      
      * Revert changes with op_act
      
      * remove softmax mkldnn attrs
      
      * set common name for act attributes
      
      * whitespace
      
      * append postops by helper function
      
      * ut style
      
      * revert changes related to quantization
      
      * Reduce redundancy
      
      * reduce number of parameters
      
      * trigger CI
      
      * validate attribute
      
      * trim unit test
      3333a439
  28. 08 7月, 2022 1 次提交
  29. 29 6月, 2022 1 次提交
  30. 24 6月, 2022 1 次提交
  31. 22 6月, 2022 1 次提交
  32. 21 6月, 2022 1 次提交
    • S
      Generalize conv+activation fuse pass (#43382) · 347e4b2e
      Sławomir Siwek 提交于
      * consolidate conv act passes
      
      * generalize conv_activation
      
      * integrate conv+act tests
      
      * code style format
      
      * whitespaces
      
      * remove timeout from old tests
      
      * implement comments from review
      
      * restore ut
      
      * whitespace
      
      * code style
      
      * transpose
      
      * fixes after review
      
      * method for gettin act
      
      * Change Paddle_enforce error type
      
      * code format
      
      * add missing opcompats
      347e4b2e
  33. 20 6月, 2022 1 次提交
  34. 09 6月, 2022 1 次提交