1. 22 8月, 2022 2 次提交
  2. 16 8月, 2022 1 次提交
    • F
      convert multihead to oss (#45019) · f706d95d
      feng_shuai 提交于
      * convert multihead to oss
      
      * fix:bug
      
      * fix:delete const cast
      
      * fix:don't support bias_qk
      
      * add vit pass
      
      * fix:convert bug and add preln_residual_bias
      
      * support length=-1
      
      * add UT for convert
      
      * add no_bias_qk support for gpu_multihead_op
      
      * delete infer_shape depends on bias_qk
      
      * oss just can be used in T4 and A*
      
      * fix:change api for ROCM CI
      f706d95d
  3. 15 8月, 2022 1 次提交
  4. 14 8月, 2022 1 次提交
  5. 10 8月, 2022 1 次提交
  6. 05 8月, 2022 1 次提交
  7. 04 8月, 2022 1 次提交
    • S
      Matmuls with activation and elementwise_add fuses (#44655) · 0420d514
      Sławomir Siwek 提交于
      * Add unit tests
      
      * matmul_v2 + activation
      
      * matmuls + elementwise_add
      
      * matmul_v2 postops
      
      * transform matmul to v2
      
      * opcompat
      
      * fix fusing matmul with multipe outs
      
      * add shape constraints
      
      * remove unused vars
      
      * change pass order
      
      * - Unit tests to be debugged
      
      - fix
      
      - refactor
      
      - diagnostic
      
      - more diagnostic
      
      - fix
      
      - Fix number two
      
      - fix
      
      - fix
      
      - fix
      
      - alpha added
      
      - more fixes
      
      - compilation fix
      
      - removed diagnostic code
      
      - cosmetic fixes
      
      * lint
      
      * add alpha constraint
      
      * merge matmul refactor
      
      * trigger CI
      
      * - fix
      
      * - another fix
      
      * code style
      
      * add support for matmul+elementwise_add+activation
      
      * code style
      
      * fix bfloat16 bugs
      
      * change append_binary to append_sum
      Co-authored-by: NJacek Czaja <jacek.czaja@intel.com>
      0420d514
  8. 02 8月, 2022 1 次提交
  9. 29 7月, 2022 1 次提交
  10. 21 7月, 2022 2 次提交
  11. 12 7月, 2022 1 次提交
    • S
      matmul+activation fuse pass (#43519) · 3333a439
      Sławomir Siwek 提交于
      * add method for post ops
      
      * format code
      
      * gpd
      
      * format style
      
      * add matmul+act test
      
      * implement matmul+activation
      
      * whitespaces
      
      * code style
      
      * python code format
      
      * Increase UT timeout
      
      * code format
      
      * update style
      
      * generalize activation fuse passes
      
      * change order
      
      * Unify activation GPD
      
      * Revert changes with op_act
      
      * remove softmax mkldnn attrs
      
      * set common name for act attributes
      
      * whitespace
      
      * append postops by helper function
      
      * ut style
      
      * revert changes related to quantization
      
      * Reduce redundancy
      
      * reduce number of parameters
      
      * trigger CI
      
      * validate attribute
      
      * trim unit test
      3333a439
  12. 08 7月, 2022 1 次提交
  13. 29 6月, 2022 1 次提交
  14. 24 6月, 2022 1 次提交
  15. 22 6月, 2022 1 次提交
  16. 21 6月, 2022 1 次提交
    • S
      Generalize conv+activation fuse pass (#43382) · 347e4b2e
      Sławomir Siwek 提交于
      * consolidate conv act passes
      
      * generalize conv_activation
      
      * integrate conv+act tests
      
      * code style format
      
      * whitespaces
      
      * remove timeout from old tests
      
      * implement comments from review
      
      * restore ut
      
      * whitespace
      
      * code style
      
      * transpose
      
      * fixes after review
      
      * method for gettin act
      
      * Change Paddle_enforce error type
      
      * code format
      
      * add missing opcompats
      347e4b2e
  17. 20 6月, 2022 1 次提交
  18. 09 6月, 2022 1 次提交
  19. 05 6月, 2022 1 次提交
  20. 02 6月, 2022 2 次提交
  21. 20 5月, 2022 1 次提交
  22. 19 5月, 2022 1 次提交
  23. 17 5月, 2022 1 次提交
  24. 12 5月, 2022 1 次提交
  25. 10 5月, 2022 1 次提交
    • P
      Rea-dd conv_affine_channel fuse pass as oneDNN only pass (#41998) · 3540d33b
      piotrekobi 提交于
      * Readd conv_affine_channel fuse pass as mkldnn pass
      
      * Fix formatting
      
      * Add new test to parallel_UT_rule.py
      
      * Fix Coverage and Windows CI issues
      
      * Revert "Fix Coverage and Windows CI issues"
      
      This reverts commit f33459846385c9fd51c07f9f44e7ff283a652637.
      
      * Fix CI errors
      
      * Remove unnecessary conv_eltwise_add_affine_channel fuse pass
      
      * Remove test from parallel_UT_rule.py
      3540d33b
  26. 14 4月, 2022 3 次提交
    • S
      FC+elementwise_add (residual connection) (#41776) · 92d8d0bc
      Sławomir Siwek 提交于
      * Change tensor name to match activation
      
      * declare fc_eltwise_add pass
      
      * merge conv_eltwise refactor PR
      
      * first compilable draft
      
      * unittest feedback tools
      
      * Fuse pass tester
      
      * Move IsReachable() to shared file
      
      * 100% coverage of fuse_pass_tester.cc
      
      * register pass
      
      * Add bias node
      
      * Improve unit tests / remove bias node from pattern
      
      * improve fc_eltwiseadd_unittest
      
      * cancel eltwise_add fuse if act is already fused
      
      * Add elementwise_input scale
      
      * Residual MVP
      
      * Add new FC attrs
      
      * Add more test cases
      
      * Add missing op attrs
      
      * Adapt code to new Elementwise pattern
      
      * reuse existing fcpattern
      
      * improve code style
      
      * remove unused arguments
      
      * fix typo
      
      * remove whitespace
      
      * remove int8 related code
      
      * Remove attributes from base ops
      
      * style
      
      * style check
      
      * Remove input from base op
      
      * Set attribute during fuse
      
      * ut timeout
      
      * download and test model
      
      * DRY
      
      * apply feedback from review
      
      * Style check
      
      * fix typo
      
      * cosmetic changes
      
      * explicitly set residual as output
      
      * VIT-OCR accuracy check
      
      * trigger CI
      
      * remove whitespaces
      
      * fix missing data file
      92d8d0bc
    • B
      add mkldnn int8 pass [step3] (#41599) · 8e2d4d30
      baoachun 提交于
      * add mkldnn int8 pass [step3]
      
      * Add test for compute_propagate_scales_mkldnn_pass
      
      * update pass
      
      * update api comment and python api
      Co-authored-by: Nwozna <joanna.wozna@intel.com>
      8e2d4d30
    • J
      Added shuffle_channel BF16/FP32 FWD oneDNN kernel (#39756) · c7623d72
      jakpiase 提交于
      * added shuffle_channel bf16/fp32 fwd kernel
      
      * added missing files
      
      * CI fix
      
      * changed from pten to phi
      
      * tmp save
      
      * added reviewers suggestions
      
      * fix for test
      c7623d72
  27. 02 4月, 2022 1 次提交
  28. 17 3月, 2022 1 次提交
  29. 14 3月, 2022 1 次提交
    • T
      Add an elementwise + activation fusion pass. (#36541) · 3f219160
      Tomasz Socha 提交于
      * Add elementwise add and activation fuse pass
      
      * Fix copy ellision
      
      * More flexible pattern detector
      
      * More flexible fusion pass
      
      * Update lists for pass
      
      * Add support for Pow operator
      
      * Add support for more activation types
      
      * Style
      
      * Rename fusion pass
      
      * First version of tests
      
      * Dirty version of pass
      
      * Polished version
      
      * Update pbtxt
      
      * Style
      
      * Update names
      
      * Style
      
      * Use PADDLE_ENFORCE_EQ
      
      * Save error message to variable
      
      * WO for error checks
      
      * CR
      
      * Static style check
      
      * Add missing 'activation_scale' attribute
      
      * Add relu6 and sigmoid activations
      
      * Style
      
      * Fix fuse list formating
      
      * Sync filenames for fuse pass files
      
      * Fix cmake after move
      
      * Fix registration
      
      * Fix pass name in tests
      
      * Add missing activations to checker
      
      * WIPS
      
      * Working mul op
      
      * Working sub
      
      * Working Add
      
      * Remove pten includes
      
      * Remove some forward declarations
      
      * Remove Includes
      
      * Fixes
      
      * Remove default kernels
      
      * Add check if post_ops attributes are avaliable
      
      * Style
      
      * Code adjustment
      
      * Register default kernels
      
      * We have year 2022 not 2021...
      Co-authored-by: Njakpiase <jakpia21@gmail.com>
      Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>
      
      * Fast review fixes
      Co-authored-by: Njakpiase <jakpia21@gmail.com>
      Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>
      
      * Review Fix
      
      * Rename one_dnn -> onednn
      
      * Style after review
      
      * Fast and dirty fix for quantization
      
      * Update tests
      
      * Style
      
      * Fix mkldnn_quantizer config
      
      * Add Joanna's suggestion.
      
      * Check if operator is explicitly disables on OneDNN
      
      * Try to use unregistered attributes
      
      * Style
      
      * Test new framework
      
      * FXI
      
      * FXII
      
      * Update test
      
      * Style
      Co-authored-by: Njakpiase <jakpia21@gmail.com>
      Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>
      3f219160
  30. 01 3月, 2022 1 次提交
  31. 15 2月, 2022 1 次提交
    • W
      [Paddle-Inference] support preln_ernie: add... · 2bc91cc5
      Wangzheee 提交于
      [Paddle-Inference] support preln_ernie: add preln_embedding_eltwise_layernorm_fuse_pass, preln_skip_layernorm_fuse_pass (#39508)
      
      * support preln_ernie
      
      * support preln_ernie
      2bc91cc5
  32. 09 2月, 2022 1 次提交
  33. 18 1月, 2022 1 次提交
    • S
      Mish FP32/BF16 kernel, conv and fc fuse passes (#38623) · 1d18bc2c
      Sławomir Siwek 提交于
      * Mish
      
      * Change exp() library
      
      * mish fuse pass
      
      * mish attrs
      
      * fixes
      
      * mishop maker
      
      * remove attrs
      
      * mish kernal for bf16
      
      * fc+mish fuse
      
      * fix code format error
      
      * Resolve merge conflicts
      
      * Update mish operator version
      
      * update mish variable to new naming convention
      1d18bc2c
  34. 20 12月, 2021 2 次提交