1. 11 7月, 2022 1 次提交
  2. 06 7月, 2022 1 次提交
    • J
      Performance fix for recommender model (#43803) · 48abaec6
      jakpiase 提交于
      * fix for binary kernels
      
      * fixed performance for elementwise, reduce and concat
      
      * added comment
      
      * CI fix
      
      * CI fix
      
      * added formatting
      
      * reverted one file
      
      * Revert "reverted one file"
      
      This reverts commit 54725e1c62318d3a18913821200e973816751019.
      
      * Revert "added formatting"
      
      This reverts commit b9795dd253d755a329376d7ab0542860aa7815c6.
      
      * added enforcing oneDNN BF16 reduce kernel
      
      * fix for eltwise and reenabled reshape kernels
      
      * fix for binary handler
      
      * added formatting
      
      * referted changes for flatten,squeeze and reshape ops
      48abaec6
  3. 26 6月, 2022 1 次提交
  4. 21 6月, 2022 1 次提交
    • S
      Generalize conv+activation fuse pass (#43382) · 347e4b2e
      Sławomir Siwek 提交于
      * consolidate conv act passes
      
      * generalize conv_activation
      
      * integrate conv+act tests
      
      * code style format
      
      * whitespaces
      
      * remove timeout from old tests
      
      * implement comments from review
      
      * restore ut
      
      * whitespace
      
      * code style
      
      * transpose
      
      * fixes after review
      
      * method for gettin act
      
      * Change Paddle_enforce error type
      
      * code format
      
      * add missing opcompats
      347e4b2e
  5. 31 5月, 2022 1 次提交
  6. 18 3月, 2022 1 次提交
  7. 16 3月, 2022 1 次提交
  8. 14 3月, 2022 1 次提交
    • T
      Add an elementwise + activation fusion pass. (#36541) · 3f219160
      Tomasz Socha 提交于
      * Add elementwise add and activation fuse pass
      
      * Fix copy ellision
      
      * More flexible pattern detector
      
      * More flexible fusion pass
      
      * Update lists for pass
      
      * Add support for Pow operator
      
      * Add support for more activation types
      
      * Style
      
      * Rename fusion pass
      
      * First version of tests
      
      * Dirty version of pass
      
      * Polished version
      
      * Update pbtxt
      
      * Style
      
      * Update names
      
      * Style
      
      * Use PADDLE_ENFORCE_EQ
      
      * Save error message to variable
      
      * WO for error checks
      
      * CR
      
      * Static style check
      
      * Add missing 'activation_scale' attribute
      
      * Add relu6 and sigmoid activations
      
      * Style
      
      * Fix fuse list formating
      
      * Sync filenames for fuse pass files
      
      * Fix cmake after move
      
      * Fix registration
      
      * Fix pass name in tests
      
      * Add missing activations to checker
      
      * WIPS
      
      * Working mul op
      
      * Working sub
      
      * Working Add
      
      * Remove pten includes
      
      * Remove some forward declarations
      
      * Remove Includes
      
      * Fixes
      
      * Remove default kernels
      
      * Add check if post_ops attributes are avaliable
      
      * Style
      
      * Code adjustment
      
      * Register default kernels
      
      * We have year 2022 not 2021...
      Co-authored-by: Njakpiase <jakpia21@gmail.com>
      Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>
      
      * Fast review fixes
      Co-authored-by: Njakpiase <jakpia21@gmail.com>
      Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>
      
      * Review Fix
      
      * Rename one_dnn -> onednn
      
      * Style after review
      
      * Fast and dirty fix for quantization
      
      * Update tests
      
      * Style
      
      * Fix mkldnn_quantizer config
      
      * Add Joanna's suggestion.
      
      * Check if operator is explicitly disables on OneDNN
      
      * Try to use unregistered attributes
      
      * Style
      
      * Test new framework
      
      * FXI
      
      * FXII
      
      * Update test
      
      * Style
      Co-authored-by: Njakpiase <jakpia21@gmail.com>
      Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>
      3f219160
  9. 20 2月, 2022 1 次提交
  10. 19 2月, 2022 1 次提交
    • A
      [Pten]Unify paddle/pten::framework::ddim into pten::ddim (#39614) · 2fe04264
      Aurelius84 提交于
      * Unify paddle/pten::framework::ddim into pten::ddim
      
      * fix paddle namespace
      
      * compile sucessfully
      
      * fix npu src file
      
      * fix conflict
      
      * fix conflict
      
      * fix tensorrt compiler error
      
      * fix conflict
      
      * fix conflict
      
      * fix tesst file conflict
      
      * fix conflict
      
      * fix mlu file conflict
      
      * fix mlu file conflict
      
      * fix cinn header file conflict
      
      * fix conflict
      
      * fix conflict
      
      * fix conflict
      
      * fix conflict
      2fe04264
  11. 22 11月, 2021 1 次提交
  12. 17 11月, 2021 1 次提交
  13. 16 8月, 2021 1 次提交
    • J
      [oneDNN] Fix to 34554 (same as previous PR but should build with GPU) (#34859) · 9cb65653
      Jacek Czaja 提交于
      * - Added softmax without caching
      
      * - Binary is no longer manually cached
      
      * - Activation onednn caching removed
      
      * - Removed manual caching of activation
      
      * - modified UT
      
      * - fix
      
      * - fix
      
      * - fixes to building
      
      * - fix
      
      * - fix
      
      * - fix to UT
      
      * - Faulty UT workaround
      
      * - approval workaround
      
      * - Fixes after review
      
      * - compilation fixes
      
      * - more lint fixes
      
      * - more fixes after review
      
      * - fixes after another round of review
      
      * - hopefully compilation fix
      
      - compilation fix
      9cb65653
  14. 12 8月, 2021 1 次提交
  15. 11 8月, 2021 1 次提交
    • J
      [oneDNN] Fix to issue #34554 (#34623) · 0a5c99e8
      Jacek Czaja 提交于
      * - Added softmax without caching
      
      * - Binary is no longer manually cached
      
      * - Activation onednn caching removed
      
      * - Removed manual caching of activation
      
      * - modified UT
      
      * - fix
      
      * - fix
      
      * - fixes to building
      
      * - fix
      
      * - fix
      
      * - fix to UT
      
      * - Faulty UT workaround
      
      * - approval workaround
      
      * - Fixes after review
      
      * - compilation fixes
      
      * - more lint fixes
      
      * - more fixes after review
      
      * - fixes after another round of review
      0a5c99e8
  16. 24 6月, 2021 1 次提交
  17. 14 4月, 2021 1 次提交
  18. 19 3月, 2021 1 次提交
  19. 25 1月, 2021 1 次提交
  20. 15 1月, 2021 1 次提交
  21. 18 6月, 2020 1 次提交