1. 15 3月, 2022 33 次提交
  2. 14 3月, 2022 7 次提交
    • S
      [Phi]Add diag_v2 grad kernel (#40447) · e157f2af
      Siming Dai 提交于
      * Add diag grad kernel
      
      * fix unittest case
      
      * add float16, remove const &
      
      * delete diag_grad in op_utils.h
      e157f2af
    • Z
      [PHI] Move set_value_grad kernel form fluid to phi (#40478) · 3149e399
      zyfncg 提交于
      * move set_value_grad kernel form fluid to phi
      
      * add unittest for passing coverage ci
      3149e399
    • T
      Add an elementwise + activation fusion pass. (#36541) · 3f219160
      Tomasz Socha 提交于
      * Add elementwise add and activation fuse pass
      
      * Fix copy ellision
      
      * More flexible pattern detector
      
      * More flexible fusion pass
      
      * Update lists for pass
      
      * Add support for Pow operator
      
      * Add support for more activation types
      
      * Style
      
      * Rename fusion pass
      
      * First version of tests
      
      * Dirty version of pass
      
      * Polished version
      
      * Update pbtxt
      
      * Style
      
      * Update names
      
      * Style
      
      * Use PADDLE_ENFORCE_EQ
      
      * Save error message to variable
      
      * WO for error checks
      
      * CR
      
      * Static style check
      
      * Add missing 'activation_scale' attribute
      
      * Add relu6 and sigmoid activations
      
      * Style
      
      * Fix fuse list formating
      
      * Sync filenames for fuse pass files
      
      * Fix cmake after move
      
      * Fix registration
      
      * Fix pass name in tests
      
      * Add missing activations to checker
      
      * WIPS
      
      * Working mul op
      
      * Working sub
      
      * Working Add
      
      * Remove pten includes
      
      * Remove some forward declarations
      
      * Remove Includes
      
      * Fixes
      
      * Remove default kernels
      
      * Add check if post_ops attributes are avaliable
      
      * Style
      
      * Code adjustment
      
      * Register default kernels
      
      * We have year 2022 not 2021...
      Co-authored-by: Njakpiase <jakpia21@gmail.com>
      Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>
      
      * Fast review fixes
      Co-authored-by: Njakpiase <jakpia21@gmail.com>
      Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>
      
      * Review Fix
      
      * Rename one_dnn -> onednn
      
      * Style after review
      
      * Fast and dirty fix for quantization
      
      * Update tests
      
      * Style
      
      * Fix mkldnn_quantizer config
      
      * Add Joanna's suggestion.
      
      * Check if operator is explicitly disables on OneDNN
      
      * Try to use unregistered attributes
      
      * Style
      
      * Test new framework
      
      * FXI
      
      * FXII
      
      * Update test
      
      * Style
      Co-authored-by: Njakpiase <jakpia21@gmail.com>
      Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>
      3f219160
    • F
      [MLU] add merged_momentum mlu kernel (#40406) · 1f7b2516
      fwenguang 提交于
      1f7b2516
    • C
      optimize group_norm op backward (#39944) · 5720537e
      crystal 提交于
      * optimize backwad
      
      * optimize group_norm backward
      
      * Add vectorized code
      
      * move assignment code
      
      * merge function
      
      * move code
      
      * optimize code
      
      * Modify function name
      5720537e
    • L
      Optimize bilinear_interp backward (#39423) · 9e1f762c
      Lijunhui 提交于
      * bilinear_bw init
      
      * optimize code
      
      * optimize
      
      * optimize 2
      
      * optimize functions
      
      * modify func name
      9e1f762c
    • L
      fix gpu callback (#40445) · 2c21d240
      Leo Chen 提交于
      * fix gpu conetxt callback
      
      * fix gpu callback
      
      * fix callback early destruct problem
      2c21d240