1. 02 12月, 2022 5 次提交
    • R
      fix phi capi kernel registration macro error (#48616) · 0f3b1ad6
      ronnywang 提交于
      * fix capi kernel registration macro error
      
      * update
      0f3b1ad6
    • W
      [Eager, Performance Optimization] modify AllocateFrom to reduce deconstruction... · 708c4f88
      Weilong Wu 提交于
      [Eager, Performance Optimization] modify AllocateFrom to reduce deconstruction of shared_ptr (#48548)
      
      708c4f88
    • J
      [Eager] Optimize Grad by prune useless branch (#47827) · d1e93be1
      Jiabin Yang 提交于
      * [Eager] Fix paddle.grad interface
      
      * [Eager] Support minimum SubGraph for GeneralGrad
      
      * Add needed_nodes to prune grad graph more thoroughly
      
      * [Eager] Add grad_node_trans_mapping_ to record which grad_node has been transformed to AccumulationNode
      
      * [Eager] Fix paddle.grad interface
      
      * Polish code
      
      * remove potential_stop_node
      
      * Add endding_nodes to enhance genSugraph logic
      
      * clear endding_nodes_
      
      * polish code
      
      * rename endding_nodes to endding_nades_
      
      * Refactor grad interface
      
      * Add register_hook case to fix coverage-ci
      
      * Fix code format
      
      * Refactor general_grad
      
      * Add more code comments
      
      * call clear directly to release GradSlotMeta
      
      * fix a mistake
      
      * fix matmul/ multiply kernel logic and optional input in yaml, fill zeros logic and so on.
      
      * fix batch_norm_double_grad yaml optional config
      
      * fix tanh_triple_grad yaml and kernels
      
      * fix MultiplyTripleGradKernel optional logic
      
      * fix merge mistake
      
      * fix compile error
      
      * remove legacy attr for bn
      
      * polish code
      
      * fix some kernel
      
      * merge develop
      
      * fix error
      
      * remote log
      
      * fix kernel with full like
      
      * hide value log behind
      
      * hide value log behind
      
      * fix matmul_triple grad
      Co-authored-by: NWeilong Wu <veyron_wu@163.com>
      d1e93be1
    • Y
      add silu, silu_grad, unfold and unfold_grad xpu kernels (#48325) · f71de378
      ykkk2333 提交于
      * add stat tool
      
      * add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun
      
      * add silu, unfold and their grads,test=kunlun
      f71de378
    • C
      polish fusion kernel naming (#48609) · 61486bf2
      Chen Weihang 提交于
      61486bf2
  2. 01 12月, 2022 2 次提交
  3. 30 11月, 2022 7 次提交
  4. 29 11月, 2022 12 次提交
  5. 28 11月, 2022 12 次提交
  6. 25 11月, 2022 2 次提交