1. 30 12月, 2022 2 次提交
    • H
      [Custom device] Add custom_cpu testcase of custom_relu (#49300) · 69c7edcf
      HongyuJia 提交于
      * add custom_cpu testcase
      
      * update test_custom_device_setup
      
      * update path to custom_runtime
      
      * fix cmd wait
      
      * test Linux only
      
      * setup once
      
      * integrate to one run_cmd
      
      * add pip install
      
      * change timeout
      
      * add debug string
      
      * add debug string
      
      * add debug string
      
      * use os.system and change module name
      
      * add runtime
      
      * add more debug message
      
      * continue debug
      
      * timestamp
      
      * fix testcase import bug
      
      * remove error message
      
      * set TIMEOUT property
      69c7edcf
    • Z
      Support static graph code-gen for squeeze and unsqueeze op (#49430) · 23c1ac2c
      zyfncg 提交于
      * support static graph code-gen for squeeze op
      
      * generate static graph code of unsqueeze
      
      * refine op name
      
      * add extra output in op_compat
      
      * remove debug log
      23c1ac2c
  2. 28 12月, 2022 1 次提交
  3. 23 12月, 2022 1 次提交
  4. 20 12月, 2022 1 次提交
    • HappyHeavyRain's avatar
      Generate static graph code of some ops (#49092) · 11d7026b
      HappyHeavyRain 提交于
      * generate static graph code of some ops
      
      * change the default value of 'num' of 'unstack'
      
      * revert the pow
      
      * fix the 'real' 'imag' op error because of 'complex'
      
      * fix the code according to review
      11d7026b
  5. 15 12月, 2022 1 次提交
  6. 13 12月, 2022 2 次提交
  7. 12 12月, 2022 2 次提交
  8. 09 12月, 2022 4 次提交
  9. 05 12月, 2022 2 次提交
  10. 02 12月, 2022 1 次提交
    • J
      [Eager] Optimize Grad by prune useless branch (#47827) · d1e93be1
      Jiabin Yang 提交于
      * [Eager] Fix paddle.grad interface
      
      * [Eager] Support minimum SubGraph for GeneralGrad
      
      * Add needed_nodes to prune grad graph more thoroughly
      
      * [Eager] Add grad_node_trans_mapping_ to record which grad_node has been transformed to AccumulationNode
      
      * [Eager] Fix paddle.grad interface
      
      * Polish code
      
      * remove potential_stop_node
      
      * Add endding_nodes to enhance genSugraph logic
      
      * clear endding_nodes_
      
      * polish code
      
      * rename endding_nodes to endding_nades_
      
      * Refactor grad interface
      
      * Add register_hook case to fix coverage-ci
      
      * Fix code format
      
      * Refactor general_grad
      
      * Add more code comments
      
      * call clear directly to release GradSlotMeta
      
      * fix a mistake
      
      * fix matmul/ multiply kernel logic and optional input in yaml, fill zeros logic and so on.
      
      * fix batch_norm_double_grad yaml optional config
      
      * fix tanh_triple_grad yaml and kernels
      
      * fix MultiplyTripleGradKernel optional logic
      
      * fix merge mistake
      
      * fix compile error
      
      * remove legacy attr for bn
      
      * polish code
      
      * fix some kernel
      
      * merge develop
      
      * fix error
      
      * remote log
      
      * fix kernel with full like
      
      * hide value log behind
      
      * hide value log behind
      
      * fix matmul_triple grad
      Co-authored-by: NWeilong Wu <veyron_wu@163.com>
      d1e93be1
  11. 01 12月, 2022 1 次提交
  12. 30 11月, 2022 2 次提交
  13. 29 11月, 2022 4 次提交
  14. 28 11月, 2022 3 次提交
  15. 25 11月, 2022 3 次提交
  16. 24 11月, 2022 1 次提交
    • H
      [Phi Support CuDNN] Support ALL CuDNN (#47865) · 1623f1b4
      HongyuJia 提交于
      * support default use_gpudnn=True
      
      * fully support cudnn in phi
      
      * add header file
      
      * add white_list, verify accuracy
      
      * phi support all cudnn
      
      * opt affine_grad
      
      * try different arches of pretrained_model
      
      * try different arches of pretrained_model
      
      * add debug string
      
      * debug eager_method
      
      * add debug string, pass all local ctest
      
      * polish all debug code
      
      * delete use_cudnn relevant code autogen
      
      * fix depthwise_conv2d
      
      * Share all other members of Tensor except use_cudnn
      
      * polish codes according to review opinion
      
      * polish codes according to review opinion, fix bug
      
      * polish codes according to review opinion, opt performance
      
      * polish codes according to review opinion, fix pooling.py
      1623f1b4
  17. 22 11月, 2022 1 次提交
  18. 17 11月, 2022 3 次提交
  19. 16 11月, 2022 1 次提交
  20. 14 11月, 2022 1 次提交
  21. 11 11月, 2022 2 次提交
  22. 10 11月, 2022 1 次提交