1. 13 4月, 2023 1 次提交
    • H
      [enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h (#52651) · 5664ea26
      HongyuJia 提交于
      * [enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h
      
      * Add logging.h for profiler.cc
      
      * Add logging.h for gloo_utils.h
      
      * Add logging.h for addmm_kernel_impl.h
      
      * Add logging.h for addmm_grad_kernel_impl.h
      
      * Add logging.h for p_send_kernel.cu
      
      * Add logging.h for determinant_grad_kernel_impl.h
      
      * Add logging.h for p_recv_kernel.cu
      
      * Add logging.h for elementwise_grad_base.h
      
      * Add logging.h for transfer_layout_kernel.cc
      
      * Add logging.h for eigvals_kernel.cc and index_select_impl.h
      
      * Add logging.h for all files in kernel directory
      
      * Add logging.h for xpu_info.cc
      
      * Add logging.h for xpu
      5664ea26
  2. 12 4月, 2023 1 次提交
    • G
      [AMP OP&Test] support bf16 for batch norm (#52407) · 523f8a26
      Guoxia Wang 提交于
      * [AMP OP&Test] support bf16 for batchnorm
      
      * codestyle
      
      * Update batch_norm_grad_kernel.cu
      
      * Update batch_norm_kernel.cu
      
      * fix codestyle
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * Update batch_norm_kernel.cc
      523f8a26
  3. 03 4月, 2023 1 次提交
  4. 09 2月, 2023 1 次提交
  5. 08 2月, 2023 1 次提交
  6. 31 1月, 2023 1 次提交
  7. 13 1月, 2023 1 次提交
  8. 22 12月, 2022 1 次提交
  9. 12 12月, 2022 1 次提交
  10. 09 12月, 2022 2 次提交
  11. 02 12月, 2022 1 次提交
    • J
      [Eager] Optimize Grad by prune useless branch (#47827) · d1e93be1
      Jiabin Yang 提交于
      * [Eager] Fix paddle.grad interface
      
      * [Eager] Support minimum SubGraph for GeneralGrad
      
      * Add needed_nodes to prune grad graph more thoroughly
      
      * [Eager] Add grad_node_trans_mapping_ to record which grad_node has been transformed to AccumulationNode
      
      * [Eager] Fix paddle.grad interface
      
      * Polish code
      
      * remove potential_stop_node
      
      * Add endding_nodes to enhance genSugraph logic
      
      * clear endding_nodes_
      
      * polish code
      
      * rename endding_nodes to endding_nades_
      
      * Refactor grad interface
      
      * Add register_hook case to fix coverage-ci
      
      * Fix code format
      
      * Refactor general_grad
      
      * Add more code comments
      
      * call clear directly to release GradSlotMeta
      
      * fix a mistake
      
      * fix matmul/ multiply kernel logic and optional input in yaml, fill zeros logic and so on.
      
      * fix batch_norm_double_grad yaml optional config
      
      * fix tanh_triple_grad yaml and kernels
      
      * fix MultiplyTripleGradKernel optional logic
      
      * fix merge mistake
      
      * fix compile error
      
      * remove legacy attr for bn
      
      * polish code
      
      * fix some kernel
      
      * merge develop
      
      * fix error
      
      * remote log
      
      * fix kernel with full like
      
      * hide value log behind
      
      * hide value log behind
      
      * fix matmul_triple grad
      Co-authored-by: NWeilong Wu <veyron_wu@163.com>
      d1e93be1
  12. 28 11月, 2022 1 次提交
    • H
      [PHI decoupling] move several header files from fluid to phi (#48415) · fd9c91c3
      huangjiyi 提交于
      * decouple cudnn_desc.h from fluid
      
      * move cudnn_desc.h from fluid to phi
      
      * fix bugs
      
      * decouple cudnn_helper.h from fluid
      
      * fix bugs
      
      * move cudnn_helper.h from fluid to phi
      
      * add fluid cudnn_helper.h
      
      * move miopen_desc.h from fluid to phi
      
      * move miopen_helper.h from fluid to phi
      
      * fix bugs
      
      * move gpu_dnn.h from fluid to phi
      
      * fix bugs
      
      * update copyright year
      
      * simplify gpu_dnn.h in fluid
      
      * fix bugs
      
      * fix xpu build bug
      
      * fix compile bug
      
      * fix bug
      fd9c91c3
  13. 24 11月, 2022 1 次提交
  14. 17 11月, 2022 1 次提交
  15. 09 11月, 2022 1 次提交
  16. 03 11月, 2022 1 次提交
  17. 02 11月, 2022 1 次提交
  18. 17 10月, 2022 1 次提交
  19. 03 8月, 2022 1 次提交
  20. 14 7月, 2022 1 次提交
  21. 21 6月, 2022 1 次提交
  22. 05 6月, 2022 1 次提交
  23. 27 5月, 2022 1 次提交
  24. 26 5月, 2022 1 次提交
  25. 04 5月, 2022 1 次提交
  26. 29 4月, 2022 1 次提交
  27. 15 4月, 2022 1 次提交
  28. 04 4月, 2022 1 次提交
  29. 22 3月, 2022 1 次提交
  30. 03 3月, 2022 1 次提交
    • H
      Move bn to pten (#39347) · ebd0f512
      hong 提交于
      * add bn cpu version; test=develop
      
      * move batch norm to pten
      
      * move batch norm to pten; test=develop
      
      * fix bug; test=develop
      
      * fix func::tranpose depend bug; test=develop
      
      * fix compile bugs; test=develop
      
      * fix use_op batch_norm bug; test=develop
      
      * fix cudnn bn add relu test; test=develop
      
      * fix pten context build and double grad bug; test= develop
      
      * remve useless code; test=develop
      
      * add batch norm gpu fp16 support; test=develop
      
      * fix test bn op bug; test=develop
      
      * remove output dtype set; test=develop
      
      * fix bug; test=develop
      
      * fix bug; test=develop
      
      * fix applay pass to program bug; test=develop
      
      * revert to develop; test=develop
      
      * fix rocm bug; test=develop
      
      * revert operator to develop; test=develop
      
      * fix pre_commit; test=develop
      
      * fix statci check error; test=develop
      
      * resolve conflict; test=develop
      
      * ana batch norm bug;
      
      * revert batch norm op
      
      * resolve conlict
      
      * fix nan inf and speed bug; test=develop
      
      * fix bug; test=develop
      
      * fix error; test=develop
      
      * test expand op; test=develop
      
      * fix bug; test=develop
      
      * resolve confilct
      
      * resolve confilct; test=develop
      
      * polish code; test=develop
      
      * polish code; test=develop
      
      * change mutable data to ctx alloc; test=develop
      
      * make format same with ci; test=develop
      
      * fix format error with ci; test=develop
      ebd0f512