1. 28 12月, 2022 1 次提交
    • L
      [new-exec] Ahead-Of-Time choosing kernel (#48789) · 63d2d722
      Leo Chen 提交于
      * add skip run
      
      * alloc minimum memory
      
      * skip check_size in Alloc
      
      * skip check_size in Alloc
      
      * skip check_size in Alloc
      
      * fix cases when tensor is initialized or empty
      
      * alloc empty output for place info
      
      * add test
      
      * increase timeout
      
      * format code
      
      * skip cpu
      
      * add cudnn_deterministic
      
      * fit for hostAlloc
      
      * follow comments
      
      * change check_size to fake_alloc
      63d2d722
  2. 23 12月, 2022 1 次提交
  3. 21 12月, 2022 1 次提交
  4. 13 12月, 2022 1 次提交
  5. 12 12月, 2022 1 次提交
    • Y
      Enhance check_nan_inf implementation for CPU. (#48591) · 69e695b7
      Yiqun Liu 提交于
      * Enable to print device info.
      
      * Enhance the nan and inf checking for cpu.
      
      * Implement a common print function.
      
      * Unify the check of complex numbers.
      
      * Rewrite the omp method.
      
      * Count and print the number of nan and inf.
      
      * Change the print content.
      
      * Add unittest.
      69e695b7
  6. 09 12月, 2022 3 次提交
  7. 08 12月, 2022 1 次提交
  8. 05 12月, 2022 1 次提交
  9. 02 12月, 2022 1 次提交
    • J
      [Eager] Optimize Grad by prune useless branch (#47827) · d1e93be1
      Jiabin Yang 提交于
      * [Eager] Fix paddle.grad interface
      
      * [Eager] Support minimum SubGraph for GeneralGrad
      
      * Add needed_nodes to prune grad graph more thoroughly
      
      * [Eager] Add grad_node_trans_mapping_ to record which grad_node has been transformed to AccumulationNode
      
      * [Eager] Fix paddle.grad interface
      
      * Polish code
      
      * remove potential_stop_node
      
      * Add endding_nodes to enhance genSugraph logic
      
      * clear endding_nodes_
      
      * polish code
      
      * rename endding_nodes to endding_nades_
      
      * Refactor grad interface
      
      * Add register_hook case to fix coverage-ci
      
      * Fix code format
      
      * Refactor general_grad
      
      * Add more code comments
      
      * call clear directly to release GradSlotMeta
      
      * fix a mistake
      
      * fix matmul/ multiply kernel logic and optional input in yaml, fill zeros logic and so on.
      
      * fix batch_norm_double_grad yaml optional config
      
      * fix tanh_triple_grad yaml and kernels
      
      * fix MultiplyTripleGradKernel optional logic
      
      * fix merge mistake
      
      * fix compile error
      
      * remove legacy attr for bn
      
      * polish code
      
      * fix some kernel
      
      * merge develop
      
      * fix error
      
      * remote log
      
      * fix kernel with full like
      
      * hide value log behind
      
      * hide value log behind
      
      * fix matmul_triple grad
      Co-authored-by: NWeilong Wu <veyron_wu@163.com>
      d1e93be1
  10. 01 12月, 2022 2 次提交
  11. 29 11月, 2022 1 次提交
  12. 26 11月, 2022 1 次提交
  13. 24 11月, 2022 1 次提交
    • H
      [PHI decoupling] simplify "convert_utils.h" in fluid (#48168) · de4310e6
      huangjiyi 提交于
      * rm dependence to "convert_utils.h" in some files
      
      * fix bugs
      
      * replace DataType2String with DataTypeToString
      
      * replace framework::DataTypeSize with phi::SizeOf
      
      * mv convert_function from fluid to phi and rm old map
      
      * recommit with pre-commit
      
      * repalce ProtoVarType with ProtoDataType and update comment.
      
      * fix error about include "dnnl.hpp"
      
      * revert add dep mkldnn to convert_utils in phi
      
      * add mkldnn deps in convert_utils.h in phi
      
      * move deps to convert_utils.h in phi
      de4310e6
  14. 19 11月, 2022 1 次提交
  15. 18 11月, 2022 2 次提交
    • W
      refine save hook (#48124) · 04709310
      wanghuancoder 提交于
      04709310
    • J
      fix device id issue for xpu eager mode (#48076) · 3b18d96b
      james 提交于
      * fix device id issue for xpu eager
      
      xpu device id is not correctly set in eager mode, thus vars are on dev0 unless
      XPUDeviceGurad is called, leading to this error message for all node rank != 0:
      "NotImplementedError: (Unimplemented) Place Place(xpu:0) is not supported."
      
      * fix typo
      
      * fix pybind error
      3b18d96b
  16. 16 11月, 2022 1 次提交
  17. 14 11月, 2022 1 次提交
  18. 10 11月, 2022 2 次提交
  19. 09 11月, 2022 3 次提交
  20. 07 11月, 2022 2 次提交
  21. 01 11月, 2022 2 次提交
    • N
      [CodeStyle][E712] use `if cond`/`if cond is True` for comparison with `True` (#47464) · 5a2ab683
      Nyakku Shigure 提交于
      * [CodeStyle][E712] use `if cond`/`if cond is True` for comparison with `True`
      
      * revert changes in fluid
      
      * revert unrelated file
      
      * revert changes in norm
      
      * revert changes in auto_parallel_amp
      
      * fix norm and auto_parallel_amp
      
      * revert a typo fix due to fixed at #47477
      5a2ab683
    • C
      Adapting device-specific Extra Attributes for the PHI kernel (#46342) · c923e6c9
      Chen Weihang 提交于
      * add extra attr property set
      
      * add type_info for all context
      
      * add onednn context to all context
      
      * fix context compile error
      
      * simplify conv kernel args
      
      * pass runtime attr into dev_ctx
      
      * fix marco error
      
      * clear conv_grad_kernel extra args
      
      * merge conv_grad_grad into conv_grad
      
      * clear conv2d_grad_grad extra attrs
      
      * clear yaml and eager extra attr
      
      * fix conv1d error
      
      * change to thread local
      
      * fix npu compile failed
      
      * try to fix windows compile failed
      
      * add conv2d onednn phi kernel
      
      * fix ci bugs (#36)
      
      * fix compile bugs (#38)
      
      * fix extra input transform bug (#39)
      
      * support dynamic created attr (#40)
      
      * reset extra info gen code
      
      * rm conv_grad_grad kernel
      
      * reimpl pass attr adapting
      
      * add int attr support
      
      * remove vector inputnames creating
      
      * fix map at error
      
      * Update paddle/phi/kernels/onednn/conv_grad_kernel.cc
      Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
      
      * remove useless extra attrs
      
      * replace mkldnn_engine by onednn_engine
      Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
      Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
      c923e6c9
  22. 27 10月, 2022 2 次提交
  23. 26 10月, 2022 4 次提交
  24. 25 10月, 2022 1 次提交
  25. 24 10月, 2022 1 次提交
  26. 23 10月, 2022 1 次提交
  27. 20 10月, 2022 1 次提交