1. 01 11月, 2022 2 次提交
    • Z
      Generate static graph code for some activation ops by Yaml (part2) (#47440) · c5d99138
      zyfncg 提交于
      * gene static graph code for ceil, expm1 op
      
      * gene static graph code for some activation op
      
      * fix bug
      
      * revert doc of silu and logsigmoid
      c5d99138
    • C
      Adapting device-specific Extra Attributes for the PHI kernel (#46342) · c923e6c9
      Chen Weihang 提交于
      * add extra attr property set
      
      * add type_info for all context
      
      * add onednn context to all context
      
      * fix context compile error
      
      * simplify conv kernel args
      
      * pass runtime attr into dev_ctx
      
      * fix marco error
      
      * clear conv_grad_kernel extra args
      
      * merge conv_grad_grad into conv_grad
      
      * clear conv2d_grad_grad extra attrs
      
      * clear yaml and eager extra attr
      
      * fix conv1d error
      
      * change to thread local
      
      * fix npu compile failed
      
      * try to fix windows compile failed
      
      * add conv2d onednn phi kernel
      
      * fix ci bugs (#36)
      
      * fix compile bugs (#38)
      
      * fix extra input transform bug (#39)
      
      * support dynamic created attr (#40)
      
      * reset extra info gen code
      
      * rm conv_grad_grad kernel
      
      * reimpl pass attr adapting
      
      * add int attr support
      
      * remove vector inputnames creating
      
      * fix map at error
      
      * Update paddle/phi/kernels/onednn/conv_grad_kernel.cc
      Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
      
      * remove useless extra attrs
      
      * replace mkldnn_engine by onednn_engine
      Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
      Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
      c923e6c9
  2. 31 10月, 2022 1 次提交
  3. 28 10月, 2022 2 次提交
  4. 25 10月, 2022 1 次提交
  5. 24 10月, 2022 2 次提交
  6. 18 10月, 2022 1 次提交
    • Z
      [code-gen] Support code-gen for opmaker of sparse op (#46993) · bdd3dde3
      zyfncg 提交于
      * support generating code of opmaker for backward op invoke forward op
      
      * gsupport code-gen of opmaker for sparse op
      
      * refind logic of choose phi kernrel
      
      * fix complie budg
      
      * fix code_gen bug
      
      * fix bug
      
      * fix kernel signature code-gen
      
      * fix complie bug of VarType
      
      * fix complie bug of VarType
      
      * fix test_sparse_conv_op
      
      * fix test_sparse_norm_op
      bdd3dde3
  7. 11 10月, 2022 1 次提交
  8. 09 10月, 2022 1 次提交
  9. 27 9月, 2022 1 次提交
  10. 19 9月, 2022 1 次提交
    • Y
      [PHI]Move sum op to PHI (#45860) · 4b3f2af1
      YuanRisheng 提交于
      * move sum
      
      * fix ci bugs
      
      * fix ci bugs
      
      * fix set_lod bugs
      
      * fix infershape bugs
      
      * fix ci bugs
      
      * fix ci unittest bug
      
      * fix ci bugs
      
      * perfect code
      
      * update code according comment
      
      * add unittest
      
      * fix ci bugs
      4b3f2af1
  11. 18 9月, 2022 1 次提交
  12. 13 9月, 2022 1 次提交
  13. 09 9月, 2022 2 次提交
  14. 07 9月, 2022 1 次提交
    • C
      [Phi] Migrate save kernel (#45665) · fc66fdb7
      Chen Weihang 提交于
      * add save kernel
      
      * add save_sr_kernel
      
      * remove original save_op
      
      * add save gpu kernel
      
      * remove combine kernel
      
      * add port.h include
      
      * add save selected rows test
      
      * remove useless kernel.h
      fc66fdb7
  15. 31 8月, 2022 2 次提交
    • C
      Fix split api bug (#45396) · 4a25b60d
      Charles-hit 提交于
      * fix split bug
      
      * solve function redefine
      
      * fix fluid.layers.split and add unit test
      
      * delete splitInferMeta register in unary.cc
      
      * modify test_split_op GPU unit test
      
      * modify test_split_op GPU unit test place param
      
      * refactor split op and fix infershape bugs
      
      * add () in && and ||
      
      * fix split C++ unit test
      
      * fix split infershape
      4a25b60d
    • L
      Add index add API (#45176) · 45171911
      Li Min 提交于
      45171911
  16. 30 8月, 2022 2 次提交
    • H
      [phi] Transfer coalesce_tensor to phi (#45478) · cf9d651b
      HongyuJia 提交于
      * add coalesce_tensor kernel
      
      * polist coalesce_tensor kernel
      
      * add sig and InferMeta
      
      * add testcase
      
      * add legacy_api.yaml
      
      * fix infermeta
      
      * fix yaml
      
      * fix kernel implementation
      
      * add compile dependency of phi/kernels
      
      * fix MetaConfig
      
      * add python api
      
      * add and fix testcase
      
      * rnn.py add import
      
      * change _C_ops.coalesce_tensor
      
      * remove useless comments
      
      * add SetBackend
      
      * restore XPU kernel temporarily
      
      * fix code according to PR comments
      cf9d651b
    • C
      rename mod c api name (#45476) · ad96fe2c
      Chen Weihang 提交于
      ad96fe2c
  17. 25 8月, 2022 2 次提交
    • K
      Transfer memcpy d2h from fluid to phi (#45150) · 0d14e74a
      kangguangli 提交于
      * transfer memcpy_d2h from fluid to phi
      
      * refine arg check and add comment
      
      * fix cannot fallback to phi kernel
      
      * fix gpu_context host alloc when tensor size = 0
      
      * add kernel for std::vector<DenseTensor> args
      
      * fix bugs in MemcpyD2HMultiIOKernel
      
      * remove useless header file
      
      * polish format
      
      * fix typo
      
      * add testcase for cudapinned place
      
      * refine check condition in test
      
      * polish error message
      
      * polish error message
      
      * remove header in fluid  directory
      
      * merge memcpy_h2d and memcpy_d2h into one file, change register method to simplify implementation
      
      * fix code style check
      0d14e74a
    • W
      [Eager] sync_batch_norm_grad delete mean and variance (#45411) · 5df464fe
      wanghuancoder 提交于
      * sync_batch_norm_grad delete mean and variance
      5df464fe
  18. 23 8月, 2022 1 次提交
  19. 18 8月, 2022 2 次提交
  20. 17 8月, 2022 1 次提交
  21. 16 8月, 2022 2 次提交
  22. 15 8月, 2022 1 次提交
  23. 12 8月, 2022 1 次提交
    • S
      [geometric]Add paddle.geometric.send_ue_recv API (#43174) · 615b15a3
      Siming Dai 提交于
      * add init file
      
      * add op definition and infermeta
      
      * add kernel definition funcs
      
      * add broadcast infer shape
      
      * add gpu forward kernel
      
      * delete SUB and DIV
      
      * add x_grad
      
      * add template
      
      * add e_grad for min and max
      
      * fix small bug
      
      * temp commit
      
      * temp commit
      
      * add e_grad for sum and mean
      
      * fix some compile bug
      
      * fix compile bugs
      
      * fix compile problem
      
      * add sum forward unittest
      
      * fix broadcast error, add kernel sig, register e_grad, change unit test
      
      * fix grad
      
      * add temp grad fix
      
      * temp commit
      
      * add min max unittest
      
      * add max, min unittest, fix mul bug
      
      * add cpu forward sum and mean
      
      * add forward min max, fix mean unittest
      
      * add cpu backward min max
      
      * fix code-style
      
      * add backward sum mean
      
      * fix rocm ci
      
      * set uniitest timeout
      
      * fix bug of x broadcast to e, gpu grad
      
      * fix bug of x broadcast to e, cpu grad
      
      * rename BOOST_GET_CONST macro
      
      * fix rocm ci
      
      * mv graph_send_e_recv to graph_send_ue_recv
      
      * move out_size to IntArray
      
      * add eager op test
      
      * fix max pool type bug, add unittest for api
      
      * revise api doc
      
      * add fp16 for atomic min and max, add unittest
      
      * add unittest
      
      * add fp16 support for graph_send_recv
      
      * fix unittest fp16 bug
      
      * change OutSizeTensor to Out_size
      
      * move E to Y
      
      * add copyright, fix comment
      
      * review code
      
      * fix thread block size
      
      * fix thread block size
      
      * change api attribute name: pool_type to reduce_op, compute_type to message_op
      
      * change api attribute name, move pool_type to reduce_op, move compute_type to message_op
      615b15a3
  24. 09 8月, 2022 3 次提交
  25. 08 8月, 2022 2 次提交
  26. 05 8月, 2022 2 次提交
  27. 04 8月, 2022 1 次提交