1. 09 8月, 2023 1 次提交
  2. 07 8月, 2023 1 次提交
    • N
      cherry-pick fused_rope from develop (#55931) · 8d3a9882
      niuliling123 提交于
      * Add fused_rope forward op (#54351)
      
      * style
      
      * more
      
      * update ctest
      
      * Update legacy_backward.yaml
      
      * Update legacy_ops.yaml
      
      * Update legacy_ops.yaml
      
      * update
      
      * update
      
      * update for move
      
      * Update the rope op according to the comments (#54985)
      
      * Update multiary.cc
      
      * Update __init__.py
      
      * for int64_t and assert
      
      * more
      
      * remove useless assert first
      
      ---------
      Co-authored-by: Nsneaxiy <sneaxiy@126.com>
      8d3a9882
  3. 11 4月, 2023 2 次提交
  4. 04 4月, 2023 1 次提交
  5. 27 3月, 2023 1 次提交
  6. 24 3月, 2023 1 次提交
    • Z
      Memory Efficient Attention (#51867) · e5ad3859
      ZhangDY-6483 提交于
      * first version, notest
      
      * return final rst, notest
      
      * use infinity() instead of max
      
      * ut structure
      
      * start up of ut
      
      * generate lse
      
      * update
      
      * add depense
      
      * reconstruct cmake
      
      * move file
      
      * add memory efficient attention and fix blasimpl
      
      * update
      
      * update cmake
      
      * add namespace
      
      * update cmake
      
      * use .cu
      
      * update for pad3d
      
      * bug fix
      
      * bug fix
      
      * update
      
      * bug fix
      
      * update enforce
      
      * add test case
      
      * merge the lse pad
      
      * fix kernel_fn of backward
      
      * fix PADDLE_ENFORCE_EQ and phi_api
      
      * fix PADDLE_ENFORCE
      
      * fix PADDLE_ENFORCE
      
      * rerun coverage
      
      * fix memory efficient attention test
      
      * rerun ci
      
      * add cuda version condition
      
      * add cuda version condition
      
      * delete WIP test
      
      * replace PADDLE_ENFORCE
      
      * edit the namespace of datatype in multiple.cc
      
      * rerun
      
      * rerun
      
      ---------
      Co-authored-by: Nliuyuang <liuyuang@baidu.com>
      e5ad3859
  7. 22 3月, 2023 1 次提交
    • S
      Add fused_linear_param_grad_add_kernel (#51805) · f59c5d8b
      sneaxiy 提交于
      * add fused_linear_param_grad_add_kernel
      
      * fix compile error
      
      * remove flag
      
      * fix ci compile error
      
      * fix ci compile error
      
      * revert pylayer revision
      
      * fix ci ut
      
      * improve performance
      f59c5d8b
  8. 08 3月, 2023 1 次提交
  9. 06 3月, 2023 1 次提交
  10. 03 3月, 2023 1 次提交
  11. 01 3月, 2023 1 次提交
  12. 17 2月, 2023 1 次提交
    • Y
      Rename MultiTensorAdam To FusedAdam (#50449) · e6af9bd2
      yuehuayingxueluo 提交于
      * rename multi_tensor_adam to fused_adam
      
      * fix some bugs
      
      * fix CI coverage
      
      * rename test_fused_adam.py
      
      * fix some bug
      
      * add test_fused_adam_op.py
      
      * fix some bugs
      
      * fix fused_adam_op.cc
      
      * fix CI bugs
      
      * fix CI bug
      
      * fix CI bug
      e6af9bd2
  13. 16 2月, 2023 1 次提交
  14. 09 2月, 2023 1 次提交
    • Y
      Add MultiTenosrAdam OP (#49220) · 10654c77
      yuehuayingxueluo 提交于
      * add multi_tenosr_adam
      
      * update multi_tensor_base.py, test_multi_tensor_adam.py, adamw.py
      
      * fix adam.py optimizer.py
      
      * fix adamw.py
      
      * fix test_multi_tensor_adam.py
      
      * fix CI bug
      
      * fix CI coverage
      
      * fix ci bug
      
      * fix betapow
      
      * fix some bugs
      
      * fix test_adamw_op.py
      
      * fix CI coverage
      
      * fix multi_tensor_adam_kernel.cc
      
      * fix CI bug
      
      * fix multi_tensor_adam_op.cc and test_multi_tensor_adam.py
      
      * fix code style
      
      * update C++ parts
      
      * remove python parts modification temporarily
      
      * add C++ ut
      
      * update betapow copy code logic
      
      * fix ci ut
      
      * fix windows ci
      
      * fix coverage ci
      
      * improve coverage rate
      
      ---------
      Co-authored-by: Nsneaxiy <sneaxiy@126.com>
      10654c77
  15. 23 12月, 2022 1 次提交
  16. 22 12月, 2022 1 次提交
  17. 09 12月, 2022 1 次提交
  18. 17 11月, 2022 1 次提交
  19. 02 11月, 2022 1 次提交
  20. 01 11月, 2022 1 次提交
  21. 31 10月, 2022 1 次提交
  22. 12 10月, 2022 1 次提交
  23. 19 9月, 2022 1 次提交
    • Y
      [PHI]Move sum op to PHI (#45860) · 4b3f2af1
      YuanRisheng 提交于
      * move sum
      
      * fix ci bugs
      
      * fix ci bugs
      
      * fix set_lod bugs
      
      * fix infershape bugs
      
      * fix ci bugs
      
      * fix ci unittest bug
      
      * fix ci bugs
      
      * perfect code
      
      * update code according comment
      
      * add unittest
      
      * fix ci bugs
      4b3f2af1
  24. 09 9月, 2022 1 次提交
  25. 07 9月, 2022 1 次提交
  26. 30 8月, 2022 1 次提交
    • H
      [phi] Transfer coalesce_tensor to phi (#45478) · cf9d651b
      HongyuJia 提交于
      * add coalesce_tensor kernel
      
      * polist coalesce_tensor kernel
      
      * add sig and InferMeta
      
      * add testcase
      
      * add legacy_api.yaml
      
      * fix infermeta
      
      * fix yaml
      
      * fix kernel implementation
      
      * add compile dependency of phi/kernels
      
      * fix MetaConfig
      
      * add python api
      
      * add and fix testcase
      
      * rnn.py add import
      
      * change _C_ops.coalesce_tensor
      
      * remove useless comments
      
      * add SetBackend
      
      * restore XPU kernel temporarily
      
      * fix code according to PR comments
      cf9d651b
  27. 16 8月, 2022 2 次提交
    • C
      [Phi] Move amp ops into phi (#45079) · b4f67757
      Chen Weihang 提交于
      * move check finite and unscale kernel into phi
      
      * move infershape into phi
      
      * move update_loss_scaling kernel into phi
      
      * remove original kernels
      
      * move update loss scaling infershape into phi
      
      * add header for xpu and npu
      
      * solve coverage failed
      
      * fix npu test failed
      
      * remove mutable data in cu file
      
      * fix new executor failed
      
      * add valid check for meta tensor output
      b4f67757
    • S
      [geometric]Add paddle.geometric.send_uv API (#44848) · 88724a53
      Siming Dai 提交于
      * initial commit
      
      * fix op maker bug
      
      * fix mul grad bug
      
      * add unittest
      
      * fix add grad bug, add cpu kernel
      
      * add paddle.geometric.message_passing
      
      * add paddle.geometric.send_uv api, add unittest
      
      * add fp16 judgement
      
      * fix file typo, move compute_type to message_op
      
      * add impl file
      
      * fix unittest timeout time
      
      * add review revise
      88724a53
  28. 12 8月, 2022 1 次提交
    • S
      [geometric]Add paddle.geometric.send_ue_recv API (#43174) · 615b15a3
      Siming Dai 提交于
      * add init file
      
      * add op definition and infermeta
      
      * add kernel definition funcs
      
      * add broadcast infer shape
      
      * add gpu forward kernel
      
      * delete SUB and DIV
      
      * add x_grad
      
      * add template
      
      * add e_grad for min and max
      
      * fix small bug
      
      * temp commit
      
      * temp commit
      
      * add e_grad for sum and mean
      
      * fix some compile bug
      
      * fix compile bugs
      
      * fix compile problem
      
      * add sum forward unittest
      
      * fix broadcast error, add kernel sig, register e_grad, change unit test
      
      * fix grad
      
      * add temp grad fix
      
      * temp commit
      
      * add min max unittest
      
      * add max, min unittest, fix mul bug
      
      * add cpu forward sum and mean
      
      * add forward min max, fix mean unittest
      
      * add cpu backward min max
      
      * fix code-style
      
      * add backward sum mean
      
      * fix rocm ci
      
      * set uniitest timeout
      
      * fix bug of x broadcast to e, gpu grad
      
      * fix bug of x broadcast to e, cpu grad
      
      * rename BOOST_GET_CONST macro
      
      * fix rocm ci
      
      * mv graph_send_e_recv to graph_send_ue_recv
      
      * move out_size to IntArray
      
      * add eager op test
      
      * fix max pool type bug, add unittest for api
      
      * revise api doc
      
      * add fp16 for atomic min and max, add unittest
      
      * add unittest
      
      * add fp16 support for graph_send_recv
      
      * fix unittest fp16 bug
      
      * change OutSizeTensor to Out_size
      
      * move E to Y
      
      * add copyright, fix comment
      
      * review code
      
      * fix thread block size
      
      * fix thread block size
      
      * change api attribute name: pool_type to reduce_op, compute_type to message_op
      
      * change api attribute name, move pool_type to reduce_op, move compute_type to message_op
      615b15a3
  29. 08 8月, 2022 1 次提交
  30. 04 8月, 2022 1 次提交
    • Z
      Phi generate_proposals_v2 (#44436) · 566c80ff
      zhiboniu 提交于
      * phi_generate_proposals_v2
      
      * remove old kernels
      
      * optest add eager_check
      
      * del lod
      
      * update
      
      * update
      
      * update test_detection with_lod
      
      * update nms_util
      
      * remove old nms_util.h
      566c80ff
  31. 03 8月, 2022 2 次提交
    • W
      [BUG FIX]fix auc calculation error under multi instag scene (#41289) · ef945c81
      wangzhen38 提交于
      * fix_auc in instag task
      
      * fix_auc in instag task
      
      * fix_auc in instag task
      
      * fix opmaker error
      
      * optmize default value of ins_tag
      
      * fix multiary
      
      * fix multiary
      
      * reback ctr_metric_bundle
      
      * reback ctr_metric_bundle
      
      * for ci auc_test
      
      * code style
      
      * code style fix
      
      * suit paddle
      
      * suit paddle
      
      * update code example
      
      * complete Auc unittest
      
      * set asdispensable
      
      * set optional
      
      * compatable and code style
      
      * fix cu
      
      * fix cu
      
      * for ci win
      
      * update unittests of auc
      
      * update code doc by reviews
      
      * change code doc by reviews
      ef945c81
    • Z
      Phi edit distance (#44447) · 5ad3228c
      zhiboniu 提交于
      * phi_edit_distance
      
      * fix
      5ad3228c
  32. 27 7月, 2022 1 次提交
  33. 26 7月, 2022 1 次提交
  34. 14 7月, 2022 1 次提交
  35. 12 7月, 2022 1 次提交
  36. 27 5月, 2022 1 次提交
  37. 20 4月, 2022 1 次提交
    • B
      【PaddlePaddle Hackathon 2】9、为 Paddle 新增 logspace API (#41261) · a3c50c42
      BrilliantYuKaimin 提交于
      * 增加logspace的算子描述
      
      * 增加logspace的形状推断
      
      * 增加logspace核函数实现
      
      * 在python中增加logspace接口
      
      * 增加logspace单测
      
      * 增加logspace
      
      * Update logspace_kernel.cu
      
      * Update logspace_op.cc
      
      * 调整代码格式
      
      * Update doc of logspace
      
      * Update tensor.py
      
      * Update logspace_op.cc
      
      * Update logspace_kernel.cc
      
      * Update logspace_kernel.cu
      
      * Update test_logspace.py
      
      * 调整 logspace 的位置
      
      * 调整代码格式
      a3c50c42