1. 12 5月, 2023 3 次提交
    • P
      【Hackathon 4 No.20】Add i0 / i0e to paddle (#52058) · ce256f75
      PommesPeter 提交于
      * added base code for i0 and i0e
      
      * added grad base code for i0 and i0e
      
      * added i0 and i0e python code
      
      * added ops and backward yaml config
      
      * added i0 and i0e cpu kernel, but not test.
      
      * added i0 and i0e code and unitest files
      
      * added test files
      
      * added i0/i0e gpu implementation code
      
      * updated code style
      
      * updated code style
      
      * fixed unitests code
      
      * updated i0 with eigen3
      
      * fixed bug and added more test cases
      
      * refactor: fixed static graph bug
      
      * refactor: removed i0 and i0e from op_compat
      
      * refactor: updated code style
      
      * refactor: updated op_compat.yaml
      
      * refactor: updated op_compat.yaml
      
      * refactor: fixed op name mapping and optimize unittest case
      
      * refactor: manually implement i0 / i0e
      
      * refactor: added grad kernel for i0 / i0e,didn't finish
      
      * Update math.py
      
      * refactor: added equation to doc in English and added comments for computing i0 / i0e gradient
      
      * refactor: removed eigen implementation
      
      * refactor: finished i0 / i0e cpu and gpu op
      
      * refactor: updated code style
      
      * fix: find  a bug but not fix
      
      * fix: incorrect unittest cases
      
      * update: updated code style and remove my file
      
      * update: updated unittest case
      
      * fix: fixed sign error
      
      * fix: fixed mistakes when merging
      
      * refactor: updated code style
      
      * refactor: remove unused code
      
      * refactor: updated code style
      ce256f75
    • W
      sequence_mask functionalization (#53478) · d2b1e3c2
      Wang Xin 提交于
      * sequence_mask functionalization
      
      * fix sequence_mask test
      d2b1e3c2
    • H
      move pow2_decay_with_linear_warmup kernel to phi (#53741) · 348565b0
      huangjiyi 提交于
      * update
      
      * update
      348565b0
  2. 11 5月, 2023 1 次提交
  3. 09 5月, 2023 2 次提交
  4. 05 5月, 2023 2 次提交
  5. 28 4月, 2023 2 次提交
    • B
      Dropout optimize & clean broadcast inT and ElementwiseType (#52969) · d611e48c
      Bo Zhang 提交于
      * change judgement for DropoutGradGPUKernelDriver
      
      * add UnrollerWithoutVecSize and after this Loaddata to be refined
      
      * pass unittest
      
      * use same unroller with XPU
      
      * BroadcastWithInt64Index
      
      * BroadcastDataLoader template partial specialization
      
      * fix compile errs in ROCms
      
      * clean ElementwiseT and InT for BroadcastKernel
      
      * default axis and clean inT
      
      * remove redundant fast divmod computation
      
      * optimize drop_nd & drop_nd_grad
      
      * optimize BroadcastDataLoader bf16 fp16
      
      * rm InT etc. after merge develop
      
      * delete constexpr for windows ci
      
      * fix conflict
      
      * fix conflic with develop
      
      * fix conflic
      
      * new clean
      
      * clean
      d611e48c
    • G
      【0D output】add_0D_output_support (#52857) · ef6e8d09
      GGBond8488 提交于
      * add 0d support for dist, trace, paddle.linalg.cond test=allcase
      
      * add_0d_output_support_for_det
      
      * test=allcase
      
      * support_0d_output_for_linalg.norm
      
      * support linalg.norm 0d output, test=allcase
      
      * fix 0D test
      
      * fix zero dim test, test=allcase
      
      * fix 0D test
      
      * fix tets,test=allcase
      
      * fix error,test=allcase
      
      * fix errors ,test=allcase
      
      * add static backward , test=allcase
      
      * add static backwward test, test=allcase
      
      * fix pr-ci-build error;test=document_fix (#53060)
      
      * [Cherry-Pick] Unique support float16&bfloat16 (#53023)
      
      unique支持float16和bfloat16数据类型,并完善相关单测。
      
      * slogdet_support_0D_output
      
      * add new case
      
      * fix tests, test=allcase
      
      * fix p_norm related test, test=allcase
      
      * fix some err, test=allcase
      
      * test=allcase
      
      * move out trace
      
      * open some case, test=allcase
      
      * fix norm all case, test=allcase
      
      * fix some test error, test=allcase
      
      * fix typro,test=allcase
      
      * fix test err, test=allcase
      
      * test=allcase
      
      * test
      
      * fix test error, test=allcase
      
      * fix test error, test=allcase
      
      * fallback norm, test=allcase
      
      ---------
      Co-authored-by: Ntianshuo78520a <707759223@qq.com>
      Co-authored-by: NZhang Zheng <32410583+ZzSean@users.noreply.github.com>
      ef6e8d09
  6. 27 4月, 2023 2 次提交
    • G
      [phi] Move sequence_pool to phi - Step 3 :sequence_pool_grad_op (#52680) · fe053396
      gouzil 提交于
      * [phi] move sequence_pool kernel to phi
      
      * mv kernels impl
      
      * fix parameter error
      
      * clean include
      
      * fix compat filename
      
      * [phi] move fluid sequence_pool_grad to phi
      
      * [phi][compat] sig rm GradVarName
      
      * [phi] fix sequence_pool out type
      
      * [phi] rm impl, add const string
      
      * [phi] fix const str
      
      * fix sequence_pooling cmake
      
      * [phi] mv sequence_pooling_test
      
      * [phi] fix grad sig
      
      * [phi] fix sequence_pool is_test error
      
      * [phi] fix sequence_pooling gpu include
      
      * [phi] mv to impl
      
      * [phi] fix SequencePoolFunctor cu include
      
      * [phi] modify out max_index int32_t
      
      * [phi] add pooltype mapping determine
      
      * [phi] fix sequence_pool_sig
      
      * [phi] fix sequence_pool_sig sum
      
      * [phi] try ci
      
      * [phi] fix max_index optional
      fe053396
    • C
      【Hackathon4】No5 nextafter (#52544) · 82ac3913
      cyberslack_lee 提交于
      82ac3913
  7. 26 4月, 2023 2 次提交
  8. 24 4月, 2023 2 次提交
    • G
      add 0D support for trace (#53208) · 9d90738c
      GGBond8488 提交于
      * add 0D support for trace, test=allcase
      
      * fix trace gpu kernel 0d error, test=allcase
      
      * fix windows error, test=allcase
      9d90738c
    • G
      remove some [-Wunused-parameter] (#53185) · 834eb2ba
      Galaxy1458 提交于
      * test,test=develop
      
      * test,test=develop
      
      * test,test=develop
      
      * test,test=develop
      
      * test,test=develop
      
      * test,test=develop
      
      * test,test=develop
      
      * test ,test=develop
      834eb2ba
  9. 23 4月, 2023 1 次提交
  10. 22 4月, 2023 1 次提交
    • W
      [Zero-Dim] support output 0D for... · b406a7db
      wangfengsheng1999 提交于
      [Zero-Dim] support output 0D for is_empty/as_complex/inner/dot/rank/tensordot/squeeze_/static.accuracy/static.auc/metric.accuracy, test=allcase (#52850)
      
      * [Zero-Dim] support output 0D for is_empty/as_complex/, test=allcase
      
      * [Zero-Dim] support output 0D for is_empty/as_complex/, test=allcase
      
      * add test case
      
      * modify dot/metric.accuracy/static.accuracy/static.auc
      
      * modfiy inner/tensordot bug
      
      * test 9 api
      
      * [Zero-Dim] support output 0D for is_empty/as_complex/inner/dot/rank/tensordot/squeeze_/static.accuracy/static.auc/metric.accuracy, test=allcase
      
      * fix bug
      
      * support output 0D for is_empty/as_complex/inner/dot/rank/tensordot/squeeze_/static.accuracy/static.auc/metric.accuracy
      
      * code style
      
      * fix bug
      
      * fix test_dot_op bug
      
      * fix accuracy bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * codestyle
      
      * fix dot bug
      
      * fix dot bug
      
      * fix dot bug
      
      * code style
      
      * fix dot bug
      
      * fix dot bug
      
      * fix dot bug
      
      * fix dot bug
      
      * fix dot bug
      
      * fix dot bug
      
      * modify code
      b406a7db
  11. 18 4月, 2023 2 次提交
  12. 17 4月, 2023 2 次提交
  13. 14 4月, 2023 2 次提交
  14. 13 4月, 2023 2 次提交
    • H
      [enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h (#52651) · 5664ea26
      HongyuJia 提交于
      * [enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h
      
      * Add logging.h for profiler.cc
      
      * Add logging.h for gloo_utils.h
      
      * Add logging.h for addmm_kernel_impl.h
      
      * Add logging.h for addmm_grad_kernel_impl.h
      
      * Add logging.h for p_send_kernel.cu
      
      * Add logging.h for determinant_grad_kernel_impl.h
      
      * Add logging.h for p_recv_kernel.cu
      
      * Add logging.h for elementwise_grad_base.h
      
      * Add logging.h for transfer_layout_kernel.cc
      
      * Add logging.h for eigvals_kernel.cc and index_select_impl.h
      
      * Add logging.h for all files in kernel directory
      
      * Add logging.h for xpu_info.cc
      
      * Add logging.h for xpu
      5664ea26
    • Z
      rename_bilinear_tensor_op (#52745) · eb93b5c9
      zhangyuqin1998 提交于
      eb93b5c9
  15. 12 4月, 2023 1 次提交
    • W
      Patch del (#52754) · 189e0d44
      wangzhen38 提交于
      * [DO NOT MERGE] adadelta lr support
      
      * [DO NOT MERGE] gpu support
      
      * [test] follow torch
      
      * fix acc update order
      
      * for ci
      
      * [bug fix] update master para
      
      * [bug fix] update test
      
      * [bug fix] for ci test
      
      * for ci
      
      * fix xpu
      
      * [adadelta fix] del fluid head file
      
      * for ci
      
      * del notes
      189e0d44
  16. 11 4月, 2023 1 次提交
  17. 10 4月, 2023 1 次提交
  18. 04 4月, 2023 2 次提交
  19. 03 4月, 2023 1 次提交
  20. 30 3月, 2023 1 次提交
  21. 24 3月, 2023 1 次提交
    • Y
      [PHI Decoupling]Remove memory header (Part3) (#51288) · 3d78e759
      YuanRisheng 提交于
      * decouple memory copy
      
      * fix ci bugs
      
      * fix ci compile bugs
      
      * fix rocm compile
      
      * fix ci bugs
      
      * decouple memory
      
      * deal with conflict
      
      * fix xpu compile bugs
      
      * fix xpu bugs
      
      * deal with xpu bugs
      
      * fix cmake bugs
      
      * fix windows bugs
      
      * fix ci bugs
      
      * fix ci bugs
      
      * delete redundance code
      
      * add code for pybind
      
      * fix py3 bugs
      
      * fix ci bugs
      3d78e759
  22. 23 3月, 2023 1 次提交
  23. 21 3月, 2023 1 次提交
    • iSerendipity's avatar
      [PHI decoupling] Move DataType* from paddle:experimental to phi namespace (#51716) · 4638a62e
      iSerendipity 提交于
      * move DataType from paddle::experimental to phi
      
      * convert namespace
      
      * convert namespace
      
      * convert namespace
      
      * clarify namespace
      
      * convert more datatype
      
      * Revert "convert more datatype"
      
      This reverts commit 083b462959e6a22d4d8767707b628b95b396642e.
      
      * convert more in auto_code_generator
      
      * fix conflicts for XPU
      
      * fix namespace conflicts
      
      * fix errors
      
      * Revert "fix errors"
      
      This reverts commit f9d9958b54ee32141112274c8a5c3c381ab0f876.
      
      * fix errors
      
      * fix formatting
      4638a62e
  24. 20 3月, 2023 1 次提交
    • L
      Support Linear operation in cuBlaslt and plug into attn_gemm and fusedLinear forward op (#51124) · 2dfc3fa8
      limingshu 提交于
      * optimization for fused linear op
      
      * fix code format
      
      * optimization for linear fused forward
      
      * merge with develop
      
      * fix bugs for gemm_ephilog
      
      * package of cublaslt ephilogue type with enmu
      
      * final fix before code reviewing
      
      * fix missed fusedType typo
      
      * fix code according to review suggestions
      
      * fix windows ci error
      
      * change location of MatmulPlanner
      
      * add some changes for compiler error fix
      
      ---------
      2dfc3fa8
  25. 17 3月, 2023 1 次提交
  26. 15 3月, 2023 1 次提交
    • iSerendipity's avatar
      [PHI] remove operator.h in blas.h (rebase to latest codebase) (#51472) · 427712df
      iSerendipity 提交于
      * Revert "Revert "【Hackathon No.67】remove operator.h in blas.h (#50989)" (#51467)"
      
      This reverts commit b9d91531.
      
      * remove cout
      
      * add header
      
      * fix missing header
      
      * fix refer fluid error
      
      * fix missing header
      
      * 更新 repeat_interleave_grad_kernel_impl.h
      
      Change to phi style datatype.
      
      * 更新 repeat_interleave_grad_kernel_impl.h
      
      Fix missing header
      
      * datatype fluid -> phi
      
      * paddle::experimental -> phi
      
      * fix reference error
      
      * fix reference error
      
      * fix reference error
      
      * fix errors
      
      * fix missing FLAGS
      
      * fix missing headers
      
      * fix missing headers
      
      * fix missing headers
      
      * fix missing headers
      
      * fix missing header
      
      * fix missing header
      
      * fix errors
      427712df
  27. 14 3月, 2023 1 次提交