1. 08 8月, 2023 1 次提交
  2. 07 8月, 2023 1 次提交
    • N
      cherry-pick fused_rope from develop (#55931) · 8d3a9882
      niuliling123 提交于
      * Add fused_rope forward op (#54351)
      
      * style
      
      * more
      
      * update ctest
      
      * Update legacy_backward.yaml
      
      * Update legacy_ops.yaml
      
      * Update legacy_ops.yaml
      
      * update
      
      * update
      
      * update for move
      
      * Update the rope op according to the comments (#54985)
      
      * Update multiary.cc
      
      * Update __init__.py
      
      * for int64_t and assert
      
      * more
      
      * remove useless assert first
      
      ---------
      Co-authored-by: Nsneaxiy <sneaxiy@126.com>
      8d3a9882
  3. 13 7月, 2023 1 次提交
  4. 13 4月, 2023 1 次提交
  5. 10 4月, 2023 1 次提交
    • W
      add autogen code support for affine_grid op (#52560) · 90280542
      Wang Xin 提交于
      * add autogen code support for affine_grid op
      
      * update op_compat.yaml for affine_grid
      
      * update op_compat.yaml for affine_grid
      
      * fix AffineGridGradInferMeta
      
      * fix CI error
      
      * update AffineGridInferMeta
      90280542
  6. 09 4月, 2023 1 次提交
  7. 04 4月, 2023 1 次提交
  8. 30 3月, 2023 1 次提交
  9. 27 3月, 2023 1 次提交
  10. 24 3月, 2023 1 次提交
    • Z
      Memory Efficient Attention (#51867) · e5ad3859
      ZhangDY-6483 提交于
      * first version, notest
      
      * return final rst, notest
      
      * use infinity() instead of max
      
      * ut structure
      
      * start up of ut
      
      * generate lse
      
      * update
      
      * add depense
      
      * reconstruct cmake
      
      * move file
      
      * add memory efficient attention and fix blasimpl
      
      * update
      
      * update cmake
      
      * add namespace
      
      * update cmake
      
      * use .cu
      
      * update for pad3d
      
      * bug fix
      
      * bug fix
      
      * update
      
      * bug fix
      
      * update enforce
      
      * add test case
      
      * merge the lse pad
      
      * fix kernel_fn of backward
      
      * fix PADDLE_ENFORCE_EQ and phi_api
      
      * fix PADDLE_ENFORCE
      
      * fix PADDLE_ENFORCE
      
      * rerun coverage
      
      * fix memory efficient attention test
      
      * rerun ci
      
      * add cuda version condition
      
      * add cuda version condition
      
      * delete WIP test
      
      * replace PADDLE_ENFORCE
      
      * edit the namespace of datatype in multiple.cc
      
      * rerun
      
      * rerun
      
      ---------
      Co-authored-by: Nliuyuang <liuyuang@baidu.com>
      e5ad3859
  11. 01 3月, 2023 1 次提交
    • C
      Integration flash attention (#49869) · 61611786
      Chitsing KUI 提交于
      * flash attn
      
      * seed
      
      * almost
      
      * softmax
      
      * fix workspace
      
      * add unitest; linux only
      
      * fix setup
      
      * fix datatype include
      
      * fix setup typo
      
      * fix def scope
      
      * new error api
      
      * use paddle fork
      
      * fix attr bug; complete ut
      
      * update flash hash
      
      * fix rng reset
      
      * fix offset
      
      * fix comments
      61611786
  12. 09 1月, 2023 1 次提交
  13. 01 11月, 2022 1 次提交
  14. 31 10月, 2022 1 次提交
  15. 10 10月, 2022 1 次提交
    • Y
      [PHI]Add RNN yaml (#46812) · ab60fd8b
      YuanRisheng 提交于
      * add yaml entry for rnn and rrnn_grad, move infershape function for rnn_grad to phi infer meta
      
      * WIP: move rnn kernrl to phi
      
      * Change the code generation to avoid converting from intializer list to tuple of heterogeneous types.
      This is only triggered when an api has intermediate outputs, and the result of the outputs are of heterogeneous types.
      
      * fix the bug that when none in a vector of tensors requires gradient, the conversion to InferShapeContext to InferMetaContext (a.k.a. BuildInferMetaContext) produces errorous results.
      
      * fix ci bugs
      
      * fix ci bugs
      
      * fix ci bugs
      
      * modify code according comment
      Co-authored-by: Nchenfeiyu <chenfeiyu@baidu.com>
      ab60fd8b
  16. 07 9月, 2022 2 次提交
  17. 05 9月, 2022 1 次提交
  18. 31 8月, 2022 1 次提交
  19. 09 8月, 2022 1 次提交
  20. 05 8月, 2022 1 次提交
    • F
      move fft kernels to phi (#44714) · 153f1138
      Feiyu Chan 提交于
      * move fft kernels to phi, done with cufft, pocketfft, mkl_cdft, hipfft
      * make stft_op use fft from phi/kernels/funcs, clean code
      153f1138
  21. 04 8月, 2022 1 次提交
  22. 03 8月, 2022 4 次提交
  23. 02 8月, 2022 1 次提交
  24. 01 8月, 2022 3 次提交
    • T
      [operator migration] Migrate unstack_op and nms_op (#44424) · 9d2e0ecb
      Thomas Young 提交于
      * update unstack_op
      
      * update unstack_op
      
      * update unstack_op
      
      * fix unstack test
      
      * update unstack
      
      * update with remote
      
      * fix unstack_test.py
      
      * temp_save_change_nms_op
      
      * add nms test
      
      * update nms fix
      
      * update unstack_op
      
      * temp save change
      
      * finish fix nms_op
      
      * pass nms test
      
      * fix CI
      
      * fix ops test
      
      * save change
      
      * fix code style
      
      * fix code style
      
      * fix ci and codestyle
      
      * fix ci
      Co-authored-by: NShiningZhang <zhang_liang1991@126.com>
      9d2e0ecb
    • L
      migrate overlap_add and overlap_add_grad op (#44739) · 2a8219c1
      levi131 提交于
      * update code format
      
      * add ymal and test
      
      * update for comments
      2a8219c1
    • L
      [PHI] Move lu_unpack to phi (#44674) · c905a9e9
      Lin Manhui 提交于
      * Add kernel declarations
      
      * Copy kernel implementation code
      
      * Transfer implementation code
      
      * Register new kernels
      
      * Remove old kernels
      
      * Fix code style
      
      * Fix bugs
      
      * mutable_data->HostAlloc
      
      * Transfer infermeta
      
      * Add yaml and update python api
      
      * Add PADDLE_WITH_HIP check
      
      * Update unittests
      
      * Add kernel declarations
      
      * Copy kernel implementation code
      
      * Transfer kernel implementation code
      
      * Register new kernels
      
      * Remove old kernels
      
      * Add lu_unpack_sig
      
      * Fix bugs
      
      * Fix bugs
      
      * Fix bugs
      
      * Optimize directory structure
      
      * Add output checks
      
      * Update include files
      
      * lu_impl.h->lu_kernel_impl.h
      
      * Transfer infermeta
      
      * Add yaml and update python api
      
      * Add check_eager
      Co-authored-by: NBobholamovic <linmanhui@baidu.com>
      c905a9e9
  25. 29 7月, 2022 1 次提交
    • L
      [PHI] Move lu to phi (#44605) · 3d88816e
      Lin Manhui 提交于
      * Add kernel declarations
      
      * Copy kernel implementation code
      
      * Transfer implementation code
      
      * Register new kernels
      
      * Remove old kernels
      
      * Fix code style
      
      * Fix bugs
      
      * mutable_data->HostAlloc
      
      * Transfer infermeta
      
      * Add yaml and update python api
      
      * Add PADDLE_WITH_HIP check
      
      * Update unittests
      
      * Fix bugs
      
      * Fix bugs
      
      * Optimize directory structure
      
      * Add output checks
      
      * lu_impl.h->lu_kernel_impl.h
      Co-authored-by: NBobholamovic <linmanhui@baidu.com>
      3d88816e
  26. 28 7月, 2022 1 次提交
    • L
      [PHI] Move spectral_norm to phi (#44577) · 768e50c9
      Lin Manhui 提交于
      * Add kernel declarations
      
      * Copy kernel implementation code
      
      * Transfer implementation code
      
      * Fix: Move out_grad to first
      
      * Register new kernels
      
      * Remove old kernels
      
      * Move out_grad to last
      
      * Fix bugs
      
      * Transfer infermeta
      
      * Add yaml files
      
      * Add blank line
      
      * Fix code style
      
      * Optimize directory structure
      Co-authored-by: NBobholamovic <linmanhui@baidu.com>
      768e50c9
  27. 27 7月, 2022 2 次提交
  28. 26 7月, 2022 1 次提交
  29. 22 7月, 2022 1 次提交
  30. 19 7月, 2022 2 次提交
    • W
      [Phi]Move angle op to phi (#44393) · 547075e9
      WangZhen 提交于
      * Move angle op to phi
      
      * Replace mutable_data using Alloc
      
      * Remove some include
      
      * Try to fix windows ci error
      
      * include math.h to fix windows ci error
      
      * Fix kernel name
      
      * Move angle_grad infershape
      547075e9
    • F
      move eig operator from fluid to phi (#44398) · 3788f5e5
      freeliuzc 提交于
      * move eig operator from fluid to phi
      
      * add eig_grad unitest, upgrade IsComplexType() from fluid to phi
      3788f5e5
  31. 13 7月, 2022 1 次提交
  32. 01 6月, 2022 1 次提交