1. 04 9月, 2023 1 次提交
  2. 28 8月, 2023 1 次提交
  3. 03 8月, 2023 1 次提交
  4. 26 7月, 2023 1 次提交
  5. 24 7月, 2023 1 次提交
  6. 19 7月, 2023 1 次提交
  7. 12 7月, 2023 1 次提交
  8. 03 7月, 2023 1 次提交
  9. 29 6月, 2023 1 次提交
    • N
      Add fused_rope forward op (#54351) · a215c46a
      niuliling123 提交于
      * style
      
      * more
      
      * update ctest
      
      * Update legacy_backward.yaml
      
      * Update legacy_ops.yaml
      
      * Update legacy_ops.yaml
      
      * update
      
      * update
      
      * update for move
      a215c46a
  10. 25 5月, 2023 1 次提交
  11. 15 5月, 2023 1 次提交
    • N
      Tranpose layout (#53351) · 3dce9f0a
      niuliling123 提交于
      * update
      
      * Update backward.h
      
      * Update composite_backward_api.h
      
      * Update tensor_utils.cc
      
      * Update backward.cc
      
      * update
      
      * stype
      
      * update
      
      * add ctest
      
      * code stype
      3dce9f0a
  12. 13 4月, 2023 1 次提交
  13. 10 4月, 2023 1 次提交
    • W
      add autogen code support for affine_grid op (#52560) · 90280542
      Wang Xin 提交于
      * add autogen code support for affine_grid op
      
      * update op_compat.yaml for affine_grid
      
      * update op_compat.yaml for affine_grid
      
      * fix AffineGridGradInferMeta
      
      * fix CI error
      
      * update AffineGridInferMeta
      90280542
  14. 09 4月, 2023 1 次提交
  15. 04 4月, 2023 1 次提交
  16. 30 3月, 2023 1 次提交
  17. 27 3月, 2023 1 次提交
  18. 24 3月, 2023 1 次提交
    • Z
      Memory Efficient Attention (#51867) · e5ad3859
      ZhangDY-6483 提交于
      * first version, notest
      
      * return final rst, notest
      
      * use infinity() instead of max
      
      * ut structure
      
      * start up of ut
      
      * generate lse
      
      * update
      
      * add depense
      
      * reconstruct cmake
      
      * move file
      
      * add memory efficient attention and fix blasimpl
      
      * update
      
      * update cmake
      
      * add namespace
      
      * update cmake
      
      * use .cu
      
      * update for pad3d
      
      * bug fix
      
      * bug fix
      
      * update
      
      * bug fix
      
      * update enforce
      
      * add test case
      
      * merge the lse pad
      
      * fix kernel_fn of backward
      
      * fix PADDLE_ENFORCE_EQ and phi_api
      
      * fix PADDLE_ENFORCE
      
      * fix PADDLE_ENFORCE
      
      * rerun coverage
      
      * fix memory efficient attention test
      
      * rerun ci
      
      * add cuda version condition
      
      * add cuda version condition
      
      * delete WIP test
      
      * replace PADDLE_ENFORCE
      
      * edit the namespace of datatype in multiple.cc
      
      * rerun
      
      * rerun
      
      ---------
      Co-authored-by: Nliuyuang <liuyuang@baidu.com>
      e5ad3859
  19. 01 3月, 2023 1 次提交
    • C
      Integration flash attention (#49869) · 61611786
      Chitsing KUI 提交于
      * flash attn
      
      * seed
      
      * almost
      
      * softmax
      
      * fix workspace
      
      * add unitest; linux only
      
      * fix setup
      
      * fix datatype include
      
      * fix setup typo
      
      * fix def scope
      
      * new error api
      
      * use paddle fork
      
      * fix attr bug; complete ut
      
      * update flash hash
      
      * fix rng reset
      
      * fix offset
      
      * fix comments
      61611786
  20. 09 1月, 2023 1 次提交
  21. 01 11月, 2022 1 次提交
  22. 31 10月, 2022 1 次提交
  23. 10 10月, 2022 1 次提交
    • Y
      [PHI]Add RNN yaml (#46812) · ab60fd8b
      YuanRisheng 提交于
      * add yaml entry for rnn and rrnn_grad, move infershape function for rnn_grad to phi infer meta
      
      * WIP: move rnn kernrl to phi
      
      * Change the code generation to avoid converting from intializer list to tuple of heterogeneous types.
      This is only triggered when an api has intermediate outputs, and the result of the outputs are of heterogeneous types.
      
      * fix the bug that when none in a vector of tensors requires gradient, the conversion to InferShapeContext to InferMetaContext (a.k.a. BuildInferMetaContext) produces errorous results.
      
      * fix ci bugs
      
      * fix ci bugs
      
      * fix ci bugs
      
      * modify code according comment
      Co-authored-by: Nchenfeiyu <chenfeiyu@baidu.com>
      ab60fd8b
  24. 07 9月, 2022 2 次提交
  25. 05 9月, 2022 1 次提交
  26. 31 8月, 2022 1 次提交
  27. 09 8月, 2022 1 次提交
  28. 05 8月, 2022 1 次提交
    • F
      move fft kernels to phi (#44714) · 153f1138
      Feiyu Chan 提交于
      * move fft kernels to phi, done with cufft, pocketfft, mkl_cdft, hipfft
      * make stft_op use fft from phi/kernels/funcs, clean code
      153f1138
  29. 04 8月, 2022 1 次提交
  30. 03 8月, 2022 4 次提交
  31. 02 8月, 2022 1 次提交
  32. 01 8月, 2022 3 次提交
    • T
      [operator migration] Migrate unstack_op and nms_op (#44424) · 9d2e0ecb
      Thomas Young 提交于
      * update unstack_op
      
      * update unstack_op
      
      * update unstack_op
      
      * fix unstack test
      
      * update unstack
      
      * update with remote
      
      * fix unstack_test.py
      
      * temp_save_change_nms_op
      
      * add nms test
      
      * update nms fix
      
      * update unstack_op
      
      * temp save change
      
      * finish fix nms_op
      
      * pass nms test
      
      * fix CI
      
      * fix ops test
      
      * save change
      
      * fix code style
      
      * fix code style
      
      * fix ci and codestyle
      
      * fix ci
      Co-authored-by: NShiningZhang <zhang_liang1991@126.com>
      9d2e0ecb
    • L
      migrate overlap_add and overlap_add_grad op (#44739) · 2a8219c1
      levi131 提交于
      * update code format
      
      * add ymal and test
      
      * update for comments
      2a8219c1
    • L
      [PHI] Move lu_unpack to phi (#44674) · c905a9e9
      Lin Manhui 提交于
      * Add kernel declarations
      
      * Copy kernel implementation code
      
      * Transfer implementation code
      
      * Register new kernels
      
      * Remove old kernels
      
      * Fix code style
      
      * Fix bugs
      
      * mutable_data->HostAlloc
      
      * Transfer infermeta
      
      * Add yaml and update python api
      
      * Add PADDLE_WITH_HIP check
      
      * Update unittests
      
      * Add kernel declarations
      
      * Copy kernel implementation code
      
      * Transfer kernel implementation code
      
      * Register new kernels
      
      * Remove old kernels
      
      * Add lu_unpack_sig
      
      * Fix bugs
      
      * Fix bugs
      
      * Fix bugs
      
      * Optimize directory structure
      
      * Add output checks
      
      * Update include files
      
      * lu_impl.h->lu_kernel_impl.h
      
      * Transfer infermeta
      
      * Add yaml and update python api
      
      * Add check_eager
      Co-authored-by: NBobholamovic <linmanhui@baidu.com>
      c905a9e9
  33. 29 7月, 2022 1 次提交
    • L
      [PHI] Move lu to phi (#44605) · 3d88816e
      Lin Manhui 提交于
      * Add kernel declarations
      
      * Copy kernel implementation code
      
      * Transfer implementation code
      
      * Register new kernels
      
      * Remove old kernels
      
      * Fix code style
      
      * Fix bugs
      
      * mutable_data->HostAlloc
      
      * Transfer infermeta
      
      * Add yaml and update python api
      
      * Add PADDLE_WITH_HIP check
      
      * Update unittests
      
      * Fix bugs
      
      * Fix bugs
      
      * Optimize directory structure
      
      * Add output checks
      
      * lu_impl.h->lu_kernel_impl.h
      Co-authored-by: NBobholamovic <linmanhui@baidu.com>
      3d88816e
  34. 28 7月, 2022 1 次提交
    • L
      [PHI] Move spectral_norm to phi (#44577) · 768e50c9
      Lin Manhui 提交于
      * Add kernel declarations
      
      * Copy kernel implementation code
      
      * Transfer implementation code
      
      * Fix: Move out_grad to first
      
      * Register new kernels
      
      * Remove old kernels
      
      * Move out_grad to last
      
      * Fix bugs
      
      * Transfer infermeta
      
      * Add yaml files
      
      * Add blank line
      
      * Fix code style
      
      * Optimize directory structure
      Co-authored-by: NBobholamovic <linmanhui@baidu.com>
      768e50c9