1. 30 3月, 2023 1 次提交
  2. 27 3月, 2023 1 次提交
  3. 24 3月, 2023 1 次提交
    • Z
      Memory Efficient Attention (#51867) · e5ad3859
      ZhangDY-6483 提交于
      * first version, notest
      
      * return final rst, notest
      
      * use infinity() instead of max
      
      * ut structure
      
      * start up of ut
      
      * generate lse
      
      * update
      
      * add depense
      
      * reconstruct cmake
      
      * move file
      
      * add memory efficient attention and fix blasimpl
      
      * update
      
      * update cmake
      
      * add namespace
      
      * update cmake
      
      * use .cu
      
      * update for pad3d
      
      * bug fix
      
      * bug fix
      
      * update
      
      * bug fix
      
      * update enforce
      
      * add test case
      
      * merge the lse pad
      
      * fix kernel_fn of backward
      
      * fix PADDLE_ENFORCE_EQ and phi_api
      
      * fix PADDLE_ENFORCE
      
      * fix PADDLE_ENFORCE
      
      * rerun coverage
      
      * fix memory efficient attention test
      
      * rerun ci
      
      * add cuda version condition
      
      * add cuda version condition
      
      * delete WIP test
      
      * replace PADDLE_ENFORCE
      
      * edit the namespace of datatype in multiple.cc
      
      * rerun
      
      * rerun
      
      ---------
      Co-authored-by: Nliuyuang <liuyuang@baidu.com>
      e5ad3859
  4. 01 3月, 2023 1 次提交
    • C
      Integration flash attention (#49869) · 61611786
      Chitsing KUI 提交于
      * flash attn
      
      * seed
      
      * almost
      
      * softmax
      
      * fix workspace
      
      * add unitest; linux only
      
      * fix setup
      
      * fix datatype include
      
      * fix setup typo
      
      * fix def scope
      
      * new error api
      
      * use paddle fork
      
      * fix attr bug; complete ut
      
      * update flash hash
      
      * fix rng reset
      
      * fix offset
      
      * fix comments
      61611786
  5. 09 1月, 2023 1 次提交
  6. 01 11月, 2022 1 次提交
  7. 31 10月, 2022 1 次提交
  8. 10 10月, 2022 1 次提交
    • Y
      [PHI]Add RNN yaml (#46812) · ab60fd8b
      YuanRisheng 提交于
      * add yaml entry for rnn and rrnn_grad, move infershape function for rnn_grad to phi infer meta
      
      * WIP: move rnn kernrl to phi
      
      * Change the code generation to avoid converting from intializer list to tuple of heterogeneous types.
      This is only triggered when an api has intermediate outputs, and the result of the outputs are of heterogeneous types.
      
      * fix the bug that when none in a vector of tensors requires gradient, the conversion to InferShapeContext to InferMetaContext (a.k.a. BuildInferMetaContext) produces errorous results.
      
      * fix ci bugs
      
      * fix ci bugs
      
      * fix ci bugs
      
      * modify code according comment
      Co-authored-by: Nchenfeiyu <chenfeiyu@baidu.com>
      ab60fd8b
  9. 07 9月, 2022 2 次提交
  10. 05 9月, 2022 1 次提交
  11. 31 8月, 2022 1 次提交
  12. 09 8月, 2022 1 次提交
  13. 05 8月, 2022 1 次提交
    • F
      move fft kernels to phi (#44714) · 153f1138
      Feiyu Chan 提交于
      * move fft kernels to phi, done with cufft, pocketfft, mkl_cdft, hipfft
      * make stft_op use fft from phi/kernels/funcs, clean code
      153f1138
  14. 04 8月, 2022 1 次提交
  15. 03 8月, 2022 4 次提交
  16. 02 8月, 2022 1 次提交
  17. 01 8月, 2022 3 次提交
    • T
      [operator migration] Migrate unstack_op and nms_op (#44424) · 9d2e0ecb
      Thomas Young 提交于
      * update unstack_op
      
      * update unstack_op
      
      * update unstack_op
      
      * fix unstack test
      
      * update unstack
      
      * update with remote
      
      * fix unstack_test.py
      
      * temp_save_change_nms_op
      
      * add nms test
      
      * update nms fix
      
      * update unstack_op
      
      * temp save change
      
      * finish fix nms_op
      
      * pass nms test
      
      * fix CI
      
      * fix ops test
      
      * save change
      
      * fix code style
      
      * fix code style
      
      * fix ci and codestyle
      
      * fix ci
      Co-authored-by: NShiningZhang <zhang_liang1991@126.com>
      9d2e0ecb
    • L
      migrate overlap_add and overlap_add_grad op (#44739) · 2a8219c1
      levi131 提交于
      * update code format
      
      * add ymal and test
      
      * update for comments
      2a8219c1
    • L
      [PHI] Move lu_unpack to phi (#44674) · c905a9e9
      Lin Manhui 提交于
      * Add kernel declarations
      
      * Copy kernel implementation code
      
      * Transfer implementation code
      
      * Register new kernels
      
      * Remove old kernels
      
      * Fix code style
      
      * Fix bugs
      
      * mutable_data->HostAlloc
      
      * Transfer infermeta
      
      * Add yaml and update python api
      
      * Add PADDLE_WITH_HIP check
      
      * Update unittests
      
      * Add kernel declarations
      
      * Copy kernel implementation code
      
      * Transfer kernel implementation code
      
      * Register new kernels
      
      * Remove old kernels
      
      * Add lu_unpack_sig
      
      * Fix bugs
      
      * Fix bugs
      
      * Fix bugs
      
      * Optimize directory structure
      
      * Add output checks
      
      * Update include files
      
      * lu_impl.h->lu_kernel_impl.h
      
      * Transfer infermeta
      
      * Add yaml and update python api
      
      * Add check_eager
      Co-authored-by: NBobholamovic <linmanhui@baidu.com>
      c905a9e9
  18. 29 7月, 2022 1 次提交
    • L
      [PHI] Move lu to phi (#44605) · 3d88816e
      Lin Manhui 提交于
      * Add kernel declarations
      
      * Copy kernel implementation code
      
      * Transfer implementation code
      
      * Register new kernels
      
      * Remove old kernels
      
      * Fix code style
      
      * Fix bugs
      
      * mutable_data->HostAlloc
      
      * Transfer infermeta
      
      * Add yaml and update python api
      
      * Add PADDLE_WITH_HIP check
      
      * Update unittests
      
      * Fix bugs
      
      * Fix bugs
      
      * Optimize directory structure
      
      * Add output checks
      
      * lu_impl.h->lu_kernel_impl.h
      Co-authored-by: NBobholamovic <linmanhui@baidu.com>
      3d88816e
  19. 28 7月, 2022 1 次提交
    • L
      [PHI] Move spectral_norm to phi (#44577) · 768e50c9
      Lin Manhui 提交于
      * Add kernel declarations
      
      * Copy kernel implementation code
      
      * Transfer implementation code
      
      * Fix: Move out_grad to first
      
      * Register new kernels
      
      * Remove old kernels
      
      * Move out_grad to last
      
      * Fix bugs
      
      * Transfer infermeta
      
      * Add yaml files
      
      * Add blank line
      
      * Fix code style
      
      * Optimize directory structure
      Co-authored-by: NBobholamovic <linmanhui@baidu.com>
      768e50c9
  20. 27 7月, 2022 2 次提交
  21. 26 7月, 2022 1 次提交
  22. 22 7月, 2022 1 次提交
  23. 19 7月, 2022 2 次提交
    • W
      [Phi]Move angle op to phi (#44393) · 547075e9
      WangZhen 提交于
      * Move angle op to phi
      
      * Replace mutable_data using Alloc
      
      * Remove some include
      
      * Try to fix windows ci error
      
      * include math.h to fix windows ci error
      
      * Fix kernel name
      
      * Move angle_grad infershape
      547075e9
    • F
      move eig operator from fluid to phi (#44398) · 3788f5e5
      freeliuzc 提交于
      * move eig operator from fluid to phi
      
      * add eig_grad unitest, upgrade IsComplexType() from fluid to phi
      3788f5e5
  24. 13 7月, 2022 1 次提交
  25. 01 6月, 2022 1 次提交
  26. 30 5月, 2022 1 次提交
    • T
      【PaddlePaddle Hackathon 2】15 新增 API Nanmedian (#42385) · f87fa3c0
      thunder95 提交于
      * nanmedian op
      
      * 修改cuda kernel的bug
      
      * 修复count_if在其他硬件平台不兼容
      
      * 修复某些cpu硬件不兼容
      
      * 修复某些cpu硬件不兼容
      
      * 修复isnan判断
      
      * 兼容numpy低版本不支持全部nan的情况
      
      * 兼容numpy低版本不支持全部nan的情况
      
      * fix code example
      
      * fix api comment error
      
      * 修改反向传播逻辑以及c++处理逻辑
      
      * 完成修改建议
      
      * typo pre_dim
      
      * update en docs, test=document_fix
      
      * remove numpy in en doc, test=document_fix
      
      * add r,test=document_fix
      
      * 添加api到all
      
      * follow advice from chenwhql
      f87fa3c0
  27. 27 5月, 2022 1 次提交
  28. 26 5月, 2022 1 次提交
  29. 26 4月, 2022 1 次提交
    • B
      【PaddlePaddle Hackathon 2】29、为 Paddle 新增 PixelUnshuffle 组网 API (#40728) · 5be9b824
      BrilliantYuKaimin 提交于
      * 增加PixelUnshuffle的形状推断
      
      * 增加PixelUnshuffle的算子注册
      
      * 增加PixelUnshuffle及其梯度的核函数
      
      * 增加PixelUnshuffle算子的描述
      
      * 增加PixelUnshuffle算子的签名
      
      * 在Python层面增加PixelUnshuffle
      
      * 增加PixelUnshuffle的单测
      
      * Update test_pixel_unshuffle.py
      
      * test=document_fix
      
      * Update test_pixel_unshuffle.py
      
      增加对extra_repr的测试
      
      * 修正代码格式
      
      * Update test_pixel_unshuffle.py
      
      修正对extra_repr的测试
      
      * 修改pixel_unshuffle核函数的实现位置
      
      * 修正代码格式
      
      * 完善对输入的检查
      
      * Update test_pixel_unshuffle.py
      
      * 完善pixel_unshuffle的输入检查
      
      * Update pixel_unshuffle_op.cc
      
      * Update unary.cc
      
      * add pixel_unshuffle
      
      * Update test_pixel_unshuffle.py
      
      * Update vision.py
      
      * 调整代码格式
      
      * Update vision.py
      
      * Delete extra spaces
      
      * Update pixel_unshuffle_sig.cc
      
      * Update vision.py
      
      * Update vision.py
      
      * add PixelUnshuffleGradInferMeta
      
      * remove PixelUnshuffleOpArgumentMapping
      
      * Update pixel_unshuffle_op.cc
      
      * 调整pixel_unshuffle及其梯度的核函数的实现位置
      
      * Update pixel_unshuffle_op.cc
      5be9b824
  30. 25 4月, 2022 1 次提交
    • B
      【PaddlePaddle Hackathon 2】24、为 Paddle 新增 nn.ChannelShuffle 组网 API (#40743) · bbaaf217
      BrilliantYuKaimin 提交于
      * Add infermeta for ChannelShuffle
      
      * Create channel_shuffle_grad_kernel.h
      
      * Create channel_shuffle_kernel.h
      
      * Create channel_shuffle_sig.cc
      
      * Create channel_shuffle_op.cc
      
      ChannelShuffle算子的描述
      
      * Create channel_shuffle_kernel_impl.h
      
      ChannelShuffle核函数的实现
      
      * Create channel_shuffle_grad_kernel_impl.h
      
      ChannelShuffle反向核函数的实现
      
      * Add kernel register of channel shuffle and grad
      
      注册ChannelShuffle及其反向的核函数
      
      * add nn.functional.channel_shuffle
      
      * add nn.ChannelShuffle
      
      * Create test_channel_shuffle.py
      
      * Update example of ChannelShuffle in vision.py
      
      * Update test_channel_shuffle.py
      
      * 修改channel_shuffle核函数的实现位置
      
      * 修正代码格式
      
      * 删除多余空格
      
      * 完善channel_shuffle的错误检查
      
      * Update unary.cc
      
      * Update channel_shuffle_op.cc
      
      * Update test_channel_shuffle.py
      
      * Update unary.cc
      
      * add channel_shuffle
      
      * Update test_channel_shuffle.py
      
      * Update vision.py
      
      * 调整代码格式
      
      * Update channel_shuffle_sig.cc
      
      * 更新ChannelShuffle的文档
      
      * 更新channel_shuffle的文档
      
      * remove ChannelShuffleOpArgumentMapping
      
      * add ChannelShuffleGradInferMeta
      
      * Update channel_shuffle_op.cc
      
      * 调整channel_shuffle及其梯度的核函数的位置
      bbaaf217
  31. 17 4月, 2022 1 次提交
    • C
      [Perf] Optimize dygraph scheduling performance (#41696) · 7ee31a96
      Chen Weihang 提交于
      * split phi and fluid infermeta context
      
      * resolve conflict
      
      * fix type error
      
      * optimize scheduling perf
      
      * spec small vector size
      
      * replace all grad var name
      
      * fix test failed
      
      * move init defalut signature
      
      * polish details
      
      * polish details
      
      * fix no init bug
      
      * init sig for tests
      
      * add init sig for infer
      
      * fix infrt error
      
      * fix infrt failed
      
      * fix kunlun error
      
      * fix infrt failed
      7ee31a96
  32. 13 4月, 2022 1 次提交