1. 24 3月, 2023 1 次提交
    • Z
      Memory Efficient Attention (#51867) · e5ad3859
      ZhangDY-6483 提交于
      * first version, notest
      
      * return final rst, notest
      
      * use infinity() instead of max
      
      * ut structure
      
      * start up of ut
      
      * generate lse
      
      * update
      
      * add depense
      
      * reconstruct cmake
      
      * move file
      
      * add memory efficient attention and fix blasimpl
      
      * update
      
      * update cmake
      
      * add namespace
      
      * update cmake
      
      * use .cu
      
      * update for pad3d
      
      * bug fix
      
      * bug fix
      
      * update
      
      * bug fix
      
      * update enforce
      
      * add test case
      
      * merge the lse pad
      
      * fix kernel_fn of backward
      
      * fix PADDLE_ENFORCE_EQ and phi_api
      
      * fix PADDLE_ENFORCE
      
      * fix PADDLE_ENFORCE
      
      * rerun coverage
      
      * fix memory efficient attention test
      
      * rerun ci
      
      * add cuda version condition
      
      * add cuda version condition
      
      * delete WIP test
      
      * replace PADDLE_ENFORCE
      
      * edit the namespace of datatype in multiple.cc
      
      * rerun
      
      * rerun
      
      ---------
      Co-authored-by: Nliuyuang <liuyuang@baidu.com>
      e5ad3859
  2. 23 3月, 2023 2 次提交
  3. 22 3月, 2023 6 次提交
  4. 20 3月, 2023 1 次提交
  5. 16 3月, 2023 1 次提交
  6. 15 3月, 2023 2 次提交
  7. 13 3月, 2023 1 次提交
  8. 10 3月, 2023 2 次提交
    • HappyHeavyRain's avatar
      [New features]Add function node in phi_kernel for MKLDNN (#51073) · a0a6dc6a
      HappyHeavyRain 提交于
      * Add function node in phi_kernel for MKLDNN
      
      * fix the bug in 'BuildInferVarKernelContext'
      
      * add infer_varkernel_utils.cc
      
      * fix the bug:the first two parametes of 'BuildInferVarKernelContext' can't be template variable
      
      * change the code according to first review
      
      * change the code according to first review
      
      * change the mode of paddle_build.sh
      
      * change 'infer_var_kernel_fn_' to 'get_kerneltype_forvar_fn_'
      
      * add the error information
      
      * fix NotFound infomation warning
      
      * fix NotFound infomation warning
      
      * fix NotFound infomation warning
      a0a6dc6a
    • C
      add flashattn raw kernel (#51383) · f951832d
      Chitsing KUI 提交于
      f951832d
  9. 09 3月, 2023 1 次提交
    • TaoTao Li's avatar
      Add comm context manager, add phi broadcast op (#51072) · c191b707
      TaoTao Li 提交于
      * * add comm context for device context
      
      * add broadcast phi operator kernel and api
      
      * add broadcast support dtype, update ut
      
      * fix broadcast bfloat16 type
      
      * fix ut
      
      * update test_collective_broadcast_api timeout to 300
      c191b707
  10. 01 3月, 2023 1 次提交
    • C
      Integration flash attention (#49869) · 61611786
      Chitsing KUI 提交于
      * flash attn
      
      * seed
      
      * almost
      
      * softmax
      
      * fix workspace
      
      * add unitest; linux only
      
      * fix setup
      
      * fix datatype include
      
      * fix setup typo
      
      * fix def scope
      
      * new error api
      
      * use paddle fork
      
      * fix attr bug; complete ut
      
      * update flash hash
      
      * fix rng reset
      
      * fix offset
      
      * fix comments
      61611786
  11. 24 2月, 2023 1 次提交
    • HappyHeavyRain's avatar
      support 'backend' in static ops (#50671) · 363825df
      HappyHeavyRain 提交于
      * support 'backend' in static ops
      
      * change bitwise_xx comment in python
      
      * change bitwise_xxx comment in python
      
      * change 'backend' and 'data_type' in GetExpectedKernelType
      363825df
  12. 23 2月, 2023 1 次提交
    • HappyHeavyRain's avatar
      Support 'complex promote' in yaml (#50611) · 91a3d159
      HappyHeavyRain 提交于
      * support 'complex promote' in yaml
      
      * change the compplex_promote
      
      * change 'kron' in math.py
      
      * change 'kron' comment in python
      
      * change kron comment in python
      
      * change kron comment in python
      91a3d159
  13. 21 2月, 2023 1 次提交
    • HappyHeavyRain's avatar
      Support bw invoke fw (#50260) · d8845735
      HappyHeavyRain 提交于
      * support bw invoke fw
      
      * fix scale in static_backward.yaml
      
      * fix the bug in tensorrt/convert
      
      * move 'scale','sign' into ops.yaml
      
      * add scale_grad of scale in op_compat.yaml
      
      * change generated_static_op in CMakeLists.txt
      d8845735
  14. 20 2月, 2023 1 次提交
  15. 03 2月, 2023 1 次提交
  16. 16 1月, 2023 1 次提交
  17. 13 1月, 2023 1 次提交
  18. 12 1月, 2023 1 次提交
  19. 09 1月, 2023 1 次提交
  20. 05 1月, 2023 2 次提交
  21. 30 12月, 2022 1 次提交
  22. 28 12月, 2022 1 次提交
  23. 23 12月, 2022 1 次提交
  24. 20 12月, 2022 1 次提交
    • HappyHeavyRain's avatar
      Generate static graph code of some ops (#49092) · 11d7026b
      HappyHeavyRain 提交于
      * generate static graph code of some ops
      
      * change the default value of 'num' of 'unstack'
      
      * revert the pow
      
      * fix the 'real' 'imag' op error because of 'complex'
      
      * fix the code according to review
      11d7026b
  25. 13 12月, 2022 1 次提交
  26. 12 12月, 2022 2 次提交
  27. 09 12月, 2022 2 次提交
  28. 05 12月, 2022 1 次提交
  29. 29 11月, 2022 1 次提交