1. 28 3月, 2023 4 次提交
  2. 27 3月, 2023 1 次提交
    • HappyHeavyRain's avatar
      Add fuse_ops.yaml and fused_backward.yaml (#52010) · 10145cb6
      HappyHeavyRain 提交于
      * add fused_yaml fused_backward
      
      * fix eager_funciton bug
      
      * add some comment of fused yaml file
      
      * add 'support_dygraph_mode' configuration in fused yaml
      
      * delete some 'fused_api.h' in include file
      
      * add fused flag in api_gen
      10145cb6
  3. 24 3月, 2023 2 次提交
    • TaoTao Li's avatar
      add phi operator allreduce/reduce (#51857) · 47f87ad3
      TaoTao Li 提交于
      * add all_reduce, reduce kernel and api
      
      * fix all_reduce reduce ut
      
      fix reduce op maker conflict
      
      fix merge conflicts
      
      * fix conflicts, rename ReduceOp->ReduceBaseOp in reduce_ops
      
      rename allreduce op, to remove
      
      * fix code format
      
      fix comments
      
      * modify test_collective_reduce_api ut timeout
      
      * fix PR-CI-Build
      
      fix comments: format phi operator
      47f87ad3
    • Z
      Memory Efficient Attention (#51867) · e5ad3859
      ZhangDY-6483 提交于
      * first version, notest
      
      * return final rst, notest
      
      * use infinity() instead of max
      
      * ut structure
      
      * start up of ut
      
      * generate lse
      
      * update
      
      * add depense
      
      * reconstruct cmake
      
      * move file
      
      * add memory efficient attention and fix blasimpl
      
      * update
      
      * update cmake
      
      * add namespace
      
      * update cmake
      
      * use .cu
      
      * update for pad3d
      
      * bug fix
      
      * bug fix
      
      * update
      
      * bug fix
      
      * update enforce
      
      * add test case
      
      * merge the lse pad
      
      * fix kernel_fn of backward
      
      * fix PADDLE_ENFORCE_EQ and phi_api
      
      * fix PADDLE_ENFORCE
      
      * fix PADDLE_ENFORCE
      
      * rerun coverage
      
      * fix memory efficient attention test
      
      * rerun ci
      
      * add cuda version condition
      
      * add cuda version condition
      
      * delete WIP test
      
      * replace PADDLE_ENFORCE
      
      * edit the namespace of datatype in multiple.cc
      
      * rerun
      
      * rerun
      
      ---------
      Co-authored-by: Nliuyuang <liuyuang@baidu.com>
      e5ad3859
  4. 23 3月, 2023 2 次提交
  5. 22 3月, 2023 6 次提交
  6. 20 3月, 2023 1 次提交
  7. 16 3月, 2023 1 次提交
  8. 15 3月, 2023 2 次提交
  9. 13 3月, 2023 1 次提交
  10. 10 3月, 2023 2 次提交
    • HappyHeavyRain's avatar
      [New features]Add function node in phi_kernel for MKLDNN (#51073) · a0a6dc6a
      HappyHeavyRain 提交于
      * Add function node in phi_kernel for MKLDNN
      
      * fix the bug in 'BuildInferVarKernelContext'
      
      * add infer_varkernel_utils.cc
      
      * fix the bug:the first two parametes of 'BuildInferVarKernelContext' can't be template variable
      
      * change the code according to first review
      
      * change the code according to first review
      
      * change the mode of paddle_build.sh
      
      * change 'infer_var_kernel_fn_' to 'get_kerneltype_forvar_fn_'
      
      * add the error information
      
      * fix NotFound infomation warning
      
      * fix NotFound infomation warning
      
      * fix NotFound infomation warning
      a0a6dc6a
    • C
      add flashattn raw kernel (#51383) · f951832d
      Chitsing KUI 提交于
      f951832d
  11. 09 3月, 2023 1 次提交
    • TaoTao Li's avatar
      Add comm context manager, add phi broadcast op (#51072) · c191b707
      TaoTao Li 提交于
      * * add comm context for device context
      
      * add broadcast phi operator kernel and api
      
      * add broadcast support dtype, update ut
      
      * fix broadcast bfloat16 type
      
      * fix ut
      
      * update test_collective_broadcast_api timeout to 300
      c191b707
  12. 01 3月, 2023 1 次提交
    • C
      Integration flash attention (#49869) · 61611786
      Chitsing KUI 提交于
      * flash attn
      
      * seed
      
      * almost
      
      * softmax
      
      * fix workspace
      
      * add unitest; linux only
      
      * fix setup
      
      * fix datatype include
      
      * fix setup typo
      
      * fix def scope
      
      * new error api
      
      * use paddle fork
      
      * fix attr bug; complete ut
      
      * update flash hash
      
      * fix rng reset
      
      * fix offset
      
      * fix comments
      61611786
  13. 24 2月, 2023 1 次提交
    • HappyHeavyRain's avatar
      support 'backend' in static ops (#50671) · 363825df
      HappyHeavyRain 提交于
      * support 'backend' in static ops
      
      * change bitwise_xx comment in python
      
      * change bitwise_xxx comment in python
      
      * change 'backend' and 'data_type' in GetExpectedKernelType
      363825df
  14. 23 2月, 2023 1 次提交
    • HappyHeavyRain's avatar
      Support 'complex promote' in yaml (#50611) · 91a3d159
      HappyHeavyRain 提交于
      * support 'complex promote' in yaml
      
      * change the compplex_promote
      
      * change 'kron' in math.py
      
      * change 'kron' comment in python
      
      * change kron comment in python
      
      * change kron comment in python
      91a3d159
  15. 21 2月, 2023 1 次提交
    • HappyHeavyRain's avatar
      Support bw invoke fw (#50260) · d8845735
      HappyHeavyRain 提交于
      * support bw invoke fw
      
      * fix scale in static_backward.yaml
      
      * fix the bug in tensorrt/convert
      
      * move 'scale','sign' into ops.yaml
      
      * add scale_grad of scale in op_compat.yaml
      
      * change generated_static_op in CMakeLists.txt
      d8845735
  16. 20 2月, 2023 1 次提交
  17. 03 2月, 2023 1 次提交
  18. 16 1月, 2023 1 次提交
  19. 13 1月, 2023 1 次提交
  20. 12 1月, 2023 1 次提交
  21. 09 1月, 2023 1 次提交
  22. 05 1月, 2023 2 次提交
  23. 30 12月, 2022 1 次提交
  24. 28 12月, 2022 1 次提交
  25. 23 12月, 2022 1 次提交
  26. 20 12月, 2022 1 次提交
    • HappyHeavyRain's avatar
      Generate static graph code of some ops (#49092) · 11d7026b
      HappyHeavyRain 提交于
      * generate static graph code of some ops
      
      * change the default value of 'num' of 'unstack'
      
      * revert the pow
      
      * fix the 'real' 'imag' op error because of 'complex'
      
      * fix the code according to review
      11d7026b
  27. 13 12月, 2022 1 次提交