1. 28 3月, 2023 4 次提交
  2. 27 3月, 2023 1 次提交
    • HappyHeavyRain's avatar
      Add fuse_ops.yaml and fused_backward.yaml (#52010) · 10145cb6
      HappyHeavyRain 提交于
      * add fused_yaml fused_backward
      
      * fix eager_funciton bug
      
      * add some comment of fused yaml file
      
      * add 'support_dygraph_mode' configuration in fused yaml
      
      * delete some 'fused_api.h' in include file
      
      * add fused flag in api_gen
      10145cb6
  3. 24 3月, 2023 1 次提交
    • Z
      Memory Efficient Attention (#51867) · e5ad3859
      ZhangDY-6483 提交于
      * first version, notest
      
      * return final rst, notest
      
      * use infinity() instead of max
      
      * ut structure
      
      * start up of ut
      
      * generate lse
      
      * update
      
      * add depense
      
      * reconstruct cmake
      
      * move file
      
      * add memory efficient attention and fix blasimpl
      
      * update
      
      * update cmake
      
      * add namespace
      
      * update cmake
      
      * use .cu
      
      * update for pad3d
      
      * bug fix
      
      * bug fix
      
      * update
      
      * bug fix
      
      * update enforce
      
      * add test case
      
      * merge the lse pad
      
      * fix kernel_fn of backward
      
      * fix PADDLE_ENFORCE_EQ and phi_api
      
      * fix PADDLE_ENFORCE
      
      * fix PADDLE_ENFORCE
      
      * rerun coverage
      
      * fix memory efficient attention test
      
      * rerun ci
      
      * add cuda version condition
      
      * add cuda version condition
      
      * delete WIP test
      
      * replace PADDLE_ENFORCE
      
      * edit the namespace of datatype in multiple.cc
      
      * rerun
      
      * rerun
      
      ---------
      Co-authored-by: Nliuyuang <liuyuang@baidu.com>
      e5ad3859
  4. 23 3月, 2023 1 次提交
  5. 22 3月, 2023 3 次提交
  6. 21 3月, 2023 1 次提交
  7. 20 3月, 2023 3 次提交
  8. 16 3月, 2023 1 次提交
  9. 10 3月, 2023 2 次提交
    • HappyHeavyRain's avatar
      [New features]Add function node in phi_kernel for MKLDNN (#51073) · a0a6dc6a
      HappyHeavyRain 提交于
      * Add function node in phi_kernel for MKLDNN
      
      * fix the bug in 'BuildInferVarKernelContext'
      
      * add infer_varkernel_utils.cc
      
      * fix the bug:the first two parametes of 'BuildInferVarKernelContext' can't be template variable
      
      * change the code according to first review
      
      * change the code according to first review
      
      * change the mode of paddle_build.sh
      
      * change 'infer_var_kernel_fn_' to 'get_kerneltype_forvar_fn_'
      
      * add the error information
      
      * fix NotFound infomation warning
      
      * fix NotFound infomation warning
      
      * fix NotFound infomation warning
      a0a6dc6a
    • C
      add flashattn raw kernel (#51383) · f951832d
      Chitsing KUI 提交于
      f951832d
  10. 09 3月, 2023 2 次提交
    • G
      add prim erf grad (#50436) · b7e4d974
      GGBond8488 提交于
      * add prim erf grad
      
      * add yaml config for prim erf grad
      
      * add math.h
      
      * add cmath
      
      * add math  defines
      
      * use define math
      
      * use define math
      
      * define M_2_SQRTPI
      
      * M_2_SQRTPI math
      
      * try math.h
      
      * fix typro
      
      * remove pow in erf grad
      
      * use new optest
      
      * add fp16 fp32 test
      
      * remove fp16 test
      b7e4d974
    • W
      Add softplus double grad (#50261) · 542844b4
      will-jl944 提交于
      * add softplus double grad
      
      * use constant method
      542844b4
  11. 08 3月, 2023 1 次提交
  12. 06 3月, 2023 1 次提交
  13. 03 3月, 2023 1 次提交
  14. 01 3月, 2023 2 次提交
    • C
      Integration flash attention (#49869) · 61611786
      Chitsing KUI 提交于
      * flash attn
      
      * seed
      
      * almost
      
      * softmax
      
      * fix workspace
      
      * add unitest; linux only
      
      * fix setup
      
      * fix datatype include
      
      * fix setup typo
      
      * fix def scope
      
      * new error api
      
      * use paddle fork
      
      * fix attr bug; complete ut
      
      * update flash hash
      
      * fix rng reset
      
      * fix offset
      
      * fix comments
      61611786
    • Z
      add topk prim backward (#50679) · 296b3ff0
      zqw_1997 提交于
      * tmp gather vjp
      
      * support gather
      
      * remove useless code
      
      * fix compiling error
      
      * fix ut
      
      * add eager test
      
      * add eager test
      
      * add seed
      
      * small change
      
      * fix cpu error
      
      * fix transpose op compat
      
      * remove tensor index case
      
      * fix prim_cinn
      
      * small commit
      
      * add cumsum prim backward
      
      * small commit
      
      * skip aixs=None test case
      
      * fix op generante eror
      
      * fix static test error
      
      * remove unused code
      
      * fix static test error
      
      * small commit
      
      * skip cpu float16 test case
      
      * skip eager cpu cumsum float16 test case
      
      * add eager and static UT
      
      * fix ut
      
      * add composite backward rule
      
      * fix error
      
      * fix type error and format error
      
      * add try cpu+float16 test
      
      * fix test bugs
      
      * remove test for cpu+float16 and make y[0] be the grad arg
      
      * add cinn test
      
      * fix UT
      
      * fix the wrong dim of v in test cases
      
      * change y[0] to y[1] for grad in UT
      
      * reshape flatten out
      
      * Disable cinn single test
      
      * use scatter_nd_add
      
      * modify the reshape part of topk_grad
      
      * delete useless build file
      
      * to make the syntax right
      
      * modify bug
      
      * try use of put_along_axis
      
      * remove cinn test
      
      * reformat todo
      
      * add silu composite rule
      
      * fix code style.
      
      * add cinn test
      
      * fix composite grad maker code gen
      
      * add prim in cumsum op test
      
      * remove old test
      
      * fix typro
      
      * pass the static test
      
      * fix typro
      
      * modify optest and delete old test files
      
      * remove normal test_top_k_op test
      
      * fix typro
      
      * pass axis=None test case
      
      * buffer comment
      
      * for debug
      
      * add silu fp16 unit test.
      
      * add static guard
      
      * remove forward prim test
      
      * remove same name axis
      
      * modify the test_top_v2_op.py to pass all local tests
      
      * delete the useless testcase
      
      * fix mistake
      
      * add more testcases to test dtype16 and dtype32
      
      ---------
      Co-authored-by: NJiabinYang <360788950@qq.com>
      Co-authored-by: NGGBond8488 <857631483@qq.com>
      Co-authored-by: Nzxcd <228587199@qq.com>
      Co-authored-by: NCharles-hit <wanghao107@baidu.com>
      296b3ff0
  15. 23 2月, 2023 1 次提交
    • HappyHeavyRain's avatar
      Support 'complex promote' in yaml (#50611) · 91a3d159
      HappyHeavyRain 提交于
      * support 'complex promote' in yaml
      
      * change the compplex_promote
      
      * change 'kron' in math.py
      
      * change 'kron' comment in python
      
      * change kron comment in python
      
      * change kron comment in python
      91a3d159
  16. 21 2月, 2023 1 次提交
    • HappyHeavyRain's avatar
      Support bw invoke fw (#50260) · d8845735
      HappyHeavyRain 提交于
      * support bw invoke fw
      
      * fix scale in static_backward.yaml
      
      * fix the bug in tensorrt/convert
      
      * move 'scale','sign' into ops.yaml
      
      * add scale_grad of scale in op_compat.yaml
      
      * change generated_static_op in CMakeLists.txt
      d8845735
  17. 03 2月, 2023 1 次提交
  18. 17 1月, 2023 1 次提交
    • X
      【Prim】Add multiply,expand,div vjp rules (#49831) · 39c6765a
      Xiaoxu Chen 提交于
      * support elementwise base func
      
      * fix compiling error and add test
      
      * support vjp for div using comp
      
      * remove additional change
      
      * fix dy2st error with magic num
      
      * fix dy magic num
      
      * another magic
      
      * another magic
      
      * another magic
      
      * add skip rename strategy
      
      * support add vjp
      
      * support add with new axis cal
      
      * support sub vjp
      
      * [prim] add multiply vjp rules
      
      * [prim] add multiply vjp rules
      
      * [prim] fix no infershape with composite in _append_backward_ops
      
      * [prim] add expand vjp rule
      
      * [prim] add exp vjp rule
      
      * uncomment infer shape for reshape/sum static prim api
      
      * [prim] fix tanh nullptr error
      
      * remove some print message
      
      * fix magic number in run_program relative tests @JiaBinYang
      
      * [prim] add expand,multiply,exp vjp rules
      
      * fix only support single direction reduce error
      
      * infer reduce dims using out dims
      Co-authored-by: NJiabinYang <360788950@qq.com>
      39c6765a
  19. 16 1月, 2023 2 次提交
  20. 13 1月, 2023 1 次提交
  21. 09 1月, 2023 2 次提交
    • HappyHeavyRain's avatar
      Support the 'drop_empty_grad' in of output of backward_ops (#49588) · 36c6c589
      HappyHeavyRain 提交于
      * support the drop_empty_grad in backward
      
      * change code according to yunfei's review suggestion
      36c6c589
    • J
      Prim paddle Basic (#49272) · 2f601282
      Jiabin Yang 提交于
      * proto type of composite grad in paddle
      
      * proto type of composite grad in paddle
      
      * refactor composite api with phi
      
      * fix compile error
      
      * support static graph code-gen for squeeze op
      
      * generate static graph code of unsqueeze
      
      * refine op name
      
      * fix compile error
      
      * add extra output in op_compat
      
      * remove debug log
      
      * fix clang compile error
      
      * support prim switch flag
      
      * support prim switch flag
      
      * fix dygraph error
      
      * merge develop
      
      * add code_gen
      
      * add necessary files without codegen
      
      * fix code_gen bug
      
      * add deps
      
      * modify igmnore
      
      * add ignore
      
      * delete std cout
      
      * add composite logic for backward.py
      
      * add tanh first order grad composite
      
      * support enable_prim flag for static graph
      
      * throw expection when both GrapOpMaker and GradCompOpMaker not been registered
      
      * reorganize the directory of prim api tests
      
      * fix windows error
      
      * add eager_utils
      
      * add eager_utils
      
      * modify code gen
      
      * add composite parse
      
      * add unittest for get_grad_op_desc
      
      * code optimize
      
      * fix static test on windows
      
      * support generate static graph code for imag and real op
      
      * fix windows compile error in test_static_prim
      
      * merge develop
      
      * disable test eager in inference
      
      * prim code gen
      
      * disable eager compile in inference
      
      * rm other file
      
      * rm gitignore file
      
      * code_style
      
      * add eager test
      
      * code_style
      
      * merge develop
      
      * remove useless files
      
      * modify static test
      
      * support bool flag from singlton
      
      * merge develop
      
      * recover git ignore
      
      * fix conflict
      
      * recover git ignore for generated op
      
      * fix test compile error
      
      * remove some tests
      
      * add python test
      
      * fix some name issue
      
      * add composite code gen
      
      * modify backward yaml
      
      * fix static composite grad maker code gen
      
      * remove addtional files
      
      * add some static funcs unit test
      
      * fix some bugs
      
      * fix composite grad maker register code gen
      
      * optimize some functions
      Co-authored-by: Nzyfncg <zhangyunfei07@baidu.com>
      Co-authored-by: Nwangruting <wangruting@baidu.com>
      Co-authored-by: Ncxxly <chenxx_id@163.com>
      Co-authored-by: Ncharles-hit <wanghao107@baidu.com>
      Co-authored-by: Nxiaoguoguo626807 <100397923+xiaoguoguo626807@users.noreply.github.com>
      2f601282
  22. 05 1月, 2023 2 次提交
  23. 30 12月, 2022 1 次提交
  24. 28 12月, 2022 1 次提交
  25. 23 12月, 2022 1 次提交
  26. 20 12月, 2022 1 次提交
    • HappyHeavyRain's avatar
      Generate static graph code of some ops (#49092) · 11d7026b
      HappyHeavyRain 提交于
      * generate static graph code of some ops
      
      * change the default value of 'num' of 'unstack'
      
      * revert the pow
      
      * fix the 'real' 'imag' op error because of 'complex'
      
      * fix the code according to review
      11d7026b
  27. 13 12月, 2022 1 次提交