1. 30 3月, 2023 4 次提交
    • W
      support prim & cinn test for layer_norm (#51272) · 84504f35
      Weilong Wu 提交于
      * support layer_norm prim and cinn test
      
      * enable cinn test
      
      * fix merge conflict
      
      * polish input for check_output_with_place
      
      * fix merge conflict
      
      * add more test case
      
      * fix merge conflict
      
      * polish test case
      
      * polish op_test
      
      * change ln_g rules
      
      * modify scale is none case
      
      * modify scale is none case
      
      * add public_python_api for check prim
      
      * modify setoutputgrad and fp64bug
      
      * add todo & delete log
      
      * recover
      
      * fix some errors
      
      * recover
      
      * recover
      
      * recover
      
      * recover
      
      * fix merge conflicts
      
      ---------
      Co-authored-by: Nwangruting <wangruting@baidu.com>
      84504f35
    • Z
      [Move Test] Move prim (#52167) · 3e2d0195
      Zheng-Bicheng 提交于
      * update
      
      * update
      3e2d0195
    • Z
      add scatter composite rule. (#52005) · e16eb22c
      zxcd 提交于
      * add scatter composite rule.
      
      * add public_python_api
      
      * add python unit16 support.
      
      * fix code style.
      
      * add cinn to makelist
      
      * cinn unsupport uint16, forbidden cinn when dtype==uint16.
      e16eb22c
    • G
      [Test Mv] ipu_test (#52143) · 38a477e2
      gouzil 提交于
      * [Test Mv] ipu_test
      
      * [Test Mv] cmake add py_test_modules
      
      * [Move Test] rm py_test_modules
      
      * rm asp
      38a477e2
  2. 29 3月, 2023 4 次提交
  3. 28 3月, 2023 2 次提交
  4. 27 3月, 2023 1 次提交
    • C
      add prim test for some ops (#51749) · e1674e8b
      Charles-hit 提交于
      * add tanh and cast prim test
      
      * fix tanh test
      
      * fix 0-d test
      
      * add sqrt fp16 prim test
      
      * add public_python_api in prim test
      
      * fix test_squeeze2_op
      
      * add tanh prim test
      
      * add dropout prim test
      
      * [Dy2St]Fix clone for test state problem
      
      * clean code
      
      * modify test_cumsum_op
      
      * modify test_cumsum_op
      
      * fix dropout test
      
      * add dropout in cmake
      
      * fix dropout test
      
      ---------
      Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
      e1674e8b
  5. 25 3月, 2023 1 次提交
  6. 24 3月, 2023 1 次提交
    • Z
      Memory Efficient Attention (#51867) · e5ad3859
      ZhangDY-6483 提交于
      * first version, notest
      
      * return final rst, notest
      
      * use infinity() instead of max
      
      * ut structure
      
      * start up of ut
      
      * generate lse
      
      * update
      
      * add depense
      
      * reconstruct cmake
      
      * move file
      
      * add memory efficient attention and fix blasimpl
      
      * update
      
      * update cmake
      
      * add namespace
      
      * update cmake
      
      * use .cu
      
      * update for pad3d
      
      * bug fix
      
      * bug fix
      
      * update
      
      * bug fix
      
      * update enforce
      
      * add test case
      
      * merge the lse pad
      
      * fix kernel_fn of backward
      
      * fix PADDLE_ENFORCE_EQ and phi_api
      
      * fix PADDLE_ENFORCE
      
      * fix PADDLE_ENFORCE
      
      * rerun coverage
      
      * fix memory efficient attention test
      
      * rerun ci
      
      * add cuda version condition
      
      * add cuda version condition
      
      * delete WIP test
      
      * replace PADDLE_ENFORCE
      
      * edit the namespace of datatype in multiple.cc
      
      * rerun
      
      * rerun
      
      ---------
      Co-authored-by: Nliuyuang <liuyuang@baidu.com>
      e5ad3859
  7. 23 3月, 2023 1 次提交
    • C
      [Prim] add meshgrid composite rule (#51061) · 53bb883d
      chenjian 提交于
      * add meshgrid composite rule
      
      * add meshgrid composite rule
      
      * update
      
      * add into CMakeLists
      
      * fix
      
      * update
      
      * update
      
      * optimize code
      
      * fix meshgrid op
      
      * update test
      53bb883d
  8. 22 3月, 2023 1 次提交
    • G
      Add fused_feed_forward pass (#50423) · 5dda0ef6
      Ghost Screaming 提交于
      * Add fused_feed_forward pass for semi-automatic static graph training.
      
      * Add fused_feedforward property in parallel_executor.cc
      
      * Polish code.
      
      * Polish fused feed_forward pass code. Support use_dropout1 and
      use_dropout2 option.
      
      * Support model parallel in fused_feedforward pass.
      5dda0ef6
  9. 20 3月, 2023 1 次提交
    • warrentdrew's avatar
      add composite rules for squeeze op (#51539) · 89ff0d59
      warrentdrew 提交于
      * add composite rule for squeeze
      
      * fix pre commit
      
      * fix pre commit
      
      * simplify rules
      
      * arrange code
      
      * fix int axis
      
      * simplify squeeze axis rules
      
      * bugfix
      
      * fix pre commit
      89ff0d59
  10. 17 3月, 2023 1 次提交
  11. 16 3月, 2023 3 次提交
  12. 15 3月, 2023 3 次提交
  13. 13 3月, 2023 3 次提交
  14. 10 3月, 2023 1 次提交
  15. 09 3月, 2023 2 次提交
  16. 08 3月, 2023 1 次提交
  17. 07 3月, 2023 1 次提交
  18. 03 3月, 2023 1 次提交
  19. 02 3月, 2023 2 次提交
    • C
      Add prim test for elementwise ops (#50807) · b8713309
      Charles-hit 提交于
      * fix prim_op_test when python api outs is different with kernel sig
      
      * add elementwise op prim test
      
      * fix unit test
      
      * add bfloat16 for full in static  prim api
      
      * empty-commit
      
      * close bf16 test
      
      * polish elementwise tests
      b8713309
    • W
      Add concat grad cinn (#50972) · a4689c90
      wangzhen38 提交于
      * [cinn] concat_grad
      
      * [cinn] concat_grad
      
      * [cinn] concat_grad build success
      
      * [Add PGLBOX] fix unnitest
      
      * [Add PGLBOX] fix unnitest
      
      * [Add PGLBOX] fix codestyle
      
      * [cinn] update by comments
      
      * [cinn] update by comment
      
      * [cinn] add axis check
      a4689c90
  20. 01 3月, 2023 3 次提交
    • C
      Integration flash attention (#49869) · 61611786
      Chitsing KUI 提交于
      * flash attn
      
      * seed
      
      * almost
      
      * softmax
      
      * fix workspace
      
      * add unitest; linux only
      
      * fix setup
      
      * fix datatype include
      
      * fix setup typo
      
      * fix def scope
      
      * new error api
      
      * use paddle fork
      
      * fix attr bug; complete ut
      
      * update flash hash
      
      * fix rng reset
      
      * fix offset
      
      * fix comments
      61611786
    • Z
      add topk prim backward (#50679) · 296b3ff0
      zqw_1997 提交于
      * tmp gather vjp
      
      * support gather
      
      * remove useless code
      
      * fix compiling error
      
      * fix ut
      
      * add eager test
      
      * add eager test
      
      * add seed
      
      * small change
      
      * fix cpu error
      
      * fix transpose op compat
      
      * remove tensor index case
      
      * fix prim_cinn
      
      * small commit
      
      * add cumsum prim backward
      
      * small commit
      
      * skip aixs=None test case
      
      * fix op generante eror
      
      * fix static test error
      
      * remove unused code
      
      * fix static test error
      
      * small commit
      
      * skip cpu float16 test case
      
      * skip eager cpu cumsum float16 test case
      
      * add eager and static UT
      
      * fix ut
      
      * add composite backward rule
      
      * fix error
      
      * fix type error and format error
      
      * add try cpu+float16 test
      
      * fix test bugs
      
      * remove test for cpu+float16 and make y[0] be the grad arg
      
      * add cinn test
      
      * fix UT
      
      * fix the wrong dim of v in test cases
      
      * change y[0] to y[1] for grad in UT
      
      * reshape flatten out
      
      * Disable cinn single test
      
      * use scatter_nd_add
      
      * modify the reshape part of topk_grad
      
      * delete useless build file
      
      * to make the syntax right
      
      * modify bug
      
      * try use of put_along_axis
      
      * remove cinn test
      
      * reformat todo
      
      * add silu composite rule
      
      * fix code style.
      
      * add cinn test
      
      * fix composite grad maker code gen
      
      * add prim in cumsum op test
      
      * remove old test
      
      * fix typro
      
      * pass the static test
      
      * fix typro
      
      * modify optest and delete old test files
      
      * remove normal test_top_k_op test
      
      * fix typro
      
      * pass axis=None test case
      
      * buffer comment
      
      * for debug
      
      * add silu fp16 unit test.
      
      * add static guard
      
      * remove forward prim test
      
      * remove same name axis
      
      * modify the test_top_v2_op.py to pass all local tests
      
      * delete the useless testcase
      
      * fix mistake
      
      * add more testcases to test dtype16 and dtype32
      
      ---------
      Co-authored-by: NJiabinYang <360788950@qq.com>
      Co-authored-by: NGGBond8488 <857631483@qq.com>
      Co-authored-by: Nzxcd <228587199@qq.com>
      Co-authored-by: NCharles-hit <wanghao107@baidu.com>
      296b3ff0
    • Y
      Add full_like composite rule (#50794) · 7468bab4
      Yichen Zhang 提交于
      * implement composite full_like and simple unit test
      
      * implement op tests for composite full_like op
      
      * some modification as reviewers suggested
      add cinn op test to CMakeLists.txt
      fix code style
      
      * fix code style
      
      * modify input args of prim fill_any_like op
      
      * resolve conflicts
      
      * resolve conflicts
      
      * modify python api and unit tests as suggested
      
      * resolve conflicts
      
      * resolve conflicts
      
      * use framework.dtype to convert dtype in Op test
      7468bab4
  21. 27 2月, 2023 1 次提交
  22. 24 2月, 2023 1 次提交
    • X
      【prim】Slice grad (#50771) · f6dea800
      xiaoguoguo626807 提交于
      * support prim test in OpTest
      
      * fix cmake
      
      * fix op test
      
      * fix test_input_spec
      
      * disable cinn in reduce_sum unit test
      
      * add bfloat16 dtype for sum
      
      * add approve rules
      
      * polish code
      
      * add clear jit program function
      
      * convert grad out from tensor to numpy
      
      * remove unnecessary code
      
      * add only_prim flag
      
      * fix flag
      
      * fix op test
      
      * add attr
      
      * fix optest comp inplace error
      
      * fix op test
      
      * fix op test with guard
      
      * add initialization of check_comp flag
      
      * fix comp inplace error in op test
      
      * rename check_comp with check_prim and add bfloat16 dtype convert
      
      * rename comp_op_type to prim_op_type
      
      * rename comp to prim
      
      * remove useless code
      
      * skip ci check for only prim
      
      * add no_grad_vars and grad_outputs in prim test
      
      * fix var_dict
      
      * fix op test for only_prim
      
      * fix dy2static bugs
      
      * polish some code
      
      * temp
      
      * modify op test
      
      * except cinn test
      
      * modify bfp16
      
      * modify pad grad
      
      * add pad_grad dtype
      
      * start cinn part
      
      ---------
      Co-authored-by: NCharles-hit <wanghao107@baidu.com>
      f6dea800
  23. 21 2月, 2023 1 次提交