1. 07 4月, 2023 2 次提交
  2. 06 4月, 2023 1 次提交
    • K
      feat: add composite rule of roll grad (#52532) · 348a36b5
      Kang Zhao 提交于
      * feat: add relu composite rule
      
      * feat: add relu composite rule, maximum op
      
      * feat: add relu composite rule, maximum op
      
      * feat: add relu composite rule, polish comments
      
      * feat: add relu composite rule, polish comments
      
      * feat: add relu composite rule, add python api of relu
      
      * feat: add relu composite rule, commit hook
      
      * fix: maximum type error & ban cinn test
      
      * fix: maximum input sequence bugs
      
      * resolve conflicts
      
      * fix: code style bugs
      
      * add: relu fp16 test
      
      * feat: add rsqrt composite rule
      
      * feat: add rsqrt composite rule
      
      * resolve conflicts of composite rule
      
      * fix: delete check eager
      
      * feat: add roll grad composite rule
      
      * fix minus shift
      
      * fix test roll op
      348a36b5
  3. 04 4月, 2023 1 次提交
    • R
      Improve new executor static build (#51149) · 5bac67d4
      Ruibiao Chen 提交于
      * Improve new executor static build
      
      * Skip GC for static build
      
      * Skip infershape for static build
      
      * Handle read_op
      
      * Add fused_attention to OpsWithFluidKernelNeedMoveToPhi
      
      * Fix argsort typos
      
      * Add sequence_pool to OpsWithFluidKernelNeedMoveToPhi
      
      * Fix skip share lod errors
      
      * Fix errors for adam
      
      * Fix errors for eigvals, memcpy and fake_quantize
      
      * Add static_build.cc
      
      * Add black list
      
      * Fix CI errors
      
      * Fix CI errors
      
      * Fix CI errors
      
      * Fix TensorArray
      
      * Fix TensorArray
      
      * Add update_loss_scaling to OpsNeedSetOutputDtypeWhenRegisterPhiKernel
      
      * Fix copy
      
      * Fix errors
      
      * Fix momentum
      
      * Skip mkldnn
      
      * Fix CI errors
      
      * Fix c_sync_calc_stream_op
      
      * Fix CINN
      
      * Fix while op
      
      * All CI pass, disable FLAGS to merge code, enable it after more tests in future
      
      * Add UTs
      
      * Fix typos
      
      * Fix typos
      
      * Add mkldnn UT
      
      * Remove mkldnn test
      
      * Fix typos
      
      * Fix dist test
      
      * Fix typos
      
      * Fix CI errors
      
      * Fix CI errors
      
      * Add UTs
      
      * Fix typos
      
      * Fix typos
      
      * Add sparse tests
      
      * ToComplexType -> ToComplex
      
      * Add test_matmul_op_static_build to disable_win_inference_test
      5bac67d4
  4. 31 3月, 2023 2 次提交
  5. 30 3月, 2023 4 次提交
    • W
      support prim & cinn test for layer_norm (#51272) · 84504f35
      Weilong Wu 提交于
      * support layer_norm prim and cinn test
      
      * enable cinn test
      
      * fix merge conflict
      
      * polish input for check_output_with_place
      
      * fix merge conflict
      
      * add more test case
      
      * fix merge conflict
      
      * polish test case
      
      * polish op_test
      
      * change ln_g rules
      
      * modify scale is none case
      
      * modify scale is none case
      
      * add public_python_api for check prim
      
      * modify setoutputgrad and fp64bug
      
      * add todo & delete log
      
      * recover
      
      * fix some errors
      
      * recover
      
      * recover
      
      * recover
      
      * recover
      
      * fix merge conflicts
      
      ---------
      Co-authored-by: Nwangruting <wangruting@baidu.com>
      84504f35
    • Z
      [Move Test] Move prim (#52167) · 3e2d0195
      Zheng-Bicheng 提交于
      * update
      
      * update
      3e2d0195
    • Z
      add scatter composite rule. (#52005) · e16eb22c
      zxcd 提交于
      * add scatter composite rule.
      
      * add public_python_api
      
      * add python unit16 support.
      
      * fix code style.
      
      * add cinn to makelist
      
      * cinn unsupport uint16, forbidden cinn when dtype==uint16.
      e16eb22c
    • G
      [Test Mv] ipu_test (#52143) · 38a477e2
      gouzil 提交于
      * [Test Mv] ipu_test
      
      * [Test Mv] cmake add py_test_modules
      
      * [Move Test] rm py_test_modules
      
      * rm asp
      38a477e2
  6. 29 3月, 2023 4 次提交
  7. 28 3月, 2023 2 次提交
  8. 27 3月, 2023 1 次提交
    • C
      add prim test for some ops (#51749) · e1674e8b
      Charles-hit 提交于
      * add tanh and cast prim test
      
      * fix tanh test
      
      * fix 0-d test
      
      * add sqrt fp16 prim test
      
      * add public_python_api in prim test
      
      * fix test_squeeze2_op
      
      * add tanh prim test
      
      * add dropout prim test
      
      * [Dy2St]Fix clone for test state problem
      
      * clean code
      
      * modify test_cumsum_op
      
      * modify test_cumsum_op
      
      * fix dropout test
      
      * add dropout in cmake
      
      * fix dropout test
      
      ---------
      Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
      e1674e8b
  9. 25 3月, 2023 1 次提交
  10. 24 3月, 2023 1 次提交
    • Z
      Memory Efficient Attention (#51867) · e5ad3859
      ZhangDY-6483 提交于
      * first version, notest
      
      * return final rst, notest
      
      * use infinity() instead of max
      
      * ut structure
      
      * start up of ut
      
      * generate lse
      
      * update
      
      * add depense
      
      * reconstruct cmake
      
      * move file
      
      * add memory efficient attention and fix blasimpl
      
      * update
      
      * update cmake
      
      * add namespace
      
      * update cmake
      
      * use .cu
      
      * update for pad3d
      
      * bug fix
      
      * bug fix
      
      * update
      
      * bug fix
      
      * update enforce
      
      * add test case
      
      * merge the lse pad
      
      * fix kernel_fn of backward
      
      * fix PADDLE_ENFORCE_EQ and phi_api
      
      * fix PADDLE_ENFORCE
      
      * fix PADDLE_ENFORCE
      
      * rerun coverage
      
      * fix memory efficient attention test
      
      * rerun ci
      
      * add cuda version condition
      
      * add cuda version condition
      
      * delete WIP test
      
      * replace PADDLE_ENFORCE
      
      * edit the namespace of datatype in multiple.cc
      
      * rerun
      
      * rerun
      
      ---------
      Co-authored-by: Nliuyuang <liuyuang@baidu.com>
      e5ad3859
  11. 23 3月, 2023 1 次提交
    • C
      [Prim] add meshgrid composite rule (#51061) · 53bb883d
      chenjian 提交于
      * add meshgrid composite rule
      
      * add meshgrid composite rule
      
      * update
      
      * add into CMakeLists
      
      * fix
      
      * update
      
      * update
      
      * optimize code
      
      * fix meshgrid op
      
      * update test
      53bb883d
  12. 22 3月, 2023 1 次提交
    • G
      Add fused_feed_forward pass (#50423) · 5dda0ef6
      Ghost Screaming 提交于
      * Add fused_feed_forward pass for semi-automatic static graph training.
      
      * Add fused_feedforward property in parallel_executor.cc
      
      * Polish code.
      
      * Polish fused feed_forward pass code. Support use_dropout1 and
      use_dropout2 option.
      
      * Support model parallel in fused_feedforward pass.
      5dda0ef6
  13. 20 3月, 2023 1 次提交
    • warrentdrew's avatar
      add composite rules for squeeze op (#51539) · 89ff0d59
      warrentdrew 提交于
      * add composite rule for squeeze
      
      * fix pre commit
      
      * fix pre commit
      
      * simplify rules
      
      * arrange code
      
      * fix int axis
      
      * simplify squeeze axis rules
      
      * bugfix
      
      * fix pre commit
      89ff0d59
  14. 17 3月, 2023 1 次提交
  15. 16 3月, 2023 3 次提交
  16. 15 3月, 2023 3 次提交
  17. 13 3月, 2023 3 次提交
  18. 10 3月, 2023 1 次提交
  19. 09 3月, 2023 2 次提交
  20. 08 3月, 2023 1 次提交
  21. 07 3月, 2023 1 次提交
  22. 03 3月, 2023 1 次提交
  23. 02 3月, 2023 2 次提交
    • C
      Add prim test for elementwise ops (#50807) · b8713309
      Charles-hit 提交于
      * fix prim_op_test when python api outs is different with kernel sig
      
      * add elementwise op prim test
      
      * fix unit test
      
      * add bfloat16 for full in static  prim api
      
      * empty-commit
      
      * close bf16 test
      
      * polish elementwise tests
      b8713309
    • W
      Add concat grad cinn (#50972) · a4689c90
      wangzhen38 提交于
      * [cinn] concat_grad
      
      * [cinn] concat_grad
      
      * [cinn] concat_grad build success
      
      * [Add PGLBOX] fix unnitest
      
      * [Add PGLBOX] fix unnitest
      
      * [Add PGLBOX] fix codestyle
      
      * [cinn] update by comments
      
      * [cinn] update by comment
      
      * [cinn] add axis check
      a4689c90