1. 17 5月, 2023 1 次提交
  2. 16 5月, 2023 1 次提交
    • S
      Move fused batchnorm to Phi (#53476) · 5e5481d8
      Sonder 提交于
      * trans fused batch norm Compute function
      
      * trans batch norm register info to phi
      
      * trans fused batch norm grad Compute
      
      * trans batch norm grad register info
      
      * add sig file
      
      * update sig file
      
      * Update fused_bn_activation_kernel.cu
      
      * Update fused_bn_activation_grad_kernel.cu
      
      * fix
      
      * Rename fused_bn_activation_kernel_grad.cu to fused_bn_activation_kernel.cu
      
      * fix
      
      * fix
      
      * fix CudnnDataType error
      
      * fix
      
      * fix include
      
      * update
      
      * add #if
      
      * add fused bn act to cmakelist.txt
      
      * update  cmakelist
      
      * fix #ifdef error
      
      * add timeout set
      
      * add env set
      
      * fix
      
      * fix
      
      * Update fused_bn_activation_sig.cc
      5e5481d8
  3. 10 5月, 2023 1 次提交
    • add index_put api (#52886) · f3393f49
      傅剑寒 提交于
      * add index_put api
      
      * fix value broadcast in backward and add test case in static
      
      * add timeout=120s for index_put
      
      * add op_compat for index_put
      
      * add inplace index_put test
      
      * add test case when index tensor in indices is int32 when indices.size less than x.dims
      
      * add index_put api backward in cpu place
      
      * add backward test case
      
      * refactor code to delete some duplicated code
      
      * replace reshape with resize for decrease extra memcpy
      
      * add datatype flag in backward yaml
      
      * fix bug in documentation
      
      * Update python/paddle/tensor/manipulation.py
      
      ---------
      Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>
      f3393f49
  4. 27 4月, 2023 2 次提交
    • S
      Move fused feedforward (#53166) · 25b4ba7f
      Sonder 提交于
      * trans fused_feedward Compute function to phi
      
      * add register info
      
      * remove maxfunctor
      
      * move fused feedward to phi
      
      * remove sig file
      
      * remove fliud include
      
      * add include
      
      * add include
      
      * add sig file
      
      * add output register info
      
      * fix sig file
      
      * Update fused_feedforward_sig.cc
      
      * fix grad kernel
      
      * update output register info
      
      * fix
      
      * open fused_feedforward static build
      
      * add optional and fix code style
      
      * fix output info for fused attention
      
      * add optional param
      
      * merge
      25b4ba7f
    • X
      【prim】Concat bug (#53350) · 6768c6ec
      xiaoguoguo626807 提交于
      * modify concat_grad add sum comp rule
      
      * modify opcompat
      6768c6ec
  5. 24 4月, 2023 1 次提交
  6. 22 4月, 2023 1 次提交
  7. 19 4月, 2023 2 次提交
  8. 17 4月, 2023 4 次提交
    • T
      mv ps distributed dir (#52885) · 1765d5d1
      tianshuo78520a 提交于
      * mv ps distributed dir
      
      * fix
      
      * add del auto_parallel
      
      * add auto_parallel
      
      * fix ps
      
      * fix bug
      
      * fix test bug
      
      * fix test bug
      
      * merge develop fix error
      
      * merge develop fix error
      
      * merge develop fix error
      1765d5d1
    • L
      [Test Mv] remove rnn (#52967) · 5e29f30c
      liulinduo 提交于
      * [Test Mv] remove rnn
      
      * Update test_rnn_cell_api.py
      5e29f30c
    • T
      mv ir test (#52834) · b8a848bb
      tianshuo78520a 提交于
      b8a848bb
    • S
      Add output defs for some kernelsPhi register (#52941) · 23f87442
      Sonder 提交于
      * add register info for eigh and eig_gard
      
      * add sync_batch_norm_op.cu register info
      
      * add lamb output register info
      
      * add unique register info
      
      * change type name
      
      * change type name
      
      * add output register info for check_finite_and_unscale
      
      * update cmake and config file
      
      * add register info for adagrad
      
      * fix build error
      
      * add sync to run_unittests.sh
      
      * add register info for unique_consecutive
      
      * fix build error
      
      * add eigh to STATIC_BUILD_TESTS
      
      * update eig_kernel.cc
      
      * update eig_kernel.cc
      
      * fix infer mate error
      
      * fix unique register error
      
      * fix lamb register info error
      
      * fix lamb register info
      
      * update lamb register info
      
      * fix lamb
      
      * remove one Output Register
      
      * update static build file
      
      * add eigh op to disable_wingpu_test
      
      * update run_unittests
      23f87442
  9. 13 4月, 2023 1 次提交
  10. 12 4月, 2023 1 次提交
    • R
      [Move Test] xpu (#52661) · 9a7c83bd
      RedContritio 提交于
      * move python/paddle/fluid/tests/unittests/xpu to test/xpu
      
      * update CMakeLists.txt
      
      * remove xpu in fluid/tests/unittests/
      
      * add path to op_test_xpu
      
      * fix incorrect path
      
      * update test script
      
      * fix test_adadelta_op_xpu error
      9a7c83bd
  11. 11 4月, 2023 3 次提交
  12. 10 4月, 2023 2 次提交
  13. 08 4月, 2023 1 次提交
  14. 07 4月, 2023 2 次提交
  15. 06 4月, 2023 1 次提交
    • K
      feat: add composite rule of roll grad (#52532) · 348a36b5
      Kang Zhao 提交于
      * feat: add relu composite rule
      
      * feat: add relu composite rule, maximum op
      
      * feat: add relu composite rule, maximum op
      
      * feat: add relu composite rule, polish comments
      
      * feat: add relu composite rule, polish comments
      
      * feat: add relu composite rule, add python api of relu
      
      * feat: add relu composite rule, commit hook
      
      * fix: maximum type error & ban cinn test
      
      * fix: maximum input sequence bugs
      
      * resolve conflicts
      
      * fix: code style bugs
      
      * add: relu fp16 test
      
      * feat: add rsqrt composite rule
      
      * feat: add rsqrt composite rule
      
      * resolve conflicts of composite rule
      
      * fix: delete check eager
      
      * feat: add roll grad composite rule
      
      * fix minus shift
      
      * fix test roll op
      348a36b5
  16. 04 4月, 2023 1 次提交
    • R
      Improve new executor static build (#51149) · 5bac67d4
      Ruibiao Chen 提交于
      * Improve new executor static build
      
      * Skip GC for static build
      
      * Skip infershape for static build
      
      * Handle read_op
      
      * Add fused_attention to OpsWithFluidKernelNeedMoveToPhi
      
      * Fix argsort typos
      
      * Add sequence_pool to OpsWithFluidKernelNeedMoveToPhi
      
      * Fix skip share lod errors
      
      * Fix errors for adam
      
      * Fix errors for eigvals, memcpy and fake_quantize
      
      * Add static_build.cc
      
      * Add black list
      
      * Fix CI errors
      
      * Fix CI errors
      
      * Fix CI errors
      
      * Fix TensorArray
      
      * Fix TensorArray
      
      * Add update_loss_scaling to OpsNeedSetOutputDtypeWhenRegisterPhiKernel
      
      * Fix copy
      
      * Fix errors
      
      * Fix momentum
      
      * Skip mkldnn
      
      * Fix CI errors
      
      * Fix c_sync_calc_stream_op
      
      * Fix CINN
      
      * Fix while op
      
      * All CI pass, disable FLAGS to merge code, enable it after more tests in future
      
      * Add UTs
      
      * Fix typos
      
      * Fix typos
      
      * Add mkldnn UT
      
      * Remove mkldnn test
      
      * Fix typos
      
      * Fix dist test
      
      * Fix typos
      
      * Fix CI errors
      
      * Fix CI errors
      
      * Add UTs
      
      * Fix typos
      
      * Fix typos
      
      * Add sparse tests
      
      * ToComplexType -> ToComplex
      
      * Add test_matmul_op_static_build to disable_win_inference_test
      5bac67d4
  17. 31 3月, 2023 2 次提交
  18. 30 3月, 2023 4 次提交
    • W
      support prim & cinn test for layer_norm (#51272) · 84504f35
      Weilong Wu 提交于
      * support layer_norm prim and cinn test
      
      * enable cinn test
      
      * fix merge conflict
      
      * polish input for check_output_with_place
      
      * fix merge conflict
      
      * add more test case
      
      * fix merge conflict
      
      * polish test case
      
      * polish op_test
      
      * change ln_g rules
      
      * modify scale is none case
      
      * modify scale is none case
      
      * add public_python_api for check prim
      
      * modify setoutputgrad and fp64bug
      
      * add todo & delete log
      
      * recover
      
      * fix some errors
      
      * recover
      
      * recover
      
      * recover
      
      * recover
      
      * fix merge conflicts
      
      ---------
      Co-authored-by: Nwangruting <wangruting@baidu.com>
      84504f35
    • Z
      [Move Test] Move prim (#52167) · 3e2d0195
      Zheng-Bicheng 提交于
      * update
      
      * update
      3e2d0195
    • Z
      add scatter composite rule. (#52005) · e16eb22c
      zxcd 提交于
      * add scatter composite rule.
      
      * add public_python_api
      
      * add python unit16 support.
      
      * fix code style.
      
      * add cinn to makelist
      
      * cinn unsupport uint16, forbidden cinn when dtype==uint16.
      e16eb22c
    • G
      [Test Mv] ipu_test (#52143) · 38a477e2
      gouzil 提交于
      * [Test Mv] ipu_test
      
      * [Test Mv] cmake add py_test_modules
      
      * [Move Test] rm py_test_modules
      
      * rm asp
      38a477e2
  19. 29 3月, 2023 4 次提交
  20. 28 3月, 2023 2 次提交
  21. 27 3月, 2023 1 次提交
    • C
      add prim test for some ops (#51749) · e1674e8b
      Charles-hit 提交于
      * add tanh and cast prim test
      
      * fix tanh test
      
      * fix 0-d test
      
      * add sqrt fp16 prim test
      
      * add public_python_api in prim test
      
      * fix test_squeeze2_op
      
      * add tanh prim test
      
      * add dropout prim test
      
      * [Dy2St]Fix clone for test state problem
      
      * clean code
      
      * modify test_cumsum_op
      
      * modify test_cumsum_op
      
      * fix dropout test
      
      * add dropout in cmake
      
      * fix dropout test
      
      ---------
      Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
      e1674e8b
  22. 25 3月, 2023 1 次提交
  23. 24 3月, 2023 1 次提交
    • Z
      Memory Efficient Attention (#51867) · e5ad3859
      ZhangDY-6483 提交于
      * first version, notest
      
      * return final rst, notest
      
      * use infinity() instead of max
      
      * ut structure
      
      * start up of ut
      
      * generate lse
      
      * update
      
      * add depense
      
      * reconstruct cmake
      
      * move file
      
      * add memory efficient attention and fix blasimpl
      
      * update
      
      * update cmake
      
      * add namespace
      
      * update cmake
      
      * use .cu
      
      * update for pad3d
      
      * bug fix
      
      * bug fix
      
      * update
      
      * bug fix
      
      * update enforce
      
      * add test case
      
      * merge the lse pad
      
      * fix kernel_fn of backward
      
      * fix PADDLE_ENFORCE_EQ and phi_api
      
      * fix PADDLE_ENFORCE
      
      * fix PADDLE_ENFORCE
      
      * rerun coverage
      
      * fix memory efficient attention test
      
      * rerun ci
      
      * add cuda version condition
      
      * add cuda version condition
      
      * delete WIP test
      
      * replace PADDLE_ENFORCE
      
      * edit the namespace of datatype in multiple.cc
      
      * rerun
      
      * rerun
      
      ---------
      Co-authored-by: Nliuyuang <liuyuang@baidu.com>
      e5ad3859