1. 10 5月, 2023 2 次提交
    • add index_put api (#52886) · f3393f49
      傅剑寒 提交于
      * add index_put api
      
      * fix value broadcast in backward and add test case in static
      
      * add timeout=120s for index_put
      
      * add op_compat for index_put
      
      * add inplace index_put test
      
      * add test case when index tensor in indices is int32 when indices.size less than x.dims
      
      * add index_put api backward in cpu place
      
      * add backward test case
      
      * refactor code to delete some duplicated code
      
      * replace reshape with resize for decrease extra memcpy
      
      * add datatype flag in backward yaml
      
      * fix bug in documentation
      
      * Update python/paddle/tensor/manipulation.py
      
      ---------
      Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>
      f3393f49
    • G
      remove some [-Wunused-parameter] warning and WITH_DISTRIBUT flags (#53650) · 65e57a7d
      Galaxy1458 提交于
      * test,test=develop
      
      * test,test=develop
      
      * test,test=develop
      
      * test,test=develop
      
      * test,test=develop
      65e57a7d
  2. 09 5月, 2023 3 次提交
  3. 08 5月, 2023 1 次提交
    • G
      add complex support for optest (#53356) · e522ceb7
      GGBond8488 提交于
      * add complex support for  optest
      
      * add complex grad test
      
      * append one
      
      * move some debug info
      
      * move some debug info
      
      * move some debug info
      
      * move some debug info
      
      * add more complex test
      
      * Fix naming ambiguity
      
      * Revert "add more complex test"
      
      This reverts commit dbcb0516b8e53ba42e2d6089878a39b395345969.
      
      * change backward gradient, add TODO
      e522ceb7
  4. 06 5月, 2023 2 次提交
    • Z
      Rename randint_raw and move it to legacy (#53157) · 3e7be9c9
      zhangyuqin1998 提交于
      * Rename randint_raw and move it to legacy
      
      * Update fetch_v2_op.cc
      
      * Update randint_kernel.cc
      
      * Update randint_kernel.cu
      
      * Empty Commit to setup deployments
      3e7be9c9
    • Z
      move UniformRawKernel to legacy (#53158) · 13e2e10c
      zhangyuqin1998 提交于
      * move UniformRawKernel to legacy
      
      * Update uniform_kernel.cc
      
      * Update uniform_kernel.cu
      
      * Update uniform_kernel.cc
      
      * Update uniform_kernel.cu
      
      * Update uniform_kernel.h
      
      * Update uniform_kernel.cc
      
      * Empty Commit to setup deployments
      13e2e10c
  5. 28 4月, 2023 1 次提交
    • B
      Dropout optimize & clean broadcast inT and ElementwiseType (#52969) · d611e48c
      Bo Zhang 提交于
      * change judgement for DropoutGradGPUKernelDriver
      
      * add UnrollerWithoutVecSize and after this Loaddata to be refined
      
      * pass unittest
      
      * use same unroller with XPU
      
      * BroadcastWithInt64Index
      
      * BroadcastDataLoader template partial specialization
      
      * fix compile errs in ROCms
      
      * clean ElementwiseT and InT for BroadcastKernel
      
      * default axis and clean inT
      
      * remove redundant fast divmod computation
      
      * optimize drop_nd & drop_nd_grad
      
      * optimize BroadcastDataLoader bf16 fp16
      
      * rm InT etc. after merge develop
      
      * delete constexpr for windows ci
      
      * fix conflict
      
      * fix conflic with develop
      
      * fix conflic
      
      * new clean
      
      * clean
      d611e48c
  6. 27 4月, 2023 2 次提交
    • G
      [phi] Move sequence_pool to phi - Step 3 :sequence_pool_grad_op (#52680) · fe053396
      gouzil 提交于
      * [phi] move sequence_pool kernel to phi
      
      * mv kernels impl
      
      * fix parameter error
      
      * clean include
      
      * fix compat filename
      
      * [phi] move fluid sequence_pool_grad to phi
      
      * [phi][compat] sig rm GradVarName
      
      * [phi] fix sequence_pool out type
      
      * [phi] rm impl, add const string
      
      * [phi] fix const str
      
      * fix sequence_pooling cmake
      
      * [phi] mv sequence_pooling_test
      
      * [phi] fix grad sig
      
      * [phi] fix sequence_pool is_test error
      
      * [phi] fix sequence_pooling gpu include
      
      * [phi] mv to impl
      
      * [phi] fix SequencePoolFunctor cu include
      
      * [phi] modify out max_index int32_t
      
      * [phi] add pooltype mapping determine
      
      * [phi] fix sequence_pool_sig
      
      * [phi] fix sequence_pool_sig sum
      
      * [phi] try ci
      
      * [phi] fix max_index optional
      fe053396
    • C
      【Hackathon4】No5 nextafter (#52544) · 82ac3913
      cyberslack_lee 提交于
      82ac3913
  7. 24 4月, 2023 2 次提交
  8. 23 4月, 2023 2 次提交
  9. 20 4月, 2023 2 次提交
  10. 19 4月, 2023 1 次提交
  11. 18 4月, 2023 2 次提交
  12. 17 4月, 2023 2 次提交
    • Z
      rename_SliceKernel (#52863) · d2b0d63f
      zhangyuqin1998 提交于
      d2b0d63f
    • S
      Add output defs for some kernelsPhi register (#52941) · 23f87442
      Sonder 提交于
      * add register info for eigh and eig_gard
      
      * add sync_batch_norm_op.cu register info
      
      * add lamb output register info
      
      * add unique register info
      
      * change type name
      
      * change type name
      
      * add output register info for check_finite_and_unscale
      
      * update cmake and config file
      
      * add register info for adagrad
      
      * fix build error
      
      * add sync to run_unittests.sh
      
      * add register info for unique_consecutive
      
      * fix build error
      
      * add eigh to STATIC_BUILD_TESTS
      
      * update eig_kernel.cc
      
      * update eig_kernel.cc
      
      * fix infer mate error
      
      * fix unique register error
      
      * fix lamb register info error
      
      * fix lamb register info
      
      * update lamb register info
      
      * fix lamb
      
      * remove one Output Register
      
      * update static build file
      
      * add eigh op to disable_wingpu_test
      
      * update run_unittests
      23f87442
  13. 14 4月, 2023 2 次提交
  14. 13 4月, 2023 2 次提交
    • H
      [enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h (#52651) · 5664ea26
      HongyuJia 提交于
      * [enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h
      
      * Add logging.h for profiler.cc
      
      * Add logging.h for gloo_utils.h
      
      * Add logging.h for addmm_kernel_impl.h
      
      * Add logging.h for addmm_grad_kernel_impl.h
      
      * Add logging.h for p_send_kernel.cu
      
      * Add logging.h for determinant_grad_kernel_impl.h
      
      * Add logging.h for p_recv_kernel.cu
      
      * Add logging.h for elementwise_grad_base.h
      
      * Add logging.h for transfer_layout_kernel.cc
      
      * Add logging.h for eigvals_kernel.cc and index_select_impl.h
      
      * Add logging.h for all files in kernel directory
      
      * Add logging.h for xpu_info.cc
      
      * Add logging.h for xpu
      5664ea26
    • Z
      rename_bilinear_tensor_op (#52745) · eb93b5c9
      zhangyuqin1998 提交于
      eb93b5c9
  15. 11 4月, 2023 3 次提交
  16. 10 4月, 2023 2 次提交
  17. 07 4月, 2023 1 次提交
  18. 04 4月, 2023 2 次提交
    • R
      Improve new executor static build (#51149) · 5bac67d4
      Ruibiao Chen 提交于
      * Improve new executor static build
      
      * Skip GC for static build
      
      * Skip infershape for static build
      
      * Handle read_op
      
      * Add fused_attention to OpsWithFluidKernelNeedMoveToPhi
      
      * Fix argsort typos
      
      * Add sequence_pool to OpsWithFluidKernelNeedMoveToPhi
      
      * Fix skip share lod errors
      
      * Fix errors for adam
      
      * Fix errors for eigvals, memcpy and fake_quantize
      
      * Add static_build.cc
      
      * Add black list
      
      * Fix CI errors
      
      * Fix CI errors
      
      * Fix CI errors
      
      * Fix TensorArray
      
      * Fix TensorArray
      
      * Add update_loss_scaling to OpsNeedSetOutputDtypeWhenRegisterPhiKernel
      
      * Fix copy
      
      * Fix errors
      
      * Fix momentum
      
      * Skip mkldnn
      
      * Fix CI errors
      
      * Fix c_sync_calc_stream_op
      
      * Fix CINN
      
      * Fix while op
      
      * All CI pass, disable FLAGS to merge code, enable it after more tests in future
      
      * Add UTs
      
      * Fix typos
      
      * Fix typos
      
      * Add mkldnn UT
      
      * Remove mkldnn test
      
      * Fix typos
      
      * Fix dist test
      
      * Fix typos
      
      * Fix CI errors
      
      * Fix CI errors
      
      * Add UTs
      
      * Fix typos
      
      * Fix typos
      
      * Add sparse tests
      
      * ToComplexType -> ToComplex
      
      * Add test_matmul_op_static_build to disable_win_inference_test
      5bac67d4
    • Z
      rename_bilinear_tensor_product (#52375) · 34069c46
      zhangyuqin1998 提交于
      * rename_bilinear_tensor_product
      
      * fix
      34069c46
  19. 03 4月, 2023 1 次提交
  20. 31 3月, 2023 1 次提交
  21. 30 3月, 2023 1 次提交
  22. 29 3月, 2023 2 次提交
  23. 28 3月, 2023 1 次提交