1. 28 4月, 2023 1 次提交
    • B
      Dropout optimize & clean broadcast inT and ElementwiseType (#52969) · d611e48c
      Bo Zhang 提交于
      * change judgement for DropoutGradGPUKernelDriver
      
      * add UnrollerWithoutVecSize and after this Loaddata to be refined
      
      * pass unittest
      
      * use same unroller with XPU
      
      * BroadcastWithInt64Index
      
      * BroadcastDataLoader template partial specialization
      
      * fix compile errs in ROCms
      
      * clean ElementwiseT and InT for BroadcastKernel
      
      * default axis and clean inT
      
      * remove redundant fast divmod computation
      
      * optimize drop_nd & drop_nd_grad
      
      * optimize BroadcastDataLoader bf16 fp16
      
      * rm InT etc. after merge develop
      
      * delete constexpr for windows ci
      
      * fix conflict
      
      * fix conflic with develop
      
      * fix conflic
      
      * new clean
      
      * clean
      d611e48c
  2. 27 4月, 2023 2 次提交
    • G
      [phi] Move sequence_pool to phi - Step 3 :sequence_pool_grad_op (#52680) · fe053396
      gouzil 提交于
      * [phi] move sequence_pool kernel to phi
      
      * mv kernels impl
      
      * fix parameter error
      
      * clean include
      
      * fix compat filename
      
      * [phi] move fluid sequence_pool_grad to phi
      
      * [phi][compat] sig rm GradVarName
      
      * [phi] fix sequence_pool out type
      
      * [phi] rm impl, add const string
      
      * [phi] fix const str
      
      * fix sequence_pooling cmake
      
      * [phi] mv sequence_pooling_test
      
      * [phi] fix grad sig
      
      * [phi] fix sequence_pool is_test error
      
      * [phi] fix sequence_pooling gpu include
      
      * [phi] mv to impl
      
      * [phi] fix SequencePoolFunctor cu include
      
      * [phi] modify out max_index int32_t
      
      * [phi] add pooltype mapping determine
      
      * [phi] fix sequence_pool_sig
      
      * [phi] fix sequence_pool_sig sum
      
      * [phi] try ci
      
      * [phi] fix max_index optional
      fe053396
    • C
      【Hackathon4】No5 nextafter (#52544) · 82ac3913
      cyberslack_lee 提交于
      82ac3913
  3. 24 4月, 2023 2 次提交
  4. 23 4月, 2023 2 次提交
  5. 20 4月, 2023 2 次提交
  6. 19 4月, 2023 1 次提交
  7. 18 4月, 2023 2 次提交
  8. 17 4月, 2023 2 次提交
    • Z
      rename_SliceKernel (#52863) · d2b0d63f
      zhangyuqin1998 提交于
      d2b0d63f
    • S
      Add output defs for some kernelsPhi register (#52941) · 23f87442
      Sonder 提交于
      * add register info for eigh and eig_gard
      
      * add sync_batch_norm_op.cu register info
      
      * add lamb output register info
      
      * add unique register info
      
      * change type name
      
      * change type name
      
      * add output register info for check_finite_and_unscale
      
      * update cmake and config file
      
      * add register info for adagrad
      
      * fix build error
      
      * add sync to run_unittests.sh
      
      * add register info for unique_consecutive
      
      * fix build error
      
      * add eigh to STATIC_BUILD_TESTS
      
      * update eig_kernel.cc
      
      * update eig_kernel.cc
      
      * fix infer mate error
      
      * fix unique register error
      
      * fix lamb register info error
      
      * fix lamb register info
      
      * update lamb register info
      
      * fix lamb
      
      * remove one Output Register
      
      * update static build file
      
      * add eigh op to disable_wingpu_test
      
      * update run_unittests
      23f87442
  9. 14 4月, 2023 2 次提交
  10. 13 4月, 2023 2 次提交
    • H
      [enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h (#52651) · 5664ea26
      HongyuJia 提交于
      * [enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h
      
      * Add logging.h for profiler.cc
      
      * Add logging.h for gloo_utils.h
      
      * Add logging.h for addmm_kernel_impl.h
      
      * Add logging.h for addmm_grad_kernel_impl.h
      
      * Add logging.h for p_send_kernel.cu
      
      * Add logging.h for determinant_grad_kernel_impl.h
      
      * Add logging.h for p_recv_kernel.cu
      
      * Add logging.h for elementwise_grad_base.h
      
      * Add logging.h for transfer_layout_kernel.cc
      
      * Add logging.h for eigvals_kernel.cc and index_select_impl.h
      
      * Add logging.h for all files in kernel directory
      
      * Add logging.h for xpu_info.cc
      
      * Add logging.h for xpu
      5664ea26
    • Z
      rename_bilinear_tensor_op (#52745) · eb93b5c9
      zhangyuqin1998 提交于
      eb93b5c9
  11. 11 4月, 2023 3 次提交
  12. 10 4月, 2023 2 次提交
  13. 07 4月, 2023 1 次提交
  14. 04 4月, 2023 2 次提交
    • R
      Improve new executor static build (#51149) · 5bac67d4
      Ruibiao Chen 提交于
      * Improve new executor static build
      
      * Skip GC for static build
      
      * Skip infershape for static build
      
      * Handle read_op
      
      * Add fused_attention to OpsWithFluidKernelNeedMoveToPhi
      
      * Fix argsort typos
      
      * Add sequence_pool to OpsWithFluidKernelNeedMoveToPhi
      
      * Fix skip share lod errors
      
      * Fix errors for adam
      
      * Fix errors for eigvals, memcpy and fake_quantize
      
      * Add static_build.cc
      
      * Add black list
      
      * Fix CI errors
      
      * Fix CI errors
      
      * Fix CI errors
      
      * Fix TensorArray
      
      * Fix TensorArray
      
      * Add update_loss_scaling to OpsNeedSetOutputDtypeWhenRegisterPhiKernel
      
      * Fix copy
      
      * Fix errors
      
      * Fix momentum
      
      * Skip mkldnn
      
      * Fix CI errors
      
      * Fix c_sync_calc_stream_op
      
      * Fix CINN
      
      * Fix while op
      
      * All CI pass, disable FLAGS to merge code, enable it after more tests in future
      
      * Add UTs
      
      * Fix typos
      
      * Fix typos
      
      * Add mkldnn UT
      
      * Remove mkldnn test
      
      * Fix typos
      
      * Fix dist test
      
      * Fix typos
      
      * Fix CI errors
      
      * Fix CI errors
      
      * Add UTs
      
      * Fix typos
      
      * Fix typos
      
      * Add sparse tests
      
      * ToComplexType -> ToComplex
      
      * Add test_matmul_op_static_build to disable_win_inference_test
      5bac67d4
    • Z
      rename_bilinear_tensor_product (#52375) · 34069c46
      zhangyuqin1998 提交于
      * rename_bilinear_tensor_product
      
      * fix
      34069c46
  15. 03 4月, 2023 1 次提交
  16. 31 3月, 2023 1 次提交
  17. 30 3月, 2023 1 次提交
  18. 29 3月, 2023 2 次提交
  19. 28 3月, 2023 1 次提交
  20. 27 3月, 2023 3 次提交
  21. 24 3月, 2023 1 次提交
    • TaoTao Li's avatar
      add phi operator allreduce/reduce (#51857) · 47f87ad3
      TaoTao Li 提交于
      * add all_reduce, reduce kernel and api
      
      * fix all_reduce reduce ut
      
      fix reduce op maker conflict
      
      fix merge conflicts
      
      * fix conflicts, rename ReduceOp->ReduceBaseOp in reduce_ops
      
      rename allreduce op, to remove
      
      * fix code format
      
      fix comments
      
      * modify test_collective_reduce_api ut timeout
      
      * fix PR-CI-Build
      
      fix comments: format phi operator
      47f87ad3
  22. 23 3月, 2023 3 次提交
  23. 22 3月, 2023 1 次提交