1. 24 5月, 2023 1 次提交
  2. 19 5月, 2023 1 次提交
  3. 18 5月, 2023 1 次提交
  4. 16 5月, 2023 3 次提交
  5. 15 5月, 2023 1 次提交
  6. 13 5月, 2023 1 次提交
  7. 12 5月, 2023 3 次提交
  8. 11 5月, 2023 5 次提交
  9. 10 5月, 2023 9 次提交
    • [cherry pick] add index_put api (#53652) · 4d16cd63
      傅剑寒 提交于
      This PR add index_put api for paddle
      4d16cd63
    • Y
      [cherry-pick] Fix the index calculation in cross_entroy_kernel. (#53659) (#53666) · 1ab562ca
      Yiqun Liu 提交于
      cherry-pick #53659
      1ab562ca
    • Z
      [Cherry-Pick] Fix bug in log_softmax kernel when lastdim is larger than 100000 (#53657) · a7cad386
      Zhang Zheng 提交于
      Fix bug in log_softmax kernel when lastdim is larger than 100000
      
      There is an unexpected log in the calculation
      
      Cherry-Pick: #53654
      a7cad386
    • R
      b0c55c28
    • Q
      revert argsort to fix OOM bug (#53647) · 6707142a
      Qi Shao 提交于
      Revert argsort to the version without full sort algorithm implemented
      6707142a
    • B
      [cherry-pick 2.5] Broadcast && Dropout_nd Performance Optimization into Release/2.5 (#53623) · f9ea2301
      Bo Zhang 提交于
      * Support different dtypes of inputs for broadcast for dropout optimization  (#52093)
      
      * change judgement for DropoutGradGPUKernelDriver
      
      * add UnrollerWithoutVecSize and after this Loaddata to be refined
      
      * pass unittest
      
      * use same unroller with XPU
      
      * BroadcastWithInt64Index
      
      * BroadcastDataLoader template partial specialization
      
      * fix compile errs in ROCms
      
      * PR comment
      
      * dropout_nd_optimization (#51479)
      
      * with printf
      
      * add DropOutNdForwardKernel
      
      * PR comment
      
      * Dropout optimize & clean broadcast inT and ElementwiseType (#52969)
      
      * change judgement for DropoutGradGPUKernelDriver
      
      * add UnrollerWithoutVecSize and after this Loaddata to be refined
      
      * pass unittest
      
      * use same unroller with XPU
      
      * BroadcastWithInt64Index
      
      * BroadcastDataLoader template partial specialization
      
      * fix compile errs in ROCms
      
      * clean ElementwiseT and InT for BroadcastKernel
      
      * default axis and clean inT
      
      * remove redundant fast divmod computation
      
      * optimize drop_nd & drop_nd_grad
      
      * optimize BroadcastDataLoader bf16 fp16
      
      * rm InT etc. after merge develop
      
      * delete constexpr for windows ci
      
      * fix conflict
      
      * fix conflic with develop
      
      * fix conflic
      
      * new clean
      
      * clean
      
      * Fix xpu2 kp compile error (#53548)
      
      * fix conflict
      
      * conflict
      f9ea2301
    • Z
      [Cherry-pick 2.5][Zero-Dim] paddle.static.data, squeeze, unbind, unstack,... · fecea4c5
      zqw_1997 提交于
      [Cherry-pick 2.5][Zero-Dim]  paddle.static.data, squeeze, unbind, unstack, gather_nd and einsum support 0D (#53602)
      
      * add test cases, test=allcase
      
      * fix test cases, test=allcase
      
      * fix test cases, test=allcase
      
      * assert_allclose, test=allcase
      
      * 1e-5 to 1e-4, test=allcase
      
      * change rtol from 1e-4 to 1e-3, test=allcase
      
      * test=allcase
      
      * test=allcase
      
      * test=allcase
      
      * test=allcase
      
      * test=allcase
      
      * fix test cases, test=allcase
      
      * fix test cases, test=allcase
      
      * modify the test_squeeze to not use Tensor type axis, test=allcase
      
      * add grad check for unbind and unstack, test=allcase
      
      * check for squeeze axis tensor type, test=allcase
      
      * fix bug, test=allcase
      
      * test=allcase
      
      * test=allcase
      
      * test=allcase
      
      * test=allcase
      
      * test=allcase
      
      * test=allcase
      
      * test=allcase
      
      * test=allcase
      
      * test=allcase
      
      * test=allcase
      
      * test=allcase
      
      * test=allcase
      
      * test=allcase
      
      * test=allcase
      
      * test=allcase
      
      * test=allcase
      fecea4c5
    • zhouweiwei2014's avatar
      [Zero-Dim] add 0D Tensor UT case for XPU (#53611) · 3a247cba
      zhouweiwei2014 提交于
      3a247cba
    • G
      add and open 0D test pnorm and cond (#53616) · 7edcd05c
      GGBond8488 提交于
      7edcd05c
  10. 09 5月, 2023 11 次提交
  11. 08 5月, 2023 4 次提交