1. 28 4月, 2023 1 次提交
    • B
      Dropout optimize & clean broadcast inT and ElementwiseType (#52969) · d611e48c
      Bo Zhang 提交于
      * change judgement for DropoutGradGPUKernelDriver
      
      * add UnrollerWithoutVecSize and after this Loaddata to be refined
      
      * pass unittest
      
      * use same unroller with XPU
      
      * BroadcastWithInt64Index
      
      * BroadcastDataLoader template partial specialization
      
      * fix compile errs in ROCms
      
      * clean ElementwiseT and InT for BroadcastKernel
      
      * default axis and clean inT
      
      * remove redundant fast divmod computation
      
      * optimize drop_nd & drop_nd_grad
      
      * optimize BroadcastDataLoader bf16 fp16
      
      * rm InT etc. after merge develop
      
      * delete constexpr for windows ci
      
      * fix conflict
      
      * fix conflic with develop
      
      * fix conflic
      
      * new clean
      
      * clean
      d611e48c
  2. 13 3月, 2023 1 次提交
  3. 02 3月, 2023 1 次提交
    • R
      New executor static build for fluid kernel (#50670) · bf50784c
      Ruibiao Chen 提交于
      * Check structed kernel for new executor static build
      
      * Update code
      
      * Ready for resnet50
      
      * Move transfer_dtype to phi
      
      * Ready for transformer
      
      * Fix CI errors
      
      * Fix layer_norm InferMeta
      
      * Remove layer_norm infermeta fix
      bf50784c
  4. 10 11月, 2022 1 次提交
  5. 23 9月, 2022 1 次提交
  6. 04 8月, 2022 1 次提交
  7. 05 6月, 2022 1 次提交
  8. 12 4月, 2022 1 次提交
    • L
      [KP] Add Logical/compare/bitwise registry & UT (#40802) · 3749198e
      Lijunhui 提交于
      * init commit no push
      
      * collect comile errors
      
      * bitwise UT
      
      * fix compile problem
      
      * cancel comments
      
      * restore miss deletion
      
      * fix compilation
      
      * fix UT
      
      * NO stash in multiple branch at the same times
      
      * fix error
      
      * combine .cu from gpu and kps
      
      * replace gpu by kps
      
      * fix by Chen-weihang
      
      * Revert "Fix kps compile error in Junhui logic compare bitwise"
      
      * fix backend test
      
      * rm comments
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      3749198e
  9. 08 3月, 2022 1 次提交
  10. 04 3月, 2022 1 次提交
    • C
      [phi]move reduce gpu impl funcs into pten/kernels/funcs (#39990) · e2e2d531
      chentianyu03 提交于
      * move reduce gpu impl funcs into pten/kernels/funcs
      
      * change reduce header name and namespace
      
      * fix spell word error
      
      * change mutable_data to dev_ctx.Alloc
      
      * modify place to devcontex
      
      * format code style
      
      * fix build error
      
      * fix build error
      
      * fix conflict
      e2e2d531
  11. 03 3月, 2022 1 次提交