1. 28 8月, 2023 1 次提交
    • G
      【inplace api】Batch add inplace api gt_, ge_, lt_, le_, eq_, not_equal_,... · c5fc413a
      GGBond8488 提交于
      【inplace api】Batch add inplace api gt_, ge_, lt_, le_, eq_, not_equal_, logical_and_, logical_or_, logical_xor_, logical_not_, divide_, floor_divide_, bitwise_and_ , bitwise_or_, bitwise_xor_, bitwise_not_ (#55509)
      
      * tmp commit
      
      * add atan2
      
      * add inplace api
      
      * fix error
      
      * add inpalce divide
      
      * add inplace api
      
      * add more inplace
      
      * add more inpalce
      
      * fix logical_not error
      
      * support sinh and cosh in cpu
      
      * support asin, acos, atan, asinh, acosh, atanh in cpu
      
      * fix typro
      
      * fix typro
      
      * mv out atan2 ldexp
      
      * mv out atan2 ldexp
      
      * support sinh and cosh in gpu
      
      * support asin, acos, atan, asinh, acosh, atanh in gpu
      
      * fix ge error
      
      * fix dygraph commpare error
      
      * fix dygraph commpare error
      
      * check complex in python
      
      * fix cast inpalce error
      
      * open inplace test
      
      * fix ops.yaml error
      
      * mv cast inpalce to python
      
      * fix coverage ci
      
      * add last inplace
      
      * fix inplace error
      
      * fix cast error
      
      * fix error
      
      * add nan_to_num_
      
      * fix typro
      
      * fix sparse cast error
      
      * remove gpu 4
      
      * fix static cast error
      
      * tmp commit
      
      * add atan2
      
      * add inplace api
      
      * fix error
      
      * add inpalce divide
      
      * add inplace api
      
      * add more inplace
      
      * add more inpalce
      
      * fix logical_not error
      
      * fix typro
      
      * fix typro
      
      * mv out atan2 ldexp
      
      * mv out atan2 ldexp
      
      * fix ge error
      
      * fix dygraph commpare error
      
      * fix dygraph commpare error
      
      * fix cast inpalce error
      
      * open inplace test
      
      * fix ops.yaml error
      
      * mv cast inpalce to python
      
      * fix coverage ci
      
      * add last inplace
      
      * fix inplace error
      
      * fix cast error
      
      * fix error
      
      * add nan_to_num_
      
      * fix typro
      
      * fix sparse cast error
      
      * remove gpu 4
      
      * fix static cast error
      
      * fix cast error
      
      * fix
      
      * Revert "check complex in python"
      
      This reverts commit c822064261d774dd58ad46a4f90ba8b467700a05.
      
      * add renorm , fix error
      
      * add coverage
      
      * fix cumsum inpalce version error
      
      * add cast inpalce impl
      
      * rm test.log
      
      * fix multiply_dyfunction and add multiply_backward test
      
      * add and use is_same_tensor
      
      * fix typro
      
      * fix sone error
      
      * fix typro
      
      ---------
      Co-authored-by: NScotty <jmhgchn@gmail.com>
      Co-authored-by: NScotty <527407973@qq.com>
      c5fc413a
  2. 17 7月, 2023 1 次提交
  3. 13 7月, 2023 1 次提交
  4. 26 6月, 2023 1 次提交
  5. 09 6月, 2023 1 次提交
  6. 02 6月, 2023 1 次提交
  7. 24 5月, 2023 2 次提交
  8. 19 5月, 2023 1 次提交
  9. 18 5月, 2023 1 次提交
  10. 16 5月, 2023 1 次提交
  11. 10 5月, 2023 1 次提交
  12. 08 5月, 2023 1 次提交
    • G
      add complex support for optest (#53356) · e522ceb7
      GGBond8488 提交于
      * add complex support for  optest
      
      * add complex grad test
      
      * append one
      
      * move some debug info
      
      * move some debug info
      
      * move some debug info
      
      * move some debug info
      
      * add more complex test
      
      * Fix naming ambiguity
      
      * Revert "add more complex test"
      
      This reverts commit dbcb0516b8e53ba42e2d6089878a39b395345969.
      
      * change backward gradient, add TODO
      e522ceb7
  13. 28 4月, 2023 2 次提交
    • B
      Dropout optimize & clean broadcast inT and ElementwiseType (#52969) · d611e48c
      Bo Zhang 提交于
      * change judgement for DropoutGradGPUKernelDriver
      
      * add UnrollerWithoutVecSize and after this Loaddata to be refined
      
      * pass unittest
      
      * use same unroller with XPU
      
      * BroadcastWithInt64Index
      
      * BroadcastDataLoader template partial specialization
      
      * fix compile errs in ROCms
      
      * clean ElementwiseT and InT for BroadcastKernel
      
      * default axis and clean inT
      
      * remove redundant fast divmod computation
      
      * optimize drop_nd & drop_nd_grad
      
      * optimize BroadcastDataLoader bf16 fp16
      
      * rm InT etc. after merge develop
      
      * delete constexpr for windows ci
      
      * fix conflict
      
      * fix conflic with develop
      
      * fix conflic
      
      * new clean
      
      * clean
      d611e48c
    • S
      【Hackathon No.55】add fmin BF16 test (#53100) · 8163faaa
      superwinner1 提交于
      * 'fmin'
      
      * 'fix'
      
      * 'fix'
      8163faaa
  14. 27 4月, 2023 1 次提交
  15. 25 4月, 2023 1 次提交
  16. 20 4月, 2023 1 次提交
    • Z
      move_elementwise_raw (#53010) · 7a72f7a2
      zhangyuqin1998 提交于
      * setup
      
      * Update elementwise_kernel.cc
      
      * Update elementwise_kernel.cc
      
      * fix
      
      * fix
      
      * Update elementwise_kernel.cu
      
      * fix
      
      * Update elementwise_kernel.cc
      
      * Update elementwise_kernel.cc
      
      * Update elementwise_kernel.cc
      
      * Update elementwise_kernel.cc
      
      * Update elementwise_kernel.cc
      
      * Update elementwise_kernel.cc
      7a72f7a2
  17. 13 4月, 2023 1 次提交
  18. 06 4月, 2023 1 次提交
    • W
      [AMP OP&Test]Add fp16/bf16 support logical op (#52112) · b10e4577
      WJJ1995 提交于
      * fixed glog
      
      * add
      
      * add bfloat16 test for logical op
      
      * rm useless code
      
      * add uint16
      
      * deal with comments
      
      * fixed code style
      
      * fixed code style
      
      * fixed for ci
      
      * deal with comments
      
      * fixed for ci
      b10e4577
  19. 23 3月, 2023 2 次提交
  20. 13 3月, 2023 1 次提交
  21. 12 3月, 2023 1 次提交
  22. 09 3月, 2023 1 次提交
  23. 08 3月, 2023 2 次提交
  24. 02 3月, 2023 1 次提交
    • R
      New executor static build for fluid kernel (#50670) · bf50784c
      Ruibiao Chen 提交于
      * Check structed kernel for new executor static build
      
      * Update code
      
      * Ready for resnet50
      
      * Move transfer_dtype to phi
      
      * Ready for transformer
      
      * Fix CI errors
      
      * Fix layer_norm InferMeta
      
      * Remove layer_norm infermeta fix
      bf50784c
  25. 25 2月, 2023 1 次提交
  26. 13 2月, 2023 1 次提交
  27. 12 1月, 2023 1 次提交
  28. 21 11月, 2022 1 次提交
  29. 10 11月, 2022 1 次提交
  30. 01 11月, 2022 1 次提交
  31. 23 9月, 2022 1 次提交
  32. 20 9月, 2022 1 次提交
  33. 17 9月, 2022 1 次提交
    • G
      Fix bug of reduce_sum op. (#46045) · 28b4240b
      Ghost Screaming 提交于
      * Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result
      is wrong.
      
      * Fix some problems.
      1. Change fluid head files to phi files.
      2. Delete useless code.
      3. Fix code style problems.
      
      * Fix some code style problems.
      
      * Fix some code style problems.
      28b4240b
  34. 06 9月, 2022 1 次提交
  35. 31 8月, 2022 1 次提交
  36. 30 8月, 2022 1 次提交