1. 29 6月, 2023 1 次提交
    • N
      Add fused_rope forward op (#54351) · a215c46a
      niuliling123 提交于
      * style
      
      * more
      
      * update ctest
      
      * Update legacy_backward.yaml
      
      * Update legacy_ops.yaml
      
      * Update legacy_ops.yaml
      
      * update
      
      * update
      
      * update for move
      a215c46a
  2. 28 6月, 2023 3 次提交
  3. 27 6月, 2023 2 次提交
  4. 26 6月, 2023 1 次提交
    • S
      remove ops from OpsWithFluidKernelNeedMoveToPhi set (#54007) · 733eca85
      Sonder 提交于
      * remove ops from OpsWithFluidKernelNeedMoveToPhi set
      
      * open static build flag
      
      * OpsWithFluidKernelNeedMoveToPhi
      
      * open new_executor_static_build
      
      * add infermate for cudnn_lstm
      
      * fix
      
      * update
      
      * fix
      
      * update
      
      * update
      
      * update
      
      * fix pow2 decay
      
      * fix pow2 decay
      
      * recover analysis_predictor.cc
      
      * fix pow2 decay
      
      * fix cudnn lstm
      
      * add output register info for svd
      
      * fix pow2_decay_with_linear_warmup_kernel
      
      * recover test lstm cudnn
      
      * recover svg register codes
      
      * fix register info
      
      * fix reduce sum register info
      
      * add output info for adadelta
      
      * add output info for adadelta
      
      * add output info for adamax
      
      * fix complex abs register info
      
      * add register info for cudnn_lstm_grad
      
      * recover
      
      * fix lstm cudnn
      
      * fix
      
      * fix xpu output registe info
      
      * remove std::cout
      
      * add backend
      
      * remove output info in pow2_decay_with_linear_warmup_kernel
      
      * add judgment in TensorShouldBeFakeInitialized
      
      * recover power_
      
      * close new_executor_static_build
      
      * fix set_value_xpu
      733eca85
  5. 20 6月, 2023 2 次提交
  6. 19 6月, 2023 1 次提交
  7. 16 6月, 2023 2 次提交
  8. 15 6月, 2023 1 次提交
    • H
      exp/expm1 support int32/int64/float16 forward (#54556) · 58ae8c7c
      Hui Zhang 提交于
      * fix for log xxx
      
      * add int32/int64 for cpu/gpu; add float16/bfloat16 for cpu forward
      
      * fix docstring
      
      * fix bug
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * fix bug
      
      * using cast
      
      * fix test
      
      * fix api
      
      * fix other bugs
      
      * fix ci bug for not using dygraph guard
      
      * add bfloat16 test
      
      * fix ut
      
      * bf16
      
      * exp/expm1 support int32/int64
      
      * fix ut
      
      * fix ut
      
      * fix ut
      58ae8c7c
  9. 14 6月, 2023 3 次提交
  10. 13 6月, 2023 1 次提交
  11. 12 6月, 2023 1 次提交
    • H
      log/Log10/log2/log1p support int32/int64/float16/bfloat16 forward (#54089) · 2ddd0473
      Hui Zhang 提交于
      * fix for log xxx
      
      * add int32/int64 for cpu/gpu; add float16/bfloat16 for cpu forward
      
      * fix docstring
      
      * fix bug
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * fix bug
      
      * using cast
      
      * fix test
      
      * fix api
      
      * fix other bugs
      
      * fix ci bug for not using dygraph guard
      
      * add bfloat16 test
      
      * fix ut
      
      * bf16
      2ddd0473
  12. 08 6月, 2023 1 次提交
    • Y
      [AMP] Add check_numerics API. (#54301) · a5444592
      Yiqun Liu 提交于
      * Add outputs to check_numerics_kernel.
      
      * Add check_numerics to yaml.
      
      * Add API and unittest.
      
      * Add check_nan_inf_level as argument of check_numerics_kernel.
      
      * Add more unittests.
      
      * Fix static API implementation and unittest.
      
      * Move the implementation of check_numerics to paddle.amp.
      
      * Fix import error.
      a5444592
  13. 05 6月, 2023 3 次提交
  14. 02 6月, 2023 2 次提交
  15. 30 5月, 2023 1 次提交
    • Y
      [AMP] Reimplement check_nan_inf as check_numerics_kernel. (#52245) · 44bd5927
      Yiqun Liu 提交于
      * Reimplement the check_nan_inf function as check_numerics kernel.
      
      * Remove the cpu implemention to phi.
      
      * Add ifdef for the including of omp.h.
      
      * Move the use of FLAGS_check_nan_inf_level out of header file.
      
      * Implement a common PrintAndThrowError function.
      
      * Fix the error using of __NVCC__, which should be instead with __CUDA_ARCH__.
      
      * Add dependency of phi.
      
      * Polish codes and unittest.
      44bd5927
  16. 26 5月, 2023 1 次提交
    • Y
      [PHI Decoupling]Create PHI shared lib (#53735) · da50a009
      YuanRisheng 提交于
      * create phi so
      
      * fix ci bugs
      
      * fix py3 bugs
      
      * add file
      
      * fix py3 bugs
      
      * fix windows bugs
      
      * perfect so
      
      * fix py3 bugs
      
      * delete all static target in phi
      
      * fix windows bugs
      
      * fix py3 bugs
      
      * fix ci bugs
      
      * fix windows bugs
      
      * fix bugs: gflags can't be linked by dynamic and static lib
      
      * fix bugs that can not load 3rd party
      
      * fix ci bugs
      
      * fix compile bugs
      
      * fix py3 bugs
      
      * fix conflict
      
      * fix xpu bugs
      
      * fix mac compile bugs
      
      * fix psgpu bugs
      
      * fix inference failed
      
      * deal with conflict
      
      * fix LIBRARY_PATH bug
      
      * fix windows bugs
      
      * fix onednn error
      
      * fix windows compile bugs
      
      * fix windows compile bugs
      
      * fix test_cuda_graph_static_mode_error aborted
      
      * fix windows bugs
      
      * fix mac-python3 error
      
      * fix hip compile bugs
      
      * change mode to static
      
      * change to static mode
      
      * fix ci bugs
      
      * fix py3 bugs
      
      * fix windows bugs
      
      * fix bugs
      
      * add static flag
      
      * add PADDLE_API
      
      * change position of PADDLE_API
      
      * fix windows bugs
      
      * change mode to dynamic lib
      
      * fix windows static bugs
      
      * deal with conflict
      
      * fix windows unit bug
      
      * fix coverage
      
      * deal with conflict
      
      * fix windows-inference
      
      * fix py3 bugs
      
      * fix bugs when compile type_info
      
      * fix compile bugs
      
      * fix py3 bugs
      
      * fix windows bugs
      
      * fix windows openblas
      
      * fix xpu bugs
      
      * fix enforce_test in windows
      
      * update code according comment
      
      * fix windows cmake bug
      
      * fix windows bugs
      
      * fix windows bugs
      
      * delete cinn unittest
      
      * fix cinn bugs
      
      ---------
      Co-authored-by: HappyHeavyRain's avatarlzydev <1528794076@qq.com>
      da50a009
  17. 25 5月, 2023 1 次提交
  18. 24 5月, 2023 1 次提交
  19. 23 5月, 2023 2 次提交
  20. 22 5月, 2023 1 次提交
    • T
      Add multiclass_nms3 GPU kernel (#52401) · f71c805e
      Tian Zheng 提交于
      * Add GPU kernel for multiclass_nms3 op
      
      * Make multiclass_nms3 gpu kernel output consistent with cpu kernel
      
      * Fix API incompatibility
      
      * Fix unittests on builds without CUDA
      
      * Fix ROCM build
      
      * Remove fluid headers; Use default atol for unittest
      
      * Change function and variable naming
      
      * Add comments; Reduce redundant code
      
      * Use paddle test framework
      f71c805e
  21. 19 5月, 2023 3 次提交
  22. 18 5月, 2023 3 次提交
  23. 17 5月, 2023 1 次提交
  24. 16 5月, 2023 2 次提交