1. 08 6月, 2023 1 次提交
    • Y
      [AMP] Add check_numerics API. (#54301) · a5444592
      Yiqun Liu 提交于
      * Add outputs to check_numerics_kernel.
      
      * Add check_numerics to yaml.
      
      * Add API and unittest.
      
      * Add check_nan_inf_level as argument of check_numerics_kernel.
      
      * Add more unittests.
      
      * Fix static API implementation and unittest.
      
      * Move the implementation of check_numerics to paddle.amp.
      
      * Fix import error.
      a5444592
  2. 30 5月, 2023 1 次提交
    • Y
      [AMP] Reimplement check_nan_inf as check_numerics_kernel. (#52245) · 44bd5927
      Yiqun Liu 提交于
      * Reimplement the check_nan_inf function as check_numerics kernel.
      
      * Remove the cpu implemention to phi.
      
      * Add ifdef for the including of omp.h.
      
      * Move the use of FLAGS_check_nan_inf_level out of header file.
      
      * Implement a common PrintAndThrowError function.
      
      * Fix the error using of __NVCC__, which should be instead with __CUDA_ARCH__.
      
      * Add dependency of phi.
      
      * Polish codes and unittest.
      44bd5927
  3. 25 4月, 2023 1 次提交
    • Y
      [PHI]Add flags macro for PHI (#52991) · 22e96bde
      YuanRisheng 提交于
      * add flags for phi
      
      * fix compile bugs
      
      * fix ci bugs
      
      * fix inference bugs
      
      * fix cinn' bugs
      
      * fix cinn bugs
      
      * perfect code according comment
      
      * fix ci bugs
      
      * fix ci bugs
      22e96bde
  4. 24 4月, 2023 1 次提交
  5. 03 4月, 2023 1 次提交
  6. 01 4月, 2023 1 次提交
  7. 31 1月, 2023 1 次提交
  8. 12 12月, 2022 1 次提交
    • Y
      Enhance check_nan_inf implementation for CPU. (#48591) · 69e695b7
      Yiqun Liu 提交于
      * Enable to print device info.
      
      * Enhance the nan and inf checking for cpu.
      
      * Implement a common print function.
      
      * Unify the check of complex numbers.
      
      * Rewrite the omp method.
      
      * Count and print the number of nan and inf.
      
      * Change the print content.
      
      * Add unittest.
      69e695b7
  9. 19 10月, 2022 1 次提交
  10. 13 10月, 2022 1 次提交
  11. 11 10月, 2022 1 次提交
  12. 28 9月, 2022 1 次提交
    • C
      Remove the declaration of using Tensor in framework/tensor.h (#46432) · e12a905e
      Chen Weihang 提交于
      * remove needless using tensor
      
      * remove needless using tensor
      
      * resolve conflict
      
      * replace tensor using
      
      * fix format error
      
      * revert needless changing
      
      * fix rocm and npu compile error
      
      * fix cinn compile error
      
      * fix format error
      
      * fix mkldnn format error
      
      * fix mkldnn format error
      
      * fix cinn compile error
      
      * fix cinn compile error
      
      * fix cinn compile error
      
      * resolve conflict
      e12a905e
  13. 19 9月, 2022 1 次提交
  14. 01 8月, 2022 1 次提交
    • L
      unify gpu context (#44740) · 86763023
      Leo Chen 提交于
      * remove cudaDeviceContext
      
      * remove more template
      
      * fix rocm compile
      
      * remove alias name CUDADeviceContext
      
      * fix compile
      
      * fix tests
      
      * revert changes
      86763023
  15. 02 7月, 2022 1 次提交
    • L
      unify cpu context, part2 (#44012) · 755438a7
      Leo Chen 提交于
      * fix init()
      
      * delete test_device_context
      
      * replace CPUDeviceContext with CPUContext
      
      * fix test_scalar
      
      * remove dot_op.cc
      
      * fix compile
      755438a7
  16. 26 6月, 2022 1 次提交
  17. 05 6月, 2022 1 次提交
  18. 17 5月, 2022 1 次提交
  19. 30 3月, 2022 1 次提交
  20. 20 2月, 2022 1 次提交
  21. 15 2月, 2022 1 次提交
    • A
      [PTen]Migrate proto::VarType outside of Pten (#39411) · 7e7e9404
      Aurelius84 提交于
      * #1 migrate dist-related type()-> dtype()
      
      * move datatype function from pten -> fluid/framework
      
      * change type() in imperative into convert(dtype())
      
      * modify xx_tensor->type into xx_tensor->dtype
      
      * change the set_type interface and the caller
      
      * modify xx_tensor.type into xx_tensor.dtype
      
      * fix mutable_data(place, dtype())
      
      * change caller of mutable_data in pten and distributed
      
      * change the caller of mutable_data in fluid/framework
      
      * change the caller of mutable_data in imperative directory
      
      * mutable_data: inference
      
      * update the call of mutable_data
      
      * transfer MakePenScalarArray MakePtenScalar ResetHolderWithType
      
      * pass the compile. the next step is remove VarType in Pten
      
      * fix all and remove VarType from pten. success in linux. Next task is other platform
      
      * fix conflict with develop
      
      * fix compiled error
      
      * Fix reset conversion
      
      * fix conflict
      
      * fix compiled problem
      
      * fix typo
      
      * Fix << in tensor_utils.cc
      
      * fix type->dtype
      
      * fix unittest
      
      * fix tensor init constructor
      
      * fix DataTypeSize for BFloat16
      
      * fix code style
      
      * fix npu compiled error
      
      * fix npu
      
      * compile npu sucessfully
      
      * fix conflict
      
      * fix conflict
      Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
      7e7e9404
  22. 25 1月, 2022 1 次提交
    • W
      [Move selected_rows PR #3] Change the relationship of [include/Cmake]. (#39128) · 2bafd338
      Weilong Wu 提交于
      * Added selected_rows and rw_lock to pten
      
      * Renamed the unit test target to fix CI
      
      * Removed Class SelectedRows in Fluid, changed include/cmake relationship, use pten::SelectedRows in Fluid
      
      * Remove rw_lock.h,rw_lock_test.cc in fluid
      
      * Use pten::RWLock and pten::AutoRDLock, fix CI
      
      * Use pten::SelectedRows
      
      * Use pten::SelectedRows
      
      * Fix to pass NPU CI
      
      * Use pten::SelectedRows, to pass NPU CI
      
      * To fix NPU CI
      
      * To fix NPU CI again
      2bafd338
  23. 17 1月, 2022 1 次提交
    • W
      [Pten] Replace platform::Place to pten::Place. (#38899) · c48a9ad5
      Wilber 提交于
      * add pten::Place data structure.
      
      * update ci problem
      
      * fix ci problem
      
      * update
      
      * using platform::Place=pten::Place
      
      * remove BOOST_GET_CONST for CPUPlace and GPUPlace
      
      * compile pass 25%.
      
      * compile pass 45%
      
      * compile pass 60%
      
      * remove boost_get for xpu npu mlu and ipu
      
      * compile pass on cpu and gpu.
      
      * fix compile problem
      
      * fix compile error.
      
      * update
      
      * fix ci problem
      
      * update
      
      * ci approve
      
      * fix ci problem
      
      * fix ci eager test problem
      
      * remove BOOST_GET_CONST
      
      * fix npu compile
      c48a9ad5
  24. 13 12月, 2021 1 次提交
  25. 27 11月, 2021 1 次提交
    • A
      [NPU] reorganization for device API abstraction (#37110) · 72241a6a
      Aganlengzi 提交于
      * [NPU] reorganization for device API abstraction
      
      * [NPU] delete old files
      
      * [NPU] fix npu_collective_helper
      
      * [NPU] fix collective_helper
      
      * [NPU] fix ut
      
      * [NPU] mod memory allocation and hccl_helper
      
      * [NPU] fix place_type
      
      * [NPU] split enfoce.h
      
      * move acl* call into npu_info
      
      * merge conflict
      
      * fix merge
      
      * merge conflict
      
      * merge conflict
      72241a6a
  26. 29 10月, 2021 1 次提交
  27. 08 9月, 2021 1 次提交
  28. 10 6月, 2021 1 次提交
  29. 01 6月, 2021 1 次提交
  30. 20 5月, 2021 1 次提交
    • C
      Add complex template type (#32857) · 738bf20e
      chentianyu03 提交于
      * add complex template file
      
      * add numtraits for complex template
      
      * add complex template type register
      
      * modify specify template of complex
      
      * modify specify template of complex
      
      * modify specify template of complex
      
      * modify specify template of complex
      
      * make TensorCheckerVisitor support complex type
      
      * fix operator= error
      
      * add complex template
      
      * add complex template type
      
      * add complex template type to pyarray transform
      
      * add complex template type to pyarray transform
      
      * remove complex type for dlpack register
      
      * set dlpack supprot complex type
      
      * set dlpack supprot complex type
      
      * set dlpack supprot complex type
      
      * remove explict for complex constructor
      
      * add complex unit test file
      738bf20e
  31. 19 4月, 2021 1 次提交
  32. 22 2月, 2021 1 次提交
  33. 04 2月, 2021 1 次提交
  34. 25 12月, 2020 1 次提交
  35. 01 12月, 2020 2 次提交
  36. 03 9月, 2020 2 次提交
  37. 12 12月, 2019 1 次提交