1. 09 2月, 2022 1 次提交
  2. 08 2月, 2022 1 次提交
    • F
      Support allocate CUDA managed memory (#39075) · 42910361
      From00 提交于
      * Rough implementation for experiment
      
      * Support allocate cuda managed memory
      
      * Fix CI error
      
      * Modify UT
      
      * Check whether support memory oversubscription
      
      * Fix ROCM Compile error
      
      * Fix ROCM Compile error
      
      * Fix UT cuda_managed_memory_test
      
      * Set UT timeout to 40
      
      * Add UT OOMExceptionTest
      
      * Set UT timeout to 50
      42910361
  3. 07 2月, 2022 1 次提交
  4. 06 2月, 2022 1 次提交
  5. 30 1月, 2022 1 次提交
  6. 29 1月, 2022 2 次提交
    • L
      Add xpu2 compiler (#37254) · 92da5055
      Liu-xiandong 提交于
      * Add XPU compiler for paddle, test=develop
      
      * clean code
      
      * clean useless code
      
      * clean useless code
      
      * clean useless code
      
      * test
      
      * add include path
      
      * use clang compiler
      
      * xpu2.cmake
      
      * XPU2 compiler passed
      
      * update
      
      * update after pten
      
      * combination the WITH_XPU and WITH_XPU2
      
      * update the fuse operation in WITH_XPU and WITH_XPU2
      
      * update
      
      * update
      
      * update
      
      * fix the merge error
      
      * update
      
      * update the code
      
      * update the code
      
      * add run_kp_kernel flag
      
      * update
      
      * update
      
      * fix prepared type_ bug
      
      * clean and update the code
      
      * reset the kernel_primitives
      
      * update
      
      * clean the code
      
      * delete useless comment
      
      * fix the bug in WITH_XPU
      
      * update
      
      * update
      
      * modify the abi
      
      * delete some useless code
      
      * Parameter automation in xpu compilation
      
      * Parameter automation in xpu compilation
      
      * delete kps in cmake
      
      * delete useless comment
      
      * clean the code
      
      * clean the code
      92da5055
    • Q
      fix kunlun2 softmax unitest bug (#39274) · 23bb2836
      QingshuChen 提交于
      * fix kunlun2 softmax unitest bug
      *test=kunlun
      
      * minor
      23bb2836
  7. 28 1月, 2022 1 次提交
  8. 27 1月, 2022 2 次提交
  9. 26 1月, 2022 3 次提交
  10. 25 1月, 2022 3 次提交
  11. 24 1月, 2022 1 次提交
  12. 21 1月, 2022 3 次提交
  13. 20 1月, 2022 1 次提交
  14. 19 1月, 2022 1 次提交
  15. 18 1月, 2022 1 次提交
  16. 17 1月, 2022 3 次提交
  17. 14 1月, 2022 1 次提交
  18. 13 1月, 2022 1 次提交
  19. 12 1月, 2022 2 次提交
  20. 10 1月, 2022 2 次提交
    • T
    • Z
      [Unify Tensors PR #5] framework::Tensor inherits from DenseTensor,test=allcases (#38632) · 5c73a6ea
      Zhanlue Yang 提交于
      * Added shared_ptr<Allocation> member & corresponding interfaces to Storage
      
      * Removed original pten::Allocation from Storage and adjusted the interfaces accordingly
      
      * Fixed issues with storage offset
      
      * Used place to malloc allocation for TensorStorage
      
      * [Unify Tensors PR #3]Ported framework::Tensor interfaces to pten::DenseTensor
      
      * Fixed issues with place
      
      * Added comments
      
      * Moved mutable_data with stream argument to DenseTensor
      
      * Added set_offset interface
      
      * Fixed CI issues,test=allcases
      
      * [Unify Tensors PR #4] Port LoDTensor interfaces to DenseTensor
      
      * Removed friend class EigenTensor/EigenMatrix/EigenVector from Tensor
      
      * Modified framework::Tensor to inherit from DenseTensor
      
      * Reverted changes too pten_layout() interface
      
      * Removed friend classes
      
      * Rearranged cfunction calls from tensor.data<void>() to tensor.data()
      
      * Fixed CI issues
      
      * Fixed lite issues
      
      * Fixed data() interface issues,test=allcases
      
      * Resolved IsInitialized() issues
      
      * Fixed ResetHolder() issues
      
      * Fixed MKLDNN & Storage issues
      
      * Resolved ShareBufferWith() issues
      
      * Fixed LoD issues
      5c73a6ea
  21. 05 1月, 2022 1 次提交
    • T
      add huber_loss for kunlun (#38589) · a268c7ce
      TTerror 提交于
      * add huber_loss for kunlun
      
      * update xpu.cmake
      
      * update unitests
      
      * update unitests
      
      * update elementwise_add
      
      * update elementwise_add
      
      * update elementwise_add
      a268c7ce
  22. 04 1月, 2022 2 次提交
  23. 31 12月, 2021 1 次提交
  24. 30 12月, 2021 4 次提交