1. 17 1月, 2022 1 次提交
    • W
      [Pten] Replace platform::Place to pten::Place. (#38899) · c48a9ad5
      Wilber 提交于
      * add pten::Place data structure.
      
      * update ci problem
      
      * fix ci problem
      
      * update
      
      * using platform::Place=pten::Place
      
      * remove BOOST_GET_CONST for CPUPlace and GPUPlace
      
      * compile pass 25%.
      
      * compile pass 45%
      
      * compile pass 60%
      
      * remove boost_get for xpu npu mlu and ipu
      
      * compile pass on cpu and gpu.
      
      * fix compile problem
      
      * fix compile error.
      
      * update
      
      * fix ci problem
      
      * update
      
      * ci approve
      
      * fix ci problem
      
      * fix ci eager test problem
      
      * remove BOOST_GET_CONST
      
      * fix npu compile
      c48a9ad5
  2. 15 1月, 2022 1 次提交
  3. 14 1月, 2022 1 次提交
  4. 13 1月, 2022 2 次提交
  5. 12 1月, 2022 3 次提交
  6. 11 1月, 2022 1 次提交
  7. 10 1月, 2022 4 次提交
    • H
      Add gpu kernel for new api : linalg.lstsq (#38621) · 405103d8
      Haohongxiang 提交于
      * add lstsq gpu kernel
      
      * update
      
      * add docs_en
      
      * modify ut
      
      * fix bugs
      
      * modify example in docs_en
      
      * remove lstsq_op.cu from ROCM cmake
      
      * modify docs_en
      
      * modify docs_en
      
      * modify docs_en
      
      * remove unneccessary TensorCopy
      405103d8
    • L
      Profiler skeleton (#38826) · a8afed69
      liutiexing 提交于
      * add align for WorkQueue
      
      * add spinlock
      
      * merge develop
      
      * merge
      
      * Add EventsWaiter
      
      * Revert "Add EventsWaiter"
      
      This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.
      
      * profiler skeleton
      
      * update
      
      * update
      
      * update
      Co-authored-by: Nliutiexing <liutiexing@google.com>
      a8afed69
    • T
    • Z
      [Unify Tensors PR #5] framework::Tensor inherits from DenseTensor,test=allcases (#38632) · 5c73a6ea
      Zhanlue Yang 提交于
      * Added shared_ptr<Allocation> member & corresponding interfaces to Storage
      
      * Removed original pten::Allocation from Storage and adjusted the interfaces accordingly
      
      * Fixed issues with storage offset
      
      * Used place to malloc allocation for TensorStorage
      
      * [Unify Tensors PR #3]Ported framework::Tensor interfaces to pten::DenseTensor
      
      * Fixed issues with place
      
      * Added comments
      
      * Moved mutable_data with stream argument to DenseTensor
      
      * Added set_offset interface
      
      * Fixed CI issues,test=allcases
      
      * [Unify Tensors PR #4] Port LoDTensor interfaces to DenseTensor
      
      * Removed friend class EigenTensor/EigenMatrix/EigenVector from Tensor
      
      * Modified framework::Tensor to inherit from DenseTensor
      
      * Reverted changes too pten_layout() interface
      
      * Removed friend classes
      
      * Rearranged cfunction calls from tensor.data<void>() to tensor.data()
      
      * Fixed CI issues
      
      * Fixed lite issues
      
      * Fixed data() interface issues,test=allcases
      
      * Resolved IsInitialized() issues
      
      * Fixed ResetHolder() issues
      
      * Fixed MKLDNN & Storage issues
      
      * Resolved ShareBufferWith() issues
      
      * Fixed LoD issues
      5c73a6ea
  8. 05 1月, 2022 1 次提交
    • T
      add huber_loss for kunlun (#38589) · a268c7ce
      TTerror 提交于
      * add huber_loss for kunlun
      
      * update xpu.cmake
      
      * update unitests
      
      * update unitests
      
      * update elementwise_add
      
      * update elementwise_add
      
      * update elementwise_add
      a268c7ce
  9. 04 1月, 2022 3 次提交
  10. 31 12月, 2021 3 次提交
  11. 30 12月, 2021 7 次提交
  12. 29 12月, 2021 4 次提交
  13. 28 12月, 2021 1 次提交
  14. 27 12月, 2021 2 次提交
  15. 24 12月, 2021 1 次提交
  16. 23 12月, 2021 3 次提交
  17. 20 12月, 2021 1 次提交
  18. 17 12月, 2021 1 次提交
    • F
      Get base pointer from Allocation (#37978) · 431a2d6a
      From00 提交于
      * Get GPU BasePtr from CUDA allocation
      
      * Fix compile error for ROCm
      
      * Add BasePtr function for IPUPlace in naive_best_fit_allocator.cc
      
      * Add alignment for BuddyAllocator
      
      * Set address alignment of BuddyAllocator to 32 bytes
      
      * Fix CI error
      
      * Remove code for naive_best_fit strategy
      431a2d6a