1. 04 1月, 2022 2 次提交
    • Z
      [Unify Tensors PR #3]Port framework::Tensor members & interfaces to... · dfdc9960
      Zhanlue Yang 提交于
      [Unify Tensors PR #3]Port framework::Tensor members & interfaces to pten::DenseTensor, test=allcases (#38473)
      
      * Added shared_ptr<Allocation> member & corresponding interfaces to Storage
      
      * Removed original pten::Allocation from Storage and adjusted the interfaces accordingly
      
      * Fixed issues with storage offset
      
      * Used place to malloc allocation for TensorStorage
      
      * [Unify Tensors PR #3]Ported framework::Tensor interfaces to pten::DenseTensor
      
      * Fixed issues with place
      
      * Added comments
      
      * Moved mutable_data with stream argument to DenseTensor
      
      * Added set_offset interface
      
      * Fixed CI issues,test=allcases
      
      * [Unify Tensors PR #4] Port LoDTensor interfaces to DenseTensor
      
      * Reverted changes too pten_layout() interface
      
      * Removed friend classes
      dfdc9960
    • Y
      heter context support dynamic mf dim (#38487) · 59888bba
      yaoxuefeng 提交于
      heter context support dynamic mf dim
      59888bba
  2. 31 12月, 2021 5 次提交
  3. 30 12月, 2021 6 次提交
  4. 29 12月, 2021 3 次提交
  5. 28 12月, 2021 3 次提交
    • F
      Utilize StreamSafeCUDAAllocator to support fast GC in new executor (#37642) · 0c7153a4
      From00 提交于
      * fix reshape move storage error
      
      * remove needless set type
      
      * alloc tensor by shared storage
      
      * Utilize StreamSafeCUDAAllocator to support fast GC in new executor
      
      * Fix compile error for Windows and ROCm
      
      * Fix compile error for Windows
      
      * Modify UT stream_safe_cuda_alloc_test
      
      * Modify UT stream_safe_cuda_alloc_test
      
      * Rewrite fast GC
      
      * Rewrite fast GC
      
      * Fix compile error for BOOST_GET_CONST
      
      * Fix compile error for BOOST_GET_CONST
      
      * Changes default stream for StreamSafeCUDAAllocator
      
      * Fix a small CI error
      
      * Remove some redundant code
      
      * Fix conflict
      
      * Fix compile error for ROCm
      
      * Fix Windoes CI error
      
      * Fix CI error
      
      * Remove some unnecessary code
      
      * Fix CI error
      
      * Add UT for fast GC
      
      * Fix CI error
      
      * add device-agnostic stream class
      
      * add stream.h
      
      * fix ut
      
      * fix cpu compile
      
      * Use RWLock in GetAllocator
      
      * Fix CI error
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>
      0c7153a4
    • L
      [new-exec] add completion_nofifier (#38447) · 404a4a6a
      Leo Chen 提交于
      * add completion_nofifier
      
      * fix bug
      
      * unregist event waiter
      404a4a6a
    • B
      add mul_lstm_fuse_pass ut (#37795) · 1db61c3e
      baoachun 提交于
      * add mul_lstm_fuse_pass ut
      
      * update mul_lstm_fuse_pass ut
      
      * update ut
      
      * update ut
      
      * update ut
      
      * add CPU ut cmake setting
      
      * update ut
      1db61c3e
  6. 27 12月, 2021 4 次提交
  7. 26 12月, 2021 1 次提交
  8. 24 12月, 2021 1 次提交
  9. 23 12月, 2021 6 次提交
  10. 22 12月, 2021 4 次提交
  11. 21 12月, 2021 3 次提交
  12. 20 12月, 2021 2 次提交
    • B
      add mkldnn conv_transpose_bias fuse pass ut (#37508) · ac696941
      baoachun 提交于
      * add mkldnn conv_transpose_bias fuse pass ut
      
      * update conv_transpose_bias_mkldnn_fuse_pass ut
      
      * update conv_transpose_bias_mkldnn_fuse_pass ut
      
      * update conv_transpose_bias_mkldnn_fuse_pass ut
      
      * restrict conv2d data_format in conv_transpose_bias_mkldnn_fuse_pass
      
      * update ut timeout setting
      
      * update ut
      ac696941
    • F
      [MLU]add mlu backend (#38207) · 76514a1f
      fwenguang 提交于
      76514a1f