• F
    Utilize StreamSafeCUDAAllocator to support fast GC in new executor (#37642) · 0c7153a4
    From00 提交于
    * fix reshape move storage error
    
    * remove needless set type
    
    * alloc tensor by shared storage
    
    * Utilize StreamSafeCUDAAllocator to support fast GC in new executor
    
    * Fix compile error for Windows and ROCm
    
    * Fix compile error for Windows
    
    * Modify UT stream_safe_cuda_alloc_test
    
    * Modify UT stream_safe_cuda_alloc_test
    
    * Rewrite fast GC
    
    * Rewrite fast GC
    
    * Fix compile error for BOOST_GET_CONST
    
    * Fix compile error for BOOST_GET_CONST
    
    * Changes default stream for StreamSafeCUDAAllocator
    
    * Fix a small CI error
    
    * Remove some redundant code
    
    * Fix conflict
    
    * Fix compile error for ROCm
    
    * Fix Windoes CI error
    
    * Fix CI error
    
    * Remove some unnecessary code
    
    * Fix CI error
    
    * Add UT for fast GC
    
    * Fix CI error
    
    * add device-agnostic stream class
    
    * add stream.h
    
    * fix ut
    
    * fix cpu compile
    
    * Use RWLock in GetAllocator
    
    * Fix CI error
    Co-authored-by: NChen Weihang <chenweihang@baidu.com>
    Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>
    0c7153a4
malloc.h 2.1 KB