- 17 12月, 2021 1 次提交
-
-
由 From00 提交于
-
- 25 11月, 2021 1 次提交
-
-
由 From00 提交于
* Support multi-stream allocation for CUDA place * Do not notify the retrying from other streams when free CUDA allocation * Fix compile error for CPU * Fix compile error for HIP * Release memory for StreamSafeCUDAAllocaRetry in malloc_test * Add FLAGS_use_stream_safe_cuda_allocator * Fix CI error for 'set_tests_properties' * Invalidate stream safe CUDA allocator for naive_best_fit and thread_local strategy * Performance improvement: insert allocation pair to outstanding_events_map when free but not alloc; replace recursive_mutex with SpinLock * FLAGS priority changes: FLAGS_use_system_allocator > FLAGS_use_stream_safe_cuda_allocator * Performance improvement: directly delete allocation when the recorded_streams is empty in FreeImpl of StreamSafeCUDAAllocator * Add UT for alloc interface * Changes multi-stream interface; move retry code from AllocatorFacadePrivate to StreamSafeCUDAAllocator
-
- 06 11月, 2020 1 次提交
-
-
由 Wilber 提交于
-
- 04 11月, 2020 1 次提交
-
-
由 Wilber 提交于
-
- 24 9月, 2020 1 次提交
-
-
由 wanghuancoder 提交于
* use iwyu clean include, test=develop, test=win * compilation error, test=develop * fix compilation error2, test=develop * fix compilation error3, test=develop * fix compilation error4, test=develop * fix compilation error5, test=develop * fix compilation error6, test=develop * fix compilation error7, test=develop * fix compilation error8, test=develop * fix compilation error8, test=develop * fix compilation error10, test=develop * fix compilation error11, test=develop
-
- 11 9月, 2019 1 次提交
-
-
由 Huihuang Zheng 提交于
TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory. We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton. Also added data_feed_proto to operator to fix CI in CPU compilation
-
- 10 6月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* remove attribute in Allocator::Allocate, test=develop * fix travis ci error, test=develop
-
- 16 11月, 2018 1 次提交
-
-
由 Yu Yang 提交于
test=develop
-
- 14 11月, 2018 1 次提交
-
-
由 Yu Yang 提交于
-
- 10 10月, 2018 1 次提交
-
-
由 sneaxiy 提交于
-
- 28 9月, 2018 1 次提交
-
-
由 Yu Yang 提交于
Use OO style to rewrite memory allocation.
-
- 08 4月, 2018 1 次提交
-
-
由 Yi Wang 提交于
-
- 26 3月, 2018 3 次提交
-
-
由 chengduoZH 提交于
-
由 chengduoZH 提交于
-
由 chengduoZH 提交于
-
- 20 3月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 12 2月, 2018 1 次提交
-
-
由 qingqing01 提交于
-
- 10 2月, 2018 2 次提交
- 05 2月, 2018 1 次提交
-
-
由 dzhwinter 提交于
-
- 09 1月, 2018 1 次提交
-
-
由 QI JUN 提交于
-
- 18 8月, 2017 1 次提交
-
-
由 liaogang 提交于
-
- 04 8月, 2017 1 次提交
-
-
由 liaogang 提交于
-
- 28 7月, 2017 1 次提交
-
-
由 liaogang 提交于
-
- 25 7月, 2017 1 次提交
-
-
由 liaogang 提交于
-
- 22 7月, 2017 1 次提交
-
-
由 Yi Wang 提交于
-
- 21 7月, 2017 2 次提交
- 19 7月, 2017 3 次提交
-
-
由 liaogang 提交于
-
由 fengjiayi 提交于
ATTENTION: some interfaces changed: 1. void Tensor::set_dims(const DDim& dims) ==> void Tensor::Resize(const DDim& dims). 2. void Tensor::ShareDataFrom(const Tensor& src) ==> void Tensor::ShareDataWith(const Tensor& src) 3. DDim Tensor::dims() const ==> const DDim& Tensor::dims() const
-
由 liaogang 提交于
-
- 06 7月, 2017 1 次提交
-
-
由 liaogang 提交于
-
- 28 6月, 2017 1 次提交
-
-
由 liaogang 提交于
-
- 27 6月, 2017 1 次提交
-
-
由 Yi Wang 提交于
-
- 26 6月, 2017 2 次提交
- 25 5月, 2017 1 次提交
-
-
由 Yu Yang 提交于
-
- 09 12月, 2016 1 次提交
-
-
由 Yi Wang 提交于
-
- 29 8月, 2016 1 次提交
-
-
由 zhangjinchao01 提交于
ISSUE=4586495 git-svn-id: https://svn.baidu.com/idl/trunk/paddle@1408 1ad973e4-5ce8-4261-8a94-b56d1f490c56
-