提交 · 277cf900fb49a28e7d7818addbb863f2b62d3ef5 · 机器未来 / Paddle

13 1月, 2022 1 次提交
- 石
  
  splits allocation for pten, test=develop (#38853) · 277cf900
  由石晓伟提交于 1月 13, 2022
  
  277cf900
28 12月, 2021 1 次提交

Utilize StreamSafeCUDAAllocator to support fast GC in new executor (#37642) · 0c7153a4

由 From00 提交于 12月 28, 2021

* fix reshape move storage error

* remove needless set type

* alloc tensor by shared storage

* Utilize StreamSafeCUDAAllocator to support fast GC in new executor

* Fix compile error for Windows and ROCm

* Fix compile error for Windows

* Modify UT stream_safe_cuda_alloc_test

* Modify UT stream_safe_cuda_alloc_test

* Rewrite fast GC

* Rewrite fast GC

* Fix compile error for BOOST_GET_CONST

* Fix compile error for BOOST_GET_CONST

* Changes default stream for StreamSafeCUDAAllocator

* Fix a small CI error

* Remove some redundant code

* Fix conflict

* Fix compile error for ROCm

* Fix Windoes CI error

* Fix CI error

* Remove some unnecessary code

* Fix CI error

* Add UT for fast GC

* Fix CI error

* add device-agnostic stream class

* add stream.h

* fix ut

* fix cpu compile

* Use RWLock in GetAllocator

* Fix CI error
Co-authored-by: NChen Weihang <chenweihang@baidu.com>
Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>

0c7153a4

27 12月, 2021 1 次提交
- L
  add device-agnostic stream class (#38391) · 6b5e33b4
  由 Leo Chen 提交于 12月 27, 2021
```
* add device-agnostic stream class

* add stream.h

* fix ut

* fix cpu compile
```
  6b5e33b4
17 12月, 2021 1 次提交
- F
  
  Add GetStream Interface for StreamSafeCUDAAllocator (#38195) · b0d12d99
  由 From00 提交于 12月 17, 2021
  
  b0d12d99
25 11月, 2021 1 次提交

Support multi-stream allocation for CUDA place (#37290) · b9c464c3

由 From00 提交于 11月 25, 2021

* Support multi-stream allocation for CUDA place

* Do not notify the retrying from other streams when free CUDA allocation

* Fix compile error for CPU

* Fix compile error for HIP

* Release memory for StreamSafeCUDAAllocaRetry in malloc_test

* Add FLAGS_use_stream_safe_cuda_allocator

* Fix CI error for 'set_tests_properties'

* Invalidate stream safe CUDA allocator for naive_best_fit and thread_local strategy

* Performance improvement: insert allocation pair to outstanding_events_map when free but not alloc; replace recursive_mutex with SpinLock

* FLAGS priority changes: FLAGS_use_system_allocator > FLAGS_use_stream_safe_cuda_allocator

* Performance improvement: directly delete allocation when the recorded_streams is empty in FreeImpl of StreamSafeCUDAAllocator

* Add UT for alloc interface

* Changes multi-stream interface; move retry code from AllocatorFacadePrivate to StreamSafeCUDAAllocator

b9c464c3

06 11月, 2020 1 次提交
- W
  
  Update memory release interface. (#28456) · ced5c40c
  由 Wilber 提交于 11月 05, 2020
  
  ced5c40c
04 11月, 2020 1 次提交
- W
  
  [Inference] Memory modification for ShrinkMemory. (#28355) · 05114693
  由 Wilber 提交于 11月 04, 2020
  
  05114693
24 9月, 2020 1 次提交

use iwyu clean include (#27267) · df43905f

由 wanghuancoder 提交于 9月 24, 2020

* use iwyu clean include, test=develop, test=win

* compilation error, test=develop

* fix compilation error2, test=develop

* fix compilation error3, test=develop

* fix compilation error4, test=develop

* fix compilation error5, test=develop

* fix compilation error6, test=develop

* fix compilation error7, test=develop

* fix compilation error8, test=develop

* fix compilation error8, test=develop

* fix compilation error10, test=develop

* fix compilation error11, test=develop

df43905f

11 9月, 2019 1 次提交

Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989) · 12542320

由 Huihuang Zheng 提交于 9月 11, 2019

TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory.

We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton.

Also added data_feed_proto to operator to fix CI in CPU compilation

12542320

10 6月, 2019 1 次提交
- Z
  Remove attribute in Allocator::Allocate (#17878) · 3ece61f7
  由 Zeng Jinle 提交于 6月 10, 2019
```
* remove attribute in Allocator::Allocate, test=develop

* fix travis ci error, test=develop
```
  3ece61f7
16 11月, 2018 1 次提交
- Y
  Add legacy_allocator · 19e669a9
  由 Yu Yang 提交于 11月 16, 2018
```
test=develop
```
  19e669a9
14 11月, 2018 1 次提交
- Y
  
  Refine code · d93b2d03
  由 Yu Yang 提交于 11月 14, 2018
  
  d93b2d03
10 10月, 2018 1 次提交
- S
  
  add support to old allocator · e2780623
  由 sneaxiy 提交于 10月 10, 2018
  
  e2780623
28 9月, 2018 1 次提交
- Y
  refactor(memory): rewrite memory allocation and make it extentable · 58ed412f
  由 Yu Yang 提交于 9月 28, 2018
```
Use OO style to rewrite memory allocation.
```
  58ed412f
08 4月, 2018 1 次提交
- Y
  
  Update CMakeLists · 67ba884d
  由 Yi Wang 提交于 4月 07, 2018
  
  67ba884d
26 3月, 2018 3 次提交
- C
  
  add unit test · 158d6c4d
  由 chengduoZH 提交于 3月 26, 2018
  
  158d6c4d
- C
  
  add CUDAPinnedPlace · 18eb7730
  由 chengduoZH 提交于 3月 26, 2018
  
  18eb7730
- C
  
  replace use_pinned with is_pinned · 39004080
  由 chengduoZH 提交于 3月 26, 2018
  
  39004080
20 3月, 2018 1 次提交
- C
  
  add pinned memory · 236b7dd2
  由 chengduoZH 提交于 3月 20, 2018
  
  236b7dd2
12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
05 2月, 2018 1 次提交
- D
  
  "add some interfaces" · 63320f72
  由 dzhwinter 提交于 2月 05, 2018
  
  63320f72
09 1月, 2018 1 次提交
- Q
  
  add general memory usage interface for both CPU/CUDA (#7352) · 45e77154
  由 QI JUN 提交于 1月 09, 2018
  
  45e77154
18 8月, 2017 1 次提交
- L
  
  Add ENVIRONMENT interface interface · 55437b58
  由 liaogang 提交于 8月 18, 2017
  
  55437b58
04 8月, 2017 1 次提交
- L
  
  Add cpplint for *.h and cuda *.cu · b58725bd
  由 liaogang 提交于 8月 04, 2017
  
  b58725bd
28 7月, 2017 1 次提交
- L
  
  ENH: Add comments for memory and memcpy · 201e7157
  由 liaogang 提交于 7月 28, 2017
  
  201e7157
25 7月, 2017 1 次提交
- L
  
  FIX: restricting c++ template usage to POD types · 4e94cd75
  由 liaogang 提交于 7月 25, 2017
  
  4e94cd75
22 7月, 2017 1 次提交
- Y
  
  Move memory::Copy out from memory.h into memcpy.h · 858dea88
  由 Yi Wang 提交于 7月 21, 2017
  
  858dea88
21 7月, 2017 2 次提交
- F
  
  Retrigger CI · bf3b8f04
  由 fengjiayi 提交于 7月 21, 2017
  
  bf3b8f04
- F
  Update Tensor and PODDeleter's template parameter · da07ec18
  由 fengjiayi 提交于 7月 21, 2017
```
1. Change PODDeleter's template parameter 'PlaceType' to 'Place'.

2. Limit PODDeleter and Tensor::mutable_data()'s `T` to POD type.
```
  da07ec18
19 7月, 2017 3 次提交

L

Add memcpy · e53a48b4
由 liaogang 提交于 7月 19, 2017

e53a48b4

Simplify Tensor implimentation · 55d30172

由 fengjiayi 提交于 7月 19, 2017

ATTENTION: some interfaces changed:
1. void Tensor::set_dims(const DDim& dims) ==> void Tensor::Resize(const DDim& dims).
2. void Tensor::ShareDataFrom(const Tensor& src) ==> void Tensor::ShareDataWith(const Tensor& src)
3. DDim Tensor::dims() const ==> const DDim& Tensor::dims() const

55d30172

L

Add memcpy · 028f3dc4
由 liaogang 提交于 7月 19, 2017

028f3dc4

06 7月, 2017 1 次提交
- L
  
  FIX: remove boost from memory folder · ddfa6cf0
  由 liaogang 提交于 7月 06, 2017
  
  ddfa6cf0
28 6月, 2017 1 次提交
- L
  
  FIX: fix memory.h/cc · 29c7512b
  由 liaogang 提交于 6月 28, 2017
  
  29c7512b
27 6月, 2017 1 次提交
- Y
  
  Replace {cpu,gpu}_allocator.h and {cpu,gpu}_allocator_test.cc by system_allocator{.h,_test.cc} · e02859c0
  由 Yi Wang 提交于 6月 26, 2017
  
  e02859c0
26 6月, 2017 2 次提交
- Y
  
  Pass cpu_allocator_test · db128c45
  由 Yi Wang 提交于 6月 25, 2017
  
  db128c45
- Y
  
  add paddle/memory/detail/cpu_allocator* · 84d1c734
  由 Yi Wang 提交于 6月 25, 2017
  
  84d1c734
25 5月, 2017 1 次提交
- Y
  
  Remove not necessary functionalities in Parameter · 273e3f44
  由 Yu Yang 提交于 5月 25, 2017
  
  273e3f44

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致