提交 · 3047b69f9427b6aeae782042b3c4e68ab0a69b0b · BaiXuePrincess / Paddle

28 4月, 2020 1 次提交
- H
  Make Allocation Smaller on BuddyAllocatorTest to Decrease CI Failure (#24205) · 3047b69f
  由 Huihuang Zheng 提交于 4月 28, 2020
```
test=develop
```
  3047b69f
21 4月, 2020 1 次提交

New feature: thread local allocator, test=develop (#23989) · d2584a70

由石晓伟提交于 4月 21, 2020

* add the thread_local_allocator, test=develop

* refactor the thread_local_allocator, test=develop

* provides option setting strategy, test=develop

d2584a70

20 4月, 2020 1 次提交

Optimize the error messages of paddle CUDA API (#23816) · 78170037

由 Zhou Wei 提交于 4月 20, 2020

* Optimize the error messages of paddle CUDA API, test=develop

* fix the error messages of paddle CUDA API, test=develop

* Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL,test=develop

* remove build_ex_string,test=develop

* merge conflict,test=develop

78170037

04 3月, 2020 1 次提交

Add flags to limit gpu memory (#22793) · d41d802b

由 Zeng Jinle 提交于 3月 04, 2020

* add recorded cuda memory apis, fix typo, test=develop

* add more ut, test=develop

* follow comments, test=develop

* fix py35 incompatible issues, test=develop

d41d802b

02 3月, 2020 1 次提交

Speed up dygraph DataLoader based on shared memory and LoDTensor serialization (#22541) · 7d8d5734

由 Chen Weihang 提交于 3月 02, 2020

* add lodtensor share memory & serialization, test=develop

* fix windows compile error, test=develop

* deal vartype pickle & fix unittest matching error message, test=develop

* update timeout variable name, test=develop

* refactor memory map implement, test=develop

* clear mmap file discripter when exit unexpectedly, test=develop

* remove the child process fd in advance, test=develop

* remove mmap fds after Queue.put in child process, test=develop

* add hard unittests for register exit func, test=develop

* fix python2 compatibility problem in unittest, test=develop

* fix exception unittest error, test=develop

* polish code based review comment, test=develop

7d8d5734

06 2月, 2020 1 次提交

Correct the use of DeviceContext in unittest sequence_pooling_test and... · 44b45b9f

由 Yiqun Liu 提交于 2月 06, 2020

Correct the use of DeviceContext in unittest sequence_pooling_test and sequence_padding_test (#22456)

* Add log in memory::Copy for debug purpose.

* Change to use context in DeviceContextPool directly in sequence_pooling_test, instead to new one.

* Change to use context in DeviceContextPool directly in sequence_padding_test, instead to new one.
test=develop

* Change the type of second_dim from size_t to int64_t.
test=develop

44b45b9f

14 1月, 2020 1 次提交
- Z
  faster build by reduce by-product, reduce linking library and fix compile... · 549e6de7
  由 zhouwei25 提交于 1月 14, 2020
```
faster build by reduce by-product, reduce linking library and fix compile warning of std=c++11 (#22164)
```
  549e6de7
13 1月, 2020 1 次提交
- Z
  
  remove cuda allocator ctor, test=develop (#22212) · 1b76e789
  由 Zeng Jinle 提交于 1月 13, 2020
  
  1b76e789
08 1月, 2020 1 次提交
- Z
  
  fix dygraph non zero gpu bug, test=develop (#22165) · c3bcd3c1
  由 Zeng Jinle 提交于 1月 08, 2020
  
  c3bcd3c1
06 1月, 2020 1 次提交
- Z
  
  ag allocator by default, test=develop (#21837) · d9f5d1eb
  由 Zeng Jinle 提交于 1月 06, 2020
  
  d9f5d1eb
19 12月, 2019 1 次提交
- Z
  Add some debug flags to auto growth allocator (#21766) · aa4d6a5d
  由 Zeng Jinle 提交于 12月 18, 2019
```
* add some debug flags to auto growth allocator, test=develop

* add comments about auto growth, test=develop
```
  aa4d6a5d
02 12月, 2019 1 次提交

fix -Wno-error=sign-compare warning in gcc8 (#21434) · 01fa4ead

由 Tao Luo 提交于 12月 02, 2019

* fix -Wno-error=sign-compare warning in gcc8

test=develop

* fix warning in distributed codes

test=develop

01fa4ead

28 11月, 2019 1 次提交

Use system allocator in OpTest (#21335) · 09696d5d

由 Zeng Jinle 提交于 11月 28, 2019

* use system allocator in unittests, test=develop

* fix op bugs, test=develop

* fix tensor copy bug when src and dst are the same, test=develop

09696d5d

14 11月, 2019 1 次提交
- C
  
  change cuda enforce & add example (#21142) · b3a3e6f6
  由 Chen Weihang 提交于 11月 14, 2019
  
  b3a3e6f6
13 11月, 2019 1 次提交
- C
  
  add examples for resource exhausted error, test=develop (#21140) · 27fa9c10
  由 Chen Weihang 提交于 11月 13, 2019
  
  27fa9c10
06 11月, 2019 1 次提交
- Z
  
  refine error message of allocator again, test=develop (#21023) · a710ccc0
  由 Zeng Jinle 提交于 11月 06, 2019
  
  a710ccc0
05 11月, 2019 1 次提交
- Z
  
  refine error message of gpu allocator, test=develop (#21008) · f56967c4
  由 Zeng Jinle 提交于 11月 05, 2019
  
  f56967c4
30 10月, 2019 1 次提交
- Z
  
  refine err msg of allocator, test=develop (#20879) · c51722c8
  由 Zeng Jinle 提交于 10月 30, 2019
  
  c51722c8
29 10月, 2019 1 次提交
- Z
  
  lazy init of allocators, test=develop (#20854) · bb8d7783
  由 Zeng Jinle 提交于 10月 29, 2019
  
  bb8d7783
24 10月, 2019 1 次提交
- Z
  
  refine err msg of allocator, test=develop (#20804) · cd1c4043
  由 Zeng Jinle 提交于 10月 24, 2019
  
  cd1c4043
17 10月, 2019 1 次提交

improve the efficiency of BuddyAllocator (#19888) · 569951c4

由 liuwei1031 提交于 10月 17, 2019

* improve save and load behaviour, test=develop

* code cleaning, test=develop

* disable check_guards and update_guards in release version, test=develop

* fix compilation issue, test=develop

* add buddy_allocator speed test data, test=develop

* fix compilation issue, test=develop

* fix comment, test=develop

* update function names according to the google C++ style guide, test=develop

* tweak the test data format, test=develop

* move buddy_allocator_test_data to paddle/fluid/testdata, test=develop

* add accessor and mutator for Desc, test=develop

569951c4

25 9月, 2019 1 次提交
- Z
  
  fix buddy_allocator_test, test=develop (#19967) · b8aff5e5
  由 Zeng Jinle 提交于 9月 25, 2019
  
  b8aff5e5
24 9月, 2019 2 次提交
- Z
  
  fix cuda dev_ctx allocator cmake deps, test=develop (#19953) · 37f76407
  由 Zeng Jinle 提交于 9月 24, 2019
  
  37f76407
- Z
  
  fix allocator ut,test=develop (#19945) · 80e0f547
  由 Zeng Jinle 提交于 9月 24, 2019
  
  80e0f547
20 9月, 2019 2 次提交
- Z
  Refine err msg of out of gpu memory (#19779) · 747d4498
  由 Zeng Jinle 提交于 9月 20, 2019
```
* refine err msg of out of gpu memory, test=develop

* refine err msg again, test=develop

* refine errog message again, test=develop

* follow reviewer's comments, test=develop
```
  747d4498
- Z
  
  add free chunks to auto growth allocator, test=develop (#19890) · 8359b415
  由 Zeng Jinle 提交于 9月 20, 2019
  
  8359b415
18 9月, 2019 1 次提交
- Z
  
  remove some flags and add comments to some flags, test=develop (#19813) · 13ca364c
  由 Zeng Jinle 提交于 9月 18, 2019
  
  13ca364c
17 9月, 2019 1 次提交
- H
  
  Add comments for CUDA Device Context Allocator related stuff (#19809) · a0d80754
  由 Huihuang Zheng 提交于 9月 17, 2019
  
  a0d80754
16 9月, 2019 1 次提交
- Z
  
  fix retry allocator bug, test=develop (#19794) · b34933d9
  由 Zeng Jinle 提交于 9月 16, 2019
  
  b34933d9
11 9月, 2019 1 次提交

Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989) · 12542320

由 Huihuang Zheng 提交于 9月 11, 2019

TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory.

We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton.

Also added data_feed_proto to operator to fix CI in CPU compilation

12542320

09 9月, 2019 1 次提交
- Z
  
  add gpu_allocator_try_time config, test=develop (#19675) · a7691603
  由 Zeng Jinle 提交于 9月 09, 2019
  
  a7691603
06 9月, 2019 1 次提交
- Z
  
  reduce thread num of retry_allocator_test,test=develop (#19638) · 2db40d9f
  由 Zeng Jinle 提交于 9月 06, 2019
  
  2db40d9f
03 9月, 2019 3 次提交
- Z
  
  fix retry_allocator_test by removing glog envs, test=develop (#19596) · e045aadf
  由 Zeng Jinle 提交于 9月 03, 2019
  
  e045aadf
- Z
  
  fix parallel compilation error of allocator (#19581) · 578cccd4
  由 Zeng Jinle 提交于 9月 03, 2019
  
  578cccd4
- Z
  
  fix typo of allocator, test=develop (#19578) · f4562c34
  由 Zeng Jinle 提交于 9月 03, 2019
  
  f4562c34
01 9月, 2019 1 次提交

Add retry_allocator for gpu (#19409) · 0a73f720

由 Zeng Jinle 提交于 9月 01, 2019

* add retry_allocator for gpu, test=develop

* follow chengduoZH's comments, test=develop

* follow huihuang's comments,test=develop

* change f,l in enforce.h to be file,line, test=develop

* increase code coverage by adding unittests, test=develop

* fix CMakeLists.txt, test=develop

0a73f720

30 8月, 2019 1 次提交
- T
  remove unused assert.h (#19529) · 02270b3e
  由 Tao Luo 提交于 8月 30, 2019
```
test=develop
```
  02270b3e
22 8月, 2019 1 次提交
- C
  add memory profiler (#19320) · a8a9823d
  由 chengduo 提交于 8月 22, 2019
```
test=develop
```
  a8a9823d
20 8月, 2019 1 次提交

replace part of PADDLE_ASSERT to PADDLE_ENFORCE (#19285) · 6527a7df

由 Tao Luo 提交于 8月 20, 2019

* replace part of PADDLE_ASSERT to PADDLE_ENFORCE

test=develop

* remove unused fallback_alloc_size_

* add unit-test of CUDAPinnedAllocator

test=develop

6527a7df

16 8月, 2019 1 次提交
- Z
  
  move_flags_to_unified_files_for_management, test=develop (#19224) · 708bd979
  由 Zeng Jinle 提交于 8月 16, 2019
  
  708bd979

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致