- 28 4月, 2020 1 次提交
-
-
由 Huihuang Zheng 提交于
test=develop
-
- 21 4月, 2020 1 次提交
-
-
由 石晓伟 提交于
* add the thread_local_allocator, test=develop * refactor the thread_local_allocator, test=develop * provides option setting strategy, test=develop
-
- 20 4月, 2020 1 次提交
-
-
由 Zhou Wei 提交于
* Optimize the error messages of paddle CUDA API, test=develop * fix the error messages of paddle CUDA API, test=develop * Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL,test=develop * remove build_ex_string,test=develop * merge conflict,test=develop
-
- 04 3月, 2020 1 次提交
-
-
由 Zeng Jinle 提交于
* add recorded cuda memory apis, fix typo, test=develop * add more ut, test=develop * follow comments, test=develop * fix py35 incompatible issues, test=develop
-
- 02 3月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add lodtensor share memory & serialization, test=develop * fix windows compile error, test=develop * deal vartype pickle & fix unittest matching error message, test=develop * update timeout variable name, test=develop * refactor memory map implement, test=develop * clear mmap file discripter when exit unexpectedly, test=develop * remove the child process fd in advance, test=develop * remove mmap fds after Queue.put in child process, test=develop * add hard unittests for register exit func, test=develop * fix python2 compatibility problem in unittest, test=develop * fix exception unittest error, test=develop * polish code based review comment, test=develop
-
- 06 2月, 2020 1 次提交
-
-
由 Yiqun Liu 提交于
Correct the use of DeviceContext in unittest sequence_pooling_test and sequence_padding_test (#22456) * Add log in memory::Copy for debug purpose. * Change to use context in DeviceContextPool directly in sequence_pooling_test, instead to new one. * Change to use context in DeviceContextPool directly in sequence_padding_test, instead to new one. test=develop * Change the type of second_dim from size_t to int64_t. test=develop
-
- 14 1月, 2020 1 次提交
-
-
由 zhouwei25 提交于
faster build by reduce by-product, reduce linking library and fix compile warning of std=c++11 (#22164)
-
- 13 1月, 2020 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 08 1月, 2020 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 06 1月, 2020 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 19 12月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* add some debug flags to auto growth allocator, test=develop * add comments about auto growth, test=develop
-
- 02 12月, 2019 1 次提交
-
-
由 Tao Luo 提交于
* fix -Wno-error=sign-compare warning in gcc8 test=develop * fix warning in distributed codes test=develop
-
- 28 11月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* use system allocator in unittests, test=develop * fix op bugs, test=develop * fix tensor copy bug when src and dst are the same, test=develop
-
- 14 11月, 2019 1 次提交
-
-
由 Chen Weihang 提交于
-
- 13 11月, 2019 1 次提交
-
-
由 Chen Weihang 提交于
-
- 06 11月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 05 11月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 30 10月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 29 10月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 24 10月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 17 10月, 2019 1 次提交
-
-
由 liuwei1031 提交于
* improve save and load behaviour, test=develop * code cleaning, test=develop * disable check_guards and update_guards in release version, test=develop * fix compilation issue, test=develop * add buddy_allocator speed test data, test=develop * fix compilation issue, test=develop * fix comment, test=develop * update function names according to the google C++ style guide, test=develop * tweak the test data format, test=develop * move buddy_allocator_test_data to paddle/fluid/testdata, test=develop * add accessor and mutator for Desc, test=develop
-
- 25 9月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 24 9月, 2019 2 次提交
-
-
由 Zeng Jinle 提交于
-
由 Zeng Jinle 提交于
-
- 20 9月, 2019 2 次提交
-
-
由 Zeng Jinle 提交于
* refine err msg of out of gpu memory, test=develop * refine err msg again, test=develop * refine errog message again, test=develop * follow reviewer's comments, test=develop
-
由 Zeng Jinle 提交于
-
- 18 9月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 17 9月, 2019 1 次提交
-
-
由 Huihuang Zheng 提交于
-
- 16 9月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 11 9月, 2019 1 次提交
-
-
由 Huihuang Zheng 提交于
TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory. We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton. Also added data_feed_proto to operator to fix CI in CPU compilation
-
- 09 9月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 06 9月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 03 9月, 2019 3 次提交
-
-
由 Zeng Jinle 提交于
-
由 Zeng Jinle 提交于
-
由 Zeng Jinle 提交于
-
- 01 9月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* add retry_allocator for gpu, test=develop * follow chengduoZH's comments, test=develop * follow huihuang's comments,test=develop * change f,l in enforce.h to be file,line, test=develop * increase code coverage by adding unittests, test=develop * fix CMakeLists.txt, test=develop
-
- 30 8月, 2019 1 次提交
-
-
由 Tao Luo 提交于
test=develop
-
- 22 8月, 2019 1 次提交
-
-
由 chengduo 提交于
test=develop
-
- 20 8月, 2019 1 次提交
-
-
由 Tao Luo 提交于
* replace part of PADDLE_ASSERT to PADDLE_ENFORCE test=develop * remove unused fallback_alloc_size_ * add unit-test of CUDAPinnedAllocator test=develop
-
- 16 8月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-