- 08 11月, 2021 1 次提交
-
-
由 wanghuancoder 提交于
* Use cuda virtual memory management and merge blocks, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * window dll, test=develop * fix cuda error of CUDA_ERROR_NOT_INITIALIZED, test=develop * use autogrowthv2 for system allocator, test=develop * remove ~CUDAVirtualMemAllocator(), test=develop * refine, test=develop * fix cuda error of CUDA_ERROR_NOT_INITIALIZED, test=develop * fix cuda error of CUDA_ERROR_NOT_INITIALIZED, test=develop * fix bug, test=develop * revert system allocator, test =develop * revert multiprocessing, test=develop * fix AutoGrowthBestFitAllocatorV2 mutxt, test=develop * catch cudaErrorInitializationError when create allocator, test=develop * fix cuMemSetAccess use, test=develop * refine cuda api use, test=develop * refine, test=develop * for test, test=develop * for test, test=develop * switch to v2, test=develop * refine virtual allocator, test=develop * Record cuMemCreate and cuMemRelease, test=develop * refine, test=develop * avoid out of bounds, test=develop * rename allocator, test=develop * refine, test=develop * use PADDLE_ENFORCE_CUDA_SUCCESS, test=develop * for test,test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop
-
- 28 9月, 2021 1 次提交
-
-
由 Yanxing Shi 提交于
* Initial Commit * add unittest and add error information * modify doc * fix some error * fix some word * fix bug cudaDeviceProp* and modify error explanation * fix cudaDeviceProp* error and unnitest samples * fix hip error and PADDLE_WITH_HIP * update style * fix error is_compiled_with_cuda * fix paddle.device.cuda.get_device_properties * fix error for multi thread safe * update style * merge conflict * modify after mentor review * update style * delete word * fix unittest error for windows * support string input and modify some code * modify doc to support string input * fix error for express information * fix error for express information * fix unnitest for windows * fix device.startswith('gpu:') * format error and doc * fix after review * format code * fix error for doc compile * fix error for doc compile * fix error for doc compile * fix error for doc compile * fix error for doc compile * fix py2 error * fix wrong words and doc * fix _gpuDeviceProperties
-
- 14 9月, 2021 1 次提交
-
-
由 chenenquan 提交于
* Add empty_cache api to release idle gpu memory hold by allocator,test=develop * Add empty_cache api to release idle gpu memory hold by allocator,test=develop * Add empty_cache api to release idle gpu memory hold by allocator,test=develop * Fix test coverage problem for empty_cache * delete redundant check for empty_cache * fix the problem of empty_cache's doc * delete the nvidia-smi comment in doc of empty_cache, test=document_fix
-
- 07 2月, 2021 1 次提交
-
-
由 Qi Li 提交于
-
- 16 8月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 04 3月, 2020 1 次提交
-
-
由 Zeng Jinle 提交于
* add recorded cuda memory apis, fix typo, test=develop * add more ut, test=develop * follow comments, test=develop * fix py35 incompatible issues, test=develop
-
- 09 1月, 2020 1 次提交
-
-
由 石晓伟 提交于
-
- 08 1月, 2020 1 次提交
-
-
由 zhaoyuchen2018 提交于
stack's wait cost a lot of cpu time, use cuda kernel to do memory copy will reduce cpu time. Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
-
- 14 11月, 2019 1 次提交
-
-
由 zhaoyuchen2018 提交于
* Improve topk performance. give 200000 data to compute topk, before opt: cost 1s after opt: cost 0.0028s. * Refine return value. * Add cuda util funtions. * Fix ComputeBlockSize bug & refine comments. Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
-
- 06 11月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 31 7月, 2019 1 次提交
-
-
由 Huihuang Zheng 提交于
GPU allocation uses fraction of available memory, also fix the GetUsed without lock
-
- 21 3月, 2019 1 次提交
-
-
由 sneaxiy 提交于
modify allocator strategy remove changes of legacy buddy_allocator test=develop
-
- 19 3月, 2019 1 次提交
-
-
由 zhhsplendid 提交于
test=develop
-
- 04 12月, 2018 1 次提交
-
-
由 Wu Yi 提交于
* wip multi process multi gpu dist training * workable for p2p * update test=develop * change back env name test=develop * fix alloc init * fix cpu build test=devlop * fix mac tests test=develop * refine code * refine test=develop
-
- 22 11月, 2018 1 次提交
-
-
由 chengduo 提交于
* refine cublase test=develop * code refine * refine cublas * add GEMME_EX * add enable_cublas_tensor_op_math doc and add cublasCall test=develop * fix CublasCall for cuda version test=develop * fix error test=develop * fix GEMM_EX to be compatible with gcc 4.8 test=develop * add GEMM_EX test=develop * to compatiable with gcc4.8 test=develop
-
- 15 10月, 2018 1 次提交
-
-
由 chengduo 提交于
test=develop
-
- 27 9月, 2018 1 次提交
-
-
由 typhoonzero 提交于
This reverts commit 1d91a49d.
-
- 26 9月, 2018 1 次提交
-
-
由 chengduo 提交于
* some trivial opt * remove the fix of lod_tensor and shrink_rnn_memory_op * refine ShrinkRNNMemoryOp test=develop
-
- 23 4月, 2018 1 次提交
-
-
由 fengjiayi 提交于
-
- 08 4月, 2018 2 次提交
- 10 3月, 2018 1 次提交
-
-
由 Kexin Zhao 提交于
-
- 03 3月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 12 2月, 2018 1 次提交
-
-
由 qingqing01 提交于
-
- 10 2月, 2018 1 次提交
-
-
由 Yi Wang 提交于
-
- 22 12月, 2017 1 次提交
-
-
由 dzhwinter 提交于
* "remove GPU Sync Interface" * "fix typo" * "fix type cast error" * "fix related Copy with stream" * "fix failed tests with DevicePool" * "fix stupid removed position error"
-
- 16 11月, 2017 1 次提交
-
-
由 dzhwinter 提交于
* "fix accuracy kernel bug" * "relauch ci"
-
- 10 10月, 2017 1 次提交
-
-
由 Luo Tao 提交于
-
- 05 10月, 2017 3 次提交
-
-
由 Yi Wang 提交于
-
由 Yi Wang 提交于
-
由 Yu Yang 提交于
By shell command ```bash sed -i 's#ifdef PADDLE_ONLY_CPU#ifndef PADDLE_WITH_GPU#g' `find ./paddle/ -name '*.h' -o -name '*.cc' -o -name '*.cpp' -o -name '*.c' -o -name '*.cu'` sed -i 's#ifndef PADDLE_ONLY_CPU#ifdef PADDLE_WITH_GPU#g' `find ./paddle/ -name '*.h' -o -name '*.cc' -o -name '*.cpp' -o -name '*.c' -o -name '*.cu'` ```
-
- 26 9月, 2017 1 次提交
-
-
由 Qiao Longfei 提交于
* fix nv_library * fix symbol in gpu_info.h
-
- 18 8月, 2017 2 次提交
- 19 7月, 2017 1 次提交
-
-
由 liaogang 提交于
-
- 11 7月, 2017 1 次提交
-
-
由 liaogang 提交于
-
- 04 7月, 2017 1 次提交
-
-
由 liaogang 提交于
-
- 29 6月, 2017 2 次提交
- 28 6月, 2017 1 次提交
-
-
由 liaogang 提交于
-