1. 28 4月, 2020 1 次提交
  2. 21 4月, 2020 1 次提交
  3. 20 4月, 2020 1 次提交
    • Z
      Optimize the error messages of paddle CUDA API (#23816) · 78170037
      Zhou Wei 提交于
      * Optimize the error messages of paddle CUDA API, test=develop
      
      * fix the error messages of paddle CUDA API, test=develop
      
      * Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL,test=develop
      
      * remove build_ex_string,test=develop
      
      * merge conflict,test=develop
      78170037
  4. 04 3月, 2020 1 次提交
    • Z
      Add flags to limit gpu memory (#22793) · d41d802b
      Zeng Jinle 提交于
      * add recorded cuda memory apis, fix typo, test=develop
      
      * add more ut, test=develop
      
      * follow comments, test=develop
      
      * fix py35 incompatible issues, test=develop
      d41d802b
  5. 02 3月, 2020 1 次提交
    • C
      Speed up dygraph DataLoader based on shared memory and LoDTensor serialization (#22541) · 7d8d5734
      Chen Weihang 提交于
      * add lodtensor share memory & serialization, test=develop
      
      * fix windows compile error, test=develop
      
      * deal vartype pickle & fix unittest matching error message, test=develop
      
      * update timeout variable name, test=develop
      
      * refactor memory map implement, test=develop
      
      * clear mmap file discripter when exit unexpectedly, test=develop
      
      * remove the child process fd in advance, test=develop
      
      * remove mmap fds after Queue.put in child process, test=develop
      
      * add hard unittests for register exit func, test=develop
      
      * fix python2 compatibility problem in unittest, test=develop
      
      * fix exception unittest error, test=develop
      
      * polish code based review comment, test=develop
      7d8d5734
  6. 06 2月, 2020 1 次提交
    • Y
      Correct the use of DeviceContext in unittest sequence_pooling_test and... · 44b45b9f
      Yiqun Liu 提交于
      Correct the use of DeviceContext in unittest sequence_pooling_test and sequence_padding_test (#22456)
      
      * Add log in memory::Copy for debug purpose.
      
      * Change to use context in DeviceContextPool directly in sequence_pooling_test, instead to new one.
      
      * Change to use context in DeviceContextPool directly in sequence_padding_test, instead to new one.
      test=develop
      
      * Change the type of second_dim from size_t to int64_t.
      test=develop
      44b45b9f
  7. 14 1月, 2020 1 次提交
  8. 13 1月, 2020 1 次提交
  9. 08 1月, 2020 1 次提交
  10. 06 1月, 2020 1 次提交
  11. 19 12月, 2019 1 次提交
  12. 02 12月, 2019 1 次提交
  13. 28 11月, 2019 1 次提交
  14. 14 11月, 2019 1 次提交
  15. 13 11月, 2019 1 次提交
  16. 06 11月, 2019 1 次提交
  17. 05 11月, 2019 1 次提交
  18. 30 10月, 2019 1 次提交
  19. 29 10月, 2019 1 次提交
  20. 24 10月, 2019 1 次提交
  21. 17 10月, 2019 1 次提交
    • L
      improve the efficiency of BuddyAllocator (#19888) · 569951c4
      liuwei1031 提交于
      * improve save and load behaviour, test=develop
      
      * code cleaning, test=develop
      
      * disable check_guards and update_guards in release version, test=develop
      
      * fix compilation issue, test=develop
      
      * add buddy_allocator speed test data, test=develop
      
      * fix compilation issue, test=develop
      
      * fix comment, test=develop
      
      * update function names according to the google C++ style guide, test=develop
      
      * tweak the test data format, test=develop
      
      * move buddy_allocator_test_data to paddle/fluid/testdata, test=develop
      
      * add accessor and mutator for Desc, test=develop
      569951c4
  22. 25 9月, 2019 1 次提交
  23. 24 9月, 2019 2 次提交
  24. 20 9月, 2019 2 次提交
  25. 18 9月, 2019 1 次提交
  26. 17 9月, 2019 1 次提交
  27. 16 9月, 2019 1 次提交
  28. 11 9月, 2019 1 次提交
    • H
      Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989) · 12542320
      Huihuang Zheng 提交于
      TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory.
      
      We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton.
      
      Also added data_feed_proto to operator to fix CI in CPU compilation
      12542320
  29. 09 9月, 2019 1 次提交
  30. 06 9月, 2019 1 次提交
  31. 03 9月, 2019 3 次提交
  32. 01 9月, 2019 1 次提交
    • Z
      Add retry_allocator for gpu (#19409) · 0a73f720
      Zeng Jinle 提交于
      * add retry_allocator for gpu, test=develop
      
      * follow chengduoZH's comments, test=develop
      
      * follow huihuang's comments,test=develop
      
      * change f,l in enforce.h to be file,line, test=develop
      
      * increase code coverage by adding unittests, test=develop
      
      * fix CMakeLists.txt, test=develop
      0a73f720
  33. 30 8月, 2019 1 次提交
  34. 22 8月, 2019 1 次提交
  35. 20 8月, 2019 1 次提交
  36. 16 8月, 2019 1 次提交