1. 09 6月, 2020 1 次提交
  2. 08 6月, 2020 1 次提交
  3. 03 6月, 2020 1 次提交
  4. 20 5月, 2020 1 次提交
    • H
      Make buddy_allocator_test Exclusive (#24636) · f831f612
      Huihuang Zheng 提交于
      The background is that buddy_allocator_test depends on some big GPU allocations and current machine may not have it. It can cause random failure on cloud machine. After discussion with Shuo Tian, we decided to make buddy_allocator_test exclusive.
      f831f612
  5. 11 5月, 2020 1 次提交
    • C
      Add macro BOOST_GET to enrich the error information of boost :: get (#24175) · aa0f254f
      Chen Weihang 提交于
      * add new macro BOOST_GET_SAFELY & unittests, test=develop
      
      * add different macro type, test=develop
      
      * fix get macro type in executor, test=develop
      
      * four macro part change backup
      
      * using one macro for all case, test=develop
      
      * revert attribute change, test=develop
      
      * change to three func to solve gcc4.8 bug, test=develop
      
      * polish some details, test=develop
      aa0f254f
  6. 28 4月, 2020 1 次提交
  7. 21 4月, 2020 1 次提交
  8. 20 4月, 2020 1 次提交
    • Z
      Optimize the error messages of paddle CUDA API (#23816) · 78170037
      Zhou Wei 提交于
      * Optimize the error messages of paddle CUDA API, test=develop
      
      * fix the error messages of paddle CUDA API, test=develop
      
      * Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL,test=develop
      
      * remove build_ex_string,test=develop
      
      * merge conflict,test=develop
      78170037
  9. 04 3月, 2020 1 次提交
    • Z
      Add flags to limit gpu memory (#22793) · d41d802b
      Zeng Jinle 提交于
      * add recorded cuda memory apis, fix typo, test=develop
      
      * add more ut, test=develop
      
      * follow comments, test=develop
      
      * fix py35 incompatible issues, test=develop
      d41d802b
  10. 02 3月, 2020 1 次提交
    • C
      Speed up dygraph DataLoader based on shared memory and LoDTensor serialization (#22541) · 7d8d5734
      Chen Weihang 提交于
      * add lodtensor share memory & serialization, test=develop
      
      * fix windows compile error, test=develop
      
      * deal vartype pickle & fix unittest matching error message, test=develop
      
      * update timeout variable name, test=develop
      
      * refactor memory map implement, test=develop
      
      * clear mmap file discripter when exit unexpectedly, test=develop
      
      * remove the child process fd in advance, test=develop
      
      * remove mmap fds after Queue.put in child process, test=develop
      
      * add hard unittests for register exit func, test=develop
      
      * fix python2 compatibility problem in unittest, test=develop
      
      * fix exception unittest error, test=develop
      
      * polish code based review comment, test=develop
      7d8d5734
  11. 06 2月, 2020 1 次提交
    • Y
      Correct the use of DeviceContext in unittest sequence_pooling_test and... · 44b45b9f
      Yiqun Liu 提交于
      Correct the use of DeviceContext in unittest sequence_pooling_test and sequence_padding_test (#22456)
      
      * Add log in memory::Copy for debug purpose.
      
      * Change to use context in DeviceContextPool directly in sequence_pooling_test, instead to new one.
      
      * Change to use context in DeviceContextPool directly in sequence_padding_test, instead to new one.
      test=develop
      
      * Change the type of second_dim from size_t to int64_t.
      test=develop
      44b45b9f
  12. 14 1月, 2020 1 次提交
  13. 13 1月, 2020 1 次提交
  14. 08 1月, 2020 1 次提交
  15. 06 1月, 2020 1 次提交
  16. 19 12月, 2019 1 次提交
  17. 02 12月, 2019 1 次提交
  18. 28 11月, 2019 1 次提交
  19. 14 11月, 2019 1 次提交
  20. 13 11月, 2019 1 次提交
  21. 06 11月, 2019 1 次提交
  22. 05 11月, 2019 1 次提交
  23. 30 10月, 2019 1 次提交
  24. 29 10月, 2019 1 次提交
  25. 24 10月, 2019 1 次提交
  26. 17 10月, 2019 1 次提交
    • L
      improve the efficiency of BuddyAllocator (#19888) · 569951c4
      liuwei1031 提交于
      * improve save and load behaviour, test=develop
      
      * code cleaning, test=develop
      
      * disable check_guards and update_guards in release version, test=develop
      
      * fix compilation issue, test=develop
      
      * add buddy_allocator speed test data, test=develop
      
      * fix compilation issue, test=develop
      
      * fix comment, test=develop
      
      * update function names according to the google C++ style guide, test=develop
      
      * tweak the test data format, test=develop
      
      * move buddy_allocator_test_data to paddle/fluid/testdata, test=develop
      
      * add accessor and mutator for Desc, test=develop
      569951c4
  27. 25 9月, 2019 1 次提交
  28. 24 9月, 2019 2 次提交
  29. 20 9月, 2019 2 次提交
  30. 18 9月, 2019 1 次提交
  31. 17 9月, 2019 1 次提交
  32. 16 9月, 2019 1 次提交
  33. 11 9月, 2019 1 次提交
    • H
      Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989) · 12542320
      Huihuang Zheng 提交于
      TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory.
      
      We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton.
      
      Also added data_feed_proto to operator to fix CI in CPU compilation
      12542320
  34. 09 9月, 2019 1 次提交
  35. 06 9月, 2019 1 次提交
  36. 03 9月, 2019 3 次提交