1. 24 4月, 2020 1 次提交
    • G
      Add cholesky_op (#23543) · a8c0fb4e
      Guo Sheng 提交于
      * Add cholesky_op forward part. test=develop
      
      * Complete cholesky_op forward part. test=develop
      
      * Add cholesky_op backward part. test=develop
      
      * Complete cholesky_op backward part. test=develop
      
      * Refine cholesky_op error check and docs. test=develop
      
      * Add grad_check unit test for cholesky_op. test=develop
      
      * Fix sample code in cholesky doc. test=develop
      
      * Refine some error messages of cholesky_op. test=develop
      
      * Refine some error messages of cholesky_op. test=develop
      
      * Remove unused input in cholesky_grad. test=develop
      
      * Remove unused input in cholesky_grad. test=develop
      
      * Fix stream for cusolverDnSetStream. test=develop
      
      * Update PADDLE_ENFORCE_CUDA_SUCCESS from cholesky_op to adapt to latest code.
      test=develop
      
      * Add CUSOLVER ERROR in enforce.h
      test=develop
      
      * Fix the missing return value in cholesky. test=develop
      a8c0fb4e
  2. 23 4月, 2020 1 次提交
  3. 20 4月, 2020 1 次提交
    • Z
      Optimize the error messages of paddle CUDA API (#23816) · 78170037
      Zhou Wei 提交于
      * Optimize the error messages of paddle CUDA API, test=develop
      
      * fix the error messages of paddle CUDA API, test=develop
      
      * Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL,test=develop
      
      * remove build_ex_string,test=develop
      
      * merge conflict,test=develop
      78170037
  4. 17 4月, 2020 1 次提交
  5. 01 4月, 2020 1 次提交
  6. 30 3月, 2020 1 次提交
  7. 05 2月, 2020 1 次提交
  8. 08 1月, 2020 1 次提交
  9. 29 11月, 2019 1 次提交
  10. 18 11月, 2019 1 次提交
    • Z
      Fix warn of gcc8 (#21205) · cdb3d279
      Zeng Jinle 提交于
      * fix warnings oof gcc 8 compilation, test=develop
      
      * fix boost::bad_get, test=develop
      
      * refine PADDLE_ENFORCE, test=develop
      cdb3d279
  11. 14 11月, 2019 1 次提交
  12. 28 9月, 2019 1 次提交
    • Q
      Enable users to create custom cpp op outside framework. (#19256) · 1a3eef02
      qingqing01 提交于
      * How to write custom op needs to follow framework OP spec.
      * Package fluid_framework.so and headers into whl.
      * Add paddle.sysconfig.get_include() and paddle.sysconfig.get_lib() to get include dir and lib dir.
      * Export some C-APIs to merge OpInfo between core.so and custom_op.so.
      * Add unit testing.
      * Update API.spec.
      1a3eef02
  13. 24 9月, 2019 1 次提交
  14. 22 9月, 2019 1 次提交
  15. 18 9月, 2019 1 次提交
  16. 11 9月, 2019 1 次提交
    • H
      Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989) · 12542320
      Huihuang Zheng 提交于
      TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory.
      
      We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton.
      
      Also added data_feed_proto to operator to fix CI in CPU compilation
      12542320
  17. 03 9月, 2019 1 次提交
  18. 08 7月, 2019 1 次提交
    • T
      add mkldnn shapeblob cache clear strategy (#18513) · fe32879d
      Tao Luo 提交于
      * add mkldnn shapeblob cache clear strategy
      
      test=develop
      
      * refine with comments
      
      test=develop
      
      * make cache clear strategy more safey
      
      test=develop
      
      * add lock for GetShapeBlobSize
      
      test=develop
      fe32879d
  19. 03 7月, 2019 1 次提交
  20. 02 7月, 2019 1 次提交
  21. 27 6月, 2019 1 次提交
  22. 28 4月, 2019 1 次提交
  23. 21 4月, 2019 1 次提交
    • Z
      Refine model gpu memory (#16993) · 1202d3fc
      Zeng Jinle 提交于
      * speedup gc and inplace softmax_with_cross_entropy_grad
      test=develop
      
      * refine models gpu mem
      Merge skip vars and warning messages of mem opt
      remove relu mem opt
      test=develop
      
      * follow comments
      test=develop
      1202d3fc
  24. 25 3月, 2019 1 次提交
  25. 21 3月, 2019 1 次提交
  26. 20 3月, 2019 2 次提交
    • N
      07dcf285
    • W
      Collective ops (#15572) · 6382b62f
      Wu Yi 提交于
      * wip allreduce in op
      
      * wip
      
      * wip
      
      * wip
      
      * wip adding test
      
      * wip for conflict with mp mode
      
      * fix tests test=develop
      
      * fix cpu build test=develop
      
      * fix travis clang format test=develop
      
      * fix cpu build test=develop
      
      * update api.spec test=develop
      
      * delete comment test=develop
      
      * fix cpplint test=develop
      
      * fix test=develop
      
      * follow comment test=develop
      
      * add file test=develop
      
      * fix build test=develop
      
      * update test=develop
      
      * to be compatible with sync_bn, and fix mp mode in develop test=develop
      6382b62f
  27. 19 3月, 2019 1 次提交
  28. 16 3月, 2019 1 次提交
  29. 15 3月, 2019 1 次提交
    • Q
      Support sync batch norm. (#16121) · 8ad672a2
      qingqing01 提交于
      * Support Sync Batch Norm.
      * Note, do not enable it in one device.
      
      Usage:
      
      build_strategy = fluid.BuildStrategy()
      build_strategy.sync_batch_norm = True
      binary = fluid.compiler.CompiledProgram(tp).with_data_parallel(
              loss_name=loss_mean.name,
              build_strategy=build_strategy)
      8ad672a2
  30. 14 1月, 2019 1 次提交
  31. 11 1月, 2019 2 次提交
    • C
      Revert "Remove workspace_handle in conv_cudnn (#15186)" · 358e657f
      chengduozh 提交于
      test=develop
      This reverts commit 064512aa.
      358e657f
    • C
      Remove workspace_handle in conv_cudnn (#15186) · 064512aa
      chengduo 提交于
      * remove workspace_handle in conv2d_cudnn
      test=develop
      
      * remove workspace_handle
      test=develop
      
      * fix bug
      test=develop
      
      * make test_conv2d_op SERIAL
      test=develop
      
      * save memory in conv_cudnn
      test=develop
      
      * enhance thread safety
      test=develop
      
      * enhance temporary allocator
      test=develop
      
      * Add excess fraction
      test=develop
      
      * follow comments
      test=develop
      
      * fix bug and code refine
      test=develop
      
      * fix memory size check
      test=develop
      
      * rename reuse_tmp_allocation_excess_fraction
      test=develop
      064512aa
  32. 08 1月, 2019 2 次提交
  33. 02 1月, 2019 1 次提交
  34. 29 12月, 2018 1 次提交
  35. 25 12月, 2018 1 次提交
  36. 21 12月, 2018 1 次提交
    • C
      [Feature] Add Temporary Allocator (#14875) · 79bd6dfa
      chengduo 提交于
      * Add Temporal Allocator
      
      * add Temporay Allocator to DeviceContext
      test=develop
      
      * code refine
      test=develop
      
      * fix mean_iou
      test=develop
      
      * Add DeviceTemporaryAllocator
      test=develop
      
      * fix conv_op bug
      test=develop
      
      * small fix
      test=develop
      
      * code refine
      test=develop
      
      * log refine
      test=develop
      
      * fix unit test
      test=develop
      
      * move double check
      
      * refine concat_and_split
      test=develop
      
      * add limit_of_temporary_allocation
      test=develop
      
      * fix name
      test=develop
      79bd6dfa
  37. 11 12月, 2018 1 次提交