1. 26 3月, 2021 1 次提交
  2. 02 3月, 2021 1 次提交
  3. 09 2月, 2021 3 次提交
    • L
      [feature] support npu allocator, part 2 (#30972) · 1201cd2e
      Leo Chen 提交于
      * support npu allocator
      
      * add npu device context
      
      * fix some compile problem
      
      * fix some compile problem
      
      * add npu info
      
      * compile ok
      
      * fix include dir
      
      * support naive_best_fit_allocator
      
      * run ut ok, bug failed to exit
      
      * call aclrtResetDevice before exit
      
      * fix aclFinilize
      
      * add system allocatot test
      
      * add selected_gpus in gtest
      
      * add tensor_test for npu
      
      * support npu op, initial commit
      
      * add npu stream
      
      * add elementwise_add_op
      
      * compile ok
      
      * fix typo
      
      * fix elementwise_add_op_npu_test
      
      * support op run
      
      * test can run but failed
      
      * change aclopExecuteV2 to aclopCompileAndExecute
      1201cd2e
    • L
      [feature] support npu operator (#30951) · 7e049108
      Leo Chen 提交于
      [feature] support npu operator
      7e049108
    • L
      [feature] support npu allocator (#30840) · 81138239
      Leo Chen 提交于
      [feature] support npu allocator
      81138239
  4. 11 1月, 2021 1 次提交
  5. 28 12月, 2020 1 次提交
  6. 26 12月, 2020 1 次提交
  7. 23 12月, 2020 1 次提交
  8. 15 12月, 2020 1 次提交
  9. 14 12月, 2020 2 次提交
  10. 25 11月, 2020 1 次提交
  11. 23 11月, 2020 1 次提交
  12. 02 11月, 2020 1 次提交
  13. 24 9月, 2020 1 次提交
    • W
      use iwyu clean include (#27267) · df43905f
      wanghuancoder 提交于
      * use iwyu clean include, test=develop, test=win
      
      * compilation error, test=develop
      
      * fix compilation error2, test=develop
      
      * fix compilation error3, test=develop
      
      * fix compilation error4, test=develop
      
      * fix compilation error5, test=develop
      
      * fix compilation error6, test=develop
      
      * fix compilation error7, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error10, test=develop
      
      * fix compilation error11, test=develop
      df43905f
  14. 17 9月, 2020 1 次提交
  15. 21 8月, 2020 2 次提交
  16. 03 7月, 2020 1 次提交
  17. 14 5月, 2020 1 次提交
  18. 24 4月, 2020 1 次提交
    • G
      Add cholesky_op (#23543) · a8c0fb4e
      Guo Sheng 提交于
      * Add cholesky_op forward part. test=develop
      
      * Complete cholesky_op forward part. test=develop
      
      * Add cholesky_op backward part. test=develop
      
      * Complete cholesky_op backward part. test=develop
      
      * Refine cholesky_op error check and docs. test=develop
      
      * Add grad_check unit test for cholesky_op. test=develop
      
      * Fix sample code in cholesky doc. test=develop
      
      * Refine some error messages of cholesky_op. test=develop
      
      * Refine some error messages of cholesky_op. test=develop
      
      * Remove unused input in cholesky_grad. test=develop
      
      * Remove unused input in cholesky_grad. test=develop
      
      * Fix stream for cusolverDnSetStream. test=develop
      
      * Update PADDLE_ENFORCE_CUDA_SUCCESS from cholesky_op to adapt to latest code.
      test=develop
      
      * Add CUSOLVER ERROR in enforce.h
      test=develop
      
      * Fix the missing return value in cholesky. test=develop
      a8c0fb4e
  19. 23 4月, 2020 1 次提交
  20. 20 4月, 2020 1 次提交
    • Z
      Optimize the error messages of paddle CUDA API (#23816) · 78170037
      Zhou Wei 提交于
      * Optimize the error messages of paddle CUDA API, test=develop
      
      * fix the error messages of paddle CUDA API, test=develop
      
      * Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL,test=develop
      
      * remove build_ex_string,test=develop
      
      * merge conflict,test=develop
      78170037
  21. 17 4月, 2020 1 次提交
  22. 01 4月, 2020 1 次提交
  23. 30 3月, 2020 1 次提交
  24. 05 2月, 2020 1 次提交
  25. 08 1月, 2020 1 次提交
  26. 29 11月, 2019 1 次提交
  27. 18 11月, 2019 1 次提交
    • Z
      Fix warn of gcc8 (#21205) · cdb3d279
      Zeng Jinle 提交于
      * fix warnings oof gcc 8 compilation, test=develop
      
      * fix boost::bad_get, test=develop
      
      * refine PADDLE_ENFORCE, test=develop
      cdb3d279
  28. 14 11月, 2019 1 次提交
  29. 28 9月, 2019 1 次提交
    • Q
      Enable users to create custom cpp op outside framework. (#19256) · 1a3eef02
      qingqing01 提交于
      * How to write custom op needs to follow framework OP spec.
      * Package fluid_framework.so and headers into whl.
      * Add paddle.sysconfig.get_include() and paddle.sysconfig.get_lib() to get include dir and lib dir.
      * Export some C-APIs to merge OpInfo between core.so and custom_op.so.
      * Add unit testing.
      * Update API.spec.
      1a3eef02
  30. 24 9月, 2019 1 次提交
  31. 22 9月, 2019 1 次提交
  32. 18 9月, 2019 1 次提交
  33. 11 9月, 2019 1 次提交
    • H
      Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989) · 12542320
      Huihuang Zheng 提交于
      TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory.
      
      We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton.
      
      Also added data_feed_proto to operator to fix CI in CPU compilation
      12542320
  34. 03 9月, 2019 1 次提交
  35. 08 7月, 2019 1 次提交
    • T
      add mkldnn shapeblob cache clear strategy (#18513) · fe32879d
      Tao Luo 提交于
      * add mkldnn shapeblob cache clear strategy
      
      test=develop
      
      * refine with comments
      
      test=develop
      
      * make cache clear strategy more safey
      
      test=develop
      
      * add lock for GetShapeBlobSize
      
      test=develop
      fe32879d
  36. 03 7月, 2019 1 次提交