1. 27 5月, 2022 1 次提交
  2. 21 5月, 2022 1 次提交
  3. 04 3月, 2022 1 次提交
  4. 03 3月, 2022 1 次提交
  5. 15 2月, 2022 1 次提交
    • R
      [PluggableDevice] Add custom runtime support (#38740) · 3e7825f3
      ronnywang 提交于
      * [CustomRuntime] Add DeviceManager
      
      * [CustomRuntime] Add DeviceInterface
      
      * [CustomRuntime] Add Stream, Event, DeviceGuard, CallbackManager
      
      * [CustomRuntime] Add plug-in device
      
      * [CustomRuntime] Memory module support PluggableDevice
      
      * [CustomRuntime] Add WITH_PLUGGABLE_DEVICE cmake option
      
      * update
      
      * [API] update API doc based on comments, test=develop
      Co-authored-by: Nqili93 <qili93@qq.com>
      3e7825f3
  6. 14 1月, 2022 1 次提交
    • Q
      [MLU]Add mean and reduce_mean op (#38872) · 7f8d5bc8
      qipengh 提交于
      * [MLU]: add mean and reduce mean op
      
      * [MLU]add mlu pytest dir in CMakeLists.txt
      
      * [MLU]fix tensor data
      
      * [MLU]fix TensorToPyArray and license
      7f8d5bc8
  7. 04 1月, 2022 1 次提交
  8. 29 12月, 2021 1 次提交
  9. 22 12月, 2021 1 次提交
  10. 20 12月, 2021 1 次提交
  11. 03 12月, 2021 1 次提交
  12. 29 11月, 2021 1 次提交
  13. 27 11月, 2021 1 次提交
    • A
      [NPU] reorganization for device API abstraction (#37110) · 72241a6a
      Aganlengzi 提交于
      * [NPU] reorganization for device API abstraction
      
      * [NPU] delete old files
      
      * [NPU] fix npu_collective_helper
      
      * [NPU] fix collective_helper
      
      * [NPU] fix ut
      
      * [NPU] mod memory allocation and hccl_helper
      
      * [NPU] fix place_type
      
      * [NPU] split enfoce.h
      
      * move acl* call into npu_info
      
      * merge conflict
      
      * fix merge
      
      * merge conflict
      
      * merge conflict
      72241a6a
  14. 03 9月, 2021 1 次提交
  15. 28 6月, 2021 1 次提交
  16. 12 5月, 2021 1 次提交
  17. 26 4月, 2021 1 次提交
  18. 09 4月, 2021 1 次提交
    • L
      [NPU] cherry-pick basic NPU components/allocator/operator/executor supports from ascendrc (#32144) · ccf5709d
      Leo Chen 提交于
      * [feature] support npu allocator (#30840)
      
      [feature] support npu allocator
      
      * [feature] support npu operator (#30951)
      
      [feature] support npu operator
      
      * [feature] support npu allocator, part 2 (#30972)
      
      * support npu allocator
      
      * add npu device context
      
      * fix some compile problem
      
      * fix some compile problem
      
      * add npu info
      
      * compile ok
      
      * fix include dir
      
      * support naive_best_fit_allocator
      
      * run ut ok, bug failed to exit
      
      * call aclrtResetDevice before exit
      
      * fix aclFinilize
      
      * add system allocatot test
      
      * add selected_gpus in gtest
      
      * add tensor_test for npu
      
      * support npu op, initial commit
      
      * add npu stream
      
      * add elementwise_add_op
      
      * compile ok
      
      * fix typo
      
      * fix elementwise_add_op_npu_test
      
      * support op run
      
      * test can run but failed
      
      * change aclopExecuteV2 to aclopCompileAndExecute
      
      * support parsing ascend rank table file (#31000)
      
      support parsing ascend rank table file
      
      * Fix reshape on GE graph. (#31084)
      
      Fix reshape on GE graph
      
      * add npu kernel for elementwise_sub and elementwise_sub_grad (#30973)
      
      * add npu sub op
      
      * fix typo
      
      * rename test
      
      * fix bug
      
      * fix bug
      
      * add fp16 kernel
      
      * fix typo
      
      * support sub grad op
      
      * support elementwise_sub_grad op
      Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>
      
      * Fix compilation problem (#31100)
      
      Fix compilation problem (#31100)
      
      * fix compile
      
      * fix code stype
      
      * remove const_cast
      
      * support adding correct npu op in pybind.h (#31143)
      
      * support adding correct npu op in pybind.h
      
      * refine code
      
      * [NPU] Support executor with NPU (#31057)
      
      * [NPU] Support executor with NPU
      
      * Fix code according to reviews
      
      * Fix code
      
      * Add unittest for sub op npu
      
      * refactor npu device manager (#31154)
      
      refactor npu device manager (#31154)
      
      * fix selected npus
      
      * fix compile
      
      * fix reading flags from env
      
      * format
      Co-authored-by: Nxiayanming <41795079@qq.com>
      Co-authored-by: Ngongweibao <weibao.gong@gmail.com>
      Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>
      Co-authored-by: Nliym27 <33742067+liym27@users.noreply.github.com>
      ccf5709d
  19. 04 2月, 2021 1 次提交
  20. 01 2月, 2021 1 次提交
  21. 15 1月, 2021 1 次提交
  22. 11 12月, 2020 1 次提交
    • L
      Add the strategy of skipping cc/cu test compilation and execution in CI (#29499) · b5d4a1f3
      LoveAn 提交于
      * Add the strategy of skipping cc/cu test compilation and execution in CI, test=develop
      
      * fix if error with CI_SKIP_TEST, test=develop
      
      * fix add properties to test error on Linux/MAC, test=develop
      
      * fix set test properties of test_code_generator error, test=develop
      
      * remove test codes and advance judgment of file modification on Linux, test=develop
      
      * rename CI_SKIP_TEST to CI_SKIP_CPP_TEST, test=document_fix
      
      * Add branch judgement on Linux, test=develop
      b5d4a1f3
  23. 06 11月, 2020 1 次提交
  24. 04 11月, 2020 1 次提交
  25. 24 9月, 2020 1 次提交
    • W
      use iwyu clean include (#27267) · df43905f
      wanghuancoder 提交于
      * use iwyu clean include, test=develop, test=win
      
      * compilation error, test=develop
      
      * fix compilation error2, test=develop
      
      * fix compilation error3, test=develop
      
      * fix compilation error4, test=develop
      
      * fix compilation error5, test=develop
      
      * fix compilation error6, test=develop
      
      * fix compilation error7, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error10, test=develop
      
      * fix compilation error11, test=develop
      df43905f
  26. 23 7月, 2020 1 次提交
  27. 24 6月, 2020 1 次提交
    • L
      Refine error message in memory folder (#25095) · ff5be2fb
      Leo Chen 提交于
      * refine PADDLE_THROW, test=develop
      
      * refine error msg, test=develop
      
      * refine cuda error, test=develop
      
      * follow comments, test=develop
      
      * fix compile problem, test=develop
      
      * fix bug, test=develop
      ff5be2fb
  28. 19 6月, 2020 1 次提交
  29. 20 5月, 2020 1 次提交
    • H
      Make buddy_allocator_test Exclusive (#24636) · f831f612
      Huihuang Zheng 提交于
      The background is that buddy_allocator_test depends on some big GPU allocations and current machine may not have it. It can cause random failure on cloud machine. After discussion with Shuo Tian, we decided to make buddy_allocator_test exclusive.
      f831f612
  30. 28 4月, 2020 1 次提交
  31. 04 3月, 2020 1 次提交
    • Z
      Add flags to limit gpu memory (#22793) · d41d802b
      Zeng Jinle 提交于
      * add recorded cuda memory apis, fix typo, test=develop
      
      * add more ut, test=develop
      
      * follow comments, test=develop
      
      * fix py35 incompatible issues, test=develop
      d41d802b
  32. 13 11月, 2019 1 次提交
  33. 06 11月, 2019 1 次提交
  34. 05 11月, 2019 1 次提交
  35. 30 10月, 2019 1 次提交
  36. 24 10月, 2019 1 次提交
  37. 17 10月, 2019 1 次提交
    • L
      improve the efficiency of BuddyAllocator (#19888) · 569951c4
      liuwei1031 提交于
      * improve save and load behaviour, test=develop
      
      * code cleaning, test=develop
      
      * disable check_guards and update_guards in release version, test=develop
      
      * fix compilation issue, test=develop
      
      * add buddy_allocator speed test data, test=develop
      
      * fix compilation issue, test=develop
      
      * fix comment, test=develop
      
      * update function names according to the google C++ style guide, test=develop
      
      * tweak the test data format, test=develop
      
      * move buddy_allocator_test_data to paddle/fluid/testdata, test=develop
      
      * add accessor and mutator for Desc, test=develop
      569951c4
  38. 25 9月, 2019 1 次提交
  39. 24 9月, 2019 1 次提交
  40. 20 9月, 2019 1 次提交
    • Z
      Refine err msg of out of gpu memory (#19779) · 747d4498
      Zeng Jinle 提交于
      * refine err msg of out of gpu memory, test=develop
      
      * refine err msg again, test=develop
      
      * refine errog message again, test=develop
      
      * follow reviewer's comments, test=develop
      747d4498