1. 09 4月, 2021 1 次提交
    • L
      [NPU] cherry-pick basic NPU components/allocator/operator/executor supports from ascendrc (#32144) · ccf5709d
      Leo Chen 提交于
      * [feature] support npu allocator (#30840)
      
      [feature] support npu allocator
      
      * [feature] support npu operator (#30951)
      
      [feature] support npu operator
      
      * [feature] support npu allocator, part 2 (#30972)
      
      * support npu allocator
      
      * add npu device context
      
      * fix some compile problem
      
      * fix some compile problem
      
      * add npu info
      
      * compile ok
      
      * fix include dir
      
      * support naive_best_fit_allocator
      
      * run ut ok, bug failed to exit
      
      * call aclrtResetDevice before exit
      
      * fix aclFinilize
      
      * add system allocatot test
      
      * add selected_gpus in gtest
      
      * add tensor_test for npu
      
      * support npu op, initial commit
      
      * add npu stream
      
      * add elementwise_add_op
      
      * compile ok
      
      * fix typo
      
      * fix elementwise_add_op_npu_test
      
      * support op run
      
      * test can run but failed
      
      * change aclopExecuteV2 to aclopCompileAndExecute
      
      * support parsing ascend rank table file (#31000)
      
      support parsing ascend rank table file
      
      * Fix reshape on GE graph. (#31084)
      
      Fix reshape on GE graph
      
      * add npu kernel for elementwise_sub and elementwise_sub_grad (#30973)
      
      * add npu sub op
      
      * fix typo
      
      * rename test
      
      * fix bug
      
      * fix bug
      
      * add fp16 kernel
      
      * fix typo
      
      * support sub grad op
      
      * support elementwise_sub_grad op
      Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>
      
      * Fix compilation problem (#31100)
      
      Fix compilation problem (#31100)
      
      * fix compile
      
      * fix code stype
      
      * remove const_cast
      
      * support adding correct npu op in pybind.h (#31143)
      
      * support adding correct npu op in pybind.h
      
      * refine code
      
      * [NPU] Support executor with NPU (#31057)
      
      * [NPU] Support executor with NPU
      
      * Fix code according to reviews
      
      * Fix code
      
      * Add unittest for sub op npu
      
      * refactor npu device manager (#31154)
      
      refactor npu device manager (#31154)
      
      * fix selected npus
      
      * fix compile
      
      * fix reading flags from env
      
      * format
      Co-authored-by: Nxiayanming <41795079@qq.com>
      Co-authored-by: Ngongweibao <weibao.gong@gmail.com>
      Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>
      Co-authored-by: Nliym27 <33742067+liym27@users.noreply.github.com>
      ccf5709d
  2. 01 2月, 2021 1 次提交
  3. 11 12月, 2020 1 次提交
    • L
      Add the strategy of skipping cc/cu test compilation and execution in CI (#29499) · b5d4a1f3
      LoveAn 提交于
      * Add the strategy of skipping cc/cu test compilation and execution in CI, test=develop
      
      * fix if error with CI_SKIP_TEST, test=develop
      
      * fix add properties to test error on Linux/MAC, test=develop
      
      * fix set test properties of test_code_generator error, test=develop
      
      * remove test codes and advance judgment of file modification on Linux, test=develop
      
      * rename CI_SKIP_TEST to CI_SKIP_CPP_TEST, test=document_fix
      
      * Add branch judgement on Linux, test=develop
      b5d4a1f3
  4. 19 6月, 2020 1 次提交
  5. 20 5月, 2020 1 次提交
    • H
      Make buddy_allocator_test Exclusive (#24636) · f831f612
      Huihuang Zheng 提交于
      The background is that buddy_allocator_test depends on some big GPU allocations and current machine may not have it. It can cause random failure on cloud machine. After discussion with Shuo Tian, we decided to make buddy_allocator_test exclusive.
      f831f612
  6. 17 10月, 2019 1 次提交
    • L
      improve the efficiency of BuddyAllocator (#19888) · 569951c4
      liuwei1031 提交于
      * improve save and load behaviour, test=develop
      
      * code cleaning, test=develop
      
      * disable check_guards and update_guards in release version, test=develop
      
      * fix compilation issue, test=develop
      
      * add buddy_allocator speed test data, test=develop
      
      * fix compilation issue, test=develop
      
      * fix comment, test=develop
      
      * update function names according to the google C++ style guide, test=develop
      
      * tweak the test data format, test=develop
      
      * move buddy_allocator_test_data to paddle/fluid/testdata, test=develop
      
      * add accessor and mutator for Desc, test=develop
      569951c4
  7. 03 9月, 2019 1 次提交
  8. 08 4月, 2018 2 次提交
  9. 10 2月, 2018 1 次提交
  10. 06 7月, 2017 1 次提交
  11. 05 7月, 2017 1 次提交
  12. 04 7月, 2017 3 次提交
  13. 29 6月, 2017 1 次提交
  14. 28 6月, 2017 2 次提交
  15. 27 6月, 2017 3 次提交
  16. 26 6月, 2017 3 次提交