1. 09 4月, 2021 1 次提交
    • L
      [NPU] cherry-pick basic NPU components/allocator/operator/executor supports from ascendrc (#32144) · ccf5709d
      Leo Chen 提交于
      * [feature] support npu allocator (#30840)
      
      [feature] support npu allocator
      
      * [feature] support npu operator (#30951)
      
      [feature] support npu operator
      
      * [feature] support npu allocator, part 2 (#30972)
      
      * support npu allocator
      
      * add npu device context
      
      * fix some compile problem
      
      * fix some compile problem
      
      * add npu info
      
      * compile ok
      
      * fix include dir
      
      * support naive_best_fit_allocator
      
      * run ut ok, bug failed to exit
      
      * call aclrtResetDevice before exit
      
      * fix aclFinilize
      
      * add system allocatot test
      
      * add selected_gpus in gtest
      
      * add tensor_test for npu
      
      * support npu op, initial commit
      
      * add npu stream
      
      * add elementwise_add_op
      
      * compile ok
      
      * fix typo
      
      * fix elementwise_add_op_npu_test
      
      * support op run
      
      * test can run but failed
      
      * change aclopExecuteV2 to aclopCompileAndExecute
      
      * support parsing ascend rank table file (#31000)
      
      support parsing ascend rank table file
      
      * Fix reshape on GE graph. (#31084)
      
      Fix reshape on GE graph
      
      * add npu kernel for elementwise_sub and elementwise_sub_grad (#30973)
      
      * add npu sub op
      
      * fix typo
      
      * rename test
      
      * fix bug
      
      * fix bug
      
      * add fp16 kernel
      
      * fix typo
      
      * support sub grad op
      
      * support elementwise_sub_grad op
      Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>
      
      * Fix compilation problem (#31100)
      
      Fix compilation problem (#31100)
      
      * fix compile
      
      * fix code stype
      
      * remove const_cast
      
      * support adding correct npu op in pybind.h (#31143)
      
      * support adding correct npu op in pybind.h
      
      * refine code
      
      * [NPU] Support executor with NPU (#31057)
      
      * [NPU] Support executor with NPU
      
      * Fix code according to reviews
      
      * Fix code
      
      * Add unittest for sub op npu
      
      * refactor npu device manager (#31154)
      
      refactor npu device manager (#31154)
      
      * fix selected npus
      
      * fix compile
      
      * fix reading flags from env
      
      * format
      Co-authored-by: Nxiayanming <41795079@qq.com>
      Co-authored-by: Ngongweibao <weibao.gong@gmail.com>
      Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>
      Co-authored-by: Nliym27 <33742067+liym27@users.noreply.github.com>
      ccf5709d
  2. 29 3月, 2021 1 次提交
  3. 05 3月, 2021 1 次提交
  4. 26 2月, 2021 1 次提交
  5. 28 9月, 2020 1 次提交
  6. 21 9月, 2020 1 次提交
  7. 31 8月, 2020 1 次提交
    • A
      Add use of global flag 'use_mkldnn' to layer_helper (#26497) · 885c61f0
      arlesniak 提交于
      * get use of global 'use_mkldnn' in layer_helper
      
      * update for CI
      
      * update for CI, relu test
      
      * update for CI, relu test added, make FLAGS_use_mkldnn a public flag
      
      * added more strict tests, fixes after review
      
      * fixes after review
      
      * fixes after review, CI stuff
      885c61f0
  8. 28 8月, 2020 1 次提交
  9. 07 8月, 2020 1 次提交
  10. 09 4月, 2020 1 次提交
    • M
      Remove: NGraph engine from PDPD repository (#23545) · 3baaee9a
      mozga-intel 提交于
      * Remove the NGraph engine from PDPD repository
      1. Each operator was removed from the operator's directory
      2. Each test was removed from the unittest directory
      3. The parallel executor support was removed from the PDPD
      4. The CMake file was removed from the PDPD
      5. The NG flags were removed from the repository
      test=develop
      
      * Remove ngraph from:
      1. Cmake file
      2. Python file
      test=develop
      3baaee9a
  11. 08 4月, 2020 1 次提交
    • G
      Add a function to update FLAGS (#22851) · ca7bd2be
      guofei 提交于
      * Add a function to update FLAGS
      
      test=develop
      
      * Add a function to update FLAGS
      
      test=develop
      
      * expr flags
      
      * Add a function to update FLAGS
      
      test=develop
      
      * distinguish public/private vars, test=develop
      
      * fix windows issues, test=develop
      
      * expr flag
      
      * Add functions to get and set FLAGS
      
      test=develop
      
      * Add functions to get and set FLAGS
      
      test=develop
      
      * Add functions to get and set FLAGS
      
      test=develop
      
      * Add functions to get and set flags
      
      test=develop
      
      * Add functions to get and set FLAGS
      
      test=develop
      
      * Add a function to update FLAGS
      
      test=develop
      
      * Add a function to update FLAGS
      
      test=develop
      
      * Add functions to get and set flags in Paddle
      
      test=develop
      Co-authored-by: Nsneaxiy <sneaxiy@126.com>
      ca7bd2be
  12. 29 3月, 2020 1 次提交
  13. 20 3月, 2020 1 次提交
    • Z
      Add dygraph double grad implementation (#22939) · a31d7328
      Zeng Jinle 提交于
      * add double grad implementation for dygraph, test=develop
      
      * polish code, add uts, test=develop
      
      * fix place bug, test=develop
      
      * polish codes, add more uts for coverages, test=develop
      
      * add no_grad_set, test=develop
      
      * add star gan ut, test=develop
      
      * follow comments, test=develop
      a31d7328
  14. 04 3月, 2020 1 次提交
    • Z
      Add flags to limit gpu memory (#22793) · d41d802b
      Zeng Jinle 提交于
      * add recorded cuda memory apis, fix typo, test=develop
      
      * add more ut, test=develop
      
      * follow comments, test=develop
      
      * fix py35 incompatible issues, test=develop
      d41d802b
  15. 02 3月, 2020 1 次提交
    • Z
      Unmerged fetch list (#22635) · 89cfa491
      Zhen Wang 提交于
      * update ScopeBufferedSSAGraphExecutor&AsyncSSAGraphExecutor&ThreadedSSAGraphExecutor&FastThreadedSSAGraphExecutor&ParallelSSAGraphExecutor&ParallelExecutor for fetching unmerged results.
      
      * add the unit test for fetch_unmerged.
      
      * update ut for multi-card and multi-cpu.
      
      * add the error message and the user suggestion in FetchOpHandle. test=develop
      89cfa491
  16. 19 12月, 2019 1 次提交
  17. 28 11月, 2019 1 次提交
  18. 26 11月, 2019 1 次提交
  19. 25 11月, 2019 1 次提交