1. 08 9月, 2021 1 次提交
    • Z
      Enable program passes on Fleet APIs (#34955) · 5f369881
      Zeng Jinle 提交于
      * add fleet api for program pass
      
      * turn on apply pass for CI test
      
      * fix disable fuse_all_optimizer bug
      
      * try to test ci
      
      * fix CI
      
      * fill unspecified op role
      
      * fix fuse_allreduce
      
      * add ut to improve coverage
      
      * remove useless change
      
      * improve c++ coverage
      
      * follow some comments
      
      * test ir pass pipeline
      
      * update doc
      
      * reduce ut time again
      5f369881
  2. 06 9月, 2021 1 次提交
  3. 02 9月, 2021 1 次提交
  4. 24 8月, 2021 1 次提交
  5. 13 8月, 2021 1 次提交
  6. 30 7月, 2021 1 次提交
  7. 30 4月, 2021 1 次提交
  8. 09 4月, 2021 1 次提交
    • L
      [NPU] cherry-pick basic NPU components/allocator/operator/executor supports from ascendrc (#32144) · ccf5709d
      Leo Chen 提交于
      * [feature] support npu allocator (#30840)
      
      [feature] support npu allocator
      
      * [feature] support npu operator (#30951)
      
      [feature] support npu operator
      
      * [feature] support npu allocator, part 2 (#30972)
      
      * support npu allocator
      
      * add npu device context
      
      * fix some compile problem
      
      * fix some compile problem
      
      * add npu info
      
      * compile ok
      
      * fix include dir
      
      * support naive_best_fit_allocator
      
      * run ut ok, bug failed to exit
      
      * call aclrtResetDevice before exit
      
      * fix aclFinilize
      
      * add system allocatot test
      
      * add selected_gpus in gtest
      
      * add tensor_test for npu
      
      * support npu op, initial commit
      
      * add npu stream
      
      * add elementwise_add_op
      
      * compile ok
      
      * fix typo
      
      * fix elementwise_add_op_npu_test
      
      * support op run
      
      * test can run but failed
      
      * change aclopExecuteV2 to aclopCompileAndExecute
      
      * support parsing ascend rank table file (#31000)
      
      support parsing ascend rank table file
      
      * Fix reshape on GE graph. (#31084)
      
      Fix reshape on GE graph
      
      * add npu kernel for elementwise_sub and elementwise_sub_grad (#30973)
      
      * add npu sub op
      
      * fix typo
      
      * rename test
      
      * fix bug
      
      * fix bug
      
      * add fp16 kernel
      
      * fix typo
      
      * support sub grad op
      
      * support elementwise_sub_grad op
      Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>
      
      * Fix compilation problem (#31100)
      
      Fix compilation problem (#31100)
      
      * fix compile
      
      * fix code stype
      
      * remove const_cast
      
      * support adding correct npu op in pybind.h (#31143)
      
      * support adding correct npu op in pybind.h
      
      * refine code
      
      * [NPU] Support executor with NPU (#31057)
      
      * [NPU] Support executor with NPU
      
      * Fix code according to reviews
      
      * Fix code
      
      * Add unittest for sub op npu
      
      * refactor npu device manager (#31154)
      
      refactor npu device manager (#31154)
      
      * fix selected npus
      
      * fix compile
      
      * fix reading flags from env
      
      * format
      Co-authored-by: Nxiayanming <41795079@qq.com>
      Co-authored-by: Ngongweibao <weibao.gong@gmail.com>
      Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>
      Co-authored-by: Nliym27 <33742067+liym27@users.noreply.github.com>
      ccf5709d
  9. 29 3月, 2021 1 次提交
  10. 22 2月, 2021 1 次提交
  11. 24 12月, 2020 1 次提交
  12. 25 11月, 2020 1 次提交
  13. 28 9月, 2020 1 次提交
  14. 21 9月, 2020 1 次提交
  15. 28 8月, 2020 1 次提交
  16. 07 8月, 2020 1 次提交
  17. 28 7月, 2020 1 次提交
    • A
      Added DNNL cache management for DyGraph (#25624) · e52df3b1
      arlesniak 提交于
      * Added DNNL cache management for DyGraph
      
      * move FLAGS_use_mkldnn to more general CMakeLists, getu use of the flag in ClearGradients
      
      * missing file
      
      * Fixes after review
      
      * Bringing back original idea of place for 'use_mkldnn' flag to be accessible from platform nad imperative.
      
      * Removed duplicate and added docs
      
      * Fixes for CI
      e52df3b1
  18. 21 4月, 2020 1 次提交
  19. 15 4月, 2020 1 次提交
  20. 04 3月, 2020 1 次提交
    • Z
      Add flags to limit gpu memory (#22793) · d41d802b
      Zeng Jinle 提交于
      * add recorded cuda memory apis, fix typo, test=develop
      
      * add more ut, test=develop
      
      * follow comments, test=develop
      
      * fix py35 incompatible issues, test=develop
      d41d802b
  21. 08 1月, 2020 1 次提交
  22. 06 1月, 2020 2 次提交
  23. 20 10月, 2019 1 次提交
  24. 11 10月, 2019 1 次提交
  25. 23 9月, 2019 1 次提交
  26. 18 9月, 2019 1 次提交
  27. 10 9月, 2019 1 次提交
  28. 16 8月, 2019 1 次提交