1. 09 4月, 2021 1 次提交
    • L
      [NPU] cherry-pick basic NPU components/allocator/operator/executor supports from ascendrc (#32144) · ccf5709d
      Leo Chen 提交于
      * [feature] support npu allocator (#30840)
      
      [feature] support npu allocator
      
      * [feature] support npu operator (#30951)
      
      [feature] support npu operator
      
      * [feature] support npu allocator, part 2 (#30972)
      
      * support npu allocator
      
      * add npu device context
      
      * fix some compile problem
      
      * fix some compile problem
      
      * add npu info
      
      * compile ok
      
      * fix include dir
      
      * support naive_best_fit_allocator
      
      * run ut ok, bug failed to exit
      
      * call aclrtResetDevice before exit
      
      * fix aclFinilize
      
      * add system allocatot test
      
      * add selected_gpus in gtest
      
      * add tensor_test for npu
      
      * support npu op, initial commit
      
      * add npu stream
      
      * add elementwise_add_op
      
      * compile ok
      
      * fix typo
      
      * fix elementwise_add_op_npu_test
      
      * support op run
      
      * test can run but failed
      
      * change aclopExecuteV2 to aclopCompileAndExecute
      
      * support parsing ascend rank table file (#31000)
      
      support parsing ascend rank table file
      
      * Fix reshape on GE graph. (#31084)
      
      Fix reshape on GE graph
      
      * add npu kernel for elementwise_sub and elementwise_sub_grad (#30973)
      
      * add npu sub op
      
      * fix typo
      
      * rename test
      
      * fix bug
      
      * fix bug
      
      * add fp16 kernel
      
      * fix typo
      
      * support sub grad op
      
      * support elementwise_sub_grad op
      Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>
      
      * Fix compilation problem (#31100)
      
      Fix compilation problem (#31100)
      
      * fix compile
      
      * fix code stype
      
      * remove const_cast
      
      * support adding correct npu op in pybind.h (#31143)
      
      * support adding correct npu op in pybind.h
      
      * refine code
      
      * [NPU] Support executor with NPU (#31057)
      
      * [NPU] Support executor with NPU
      
      * Fix code according to reviews
      
      * Fix code
      
      * Add unittest for sub op npu
      
      * refactor npu device manager (#31154)
      
      refactor npu device manager (#31154)
      
      * fix selected npus
      
      * fix compile
      
      * fix reading flags from env
      
      * format
      Co-authored-by: Nxiayanming <41795079@qq.com>
      Co-authored-by: Ngongweibao <weibao.gong@gmail.com>
      Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>
      Co-authored-by: Nliym27 <33742067+liym27@users.noreply.github.com>
      ccf5709d
  2. 26 3月, 2021 1 次提交
  3. 07 1月, 2021 1 次提交
  4. 23 12月, 2020 1 次提交
  5. 23 11月, 2020 2 次提交
  6. 30 10月, 2020 1 次提交
  7. 14 10月, 2020 1 次提交
  8. 13 10月, 2020 1 次提交
  9. 10 10月, 2020 1 次提交
  10. 30 9月, 2020 1 次提交
  11. 25 9月, 2020 1 次提交
    • T
      add xpu in heter mode (#27000) · 6f69a4cb
      Thunderbrook 提交于
      * add xpu in heter mode
      test=develop
      
      * BOOST_CONST_GET; PADDLE_THROW
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * refine
      test=develop
      
      * refine
      test=develop
      
      * refine
      test=develop
      
      * refine code
      test=develop
      6f69a4cb
  12. 25 8月, 2020 1 次提交
    • W
      optimized transformation form tensor to numpy (#26447) · c1f5df52
      wanghuancoder 提交于
      * optimized transformation form tensor to numpy, test=develop
      
      * optimized transformation form tensor to numpy, pass pre-commit, test=develop
      
      * modify fetchophandle zerocopy to deepcopy in PE&CUP, test=develop
      
      * modify py:array construct, test=develop
      
      * fix _fetch_var to use deep copy, test=develop
      c1f5df52
  13. 24 8月, 2020 1 次提交
  14. 19 8月, 2020 1 次提交
  15. 16 8月, 2020 1 次提交
  16. 08 8月, 2020 1 次提交
  17. 06 8月, 2020 1 次提交
    • T
      add heter ps mode (#25682) · 0cb60c70
      Thunderbrook 提交于
      * add heter ps mode
      
      * code style
      test=develop
      
      * add with_pslib
      test=develop
      
      * unitest
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * test monitor
      test=develop
      
      * prepare trainer
      test=develop
      
      * code style
      test=develop
      0cb60c70
  18. 04 8月, 2020 1 次提交
  19. 03 8月, 2020 1 次提交
  20. 31 7月, 2020 1 次提交
  21. 24 7月, 2020 1 次提交
  22. 08 7月, 2020 1 次提交
    • L
      refine as_lodtensor, test=develop (#25286) · 91f7b4e0
      Leo Chen 提交于
      * refine as_lodtensor, test=develop
      
      * fix test, test=develop
      
      * add unittest, test=develop
      
      * handle nested_list, test=develop
      
      * handle nested_list, test=develop
      91f7b4e0
  23. 14 5月, 2020 1 次提交
  24. 29 4月, 2020 1 次提交
  25. 19 4月, 2020 1 次提交
    • G
      Support LoDTensorArray in fetch (#23645) · 2b896c1f
      guofei 提交于
      * Support LoDTEnsorArray in fetch op
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      
      * Support LoDTensorArray in fetch
      
      test=develop
      2b896c1f
  26. 17 4月, 2020 1 次提交
  27. 09 4月, 2020 1 次提交
  28. 08 4月, 2020 1 次提交
    • G
      Make the place parameter default (#23094) · cb1a2512
      guofei 提交于
      * Make the place parameter default
      
      test=develop
      
      * Make the place parameter default
      
      test=develop
      
      * Make the place parameter default
      
      test=develop
      
      * Make the place parameter default
      
      test=develop
      
      * Make the place parameter default
      
      test=develop
      
      * Make the place parameter default
      
      test=develop
      
      * Make the place parameter default
      
      test=develop
      
      * Make the place parameter default
      
      test=develop
      cb1a2512
  29. 06 4月, 2020 1 次提交
    • C
      Implement StaticModelRunner to support dygraph fine-tune static graph pre-training model (#23171) · 75bd3507
      Chen Weihang 提交于
      * static model runner basic implement, test=develop
      
      * add run program op to execute loaded program, test=develop
      
      * refactor static model runner & run program op, test=develop
      
      * reset engine.cc to resolve conflict
      
      * adapt the change of dygraph double grad, test=develop
      
      * refactor impl to solve control flow error, test=develop
      
      * clear debug code, test=develop
      
      * fix ci str compatible error & checkout dygraph grad maker & add example, test=develop
      
      * hide api & add op test, test=develop
      
      * fix run program op test places error, test=develop
      
      * fix program by review comment, test=develop
      
      * delete change var desc name, test=develop
      
      * fix other program by review comment, test=develop
      
      * remove _static_graph_guard, test=develop
      
      * add selectedrows test, test=develop
      
      * remove desc parser, test=develop
      
      * fix detail program, test=develop
      
      * change socpe create & add test, test=develop
      75bd3507
  30. 03 4月, 2020 1 次提交
    • L
      [feature] prune program by feed and fetch_list automatically (#22474) · a62599a8
      Leo Chen 提交于
      * prune train program by fetch_list, test=develop
      
      * add unittest for prune, test=develop
      
      * fix pruned feed, test=develop
      
      * support ParallelExecutor and feed prune, test=develop
      
      * add comments, test=develop
      
      * update unittest, test=develop
      
      * update unittests, test=develop
      
      * remove debug code, test=develop
      
      * support cond in clone, test=develop
      
      * support cond in prune, test=develop
      
      * support multiple minimize, test=develop
      
      * support cache, test=develop
      
      * fix _copy_param_info_from, test=develop
      
      * support python2 str, test=develop
      
      * remove debug code, test=develop
      
      * fix bug of caching CompiledProgram, test=develop
      
      * fix multi_device issue, test=develop
      
      * tmp
      
      * support tuple in fetch_list and overriding use_prune, test=develop
      
      * dont use nonlocal in python2, test=develop
      
      * remove nonlocal, test=develop
      
      * code clean, test=develop
      
      * code clean, test=develop
      
      * feed list, test=develop
      
      * test adam, test=develop
      
      * follow comments, test=develop
      
      * reduce duplicate code, test=develop
      
      * update comments, test=develop
      a62599a8
  31. 26 3月, 2020 1 次提交
  32. 20 3月, 2020 1 次提交
    • Z
      Reader sequential and inference partial feed (#22699) · acfc9b8a
      Zeng Jinle 提交于
      * sequential reader stage 1, test=develop
      
      * fix ut, test=develop
      
      * fix iterable=False reset bug, add some logs and polish code, test=develop
      
      * inference feed partial data, test=develop
      
      * Turn on keep_order=True for test, test=develop
      
      * enhance ut to test more cases, test=develop
      
      * test commit for reverting
      
      * Revert "test commit for reverting", test=develop
      
      This reverts commit 80aef42e.
      
      * add ut of merged and unmerged results, test=develop
      
      * add more uts for coverages and add en doc of api, test=develop
      
      * follow comments, test=develop
      
      * change note style, test=develop
      acfc9b8a
  33. 02 3月, 2020 1 次提交
    • Z
      Unmerged fetch list (#22635) · 89cfa491
      Zhen Wang 提交于
      * update ScopeBufferedSSAGraphExecutor&AsyncSSAGraphExecutor&ThreadedSSAGraphExecutor&FastThreadedSSAGraphExecutor&ParallelSSAGraphExecutor&ParallelExecutor for fetching unmerged results.
      
      * add the unit test for fetch_unmerged.
      
      * update ut for multi-card and multi-cpu.
      
      * add the error message and the user suggestion in FetchOpHandle. test=develop
      89cfa491
  34. 28 2月, 2020 1 次提交
  35. 23 2月, 2020 1 次提交
  36. 17 2月, 2020 1 次提交
  37. 17 1月, 2020 1 次提交
  38. 10 1月, 2020 1 次提交
  39. 06 1月, 2020 1 次提交