1. 18 11月, 2021 1 次提交
  2. 17 11月, 2021 3 次提交
    • L
      [new-exec] Refine standalone executor (#37278) · 6d6642c8
      Leo Chen 提交于
      * init
      
      * add feed ops in python side
      
      * import LRScheduler
      
      * update_feed
      
      * refine code format
      6d6642c8
    • Z
      [heterps]Refactor heterogenous worker (#37244) · 54d2626a
      zmx 提交于
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix ut. test=develop
      
      * fix ut. test=develop
      
      * fix ut. test=develop
      
      * refactor heter trainer. test=develop
      
      * fix. test=develop
      
      * fix ut. test=develop
      
      * fix ut. test=develop
      
      * fix ut. test=develop
      
      * fix ut. test=develop
      
      * fix ut. test=develop
      
      * fix ut. test=develop
      
      * fix ut. test=develop
      
      * fix ut. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix ut. test=develop
      
      * fix ut. test=develop
      
      * fix ut. test=develop
      54d2626a
    • L
      [Fleet Executor] Construct runtime graph (#37158) · 0daa69d4
      LiYuRio 提交于
      0daa69d4
  3. 15 11月, 2021 1 次提交
  4. 11 11月, 2021 3 次提交
    • Z
      [Heterps]Refactor Heter Pipeline Parameter Server (#36845) · a2da1efa
      zmx 提交于
      * change username
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * update
      
      * update
      
      * update unittests
      
      * fix
      
      * update
      
      * fix
      
      * update
      
      * fix
      
      * fix
      
      * fix
      
      * update
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update send_and_recv op. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * update. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix. test=develop
      
      * fix ut. test=develop
      
      * fix unit. notest,test=coverage
      
      * fix ut. notest, test=coverage
      
      * update. notest,test=coverage
      
      * fix ut. notest, test=coverage
      
      * fix ut. notest, test=coverage
      
      * fix. notest, test=coverage
      
      * fix. notest, test=coverage
      
      * fix ut. notest, test=coverage
      
      * fix ut. notest, test=coverage
      
      * fix ut. notest, test=coverage
      
      * fix ut. notest, test=coverage
      
      * add func. notest, test=coverage
      
      * fix ut. notest, test=coverage
      
      * fix. test=develop
      
      * fix. test=develop
      a2da1efa
    • L
      Get global cluster information (#37084) · 31673a92
      LiYuRio 提交于
      31673a92
    • W
      fix 2 bug: 1.skip lodtensorarray; 2.delete feed op (#37090) · d5df6bdf
      wanghuancoder 提交于
      * fix 2 bug: 1.skip lodtensorarray; 2.delete feed op, test=develop
      
      * program clone, test=develop
      d5df6bdf
  5. 10 11月, 2021 1 次提交
  6. 09 11月, 2021 1 次提交
  7. 03 11月, 2021 1 次提交
  8. 29 10月, 2021 1 次提交
  9. 20 10月, 2021 1 次提交
    • S
      Add FasterTokenizer Operator (#34491) · 3f2d6a3f
      Steffy-zxf 提交于
      Add Tokenizer related functionalities for Transformer model in order that the process of training and predicting is consistent.
      
      * support the text string as an input Tensor
      * support the "VOCAB"unordered_map<wstring, int> as an input Tensor to lookup tokens
      * Tokenizer used for BERT. This tokenizer applies an end-to-end, text string to wordpiece tokenization.
      * It first applies basic tokenization, followed by wordpiece tokenization.
      3f2d6a3f
  10. 13 10月, 2021 1 次提交
  11. 11 10月, 2021 1 次提交
  12. 08 10月, 2021 1 次提交
  13. 22 9月, 2021 1 次提交
  14. 15 9月, 2021 1 次提交
  15. 14 9月, 2021 1 次提交
    • A
      Intergrate StandaloneExecutor in Static.Executor Interface with... · 4bc08530
      Aurelius84 提交于
      Intergrate StandaloneExecutor in Static.Executor Interface with FLAGS_USE_STANDALONE_EXECUTOR (#35628)
      
      * Intergrate StandaloneExecutor in Static.Executor Interface with FLAGS_USE_STANDALONE_EXECUTOR
      
      * Enhance unittest and clean code in StandaloneExecutor
      
      * polish unittest
      4bc08530
  16. 28 7月, 2021 1 次提交
  17. 27 7月, 2021 1 次提交
  18. 09 7月, 2021 1 次提交
  19. 06 7月, 2021 1 次提交
  20. 05 7月, 2021 1 次提交
  21. 08 5月, 2021 1 次提交
  22. 23 4月, 2021 1 次提交
  23. 15 4月, 2021 1 次提交
    • T
      heterps support pscore (#32093) · 9f8c8f96
      Thunderbrook 提交于
      * pscore support heterps
      
      * fleet cmake
      
      * fleet wrapper
      
      * macro
      
      * solve conflict
      
      * solve conflict
      
      * add unitest
      
      * paddle enforce
      
      * unitest
      
      * unitest
      
      * unitest
      9f8c8f96
  24. 09 4月, 2021 1 次提交
    • L
      [NPU] cherry-pick basic NPU components/allocator/operator/executor supports from ascendrc (#32144) · ccf5709d
      Leo Chen 提交于
      * [feature] support npu allocator (#30840)
      
      [feature] support npu allocator
      
      * [feature] support npu operator (#30951)
      
      [feature] support npu operator
      
      * [feature] support npu allocator, part 2 (#30972)
      
      * support npu allocator
      
      * add npu device context
      
      * fix some compile problem
      
      * fix some compile problem
      
      * add npu info
      
      * compile ok
      
      * fix include dir
      
      * support naive_best_fit_allocator
      
      * run ut ok, bug failed to exit
      
      * call aclrtResetDevice before exit
      
      * fix aclFinilize
      
      * add system allocatot test
      
      * add selected_gpus in gtest
      
      * add tensor_test for npu
      
      * support npu op, initial commit
      
      * add npu stream
      
      * add elementwise_add_op
      
      * compile ok
      
      * fix typo
      
      * fix elementwise_add_op_npu_test
      
      * support op run
      
      * test can run but failed
      
      * change aclopExecuteV2 to aclopCompileAndExecute
      
      * support parsing ascend rank table file (#31000)
      
      support parsing ascend rank table file
      
      * Fix reshape on GE graph. (#31084)
      
      Fix reshape on GE graph
      
      * add npu kernel for elementwise_sub and elementwise_sub_grad (#30973)
      
      * add npu sub op
      
      * fix typo
      
      * rename test
      
      * fix bug
      
      * fix bug
      
      * add fp16 kernel
      
      * fix typo
      
      * support sub grad op
      
      * support elementwise_sub_grad op
      Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>
      
      * Fix compilation problem (#31100)
      
      Fix compilation problem (#31100)
      
      * fix compile
      
      * fix code stype
      
      * remove const_cast
      
      * support adding correct npu op in pybind.h (#31143)
      
      * support adding correct npu op in pybind.h
      
      * refine code
      
      * [NPU] Support executor with NPU (#31057)
      
      * [NPU] Support executor with NPU
      
      * Fix code according to reviews
      
      * Fix code
      
      * Add unittest for sub op npu
      
      * refactor npu device manager (#31154)
      
      refactor npu device manager (#31154)
      
      * fix selected npus
      
      * fix compile
      
      * fix reading flags from env
      
      * format
      Co-authored-by: Nxiayanming <41795079@qq.com>
      Co-authored-by: Ngongweibao <weibao.gong@gmail.com>
      Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>
      Co-authored-by: Nliym27 <33742067+liym27@users.noreply.github.com>
      ccf5709d
  25. 26 3月, 2021 1 次提交
  26. 07 1月, 2021 1 次提交
  27. 23 12月, 2020 1 次提交
  28. 23 11月, 2020 2 次提交
  29. 30 10月, 2020 1 次提交
  30. 14 10月, 2020 1 次提交
  31. 13 10月, 2020 1 次提交
  32. 10 10月, 2020 1 次提交
  33. 30 9月, 2020 1 次提交
  34. 25 9月, 2020 1 次提交
    • T
      add xpu in heter mode (#27000) · 6f69a4cb
      Thunderbrook 提交于
      * add xpu in heter mode
      test=develop
      
      * BOOST_CONST_GET; PADDLE_THROW
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * refine
      test=develop
      
      * refine
      test=develop
      
      * refine
      test=develop
      
      * refine code
      test=develop
      6f69a4cb
  35. 25 8月, 2020 1 次提交
    • W
      optimized transformation form tensor to numpy (#26447) · c1f5df52
      wanghuancoder 提交于
      * optimized transformation form tensor to numpy, test=develop
      
      * optimized transformation form tensor to numpy, pass pre-commit, test=develop
      
      * modify fetchophandle zerocopy to deepcopy in PE&CUP, test=develop
      
      * modify py:array construct, test=develop
      
      * fix _fetch_var to use deep copy, test=develop
      c1f5df52