1. 18 9月, 2021 1 次提交
    • Z
      Change __init__.py to adapt the new FLAGS coding style and update CI to... · 74f38d63
      Zeng Jinle 提交于
      Change __init__.py to adapt the new FLAGS coding style and update CI to monitor FLAGS changing (#35849)
      
      * change __init__.py to adapt new FLAGS
      
      * test ci check, ready for revert
      
      * split __init__.py and FLAGS approval
      
      * Revert "test ci check, ready for revert"
      
      This reverts commit bbbd2442fe3e948fef790ec634085a2431474326.
      74f38d63
  2. 15 9月, 2021 1 次提交
  3. 08 9月, 2021 1 次提交
    • Z
      Enable program passes on Fleet APIs (#34955) · 5f369881
      Zeng Jinle 提交于
      * add fleet api for program pass
      
      * turn on apply pass for CI test
      
      * fix disable fuse_all_optimizer bug
      
      * try to test ci
      
      * fix CI
      
      * fill unspecified op role
      
      * fix fuse_allreduce
      
      * add ut to improve coverage
      
      * remove useless change
      
      * improve c++ coverage
      
      * follow some comments
      
      * test ir pass pipeline
      
      * update doc
      
      * reduce ut time again
      5f369881
  4. 02 9月, 2021 1 次提交
  5. 24 8月, 2021 1 次提交
  6. 13 8月, 2021 1 次提交
  7. 02 8月, 2021 1 次提交
  8. 30 7月, 2021 1 次提交
  9. 28 7月, 2021 1 次提交
  10. 15 7月, 2021 1 次提交
  11. 29 6月, 2021 1 次提交
  12. 16 6月, 2021 1 次提交
  13. 22 4月, 2021 1 次提交
  14. 21 4月, 2021 1 次提交
  15. 09 4月, 2021 1 次提交
    • L
      [NPU] cherry-pick basic NPU components/allocator/operator/executor supports from ascendrc (#32144) · ccf5709d
      Leo Chen 提交于
      * [feature] support npu allocator (#30840)
      
      [feature] support npu allocator
      
      * [feature] support npu operator (#30951)
      
      [feature] support npu operator
      
      * [feature] support npu allocator, part 2 (#30972)
      
      * support npu allocator
      
      * add npu device context
      
      * fix some compile problem
      
      * fix some compile problem
      
      * add npu info
      
      * compile ok
      
      * fix include dir
      
      * support naive_best_fit_allocator
      
      * run ut ok, bug failed to exit
      
      * call aclrtResetDevice before exit
      
      * fix aclFinilize
      
      * add system allocatot test
      
      * add selected_gpus in gtest
      
      * add tensor_test for npu
      
      * support npu op, initial commit
      
      * add npu stream
      
      * add elementwise_add_op
      
      * compile ok
      
      * fix typo
      
      * fix elementwise_add_op_npu_test
      
      * support op run
      
      * test can run but failed
      
      * change aclopExecuteV2 to aclopCompileAndExecute
      
      * support parsing ascend rank table file (#31000)
      
      support parsing ascend rank table file
      
      * Fix reshape on GE graph. (#31084)
      
      Fix reshape on GE graph
      
      * add npu kernel for elementwise_sub and elementwise_sub_grad (#30973)
      
      * add npu sub op
      
      * fix typo
      
      * rename test
      
      * fix bug
      
      * fix bug
      
      * add fp16 kernel
      
      * fix typo
      
      * support sub grad op
      
      * support elementwise_sub_grad op
      Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>
      
      * Fix compilation problem (#31100)
      
      Fix compilation problem (#31100)
      
      * fix compile
      
      * fix code stype
      
      * remove const_cast
      
      * support adding correct npu op in pybind.h (#31143)
      
      * support adding correct npu op in pybind.h
      
      * refine code
      
      * [NPU] Support executor with NPU (#31057)
      
      * [NPU] Support executor with NPU
      
      * Fix code according to reviews
      
      * Fix code
      
      * Add unittest for sub op npu
      
      * refactor npu device manager (#31154)
      
      refactor npu device manager (#31154)
      
      * fix selected npus
      
      * fix compile
      
      * fix reading flags from env
      
      * format
      Co-authored-by: Nxiayanming <41795079@qq.com>
      Co-authored-by: Ngongweibao <weibao.gong@gmail.com>
      Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>
      Co-authored-by: Nliym27 <33742067+liym27@users.noreply.github.com>
      ccf5709d
  16. 29 3月, 2021 1 次提交
  17. 23 3月, 2021 1 次提交
  18. 24 12月, 2020 1 次提交
  19. 20 11月, 2020 1 次提交
  20. 30 10月, 2020 1 次提交
  21. 28 9月, 2020 2 次提交
  22. 21 9月, 2020 1 次提交
  23. 28 8月, 2020 1 次提交
  24. 21 8月, 2020 1 次提交
    • Q
      support Baidu Kunlun AI Accelerator (#25959) · 138ecf24
      QingshuChen 提交于
      * support Baidu AI Accelerator
        * test=kunlun
      
      * minor
       * test=kunlun
      
      * support xpu op in separate file
       * test=kunlun
      
      * update XPU error message and remove duplicated code
      
       * test=kunlun
      
      * minor
       * test=kunlun
      
      * minor
       * test=kunlun
      138ecf24
  25. 18 8月, 2020 1 次提交
  26. 07 8月, 2020 1 次提交
  27. 18 6月, 2020 1 次提交
    • Z
      add new API: set_global_initializer (#24378) · 542a226c
      Zhou Wei 提交于
      * add new api (set_global_initializer/reset_global_initializer),test=develop
      
      * add new api (set_global_initializer/reset_global_initializer),test=develop
      
      * fix doc and example code of set_global_initializer,test=develop
      542a226c
  28. 13 5月, 2020 1 次提交
  29. 20 4月, 2020 1 次提交
  30. 09 4月, 2020 1 次提交
    • M
      Remove: NGraph engine from PDPD repository (#23545) · 3baaee9a
      mozga-intel 提交于
      * Remove the NGraph engine from PDPD repository
      1. Each operator was removed from the operator's directory
      2. Each test was removed from the unittest directory
      3. The parallel executor support was removed from the PDPD
      4. The CMake file was removed from the PDPD
      5. The NG flags were removed from the repository
      test=develop
      
      * Remove ngraph from:
      1. Cmake file
      2. Python file
      test=develop
      3baaee9a
  31. 04 4月, 2020 1 次提交
    • L
      Dev/fix init flags (#23465) · f297a332
      Leo Chen 提交于
      * fix init_gflags with 'python -c', test=develop
      
      * add test, test=develop
      
      * use sys.executable instead of python, test=develop
      
      * keep dummy, test=develop
      f297a332
  32. 02 4月, 2020 1 次提交
  33. 04 3月, 2020 1 次提交
    • Z
      Add flags to limit gpu memory (#22793) · d41d802b
      Zeng Jinle 提交于
      * add recorded cuda memory apis, fix typo, test=develop
      
      * add more ut, test=develop
      
      * follow comments, test=develop
      
      * fix py35 incompatible issues, test=develop
      d41d802b
  34. 03 3月, 2020 1 次提交
    • S
      Add functional dygraph mode api (#22745) · df87e79f
      songyouwei 提交于
      * functional dygraph enable/disable
      test=develop
      
      * use context manager instead
      test=develop
      
      * refine sample code
      test=develop
      
      * rename api & expose to fluid
      test=develop
      
      * fix sample code
      test=develop
      df87e79f
  35. 17 1月, 2020 1 次提交
  36. 19 12月, 2019 1 次提交
  37. 05 12月, 2019 1 次提交
    • L
      Split VarBase from Python Variable for Dygraph (#21359) · cdd46d7e
      Leo Chen 提交于
      * test=develop, fix docker with paddle nccl problem
      
      * don't expose numerous Tensor.set(), test=develop
      
      * fix condition, test=develop
      
      * fix float16 bug, test=develop
      
      * feed should be Tensor or np.array, not Variable or number, test=develop
      
      * use forcecast to copy numpy slice to new array, test=develop
      
      * remove float16-uint16 hacking, test=develop
      
      * add variable method to varbase and refactor to_variable to support return varbase
      
      * support kwargs in varbase constructor
      
      * add VarBase constructor to support default python args
      
      * refine varbase initial method
      
      * reset branch
      
      * fix ut for change VarBase error info to PaddleEnforce
      
      * cherry is parameter change before
      
      * overload isinstance to replace too many change of is_variable
      
      * rm useless files
      
      * rm useless code merged by git
      
      * test=develop, fix some ut failed error
      
      * test=develop, fix test_graph_wrapper
      
      * add some tests, test=develop
      
      * refine __getitem__, test=develop
      
      * add tests, test=develop
      
      * fix err_msg, test=develop
      cdd46d7e
  38. 02 12月, 2019 1 次提交
  39. 29 11月, 2019 1 次提交
    • L
      add unused input vars check for OpWithKernel, test=develop (#21169) · e0c9d856
      Leo Chen 提交于
      * add unused input vars check for OpWithKernel, test=develop
      
      * remove unused vars in some ops, test=develop
      
      * fix batch_norm, test=develop
      
      * add white list, test=develop
      
      * add CI check for white list, test=develop
      
      * :ove white list to c++, test=develop
      
      * solve failure of CI, test=develop
      
      * add unittest for unused_var_check, test=develop
      
      * refine code, enable check in operator_test, test=develop
      
      * skip mkldnn, test=develop
      
      * extend white list, test=develop
      
      * refine condition of mkldnn, test=develop
      
      * fix paddle_build, test=develop
      
      * follow comments, test=develop
      
      * fix GetExpectedKernelType
      
      * add wiki ref to err_msg, test=develop
      
      * follow comment, test=develop
      e0c9d856