1. 21 4月, 2021 1 次提交
  2. 09 4月, 2021 1 次提交
    • L
      [NPU] cherry-pick basic NPU components/allocator/operator/executor supports from ascendrc (#32144) · ccf5709d
      Leo Chen 提交于
      * [feature] support npu allocator (#30840)
      
      [feature] support npu allocator
      
      * [feature] support npu operator (#30951)
      
      [feature] support npu operator
      
      * [feature] support npu allocator, part 2 (#30972)
      
      * support npu allocator
      
      * add npu device context
      
      * fix some compile problem
      
      * fix some compile problem
      
      * add npu info
      
      * compile ok
      
      * fix include dir
      
      * support naive_best_fit_allocator
      
      * run ut ok, bug failed to exit
      
      * call aclrtResetDevice before exit
      
      * fix aclFinilize
      
      * add system allocatot test
      
      * add selected_gpus in gtest
      
      * add tensor_test for npu
      
      * support npu op, initial commit
      
      * add npu stream
      
      * add elementwise_add_op
      
      * compile ok
      
      * fix typo
      
      * fix elementwise_add_op_npu_test
      
      * support op run
      
      * test can run but failed
      
      * change aclopExecuteV2 to aclopCompileAndExecute
      
      * support parsing ascend rank table file (#31000)
      
      support parsing ascend rank table file
      
      * Fix reshape on GE graph. (#31084)
      
      Fix reshape on GE graph
      
      * add npu kernel for elementwise_sub and elementwise_sub_grad (#30973)
      
      * add npu sub op
      
      * fix typo
      
      * rename test
      
      * fix bug
      
      * fix bug
      
      * add fp16 kernel
      
      * fix typo
      
      * support sub grad op
      
      * support elementwise_sub_grad op
      Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>
      
      * Fix compilation problem (#31100)
      
      Fix compilation problem (#31100)
      
      * fix compile
      
      * fix code stype
      
      * remove const_cast
      
      * support adding correct npu op in pybind.h (#31143)
      
      * support adding correct npu op in pybind.h
      
      * refine code
      
      * [NPU] Support executor with NPU (#31057)
      
      * [NPU] Support executor with NPU
      
      * Fix code according to reviews
      
      * Fix code
      
      * Add unittest for sub op npu
      
      * refactor npu device manager (#31154)
      
      refactor npu device manager (#31154)
      
      * fix selected npus
      
      * fix compile
      
      * fix reading flags from env
      
      * format
      Co-authored-by: Nxiayanming <41795079@qq.com>
      Co-authored-by: Ngongweibao <weibao.gong@gmail.com>
      Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>
      Co-authored-by: Nliym27 <33742067+liym27@users.noreply.github.com>
      ccf5709d
  3. 29 3月, 2021 1 次提交
  4. 23 3月, 2021 1 次提交
  5. 24 12月, 2020 1 次提交
  6. 20 11月, 2020 1 次提交
  7. 30 10月, 2020 1 次提交
  8. 28 9月, 2020 2 次提交
  9. 21 9月, 2020 1 次提交
  10. 28 8月, 2020 1 次提交
  11. 21 8月, 2020 1 次提交
    • Q
      support Baidu Kunlun AI Accelerator (#25959) · 138ecf24
      QingshuChen 提交于
      * support Baidu AI Accelerator
        * test=kunlun
      
      * minor
       * test=kunlun
      
      * support xpu op in separate file
       * test=kunlun
      
      * update XPU error message and remove duplicated code
      
       * test=kunlun
      
      * minor
       * test=kunlun
      
      * minor
       * test=kunlun
      138ecf24
  12. 18 8月, 2020 1 次提交
  13. 07 8月, 2020 1 次提交
  14. 18 6月, 2020 1 次提交
    • Z
      add new API: set_global_initializer (#24378) · 542a226c
      Zhou Wei 提交于
      * add new api (set_global_initializer/reset_global_initializer),test=develop
      
      * add new api (set_global_initializer/reset_global_initializer),test=develop
      
      * fix doc and example code of set_global_initializer,test=develop
      542a226c
  15. 13 5月, 2020 1 次提交
  16. 20 4月, 2020 1 次提交
  17. 09 4月, 2020 1 次提交
    • M
      Remove: NGraph engine from PDPD repository (#23545) · 3baaee9a
      mozga-intel 提交于
      * Remove the NGraph engine from PDPD repository
      1. Each operator was removed from the operator's directory
      2. Each test was removed from the unittest directory
      3. The parallel executor support was removed from the PDPD
      4. The CMake file was removed from the PDPD
      5. The NG flags were removed from the repository
      test=develop
      
      * Remove ngraph from:
      1. Cmake file
      2. Python file
      test=develop
      3baaee9a
  18. 04 4月, 2020 1 次提交
    • L
      Dev/fix init flags (#23465) · f297a332
      Leo Chen 提交于
      * fix init_gflags with 'python -c', test=develop
      
      * add test, test=develop
      
      * use sys.executable instead of python, test=develop
      
      * keep dummy, test=develop
      f297a332
  19. 02 4月, 2020 1 次提交
  20. 04 3月, 2020 1 次提交
    • Z
      Add flags to limit gpu memory (#22793) · d41d802b
      Zeng Jinle 提交于
      * add recorded cuda memory apis, fix typo, test=develop
      
      * add more ut, test=develop
      
      * follow comments, test=develop
      
      * fix py35 incompatible issues, test=develop
      d41d802b
  21. 03 3月, 2020 1 次提交
    • S
      Add functional dygraph mode api (#22745) · df87e79f
      songyouwei 提交于
      * functional dygraph enable/disable
      test=develop
      
      * use context manager instead
      test=develop
      
      * refine sample code
      test=develop
      
      * rename api & expose to fluid
      test=develop
      
      * fix sample code
      test=develop
      df87e79f
  22. 17 1月, 2020 1 次提交
  23. 19 12月, 2019 1 次提交
  24. 05 12月, 2019 1 次提交
    • L
      Split VarBase from Python Variable for Dygraph (#21359) · cdd46d7e
      Leo Chen 提交于
      * test=develop, fix docker with paddle nccl problem
      
      * don't expose numerous Tensor.set(), test=develop
      
      * fix condition, test=develop
      
      * fix float16 bug, test=develop
      
      * feed should be Tensor or np.array, not Variable or number, test=develop
      
      * use forcecast to copy numpy slice to new array, test=develop
      
      * remove float16-uint16 hacking, test=develop
      
      * add variable method to varbase and refactor to_variable to support return varbase
      
      * support kwargs in varbase constructor
      
      * add VarBase constructor to support default python args
      
      * refine varbase initial method
      
      * reset branch
      
      * fix ut for change VarBase error info to PaddleEnforce
      
      * cherry is parameter change before
      
      * overload isinstance to replace too many change of is_variable
      
      * rm useless files
      
      * rm useless code merged by git
      
      * test=develop, fix some ut failed error
      
      * test=develop, fix test_graph_wrapper
      
      * add some tests, test=develop
      
      * refine __getitem__, test=develop
      
      * add tests, test=develop
      
      * fix err_msg, test=develop
      cdd46d7e
  25. 02 12月, 2019 1 次提交
  26. 29 11月, 2019 1 次提交
    • L
      add unused input vars check for OpWithKernel, test=develop (#21169) · e0c9d856
      Leo Chen 提交于
      * add unused input vars check for OpWithKernel, test=develop
      
      * remove unused vars in some ops, test=develop
      
      * fix batch_norm, test=develop
      
      * add white list, test=develop
      
      * add CI check for white list, test=develop
      
      * :ove white list to c++, test=develop
      
      * solve failure of CI, test=develop
      
      * add unittest for unused_var_check, test=develop
      
      * refine code, enable check in operator_test, test=develop
      
      * skip mkldnn, test=develop
      
      * extend white list, test=develop
      
      * refine condition of mkldnn, test=develop
      
      * fix paddle_build, test=develop
      
      * follow comments, test=develop
      
      * fix GetExpectedKernelType
      
      * add wiki ref to err_msg, test=develop
      
      * follow comment, test=develop
      e0c9d856
  27. 28 11月, 2019 1 次提交
  28. 29 10月, 2019 1 次提交
  29. 20 10月, 2019 1 次提交
  30. 16 10月, 2019 1 次提交
  31. 10 10月, 2019 1 次提交
    • H
      New save load interface (#20148) · fa43e80e
      hong 提交于
      * add new save load interface; test=develop
      
      * add new save interface; test=develop
      
      * add save load interface ;
      
      * fix save load error;
      
      * fix dygraph set dict bug;
      
      * add save load unit test; test=develop
      
      * fix test_imperative_optimizer bug; test=develop
      
      * fix unitest optimizer bug; test=develop
      
      * fix code coverage; test=develop
      
      * fix converage; test=develop
      
      * add document for apis; test=develop
      
      * fix unitest error; test=develop
      
      * fix save load unit test error; test=develop
      
      * fix error message; test=develop
      
      * change set_parameter set_optimizer to save_dygraph; test=develop
      
      * add load_graph check; test=develop
      
      * fix api spec; test=develop
      fa43e80e
  32. 07 10月, 2019 1 次提交
  33. 30 9月, 2019 1 次提交
  34. 26 9月, 2019 1 次提交
    • H
      Add new data layer (#19916) · 88af4ab6
      Huihuang Zheng 提交于
      The new "fluid.data" changes old "fluid.layers.data":
      
      1. Add shape and dtype check.
      2. Remove "append_batch_size" parameter. We won't offer this in the new data layer because other deep learning platforms don't have this kind of data layer pre-processing. It may confuse users.
      3. Remove "stop gradient" parameter because the data layer doesn't do back-propagation
      
      TODO:
      Now data layer feeded by executor is checked, will we want to check the feed data of readers in the future?
      88af4ab6
  35. 24 9月, 2019 1 次提交
    • A
      Remove constraint that last dimension is forced to be 1 by adding lookup_table_v2 (#19735) · 039b9710
      Aurelius84 提交于
      * Remove constraint that last dimension is forced to be 1 by add
      lookup_table_v2 test=develop
      
      * modify into PADDLE_ENFORCE_CUDA_SUCCESS test=develop
      
      * Revert "modify into PADDLE_ENFORCE_CUDA_SUCCESS test=develop"
      
      This reverts commit 8a960bfc61e51aa27c3c529df8fb90b93ebd19f9.
      
      * move api into fluid.embedding test=develop
      
      * fix example code test=develop
      
      * move one_hot into fluid.one_hot
      
      * modify api.spec test=develop
      
      * fix loss shape test=develop
      039b9710
  36. 23 9月, 2019 1 次提交
  37. 18 9月, 2019 2 次提交
  38. 12 9月, 2019 1 次提交