1. 25 5月, 2022 1 次提交
    • M
      Dynamic graph support to Automatic SParsity. (#41177) · e5fc68b2
      Ming-Xu Huang 提交于
      * Dynamic graph support to Automatic SParsity.
      
      1. Added dynamic support to ASP module (paddle.fluid.contrib.sparsity).
      2. Added ASP related unit-tests regards to above changes.
      3. Put ASP module under paddle.static for now, waiting for APIs confirmation from Paddle.
      
      * Modified documents of functions to have correct examples.
      
      * Update in_dygraph_mode to paddle.in_dynamic_mode()
      
      * Modified documents of functions and added comments
      
      * Minor changes.
      
      * Fix example errors in asp API.
      
      * Code Change for Review
      
      1. Added more examples in documents.
      2. Chaged test_asp_pruning_static.
      
      * Minor changes
      
      * Update ASP function documents.
      
      * Update ASP function documents.
      
      * Reduce test case size of asp pruning due CI time limit.
      
      * Update time limitation to some asp UTs.
      
      * Fix sample code errors.
      
      * Fix sample code errors.
      
      * Fix sample code errors.
      
      * Update time limitation to parts of ASP UTs.
      
      * Update UTs to fit with CI.
      
      * Reduce problem size in python/paddle/fluid/tests/unittests/asp/test_fleet_with_asp_dynamic.py
      
      * Added paddle.asp
      
      * Fixed type casting error of OpRole.Optimize in new dygraph mode.
      
      * Made set_excluded_layers be compatible with 2.2
      
      * Fix example code of calculate_density.
      
      * Update code examples.
      
      * Move paddle.asp to paddle.incubate.asp
      
      * Fixed an example error of calculate_density
      e5fc68b2
  2. 21 2月, 2022 1 次提交
  3. 20 1月, 2022 1 次提交
  4. 14 7月, 2021 1 次提交
  5. 08 7月, 2021 1 次提交
  6. 10 6月, 2021 1 次提交
  7. 09 4月, 2021 1 次提交
    • L
      [NPU] cherry-pick basic NPU components/allocator/operator/executor supports from ascendrc (#32144) · ccf5709d
      Leo Chen 提交于
      * [feature] support npu allocator (#30840)
      
      [feature] support npu allocator
      
      * [feature] support npu operator (#30951)
      
      [feature] support npu operator
      
      * [feature] support npu allocator, part 2 (#30972)
      
      * support npu allocator
      
      * add npu device context
      
      * fix some compile problem
      
      * fix some compile problem
      
      * add npu info
      
      * compile ok
      
      * fix include dir
      
      * support naive_best_fit_allocator
      
      * run ut ok, bug failed to exit
      
      * call aclrtResetDevice before exit
      
      * fix aclFinilize
      
      * add system allocatot test
      
      * add selected_gpus in gtest
      
      * add tensor_test for npu
      
      * support npu op, initial commit
      
      * add npu stream
      
      * add elementwise_add_op
      
      * compile ok
      
      * fix typo
      
      * fix elementwise_add_op_npu_test
      
      * support op run
      
      * test can run but failed
      
      * change aclopExecuteV2 to aclopCompileAndExecute
      
      * support parsing ascend rank table file (#31000)
      
      support parsing ascend rank table file
      
      * Fix reshape on GE graph. (#31084)
      
      Fix reshape on GE graph
      
      * add npu kernel for elementwise_sub and elementwise_sub_grad (#30973)
      
      * add npu sub op
      
      * fix typo
      
      * rename test
      
      * fix bug
      
      * fix bug
      
      * add fp16 kernel
      
      * fix typo
      
      * support sub grad op
      
      * support elementwise_sub_grad op
      Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>
      
      * Fix compilation problem (#31100)
      
      Fix compilation problem (#31100)
      
      * fix compile
      
      * fix code stype
      
      * remove const_cast
      
      * support adding correct npu op in pybind.h (#31143)
      
      * support adding correct npu op in pybind.h
      
      * refine code
      
      * [NPU] Support executor with NPU (#31057)
      
      * [NPU] Support executor with NPU
      
      * Fix code according to reviews
      
      * Fix code
      
      * Add unittest for sub op npu
      
      * refactor npu device manager (#31154)
      
      refactor npu device manager (#31154)
      
      * fix selected npus
      
      * fix compile
      
      * fix reading flags from env
      
      * format
      Co-authored-by: Nxiayanming <41795079@qq.com>
      Co-authored-by: Ngongweibao <weibao.gong@gmail.com>
      Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>
      Co-authored-by: Nliym27 <33742067+liym27@users.noreply.github.com>
      ccf5709d
  8. 16 11月, 2020 1 次提交
  9. 08 11月, 2020 1 次提交
    • Y
      exec ut no more than 15s 1 (#28439) · ba075632
      YUNSHEN XIE 提交于
      * disable ut test_parallel_executor_fetch_isolated_var,test=document_fix
      
      * test for limiting ut exec time as 15S
      
      * fix an error caused by cannot find ut
      
      * fix some error
      
      * can not find test_transformer
      
      * fix error caused by ut not run in windows
      
      * fix error caused by Compiler Options
      
      * fix error caused by setting timeout value as 15 in python/paddle/tests/CMakeLists.txt
      
      * setting timeout value to 120s for old ut
      
      * add the timeout value setting
      
      * fix error caused by ut only run in coverage_ci
      
      * add analyzer_transformer_profile_tester
      
      * fix some error
      
      * fix some error
      
      * fix error with inference option
      
      * fix error with inference option setting as ON_INFER
      
      * add some ut to set timeout
      
      * modified some option
      
      * fix error
      
      * fix some timeout error
      
      * fix error
      
      * fix error
      
      * fix timeout for test_analyzer_bfloat16_resnet50
      
      * fix error
      
      * setting timeout properity for some ut
      
      * first pr for new ut timeout as 15S
      ba075632
  10. 27 8月, 2020 1 次提交
    • F
      Add unified RNN APIs (#26588) · f4083010
      Feiyu Chan 提交于
      * Add RNN related apis in paddl.nn
      test=develop
      
      * new rnn api, cell almost done
      
      * add new progresses in rnn APIs for 2.0
      
      * refine rnn APIs and docstrings.
      
      * add unittets
      
      * disable gpu tests when paddle is not compiled with cuda support
      
      * remove unnecessary imports
      
      * fix docstring
      
      * add to no_sample wlist
      
      * backport to python2 to avoid yield from
      
      * add **kwargs, fix typos
      
      * update docstrings for birnn
      
      * rename argument for SimpleRNN and SimpleRNNCell, fix sample code
      
      * add default value for initial_states in fluid.layers.birnn
      Co-authored-by: Nguosheng <guosheng@baidu.com>
      f4083010
  11. 29 1月, 2019 1 次提交
  12. 14 12月, 2018 1 次提交