1. 30 8月, 2019 1 次提交
    • H
      [Cherry-pick] Support memory eager deletion on recurrent OP (#19411) · cb74dac3
      Huihuang Zheng 提交于
      * Support memory eager deletion on recurrent OP (#17710)
      
      Test PaddingRNN on V100 GPU device.
      
      Test configuration: large model, padding mode (which is the mode using recurrentOp), one GPU.
                         
      GPU memory (MiB):   6414 (this PR)     vs   6837 (without this PR)
      Speed (steps/s):         10.28 (this PR)    vs    9.89 (without this PR)
      
      * Fix random test_recurrent_op failure (#18718)
      
      The change includes 3 things:
      
      1. Set CPU_NUM to 1 in the tests because the ParallelExecutor will print warning that CPU_NUM is not set and use default 1.
      
      2. Old tests compare two RNNs, hand written simple RNN and same RNN built by Paddle, but initialized RNN weights in numpy random and Paddle random separately. Fixed it by setting weights and bias values.
      
      3. Also set numpy random seed in the tests. Now the two RNNs diff can be smaller (rtol from 0.1, 0.2 to. 0.01) in the tests.
      cb74dac3
  2. 17 6月, 2019 1 次提交
  3. 05 6月, 2019 1 次提交
    • B
      [NGraph] some ngraph updates to enable bert (#17739) · a4c528a3
      baojun 提交于
      * delay infershape test=develop
      
      * fall back subblock to paddle test=develop
      
      * fix edge cases test=develop
      
      * remove output duplicates test=develop
      
      * handle reshape2_grad infershape test=develop
      a4c528a3
  4. 31 5月, 2019 1 次提交
  5. 24 5月, 2019 1 次提交
  6. 29 3月, 2019 8 次提交
  7. 24 3月, 2019 1 次提交
  8. 13 3月, 2019 1 次提交
  9. 11 3月, 2019 1 次提交
  10. 06 3月, 2019 1 次提交
  11. 05 3月, 2019 1 次提交
  12. 04 3月, 2019 1 次提交
    • D
      polish cudnn related code and fix bug. (#15164) · 4449e855
      dzhwinter 提交于
      * staged.
      
      * polish code
      
      * polish code. test=develop
      
      * polish code. test=develop
      
      * api change. test=develop
      
      * fix default value. test=develop
      
      * fix default value. test=develop
      4449e855
  13. 27 2月, 2019 1 次提交
    • D
      polish cudnn related code and fix bug. (#15164) · 225c11a9
      dzhwinter 提交于
      * staged.
      
      * polish code
      
      * polish code. test=develop
      
      * polish code. test=develop
      
      * api change. test=develop
      
      * fix default value. test=develop
      
      * fix default value. test=develop
      225c11a9
  14. 25 1月, 2019 1 次提交
    • B
      Adding ngraph_engine_op (#14948) · efce2567
      baojun 提交于
      * enable ngraph_engine_op
      test=develop
      
      * merge develop test=develop
      
      * avoid const_cast test=develop
      
      * rm ngraph_operator test=develop
      
      * Added TODO to move EnableNgraph test=develop
      
      * Add TODO to remove const_cast test=develop
      efce2567
  15. 28 12月, 2018 1 次提交
    • W
      Refactor distributed RPC (#15075) · a8bc05b5
      Wu Yi 提交于
      * wip
      
      * wip
      
      * refactor no.1 dir structure test=develop
      
      * fix linking test=develop
      
      * fix includes test=develop
      
      * fix build test=develop
      
      * fix build test=develop
      a8bc05b5
  16. 19 12月, 2018 1 次提交
  17. 18 12月, 2018 1 次提交
  18. 14 12月, 2018 1 次提交
  19. 13 12月, 2018 1 次提交
  20. 12 12月, 2018 1 次提交
  21. 08 12月, 2018 2 次提交
  22. 07 12月, 2018 2 次提交
    • S
      polish code · eb825246
      sneaxiy 提交于
      add unittest model containing while_op
      remove unnecessary codes
      test=develop
      eb825246
    • S
      refine code · 387bac46
      sneaxiy 提交于
      test=develop
      387bac46
  23. 04 12月, 2018 2 次提交
  24. 03 12月, 2018 1 次提交
  25. 30 11月, 2018 1 次提交
    • W
      AsyncExecutor (#14627) · 41e19eb4
      Wang Guibao 提交于
      * AsyncExecutor: C++ side
      
      * Google naming conventions
      
      * Rename MultiExecutor to AsyncExecutor
      
      * pybind with async_executor
      
      * Naming convention
      
      * remove some flags and unused code
      
      * add refactored file of async_executor and data_feed
      
      * clear async executor interface and add data feed factory
      
      * split async executor into executor_thread_worker and async_executor, refactor pybind, add datafeed and corresponding proto
      
      * Fix async_executor interfaces: 1) Remove all protobufs; 2) Stop after each epoch
      
      * refine async_executor_refactor.cc
      
      * add some files about datafeed
      
      * Revert "add some files about datafeed"
      
      This reverts commit 8ee8133a.
      
      * Interface rework
      
      * add MultiSlotDataFeed
      
      * Creating DataFeedDesc from .proto file, then manipulate it (add/del fields etc) from python side
      
      * update data_feed for add MultiSlotDataFeed
      
      * update datafeed and async_executor to run bow_net demo
      
      * fix bug that finish_set_filelist failed in multithread
      
      * delete finish_binding_memory_(flag), because it can not be marked under the current interface
      
      * Fix bug
      
      * update async_executor.py for support set_use_slots
      
      * update async_executor.py for support set_use_slots and set set_dense_slots
      
      * fix bug that when the number of files is less than the number of threads, it will fetch nan
      
      * remove redundant code, and make executor exit when set a illegal queue size
      
      * add batch_size check
      
      * add MultiSlotDesc
      
      * Revert "add MultiSlotDesc"
      
      This reverts commit 2e72ebfa.
      
      * add some checkpoint in DataFeedDesc
      
      * add CheckFile function in MultiSlotDataFeed
      
      * update something error info
      
      * fix deaded lock bug
      
      * Fix fetch variable
      
      * Merge error
      
      * fix code style in async_executor
      
      * using one lock blocking queue replace two lock blocking queue because of some bugs
      
      * update code style
      
      * add utest for data_feed
      
      * Fix fetch var
      
      * update utest for data_feed for multithread
      
      * update SetFileList info
      
      * fix bug in utest of data_feed
      
      * Add comments for python
      
      * Add comments for python code
      
      * Fix pybind.cc with new pybind11 version
      
      * add note for DataFeedDesc's set_use_slots function
      
      * Add save_model
      
      * update data_feed_test for multi-type
      
      * add comment for executor_thread_worker
      
      * Remove unused code
      
      * update data_feed_test for generate test data file
      
      * removed unnecessary interfaces and add comments
      
      * c++ style check
      
      * update data_feed.cc
      
      * AsyncExecutor: C++ side
      
      Google naming conventions
      
      Rename MultiExecutor to AsyncExecutor
      
      pybind with async_executor
      
      Naming convention
      
      remove some flags and unused code
      
      add refactored file of async_executor and data_feed
      
      clear async executor interface and add data feed factory
      
      split async executor into executor_thread_worker and async_executor, refactor pybind, add datafeed and corresponding proto
      
      Fix async_executor interfaces: 1) Remove all protobufs; 2) Stop after each epoch
      
      refine async_executor_refactor.cc
      
      add some files about datafeed
      
      Revert "add some files about datafeed"
      
      This reverts commit 8ee8133a.
      
      add MultiSlotDataFeed
      
      Interface rework
      
      Creating DataFeedDesc from .proto file, then manipulate it (add/del fields etc) from python side
      
      update datafeed and async_executor to run bow_net demo
      
      update async_executor.py for support set_use_slots
      
      Fix bug
      
      update async_executor.py for support set_use_slots and set set_dense_slots
      
      fix bug that when the number of files is less than the number of threads, it will fetch nan
      
      remove redundant code, and make executor exit when set a illegal queue size
      
      add MultiSlotDesc
      
      Revert "add MultiSlotDesc"
      
      This reverts commit 2e72ebfa.
      
      add some checkpoint in DataFeedDesc
      
      Fix fetch variable
      
      fix code style in async_executor
      
      Fix fetch var
      
      add utest for data_feed
      
      Add comments for python
      
      update utest for data_feed for multithread
      
      fix bug in utest of data_feed
      
      Add comments for python code
      
      Fix pybind.cc with new pybind11 version
      
      add note for DataFeedDesc's set_use_slots function
      
      update data_feed_test for multi-type
      
      Add save_model
      
      update data_feed_test for generate test data file
      
      removed unnecessary interfaces and add comments
      
      add comment for executor_thread_worker
      
      Remove unused code
      
      update data_feed.cc
      
      c++ style check
      
      * commit for code style
      
      * commit for code style
      
      * commit for code style
      
      * commit for code style
      
      * Comment away __init__ in async_executor.py
      
      * clang-format fix test=develop
      
      * use PADDLE_THROW instead of exit(-1); use unique_ptr to manage scope var in data_feed_test.cc
      
      * commit for update code style
      
      * commit for update code style
      
      * Add async_executor demo; Remove some methods
      test=develop
      
      * commit for update code style
      
      * commit for update code style
      
      * commit for update code style
      
      * update API.spec
      
      * AsyncExecutor
      test=develop
      
      * AsyncExecutor
      test=develop
      
      * AsyncExecutor
      test=develop
      
      * AsyncExecutor
      test=develop
      
      * Fix API.spec
      test=develop
      
      * Fix API.spec
      test=develop
      
      * Fix windows build error
      test=develop
      
      * FIx windows build error
      test=develop
      
      * FIx windows build error
      test=develop
      
      * FIx windows build error
      test=develop
      
      * Fix Windows Build
      test=develop
      
      * Fix Windows Build
      test=develop
      
      * Fix Windows Build
      test=develop
      
      * Fix code style
      test=develop
      
      * Fix code style
      test=develop
      
      * update datafeed
      
      * Fix code style
      test=develop
      
      * update data_feed_test for test Tensor test=develop
      
      * Fix code style
      test=develop
      
      * Fix windows build failure
      test=develop
      
      * Fix code style and windows build failure
      test=develop
      
      * Fix PYTHON3.5 build failure
      test=develop
      
      * AsyncExecutor API
      test=develop
      41e19eb4
  26. 26 11月, 2018 2 次提交
  27. 25 11月, 2018 1 次提交
  28. 14 11月, 2018 1 次提交
  29. 13 11月, 2018 1 次提交