1. 30 11月, 2018 1 次提交
    • W
      AsyncExecutor (#14627) · 41e19eb4
      Wang Guibao 提交于
      * AsyncExecutor: C++ side
      
      * Google naming conventions
      
      * Rename MultiExecutor to AsyncExecutor
      
      * pybind with async_executor
      
      * Naming convention
      
      * remove some flags and unused code
      
      * add refactored file of async_executor and data_feed
      
      * clear async executor interface and add data feed factory
      
      * split async executor into executor_thread_worker and async_executor, refactor pybind, add datafeed and corresponding proto
      
      * Fix async_executor interfaces: 1) Remove all protobufs; 2) Stop after each epoch
      
      * refine async_executor_refactor.cc
      
      * add some files about datafeed
      
      * Revert "add some files about datafeed"
      
      This reverts commit 8ee8133a.
      
      * Interface rework
      
      * add MultiSlotDataFeed
      
      * Creating DataFeedDesc from .proto file, then manipulate it (add/del fields etc) from python side
      
      * update data_feed for add MultiSlotDataFeed
      
      * update datafeed and async_executor to run bow_net demo
      
      * fix bug that finish_set_filelist failed in multithread
      
      * delete finish_binding_memory_(flag), because it can not be marked under the current interface
      
      * Fix bug
      
      * update async_executor.py for support set_use_slots
      
      * update async_executor.py for support set_use_slots and set set_dense_slots
      
      * fix bug that when the number of files is less than the number of threads, it will fetch nan
      
      * remove redundant code, and make executor exit when set a illegal queue size
      
      * add batch_size check
      
      * add MultiSlotDesc
      
      * Revert "add MultiSlotDesc"
      
      This reverts commit 2e72ebfa.
      
      * add some checkpoint in DataFeedDesc
      
      * add CheckFile function in MultiSlotDataFeed
      
      * update something error info
      
      * fix deaded lock bug
      
      * Fix fetch variable
      
      * Merge error
      
      * fix code style in async_executor
      
      * using one lock blocking queue replace two lock blocking queue because of some bugs
      
      * update code style
      
      * add utest for data_feed
      
      * Fix fetch var
      
      * update utest for data_feed for multithread
      
      * update SetFileList info
      
      * fix bug in utest of data_feed
      
      * Add comments for python
      
      * Add comments for python code
      
      * Fix pybind.cc with new pybind11 version
      
      * add note for DataFeedDesc's set_use_slots function
      
      * Add save_model
      
      * update data_feed_test for multi-type
      
      * add comment for executor_thread_worker
      
      * Remove unused code
      
      * update data_feed_test for generate test data file
      
      * removed unnecessary interfaces and add comments
      
      * c++ style check
      
      * update data_feed.cc
      
      * AsyncExecutor: C++ side
      
      Google naming conventions
      
      Rename MultiExecutor to AsyncExecutor
      
      pybind with async_executor
      
      Naming convention
      
      remove some flags and unused code
      
      add refactored file of async_executor and data_feed
      
      clear async executor interface and add data feed factory
      
      split async executor into executor_thread_worker and async_executor, refactor pybind, add datafeed and corresponding proto
      
      Fix async_executor interfaces: 1) Remove all protobufs; 2) Stop after each epoch
      
      refine async_executor_refactor.cc
      
      add some files about datafeed
      
      Revert "add some files about datafeed"
      
      This reverts commit 8ee8133a.
      
      add MultiSlotDataFeed
      
      Interface rework
      
      Creating DataFeedDesc from .proto file, then manipulate it (add/del fields etc) from python side
      
      update datafeed and async_executor to run bow_net demo
      
      update async_executor.py for support set_use_slots
      
      Fix bug
      
      update async_executor.py for support set_use_slots and set set_dense_slots
      
      fix bug that when the number of files is less than the number of threads, it will fetch nan
      
      remove redundant code, and make executor exit when set a illegal queue size
      
      add MultiSlotDesc
      
      Revert "add MultiSlotDesc"
      
      This reverts commit 2e72ebfa.
      
      add some checkpoint in DataFeedDesc
      
      Fix fetch variable
      
      fix code style in async_executor
      
      Fix fetch var
      
      add utest for data_feed
      
      Add comments for python
      
      update utest for data_feed for multithread
      
      fix bug in utest of data_feed
      
      Add comments for python code
      
      Fix pybind.cc with new pybind11 version
      
      add note for DataFeedDesc's set_use_slots function
      
      update data_feed_test for multi-type
      
      Add save_model
      
      update data_feed_test for generate test data file
      
      removed unnecessary interfaces and add comments
      
      add comment for executor_thread_worker
      
      Remove unused code
      
      update data_feed.cc
      
      c++ style check
      
      * commit for code style
      
      * commit for code style
      
      * commit for code style
      
      * commit for code style
      
      * Comment away __init__ in async_executor.py
      
      * clang-format fix test=develop
      
      * use PADDLE_THROW instead of exit(-1); use unique_ptr to manage scope var in data_feed_test.cc
      
      * commit for update code style
      
      * commit for update code style
      
      * Add async_executor demo; Remove some methods
      test=develop
      
      * commit for update code style
      
      * commit for update code style
      
      * commit for update code style
      
      * update API.spec
      
      * AsyncExecutor
      test=develop
      
      * AsyncExecutor
      test=develop
      
      * AsyncExecutor
      test=develop
      
      * AsyncExecutor
      test=develop
      
      * Fix API.spec
      test=develop
      
      * Fix API.spec
      test=develop
      
      * Fix windows build error
      test=develop
      
      * FIx windows build error
      test=develop
      
      * FIx windows build error
      test=develop
      
      * FIx windows build error
      test=develop
      
      * Fix Windows Build
      test=develop
      
      * Fix Windows Build
      test=develop
      
      * Fix Windows Build
      test=develop
      
      * Fix code style
      test=develop
      
      * Fix code style
      test=develop
      
      * update datafeed
      
      * Fix code style
      test=develop
      
      * update data_feed_test for test Tensor test=develop
      
      * Fix code style
      test=develop
      
      * Fix windows build failure
      test=develop
      
      * Fix code style and windows build failure
      test=develop
      
      * Fix PYTHON3.5 build failure
      test=develop
      
      * AsyncExecutor API
      test=develop
      41e19eb4
  2. 29 11月, 2018 2 次提交
  3. 27 11月, 2018 4 次提交
    • J
      remove arg 'non_leaf_num', test=develop · a08dc83e
      JiabinYang 提交于
      a08dc83e
    • Q
      update embedding api · 18fd2d01
      Qiao Longfei 提交于
      18fd2d01
    • T
      Make NCE_OP more efficient and support SelectedRows (#14469) · 56a4912b
      tangwei12 提交于
      * Fix truncated normal.
      
      * Fix.
      
      * Make nce support more distribution.
      
      * Fix API.spec.
      
      * Fix python API.
      
      * Fix.
      test=develop
      
      * Fix API.spec
      test=develop
      
      * Fix sampler.
      
      * Fix order of arguments in python API.
      test=develop
      
      * NCE add selectedrows support
      
      * NCE update weighted sampling
      
      * fix bugs in nce_op, and assign_value_op optimized
      
      * fix bugs in nce_op, revert assign_value_op
      
      * nce_op optimize
      
      * nce_op optimize
      
      * nce_op optimize
      
      * add selectedRows test later
      
      test=develop
      
      * add selectedRows supported
      
      * add selectedRows supported
      
      test=develop
      
      * add selectedRows supported
      
      * add nce selectedRows supported, test=develop
      
      * add nce selectedRows supported
      
      * add nce selectedRows supported, test=develop
      
      * fix height in nce, test=develop
      
      * add ut
      
      * add ut, test=develop
      
      * make AutoGrownIndex inline
      test=develop
      
      * fix tinny error, test=develop
      56a4912b
    • J
      Update API.spec, test=develop · 7389597c
      JiabinYang 提交于
      7389597c
  4. 25 11月, 2018 1 次提交
  5. 23 11月, 2018 2 次提交
  6. 22 11月, 2018 1 次提交
  7. 16 11月, 2018 2 次提交
    • W
      Make nce support more distribution. (#13549) · 17226782
      whs 提交于
      * Fix truncated normal.
      
      * Fix.
      
      * Make nce support more distribution.
      
      * Fix API.spec.
      
      * Fix python API.
      
      * Fix.
      test=develop
      
      * Fix API.spec
      test=develop
      
      * Fix sampler.
      
      * Fix order of arguments in python API.
      test=develop
      17226782
    • W
      Add cudnn ctc loss (#12366) · b32c13dc
      Wu Yi 提交于
      * add cudnn ctc loss
      
      * wip add test test=develop
      
      * wip
      
      * wip
      
      * done test=develop
      
      * move include cudnn test=develop
      
      * test test=develop
      
      * fix build test=develop
      
      * fix build test=develop
      
      * fix build on cudnn5 test=develop
      
      * fix cudnn5 build test=develop
      
      * fix cudnn5 build test=develop
      
      * merge develop softmax functor change test=develop
      b32c13dc
  8. 15 11月, 2018 1 次提交
    • C
      Add selu (#14415) · 82773477
      chengduo 提交于
      * add selu
      
      * use for range
      test=develop
      
      * add API
      test=develop
      
      * follow comment
      test=develop
      
      * update API.spec
      test=develop
      82773477
  9. 13 11月, 2018 3 次提交
  10. 09 11月, 2018 2 次提交
    • L
      Add lod tensor array to tensor op (#13990) · 688ed601
      li099 提交于
      * add lod tensor array concat
      
      * add lod tensor array concat
      
      * test=develop
      
      * add lod tensor array concat
      test=develop
      
      * Fix API.spec
      test=develop
      
      * add lod tensor array concat
      test=develop
      
      * revise some bug of lod tensor array concat
      test=develop
      
      * add unittest for tensor array concat
      test=develop
      
      * change to tensor array to tensor
      test=develop
      
      * revise bug
      test=develop
      
      * revise a bug
      test=develop
      
      * revise a bug
      test=develop
      
      * revise a bug of python3
      test=develop
      688ed601
    • Q
      update API.spec · 3f91e0f0
      Qiao Longfei 提交于
      test=develop
      3f91e0f0
  11. 07 11月, 2018 1 次提交
  12. 02 11月, 2018 2 次提交
  13. 01 11月, 2018 2 次提交
  14. 30 10月, 2018 2 次提交
  15. 29 10月, 2018 6 次提交
    • J
      Rebase of seqpool-max optimization · 458b16f4
      Jacek Czaja 提交于
      test=develop
      
      - Added rough profiling
      
      - Profiled maxpool itself
      
      - First draft of max seqpool optimization (is_test added)
      
      - Added unit tests to seqpool
      
      - Cosmetic fixes
      
      - Fix to UT of Seq pool
      
      Disabled grad checking for sequence max pool when is_test is set to True
      
      -Cosmetic fix to comment
      
      test=develop
      
      - Fix to GPU build
      
      test=develop
      
      - yet another GPU fix for sequence max pool
      
      - Fix to comment
      
      test=develop
      
      - Change to API of sequence_pool
      
      test=develop
      
      - Yet another API spec change
      
      test=develop
      458b16f4
    • B
      update paddle/fluid/API.spec · 73671379
      barrierye 提交于
      test=develop
      73671379
    • D
    • D
      add Grid Sampler Operator for STN. · 0bb0e0c1
      dengkaipeng 提交于
      0bb0e0c1
    • J
      test=develop · 0e303868
      JiabinYang 提交于
      0e303868
    • W
      [1.1] [project] train imagenet using large batch size (#13766) · 26200f2e
      Wu Yi 提交于
      * fix nccl2 lars dist support
      
      * put lars in momentum op
      
      * add tests lars
      
      * fix ci
      
      * fix cpu kernel
      
      * soft warning
      
      * remove lars in test_recognize_digits.py
      
      * move to another op
      
      * add file
      
      * update api.spec test=develop
      
      * update test=develop
      
      * fix api.spec test=develop
      
      * wip
      
      * wip, finish grad merge ops
      
      * wip, finish graph build
      
      * wip test running
      
      * work on 1 gpu
      
      * workable version
      
      * update
      
      * fix tests
      
      * fuse broadcast op
      
      * fix compile failed
      
      * refine
      
      * add batch merge test mnist
      
      * fix CI test=develop
      
      * fix build
      
      * use independent bn params for batch merge test=develop
      
      * update api.spec
      
      * follow comments and for test
      
      * wip
      
      * refine tests test=develop
      
      * follow comments test=develop
      
      * remove startup bn modify test=develop
      
      * follow comments test=develop
      
      * fix merge test=develop
      26200f2e
  16. 25 10月, 2018 2 次提交
  17. 24 10月, 2018 2 次提交
  18. 23 10月, 2018 1 次提交
  19. 22 10月, 2018 1 次提交
  20. 19 10月, 2018 2 次提交
    • G
      Make reshape_op reuse input. · dfb841ad
      guosheng 提交于
      test=develop
      dfb841ad
    • C
      Fix add doc for bias_attr (#13937) · 9775e50c
      chengduo 提交于
      * fix conv doc
      test=develop
      
      * fix seq_conv doc
      test=develop
      
      * fix simple_img_conv_pool
      test=develop
      
      * update API.spec
      
      * update parameter doc
      test=develop
      
      * follow comment
      test=develop
      
      * fix other layer
      test=develop
      
      * fix lstm bias_attr doc
      test=develop
      9775e50c