1. 10 1月, 2019 1 次提交
    • W
      [Feature] support mix precision training for resnet (#14899) · fd854183
      Wu Yi 提交于
      * clip softmax for fp16
      
      * updates
      
      * fuse xent support fp16 test=develop
      
      * wip
      
      * wip
      
      * add simple row reduce
      
      * wip fp16 accurate softmax
      
      * add accurate softmax kernel for fp16 test=develop
      
      * update test=develop
      
      * fix cpu build test=develop
      
      * update api.spec test=develop
      
      * follow comments test=develop
      
      * fix build test=develop
      
      * fix trt build test=develop
      
      * fix inference build test=develop
      
      * fix merge test=develop
      
      * update test=develop
      
      * try fix build test=develop
      
      * fix build test=develop
      
      * rename real_exp test=develop
      
      * fortest
      
      * remove hacky kernels test=develop
      
      * clean up test=develop
      fd854183
  2. 26 12月, 2018 1 次提交
  3. 24 12月, 2018 1 次提交
    • W
      Init paddle slim (#14834) · 93870574
      whs 提交于
      * Init slim.
      
      * Remove distillation demo.
      
      * Fix import errors.
      test=develop
      
      * Fix some issues.
      test=develop
      
      * Fix configs.
      test=develop
      
      * Modify API.spec.
      test=develop
      
      * Fix format.
      test=develop
      
      * Fix format.
      test=develop
      
      * Add some comments.
      93870574
  4. 20 12月, 2018 1 次提交
    • T
      lookup table utils fix (#14730) · 45dd3491
      tangwei12 提交于
      * fix utils config, test=develop
      
      * reweite tools in lookup table utils, test=develop
      
      * merge develop, test=develop
      
      * add copy in setup, test=develop
      
      * update api spec, test=develop
      
      * add doc, test=develop
      
      * prettifying doc, test=develop
      
      * Update API.spec
      
      update api.spec, test=develop
      
      * Update lookup_table_utils.py
      
      test=develop
      
      * Update lookup_table_utils.py
      
      test=develop
      
      * merge develop, test=develop
      
      * merge develop, test=develop
      
      * init fix, test=develop
      
      * fix in downloads, test=develop
      
      * fix in setup.in, test=develop
      45dd3491
  5. 17 12月, 2018 4 次提交
  6. 14 12月, 2018 4 次提交
  7. 13 12月, 2018 3 次提交
  8. 12 12月, 2018 2 次提交
  9. 11 12月, 2018 1 次提交
  10. 10 12月, 2018 2 次提交
  11. 09 12月, 2018 1 次提交
  12. 07 12月, 2018 1 次提交
  13. 06 12月, 2018 1 次提交
  14. 05 12月, 2018 1 次提交
    • C
      Fix clip.py (#14718) · 04539d4c
      chengduo 提交于
      * expose square
      test=develop
      
      * fix activation
      test=develop
      
      * Add square API
      test=develop
      
      * add necessary op
      
      * code refine
      
      * fix API.spec
      test=develop
      
      * fix unit test
      test=develop
      
      * add unit test sparse_grad_clip
      test=develop
      
      * fix API.spec
      test=develop
      
      * remove mac test for test_gradient_clip
      test=develop
      
      * remove selectedrows_mul_tensor
      test=develop
      04539d4c
  15. 01 12月, 2018 1 次提交
  16. 30 11月, 2018 1 次提交
    • W
      AsyncExecutor (#14627) · 41e19eb4
      Wang Guibao 提交于
      * AsyncExecutor: C++ side
      
      * Google naming conventions
      
      * Rename MultiExecutor to AsyncExecutor
      
      * pybind with async_executor
      
      * Naming convention
      
      * remove some flags and unused code
      
      * add refactored file of async_executor and data_feed
      
      * clear async executor interface and add data feed factory
      
      * split async executor into executor_thread_worker and async_executor, refactor pybind, add datafeed and corresponding proto
      
      * Fix async_executor interfaces: 1) Remove all protobufs; 2) Stop after each epoch
      
      * refine async_executor_refactor.cc
      
      * add some files about datafeed
      
      * Revert "add some files about datafeed"
      
      This reverts commit 8ee8133ab841196925a2812b76f18d2812a6701d.
      
      * Interface rework
      
      * add MultiSlotDataFeed
      
      * Creating DataFeedDesc from .proto file, then manipulate it (add/del fields etc) from python side
      
      * update data_feed for add MultiSlotDataFeed
      
      * update datafeed and async_executor to run bow_net demo
      
      * fix bug that finish_set_filelist failed in multithread
      
      * delete finish_binding_memory_(flag), because it can not be marked under the current interface
      
      * Fix bug
      
      * update async_executor.py for support set_use_slots
      
      * update async_executor.py for support set_use_slots and set set_dense_slots
      
      * fix bug that when the number of files is less than the number of threads, it will fetch nan
      
      * remove redundant code, and make executor exit when set a illegal queue size
      
      * add batch_size check
      
      * add MultiSlotDesc
      
      * Revert "add MultiSlotDesc"
      
      This reverts commit 2e72ebfad364ed6b5dcc75f38ffb2a1fdec83d8e.
      
      * add some checkpoint in DataFeedDesc
      
      * add CheckFile function in MultiSlotDataFeed
      
      * update something error info
      
      * fix deaded lock bug
      
      * Fix fetch variable
      
      * Merge error
      
      * fix code style in async_executor
      
      * using one lock blocking queue replace two lock blocking queue because of some bugs
      
      * update code style
      
      * add utest for data_feed
      
      * Fix fetch var
      
      * update utest for data_feed for multithread
      
      * update SetFileList info
      
      * fix bug in utest of data_feed
      
      * Add comments for python
      
      * Add comments for python code
      
      * Fix pybind.cc with new pybind11 version
      
      * add note for DataFeedDesc's set_use_slots function
      
      * Add save_model
      
      * update data_feed_test for multi-type
      
      * add comment for executor_thread_worker
      
      * Remove unused code
      
      * update data_feed_test for generate test data file
      
      * removed unnecessary interfaces and add comments
      
      * c++ style check
      
      * update data_feed.cc
      
      * AsyncExecutor: C++ side
      
      Google naming conventions
      
      Rename MultiExecutor to AsyncExecutor
      
      pybind with async_executor
      
      Naming convention
      
      remove some flags and unused code
      
      add refactored file of async_executor and data_feed
      
      clear async executor interface and add data feed factory
      
      split async executor into executor_thread_worker and async_executor, refactor pybind, add datafeed and corresponding proto
      
      Fix async_executor interfaces: 1) Remove all protobufs; 2) Stop after each epoch
      
      refine async_executor_refactor.cc
      
      add some files about datafeed
      
      Revert "add some files about datafeed"
      
      This reverts commit 8ee8133ab841196925a2812b76f18d2812a6701d.
      
      add MultiSlotDataFeed
      
      Interface rework
      
      Creating DataFeedDesc from .proto file, then manipulate it (add/del fields etc) from python side
      
      update datafeed and async_executor to run bow_net demo
      
      update async_executor.py for support set_use_slots
      
      Fix bug
      
      update async_executor.py for support set_use_slots and set set_dense_slots
      
      fix bug that when the number of files is less than the number of threads, it will fetch nan
      
      remove redundant code, and make executor exit when set a illegal queue size
      
      add MultiSlotDesc
      
      Revert "add MultiSlotDesc"
      
      This reverts commit 2e72ebfad364ed6b5dcc75f38ffb2a1fdec83d8e.
      
      add some checkpoint in DataFeedDesc
      
      Fix fetch variable
      
      fix code style in async_executor
      
      Fix fetch var
      
      add utest for data_feed
      
      Add comments for python
      
      update utest for data_feed for multithread
      
      fix bug in utest of data_feed
      
      Add comments for python code
      
      Fix pybind.cc with new pybind11 version
      
      add note for DataFeedDesc's set_use_slots function
      
      update data_feed_test for multi-type
      
      Add save_model
      
      update data_feed_test for generate test data file
      
      removed unnecessary interfaces and add comments
      
      add comment for executor_thread_worker
      
      Remove unused code
      
      update data_feed.cc
      
      c++ style check
      
      * commit for code style
      
      * commit for code style
      
      * commit for code style
      
      * commit for code style
      
      * Comment away __init__ in async_executor.py
      
      * clang-format fix test=develop
      
      * use PADDLE_THROW instead of exit(-1); use unique_ptr to manage scope var in data_feed_test.cc
      
      * commit for update code style
      
      * commit for update code style
      
      * Add async_executor demo; Remove some methods
      test=develop
      
      * commit for update code style
      
      * commit for update code style
      
      * commit for update code style
      
      * update API.spec
      
      * AsyncExecutor
      test=develop
      
      * AsyncExecutor
      test=develop
      
      * AsyncExecutor
      test=develop
      
      * AsyncExecutor
      test=develop
      
      * Fix API.spec
      test=develop
      
      * Fix API.spec
      test=develop
      
      * Fix windows build error
      test=develop
      
      * FIx windows build error
      test=develop
      
      * FIx windows build error
      test=develop
      
      * FIx windows build error
      test=develop
      
      * Fix Windows Build
      test=develop
      
      * Fix Windows Build
      test=develop
      
      * Fix Windows Build
      test=develop
      
      * Fix code style
      test=develop
      
      * Fix code style
      test=develop
      
      * update datafeed
      
      * Fix code style
      test=develop
      
      * update data_feed_test for test Tensor test=develop
      
      * Fix code style
      test=develop
      
      * Fix windows build failure
      test=develop
      
      * Fix code style and windows build failure
      test=develop
      
      * Fix PYTHON3.5 build failure
      test=develop
      
      * AsyncExecutor API
      test=develop
      41e19eb4
  17. 29 11月, 2018 6 次提交
  18. 27 11月, 2018 5 次提交
    • J
      remove arg 'non_leaf_num', test=develop · a08dc83e
      JiabinYang 提交于
      a08dc83e
    • Q
      update embedding api · 18fd2d01
      Qiao Longfei 提交于
      18fd2d01
    • T
      Make NCE_OP more efficient and support SelectedRows (#14469) · 56a4912b
      tangwei12 提交于
      * Fix truncated normal.
      
      * Fix.
      
      * Make nce support more distribution.
      
      * Fix API.spec.
      
      * Fix python API.
      
      * Fix.
      test=develop
      
      * Fix API.spec
      test=develop
      
      * Fix sampler.
      
      * Fix order of arguments in python API.
      test=develop
      
      * NCE add selectedrows support
      
      * NCE update weighted sampling
      
      * fix bugs in nce_op, and assign_value_op optimized
      
      * fix bugs in nce_op, revert assign_value_op
      
      * nce_op optimize
      
      * nce_op optimize
      
      * nce_op optimize
      
      * add selectedRows test later
      
      test=develop
      
      * add selectedRows supported
      
      * add selectedRows supported
      
      test=develop
      
      * add selectedRows supported
      
      * add nce selectedRows supported, test=develop
      
      * add nce selectedRows supported
      
      * add nce selectedRows supported, test=develop
      
      * fix height in nce, test=develop
      
      * add ut
      
      * add ut, test=develop
      
      * make AutoGrownIndex inline
      test=develop
      
      * fix tinny error, test=develop
      56a4912b
    • L
      add cudnn lstm; test=develop · 05917c3c
      liuhongyu 提交于
      05917c3c
    • J
      Update API.spec, test=develop · 7389597c
      JiabinYang 提交于
      7389597c
  19. 25 11月, 2018 1 次提交
  20. 23 11月, 2018 2 次提交