1. 12 2月, 2019 1 次提交
  2. 21 1月, 2019 1 次提交
  3. 15 1月, 2019 1 次提交
  4. 14 1月, 2019 1 次提交
  5. 11 1月, 2019 2 次提交
    • C
      Revert "Remove workspace_handle in conv_cudnn (#15186)" · 358e657f
      chengduozh 提交于
      test=develop
      This reverts commit 064512aa.
      358e657f
    • C
      Remove workspace_handle in conv_cudnn (#15186) · 064512aa
      chengduo 提交于
      * remove workspace_handle in conv2d_cudnn
      test=develop
      
      * remove workspace_handle
      test=develop
      
      * fix bug
      test=develop
      
      * make test_conv2d_op SERIAL
      test=develop
      
      * save memory in conv_cudnn
      test=develop
      
      * enhance thread safety
      test=develop
      
      * enhance temporary allocator
      test=develop
      
      * Add excess fraction
      test=develop
      
      * follow comments
      test=develop
      
      * fix bug and code refine
      test=develop
      
      * fix memory size check
      test=develop
      
      * rename reuse_tmp_allocation_excess_fraction
      test=develop
      064512aa
  6. 09 1月, 2019 1 次提交
  7. 07 1月, 2019 2 次提交
  8. 02 1月, 2019 2 次提交
  9. 28 12月, 2018 3 次提交
  10. 26 12月, 2018 1 次提交
  11. 22 12月, 2018 1 次提交
  12. 18 12月, 2018 4 次提交
    • P
      add ctc support for windows · 19ebd8b4
      peizhilin 提交于
      19ebd8b4
    • P
      fix code style · 001891ae
      peizhilin 提交于
      test=develop
      001891ae
    • D
      add ir memory optimize. (#14530) · 7cd24b13
      dzhwinter 提交于
      * follow comments. test=develop
      
      * Fix typo
      
      * fix compile error. test=develop
      
      * merge develop branch. test=develop
      
      * Remove set_equal
      
      * Polish code
      
      * Delete unused functions
      
      test=develop
      
      * polish code. test=develop
      
      * follow comment
      
      * polish code.
      
      * fix windows compile error. test=develop
      
      * fix op handle.
      
      * rerun ci. test=develop
      
      * rerun ci. test=develop
      
      * rerun macci. test=develop
      
      * polish code. test=develop
      
      * rewrite sort code. test=develop
      
      * remove unused code. test=develop
      
      * fix tests. test=develop
      
      * fix conflict. test=develop
      
      * follow comment. test=develop
      
      * merge develop branch. test=develop
      
      * fix tests. test=develop
      
      * remove ToTypeIndex. test=develop
      
      * rerun ci. test=develop
      7cd24b13
    • P
      add mkl,ctc support for windows · 5a6d7fe2
      peizhilin 提交于
      5a6d7fe2
  13. 14 12月, 2018 1 次提交
  14. 13 12月, 2018 1 次提交
  15. 12 12月, 2018 1 次提交
  16. 10 12月, 2018 1 次提交
  17. 09 12月, 2018 1 次提交
  18. 07 12月, 2018 2 次提交
  19. 06 12月, 2018 1 次提交
  20. 04 12月, 2018 1 次提交
  21. 03 12月, 2018 2 次提交
  22. 30 11月, 2018 1 次提交
    • W
      AsyncExecutor (#14627) · 41e19eb4
      Wang Guibao 提交于
      * AsyncExecutor: C++ side
      
      * Google naming conventions
      
      * Rename MultiExecutor to AsyncExecutor
      
      * pybind with async_executor
      
      * Naming convention
      
      * remove some flags and unused code
      
      * add refactored file of async_executor and data_feed
      
      * clear async executor interface and add data feed factory
      
      * split async executor into executor_thread_worker and async_executor, refactor pybind, add datafeed and corresponding proto
      
      * Fix async_executor interfaces: 1) Remove all protobufs; 2) Stop after each epoch
      
      * refine async_executor_refactor.cc
      
      * add some files about datafeed
      
      * Revert "add some files about datafeed"
      
      This reverts commit 8ee8133a.
      
      * Interface rework
      
      * add MultiSlotDataFeed
      
      * Creating DataFeedDesc from .proto file, then manipulate it (add/del fields etc) from python side
      
      * update data_feed for add MultiSlotDataFeed
      
      * update datafeed and async_executor to run bow_net demo
      
      * fix bug that finish_set_filelist failed in multithread
      
      * delete finish_binding_memory_(flag), because it can not be marked under the current interface
      
      * Fix bug
      
      * update async_executor.py for support set_use_slots
      
      * update async_executor.py for support set_use_slots and set set_dense_slots
      
      * fix bug that when the number of files is less than the number of threads, it will fetch nan
      
      * remove redundant code, and make executor exit when set a illegal queue size
      
      * add batch_size check
      
      * add MultiSlotDesc
      
      * Revert "add MultiSlotDesc"
      
      This reverts commit 2e72ebfa.
      
      * add some checkpoint in DataFeedDesc
      
      * add CheckFile function in MultiSlotDataFeed
      
      * update something error info
      
      * fix deaded lock bug
      
      * Fix fetch variable
      
      * Merge error
      
      * fix code style in async_executor
      
      * using one lock blocking queue replace two lock blocking queue because of some bugs
      
      * update code style
      
      * add utest for data_feed
      
      * Fix fetch var
      
      * update utest for data_feed for multithread
      
      * update SetFileList info
      
      * fix bug in utest of data_feed
      
      * Add comments for python
      
      * Add comments for python code
      
      * Fix pybind.cc with new pybind11 version
      
      * add note for DataFeedDesc's set_use_slots function
      
      * Add save_model
      
      * update data_feed_test for multi-type
      
      * add comment for executor_thread_worker
      
      * Remove unused code
      
      * update data_feed_test for generate test data file
      
      * removed unnecessary interfaces and add comments
      
      * c++ style check
      
      * update data_feed.cc
      
      * AsyncExecutor: C++ side
      
      Google naming conventions
      
      Rename MultiExecutor to AsyncExecutor
      
      pybind with async_executor
      
      Naming convention
      
      remove some flags and unused code
      
      add refactored file of async_executor and data_feed
      
      clear async executor interface and add data feed factory
      
      split async executor into executor_thread_worker and async_executor, refactor pybind, add datafeed and corresponding proto
      
      Fix async_executor interfaces: 1) Remove all protobufs; 2) Stop after each epoch
      
      refine async_executor_refactor.cc
      
      add some files about datafeed
      
      Revert "add some files about datafeed"
      
      This reverts commit 8ee8133a.
      
      add MultiSlotDataFeed
      
      Interface rework
      
      Creating DataFeedDesc from .proto file, then manipulate it (add/del fields etc) from python side
      
      update datafeed and async_executor to run bow_net demo
      
      update async_executor.py for support set_use_slots
      
      Fix bug
      
      update async_executor.py for support set_use_slots and set set_dense_slots
      
      fix bug that when the number of files is less than the number of threads, it will fetch nan
      
      remove redundant code, and make executor exit when set a illegal queue size
      
      add MultiSlotDesc
      
      Revert "add MultiSlotDesc"
      
      This reverts commit 2e72ebfa.
      
      add some checkpoint in DataFeedDesc
      
      Fix fetch variable
      
      fix code style in async_executor
      
      Fix fetch var
      
      add utest for data_feed
      
      Add comments for python
      
      update utest for data_feed for multithread
      
      fix bug in utest of data_feed
      
      Add comments for python code
      
      Fix pybind.cc with new pybind11 version
      
      add note for DataFeedDesc's set_use_slots function
      
      update data_feed_test for multi-type
      
      Add save_model
      
      update data_feed_test for generate test data file
      
      removed unnecessary interfaces and add comments
      
      add comment for executor_thread_worker
      
      Remove unused code
      
      update data_feed.cc
      
      c++ style check
      
      * commit for code style
      
      * commit for code style
      
      * commit for code style
      
      * commit for code style
      
      * Comment away __init__ in async_executor.py
      
      * clang-format fix test=develop
      
      * use PADDLE_THROW instead of exit(-1); use unique_ptr to manage scope var in data_feed_test.cc
      
      * commit for update code style
      
      * commit for update code style
      
      * Add async_executor demo; Remove some methods
      test=develop
      
      * commit for update code style
      
      * commit for update code style
      
      * commit for update code style
      
      * update API.spec
      
      * AsyncExecutor
      test=develop
      
      * AsyncExecutor
      test=develop
      
      * AsyncExecutor
      test=develop
      
      * AsyncExecutor
      test=develop
      
      * Fix API.spec
      test=develop
      
      * Fix API.spec
      test=develop
      
      * Fix windows build error
      test=develop
      
      * FIx windows build error
      test=develop
      
      * FIx windows build error
      test=develop
      
      * FIx windows build error
      test=develop
      
      * Fix Windows Build
      test=develop
      
      * Fix Windows Build
      test=develop
      
      * Fix Windows Build
      test=develop
      
      * Fix code style
      test=develop
      
      * Fix code style
      test=develop
      
      * update datafeed
      
      * Fix code style
      test=develop
      
      * update data_feed_test for test Tensor test=develop
      
      * Fix code style
      test=develop
      
      * Fix windows build failure
      test=develop
      
      * Fix code style and windows build failure
      test=develop
      
      * Fix PYTHON3.5 build failure
      test=develop
      
      * AsyncExecutor API
      test=develop
      41e19eb4
  23. 26 11月, 2018 1 次提交
  24. 25 11月, 2018 1 次提交
  25. 24 11月, 2018 1 次提交
  26. 23 11月, 2018 2 次提交
  27. 22 11月, 2018 2 次提交
    • C
      Refine cublas to support CUBLAS_TENSOR_OP_MATH (#13929) · 00b9e9a1
      chengduo 提交于
      * refine cublase
      test=develop
      
      * code refine
      
      * refine cublas
      
      * add GEMME_EX
      
      * add enable_cublas_tensor_op_math doc and add cublasCall
      test=develop
      
      * fix CublasCall for cuda version
      test=develop
      
      * fix error
      test=develop
      
      * fix GEMM_EX to be compatible with gcc 4.8
      test=develop
      
      * add GEMM_EX
      test=develop
      
      * to compatiable with gcc4.8
      test=develop
      00b9e9a1
    • W
      Windows/online (#14474) · d9a1f3e5
      wopeizl 提交于
      * add recordio support
      
      * disable the openblas multi-thread on windows since no support
      adjust the python script
      
      * code style
      
      * code style
      test=develop
      
      * add create_recordio_file_reader back
      
      * fix code style
      test=develop
      
      * fix the gtest.cmake on windows
      
      * fix cc_test on windows
      
      * fix the win build
      test=develop
      
      * remove fused compile support on windows
      test=develop
      
      * add the jit support
      test=develop
      
      * add the jit support, test=develop
      
      * add the jit support, test=develop
      
      * add the jit back
      fix compile error on windows
      
      * rollback test=develop
      
      * test case fix
      
      * disable DSO by default on windows
      
      * exclude warpctc_op on windows
      
      * exclude the dynload_warpctc out on windows
      test=develop
      
      * fix the scripts error
      test=develop
      
      * disable avx on windows by default
      test=develop
      
      * re-organize the cmake file
      
      * disable mkl on windows by default
      
      * add warp_ctc back
      
      * fix the dependency
      
      * fix the dependency
      
      * fix the build issue on windows
      
      * remove unsupported flag on windows
      
      * code style
      
      * code style
      test=develop
      
      * fix issue
      
      * add profiler, parallel_executor back
      
      * clean up the pre-definitions on windows
      
      * fix build issue
      
      * test=develop
      d9a1f3e5
  28. 21 11月, 2018 1 次提交