1. 26 11月, 2018 1 次提交
    • T
      Fix save and load lookup table/optimizer vars (#14301) · 3639d99f
      tangwei12 提交于
      *  fix mkdir conflict
      
      *  fix load/save lookup tables
      
       test=develop
      
      * add lookup_table_utils
      
      * fix load optimize vars on pserver
      
      * delete lookup table utils
      
      * fix save and load lookup tables
      
      * fix load optimizer var
      
      * fix load optimizer var, test=develop
      
      * fix python 3 style, test=develop
      
      * move lookup_table_utils to contrib utils
      3639d99f
  2. 11 11月, 2018 1 次提交
  3. 08 11月, 2018 2 次提交
  4. 07 11月, 2018 1 次提交
  5. 06 11月, 2018 2 次提交
  6. 02 11月, 2018 1 次提交
    • T
      [1.1] Load vars on PSERVER (#14037) · d325e668
      tangwei12 提交于
      * fix dim0 in _load_slice_up_vars
      
      * fix dim0 in _load_slice_up_vars, fix innershape in delete_var_op
      
      * Revert "fix lookuptable in reduce strategy"
      
      This reverts commit 0e722c5e
      
      * add unit test for dist
      
      * add unit test for dist, test=develop
      
      * cancel revert, test=develop
      d325e668
  7. 01 11月, 2018 4 次提交
  8. 31 10月, 2018 2 次提交
  9. 30 10月, 2018 1 次提交
  10. 29 10月, 2018 1 次提交
    • W
      [1.1] [project] train imagenet using large batch size (#13766) · 26200f2e
      Wu Yi 提交于
      * fix nccl2 lars dist support
      
      * put lars in momentum op
      
      * add tests lars
      
      * fix ci
      
      * fix cpu kernel
      
      * soft warning
      
      * remove lars in test_recognize_digits.py
      
      * move to another op
      
      * add file
      
      * update api.spec test=develop
      
      * update test=develop
      
      * fix api.spec test=develop
      
      * wip
      
      * wip, finish grad merge ops
      
      * wip, finish graph build
      
      * wip test running
      
      * work on 1 gpu
      
      * workable version
      
      * update
      
      * fix tests
      
      * fuse broadcast op
      
      * fix compile failed
      
      * refine
      
      * add batch merge test mnist
      
      * fix CI test=develop
      
      * fix build
      
      * use independent bn params for batch merge test=develop
      
      * update api.spec
      
      * follow comments and for test
      
      * wip
      
      * refine tests test=develop
      
      * follow comments test=develop
      
      * remove startup bn modify test=develop
      
      * follow comments test=develop
      
      * fix merge test=develop
      26200f2e
  11. 28 10月, 2018 1 次提交
  12. 27 10月, 2018 1 次提交
  13. 26 10月, 2018 4 次提交
  14. 25 10月, 2018 1 次提交
  15. 18 10月, 2018 1 次提交
  16. 16 10月, 2018 1 次提交
  17. 14 10月, 2018 1 次提交
  18. 27 9月, 2018 3 次提交
    • T
      fix graph num · 5b152b1f
      typhoonzero 提交于
      5b152b1f
    • T
      Add distributed unit tests about text_classification/simnet-bow/ctr (#12812) · 97cf1eb6
      tangwei12 提交于
      * add dist ut for text_classification
      
      * add dist ut for text_classification
      
      * add simnet bow unittest
      
      * add dist ut for simnet bow
      
      * add trainning data url for simnet bow
      
      * add trainning data url for simnet bow
      
      * modify simnet test_reader to train reader
      
      * add test_dist_ctr
      
      * test_dist_ctr can run now
      
      * dense update is good
      
      * add unit test for selected rows
      
      * debug unit test
      
      * fix dist sparse update problem
      
      * Constant args at init
      
      * optimize code
      
      * simnet optimize
      
      * fix DebugStringEx
      
      * optimize sum_op.h
      
      * add ScaleOpVarTypeInference
      
      * clean code
      
      * fix test_dist_transpiler.py
      
      * code optimize
      
      * modify delta
      
      * fix sparse update bug
      
      * dist test use one cpu
      
      * update some data
      
      * remove unused code
      
      * add use cuda config
      
      * unit test fix
      
      * unit test fix
      
      * unit test fix
      
      * unit test fix
      
      * dist_word2vec use CPU
      
      * unit test fix
      
      * unit test fix
      
      * code clean
      
      * code clean
      
      * merge develop
      
      * api spec update
      
      * Revert: api spec update
      
      * replace simnet data with fake
      
      * replace simnet data with fake
      
      * update dim
      
      * add batch auc
      
      * code clean
      
      * code clean
      
      * modify print to stderr
      
      * update simnet delta -> 1e-5
      
      * update RUN_STEP
      
      * add use_reader_alloc
      
      * add use_reader_alloc
      
      * add use_reader_alloc
      
      * modify delta
      
      * add use_reader_alloc
      
      * fix stderr write
      
      * python3 compatibility
      
      test=develop
      
      * python3 compatibility, test=develop
      
      * Update dist_text_classification.py
      
      * test=develop
      97cf1eb6
    • T
      Batch AUC (#13567) · 85362e98
      tangwei12 提交于
      * add distributed auc
      
      * add attr "is distributed" and config it
      
      * add distributed auc
      
      * add batch auc and code format
      
      * code format
      
      * auc optimize
      
      * metric_op optimize
      
      * code clean
      
      * bug fix and code clean
      
      * bug fix and code clean
      
      * code optimize
      
      * code optimize
      
      * api spec update
      
      * Comments optimized
      
      * add mutex
      
      * Revert: add mutex
      
      * remove distribute metric
      
      * remove distribute metric
      
      * spec modifyed
      
      * add annotation, test=develop
      
      * keep API compatibility
      test=develop
      85362e98
  19. 26 9月, 2018 2 次提交
  20. 25 9月, 2018 1 次提交
    • W
      Nccl2 dist API (#13506) · aeb2dc2b
      Wu Yi 提交于
      * add nccl2 dist api
      
      * update apispec
      
      * update
      
      * update api spec
      aeb2dc2b
  21. 23 9月, 2018 1 次提交
  22. 21 9月, 2018 1 次提交
  23. 18 9月, 2018 2 次提交
  24. 13 9月, 2018 1 次提交
    • W
      Trainer auto wait pserver ports (#13341) · 3ab3a7f3
      Wu Yi 提交于
      * trainer auto wait pserver port ready
      
      * add file
      
      * fix docstring
      
      * add option to not wait
      
      * update api spec
      
      * clean
      
      * fix test hang
      3ab3a7f3
  25. 04 9月, 2018 1 次提交
  26. 03 9月, 2018 1 次提交
  27. 31 8月, 2018 1 次提交