1. 29 10月, 2018 1 次提交
    • W
      [1.1] [project] train imagenet using large batch size (#13766) · 26200f2e
      Wu Yi 提交于
      * fix nccl2 lars dist support
      
      * put lars in momentum op
      
      * add tests lars
      
      * fix ci
      
      * fix cpu kernel
      
      * soft warning
      
      * remove lars in test_recognize_digits.py
      
      * move to another op
      
      * add file
      
      * update api.spec test=develop
      
      * update test=develop
      
      * fix api.spec test=develop
      
      * wip
      
      * wip, finish grad merge ops
      
      * wip, finish graph build
      
      * wip test running
      
      * work on 1 gpu
      
      * workable version
      
      * update
      
      * fix tests
      
      * fuse broadcast op
      
      * fix compile failed
      
      * refine
      
      * add batch merge test mnist
      
      * fix CI test=develop
      
      * fix build
      
      * use independent bn params for batch merge test=develop
      
      * update api.spec
      
      * follow comments and for test
      
      * wip
      
      * refine tests test=develop
      
      * follow comments test=develop
      
      * remove startup bn modify test=develop
      
      * follow comments test=develop
      
      * fix merge test=develop
      26200f2e
  2. 18 10月, 2018 1 次提交
  3. 12 9月, 2018 1 次提交
  4. 10 9月, 2018 1 次提交
  5. 08 9月, 2018 1 次提交
    • W
      Benchmark tool for imgnet (#12305) · f90c7865
      Wu Yi 提交于
      * support test using executor without reader
      
      * run imgnet
      
      * update fluid benchmark
      
      * wip
      
      * update
      
      * update all models
      
      * support pyreader
      
      * update
      
      * clean up
      
      * make profile batches contollable
      
      * update API.spec
      
      * update scripts
      
      * clean dockerfile
      
      * update
      
      * clean comments
      
      * add scope argument docstring
      
      * use num_trainers to determine nccl init comms
      f90c7865
  6. 30 8月, 2018 1 次提交
  7. 27 7月, 2018 1 次提交
  8. 17 7月, 2018 1 次提交
    • W
      Remove block api (#12107) · db67d60e
      Wu Yi 提交于
      * remove block api
      
      * remove clone_variable
      
      * hide block inner apis
      
      * update
      
      * fix tests
      db67d60e
  9. 30 6月, 2018 1 次提交
  10. 27 6月, 2018 1 次提交
    • P
      bnorm+relu fuse for mkldnn (inference) (#11434) · 9a15c923
      pzelazko-intel 提交于
      * bnorm+relu fuse for mkldnn
      
      * separate fuse_relu function
      
      * bug fix
      
      * proper while range in inference_transpiler
      
      * description fix
      
      * review fix
      
      * review fix
      
      * unit test for fwd batch norm+relu MKLDNN fuse
      9a15c923
  11. 25 6月, 2018 2 次提交
  12. 20 6月, 2018 4 次提交
  13. 14 6月, 2018 1 次提交
  14. 11 6月, 2018 2 次提交
    • G
      fix bugs in fluid_benchmark · a95cf55e
      guochaorong 提交于
      a95cf55e
    • Q
      Add some dist-training robust cases into fluid benchmark test (#11207) · 1cfd3cb1
      Qiyang Min 提交于
      * 1. add weight decay feature into fluid benchmark test
      2. add learning rate decay feature into fluid benchmark test
      3. add L1&L2 regularization feature into fluid benchmark test
      4. add error clipping feature into fluid benchmark test
      5. add gradient clipping feature into fluid benchmark test
      
      * Add some document to README.md under benchmark/fluid/ repo
      
      * Add model_base.py
      
      * Fix bugs in test_listen_and_serv_op
      
      * 1. remove args out of fluid_benchmark.py
      2. remove lr_decay, regularization, clipping out of fluid_benchmark.py
      
      * add async_mode description to doc and remove the clipping description out
      
      * for restart build
      
      * to restart build
      
      * remove optimization args from args.py
      
      * 1. remove optimization from models
      2. fix bug in test_listen_and_serv_op
      
      * change the name retry_times to left_time
      
      * change retry_times to the pserver start left time
      1cfd3cb1
  15. 07 6月, 2018 1 次提交
  16. 06 6月, 2018 3 次提交
  17. 05 6月, 2018 3 次提交
  18. 04 6月, 2018 1 次提交
  19. 01 6月, 2018 1 次提交
  20. 30 5月, 2018 1 次提交
  21. 25 5月, 2018 1 次提交
  22. 23 5月, 2018 1 次提交
    • W
      Benchmark/Integrate benchmark scripts (#10707) · 55d3951b
      Wu Yi 提交于
      * wip integrate benchmark scripts
      
      * testing nlp models
      
      * k8s script to start dist benchmark job
      
      * update script
      
      * done support all models
      
      * add README.md
      
      * update by comment
      
      * clean up
      
      * follow comments
      55d3951b