1. 29 10月, 2018 1 次提交
    • W
      [1.1] [project] train imagenet using large batch size (#13766) test=release/1.1 · cb274159
      Wu Yi 提交于
      * fix nccl2 lars dist support
      
      * put lars in momentum op
      
      * add tests lars
      
      * fix ci
      
      * fix cpu kernel
      
      * soft warning
      
      * remove lars in test_recognize_digits.py
      
      * move to another op
      
      * add file
      
      * update api.spec test=develop
      
      * update test=develop
      
      * fix api.spec test=develop
      
      * wip
      
      * wip, finish grad merge ops
      
      * wip, finish graph build
      
      * wip test running
      
      * work on 1 gpu
      
      * workable version
      
      * update
      
      * fix tests
      
      * fuse broadcast op
      
      * fix compile failed
      
      * refine
      
      * add batch merge test mnist
      
      * fix CI test=develop
      
      * fix build
      
      * use independent bn params for batch merge test=develop
      
      * update api.spec
      
      * follow comments and for test
      
      * wip
      
      * refine tests test=develop
      
      * follow comments test=develop
      
      * remove startup bn modify test=develop
      
      * follow comments test=develop
      
      * fix merge test=develop
      cb274159
  2. 20 9月, 2018 1 次提交
  3. 12 9月, 2018 1 次提交
  4. 08 9月, 2018 1 次提交
    • W
      Benchmark tool for imgnet (#12305) · f90c7865
      Wu Yi 提交于
      * support test using executor without reader
      
      * run imgnet
      
      * update fluid benchmark
      
      * wip
      
      * update
      
      * update all models
      
      * support pyreader
      
      * update
      
      * clean up
      
      * make profile batches contollable
      
      * update API.spec
      
      * update scripts
      
      * clean dockerfile
      
      * update
      
      * clean comments
      
      * add scope argument docstring
      
      * use num_trainers to determine nccl init comms
      f90c7865
  5. 30 6月, 2018 1 次提交
  6. 27 6月, 2018 1 次提交
    • P
      bnorm+relu fuse for mkldnn (inference) (#11434) · 9a15c923
      pzelazko-intel 提交于
      * bnorm+relu fuse for mkldnn
      
      * separate fuse_relu function
      
      * bug fix
      
      * proper while range in inference_transpiler
      
      * description fix
      
      * review fix
      
      * review fix
      
      * unit test for fwd batch norm+relu MKLDNN fuse
      9a15c923
  7. 25 6月, 2018 2 次提交
  8. 11 6月, 2018 1 次提交
    • Q
      Add some dist-training robust cases into fluid benchmark test (#11207) · 1cfd3cb1
      Qiyang Min 提交于
      * 1. add weight decay feature into fluid benchmark test
      2. add learning rate decay feature into fluid benchmark test
      3. add L1&L2 regularization feature into fluid benchmark test
      4. add error clipping feature into fluid benchmark test
      5. add gradient clipping feature into fluid benchmark test
      
      * Add some document to README.md under benchmark/fluid/ repo
      
      * Add model_base.py
      
      * Fix bugs in test_listen_and_serv_op
      
      * 1. remove args out of fluid_benchmark.py
      2. remove lr_decay, regularization, clipping out of fluid_benchmark.py
      
      * add async_mode description to doc and remove the clipping description out
      
      * for restart build
      
      * to restart build
      
      * remove optimization args from args.py
      
      * 1. remove optimization from models
      2. fix bug in test_listen_and_serv_op
      
      * change the name retry_times to left_time
      
      * change retry_times to the pserver start left time
      1cfd3cb1