1. 11 7月, 2019 1 次提交
    • Z
      Feature/buffer_shared_inplace (#17911) · d3003a16
      Zeng Jinle 提交于
      * feature/buffer_shared_inplace, test=develop
      
      * refine code, test=develop
      
      * fix elementwise_add op cpu inplace and sum inplace bug, test=develop
      
      * add unittest and debug log, test=develop
      
      * fix parallel_executor scope bug, polish code, test=develop
      
      * fix sum op, activation op, single_in_place_inference bug, test=develop
      
      * remove kLocalExecScopeName, test=develop
      
      * fix unittest,test=develop
      
      * fix out_var first version bug, test=develop
      
      * follow comments,test=develop
      d3003a16
  2. 14 6月, 2019 1 次提交
  3. 06 6月, 2019 1 次提交
  4. 27 5月, 2019 1 次提交
  5. 20 5月, 2019 1 次提交
  6. 14 5月, 2019 1 次提交
  7. 08 5月, 2019 1 次提交
  8. 06 5月, 2019 1 次提交
  9. 23 4月, 2019 1 次提交
  10. 21 4月, 2019 1 次提交
    • Z
      Refine model gpu memory (#16993) · 1202d3fc
      Zeng Jinle 提交于
      * speedup gc and inplace softmax_with_cross_entropy_grad
      test=develop
      
      * refine models gpu mem
      Merge skip vars and warning messages of mem opt
      remove relu mem opt
      test=develop
      
      * follow comments
      test=develop
      1202d3fc
  11. 12 4月, 2019 1 次提交
  12. 11 4月, 2019 1 次提交
  13. 08 4月, 2019 2 次提交
  14. 03 4月, 2019 1 次提交
  15. 28 3月, 2019 2 次提交
  16. 22 3月, 2019 1 次提交
    • C
      [Speed]Refine ParallelExecutor (#16190) · a6a3b2fb
      chengduo 提交于
      * refine parallelExecutor
      test=develop
      
      * Polish op_handle
      test=develop
      
      * Remove unnecessary op_handle
      test=develop
      
      * Fix Travis CI
      test=develop
      
      * Fix fetch bug
      test=develop
      
      * Remove WaitInputVarGenerated
      
      * Fix OpHandleBase::Run
      test=develop
      
      * debug
      test=develop
      
      * use origin fetch_op_handle
      test=develop
      
      * Revert op_handle_base.cc
      test=develop
      
      * Polish code
      test=develop
      
      * Fix OpHandleBase::Run
      test=develop
      
      * code refine
      
      * test CI and CE
      test=develop
      
      * fix OpHandle::Run
      test=develop
      
      * refine AllReduceOpHandle
      test=develop
      
      * Polish code
      test=develop
      a6a3b2fb
  17. 20 3月, 2019 1 次提交
    • C
      Fuse AllReduce (#15921) · f26ba5bd
      chengduo 提交于
      * fuse all_reduce
      test=develop
      
      * add fuse_parameter_groups_size
      test=develop
      
      * Polish code
      test=develop
      
      * Fix travis-ci
      test=develop
      
      * Add SetGroupAccordingToLayers and SetGroupAccordingToGroupSize
      test=develop
      
      * Add SetGroupAccordingToMemorySize
      test=develop
      
      * fix multi_devices_graph
      test=develop
      
      * reset params_grads
      test=develop
      
      * Polish code
      test=develop
      f26ba5bd
  18. 15 3月, 2019 1 次提交
    • Q
      Support sync batch norm. (#16121) · 8ad672a2
      qingqing01 提交于
      * Support Sync Batch Norm.
      * Note, do not enable it in one device.
      
      Usage:
      
      build_strategy = fluid.BuildStrategy()
      build_strategy.sync_batch_norm = True
      binary = fluid.compiler.CompiledProgram(tp).with_data_parallel(
              loss_name=loss_mean.name,
              build_strategy=build_strategy)
      8ad672a2
  19. 07 3月, 2019 1 次提交
  20. 05 3月, 2019 1 次提交
  21. 23 2月, 2019 1 次提交
  22. 22 2月, 2019 2 次提交
  23. 21 2月, 2019 2 次提交
  24. 19 2月, 2019 1 次提交
  25. 18 2月, 2019 2 次提交
  26. 14 2月, 2019 3 次提交
  27. 12 2月, 2019 2 次提交
  28. 11 2月, 2019 2 次提交
  29. 08 2月, 2019 1 次提交
  30. 07 2月, 2019 1 次提交
  31. 31 1月, 2019 1 次提交