1. 28 3月, 2019 2 次提交
  2. 20 3月, 2019 1 次提交
    • C
      Fuse AllReduce (#15921) · f26ba5bd
      chengduo 提交于
      * fuse all_reduce
      test=develop
      
      * add fuse_parameter_groups_size
      test=develop
      
      * Polish code
      test=develop
      
      * Fix travis-ci
      test=develop
      
      * Add SetGroupAccordingToLayers and SetGroupAccordingToGroupSize
      test=develop
      
      * Add SetGroupAccordingToMemorySize
      test=develop
      
      * fix multi_devices_graph
      test=develop
      
      * reset params_grads
      test=develop
      
      * Polish code
      test=develop
      f26ba5bd
  3. 15 3月, 2019 1 次提交
    • Q
      Support sync batch norm. (#16121) · 8ad672a2
      qingqing01 提交于
      * Support Sync Batch Norm.
      * Note, do not enable it in one device.
      
      Usage:
      
      build_strategy = fluid.BuildStrategy()
      build_strategy.sync_batch_norm = True
      binary = fluid.compiler.CompiledProgram(tp).with_data_parallel(
              loss_name=loss_mean.name,
              build_strategy=build_strategy)
      8ad672a2
  4. 06 3月, 2019 1 次提交
    • L
      add IfElse test case for ir memory optimize (#15998) · 9cc6f400
      liuwei1031 提交于
      * add ir memory optimize test case for IfElse op, test=develop
      
      * fix some unitttest failure by force using the python memory_optimize, test=develop
      
      * tweak comments, test=develop
      
      * fix unittest, test=develop
      
      * fix unittest, test=develop
      9cc6f400
  5. 05 3月, 2019 1 次提交
    • L
      add IfElse test case for ir memory optimize (#15998) · caadd058
      liuwei1031 提交于
      * add ir memory optimize test case for IfElse op, test=develop
      
      * fix some unitttest failure by force using the python memory_optimize, test=develop
      
      * tweak comments, test=develop
      
      * fix unittest, test=develop
      
      * fix unittest, test=develop
      caadd058
  6. 21 2月, 2019 1 次提交
  7. 11 2月, 2019 1 次提交
  8. 31 1月, 2019 1 次提交
  9. 22 1月, 2019 1 次提交
  10. 21 1月, 2019 2 次提交
    • D
      squash commits. test=develop · 8f3b2523
      dzhwinter 提交于
      8f3b2523
    • D
      Memory optimization of depthwise conv op and group norm op (#15313) · 9f8f0fc2
      Dun 提交于
      * mem opt
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * refine code  test=develop
      
      * refine code  test=develop
      
      * refine code  test=develop
      
      * refine code  test=develop
      
      * refine with cub test=develop
      
      * fix mkldnn test && remove comments && test=develop
      
      * polish code && test=develop
      
      * add only_forward test && test=develop
      9f8f0fc2
  11. 07 1月, 2019 1 次提交
    • C
      Refactor MultiDevSSAGraphBuilder (#15090) · eabb2105
      chengduo 提交于
      * Refactor ParallelExecutor
      test=develop
      
      * extract Reduce and AllReduce mode from MultiDevSSAGraphBuilder
      test=develop
      
      * Refactor MultiDevSSAGraphBuilder
      test=developt
      
      * Remove enable_data_balance
      test=develop
      
      * code refine
      test=develop
      
      * remove data balance
      test=develop
      
      * refine ScaleLossGradOp
      test=develop
      
      * remove uncessary file
      test=develop
      
      * code refine
      test=develop
      
      * modify  function name
      test=develop
      
      * follow comments
      test=develop
      
      * add is_distribution field
      test=develop
      
      * set is_distribution
      test=develop
      
      * fix DistSSAGraphBuilder
      test=develop
      eabb2105
  12. 28 12月, 2018 1 次提交
  13. 26 12月, 2018 1 次提交
  14. 20 12月, 2018 1 次提交
  15. 18 12月, 2018 1 次提交
    • D
      add ir memory optimize. (#14530) · 7cd24b13
      dzhwinter 提交于
      * follow comments. test=develop
      
      * Fix typo
      
      * fix compile error. test=develop
      
      * merge develop branch. test=develop
      
      * Remove set_equal
      
      * Polish code
      
      * Delete unused functions
      
      test=develop
      
      * polish code. test=develop
      
      * follow comment
      
      * polish code.
      
      * fix windows compile error. test=develop
      
      * fix op handle.
      
      * rerun ci. test=develop
      
      * rerun ci. test=develop
      
      * rerun macci. test=develop
      
      * polish code. test=develop
      
      * rewrite sort code. test=develop
      
      * remove unused code. test=develop
      
      * fix tests. test=develop
      
      * fix conflict. test=develop
      
      * follow comment. test=develop
      
      * merge develop branch. test=develop
      
      * fix tests. test=develop
      
      * remove ToTypeIndex. test=develop
      
      * rerun ci. test=develop
      7cd24b13
  16. 17 12月, 2018 1 次提交
  17. 07 12月, 2018 1 次提交
  18. 27 11月, 2018 1 次提交
  19. 22 11月, 2018 1 次提交
  20. 13 11月, 2018 2 次提交
  21. 06 11月, 2018 1 次提交
  22. 02 11月, 2018 1 次提交
  23. 29 10月, 2018 2 次提交
    • W
      [1.1] [project] train imagenet using large batch size (#13766) · 26200f2e
      Wu Yi 提交于
      * fix nccl2 lars dist support
      
      * put lars in momentum op
      
      * add tests lars
      
      * fix ci
      
      * fix cpu kernel
      
      * soft warning
      
      * remove lars in test_recognize_digits.py
      
      * move to another op
      
      * add file
      
      * update api.spec test=develop
      
      * update test=develop
      
      * fix api.spec test=develop
      
      * wip
      
      * wip, finish grad merge ops
      
      * wip, finish graph build
      
      * wip test running
      
      * work on 1 gpu
      
      * workable version
      
      * update
      
      * fix tests
      
      * fuse broadcast op
      
      * fix compile failed
      
      * refine
      
      * add batch merge test mnist
      
      * fix CI test=develop
      
      * fix build
      
      * use independent bn params for batch merge test=develop
      
      * update api.spec
      
      * follow comments and for test
      
      * wip
      
      * refine tests test=develop
      
      * follow comments test=develop
      
      * remove startup bn modify test=develop
      
      * follow comments test=develop
      
      * fix merge test=develop
      26200f2e
    • S
      test=develop · 2414f92f
      sneaxiy 提交于
      2414f92f
  24. 12 10月, 2018 1 次提交
  25. 25 9月, 2018 4 次提交
  26. 20 9月, 2018 1 次提交
    • C
      Feature/op_fuse_pass (#12440) · d402234b
      chengduo 提交于
      * Add Preface
      
      * Add demo code
      
      * Save file
      
      * Refine code
      
      * seems can work
      
      * use elementwise strategy
      
      * Use ElementwiseComputeEx
      
      * Add comments
      
      * extract functions from operator
      
      * Refine code
      
      * Follow comment
      
      * code refine
      
      * add op_fuse  pass
      
      * add backward
      
      * code refine
      
      * use TopologySortOperations
      
      * follow comments
      
      * refine IsFusible
      
      * code enhance
      
      * fix op_fusion_pass
      
      * refine code
      
      * refine fuse_elemwise_act_op
      
      * adjust the input and output
      
      * refine logic
      
      * add intermediate_edge
      
      * disable inplace
      
      * follow comments
      
      * refine logic
      
      * follow comments
      
      * Remove the removable IntermediateOut
      
      * change strategy
      
      * code refine
      
      * enable fuse backward
      
      * code refine
      
      * code refine
      
      * rename unit test
      
      * follow comments
      d402234b
  27. 07 8月, 2018 1 次提交
  28. 06 7月, 2018 1 次提交
  29. 02 7月, 2018 1 次提交
  30. 06 6月, 2018 1 次提交
    • Y
      SSA Graph Builder Factory · d9af1532
      yuyang18 提交于
      * Use Builder Chain to decorate new builders. It is easy to extend
        builders.
      * Make graphviz path as a build strategy, not a FLAGS.
      d9af1532
  31. 15 5月, 2018 1 次提交
  32. 14 5月, 2018 2 次提交