1. 16 8月, 2023 2 次提交
    • G
      Add mp_all_reduce asynchronize overlap. (#55662) · 6b1dfb5f
      Ghost Screaming 提交于
      * [WIP] Add mp_all_reduce asynchronize overlap.
      
      * Fix some problems.
      
      * Fix dw compute bug, and use a temporary solution to achieve overlap.
      
      * Use fused_linear_param_grad_add to compute dw.
      
      * Reformat ColumnParallel _overlap_linear. Use environment flags to
      control following behaviors:
      1. export Flags_mp_aysnc_allreduce=True to turn on mp async all_reduce
      2. export Flags_skip_mp_c_identity=True to skip two c_identity operators
         in dygraph mode.
      3. export Flags_fused_linear_param_grad_add to enable fused_linear_param_grad_add
         in ColumnParallel backward with mp async all_reduce.
      
      * Polish code.
      
      * Remove useless communication API.
      
      * Fix some problems in mp_async_all_reduce and skip_c_identity.
      
      * Add test cases.
      
      * Remove environment variable Flags_fused_linear_param_grad_add in test case.
      
      * Reset error threshold.
      
      * Reset threshold in test case.
      
      * Add useful log. Remove useless test cases.
      6b1dfb5f
    • Z
      make params_grads order same bewteen dynamic and auto_parallel (#56126) · 496422e9
      zhaoyingli 提交于
      * make params_grads order same bewteen dynamic and static mode
      
      * revert inplace clip
      
      * use sorted attribute to control
      
      * tiny fix
      
      * fix find loss_grad_op
      496422e9
  2. 15 8月, 2023 1 次提交
  3. 14 8月, 2023 5 次提交
  4. 11 8月, 2023 4 次提交
    • L
      remove the optimizer base and learning rate base (#56099) · 6eaed2da
      LoneRanger 提交于
      * remove the optimizer base and learning rate base
      
      * fix bug
      
      * fix bug
      6eaed2da
    • K
      28f74a0e
    • D
      repacle fluid.io.load_inference_model, fluid.io.save_inference_model in fluid... · bfc64801
      Difer 提交于
      repacle fluid.io.load_inference_model, fluid.io.save_inference_model in fluid with 2.0 version  (#55345)
      
      * repacle fluid.io.load_inference_model
      
      * replace fluid.io.save_inference_model
      
      * fix some bug
      
      * fix some bugs of load & save model
      
      * fix some bug
      
      * fix test_inference_model_io bug
      
      * fix word2vec_inference_model bug
      
      * fix some bug
      
      * fix valueError bug
      
      * fix some bug
      
      * fix a warning error
      
      * for debug
      
      * for debug
      
      * fix io error
      
      * fix test_wordvec_book error
      
      * remove debug print
      
      * fix load_var bug
      
      * for debug cinn test
      
      * revert cinn & fix inference_pass_test in windows
      
      * fix some bugs
      
      * revert cinn & fix inference_pass_test in windows
      
      * for debug vars
      
      * for debug
      
      * fix quant_dequant_test
      
      * fix some path errors
      
      * remove fluid save/load
      
      * fix incubate-fleet save
      
      * move some from fluid.io to static.io
      bfc64801
    • D
      move some fluid apis (#55986) · eafc9889
      Difer 提交于
      * move fluid apis
      
      * fix type error
      
      * remove static exponential_decay
      
      * fix some import error
      
      * remove nn.py
      
      * fix some error
      
      * fix type error
      eafc9889
  5. 09 8月, 2023 2 次提交
    • L
      remove the... · 723c6f77
      LoneRanger 提交于
      remove the AdamOptimizer、SGDOptimizer、MomentumOptimizer、ModelAverage、LookaheadOptimizer、FtrlOptimizer、DecayedAdagradOptimizer、DpsgdOptimizer in fluid and relocate the ExponentialMovingAverage、PipelineOptimizer、GradientMergeOptimizer and change optimizer base for LarsMomentumOptimizer and RecomputeOptimizer (#55970)
      
      * change the optimizer base for SGDOptimizer
      
      * change the optimizer base for SGDOptimizer
      
      * replace the SGDOptimizer with SGD
      
      * fix bug of sgd
      
      * change the optimizer base for MomentumOptimizer
      
      * fix the remaining tests
      
      * remove the Momentum in fluid/optimizer.py
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * Update test_resnet_cinn.py
      
      * Update test_resnet_prim_cinn.py
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * remove the ModelAverage in fluid
      
      * remove the LookaheadOptimizer in fluid
      
      * fix bug
      
      * remove AdamOptimizer in fluid
      
      * Update test_image_classification_fp16.py
      
      * fix bug
      
      * relocate the ExponentialMovingAverage in fluid
      
      * restore the static api
      
      * remove the FtrlOptimizer in fluid
      
      * remove the DecayedAdagradOptimizer in fluid
      
      * remove the DpsgdOptimizer in fluid
      
      * fix bug
      
      * fix codestyle
      
      * fix bug
      
      * fix bug
      
      * relocate the PipelineOptimizer
      
      * relocate the GradientMergeOptimizer
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix doc
      
      * Update __init__.py
      
      * Update test_fleet_qat_meta_optimizer.py
      
      * change optimizer base for LarsMomentumOptimizer
      
      * fix bug
      
      * fix conflict
      
      * fix code-style
      
      * fix sample codes
      
      * fix bug
      
      * fix bug
      
      * fix cinn bug
      
      * fix bug
      
      * fix bug
      
      * Update qat_optimizer.py
      
      * Update __init__.py
      
      * fix bug
      
      * change optimizer base for RecomputeOptimizer
      
      * fix bug
      
      * fix bug
      
      * Update test_imperative_optimizer_v2.py
      723c6f77
    • Y
      cherry pick #55651 and #55890 (#56063) · fa878846
      Yuang Liu 提交于
      fa878846
  6. 08 8月, 2023 3 次提交
  7. 07 8月, 2023 1 次提交
  8. 02 8月, 2023 1 次提交
  9. 01 8月, 2023 3 次提交
  10. 31 7月, 2023 1 次提交
  11. 27 7月, 2023 1 次提交
  12. 25 7月, 2023 1 次提交
  13. 24 7月, 2023 4 次提交
  14. 22 7月, 2023 2 次提交
  15. 20 7月, 2023 4 次提交
  16. 19 7月, 2023 4 次提交
  17. 14 7月, 2023 1 次提交
    • C
      [AutoTuner] Distribute best cfg (#54834) · 7f6d222f
      caozhou 提交于
      * distribute best cfg
      
      * adapt to multi args transmission
      
      * update metric extracting
      
      * fix bugs of prune and reading log
      
      * fix time default value
      
      * remove time record
      
      * adjust the order of searching dim
      
      * fix prune bugs
      
      * fix adding cfg bug
      
      * fix multi nodes bug
      
      * reset status
      
      * remove alarm and set logdir
      
      * deepcopy ctx
      
      * change alarm
      
      * fix restart bug
      
      * add exit
      
      * best no need alarm
      
      * add warmup time
      7f6d222f