1. 30 8月, 2023 2 次提交
    • G
      [Auto Parallel] Compatible new comm library upgrade (#56604) · ade51aa5
      Ghost Screaming 提交于
      * for verify
      
      fluid operator support new comm library
      
      * u
      
      * u
      
      * u
      
      * compatiable new comm library upgrade for c_allgather, c_reduce, c_reduce_scatter and c_scatter.
      
      * Remove useless comments in process_group.py
      
      * Polish code style.
      
      * Fix some problems.
      
      * Remove use fluid api in phi comm_context_manager.
      
      * Add PPADDLE_WITH_CUDA and PADDLE_WITH_NCCL micro judgement.
      
      * Fix bug of HIP architecture.
      
      * Fix some problems.
      1. remove useless loggings.
      2. Fix conditional compilation for HIP.
      3. Fix problems of test_pass_generation_pipeline.py. It calls paddle.distributed.init_parallel_env() at first,
      then auto.Engine calls _init_comm(), which will calls process_group.instantiate(). However, init_parallel_env() will call
      paddle.distributed.barrier(), it will call CreateNCCLEnvCache and create corresponding NCCLCommContext. But dev_id is not
      set, as a result, NCCLCommContext's dev_ctx is not initialized.
      
      * Fix some problems.
      
      * Polish code.
      
      * Polish code.
      
      * Revert compatiable upgrade for communication operators. Their upgrades
      will be submitted in another PR.
      
      * Remove StaticTCPStore.
      
      * Remove useless modification.
      
      * Remove useless set_cuda_device_id.
      
      * Polish code.
      
      * Remove fluid header files in phi files.
      
      * Remove useless comments.
      
      * Fix problems of hip arch.
      
      * Fix some problems.
      
      * Polish code.
      
      * Polish code style.
      
      ---------
      Co-authored-by: TaoTao Li's avatarhitywt <yuwentao126@126.com>
      ade51aa5
    • [xdoctest] reformat example code with google style in No.307 (#56595) · 34eecb0e
      张春乔 提交于
      * weight_norm_hook
      
      * Update weight_norm_hook.py
      
      * Update weight_norm_hook.py
      
      * Update python/paddle/nn/utils/weight_norm_hook.py
      
      * Update python/paddle/nn/utils/weight_norm_hook.py
      
      * Update python/paddle/nn/utils/weight_norm_hook.py
      Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>
      
      * xdoc
      
      * Apply suggestions from code review
      
      * Apply suggestions from code review
      
      ---------
      Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>
      34eecb0e
  2. 29 8月, 2023 2 次提交
  3. 28 8月, 2023 2 次提交
  4. 25 8月, 2023 4 次提交
  5. 24 8月, 2023 1 次提交
  6. 23 8月, 2023 1 次提交
  7. 22 8月, 2023 5 次提交
  8. 21 8月, 2023 2 次提交
  9. 19 8月, 2023 1 次提交
  10. 18 8月, 2023 1 次提交
  11. 17 8月, 2023 1 次提交
  12. 16 8月, 2023 2 次提交
    • G
      Add mp_all_reduce asynchronize overlap. (#55662) · 6b1dfb5f
      Ghost Screaming 提交于
      * [WIP] Add mp_all_reduce asynchronize overlap.
      
      * Fix some problems.
      
      * Fix dw compute bug, and use a temporary solution to achieve overlap.
      
      * Use fused_linear_param_grad_add to compute dw.
      
      * Reformat ColumnParallel _overlap_linear. Use environment flags to
      control following behaviors:
      1. export Flags_mp_aysnc_allreduce=True to turn on mp async all_reduce
      2. export Flags_skip_mp_c_identity=True to skip two c_identity operators
         in dygraph mode.
      3. export Flags_fused_linear_param_grad_add to enable fused_linear_param_grad_add
         in ColumnParallel backward with mp async all_reduce.
      
      * Polish code.
      
      * Remove useless communication API.
      
      * Fix some problems in mp_async_all_reduce and skip_c_identity.
      
      * Add test cases.
      
      * Remove environment variable Flags_fused_linear_param_grad_add in test case.
      
      * Reset error threshold.
      
      * Reset threshold in test case.
      
      * Add useful log. Remove useless test cases.
      6b1dfb5f
    • Z
      make params_grads order same bewteen dynamic and auto_parallel (#56126) · 496422e9
      zhaoyingli 提交于
      * make params_grads order same bewteen dynamic and static mode
      
      * revert inplace clip
      
      * use sorted attribute to control
      
      * tiny fix
      
      * fix find loss_grad_op
      496422e9
  13. 15 8月, 2023 1 次提交
  14. 14 8月, 2023 5 次提交
  15. 11 8月, 2023 4 次提交
    • L
      remove the optimizer base and learning rate base (#56099) · 6eaed2da
      LoneRanger 提交于
      * remove the optimizer base and learning rate base
      
      * fix bug
      
      * fix bug
      6eaed2da
    • K
      28f74a0e
    • D
      repacle fluid.io.load_inference_model, fluid.io.save_inference_model in fluid... · bfc64801
      Difer 提交于
      repacle fluid.io.load_inference_model, fluid.io.save_inference_model in fluid with 2.0 version  (#55345)
      
      * repacle fluid.io.load_inference_model
      
      * replace fluid.io.save_inference_model
      
      * fix some bug
      
      * fix some bugs of load & save model
      
      * fix some bug
      
      * fix test_inference_model_io bug
      
      * fix word2vec_inference_model bug
      
      * fix some bug
      
      * fix valueError bug
      
      * fix some bug
      
      * fix a warning error
      
      * for debug
      
      * for debug
      
      * fix io error
      
      * fix test_wordvec_book error
      
      * remove debug print
      
      * fix load_var bug
      
      * for debug cinn test
      
      * revert cinn & fix inference_pass_test in windows
      
      * fix some bugs
      
      * revert cinn & fix inference_pass_test in windows
      
      * for debug vars
      
      * for debug
      
      * fix quant_dequant_test
      
      * fix some path errors
      
      * remove fluid save/load
      
      * fix incubate-fleet save
      
      * move some from fluid.io to static.io
      bfc64801
    • D
      move some fluid apis (#55986) · eafc9889
      Difer 提交于
      * move fluid apis
      
      * fix type error
      
      * remove static exponential_decay
      
      * fix some import error
      
      * remove nn.py
      
      * fix some error
      
      * fix type error
      eafc9889
  16. 09 8月, 2023 2 次提交
    • L
      remove the... · 723c6f77
      LoneRanger 提交于
      remove the AdamOptimizer、SGDOptimizer、MomentumOptimizer、ModelAverage、LookaheadOptimizer、FtrlOptimizer、DecayedAdagradOptimizer、DpsgdOptimizer in fluid and relocate the ExponentialMovingAverage、PipelineOptimizer、GradientMergeOptimizer and change optimizer base for LarsMomentumOptimizer and RecomputeOptimizer (#55970)
      
      * change the optimizer base for SGDOptimizer
      
      * change the optimizer base for SGDOptimizer
      
      * replace the SGDOptimizer with SGD
      
      * fix bug of sgd
      
      * change the optimizer base for MomentumOptimizer
      
      * fix the remaining tests
      
      * remove the Momentum in fluid/optimizer.py
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * Update test_resnet_cinn.py
      
      * Update test_resnet_prim_cinn.py
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * remove the ModelAverage in fluid
      
      * remove the LookaheadOptimizer in fluid
      
      * fix bug
      
      * remove AdamOptimizer in fluid
      
      * Update test_image_classification_fp16.py
      
      * fix bug
      
      * relocate the ExponentialMovingAverage in fluid
      
      * restore the static api
      
      * remove the FtrlOptimizer in fluid
      
      * remove the DecayedAdagradOptimizer in fluid
      
      * remove the DpsgdOptimizer in fluid
      
      * fix bug
      
      * fix codestyle
      
      * fix bug
      
      * fix bug
      
      * relocate the PipelineOptimizer
      
      * relocate the GradientMergeOptimizer
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix doc
      
      * Update __init__.py
      
      * Update test_fleet_qat_meta_optimizer.py
      
      * change optimizer base for LarsMomentumOptimizer
      
      * fix bug
      
      * fix conflict
      
      * fix code-style
      
      * fix sample codes
      
      * fix bug
      
      * fix bug
      
      * fix cinn bug
      
      * fix bug
      
      * fix bug
      
      * Update qat_optimizer.py
      
      * Update __init__.py
      
      * fix bug
      
      * change optimizer base for RecomputeOptimizer
      
      * fix bug
      
      * fix bug
      
      * Update test_imperative_optimizer_v2.py
      723c6f77
    • Y
      cherry pick #55651 and #55890 (#56063) · fa878846
      Yuang Liu 提交于
      fa878846
  17. 08 8月, 2023 3 次提交
  18. 07 8月, 2023 1 次提交