- 11 8月, 2023 1 次提交
-
-
由 Difer 提交于
* move fluid apis * fix type error * remove static exponential_decay * fix some import error * remove nn.py * fix some error * fix type error
-
- 09 8月, 2023 2 次提交
-
-
由 LoneRanger 提交于
remove the AdamOptimizer、SGDOptimizer、MomentumOptimizer、ModelAverage、LookaheadOptimizer、FtrlOptimizer、DecayedAdagradOptimizer、DpsgdOptimizer in fluid and relocate the ExponentialMovingAverage、PipelineOptimizer、GradientMergeOptimizer and change optimizer base for LarsMomentumOptimizer and RecomputeOptimizer (#55970) * change the optimizer base for SGDOptimizer * change the optimizer base for SGDOptimizer * replace the SGDOptimizer with SGD * fix bug of sgd * change the optimizer base for MomentumOptimizer * fix the remaining tests * remove the Momentum in fluid/optimizer.py * fix bug * fix bug * fix bug * fix bug * Update test_resnet_cinn.py * Update test_resnet_prim_cinn.py * fix bug * fix bug * fix bug * remove the ModelAverage in fluid * remove the LookaheadOptimizer in fluid * fix bug * remove AdamOptimizer in fluid * Update test_image_classification_fp16.py * fix bug * relocate the ExponentialMovingAverage in fluid * restore the static api * remove the FtrlOptimizer in fluid * remove the DecayedAdagradOptimizer in fluid * remove the DpsgdOptimizer in fluid * fix bug * fix codestyle * fix bug * fix bug * relocate the PipelineOptimizer * relocate the GradientMergeOptimizer * fix bug * fix bug * fix bug * fix doc * Update __init__.py * Update test_fleet_qat_meta_optimizer.py * change optimizer base for LarsMomentumOptimizer * fix bug * fix conflict * fix code-style * fix sample codes * fix bug * fix bug * fix cinn bug * fix bug * fix bug * Update qat_optimizer.py * Update __init__.py * fix bug * change optimizer base for RecomputeOptimizer * fix bug * fix bug * Update test_imperative_optimizer_v2.py
-
由 Yuang Liu 提交于
-
- 08 8月, 2023 3 次提交
-
-
由 Ruibiao Chen 提交于
* Improve GC for pipeline parallel * Delete print
-
由 Sonder 提交于
* open * update
-
由 Yuang Liu 提交于
-
- 07 8月, 2023 1 次提交
-
-
由 LiYuRio 提交于
* make tcp store a global instance * fix windows compile error
-
- 02 8月, 2023 1 次提交
-
-
由 zhaoyingli 提交于
* Update autoparallel DistributedDataLoader * add places for engine.dataloder()
-
- 01 8月, 2023 3 次提交
-
-
由 Yuang Liu 提交于
-
由 LiYuRio 提交于
* use string as key for comm_context_manager * remove device_id from comm_context
-
由 pangengzheng 提交于
-
- 31 7月, 2023 1 次提交
-
-
由 Difer 提交于
* simple reaplce * for debug * fix bugs * fix some bugs * del fill_constant_batch_size_like
-
- 27 7月, 2023 1 次提交
-
-
由 sneaxiy 提交于
-
- 25 7月, 2023 1 次提交
-
-
由 Yiqun Liu 提交于
* Call multiply_ instead of scale_ to avoid multiple DtoH copy. * Call _squared_l2_norm to calculate grad_clip. * Fix import error.
-
- 24 7月, 2023 4 次提交
-
-
由 jjyaoao 提交于
Signed-off-by: Njjyaoao <jjyaoao@126.com>
-
由 Windfarer 提交于
-
由 Yuang Liu 提交于
-
由 Chen Weihang 提交于
* add shard tensor api * add DistAttr api * add unittest for coverage * fix process mesh sample code * fix checking error
-
- 22 7月, 2023 2 次提交
-
-
由 zhenhailiu 提交于
-
由 sneaxiy 提交于
* fix new launch * fix ps uit
-
- 20 7月, 2023 4 次提交
-
-
由 Leo Chen 提交于
-
由 XiaociZhang 提交于
* [Kunlun] Mofify some legacy code on distributed training There were limitations on XPUs before, such as concat/split is not supported, and c_broadcast only support fp32. These limitations are lifted recently. Multi-device profiling on XPU will also be supported by this PR. Without this PR, a hanging broadcast will be issued by devices that enables profiling, eventually lead to kernel timeout error. * fix typo
-
由 zhenhailiu 提交于
-
由 Yuang Liu 提交于
-
- 19 7月, 2023 4 次提交
-
-
由 zhaoyingli 提交于
-
由 caozhou 提交于
-
由 陶泽伟 提交于
-
由 Yuang Liu 提交于
-
- 14 7月, 2023 1 次提交
-
-
由 caozhou 提交于
* distribute best cfg * adapt to multi args transmission * update metric extracting * fix bugs of prune and reading log * fix time default value * remove time record * adjust the order of searching dim * fix prune bugs * fix adding cfg bug * fix multi nodes bug * reset status * remove alarm and set logdir * deepcopy ctx * change alarm * fix restart bug * add exit * best no need alarm * add warmup time
-
- 13 7月, 2023 5 次提交
-
-
由 Ruibiao Chen 提交于
* Support nvprof for auto parallel * Fix CI errors * Fix CI errors
-
由 lil-Xing 提交于
* add phi operator c_concat and ut * update create_var use * update copyright
-
由 Leo Chen 提交于
* Support AMP program for onnx QAT API * Integrate QAT into distributed optimizer * Reduce the size of test data and increase time limit * Use logger and reduce time limit of unittests * Rename and move unittest into fleet test * Test qat_init API
-
由 risemeup1 提交于
* fix protobuf problem * fix protobuf problem
-
由 Yuang Liu 提交于
-
- 11 7月, 2023 3 次提交
-
-
由 pangengzheng 提交于
* support sharding parallel * fix name * fix * update * test amp for sharding --------- Co-authored-by: pangengzheng <pangengzheng.baidu.com>
-
由 Wennie396 提交于
* format correction * variable names adjustment * variable names adjustment, name-->type, value-->sub_program
-
由 LoneRanger 提交于
replace the AdagradOptimizer 、adamaxOptimizer、AdadeltaOptimizer、RMSPropOptimizer、LambOptimizer and Momentum (#54152) * replace the AdadeltaOptimizer with Adadelta * replace the RMSPropOptimizer with RMSProp * replace the LambOptimizer with lamb * replace the momentum in contrib/optimizer.py with Momentum in python/paddle/optimizer/momentum.py * fix bug * fix bug * fix bug * fix bug of Lamp * fix bug of Lamp * fix bug of import * replace the AdamaxOptimizer with Admax and change the optimizer base for AdagradOptimizer * fix bug * fix bug * Update optimizer.py * fix bug * fix bug
-
- 06 7月, 2023 1 次提交
-
-
由 zhaoyingli 提交于
* remove allreduce before c_allgather * update reshard insert_fill_constant_op func * insert_fill_constant_op add shape arg
-
- 03 7月, 2023 1 次提交
-
-
由 zhenhailiu 提交于
-
- 30 6月, 2023 1 次提交
-
-
由 sneaxiy 提交于
-