- 20 7月, 2023 3 次提交
-
-
由 XiaociZhang 提交于
* [Kunlun] Mofify some legacy code on distributed training There were limitations on XPUs before, such as concat/split is not supported, and c_broadcast only support fp32. These limitations are lifted recently. Multi-device profiling on XPU will also be supported by this PR. Without this PR, a hanging broadcast will be issued by devices that enables profiling, eventually lead to kernel timeout error. * fix typo
-
由 zhenhailiu 提交于
-
由 Yuang Liu 提交于
-
- 19 7月, 2023 4 次提交
-
-
由 zhaoyingli 提交于
-
由 caozhou 提交于
-
由 陶泽伟 提交于
-
由 Yuang Liu 提交于
-
- 14 7月, 2023 1 次提交
-
-
由 caozhou 提交于
* distribute best cfg * adapt to multi args transmission * update metric extracting * fix bugs of prune and reading log * fix time default value * remove time record * adjust the order of searching dim * fix prune bugs * fix adding cfg bug * fix multi nodes bug * reset status * remove alarm and set logdir * deepcopy ctx * change alarm * fix restart bug * add exit * best no need alarm * add warmup time
-
- 13 7月, 2023 5 次提交
-
-
由 Ruibiao Chen 提交于
* Support nvprof for auto parallel * Fix CI errors * Fix CI errors
-
由 lil-Xing 提交于
* add phi operator c_concat and ut * update create_var use * update copyright
-
由 Leo Chen 提交于
* Support AMP program for onnx QAT API * Integrate QAT into distributed optimizer * Reduce the size of test data and increase time limit * Use logger and reduce time limit of unittests * Rename and move unittest into fleet test * Test qat_init API
-
由 risemeup1 提交于
* fix protobuf problem * fix protobuf problem
-
由 Yuang Liu 提交于
-
- 11 7月, 2023 3 次提交
-
-
由 pangengzheng 提交于
* support sharding parallel * fix name * fix * update * test amp for sharding --------- Co-authored-by: pangengzheng <pangengzheng.baidu.com>
-
由 Wennie396 提交于
* format correction * variable names adjustment * variable names adjustment, name-->type, value-->sub_program
-
由 LoneRanger 提交于
replace the AdagradOptimizer 、adamaxOptimizer、AdadeltaOptimizer、RMSPropOptimizer、LambOptimizer and Momentum (#54152) * replace the AdadeltaOptimizer with Adadelta * replace the RMSPropOptimizer with RMSProp * replace the LambOptimizer with lamb * replace the momentum in contrib/optimizer.py with Momentum in python/paddle/optimizer/momentum.py * fix bug * fix bug * fix bug * fix bug of Lamp * fix bug of Lamp * fix bug of import * replace the AdamaxOptimizer with Admax and change the optimizer base for AdagradOptimizer * fix bug * fix bug * Update optimizer.py * fix bug * fix bug
-
- 06 7月, 2023 1 次提交
-
-
由 zhaoyingli 提交于
* remove allreduce before c_allgather * update reshard insert_fill_constant_op func * insert_fill_constant_op add shape arg
-
- 03 7月, 2023 1 次提交
-
-
由 zhenhailiu 提交于
-
- 30 6月, 2023 1 次提交
-
-
由 sneaxiy 提交于
-
- 29 6月, 2023 3 次提交
-
-
由 Yuang Liu 提交于
* fix vpp mp init prob * triger ci
-
由 TaoTao Li 提交于
* update dygraph collective fix ut * remove debug log
-
由 zhaoyingli 提交于
* add skip_gc_vars for 1f1b schedule mode * add pp_degree and pp_stage
-
- 28 6月, 2023 2 次提交
-
-
由 zhaoyingli 提交于
* add gc for multi jobs * fix job.h * update OpInfo to OpInOutInfo * update get_skip_gc_vars algo order
-
由 LiYuRio 提交于
-
- 27 6月, 2023 2 次提交
-
-
由 JZ-LIANG 提交于
* base rule * add sharidng merge * add sharidng axis merge * define unified data class for inferencing dist_attr * test wrap DistTensorSpec in dygraph mode * matmul main logic done * define unified data class for inferencing dist_attr --------- Co-authored-by: NYichen Zhang <zhangyichen03@baidu.com>
-
由 LoneRanger 提交于
* remove the NaturalExpDecay in fluid * fix bug * remove the ExponentialDecay in fluid * remove the InverseTimeDecay in fluid * remove the InverseTimeDecay class * fix bug
-
- 25 6月, 2023 2 次提交
-
-
由 Chitsing KUI 提交于
-
由 zhaoyingli 提交于
* auto parallel support pipeline scheduler with standalone executor * rm check_fetch * update cmakelist and flags env * rm set micro batch id * rm import * update utils func * raise error when merge tensor for return_numpy is False * fix _pipeline_opt * fix unittest
-
- 20 6月, 2023 2 次提交
-
-
由 ShenLiang 提交于
-
由 Azure 提交于
* add auto tuner * compare and record module * revert launch main * add prune rule * add unit test * add auto tuner * revert launch main * add prune rule * modify unit test script * fix bug for dump nodes; fix bug for checking log file * fix bug --------- Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>
-
- 19 6月, 2023 1 次提交
-
-
由 Chitsing KUI 提交于
* no endpoints in dy mode * fix fleet api inconsistent
-
- 16 6月, 2023 3 次提交
-
-
由 Ruibiao Chen 提交于
* Run plan in standalone executor * Update codes * Update atol and rtol for py3-CI * Add scope to cache key * Fix CI errors * Fix code style * Update codes * Remove fetch_name in standalone executor * Fix UT * Update codes * Fix new IR bug
-
由 LiYuRio 提交于
-
由 Yuang Liu 提交于
-
- 15 6月, 2023 3 次提交
-
-
由 LiYuRio 提交于
-
由 Haohongxiang 提交于
-
由 LiYuRio 提交于
-
- 14 6月, 2023 2 次提交
- 13 6月, 2023 1 次提交
-
-
由 zhenhailiu 提交于
* polish * polish * polish * polish * polish * polish * polish * polish * polish * polish * polish * polish
-