- 26 10月, 2021 2 次提交
-
-
由 Haohongxiang 提交于
* fix bugs in HybridParallelClipGrad of hybrid_parallel_optimizer (#36237) * fix bugs in HybridParallelClipGrad of hybrid_parallel_optimizer * update * update * fix bugs in mp_layers、pp_layers and HybridParallelClipGrad (#36144) * fix calling bug of HybridParallelClipGrad * fix bugs of HybridParallelClipGrad * add unittest of pp with HybridParallelClipGrad * fix bugs in mp_layers.py * update * fix bugs in pp_layers.py * update * [HybridParallel]Rebuild code for pipeline (#36396) * add no_sync for parameters sync * add pipeline for moe * [HybridParallel]Support fp16 in dygraph hybrid parallel (#36420) * [HybridParallel]Support fp16 in dygraph hybrid parallel * update * update * update for recompute * add unittest of pp+fp16 * add unittest of recompute+fp16 * update * modify ut * modify ut of cond (#36475) * fix bugs of ClipGradByGlobalNorm in HybridParallel (#36555) * fix bugs of ClipGradByGlobalNorm * add unittests * add unittests * [HybridParallel]fix bug of check_inf in fleet_base.py (#36651) * fix bug of check_inf * fix allreduce * support ClipGradByGlobalNorm in sharding (#36012) * support ClipGradByGlobalNorm in sharding * support ClipGradByGlobalNorm in sharding * test=allcase * Update test_linalg_cond.py * Update hybrid_parallel_util.py * Update hybrid_parallel_util.py Co-authored-by: NShenLiang <1422485404@qq.com> Co-authored-by: Nzhaoyingli <86812880+zhaoyinglia@users.noreply.github.com>
-
由 Leo Chen 提交于
* refine amp level * fix typo * update tracer._amp_level
-
- 17 9月, 2021 1 次提交
-
-
由 zhangbo9674 提交于
* add pure fp16 major function in auto_cast & tracer * support master weight in dygraph for pure fp16 * check mix dtype of fp16&fp32 for check_finite_and_unscale op * change pure fp16 funtion name * refine some bug in auto_cast * refine auto_cast interface logic * add param _casted_by_pure_fp16 for class Layer * support state_dict hook for save model by user appointed dtype in pure_fp16_decorator * refine pure_fp16_decorator as decorator * add unittest * add comment * add comment * support recompute * add comment for auto_cast and decorator * support to_static_state_dict for paddle.jit.save * unlimite models num and optimizers num * add lookup_table in black_list * fix momentum and layer state_dict * fix bug in layer state_dict * fix bug in layer state_dict_helper * refine unittest * refine test_momentun_op * refine interface and some code * refine amp_decorator interface * refine pure fp16 interface * refine master weight interface
-
- 15 9月, 2021 1 次提交
-
-
由 Haohongxiang 提交于
-
- 14 9月, 2021 1 次提交
-
-
由 Haohongxiang 提交于
* Add solutions to PyLayer which is unsupported in DataParallel * modify note format for parallel.py * modify docs of dataparallel * add docs of dp with pylayer * modify docs format * modify example format * change example of dp with pylayer * add unittest for dp with pylayer * modify ut * merge latest codes * update * modify for CI-Coverage * modify text-indent
-
- 13 9月, 2021 1 次提交
-
-
由 Guoxia Wang 提交于
* support hybrid parallel inference helper class
-
- 09 8月, 2021 1 次提交
-
-
由 JZ-LIANG 提交于
-
- 05 7月, 2021 1 次提交
-
-
由 danleifeng 提交于
* fix hdfs and fleet_util for supporting save/load infer;test=develop
-
- 01 7月, 2021 1 次提交
-
-
由 JZ-LIANG 提交于
* dygraph sharding * update unitest hybrid_parallel_communicate_group
-
- 21 6月, 2021 1 次提交
-
-
由 tianshuo78520a 提交于
* del py2 code2 * fix test timeout
-
- 07 6月, 2021 1 次提交
-
-
由 zhangchunle 提交于
-
- 03 6月, 2021 1 次提交
-
-
由 JZ-LIANG 提交于
* Dygraph Recompute support AMP * dygraph recompute: update unitest
-
- 17 5月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* fix precision of mp * fix bug of seed * fix dp * print group
-
- 06 5月, 2021 1 次提交
-
-
由 zhiboniu 提交于
-
- 30 4月, 2021 1 次提交
-
-
由 Feiyu Chan 提交于
avoid polluting logging's root logger
-
- 26 4月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* fix model parallel * rm parallel_help.py * add embedding
-
- 25 4月, 2021 2 次提交
- 22 4月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* add clip/check * add amp & clip grad in dygraph * add logging
-
- 19 4月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* support dp & mp
-
- 07 4月, 2021 1 次提交
-
-
由 tangwei12 提交于
* add PullSparseValue for pull sparse * fix bug for PullSparseValue * add test mode in lookuptable * revert API change * add comment for is_training
-
- 02 3月, 2021 1 次提交
-
-
由 danleifeng 提交于
* topo and memory performance for heterps; test=develop * add trainwithprofiler in heter trainier; test=develop
-
- 14 1月, 2021 1 次提交
-
-
由 123malin 提交于
* test=develop, add distributed_infer
-
- 24 12月, 2020 1 次提交
-
-
由 tangwei12 提交于
* oneps (3/4) Co-authored-by: NMrChengmo <cmchengmo@163.com> Co-authored-by: Nmalin10 <malin10@baidu.com> Co-authored-by: Nchengmo <chengmo@baidu.com>
-
- 08 12月, 2020 1 次提交
-
-
由 lilong12 提交于
* update, test=develop
-
- 30 11月, 2020 1 次提交
-
-
由 123malin 提交于
* test=develop, rm pathlib
-
- 16 10月, 2020 1 次提交
-
-
由 lilong12 提交于
* update, test=develop
-
- 29 9月, 2020 1 次提交
-
-
由 lilong12 提交于
* add gloo initializer, test=develop
-
- 28 9月, 2020 3 次提交
-
-
由 Qinghe JING 提交于
* set default value to strategy in distributed_optimizer test=develop
-
由 lilong12 提交于
-
由 lilong12 提交于
* add gloo initializer, test=develop
-
- 24 9月, 2020 1 次提交
-
-
由 123malin 提交于
* test=develop, bug fix
-
- 17 9月, 2020 1 次提交
-
-
由 123malin 提交于
* test=develop, util documents
-
- 16 9月, 2020 1 次提交
-
-
由 gongweibao 提交于
-
- 29 8月, 2020 1 次提交
-
-
由 Dong Daxiang 提交于
* fix api document
-
- 13 8月, 2020 1 次提交
-
-
由 Dong Daxiang 提交于
* move paddle.fleet to paddle.distributed.fleet
-