- 11 11月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
-
- 10 11月, 2022 1 次提交
-
-
由 wuhuachaocoding 提交于
* cherry-pick recompute doc update. * update.
-
- 07 11月, 2022 3 次提交
-
-
由 Ligoml 提交于
* #46165 * #45752 * fix some doc bug test=document_fix (#45488) * fix some doc bug test=document_fix * fix some docs issues, test=document_fix * beta -> \beta in softplus * threshold -> \varepsilon in softplus * parameter name * delta -> \delta in smooth_l1_loss * fix some docs test=document_fix * fix docs test=document_fix * fix docs && 增加空行 test=document_fix * Update python/paddle/nn/functional/activation.py, test=document_fix * Update python/paddle/nn/layer/activation.py, test=document_fix Co-authored-by: NSigureMo <sigure.qaq@gmail.com> * [docs] add ipustrategy Hyperlink (#46422) * [docs] add ipustrategy Hyperlink * fix ipu_shard_guard docs; test=document_fix * [docs] add set_ipu_shard note * [docs] fix hyperlink * update framework.py * fix mlu_places docs; test=document_fix * fix put_along_axis docs; test=document_fix * fix flake8 W293 error, test=document_fix * fix typo in typing, test=document_fix Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com> Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com> * #46659 * Update README_cn.md (#46927) 修复了错别字 * #46738 * fix paddle.get_default_dtype (#47040) Chinese and English return values are inconsistent * fix bug Co-authored-by: N张春乔 <83450930+Liyulingyue@users.noreply.github.com> Co-authored-by: NInfinity_lee <luhputu0815@gmail.com> Co-authored-by: Nmrcangye <chenloong@88.com> Co-authored-by: NSigureMo <sigure.qaq@gmail.com> Co-authored-by: Ngouzil <66515297+gouzil@users.noreply.github.com> Co-authored-by: NHamid Zare <12127420+hamidzr@users.noreply.github.com> Co-authored-by: NSqhttwl <61459740+Sqhttwl@users.noreply.github.com> Co-authored-by: NOccupyMars2025 <31559413+OccupyMars2025@users.noreply.github.com> Co-authored-by: N超级码牛 <54444805+SuperCodebull@users.noreply.github.com> Co-authored-by: Njzhang533 <jzhang533@gmail.com>
-
由 Yuang Liu 提交于
* code format change * update the split logic for uniform (#47670)
-
由 Ligoml 提交于
* #46765 * #47042 * Remove redundant numpy import (#47483) * #47555 * resolve conflict * resolve conflict * resolve conflict * resolve conflict * resolve conflict * for_codestyle * fix sample code paddle.linalg.multi_dot Co-authored-by: NKevin吴嘉文 <417333277@qq.com>
-
- 04 11月, 2022 1 次提交
-
-
由 Ligoml 提交于
* only run pre-commit * only run pre-commit
-
- 03 11月, 2022 1 次提交
-
-
由 ShenLiang 提交于
* add unbalanced data * fix utest
-
- 01 11月, 2022 1 次提交
-
-
由 sneaxiy 提交于
-
- 29 10月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* reformat hybrid_parallel_util.py by black * add fused_allreduce_gradients_with_group * add scale * fix ci
-
- 24 10月, 2022 3 次提交
-
-
由 Yuang Liu 提交于
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * support pure bfloat16 * support bf16 linear * update PR to pass CI * tiny fix where_grad_kernel.cu * Support bfloat16 type for reducer and sharding. * Fix some bug. * Polish code. * Polise code. * Add bfloat16 datatype in fill_grad kernels. Co-authored-by: Nsneaxiy <sneaxiy@126.com> Co-authored-by: Nsneaxiy <sneaxiy@126.com>
-
由 Roc 提交于
-
- 21 10月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
-
- 19 10月, 2022 1 次提交
-
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * Support allow_partial switch, which can be configure in pipeline_configs. If sent tensor are not the same from different hosts, they shouldn't been sent partially and then concated as a whole tensor. * Change name allow_partial to enable_partial_send_recv. * Add global variable _enable_partial_send_recv
-
- 18 10月, 2022 2 次提交
-
-
由 Yuang Liu 提交于
* [dygraph sharding] Overlap the reduce and the caculation for sharding stage 2. (#46495) * [dygraph sharding stage 2] sharding broadcast overlap (#46656) * Multi groups for broadcast of sharding stage 2 (#46894)
-
由 Haohongxiang 提交于
* [Dygraph] Fix performance of pp+mp by using send/recv_calc_stream instead of send/recv (#46116) * [Dygraph] Fix Perf of FusedFeedForward and FusedAttention with AllReduce (#46780) * update
-
- 17 10月, 2022 1 次提交
-
-
由 Wen Sun 提交于
* Support both use_calc_stream and sync_op in send recv APIs (#46023) * Support both use_calc_stream and sync_op in allgather API (#46295) * Support both use_calc_stream and sync_op in collective communication API (#46761) * Move group and all reduce from collective to communication (#45848) * Completes bfloat16 dtype for collective api in eager mode (#45844) * Fix collective APIs cannot be recognized when building docs (#46962) Co-authored-by: NLiYuRio <63526175+LiYuRio@users.noreply.github.com>
-
- 11 10月, 2022 1 次提交
-
-
由 Yuang Liu 提交于
* bug fix for virtual pipeline parallel (#45922) * dont wait for send op under dygraph pp (#46209) * [interleave pp] sync recv for 1f1b (#46399) * [dygraph pp] all sync for allgather partial (#46483)
-
- 27 9月, 2022 1 次提交
-
-
由 LiYuRio 提交于
-
- 22 9月, 2022 2 次提交
-
-
由 Roc 提交于
uniform logger manager in FleetAPI. hidde API under distributed/utils which users don't need.
-
由 Haohongxiang 提交于
* fix bugs of mp * fix bugs of mp * update * update * fix bug
-
- 20 9月, 2022 2 次提交
-
-
由 HongyuJia 提交于
* polish code comments * polish data_device_transform.cc
-
由 zhaoyingli 提交于
* [Auto Parallel] Change the import way of Auto Parallel (#46115) * fix strategy (#46256) * [Auto Parallel] performance improvement for Sharding-DP hybrid parallelism (#46180) * remove no need grad allreduce communication when sharding-dp * remove no need grad allreduce communication when sharding-dp * bugfix * bugfix * bugfix Co-authored-by: NYulong Ao <aoyulong@baidu.com> Co-authored-by: NJZ-LIANG <jianzhongliang10@gmail.com>
-
- 19 9月, 2022 3 次提交
-
-
由 wuhuachaocoding 提交于
-
由 wuhuachaocoding 提交于
* refactor mp. * update setup.py. * update mp_layers.py for compatibility. * add documents for mp_layers.py * update init.py * update collective.py. * update. * update mp_ops.py * update. * update code style. * update code style.
-
由 ShenLiang 提交于
-
- 09 9月, 2022 1 次提交
-
-
由 Yuang Liu 提交于
-
- 07 9月, 2022 2 次提交
- 06 9月, 2022 1 次提交
-
-
由 Yuang Liu 提交于
-
- 02 9月, 2022 1 次提交
-
-
由 wuhuachaocoding 提交于
-
- 01 9月, 2022 1 次提交
-
-
由 wangguanqun 提交于
* config * fix unittest * zero init & cache & patch config * add barrier to save and load * add unittest
-
- 26 8月, 2022 3 次提交
-
-
由 Yuang Liu 提交于
-
由 wanghuancoder 提交于
-
由 Yuang Liu 提交于
-
- 23 8月, 2022 2 次提交
-
-
由 zhaoyingli 提交于
* add quant pass
-
由 LiYuRio 提交于
-
- 16 8月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
* reconstruct_of_fleet_api * update
-
- 15 8月, 2022 1 次提交
-
-
由 wuhuachaocoding 提交于
* refactor fleet. * refact fleet.py. * update fleet/__init__.py. * update fleet.py * update code style. * update fleet * update fleet * update fleet * update fleet * update model.py * update fleet. * update __init__.py * update fleet. * update fleet. * update fleet * update fleet * update fleet * update fleet. * update optimizer.py * update optimizer * update fleet.py * update scaler.py * update setup.py.in
-
- 13 8月, 2022 1 次提交
-
-
由 ziyoujiyi 提交于
* back fl * delete ssl cert * . * make warning * . * unittest paral degree * solve unittest * heter & multi cloud commm ready * . * . * fl-ps v1.0 * . * support N + N mode * . * . * . * . * delete print * . * . * . * . * fix bug * . * . * fl-ps with coordinator ready * merge dev * update message parse only * update fl client scheduler * fix bug * update multithreads sync * fix ci errors * update role_maker.py * update role_maker.py * fix ci error: windows py import error * fix ci error: windows py import error * fix windows ci pylib import error * add dump fields & params * try to fix windows import fleet error * fix ps FLAGS error * fix logging risk * fix logging possible risk * write trainer_desc file * support split sparse params in local & remote * fix import paddle.fluid.core.PSGPU * fix import paddle.fluid.core.PSGPU * add remote_sparse & local_sparse config * fix unittest * fix test_dist_fleet_geo table error * fix PADDLE_ENFORCE error * fix other's pr conflict
-