- 19 10月, 2022 2 次提交
-
-
由 Nyakku Shigure 提交于
-
由 Roc 提交于
-
- 18 10月, 2022 6 次提交
-
-
由 caozhou 提交于
* add parallel tuner * add unittest * fix unittest * set timeout of unittest * set unittest timeout * fix auto_mode setting * update unittest * sync from develop and update unittest * remove unused import * update unittest * update cmakelist * add unittests
-
由 LiYuRio 提交于
-
由 caozhou 提交于
* add cost interface * update inferface and add unittest * update unittest * update inferface
-
由 Nyakku Shigure 提交于
-
由 Nyakku Shigure 提交于
* [CodeStyle][py2] remove `compat` module (to_text) * remove some unnecessary decode * remove to_text definition and unittest * Revert "remove to_text definition and unittest" This reverts commit a6b69cb8dca8b9b031ce10ea32d1040e7e0dd267. * remove an assertion * empty commit
-
由 zhaoyingli 提交于
* [AutoParallel] add callbacks * fix unittest * fix dist_context * fix engine * fix cmakelist * fix unittest's returns * fix cmakelist
-
- 17 10月, 2022 3 次提交
-
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * Support allow_partial switch, which can be configure in pipeline_configs. If sent tensor are not the same from different hosts, they shouldn't been sent partially and then concated as a whole tensor. * Change name allow_partial to enable_partial_send_recv. * Add global variable _enable_partial_send_recv
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * support pure bfloat16 * support bf16 linear * update PR to pass CI * tiny fix where_grad_kernel.cu * Support bfloat16 type for reducer and sharding. * Fix some bug. * Polish code. * Polise code. * Add bfloat16 datatype in fill_grad kernels. Co-authored-by: Nsneaxiy <sneaxiy@126.com>
-
由 Yulong Ao 提交于
* [Auto Parallel] Fix the bug for None labels * [Auto Parallel] Fix the completion bug
-
- 14 10月, 2022 3 次提交
-
-
由 Wen Sun 提交于
-
由 zhaoyingli 提交于
* for gpt-gen * fix reshard * adapt assign and shape op * add dist_assign & unittest * add conditional block unittest * rename unittest
-
由 Yulong Ao 提交于
-
- 13 10月, 2022 3 次提交
-
-
由 wuhuachaocoding 提交于
* combine dp and stage2 hybrid parallel. * update condition.
-
由 Xinger 提交于
* add rpc module in cpp side * add rpc module in python side * support win32 and mac for rpc * 代码优化 * 优化代码 * update rpc * update rpc launch * rpc remove rank and world_size api * fix logger import bug * remove support for win and mac * remove support for xpu, npu, cinn and rocm * remove support for xpu, npu, cinn and rocm * fix shutdown barrier timeout bug * update:python_rpc_handler to shared ptr * fix master shutodwn first bug * tests support for cpu * update log to vlog * update get service info api * add single process test case * remove process group * remove some useless dependencies * update rpc api comments * update rpc comments: Example to Examples * update rpc api comments * update rpc api comments * update launch api comments * update init_rpc comments * update rpc sync and async comments * fix bug: init_rpc cant be called repeatly in a process * update rpc api comment: make master endpoint unique * update rpc api:service to worker, timeout_ms to timeout * rename ServiceInfo to WorkerInfo * refactor: rename server to worker, log to vlog * add launch test * remove unused codes * refine
-
由 Nyakku Shigure 提交于
-
- 12 10月, 2022 5 次提交
-
-
由 JZ-LIANG 提交于
-
由 Yulong Ao 提交于
* [Auto Parallel] Suppport different dataloaders * [Auto Parallel] Add num_shards config for dataset * [Auto Parallel] Unify the logger and outputs of Engine API * [Auto Parallel] Fix the bugs of to_static * [Auto Parallel] Adjust the test_to_static.py * [Auto Parallel] Add the prepare API and replace __call__ with run * [Auto Parallel] Improve the private implementations of Engine * [Auto Parallel] Set capacity of dataloader for opt tuning * [Auto Parallel] [WIP] Change the fine-grained API * [Auto Parallel] Improve APIs to support different user cases * [Auto Parallel] Add removed config * [Auto Parallel] Add imports * [Auto Parallel] Fix bugs for to_static * [Auto Parallel] Remove unnecessary imports
-
由 zhouweiwei2014 提交于
* [Zero-Dim] support input 0D Tensor for unary api * fix CI
-
由 Yuang Liu 提交于
-
由 Nyakku Shigure 提交于
* [CodeStyle][F401] remove unused import in python/paddle/distributed * remove pass * empty commit * Fix ValueError: list.remove(x): x not in list for meta_optimizer_names. Fix ValueError: list.remove(x): x not in list for meta_optimizer_names. * Fix split import. Fix split import. * add noqa after meta_optimizers in factory * restort collective ops * expand `import *` * add noqa after required imports * try to fix APIs without core.ops * Revert "try to fix APIs without core.ops" This reverts commit 6172beaf601e84bf61f2490c12c4739f0edaa5eb. * fix an increment * empty commit * add noqa after required imports * expand `import *`, fix ci error Co-authored-by: NShuangchi He <34329208+Yulv-git@users.noreply.github.com>
-
- 11 10月, 2022 5 次提交
- 10 10月, 2022 5 次提交
-
-
由 Tony Cao 提交于
* Update README.md * Update README.md * Fix F632: replace 'is', 'is not' with ==, != respectively
-
由 Tony Cao 提交于
* Update README.md * Update README.md * Fix F541 by converting f-string to normal strings
-
由 Yulong Ao 提交于
* [Auto Parallel] Unify the logger and outputs of Engine API * [Auto Parallel] Fix the bugs of to_static * [Auto Parallel] Adjust the test_to_static.py
-
由 wuhuachaocoding 提交于
-
由 LiYuRio 提交于
-
- 09 10月, 2022 1 次提交
-
-
由 Yuang Liu 提交于
-
- 08 10月, 2022 2 次提交
-
-
由 caozhou 提交于
* update comp cost and completion for gpt auto search * add unittest
-
由 Haohongxiang 提交于
-
- 30 9月, 2022 1 次提交
-
-
由 Wen Sun 提交于
-
- 29 9月, 2022 2 次提交
-
-
由 zhaoyingli 提交于
-
由 zhaoyingli 提交于
* [AutoParallel] fix reshard when train with eval * fix mppp
-
- 28 9月, 2022 2 次提交
-
-
由 zhaoyingli 提交于
-
由 zhaoyingli 提交于
-