- 01 11月, 2022 3 次提交
-
-
由 Nyakku Shigure 提交于
* [CodeStyle][E712] use `if cond`/`if cond is True` for comparison with `True` * revert changes in fluid * revert unrelated file * revert changes in norm * revert changes in auto_parallel_amp * fix norm and auto_parallel_amp * revert a typo fix due to fixed at #47477
-
由 Nyakku Shigure 提交于
* [CodeStyle][py2] remove `six` package (part2) * six.ensure_str * remove unused `import six` * remove six from BUILTIN_LIKELY_MODULES * remove six in example code * remove some decode * try to fix example code * fix MockEtcdClient get/get_prefix returns data type * fix MockEtcdClient get_prefix returns data * fix MockEtcdClient get returns data * remove `six` in pypi and conda requirements * fix MockEtcdClient add_watch_callback/add_watch_prefix_callback returns data type * refine MockEtcdClient
-
由 sneaxiy 提交于
-
- 31 10月, 2022 1 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Improve the c++ dist attr * [Auto Parallel] Modify test_program.py * [Auto Parallel] Add the missiong import
-
- 28 10月, 2022 3 次提交
-
-
由 sneaxiy 提交于
* add fused_allreduce_gradients_with_group * add scale * fix ci
-
由 zhaoyingli 提交于
* fix engine build method * fix import * update engine cost * update raise error * update cmakelist * revert optimizer * revert optimizer * fix unittest * fix unittest Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>
-
由 LiYuRio 提交于
-
- 26 10月, 2022 1 次提交
-
-
由 Roc 提交于
-
- 24 10月, 2022 3 次提交
-
-
由 Haohongxiang 提交于
-
由 Nyakku Shigure 提交于
* [CodeStyle][black] format dy2static unittests * format some missing files * update lineno in test_origin_info * update lineno in test_error * update lineno
-
由 Tony Cao 提交于
* Fix F522: remove unused arguments * add redirect_stderr argument in _run_cmd
-
- 23 10月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format
-
- 21 10月, 2022 3 次提交
- 20 10月, 2022 4 次提交
-
-
由 zhaoyingli 提交于
* [AutoParallel] fix fp16 for subblock * fix engine * fix comment
-
由 Tony Cao 提交于
* Fix W605 in tools folder by adding escape symbols * Fix W605 in incubate and some other folders * Fix W605 in /fluid/test folders * Update tools/analysisPyXml.py Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com> * Add some changes to manual and auto escape symbols * revert changes in transformer.py * Fix new code with W605 error: add escape symbols * revert changes in transformer.py * revert changes in transformer.py Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>
-
由 Haohongxiang 提交于
-
由 wuhuachaocoding 提交于
* add test for stage2 + dp * update test for stage2 + dp. * update. * update.
-
- 19 10月, 2022 3 次提交
-
-
由 Nyakku Shigure 提交于
-
由 Nyakku Shigure 提交于
-
由 Roc 提交于
-
- 18 10月, 2022 6 次提交
-
-
由 caozhou 提交于
* add parallel tuner * add unittest * fix unittest * set timeout of unittest * set unittest timeout * fix auto_mode setting * update unittest * sync from develop and update unittest * remove unused import * update unittest * update cmakelist * add unittests
-
由 LiYuRio 提交于
-
由 caozhou 提交于
* add cost interface * update inferface and add unittest * update unittest * update inferface
-
由 Nyakku Shigure 提交于
-
由 Nyakku Shigure 提交于
* [CodeStyle][py2] remove `compat` module (to_text) * remove some unnecessary decode * remove to_text definition and unittest * Revert "remove to_text definition and unittest" This reverts commit a6b69cb8dca8b9b031ce10ea32d1040e7e0dd267. * remove an assertion * empty commit
-
由 zhaoyingli 提交于
* [AutoParallel] add callbacks * fix unittest * fix dist_context * fix engine * fix cmakelist * fix unittest's returns * fix cmakelist
-
- 17 10月, 2022 3 次提交
-
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * Support allow_partial switch, which can be configure in pipeline_configs. If sent tensor are not the same from different hosts, they shouldn't been sent partially and then concated as a whole tensor. * Change name allow_partial to enable_partial_send_recv. * Add global variable _enable_partial_send_recv
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * support pure bfloat16 * support bf16 linear * update PR to pass CI * tiny fix where_grad_kernel.cu * Support bfloat16 type for reducer and sharding. * Fix some bug. * Polish code. * Polise code. * Add bfloat16 datatype in fill_grad kernels. Co-authored-by: Nsneaxiy <sneaxiy@126.com>
-
由 Yulong Ao 提交于
* [Auto Parallel] Fix the bug for None labels * [Auto Parallel] Fix the completion bug
-
- 14 10月, 2022 3 次提交
-
-
由 Wen Sun 提交于
-
由 zhaoyingli 提交于
* for gpt-gen * fix reshard * adapt assign and shape op * add dist_assign & unittest * add conditional block unittest * rename unittest
-
由 Yulong Ao 提交于
-
- 13 10月, 2022 3 次提交
-
-
由 wuhuachaocoding 提交于
* combine dp and stage2 hybrid parallel. * update condition.
-
由 Xinger 提交于
* add rpc module in cpp side * add rpc module in python side * support win32 and mac for rpc * 代码优化 * 优化代码 * update rpc * update rpc launch * rpc remove rank and world_size api * fix logger import bug * remove support for win and mac * remove support for xpu, npu, cinn and rocm * remove support for xpu, npu, cinn and rocm * fix shutdown barrier timeout bug * update:python_rpc_handler to shared ptr * fix master shutodwn first bug * tests support for cpu * update log to vlog * update get service info api * add single process test case * remove process group * remove some useless dependencies * update rpc api comments * update rpc comments: Example to Examples * update rpc api comments * update rpc api comments * update launch api comments * update init_rpc comments * update rpc sync and async comments * fix bug: init_rpc cant be called repeatly in a process * update rpc api comment: make master endpoint unique * update rpc api:service to worker, timeout_ms to timeout * rename ServiceInfo to WorkerInfo * refactor: rename server to worker, log to vlog * add launch test * remove unused codes * refine
-
由 Nyakku Shigure 提交于
-
- 12 10月, 2022 3 次提交
-
-
由 JZ-LIANG 提交于
-
由 Yulong Ao 提交于
* [Auto Parallel] Suppport different dataloaders * [Auto Parallel] Add num_shards config for dataset * [Auto Parallel] Unify the logger and outputs of Engine API * [Auto Parallel] Fix the bugs of to_static * [Auto Parallel] Adjust the test_to_static.py * [Auto Parallel] Add the prepare API and replace __call__ with run * [Auto Parallel] Improve the private implementations of Engine * [Auto Parallel] Set capacity of dataloader for opt tuning * [Auto Parallel] [WIP] Change the fine-grained API * [Auto Parallel] Improve APIs to support different user cases * [Auto Parallel] Add removed config * [Auto Parallel] Add imports * [Auto Parallel] Fix bugs for to_static * [Auto Parallel] Remove unnecessary imports
-
由 zhouweiwei2014 提交于
* [Zero-Dim] support input 0D Tensor for unary api * fix CI
-