- 14 7月, 2023 1 次提交
-
-
由 caozhou 提交于
* distribute best cfg * adapt to multi args transmission * update metric extracting * fix bugs of prune and reading log * fix time default value * remove time record * adjust the order of searching dim * fix prune bugs * fix adding cfg bug * fix multi nodes bug * reset status * remove alarm and set logdir * deepcopy ctx * change alarm * fix restart bug * add exit * best no need alarm * add warmup time
-
- 25 6月, 2023 1 次提交
-
-
由 Chitsing KUI 提交于
-
- 19 6月, 2023 1 次提交
-
-
由 Chitsing KUI 提交于
* no endpoints in dy mode * fix fleet api inconsistent
-
- 14 6月, 2023 1 次提交
-
-
由 caozhou 提交于
* add auto tuner * fix prune * fix sharding prune and mbs candidates * fix cfg * fix launch * fix launch * add unittest * fix code style
-
- 12 6月, 2023 1 次提交
-
-
由 Nyakku Shigure 提交于
-
- 08 6月, 2023 1 次提交
-
-
由 Chitsing KUI 提交于
-
- 10 5月, 2023 1 次提交
-
-
由 Chitsing KUI 提交于
* add log overwrite flag * use strtobool
-
- 23 4月, 2023 1 次提交
-
-
由 Chitsing KUI 提交于
* save env log for each worker * fix ut
-
- 13 4月, 2023 1 次提交
-
-
由 TaoTao Li 提交于
* add auto parallel tuner options in launch * add ut for launch in auto_parallel tuner fix code format * fix ci-converage
-
- 31 3月, 2023 1 次提交
-
-
由 张春乔 提交于
* autofix Co-authored-by: NLiyulingyue <83450930+Liyulingyue@users.noreply.github.com> * revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py * empty commit, trigger ci * fix test_slice --------- Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
- 30 3月, 2023 1 次提交
-
-
由 cyberslack_lee 提交于
[CodeStyle][C416][C417] rewrite unnecessary comprehension with function call and use generator instead of map (#52140) * codestyle c416 c417 * fix error * fix inc * unify all C4 rules into one * fix inc --------- Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
- 20 3月, 2023 1 次提交
-
-
由 Chitsing KUI 提交于
-
- 29 11月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* isort all files * revert conflicting files * revert conflicting files * revert conflicting files
-
- 09 11月, 2022 1 次提交
-
-
由 Tony Cao 提交于
* fix flake8 CodeStyle E266 * fix comments
-
- 08 11月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition * fix an increment
-
- 01 11月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* [CodeStyle][py2] remove `six` package (part2) * six.ensure_str * remove unused `import six` * remove six from BUILTIN_LIKELY_MODULES * remove six in example code * remove some decode * try to fix example code * fix MockEtcdClient get/get_prefix returns data type * fix MockEtcdClient get_prefix returns data * fix MockEtcdClient get returns data * remove `six` in pypi and conda requirements * fix MockEtcdClient add_watch_callback/add_watch_prefix_callback returns data type * refine MockEtcdClient
-
- 23 10月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format
-
- 19 10月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
-
- 13 10月, 2022 1 次提交
-
-
由 Xinger 提交于
* add rpc module in cpp side * add rpc module in python side * support win32 and mac for rpc * 代码优化 * 优化代码 * update rpc * update rpc launch * rpc remove rank and world_size api * fix logger import bug * remove support for win and mac * remove support for xpu, npu, cinn and rocm * remove support for xpu, npu, cinn and rocm * fix shutdown barrier timeout bug * update:python_rpc_handler to shared ptr * fix master shutodwn first bug * tests support for cpu * update log to vlog * update get service info api * add single process test case * remove process group * remove some useless dependencies * update rpc api comments * update rpc comments: Example to Examples * update rpc api comments * update rpc api comments * update launch api comments * update init_rpc comments * update rpc sync and async comments * fix bug: init_rpc cant be called repeatly in a process * update rpc api comment: make master endpoint unique * update rpc api:service to worker, timeout_ms to timeout * rename ServiceInfo to WorkerInfo * refactor: rename server to worker, log to vlog * add launch test * remove unused codes * refine
-
- 12 10月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* [CodeStyle][F401] remove unused import in python/paddle/distributed * remove pass * empty commit * Fix ValueError: list.remove(x): x not in list for meta_optimizer_names. Fix ValueError: list.remove(x): x not in list for meta_optimizer_names. * Fix split import. Fix split import. * add noqa after meta_optimizers in factory * restort collective ops * expand `import *` * add noqa after required imports * try to fix APIs without core.ops * Revert "try to fix APIs without core.ops" This reverts commit 6172beaf601e84bf61f2490c12c4739f0edaa5eb. * fix an increment * empty commit * add noqa after required imports * expand `import *`, fix ci error Co-authored-by: NShuangchi He <34329208+Yulv-git@users.noreply.github.com>
-
- 18 8月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
-
- 17 8月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
-
- 11 8月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
-
- 08 8月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
* make launch compatible * fix ut * fix log offset
-
- 04 8月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
-
- 03 8月, 2022 1 次提交
-
-
由 ronnywang 提交于
* [CustomDevice] add custom ccl 2/2 * update * update * update launch
-
- 29 7月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
-
- 28 7月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
* add launch test * launch test for cpu * bs 1
-
- 30 6月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
-
- 20 6月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
-
- 14 6月, 2022 1 次提交
-
-
由 yaozhixin 提交于
* update paddle.distributed.launch * add sample code * update shell * fix typo * fix typo * update docs * rm code * fix doc 2 * fix doc 3 * fix doc 4 Co-authored-by: Nroot <root@sgjur-pod004-1.ipu.graphcore.cn>
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
* use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed
-
- 01 6月, 2022 1 次提交
-
- 31 5月, 2022 1 次提交
-
-
由 yaozhixin 提交于
* [IPU] support paddle.distributed.launch with IPUs * add device_num to env_args_mapping
-
- 24 5月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
-
- 13 5月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
* add nvsmi * collect gpu info to log * fix unitest * rm ret_type
-
- 19 4月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
-
- 02 4月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
-
- 01 4月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
-
- 24 3月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
-