- 24 7月, 2023 1 次提交
-
-
由 Windfarer 提交于
-
- 14 7月, 2023 1 次提交
-
-
由 caozhou 提交于
* distribute best cfg * adapt to multi args transmission * update metric extracting * fix bugs of prune and reading log * fix time default value * remove time record * adjust the order of searching dim * fix prune bugs * fix adding cfg bug * fix multi nodes bug * reset status * remove alarm and set logdir * deepcopy ctx * change alarm * fix restart bug * add exit * best no need alarm * add warmup time
-
- 30 6月, 2023 1 次提交
-
-
由 sneaxiy 提交于
-
- 25 6月, 2023 1 次提交
-
-
由 Chitsing KUI 提交于
-
- 14 6月, 2023 1 次提交
-
-
由 caozhou 提交于
* add auto tuner * fix prune * fix sharding prune and mbs candidates * fix cfg * fix launch * fix launch * add unittest * fix code style
-
- 08 6月, 2023 1 次提交
-
-
由 Chitsing KUI 提交于
-
- 10 5月, 2023 1 次提交
-
-
由 Chitsing KUI 提交于
* add log overwrite flag * use strtobool
-
- 13 4月, 2023 1 次提交
-
-
由 TaoTao Li 提交于
* add auto parallel tuner options in launch * add ut for launch in auto_parallel tuner fix code format * fix ci-converage
-
- 06 4月, 2023 1 次提交
-
-
由 Kim Yann 提交于
* rem is_compiled_with_npu * rem nup related code * make lint happy * rem test * remove some tests * Update grad_scaler.py * fix an error
-
- 03 4月, 2023 1 次提交
-
-
由 Kim Yann 提交于
* rem is_compiled_with_mlu * fix some mlu_place and mlu_device_coount * make lint happy
-
- 31 3月, 2023 1 次提交
-
-
由 张春乔 提交于
* autofix Co-authored-by: NLiyulingyue <83450930+Liyulingyue@users.noreply.github.com> * revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py * empty commit, trigger ci * fix test_slice --------- Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
- 08 12月, 2022 1 次提交
-
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * Remove climits. * Clean fluid API in paddle/distributed and paddle/fleetx folders. Include following files: python/paddle/distributed/__init__.py python/paddle/distributed/collective.py python/paddle/distributed/fleet/utils/fs.py python/paddle/distributed/fleet/utils/hybrid_parallel_inference.py python/paddle/distributed/fleet/utils/hybrid_parallel_util.py python/paddle/distributed/fleet/utils/internal_storage.py python/paddle/distributed/launch/context/device.py python/paddle/distributed/parallel.py python/paddle/distributed/parallel_with_gloo.py python/paddle/distributed/spawn.py python/paddle/framework/__init__.py To be mentioned, 'paddle.fluid.dygraph.parallel.ParallelEnv' and 'fluid.framework.core' keeps unchanged in those files. ParallelEnv is used by paddle.fluid.dygraph.parallel.DataParallel. However, APIs in paddle.fluid.dygraph.parallel can't be migrated to paddle.distributed, as there exists cyclic import dependencies in modules like paddle.static, paddle.tensor. And 'fluid.framework.core' will be changed to import framework.core after fluid.core is transmitted. * Change TODO authors.
-
- 29 11月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* isort all files * revert conflicting files * revert conflicting files * revert conflicting files
-
- 08 11月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition * fix an increment
-
- 23 10月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format
-
- 19 10月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
-
- 22 8月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
-
- 19 8月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
* rewrite get free port strategy * hide the old one
-
- 17 8月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
-
- 11 8月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
-
- 08 8月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
* make launch compatible * fix ut * fix log offset
-
- 04 8月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
-
- 03 8月, 2022 1 次提交
-
-
由 ronnywang 提交于
* [CustomDevice] add custom ccl 2/2 * update * update * update launch
-
- 29 7月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
-
- 28 7月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
* add launch test * launch test for cpu * bs 1
-
- 30 6月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
-
- 14 6月, 2022 1 次提交
-
-
由 yaozhixin 提交于
* update paddle.distributed.launch * add sample code * update shell * fix typo * fix typo * update docs * rm code * fix doc 2 * fix doc 3 * fix doc 4 Co-authored-by: Nroot <root@sgjur-pod004-1.ipu.graphcore.cn>
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
* use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed
-
- 01 6月, 2022 1 次提交
-
- 31 5月, 2022 1 次提交
-
-
由 yaozhixin 提交于
* [IPU] support paddle.distributed.launch with IPUs * add device_num to env_args_mapping
-
- 24 5月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
-
- 19 4月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
-
- 02 4月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
-
- 01 4月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
-
- 28 3月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
-
- 24 3月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
-
- 23 3月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
-
- 21 3月, 2022 1 次提交
-
-
由 kuizhiqing 提交于
-