- 22 9月, 2022 1 次提交
-
-
由 Roc 提交于
uniform logger manager in FleetAPI. hidde API under distributed/utils which users don't need.
-
- 26 8月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
* use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed
-
- 20 5月, 2022 1 次提交
-
-
由 zn 提交于
-
- 25 3月, 2022 1 次提交
-
-
由 Jiabin Yang 提交于
* refactor eager flags * fix flags error when we switch from eager to dygraph * fix ci problem * fix ci * fix ci * merge develop and fix code style * merge develop and fix code style * fix op test error * fix op test error * fix op test error * fix op test error * fix op test error * merge develop
-
- 07 3月, 2022 1 次提交
-
-
由 zhaoyingli 提交于
* engine support pp * fix format * avoid multi print * fix convert * bug fix * add pp unittest
-
- 22 12月, 2021 1 次提交
-
-
由 Zhanlue Yang 提交于
-
- 29 11月, 2021 1 次提交
-
-
由 李季 提交于
Co-authored-by: NChen Long <1300851984@qq.com>
-
- 21 10月, 2021 1 次提交
-
-
由 xiongkun 提交于
-
- 20 10月, 2021 1 次提交
-
-
由 李季 提交于
* fix global gather and global scatter operators
-
- 11 10月, 2021 1 次提交
-
-
由 李季 提交于
-
- 13 9月, 2021 2 次提交
-
-
由 danleifeng 提交于
-
由 李季 提交于
* upload global scatter and global gather operators related files
-
- 21 6月, 2021 1 次提交
-
-
由 Jiangxinz 提交于
-
- 15 6月, 2021 1 次提交
-
- 11 6月, 2021 1 次提交
-
-
由 Jiangxinz 提交于
* fix Undefined variables * fix Undefined variables
-
- 08 6月, 2021 1 次提交
-
-
由 Jiangxinz 提交于
Add 'self' parameters to function Cluster::update_pods, use variable name 'cur_proxy' instead of 'proxy' in file 'tools/get_pr_ut.py' (#33377) * undefined variable proxy * Cluster::update_pods lacks of parameter self
-
- 07 6月, 2021 1 次提交
-
-
由 zhangchunle 提交于
-
- 06 5月, 2021 1 次提交
-
-
由 zhiboniu 提交于
-
- 05 3月, 2021 1 次提交
-
-
由 liuyuhui 提交于
[Kunlun]Multi xpu dygraph performance optimization , add distributed.spawn support for multi xpu and some bug-fixes (#31130)
-
- 26 11月, 2020 1 次提交
-
-
由 gongweibao 提交于
-
- 16 9月, 2020 1 次提交
-
-
由 danleifeng 提交于
* fix ports conflict when launching multi-nodes in paddlecloud;test=develop * add DISTRIBUTED_TRAINER_ENDPOINTS env for cloud;test=develop
-
- 28 8月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add dygraph parallel run interface * polish implement & unified env property name * add print config arg * refactor init_parallel_env function * Compatible with multiprocessing and launch modes * set default trainer start port * support run in python 2 * polish python2 support code * remove python2 support * refine launch import * polish dome design details * refactor api implemention & path * use new method _set_expected_place * add spawn unittest framework & mnist test * add more unittests & doc * fix unittest failed * polish english doc * self review and polish details * refactor code by reviewer's comments * fix unittest failed * fix parallel_env unittest * fix several typos * fix error introduced when fixing typos * add unpublic note for start_processes * polish details by xiaoguang's comment * verify correctly when spawn nprocs=-1 * refactor spawn & init_parallel_env design * polish doc details * open spawn unittests * try to fix doc compile error * try to fix unknown doc format error * add skip unittest when not gpu
-
- 27 7月, 2020 1 次提交
-
-
由 Yi Liu 提交于
test=develop
-
- 07 7月, 2020 1 次提交
-
-
由 gongweibao 提交于
-
- 02 7月, 2020 1 次提交
-
-
由 Yi Liu 提交于
* fix the compatibility of PY2 and PY3 in paddle.distributed.launch test=develop * only pull log of local rank 0 test=develop * log exception if UnicodeEncodeError occurs when pulling log in paddle.distributed.launch test=develop Co-authored-by: NSunGaofeng <peakbee@gmail.com>
-
- 30 6月, 2020 1 次提交
-
-
由 Yi Liu 提交于
test=develop
-
- 28 5月, 2020 1 次提交
-
-
由 mapingshuo 提交于
replace join to terminate
-
- 21 4月, 2020 1 次提交
-
-
由 Kaipeng Deng 提交于
* add DataLoader, Dataset, BatchSampler
-
- 03 4月, 2020 1 次提交
-
-
由 gongweibao 提交于
-