- 28 3月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
* add uts for EagerReducer * add more uts * fix bugs * fix bugs * modify * modify uts * fix bugs * update * update * update * solve conflicts and merge * add some other uts * modify time of uts * update * update * update * remove uts of resnet
-
- 25 3月, 2022 1 次提交
-
-
由 Jiabin Yang 提交于
* refactor eager flags * fix flags error when we switch from eager to dygraph * fix ci problem * fix ci * fix ci * merge develop and fix code style * merge develop and fix code style * fix op test error * fix op test error * fix op test error * fix op test error * fix op test error * merge develop
-
- 15 3月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
* refactor reducer * modify cmakelists * solve conflicts * rename group and update process_group * fix bugs of ProcessGroupNCCL * modify for CIs * refactoring reducer
-
- 08 3月, 2022 1 次提交
-
-
由 mhhhh1 提交于
* [MLU] add fleet init api and collective api pytest for mlu * fix no value for argument 'data_type' in method call
-
- 01 3月, 2022 1 次提交
-
-
由 ShenLiang 提交于
* add reducer
-
- 29 12月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* fix bug of dp in pfp16 * fix topo
-
- 27 12月, 2021 1 次提交
-
-
由 ShenLiang 提交于
-
- 26 11月, 2021 1 次提交
-
-
由 Steffy-zxf 提交于
* fix data parallel when VOCAB var in program
-
- 23 11月, 2021 1 次提交
-
-
由 ronnywang 提交于
* Added HCCL backend support in dynamic graph mode * fix segmentation fault * add ut
-
- 14 10月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* add no_sync for parameters sync * add pipeline for moe
-
- 14 9月, 2021 1 次提交
-
-
由 Haohongxiang 提交于
* Add solutions to PyLayer which is unsupported in DataParallel * modify note format for parallel.py * modify docs of dataparallel * add docs of dp with pylayer * modify docs format * modify example format * change example of dp with pylayer * add unittest for dp with pylayer * modify ut * merge latest codes * update * modify for CI-Coverage * modify text-indent
-
- 24 8月, 2021 1 次提交
-
-
由 Haohongxiang 提交于
* Add no_sync in data parallel for dynamic graph * modify UT of no_sync * delete test_parallel_dygraph_dataparallel_no_sync.py * add test_parallel_dygraph_no_sync.py * modify run_trainer_with_spawn in UTs * Add UT of complex control flow in no_sync * add specific descriptions and notes for no_sync * check code style * modify UT's TIMEOUT in CMakeLists.txt
-
- 05 7月, 2021 1 次提交
-
-
由 ShenLiang 提交于
-
- 11 5月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* fix find_unused_parameters default value
-
- 22 4月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* add clip/check * add amp & clip grad in dygraph * add logging
-
- 19 4月, 2021 2 次提交
-
-
由 ShenLiang 提交于
* support dp & mp
-
由 Jiabin Yang 提交于
* fix sublayer error with include_sublayers=False * add ut * refactor include_sublayers related api * fix ut * fix ut of transformer * fix ut of transformer * remove useless code * change sublayer api * polish code * add test for include_self=True
-
- 01 4月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* support control flow * supoort sync_parameters_buffers * fix the bug of sparse embedding
-
- 19 2月, 2021 1 次提交
-
-
由 ShenLiang 提交于
-
- 03 2月, 2021 1 次提交
-
-
由 WangXi 提交于
-
- 13 1月, 2021 1 次提交
-
-
由 ShenLiang 提交于
-
- 22 12月, 2020 1 次提交
-
-
由 ShenLiang 提交于
* fix fleet for multi-stream * fix memcpy for ncclid * use sync to solve move operation
-
- 09 12月, 2020 2 次提交
- 04 12月, 2020 1 次提交
-
-
由 ShenLiang 提交于
-
- 01 12月, 2020 1 次提交
-
-
由 ShenLiang 提交于
-
- 27 11月, 2020 1 次提交
-
-
由 ShenLiang 提交于
* add reducer * refine envent for memorycopy * add concat&split for allreduce * apply concat & split for fuse tensor * fix nccl dep * fix the untest, compile problem and ddp initialize problem * fix untest for mac & add some comments & solve the repeated param in sublayers * fix untest for windows & fix document
-
- 26 11月, 2020 1 次提交
-
-
由 gongweibao 提交于
-
- 24 11月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* polish parallel api impl & doc details * add unittest for coverage * remove spawn test in py2.7 * add parallel api into white list
-
- 14 10月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
-
- 29 9月, 2020 2 次提交
-
-
由 Chen Weihang 提交于
* remove data parallel scale loss & apply collective_grads * move apply in minimize * fix failed unittests
-
由 Chen Weihang 提交于
* polish Program api doc & example * polish CompiledProgram api doc & example * polish ParallelEnv api doc & examples * polish details, test=document_fix * polish program doc details, test=document_fix * polish details, test=document_fix * fix note format error, test=document_fix * add lost example, test=document_fix * fix lost example, test=document_fix
-
- 28 9月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* remove inplace argument * fix sample code * fix sample code
-
- 23 9月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* init commit of new save/load * fix failed unittests * fix save_load_v2 unittest failed * fix failed unittest & polish doc * add tests for coverage * add more tests & move static apis * fix example code error * polish emample code * fix detail example code problem
-
- 03 9月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* update set_dict method name & add aliases * fix var name error * fix alias formats * use set_state_dict in unittest * add decorator solve compatible problem * polish decorator * replace layer set_state_dict by patched method * remove import monkey path layer * fix import function error * add unittest for coverage
-
- 02 9月, 2020 1 次提交
-
-
由 Yang Zhang 提交于
-
- 31 8月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* remove backend argument of init_parallel_env * remove keep name table in transformer * add cpu version check * add skip unittest for init_parallel_env * polish doc: remove func use & update example
-
- 28 8月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add dygraph parallel run interface * polish implement & unified env property name * add print config arg * refactor init_parallel_env function * Compatible with multiprocessing and launch modes * set default trainer start port * support run in python 2 * polish python2 support code * remove python2 support * refine launch import * polish dome design details * refactor api implemention & path * use new method _set_expected_place * add spawn unittest framework & mnist test * add more unittests & doc * fix unittest failed * polish english doc * self review and polish details * refactor code by reviewer's comments * fix unittest failed * fix parallel_env unittest * fix several typos * fix error introduced when fixing typos * add unpublic note for start_processes * polish details by xiaoguang's comment * verify correctly when spawn nprocs=-1 * refactor spawn & init_parallel_env design * polish doc details * open spawn unittests * try to fix doc compile error * try to fix unknown doc format error * add skip unittest when not gpu
-
- 19 8月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
-
- 14 8月, 2020 1 次提交
-
-
由 Yang Zhang 提交于
* Upgrade `no_grad` decorator test=develop - match torch decorator usage (i.e., with parenthesis) - handle generator functions - add `paddle.no_grad` alias * Switch from `functools` to `decorator` preserves signature * Reword decorator usage note
-