- 28 10月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
* fix no sync bugs * update * update task chain fix: update wait chain feat: add `GetDeviceContext` for gloo * fix oom * fix dev * update * update Co-authored-by: NLiYuRio <liyuruijx@163.com> Co-authored-by: NForFishes <2282912238@qq.com>
-
- 23 10月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format
-
- 19 10月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
-
- 21 9月, 2022 1 次提交
-
-
由 LiYuRio 提交于
-
- 14 9月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* trim trailing whitespace * fix `.cmake-format.py` * revert npu ut changes, avoid npu ci error
-
- 26 8月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
-
- 03 8月, 2022 1 次提交
-
-
由 ronnywang 提交于
* [CustomDevice] add custom ccl 2/2 * update * update * update launch
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
* use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed
-
- 15 4月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
* refactor mp in eager mode * update * update * add uts
-
- 06 4月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
* remove unrequired ut cases * update * fix bugs * update
-
- 05 4月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
* support process group in dp with fleet api * update * fix uts * update
-
- 02 4月, 2022 1 次提交
-
-
由 lilong12 提交于
-
- 28 3月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
* add uts for EagerReducer * add more uts * fix bugs * fix bugs * modify * modify uts * fix bugs * update * update * update * solve conflicts and merge * add some other uts * modify time of uts * update * update * update * remove uts of resnet
-
- 25 3月, 2022 1 次提交
-
-
由 Jiabin Yang 提交于
* refactor eager flags * fix flags error when we switch from eager to dygraph * fix ci problem * fix ci * fix ci * merge develop and fix code style * merge develop and fix code style * fix op test error * fix op test error * fix op test error * fix op test error * fix op test error * merge develop
-
- 15 3月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
* refactor reducer * modify cmakelists * solve conflicts * rename group and update process_group * fix bugs of ProcessGroupNCCL * modify for CIs * refactoring reducer
-
- 08 3月, 2022 1 次提交
-
-
由 mhhhh1 提交于
* [MLU] add fleet init api and collective api pytest for mlu * fix no value for argument 'data_type' in method call
-
- 01 3月, 2022 1 次提交
-
-
由 ShenLiang 提交于
* add reducer
-
- 29 12月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* fix bug of dp in pfp16 * fix topo
-
- 27 12月, 2021 1 次提交
-
-
由 ShenLiang 提交于
-
- 26 11月, 2021 1 次提交
-
-
由 Steffy-zxf 提交于
* fix data parallel when VOCAB var in program
-
- 23 11月, 2021 1 次提交
-
-
由 ronnywang 提交于
* Added HCCL backend support in dynamic graph mode * fix segmentation fault * add ut
-
- 14 10月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* add no_sync for parameters sync * add pipeline for moe
-
- 14 9月, 2021 1 次提交
-
-
由 Haohongxiang 提交于
* Add solutions to PyLayer which is unsupported in DataParallel * modify note format for parallel.py * modify docs of dataparallel * add docs of dp with pylayer * modify docs format * modify example format * change example of dp with pylayer * add unittest for dp with pylayer * modify ut * merge latest codes * update * modify for CI-Coverage * modify text-indent
-
- 24 8月, 2021 1 次提交
-
-
由 Haohongxiang 提交于
* Add no_sync in data parallel for dynamic graph * modify UT of no_sync * delete test_parallel_dygraph_dataparallel_no_sync.py * add test_parallel_dygraph_no_sync.py * modify run_trainer_with_spawn in UTs * Add UT of complex control flow in no_sync * add specific descriptions and notes for no_sync * check code style * modify UT's TIMEOUT in CMakeLists.txt
-
- 05 7月, 2021 1 次提交
-
-
由 ShenLiang 提交于
-
- 11 5月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* fix find_unused_parameters default value
-
- 22 4月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* add clip/check * add amp & clip grad in dygraph * add logging
-
- 19 4月, 2021 2 次提交
-
-
由 ShenLiang 提交于
* support dp & mp
-
由 Jiabin Yang 提交于
* fix sublayer error with include_sublayers=False * add ut * refactor include_sublayers related api * fix ut * fix ut of transformer * fix ut of transformer * remove useless code * change sublayer api * polish code * add test for include_self=True
-
- 01 4月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* support control flow * supoort sync_parameters_buffers * fix the bug of sparse embedding
-
- 19 2月, 2021 1 次提交
-
-
由 ShenLiang 提交于
-
- 03 2月, 2021 1 次提交
-
-
由 WangXi 提交于
-
- 13 1月, 2021 1 次提交
-
-
由 ShenLiang 提交于
-
- 22 12月, 2020 1 次提交
-
-
由 ShenLiang 提交于
* fix fleet for multi-stream * fix memcpy for ncclid * use sync to solve move operation
-
- 09 12月, 2020 2 次提交
- 04 12月, 2020 1 次提交
-
-
由 ShenLiang 提交于
-
- 01 12月, 2020 1 次提交
-
-
由 ShenLiang 提交于
-
- 27 11月, 2020 1 次提交
-
-
由 ShenLiang 提交于
* add reducer * refine envent for memorycopy * add concat&split for allreduce * apply concat & split for fuse tensor * fix nccl dep * fix the untest, compile problem and ddp initialize problem * fix untest for mac & add some comments & solve the repeated param in sublayers * fix untest for windows & fix document
-
- 26 11月, 2020 1 次提交
-
-
由 gongweibao 提交于
-