- 08 2月, 2023 1 次提交
-
-
由 Huang Jiyi 提交于
-
- 05 12月, 2022 1 次提交
-
-
由 ShenLiang 提交于
-
- 21 11月, 2022 1 次提交
-
-
由 LiYuRio 提交于
-
- 10 11月, 2022 1 次提交
-
-
由 james 提交于
* XPU support eager mode * add unittest for XPU eager mode * minor bugfix * minor bugfix, test=kunlun * correct copyright info * 1. remove unsed vars/funcs 2. ProcessGroupBKCL inherit from ProcessGroupStream * bugfix for fp16 in eager mode multi-card, test=kunlun * rebase & fix a few issues * use new processgroup interface, test=kunlun * fix compile issue, test=kunlun
-
- 28 10月, 2022 2 次提交
-
-
由 Haohongxiang 提交于
-
由 Haohongxiang 提交于
* fix no sync bugs * update * update task chain fix: update wait chain feat: add `GetDeviceContext` for gloo * fix oom * fix dev * update * update Co-authored-by: NLiYuRio <liyuruijx@163.com> Co-authored-by: NForFishes <2282912238@qq.com>
-
- 17 10月, 2022 1 次提交
-
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * support pure bfloat16 * support bf16 linear * update PR to pass CI * tiny fix where_grad_kernel.cu * Support bfloat16 type for reducer and sharding. * Fix some bug. * Polish code. * Polise code. * Add bfloat16 datatype in fill_grad kernels. Co-authored-by: Nsneaxiy <sneaxiy@126.com>
-
- 01 9月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* make dense_contents_ lazy init * update legacy dygraph * fix legacy dygraph bug
-
- 03 8月, 2022 1 次提交
-
-
由 ronnywang 提交于
* [CustomDevice] add custom ccl 2/2 * update * update * update launch
-
- 01 8月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* remove cudaDeviceContext * remove more template * fix rocm compile * remove alias name CUDADeviceContext * fix compile * fix tests * revert changes
-
- 29 7月, 2022 1 次提交
-
-
由 Jiabin Yang 提交于
-
- 02 7月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* fix init() * delete test_device_context * replace CPUDeviceContext with CPUContext * fix test_scalar * remove dot_op.cc * fix compile
-
- 26 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 07 6月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
* fix bugs of reducer * update * update
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 11 5月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
* support diff batch for sparse of eagerreducer * fix
-
- 29 4月, 2022 1 次提交
-
-
由 Jiabin Yang 提交于
-
- 14 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
-
- 13 4月, 2022 2 次提交
-
-
由 lilong12 提交于
-
由 Chen Weihang 提交于
* remove old custom op placetype * replace dist placetype using * add with gpu macro * fix mutable_data error * fix set value error * add comment
-
- 04 4月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
* [Dygraph] Support sparse tensor in refactored reducer * add uts * refactor * update * fix bugs
-
- 31 3月, 2022 1 次提交
-
-
由 zyfncg 提交于
* rename scalar_array to int_array * update cmake * fix conflict * remove useless log
-
- 22 3月, 2022 1 次提交
-
-
由 zyfncg 提交于
* replace Backend by Place in C++ API * fix left code * fix test_to_api bug
-
- 18 3月, 2022 1 次提交
-
-
由 ShenLiang 提交于
* fix bug * fix bug
-
- 15 3月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
* refactor reducer * modify cmakelists * solve conflicts * rename group and update process_group * fix bugs of ProcessGroupNCCL * modify for CIs * refactoring reducer
-
- 01 3月, 2022 1 次提交
-
-
由 ShenLiang 提交于
* add reducer
-