- 24 10月, 2022 3 次提交
-
-
由 Yuang Liu 提交于
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * support pure bfloat16 * support bf16 linear * update PR to pass CI * tiny fix where_grad_kernel.cu * Support bfloat16 type for reducer and sharding. * Fix some bug. * Polish code. * Polise code. * Add bfloat16 datatype in fill_grad kernels. Co-authored-by: Nsneaxiy <sneaxiy@126.com> Co-authored-by: Nsneaxiy <sneaxiy@126.com>
-
由 Roc 提交于
-
- 21 10月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
-
- 19 10月, 2022 1 次提交
-
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * Support allow_partial switch, which can be configure in pipeline_configs. If sent tensor are not the same from different hosts, they shouldn't been sent partially and then concated as a whole tensor. * Change name allow_partial to enable_partial_send_recv. * Add global variable _enable_partial_send_recv
-
- 18 10月, 2022 2 次提交
-
-
由 Yuang Liu 提交于
* [dygraph sharding] Overlap the reduce and the caculation for sharding stage 2. (#46495) * [dygraph sharding stage 2] sharding broadcast overlap (#46656) * Multi groups for broadcast of sharding stage 2 (#46894)
-
由 Haohongxiang 提交于
* [Dygraph] Fix performance of pp+mp by using send/recv_calc_stream instead of send/recv (#46116) * [Dygraph] Fix Perf of FusedFeedForward and FusedAttention with AllReduce (#46780) * update
-
- 11 10月, 2022 1 次提交
-
-
由 Yuang Liu 提交于
* bug fix for virtual pipeline parallel (#45922) * dont wait for send op under dygraph pp (#46209) * [interleave pp] sync recv for 1f1b (#46399) * [dygraph pp] all sync for allgather partial (#46483)
-
- 27 9月, 2022 1 次提交
-
-
由 LiYuRio 提交于
-
- 22 9月, 2022 1 次提交
-
-
由 Roc 提交于
uniform logger manager in FleetAPI. hidde API under distributed/utils which users don't need.
-
- 19 9月, 2022 2 次提交
-
-
由 wuhuachaocoding 提交于
-
由 wuhuachaocoding 提交于
* refactor mp. * update setup.py. * update mp_layers.py for compatibility. * add documents for mp_layers.py * update init.py * update collective.py. * update. * update mp_ops.py * update. * update code style. * update code style.
-
- 09 9月, 2022 1 次提交
-
-
由 Yuang Liu 提交于
-
- 07 9月, 2022 1 次提交
-
-
由 Yuang Liu 提交于
-
- 06 9月, 2022 1 次提交
-
-
由 Yuang Liu 提交于
-
- 02 9月, 2022 1 次提交
-
-
由 wuhuachaocoding 提交于
-
- 26 8月, 2022 3 次提交
-
-
由 Yuang Liu 提交于
-
由 wanghuancoder 提交于
-
由 Yuang Liu 提交于
-
- 16 8月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
* reconstruct_of_fleet_api * update
-
- 12 8月, 2022 1 次提交
-
-
由 hong 提交于
-
- 10 8月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* [OpAttr]Support VarDesc* and vector<VarDesc*> in Attribute * add unittest for inference predictor
-
- 09 8月, 2022 1 次提交
-
-
由 Yuang Liu 提交于
-
- 22 7月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
-
- 13 7月, 2022 2 次提交
-
-
由 ShenLiang 提交于
-
由 Jiabin Yang 提交于
* fix sharding in eager * support eager sharding
-
- 27 6月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
* rename eagerpylayer
-
- 14 6月, 2022 1 次提交
-
-
由 zlsh80826 提交于
* Replace np.bool/np.bool8 with np.bool_ * Replace np.object with np.object_ * Replace np.complex with np.complex128 * Replace np.float with np.float64 * Replace np.int with np.int_ * Rerun pre-commit for newer pre-commit configuration * Use builtin bool instead of np.bool_ based on the context
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
* use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed
-
- 26 5月, 2022 1 次提交
-
-
由 ShenLiang 提交于
-
- 25 5月, 2022 1 次提交
-
-
由 Baibaifan 提交于
* fix_sharding3_offload * fix_fp16dtype_bug
-
- 16 5月, 2022 1 次提交
-
-
由 ShenLiang 提交于
* fix recompute in mp * fix recompute
-
- 19 4月, 2022 1 次提交
-
-
由 ShenLiang 提交于
* fix utest * fix time
-
- 16 4月, 2022 1 次提交
-
-
由 Baibaifan 提交于
-
- 13 4月, 2022 1 次提交
-
-
由 Baibaifan 提交于
-
- 25 3月, 2022 1 次提交
-
-
由 Jiabin Yang 提交于
* refactor eager flags * fix flags error when we switch from eager to dygraph * fix ci problem * fix ci * fix ci * merge develop and fix code style * merge develop and fix code style * fix op test error * fix op test error * fix op test error * fix op test error * fix op test error * merge develop
-
- 18 3月, 2022 1 次提交
-
-
由 Baibaifan 提交于
-
- 14 3月, 2022 1 次提交
-
-
由 Baibaifan 提交于
-
- 11 3月, 2022 1 次提交
-
-
由 Baibaifan 提交于
-
- 09 3月, 2022 1 次提交
-
-
由 Baibaifan 提交于
-