- 05 2月, 2021 1 次提交
-
-
由 Stas Bekman 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 03 2月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 02 2月, 2021 3 次提交
-
-
由 Jeff Rasley 提交于
-
由 Jon Eyolfson 提交于
* Add executable permission to `ds_elastic` and `ds_report` in `bin`. * Automatic `ds_elastic` formatting Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Jeff Rasley 提交于
-
- 30 1月, 2021 2 次提交
-
-
由 Shaden Smith 提交于
-
由 Jeff Rasley 提交于
-
- 28 1月, 2021 3 次提交
-
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-
- 26 1月, 2021 2 次提交
-
-
由 Taka152 提交于
* fix wrong idx bug in invertible LayerNormBackward1 this index bug cause wrong scale grad * fix unexpected deletion * fix idx for LayerNormBackward1_fused_add * move pos defination in LayerNormBackward1 kernels * fix format error Co-authored-by: NReza Yazdani <reyazda@microsoft.com>
-
由 sdtblck 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 21 1月, 2021 3 次提交
-
-
由 Leo Gao 提交于
* Fix ZeRO 2 + Pipelining
-
由 Shaden Smith 提交于
-
由 Stas Bekman 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 20 1月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
* Update README.md * Update index.md
-
- 16 1月, 2021 4 次提交
-
-
由 Stas Bekman 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Stas Bekman 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Stas Bekman 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Jeff Rasley 提交于
-
- 15 1月, 2021 2 次提交
-
-
由 Olatunji Ruwase 提交于
-
由 Jeff Rasley 提交于
-
- 13 1月, 2021 3 次提交
-
-
由 Reza Yazdani 提交于
* move workspace memory-allocation to PyTorch * refine the code based on the comments * remove unnecessary options * remove bsz from set_seq_len function
-
由 Cheng Li 提交于
Co-authored-by: NCheng Li <pistasable@gmail.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Shaden Smith 提交于
Special thanks to @g-karthik for tracking this issue down.
-
- 09 1月, 2021 5 次提交
-
-
由 Olatunji Ruwase 提交于
* Add Linear warmup+decay lr schedule Update lr schedule unit tests * LR scheduler unit tests for LR Range Test and 1Cycle * Disable yapf to preserve parameterizaton * Disable test_pipe.py for CI debugging * Disable test_lr_scheduler for CI debugging * Disable test_lr_scheduler for CI debugging * Enable all unit tests for CI debugging Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Jeff Rasley 提交于
-
由 Ammar Ahmad Awan 提交于
* Remove a very verbose print statement. * Update engine.py
-
由 Jeff Rasley 提交于
-
由 Stas Bekman 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 08 1月, 2021 2 次提交
-
-
由 Jeff Rasley 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 dependabot[bot] 提交于
Bumps [nokogiri](https://github.com/sparklemotion/nokogiri) from 1.10.10 to 1.11.0. - [Release notes](https://github.com/sparklemotion/nokogiri/releases) - [Changelog](https://github.com/sparklemotion/nokogiri/blob/master/CHANGELOG.md) - [Commits](https://github.com/sparklemotion/nokogiri/compare/v1.10.10...v1.11.0) Signed-off-by: Ndependabot[bot] <support@github.com> Co-authored-by: Ndependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 07 1月, 2021 2 次提交
-
-
由 Xingjian Shi 提交于
-
由 Jeff Rasley 提交于
Co-authored-by: NReza Yazdani <reyazda@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 06 1月, 2021 4 次提交
-
-
由 Olatunji Ruwase 提交于
-
由 brett koonce 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Ammar Ahmad Awan 提交于
-
由 gcooper-isi 提交于
Allow DeepSpeed models to be initialized with optimizer=None Co-authored-by: NShaden Smith <Shaden.Smith@microsoft.com>
-
- 05 1月, 2021 2 次提交
-
-
由 Olatunji Ruwase 提交于
-
由 Jeff Rasley 提交于
-