- 12 6月, 2020 1 次提交
-
-
由 Chunyang Wen 提交于
-
- 10 6月, 2020 1 次提交
-
-
由 Jeff Rasley 提交于
* install update: no-sudo + clean build files Co-authored-by: NShaden Smith <Shaden.Smith@microsoft.com>
-
- 09 6月, 2020 1 次提交
-
-
由 eltonzheng 提交于
-
- 06 6月, 2020 1 次提交
-
-
由 Olatunji Ruwase 提交于
* Debugging * Fix step() bug; Make step timing optional * Remove unnecessary changes * Format fixes * Replace list with scalar variable * Remove redundant code * Fix typo
-
- 05 6月, 2020 4 次提交
-
-
由 Jeff Rasley 提交于
-
由 Vidush Vishwanath 提交于
-
由 Chunyang Wen 提交于
* Add log util * replace all occurrences of print and logging * address format * disable propagate to avoid duplicate log
-
由 Shaden Smith 提交于
* links and formatting
-
- 04 6月, 2020 2 次提交
-
-
由 Ammar Ahmad Awan 提交于
Co-authored-by: NAmmar Ahmad Awan <ammar.awan@microsoft.com>
-
由 eltonzheng 提交于
-
- 30 5月, 2020 9 次提交
-
-
由 Jeff Rasley 提交于
-
由 Samyam Rajbhandari 提交于
-
由 Shaden Smith 提交于
-
由 Jeff Rasley 提交于
-
由 Shaden Smith 提交于
-
由 Shaden Smith 提交于
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
* Transformer kernels release Co-authored-by: NShaden Smith <ShadenTSmith@gmail.com> Co-authored-by: NElton Zheng <eltonz@microsoft.com> Co-authored-by: NReza Yazdani <reyazda@microsoft.com> Co-authored-by: NRezaYazdaniAminabadi <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by: NTunji Ruwase <olruwase@microsoft.com> Co-authored-by: NShaden Smith <ShadenTSmith@gmail.com> Co-authored-by: NShaden Smith <Shaden.Smith@microsoft.com> Co-authored-by: NSamyam Rajbhandari <samyamr@microsoft.com> Co-authored-by: NShaden Smith <ShadenTSmith@gmail.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NSamyam Rajbhandari <samyamr@microsoft.com> Co-authored-by: NShaden Smith <ShadenTSmith@gmail.com> Co-authored-by: NElton Zheng <eltonz@microsoft.com> Co-authored-by: NReza Yazdani <reyazda@microsoft.com> Co-authored-by: NRezaYazdaniAminabadi <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by: NTunji Ruwase <olruwase@microsoft.com> Co-authored-by: NShaden Smith <Shaden.Smith@microsoft.com> Co-authored-by: NSamyam Rajbhandari <samyamr@microsoft.com>
-
- 29 5月, 2020 1 次提交
-
-
由 Chunyang Wen 提交于
* fix: typo in code docs * more pythonic code
-
- 28 5月, 2020 5 次提交
-
-
由 Chunyang Wen 提交于
Co-authored-by: NShaden Smith <Shaden.Smith@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
* add support for predivide as a flag * add predivide json config, remove allgather_disable (as it's not currently used anymore)
-
由 Samyam Rajbhandari 提交于
Contiguous Gradients should be set to false by default. Its not useful unless the model is very large
-
由 Samyam Rajbhandari 提交于
* Fix for CPU memory Bloating Issue caused by pyorch backward graph creation in allgather. Fixed by calling detach on tensors before calling all_gather * Fix for CPU memory Bloating Issue caused by pyorch backward graph creation in allgather. Fixed by calling detach on tensors before calling all_gather * Fix for CPU memory Bloating Issue caused by pyorch backward graph creation in allgather. Fixed by calling detach on tensors before calling all_gather
-
- 27 5月, 2020 2 次提交
-
-
由 Jeff Rasley 提交于
* updates to support fp32 grad clipping and disable max_grad_norm
-
由 Shaden Smith 提交于
-
- 25 5月, 2020 1 次提交
-
-
由 Chunyang Wen 提交于
-
- 22 5月, 2020 2 次提交
-
-
由 Shaden Smith 提交于
-
由 Shaden Smith 提交于
-
- 21 5月, 2020 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 19 5月, 2020 7 次提交
-
-
由 Shaden Smith 提交于
-
由 Shaden Smith 提交于
-
由 Shaden Smith 提交于
* BERT title
-
由 Shaden Smith 提交于
-
由 Shaden Smith 提交于
-
由 Jeff Rasley 提交于
Updates for ZeRO stage 2 + ZeRO stage 1 w. RS Co-authored-by: NTunji Ruwase <olruwase@microsoft.com> Co-authored-by: NSamyam Rajbhandari <samyamr@microsoft.com> Co-authored-by: NShaden Smith <ShadenTSmith@gmail.com> Co-authored-by: NElton Zheng <eltonz@microsoft.com> Co-authored-by: NShaden Smith <Shaden.Smith@microsoft.com> Co-authored-by: Nyuxionghe <yuxhe@microsoft.com> Co-authored-by: NArash Ashari <arashari@microsoft.com>
-
由 Arash Ashari 提交于
* adding BingSqaud e2e test * updating the draft test; bring final step under try section * finalizinf test for base deepspeed and deepspeed with ZeRO * applying the comment (thanks Jeff); fixed formatting
-
- 15 5月, 2020 1 次提交
-
-
由 Shaden Smith 提交于
-
- 14 5月, 2020 1 次提交
-
-
由 Jeff Rasley 提交于
-