- 18 2月, 2021 3 次提交
-
-
由 Conglong Li 提交于
-
由 Jeff Rasley 提交于
-
由 Takuya Makino 提交于
-
- 17 2月, 2021 2 次提交
-
-
由 Olatunji Ruwase 提交于
* Fix docstring * Make screenshots clickable for easier viewing * Navigation menu in alphabetical order; More clicable screenshots * Rename 1Cycle doc * Tweak naming
-
由 Cheng Li 提交于
* check none tensors when splitting buckets
-
- 13 2月, 2021 4 次提交
-
-
由 Olatunji Ruwase 提交于
* Activation checkpoint support for non tensor input/output * Format fixes * Address PR comments; Add ordering edge case tests
-
由 Jeff Rasley 提交于
* add -e/--examples flag to checkout submodules * bump DSE commit
-
由 Stas Bekman 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Sean Naren 提交于
* Use log dist function instead of print * Expose ranks Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 12 2月, 2021 2 次提交
-
-
由 Conglong Li 提交于
* 1-bit adam doc fix * 1-bit adam doc fix * 1-bit adam doc fix Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 sdtblck 提交于
-
- 11 2月, 2021 4 次提交
-
-
由 Sean Naren 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Cheng Li 提交于
* work on flops profiler tutorial * update flops profiler tutorial * add flops profiler tutorial and fix names * work on flops profiler tutorial * update flops profiler tutorial * add flops profiler tutorial and fix names * fix tailing ws * fix names * remove multistep profiling and update docs * fix cases where functionals and submodules coexist in a parent module, update readme * fix typo * always invoke post hook function * fix module flops sum and update tests * update tutorial
-
由 Olatunji Ruwase 提交于
* Fix docstring * Make screenshots clickable for easier viewing
-
由 Stas Bekman 提交于
-
- 09 2月, 2021 2 次提交
-
-
由 TheDudeFromCI 提交于
-
由 Jon Eyolfson 提交于
* Improve starred expressions `deepspeed/profiling/flops_profiler/profiler.py` uses starred expressions that are no longer valid with [PEP 617][1]. The new Python parser is in 3.9, and this change allows DeepSpeed to run with the newest Python version. I have not checked all locations that has this issue. However, this change allows me to run simple examples. [1]: https://www.python.org/dev/peps/pep-0617/ * Match style for "Improve starred expressions", although readability suffers The style guide might need to be updated for this new use case of expressions. Python [Issue 40631][1] includes more discussion on the change. [1]: https://bugs.python.org/issue40631Co-authored-by: NCheng Li <pistasable@gmail.com>
-
- 05 2月, 2021 1 次提交
-
-
由 Stas Bekman 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 03 2月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 02 2月, 2021 3 次提交
-
-
由 Jeff Rasley 提交于
-
由 Jon Eyolfson 提交于
* Add executable permission to `ds_elastic` and `ds_report` in `bin`. * Automatic `ds_elastic` formatting Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Jeff Rasley 提交于
-
- 30 1月, 2021 2 次提交
-
-
由 Shaden Smith 提交于
-
由 Jeff Rasley 提交于
-
- 28 1月, 2021 3 次提交
-
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-
- 26 1月, 2021 2 次提交
-
-
由 Taka152 提交于
* fix wrong idx bug in invertible LayerNormBackward1 this index bug cause wrong scale grad * fix unexpected deletion * fix idx for LayerNormBackward1_fused_add * move pos defination in LayerNormBackward1 kernels * fix format error Co-authored-by: NReza Yazdani <reyazda@microsoft.com>
-
由 sdtblck 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 21 1月, 2021 3 次提交
-
-
由 Leo Gao 提交于
* Fix ZeRO 2 + Pipelining
-
由 Shaden Smith 提交于
-
由 Stas Bekman 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 20 1月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
* Update README.md * Update index.md
-
- 16 1月, 2021 4 次提交
-
-
由 Stas Bekman 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Stas Bekman 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Stas Bekman 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Jeff Rasley 提交于
-
- 15 1月, 2021 2 次提交
-
-
由 Olatunji Ruwase 提交于
-
由 Jeff Rasley 提交于
-
- 13 1月, 2021 1 次提交
-
-
由 Reza Yazdani 提交于
* move workspace memory-allocation to PyTorch * refine the code based on the comments * remove unnecessary options * remove bsz from set_seq_len function
-