- 20 11月, 2020 3 次提交
-
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
* zero-1 memory fix * auto-tune max elems per comm to reduce padding/comm intervals * clean-up and added previously missing reduction options * fix testing backing to work with torch1.7
-
- 19 11月, 2020 2 次提交
-
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-
- 18 11月, 2020 1 次提交
-
-
由 Olatunji Ruwase 提交于
* Fix layout bug in ZeRO Stage 1 checkpoint logic Add elastic checkpoint option for ZeRO stage 1, default to True * Format fixes
-
- 14 11月, 2020 2 次提交
-
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
* remove cpu-feature * remove psutils requirement
-
- 13 11月, 2020 3 次提交
-
-
由 Shaden Smith 提交于
* Adds torch install requirement to documentation. * build ops documentation
-
由 Jeff Rasley 提交于
* on cpu box error gracefully if cuda home doesn't exist * gaurd against torch import issue * fix sytax error * fix import
-
由 Jeff Rasley 提交于
Co-authored-by: NShaden Smith <Shaden.Smith@microsoft.com> Co-authored-by: NReza Yazdani <reyazda@microsoft.com>
-
- 12 11月, 2020 2 次提交
-
-
由 Samyam Rajbhandari 提交于
* Update zero.md Update to ZeRO tutorial to specify the use of activation checkpointing * Update zero-offload.md Use activation checkpointing with ZeRO-Offload Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Jeff Rasley 提交于
-
- 11 11月, 2020 1 次提交
-
-
由 Olatunji Ruwase 提交于
* Progressive layer dropping docs (#499) * test * Adding tutorial and news page for pld * updating the tutorial and posts of PLD * update the finetune tutorial * Update PLD tutorial (#512) * Update installation instructions * Format fix * ZeRO tutorial * Format fixes * ZeRO-Offload * ZeRO and ZeRO-Offload tutorials * Update navigation page * Format fixes * Add yuxhe feedback * Fix blog post link * Fix OneBit-Adam link Tweak scheduler example * Fix date link * Add DeepSpeed_Adam * Add PLD tutorial to navigation Co-authored-by: NShaden Smith <Shaden.Smith@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com> * updating the pld docs * DeepSpeed implementation of PLD (#508) * DeepSpeed implementation of PLD * Format fixes * Formatting fixes * Fix broken url * Address PR feedback * Bump DSE Co-authored-by: NMinjia Zhang <33713995+minjiaz@users.noreply.github.com> Co-authored-by: NShaden Smith <Shaden.Smith@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NMinjia Zhang <minjiaz@microsoft.com>
-
- 10 11月, 2020 3 次提交
-
-
由 Minjia Zhang 提交于
-
由 Olatunji Ruwase 提交于
* PLD documentation * Formatting fixes * Fix url bug
-
由 Olatunji Ruwase 提交于
* PLD documentation * Formatting fixes
-
- 06 11月, 2020 1 次提交
-
-
由 Reza Yazdani 提交于
* fixing cpu-adam * fixing copy with optimizer for data and model parallelism * fixing cpu-adam * fix cpu-adam * fix cpu-adam
-
- 31 10月, 2020 2 次提交
-
-
由 Reza Yazdani 提交于
-
由 Reza Yazdani 提交于
* add adamW to CPU-ADAM implementation * supporting cpu-adam optimizer for zero-offload on deepspeed side * bump DSE to match cpu-adam updates Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 20 10月, 2020 1 次提交
-
-
由 Shaden Smith 提交于
-
- 15 10月, 2020 1 次提交
-
-
由 Jeff Rasley 提交于
add compute cap of 6.0, support p100
-
- 13 10月, 2020 3 次提交
-
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
add compute cap of 6.0 to transformer kernels
-
由 Reza Yazdani 提交于
-
- 11 10月, 2020 1 次提交
-
-
由 Olatunji Ruwase 提交于
* Update installation instructions * Format fix * ZeRO tutorial * Format fixes * ZeRO-Offload * ZeRO and ZeRO-Offload tutorials * Update navigation page * Format fixes * Add yuxhe feedback * Fix blog post link * Fix OneBit-Adam link Tweak scheduler example * Fix date link * Add DeepSpeed_Adam Co-authored-by: NShaden Smith <Shaden.Smith@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 08 10月, 2020 1 次提交
-
-
由 Sylwester Klocek 提交于
* Fix printing momentum for non-deepspeed optimizer Fix printing momentum for non-deepspeed optimizer * fix momentum access for Adam fix momentum access for Adam
-
- 07 10月, 2020 3 次提交
-
-
由 niumanar 提交于
* gan tutorial * formatting fix * adding pointer to repo; adding navigation link Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-
- 06 10月, 2020 1 次提交
-
-
由 Sylwester Klocek 提交于
Do not load optimizer states for fp32 if flag set to False.
-
- 01 10月, 2020 1 次提交
-
-
由 Bruno 提交于
* Towards Windows build * formatting Co-authored-by: NBruno Cabral <bruno@potelo.com.br> Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 30 9月, 2020 1 次提交
-
-
由 Olatunji Ruwase 提交于
* Disable default installation of CPU Adam * Handle cpufeature import/use errors separately
-
- 28 9月, 2020 1 次提交
-
-
由 Haibin Lin 提交于
-
- 25 9月, 2020 5 次提交
-
-
由 Shaden Smith 提交于
-
由 Shaden Smith 提交于
-
由 Haibin Lin 提交于
-
由 Conglong Li 提交于
-
由 Conglong Li 提交于
* url fix * revert absolute path but keep some actual fix * add real readme
-
- 22 9月, 2020 1 次提交
-
-
由 RezaYazdaniAminabadi 提交于
Co-authored-by: NConglong Li <conglong.li@gmail.com>
-