- 02 10月, 2021 1 次提交
-
-
由 Hyunwoong Ko 提交于
* Add flexibility of pipeline module and engine * Separate PRs * Separate PRs Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 01 10月, 2021 5 次提交
-
-
由 Jeff Rasley 提交于
-
由 Alex Hedges 提交于
* Fix spelling errors in inference tutorial * Remove unused imports in inference tutorial * Fix inference tutorial code to work with 1 GPU
-
由 Reza Yazdani 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Manuel R. Ciosici 提交于
-
由 Jeff Rasley 提交于
-
- 30 9月, 2021 2 次提交
-
-
由 Stas Bekman 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Jeff Rasley 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NShaden Smith <shaden.smith@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NSamyam Rajbhandari <samyamr@microsoft.com> Co-authored-by: NReza Yazdani <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by: Neltonzheng <eltonz@microsoft.com> Co-authored-by: NStas Bekman <stas00@users.noreply.github.com>
-
- 29 9月, 2021 1 次提交
-
-
由 dependabot[bot] 提交于
Bumps [nokogiri](https://github.com/sparklemotion/nokogiri) from 1.11.4 to 1.12.5. - [Release notes](https://github.com/sparklemotion/nokogiri/releases) - [Changelog](https://github.com/sparklemotion/nokogiri/blob/main/CHANGELOG.md) - [Commits](https://github.com/sparklemotion/nokogiri/compare/v1.11.4...v1.12.5) --- updated-dependencies: - dependency-name: nokogiri dependency-type: indirect ... Signed-off-by: Ndependabot[bot] <support@github.com> Co-authored-by: Ndependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 28 9月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
* install HF w. dev extra to get all required packages * switch ds.init to use param dict instead of json file on disk * switch back to 'testing' extra
-
- 25 9月, 2021 1 次提交
-
-
由 Ammar Ahmad Awan 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NStas Bekman <stas00@users.noreply.github.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 23 9月, 2021 1 次提交
-
-
由 Ammar Ahmad Awan 提交于
-
- 22 9月, 2021 1 次提交
-
-
由 Stas Bekman 提交于
* [zero_to_fp32] fix padding removal * style * fix comments Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 21 9月, 2021 2 次提交
-
-
由 Cheng Li 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Jeff Rasley 提交于
Co-authored-by: NReza Yazdani <reyazda@microsoft.com>
-
- 18 9月, 2021 1 次提交
-
-
由 Adam Moody 提交于
-
- 17 9月, 2021 2 次提交
-
-
由 Jeff Rasley 提交于
-
由 Stas Bekman 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 16 9月, 2021 2 次提交
-
-
由 Stas Bekman 提交于
* [zero Init] fix regression * clean up the warning
-
由 Sean Naren 提交于
-
- 15 9月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 14 9月, 2021 1 次提交
-
-
由 Anurag Kumar 提交于
updated classifiers
-
- 11 9月, 2021 2 次提交
-
-
由 eltonzheng 提交于
-
由 Ammar Ahmad Awan 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 10 9月, 2021 6 次提交
-
-
由 Jeff Rasley 提交于
* pass GAS boundary state from PP -> ZeRO * formatting Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-
由 Hyunwoong Ko 提交于
-
由 Aswin John Mathews 提交于
* Added 4-byte alignment on NCCL/RCCL * pre-commit formatting fixes * Fix for checkpoint loading with optimizer partitioning * Better assert print * Added unit tests for nccl/rccl 4-byte alignment * bug * Updated alignment to implicit Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Jeff Rasley 提交于
-
- 09 9月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 03 9月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 02 9月, 2021 6 次提交
-
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-
由 Olatunji Ruwase 提交于
-
由 Olatunji Ruwase 提交于
-
由 Hari Prasad 提交于
* Added drop_last to DeepSpeedDataLoader This solves issue #326 * Updated drop_last in engine.py added drop_last as a ds_config as mentioned by @tjruwase * Update engine.py * Update engine.py * updated config.py and constants.py * Update constants.py * added dataloader_ prefix * Update dataloader.py * corrected yapf test errors * Update test_data.py Added dataloader_drop_last unit test * Corrected yapf and formatting issues * updated simple_model.py and test_data.py * Update simple_model.py * pre-commit fix * corrected issues * Update test_data.py * Update test_data.py * Update test_data.py * Update test_data.py * removed batch_size from test_data.py * Update simple_model.py * Update test_data.py * Update test_data.py * Fix unit test issues * Use fp32 to make things work Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 31 8月, 2021 2 次提交
-
-
由 Ammar Ahmad Awan 提交于
* Remove the wrong function with duplicate name * fix format. * add mpu check. fix tests.
-
由 Stas Bekman 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-