- 08 7月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 02 7月, 2021 2 次提交
-
-
由 Samyam Rajbhandari 提交于
* contiguous gradients should be set to True by default * Set contiguous gradients to True by default Features such as reduce_scatter depends on contiguous gradients being True. This is also the preferred default configuration.
-
由 Jeff Rasley 提交于
-
- 29 6月, 2021 1 次提交
-
-
由 Stas Bekman 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 26 6月, 2021 1 次提交
-
-
由 Stas Bekman 提交于
* undo noise * another
-
- 24 6月, 2021 5 次提交
-
-
由 Hyunwoong Ko 提交于
* Fix bugs about non-contiguous tensor broadcasting * Fix typo Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NReza Yazdani <44502768+RezaYazdaniAminabadi@users.noreply.github.com>
-
由 Stas Bekman 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Stas Bekman 提交于
Co-authored-by: NSamyam Rajbhandari <samyamr@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Stas Bekman 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Jeff Rasley 提交于
-
- 22 6月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 18 6月, 2021 2 次提交
-
-
由 Olatunji Ruwase 提交于
-
由 Jeff Rasley 提交于
-
- 17 6月, 2021 4 次提交
-
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-
由 Olatunji Ruwase 提交于
* Fix docstring * Make screenshots clickable for easier viewing * Navigation menu in alphabetical order; More clicable screenshots * Rename 1Cycle doc * Tweak naming * Remove no longer used flag * ZeRO3 Offload release * Single GPU results * Rearrange figures * Single GPU text * tweak intro * zero3-offload section * Add asynchronous i/o docs * Fix print_per_steps doc
-
由 Samyam Rajbhandari 提交于
* largest_partitioned_params calculation fix largest partitioned params was getting calculated incorrectly * Update stage3.py * Update stage3.py * formatting fix * changing sub-group size default to 1e9 Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 15 6月, 2021 1 次提交
-
-
由 Hyunwoong Ko 提交于
* Add `import os` to inference tutorials * assign deepspeed-initialized model to hf model
-
- 10 6月, 2021 2 次提交
-
-
由 eltonzheng 提交于
* Add Windows support in README, use c++17 on Windows to support latest vc build tool * Add detailed cpp build tools version in README Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Shaden Smith 提交于
* unit test for bugfix #1135 * formatter * fix test in presence of mpi4py Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 09 6月, 2021 7 次提交
-
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-
由 Reza Yazdani 提交于
* fix links for inference tutorial * Fix automatic injection. Add the local-attention for GPT-Neo * fix the inference for generation of large sequences (>1K & <32K) * fix format Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Jeff Rasley 提交于
* remove old unit tests, should have been removed in rebase * formatting
-
由 Cody 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Stas Bekman 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 08 6月, 2021 3 次提交
-
-
由 Stas Bekman 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Stas Bekman 提交于
* fix missed subclassed partitioning bug * fix on exit Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Stas Bekman 提交于
* allow minor cuda version differences * cleanup * typo
-
- 04 6月, 2021 1 次提交
-
-
由 Stas Bekman 提交于
fixes: s/micro_batch_per_gpu/train_micro_batch_size_per_gpu/ Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 03 6月, 2021 3 次提交
-
-
由 Stas Bekman 提交于
-
由 Reza Yazdani 提交于
-
由 Reza Yazdani 提交于
* Change the sparse attention API to be compatible with latest changes on the triton side * remove compatibility checks for CUDA 11 * Update requirements-sparse_attn.txt Co-authored-by: NArash Ashari <arashari@microsoft.com>
-
- 28 5月, 2021 1 次提交
-
-
由 Reza Yazdani 提交于
-
- 26 5月, 2021 1 次提交
-
-
由 Reza Yazdani 提交于
-
- 25 5月, 2021 4 次提交
-
-
由 eltonzheng 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Jeff Rasley 提交于
* delay imports for replace policies and fix missing req * fix issue with _orig_layer_class always being None
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
* update with inference refs * updates
-