- 29 6月, 2021 1 次提交
-
-
由 Stas Bekman 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 26 6月, 2021 1 次提交
-
-
由 Stas Bekman 提交于
* undo noise * another
-
- 24 6月, 2021 4 次提交
-
-
由 Hyunwoong Ko 提交于
* Fix bugs about non-contiguous tensor broadcasting * Fix typo Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NReza Yazdani <44502768+RezaYazdaniAminabadi@users.noreply.github.com>
-
由 Stas Bekman 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Stas Bekman 提交于
Co-authored-by: NSamyam Rajbhandari <samyamr@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Stas Bekman 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 18 6月, 2021 2 次提交
-
-
由 Olatunji Ruwase 提交于
-
由 Jeff Rasley 提交于
-
- 17 6月, 2021 2 次提交
-
-
由 Jeff Rasley 提交于
-
由 Samyam Rajbhandari 提交于
* largest_partitioned_params calculation fix largest partitioned params was getting calculated incorrectly * Update stage3.py * Update stage3.py * formatting fix * changing sub-group size default to 1e9 Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 09 6月, 2021 2 次提交
-
-
由 Reza Yazdani 提交于
* fix links for inference tutorial * Fix automatic injection. Add the local-attention for GPT-Neo * fix the inference for generation of large sequences (>1K & <32K) * fix format Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Stas Bekman 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 08 6月, 2021 2 次提交
-
-
由 Stas Bekman 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Stas Bekman 提交于
* fix missed subclassed partitioning bug * fix on exit Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 04 6月, 2021 1 次提交
-
-
由 Stas Bekman 提交于
fixes: s/micro_batch_per_gpu/train_micro_batch_size_per_gpu/ Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 03 6月, 2021 1 次提交
-
-
由 Reza Yazdani 提交于
* Change the sparse attention API to be compatible with latest changes on the triton side * remove compatibility checks for CUDA 11 * Update requirements-sparse_attn.txt Co-authored-by: NArash Ashari <arashari@microsoft.com>
-
- 26 5月, 2021 1 次提交
-
-
由 Reza Yazdani 提交于
-
- 25 5月, 2021 2 次提交
-
-
由 Jeff Rasley 提交于
* delay imports for replace policies and fix missing req * fix issue with _orig_layer_class always being None
-
由 Reza Yazdani 提交于
* Fix Inference and Quantization tutorial links * fix inference api * use correct attention scaling Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 24 5月, 2021 1 次提交
-
-
由 Reza Yazdani 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: Neltonzheng <eltonz@microsoft.com> Co-authored-by: NReza Yazdani <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by: NShaden Smith <Shaden.Smith@microsoft.com> Co-authored-by: NReza Yazdani <reyazda@microsoft.com> Co-authored-by: NShaden Smith <Shaden.Smith@microsoft.com> Co-authored-by: NElton Zheng <eltonz@microsoft.com> Co-authored-by: NReza Yazdani <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by: Neltonzheng <eltonz@microsoft.com> Co-authored-by: NArash Ashari <arashari@microsoft.com> Co-authored-by: NReza Yazdani <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by: NShaden Smith <Shaden.Smith@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NReza Yazdani <reyazda@microsoft.com> Co-authored-by: Nniumanar <60243342+niumanar@users.noreply.github.com> Co-authored-by: Neltonzheng <eltonz@microsoft.com> Co-authored-by: NReza Yazdani <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by: NShaden Smith <Shaden.Smith@microsoft.com> Co-authored-by: NReza Yazdani <reyazda@microsoft.com> Co-authored-by: NArash Ashari <arashari@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: Nniumanar <60243342+niumanar@users.noreply.github.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: Neltonzheng <eltonz@microsoft.com> Co-authored-by: NShaden Smith <Shaden.Smith@microsoft.com> Co-authored-by: NArash Ashari <arashari@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: Nniumanar <60243342+niumanar@users.noreply.github.com>
-
- 22 5月, 2021 1 次提交
-
-
由 Meng, Peng 提交于
* fix Reduce Scatter default value * Update constants.py Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 21 5月, 2021 1 次提交
-
-
由 Olatunji Ruwase 提交于
* Align fp16 param wap buffers * Integrating swap buffer manager for fp16 params * Support swapping misaligned fp16 parameters * Support swap into unaligned fp16 buffer
-
- 20 5月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 19 5月, 2021 1 次提交
-
-
由 Olatunji Ruwase 提交于
* Align fp16 param wap buffers * Integrating swap buffer manager for fp16 params * Support swapping misaligned fp16 parameters
-
- 16 5月, 2021 1 次提交
-
-
由 Olatunji Ruwase 提交于
* Round robin partitioning to improve ZeRO-2 Offload CPU copy * Formatting fixes * Fix index issues in debug dumps * Remove debug prints * Code cleanup * Remove unintended stage3.py changes * Add TODO
-
- 14 5月, 2021 4 次提交
-
-
由 Shaden Smith 提交于
* is not -> != * Use pytest-randomly to seed unit tests.
-
由 Olatunji Ruwase 提交于
-
由 Stas Bekman 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Stas Bekman 提交于
* [configure_distributed_model] improve assert This PR changes the 2 asserts to actually print the names of the params that are wrong. e.g.: ``` fp16 is enabled but the following parameters have dtype that is not fp16: wav2vec2.masked_spec_embed ``` * style Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 13 5月, 2021 1 次提交
-
-
由 Cheng Li 提交于
* use the original function's name as the key to old_functions dict * update profile output format * print at global rank 0 * add flops calculation in bwd pass using time from ds timers * improve aggregated profiling out to show all depth * print samples/second * update readme and examples * update docs * fix typo and reorder printing * fix format
-
- 08 5月, 2021 1 次提交
-
-
由 Olatunji Ruwase 提交于
* Unused parameters assert should be disabled by default * Fix message * Invert assert logic in unit test * Change option for ignoring unused parameters Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 06 5月, 2021 1 次提交
-
-
由 Olatunji Ruwase 提交于
* NVMe intra-request validation should be on entire file Optimizer swap buffer sizes should be aligned * Add fix message for missing aio lib error.
-
- 04 5月, 2021 2 次提交
-
-
由 Stas Bekman 提交于
* fix assert The current assert "Model must initialized in fp16 mode for ZeRO Stage 3." needs TLC - I rewrote it completely to match its cousen assert, so now we have 2 consistent matching asserts: - f"fp16 is enabled but one or several model parameters have dtype that is not fp16" - f"fp16 is not enabled but one or several model parameters have dtype of fp16" * remove f
-
由 janEbert 提交于
Fix #1032
-
- 03 5月, 2021 1 次提交
-
-
由 Cheng Li 提交于
-
- 01 5月, 2021 5 次提交
-
-
由 Sean Naren 提交于
* Add additional conditions when checking types of output from the model * Add test * Modify test to use torch.tensor as well Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Jiangang Zhu 提交于
Co-authored-by: NJiangang Zhu <jiangazh@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Cheng Li 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Stas Bekman 提交于
-
由 Stas Bekman 提交于
-