- 12 4月, 2023 1 次提交
-
-
由 Ammar Ahmad Awan 提交于
Co-authored-by: NSamyam Rajbhandari <samyamr@microsoft.com> Co-authored-by: NReza Yazdani <reyazda@microsoft.com> Co-authored-by: NSamyam Rajbhandari <samyamr@microsoft.com> Co-authored-by: NConglong Li <conglong.li@gmail.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NZhewei Yao <zheweiyao@gmail.com>
-
- 11 4月, 2023 1 次提交
-
-
由 郭叶军 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 08 4月, 2023 3 次提交
-
-
由 Olatunji Ruwase 提交于
* Persist params in zero.Init * Disable debug prints * Formatting * Avoid offloading persisted params * Simplify world_size=1 * Formatting * Remove pdb * Restructure * Formating * Formatting * Apply persistence only if ds_config available * Fix typo * add util function for getting pydantic config default values --------- Co-authored-by: NMichael Wyatt <michaelwyatt@microsoft.com>
-
由 Logan Adams 提交于
This reverts commit 0f8b5da6.
-
由 Logan Adams 提交于
-
- 07 4月, 2023 6 次提交
-
-
由 Adam Moody 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Bing Xie 提交于
Co-authored-by: NBing Xie <bingxie@BINGHYPC014.redmond.corp.microsoft.com> Co-authored-by: NShaden Smith <Shaden.Smith@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Michael Wyatt 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Gavin Goodship 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Michael Wyatt 提交于
* fix copyright script and add replace-copyright script
-
由 Molly Smith 提交于
cross attention kwargs and vae config for diffusers 0.14.0
-
- 06 4月, 2023 8 次提交
-
-
由 ShijieZZZZ 提交于
* submit changes * update format * fix fomrat * revert * test * add top * treat z1 as z2 * revert --------- Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Logan Adams 提交于
This reverts commit 1ec34e54.
-
由 Logan Adams 提交于
-
由 Vincent Hellendoorn 提交于
This was breaking the link to the Pyramid MoE paper on Arxiv
-
由 Earlee 提交于
Co-authored-by: NLogan Adams <114770087+loadams@users.noreply.github.com>
-
由 Logan Adams 提交于
* Replace old torch version checks with existing function * Clean up formatting
-
由 Stas Bekman 提交于
Co-authored-by: NLogan Adams <114770087+loadams@users.noreply.github.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Stas Bekman 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 05 4月, 2023 6 次提交
-
-
由 Wang, Yi 提交于
Signed-off-by: NWang, Yi A <yi.a.wang@intel.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Olatunji Ruwase 提交于
-
由 Molly Smith 提交于
* Simplify kernel * Coalesce memory attempt 1. Logits divergence. * Logits fix? * sync after every global mem access * template on iterations. Down to 8.3% cuda time for 8k tokens * Up to 64 iterations * Add alibi/mask check * fp32 * Revert builder.py * naming. precommit * Revert "naming. precommit" This reverts commit 150eb7d9. * naming. spacing * Spacing. simplify checks * remove bsyncs * missed bsyncs * precommit
-
由 Michael Wyatt 提交于
-
由 Olatunji Ruwase 提交于
-
由 Lev Kurilenko 提交于
-
- 01 4月, 2023 3 次提交
-
-
由 Jeff Rasley 提交于
-
由 Zhewei Yao 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Michael Wyatt 提交于
-
- 31 3月, 2023 1 次提交
-
-
由 Michael Wyatt 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 30 3月, 2023 2 次提交
-
-
由 Olatunji Ruwase 提交于
* Make fp32 default communication data type * Fix asserts
-
由 Mayank Mishra 提交于
*
💩 drop dead code *♻ replace has_all_gather_base with has_all_gather_into_tensor *♻ remove deprecated _all_gather_base *♻ remove deprecated _reduce_scatter_base *🎨 reformat files *🔧 fix _six * Trigger CI * Trigger CI * Trigger CI *🎨 formatting * incorporate suggestion * incorporate suggestion --------- Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 29 3月, 2023 1 次提交
-
-
由 Michael Wyatt 提交于
* disable CPUAdam pathways in optimizer copy/step * Update stage_1_and_2.py --------- Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 28 3月, 2023 1 次提交
-
-
由 Quentin Anthony 提交于
* Fix benchmark import issues and support MPI launching with pure torch.dist * Formatting * Update comms benchmark README * Formatting * Added better error handling and support MPI torch.dist backend * Update formatting versions * Formatting again * Trigger CI --------- Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 27 3月, 2023 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 24 3月, 2023 6 次提交
-
-
由 Logan Adams 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Ma, Guokai 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Olatunji Ruwase 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Michael Wyatt 提交于
* bump torch18 -> torch19 * fix gptj --------- Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Satpal Singh Rathore 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 FreyaRao 提交于
Co-authored-by: NQinghuan Rao <qinghuanrao@microsoft.com>
-