- 06 4月, 2023 5 次提交
-
-
由 Vincent Hellendoorn 提交于
This was breaking the link to the Pyramid MoE paper on Arxiv
-
由 Earlee 提交于
Co-authored-by: NLogan Adams <114770087+loadams@users.noreply.github.com>
-
由 Logan Adams 提交于
* Replace old torch version checks with existing function * Clean up formatting
-
由 Stas Bekman 提交于
Co-authored-by: NLogan Adams <114770087+loadams@users.noreply.github.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Stas Bekman 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 05 4月, 2023 6 次提交
-
-
由 Wang, Yi 提交于
Signed-off-by: NWang, Yi A <yi.a.wang@intel.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Olatunji Ruwase 提交于
-
由 Molly Smith 提交于
* Simplify kernel * Coalesce memory attempt 1. Logits divergence. * Logits fix? * sync after every global mem access * template on iterations. Down to 8.3% cuda time for 8k tokens * Up to 64 iterations * Add alibi/mask check * fp32 * Revert builder.py * naming. precommit * Revert "naming. precommit" This reverts commit 150eb7d9. * naming. spacing * Spacing. simplify checks * remove bsyncs * missed bsyncs * precommit
-
由 Michael Wyatt 提交于
-
由 Olatunji Ruwase 提交于
-
由 Lev Kurilenko 提交于
-
- 01 4月, 2023 3 次提交
-
-
由 Jeff Rasley 提交于
-
由 Zhewei Yao 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Michael Wyatt 提交于
-
- 31 3月, 2023 1 次提交
-
-
由 Michael Wyatt 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 30 3月, 2023 2 次提交
-
-
由 Olatunji Ruwase 提交于
* Make fp32 default communication data type * Fix asserts
-
由 Mayank Mishra 提交于
*
💩 drop dead code *♻ replace has_all_gather_base with has_all_gather_into_tensor *♻ remove deprecated _all_gather_base *♻ remove deprecated _reduce_scatter_base *🎨 reformat files *🔧 fix _six * Trigger CI * Trigger CI * Trigger CI *🎨 formatting * incorporate suggestion * incorporate suggestion --------- Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 29 3月, 2023 1 次提交
-
-
由 Michael Wyatt 提交于
* disable CPUAdam pathways in optimizer copy/step * Update stage_1_and_2.py --------- Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 28 3月, 2023 1 次提交
-
-
由 Quentin Anthony 提交于
* Fix benchmark import issues and support MPI launching with pure torch.dist * Formatting * Update comms benchmark README * Formatting * Added better error handling and support MPI torch.dist backend * Update formatting versions * Formatting again * Trigger CI --------- Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 27 3月, 2023 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 24 3月, 2023 6 次提交
-
-
由 Logan Adams 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Ma, Guokai 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Olatunji Ruwase 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Michael Wyatt 提交于
* bump torch18 -> torch19 * fix gptj --------- Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Satpal Singh Rathore 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 FreyaRao 提交于
Co-authored-by: NQinghuan Rao <qinghuanrao@microsoft.com>
-
- 22 3月, 2023 7 次提交
-
-
由 Connor Holmes 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Molly Smith 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Mor Zusman 提交于
Co-authored-by: NMor Zusman <morz@ai21.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Jeff Rasley 提交于
-
由 Molly Smith 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Logan Adams 提交于
Co-authored-by: NMichael Wyatt <michaelwyatt@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Quentin Anthony 提交于
-
- 18 3月, 2023 1 次提交
-
-
由 Satpal Singh Rathore 提交于
-
- 16 3月, 2023 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 15 3月, 2023 5 次提交
-
-
由 Quentin Anthony 提交于
* Improve overflow logs * Trigger CI --------- Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Joe Mayer 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Stas Bekman 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Joe Mayer 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Stas Bekman 提交于
-