- 19 4月, 2023 2 次提交
-
-
由 Bhavya Medishetty 提交于
* temporary WAR workaround till __double2half support enabled in HIP * workaround only for hipcc --------- Co-authored-by: NLogan Adams <114770087+loadams@users.noreply.github.com>
-
由 Michael Wyatt 提交于
* updated cupy install * do non-isolated pip install * Update action.yml
-
- 18 4月, 2023 5 次提交
-
-
由 digger-yu 提交于
Optimization Code 1. Use #!/usr/bin/env bash instead of #!/bin/bash to make the script more portable. 2. Use rm -rf instead of rm -r to remove directories recursively. Co-authored-by: NLogan Adams <114770087+loadams@users.noreply.github.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Heyang Qin 提交于
* Fixes for asymmetric quantization * addtional offset to further improve accuracy * put the 0.5 into offset rather than applying it later * update unit test for quantization * fix format * attempt to fix format --------- Co-authored-by: NConnor Holmes <connorholmes@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Ikko Eltociear Ashimine 提交于
resutls -> results Co-authored-by: NLogan Adams <114770087+loadams@users.noreply.github.com>
-
由 Jirka Borovec 提交于
update link to PL docs talking about DeepSpeed integration
-
由 Stas Bekman 提交于
-
- 15 4月, 2023 4 次提交
-
-
由 Dogukan Uraz Tuna 提交于
Co-authored-by: NMichael Wyatt <michaelwyatt@microsoft.com>
-
由 Michael Wyatt 提交于
-
由 Michael Wyatt 提交于
-
由 Logan Adams 提交于
* Update torch version check in building sparse_attn * Update triton check * Fix error message * Format fixes * Test with triton 2 * Change requirements back * fix to just prevent 2.0.0 * Fix formatting --------- Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 14 4月, 2023 3 次提交
-
-
由 Masahiro Tanaka 提交于
* support nesting zero.Init() and dynamically defined module * throw an error if model class defined in zero.Init is not wrapped * fix check on new classes that are not wrapped in zero.Init() * add tests of nesting zero.Init() and dynamically defined classes * fix tests for zero.Init * fix style --------- Co-authored-by: NLogan Adams <114770087+loadams@users.noreply.github.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Michael Wyatt 提交于
* update docs to reflect changes in deepspeed-chat training script * add blogs to ignored changes in unit tests
-
由 Michael Wyatt 提交于
-
- 13 4月, 2023 5 次提交
-
-
由 Jeff Rasley 提交于
-
由 Ma, Guokai 提交于
* add fallback path for kernels used in megatron * temporary numactl WA for SPR 56core * adapt core allocation according to number of ranks * add switch to turn on numactl * detect number of cores on the system * allow select a subset of the cores on the system to bind * remove unneeded changes * use current_env to set OMP_NUM_THREADS in subprocess * add test for ds_arguments * change --bind_cores_to_rank option to store_true * add test for parse_range_list * add comment for parse range list * add test for parse range list, rewrite parse_range_list * fix format error * fix format * add -m parameter to numactl when necessary * Check KMP_AFFINITY to avoid conflict with numactl * fix format * negative case for parse_range_list * detect whether numactl is installed before use numactl to bind cores * check numactl with package manager of distro --------- Co-authored-by: Nsdp <sdp@aia-sdp-spr-108864.jf.intel.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Logan Adams 提交于
* Update AMD workflows * Update MI200 test flow to use torch latest * Update tolerances to values that pass (will fix before completing PR) * Revert chyanges to atol * Rename workflows * Fix CI badges
-
由 Conglong Li 提交于
-
由 Alexander van Eck 提交于
* feat: Add support for `NamedTuple` when sharding parameters [#3029] * Formatting --------- Co-authored-by: NAlexander van Eck <alexander.vaneck@paige.ai> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 12 4月, 2023 10 次提交
-
-
由 Conglong Li 提交于
-
由 Masahiro Tanaka 提交于
Co-authored-by: NConglong Li <conglong.li@gmail.com>
-
由 Zhewei Yao 提交于
Co-authored-by: NAmmar Ahmad Awan <ammar.awan@microsoft.com>
-
由 dawei-wang 提交于
Fix microsoft/DeepSpeed#3163
-
由 Zaida Zhou 提交于
-
由 Masahiro Tanaka 提交于
Co-authored-by: NMichael Wyatt <michaelwyatt@microsoft.com>
-
由 Olatunji Ruwase 提交于
Co-authored-by: NReza Yazdani <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by: Nyaozhewei <zheweiy@berkeley.edu> Co-authored-by: NAmmar Ahmad Awan <ammar.awan@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NConnor Holmes <connorholmes@microsoft.com> Co-authored-by: NLok Chand Koppaka <lokoppak@microsoft.com> Co-authored-by: NMasahiro Tanaka <81312776+tohtana@users.noreply.github.com> Co-authored-by: NMichael Wyatt <michaelwyatt@microsoft.com>
-
由 Ammar Ahmad Awan 提交于
-
由 Ammar Ahmad Awan 提交于
-
由 Ammar Ahmad Awan 提交于
Co-authored-by: NSamyam Rajbhandari <samyamr@microsoft.com> Co-authored-by: NReza Yazdani <reyazda@microsoft.com> Co-authored-by: NSamyam Rajbhandari <samyamr@microsoft.com> Co-authored-by: NConglong Li <conglong.li@gmail.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NZhewei Yao <zheweiyao@gmail.com>
-
- 11 4月, 2023 1 次提交
-
-
由 郭叶军 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 08 4月, 2023 3 次提交
-
-
由 Olatunji Ruwase 提交于
* Persist params in zero.Init * Disable debug prints * Formatting * Avoid offloading persisted params * Simplify world_size=1 * Formatting * Remove pdb * Restructure * Formating * Formatting * Apply persistence only if ds_config available * Fix typo * add util function for getting pydantic config default values --------- Co-authored-by: NMichael Wyatt <michaelwyatt@microsoft.com>
-
由 Logan Adams 提交于
This reverts commit 0f8b5da6.
-
由 Logan Adams 提交于
-
- 07 4月, 2023 6 次提交
-
-
由 Adam Moody 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Bing Xie 提交于
Co-authored-by: NBing Xie <bingxie@BINGHYPC014.redmond.corp.microsoft.com> Co-authored-by: NShaden Smith <Shaden.Smith@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Michael Wyatt 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Gavin Goodship 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Michael Wyatt 提交于
* fix copyright script and add replace-copyright script
-
由 Molly Smith 提交于
cross attention kwargs and vae config for diffusers 0.14.0
-
- 06 4月, 2023 1 次提交
-
-
由 ShijieZZZZ 提交于
* submit changes * update format * fix fomrat * revert * test * add top * treat z1 as z2 * revert --------- Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-