- 22 4月, 2023 3 次提交
-
-
由 Molly Smith 提交于
* diffusers 0.15.0 cross attention class check * revert diffusers_attention.py
-
由 Ramya Ramineni 提交于
Co-authored-by: NLogan Adams <114770087+loadams@users.noreply.github.com>
-
由 Michael Wyatt 提交于
* formatting * fixing clang-format version * update pre-commit URL
-
- 21 4月, 2023 7 次提交
-
-
由 Jeff Rasley 提交于
-
由 Connor Holmes 提交于
Co-authored-by: NMichael Wyatt <michaelwyatt@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Connor Holmes 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NMichael Wyatt <michaelwyatt@microsoft.com>
-
由 Michael Wyatt 提交于
* move dist init out of Engine
-
由 Olatunji Ruwase 提交于
* zero3 checkpoint frozen params * Remove debug prints * Move to cpu * WIP * WIP * WIP * Cleanup * Cleanup * Extend unit test for frozen params * API fix
-
由 bobowwb 提交于
line 98 should be curl -O https://bootstrap.pypa.io/pip/3.6/get-pip.py && \ to avoid #16 106.9 ERROR: This script does not work on Python 3.6 The minimum supported Python version is 3.7. Please use https://bootstrap.pypa.io/pip/3.6/get-pip.py instead. Co-authored-by: NLogan Adams <114770087+loadams@users.noreply.github.com>
-
由 digger-yu 提交于
Optimization Code GitHub's image caching mechanism will cache images,Add a random number after the last modified link.so that every time you visit that link, the contributor's image will be refreshed in real time.
-
- 19 4月, 2023 7 次提交
-
-
由 Logan Adams 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Logan Adams 提交于
-
由 Jinzhen Lin 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NLogan Adams <114770087+loadams@users.noreply.github.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 digger-yu 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Logan Adams 提交于
-
由 Bhavya Medishetty 提交于
* temporary WAR workaround till __double2half support enabled in HIP * workaround only for hipcc --------- Co-authored-by: NLogan Adams <114770087+loadams@users.noreply.github.com>
-
由 Michael Wyatt 提交于
* updated cupy install * do non-isolated pip install * Update action.yml
-
- 18 4月, 2023 5 次提交
-
-
由 digger-yu 提交于
Optimization Code 1. Use #!/usr/bin/env bash instead of #!/bin/bash to make the script more portable. 2. Use rm -rf instead of rm -r to remove directories recursively. Co-authored-by: NLogan Adams <114770087+loadams@users.noreply.github.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Heyang Qin 提交于
* Fixes for asymmetric quantization * addtional offset to further improve accuracy * put the 0.5 into offset rather than applying it later * update unit test for quantization * fix format * attempt to fix format --------- Co-authored-by: NConnor Holmes <connorholmes@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Ikko Eltociear Ashimine 提交于
resutls -> results Co-authored-by: NLogan Adams <114770087+loadams@users.noreply.github.com>
-
由 Jirka Borovec 提交于
update link to PL docs talking about DeepSpeed integration
-
由 Stas Bekman 提交于
-
- 15 4月, 2023 4 次提交
-
-
由 Dogukan Uraz Tuna 提交于
Co-authored-by: NMichael Wyatt <michaelwyatt@microsoft.com>
-
由 Michael Wyatt 提交于
-
由 Michael Wyatt 提交于
-
由 Logan Adams 提交于
* Update torch version check in building sparse_attn * Update triton check * Fix error message * Format fixes * Test with triton 2 * Change requirements back * fix to just prevent 2.0.0 * Fix formatting --------- Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 14 4月, 2023 3 次提交
-
-
由 Masahiro Tanaka 提交于
* support nesting zero.Init() and dynamically defined module * throw an error if model class defined in zero.Init is not wrapped * fix check on new classes that are not wrapped in zero.Init() * add tests of nesting zero.Init() and dynamically defined classes * fix tests for zero.Init * fix style --------- Co-authored-by: NLogan Adams <114770087+loadams@users.noreply.github.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Michael Wyatt 提交于
* update docs to reflect changes in deepspeed-chat training script * add blogs to ignored changes in unit tests
-
由 Michael Wyatt 提交于
-
- 13 4月, 2023 5 次提交
-
-
由 Jeff Rasley 提交于
-
由 Ma, Guokai 提交于
* add fallback path for kernels used in megatron * temporary numactl WA for SPR 56core * adapt core allocation according to number of ranks * add switch to turn on numactl * detect number of cores on the system * allow select a subset of the cores on the system to bind * remove unneeded changes * use current_env to set OMP_NUM_THREADS in subprocess * add test for ds_arguments * change --bind_cores_to_rank option to store_true * add test for parse_range_list * add comment for parse range list * add test for parse range list, rewrite parse_range_list * fix format error * fix format * add -m parameter to numactl when necessary * Check KMP_AFFINITY to avoid conflict with numactl * fix format * negative case for parse_range_list * detect whether numactl is installed before use numactl to bind cores * check numactl with package manager of distro --------- Co-authored-by: Nsdp <sdp@aia-sdp-spr-108864.jf.intel.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Logan Adams 提交于
* Update AMD workflows * Update MI200 test flow to use torch latest * Update tolerances to values that pass (will fix before completing PR) * Revert chyanges to atol * Rename workflows * Fix CI badges
-
由 Conglong Li 提交于
-
由 Alexander van Eck 提交于
* feat: Add support for `NamedTuple` when sharding parameters [#3029] * Formatting --------- Co-authored-by: NAlexander van Eck <alexander.vaneck@paige.ai> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 12 4月, 2023 6 次提交
-
-
由 Conglong Li 提交于
-
由 Masahiro Tanaka 提交于
Co-authored-by: NConglong Li <conglong.li@gmail.com>
-
由 Zhewei Yao 提交于
Co-authored-by: NAmmar Ahmad Awan <ammar.awan@microsoft.com>
-
由 dawei-wang 提交于
Fix microsoft/DeepSpeed#3163
-
由 Zaida Zhou 提交于
-
由 Masahiro Tanaka 提交于
Co-authored-by: NMichael Wyatt <michaelwyatt@microsoft.com>
-