- 18 1月, 2023 4 次提交
-
-
由 Jeff Rasley 提交于
-
由 Olatunji Ruwase 提交于
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 14 1月, 2023 4 次提交
-
-
由 Joe Mayer 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Jeff Rasley 提交于
-
由 Michael Wyatt 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Stas Bekman 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 13 1月, 2023 1 次提交
-
-
由 LOK CHAND KOPPAKA 提交于
* Extend quantization utils features * remove unwanted files * fix cahce setting Co-authored-by: NConnor Holmes <connorholmes@microsoft.com>
-
- 12 1月, 2023 2 次提交
-
-
由 LOK CHAND KOPPAKA 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NReza Yazdani <44502768+RezaYazdaniAminabadi@users.noreply.github.com>
-
由 Masahiro Tanaka 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 11 1月, 2023 2 次提交
-
-
由 cassieesvelt 提交于
* add logging changes * try w/out abspath * undo last change * start mlflow debug * remove mlflow from export_envs * add mlflow logging for reversed * remove mlflow.start_run * add back start run * don't clean cmd * print os environment variables * remove first start run * add run_id to mlflow star * remove context managers * move last end run * add extra parent start_runs * add run id logging * add logging to run_ds_config * change run_id to run_name * add back context managers and run_id logs * remove context mng * debug environment variable * reset environment variables * add env variable deletion * clean up * remove unused import * fix yapf/whitespace errors Co-authored-by: NCheng Li <pistasable@gmail.com>
-
由 JackieWu 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 10 1月, 2023 2 次提交
-
-
由 Ma, Guokai 提交于
-
由 Jeff Rasley 提交于
-
- 09 1月, 2023 4 次提交
-
-
由 Xiaoxia (Shirley) Wu 提交于
double check the unit tests
-
由 Stas Bekman 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 JackieWu 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 li-yi-dong 提交于
* Remove unnecessary device synchronization for stage 2 * Remove unnecessary device synchronization for stage 2 Co-authored-by: Nliyidong.lyd <liyidong.lyd@alibaba-inc.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJoe Mayer <114769929+jomayeri@users.noreply.github.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 07 1月, 2023 2 次提交
-
-
由 Ma, Guokai 提交于
* Abstract accelerator (step 2) * more flex op_builder path for both installation and runtime * add SpatialInferenceBuilder into cuda_accelerator.py * use reflection to make cuda_accelerator adapt to CUDA op builder change automatically * clean up deepspeed/__init__.py * add comments in cuda_accelerator for no torch path * Update deepspeed/env_report.py Change env_report.py according to suggestion Co-authored-by: NMichael Wyatt <mrwyattii@gmail.com> * reduce the range of try...except for better code clarity * Add porting for deepspeed/ops/random_ltd/dropping_utils.py * move accelerator to top directory and create symlink under deepspeed Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NMichael Wyatt <mrwyattii@gmail.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Reza Yazdani 提交于
* fix Opt injection & add injection verification check at inference test * fix several issues * remove fixture * remove check_injection when no kerenl is injected Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 04 1月, 2023 2 次提交
-
-
由 Jeff Rasley 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Stas Bekman 提交于
* [doc] fix `min_loss_scale` default * align
-
- 29 12月, 2022 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 23 12月, 2022 4 次提交
-
-
由 Guanhua Wang 提交于
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-
由 Samyam Rajbhandari 提交于
Co-authored-by: NStas Bekman <stas00@users.noreply.github.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 22 12月, 2022 1 次提交
-
-
由 Ikko Ashimine 提交于
-
- 21 12月, 2022 2 次提交
-
-
由 mzl 提交于
-
由 Michael Wyatt 提交于
add reusable workflow that sets up fresh venv for each test and prints relevant environment info
-
- 20 12月, 2022 2 次提交
-
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-
- 18 12月, 2022 1 次提交
-
-
由 Michael Wyatt 提交于
* added megatron unit test * Update nv-megatron.yml Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 17 12月, 2022 4 次提交
-
-
由 Connor Holmes 提交于
* Update cpuinfo AVX512 detection * Missing conversion from `_mm256` to `_mm256i` Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Alexander Jipa 提交于
taking gradient accumulation steps into account for throughput calculation Co-authored-by: NAlexander Jipa <azzhipa@amazon.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Lev Kurilenko 提交于
This PR removes the zero-infernece GatheredParameters context from replace_with_policy due to no longer needing zero-inference after the introduction of meta tensor support for BLOOM.
-
由 郭叶军 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 16 12月, 2022 2 次提交
-
-
由 郭叶军 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Jithun Nair 提交于
-