提交 · v0.7.6 · Greenplum / DeepSpeed

01 12月, 2022 1 次提交
- A
  Fix invalid check of recorded parameter orders in zero stage3. (#2550) · aeda7f9f
  由 AGUL 提交于 12月 01, 2022
```
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
```
  aeda7f9f
30 11月, 2022 3 次提交

Abstract accelerator (step 1) (#2504) · ffcf3846

由 Ma, Guokai 提交于 11月 30, 2022

* Establish building block of abstract accelerator

* Change .*Tensor variable to @property

* [op builder] add op builder reflection to allow enumerate of builders in all_ops.py and builder_names.py

* change @abstractproperty to @property @abstractmethod
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>

ffcf3846

M

add missing moe deprecated fields to inference config (#2556) · c5f85858
由 Michael Wyatt 提交于 11月 29, 2022

c5f85858

encoded ds config into command line argument when launching child processes in autotuning (#2524) · abe4fc6b

由 Cheng Li 提交于 11月 29, 2022

* rollback ds config changes

* fix format

* Fix error when output_file is a relative path without a prefix (#2397)
Co-authored-by: NBenjamin Steenhoek <benjaminjsteenhoek@gmail.com>

* fix restuls and exprs path to use absolute path

* use base64 encoded ds config as cmd arg

* fix format

* remove assert

* write out optimial config after tuning

* fix format

* no need to update ds config path when encoding ds config

* udpate

* do not use abs path for result and expr dir

* fix conflicts

* fix run mode

* fix format

* fix format
Co-authored-by: NBenjamin Steenhoek <benjaminjsteenhoek@gmail.com>
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>

abe4fc6b

29 11月, 2022 1 次提交
- S
  Report progress at gradient accumulation boundary (#2553) · 340fc0cf
  由 ShijieZZZZ 提交于 11月 28, 2022
```
* report progress at gradient accumulation boundary

* format

* format
```
  340fc0cf
28 11月, 2022 1 次提交

Adding Gradient Accumulation Data Type Config (#2512) · 21c28029

由 Joe Mayer 提交于 11月 27, 2022

* Adding gradient accumulation dtype config.

* Switching to new DtypeEnum

* Adding standalone check function, and unit tests

* Variable disambiguation

* Adding checks for unsupported states.

* Updating for PR comments.

* Reorganizing unit test.
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>

21c28029

24 11月, 2022 2 次提交

Pass down the new DS inference config to replace_transformer_layer. (#2539) · 90ae6884

由 Ammar Ahmad Awan 提交于 11月 23, 2022

* pass down the new DS inference config to replace_transformer_layer.

* remove quantize_settings and rename the ep_mp_group.

* Fix model_config passing. Fixes gptj issue with wrong output.

* fix small bug in gpt-neo.

Co-authored-by: Reza Yazdani and Michael Wyatt

90ae6884

Change Where DS/Triton is Used in Stable Diffusion (#2536) · 5df1eea7

由 Connor Holmes 提交于 11月 23, 2022

* Change utilization of DS/Triton kernels

* add config at Clip-encoder
Co-authored-by: NReza Yazdani <reyazda@microsoft.com>
Co-authored-by: NReza Yazdani <44502768+RezaYazdaniAminabadi@users.noreply.github.com>

5df1eea7

23 11月, 2022 4 次提交
- A
  Remove mutable default parameter in init_inference() (#2540) · 4abf637f
  由 Alex Hedges 提交于 11月 22, 2022
```
A mutable default value is dangerous because editing it will change the
value for all future calls to the function. The value is itself edited
later in the function, so this problem will likely be encountered sooner
or later.
Co-authored-by: NMichael Wyatt <michaelwyatt@microsoft.com>
```
  4abf637f
- M
  Add MII tests (#2533) · c5ee27f7
  由 Michael Wyatt 提交于 11月 22, 2022
```
Adding MII tests to ensure changes to DS-Inference do not break MII
```
  c5ee27f7
- M
  
  Make DS-Inference config readable from JSON (#2537) · 8b4318b9
  由 Michael Wyatt 提交于 11月 22, 2022
  
  8b4318b9
- C
  
  Ensure is initialized for SD (#2534) · 57e0a550
  由 Connor Holmes 提交于 11月 22, 2022
  
  57e0a550
22 11月, 2022 1 次提交
- J
  Fixes for torch 1.14 due to new torch.numel return type (#2522) · 21105521
  由 Jeff Rasley 提交于 11月 21, 2022
```
* fixes for new torch.numel return type

* address comment
```
  21105521
19 11月, 2022 2 次提交
- C
  
  Initial dequant library implementation (#2521) · 30c8d8a8
  由 Connor Holmes 提交于 11月 18, 2022
  
  30c8d8a8
- J
  
  Update codeowners (#2525) · 0b265326
  由 Jeff Rasley 提交于 11月 18, 2022
  
  0b265326
18 11月, 2022 4 次提交

J

Add note about nvcc/hipcc requirement (#2519) · 7ce371b1
由 Jeff Rasley 提交于 11月 17, 2022

7ce371b1
M

Add missing Inference sub-configs (#2518) · fe678544
由 Michael Wyatt 提交于 11月 17, 2022

fe678544

Fix backward compatibility for InferenceConfig (#2516) · e59f8054

由 Michael Wyatt 提交于 11月 17, 2022

* Make new InferenceConfig backwards compatible with previous init_inference API
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>

e59f8054

Deepspeed quantization library v0.1 (#2450) · 78d4ca1f

由 lokoppakmsft 提交于 11月 17, 2022

* Initial commit Deepspeed quantization library

* Match function signatures

* Add Quantization Kernel

* adding offset comparision and precommit changes

* format fixes

* FIle name changes

* pt_binding_changes

* test name change

* Integer quantization, minor refactors

* Add directed test_case

* format fixes

* Move param calculation to constructor of params class

* Use local function and add elemsPerBlock

* change function to be specalized

* sub block reduce

* add new schedule

* Add new schedule test case

* fix illegal writes in sch1

* Style fixes in comments
Co-authored-by: NConnor Holmes <connorholmes@microsoft.com>

78d4ca1f

16 11月, 2022 2 次提交

Add max_tokens alias to max_out_tokens arg to maintain backwards compatibility (#2508) · d40a15fc

由 Lev Kurilenko 提交于 11月 15, 2022

This PR adds a max_tokens alias to the max_out_tokens argument in the init_inference API to support backwards compatibility after the config refactor PR https://github.com/microsoft/DeepSpeed/pull/2472.

Thanks @molly-smith and @mrwyattii.

d40a15fc

M
Update docs to autogenerate pydantic config model docs (#2509) · 43bf035c
由 Michael Wyatt 提交于 11月 15, 2022
```
* update zero config docs
* add autogenerated docs for pydantic models used in ZeRO and Inference configs
```
43bf035c

15 11月, 2022 2 次提交

DeepSpeed inference config. (#2459) (#2472) · b5d18a6a

由 Ammar Ahmad Awan 提交于 11月 14, 2022

Changes to inference API to use accept a config dict and cleaning up Inference Engine to utilize the newly added inference config.
Co-authored-by: NMichael Wyatt <michaelwyatt@microsoft.com>

b5d18a6a

J

bump to 0.7.6 · a4ceabb6
由 Jeff Rasley 提交于 11月 14, 2022

a4ceabb6

14 11月, 2022 1 次提交
- I
  
  Fix typos: deepseed -> deepspeed (#2499) · 06e00f61
  由 iLeGend 提交于 11月 14, 2022
  
  06e00f61
12 11月, 2022 1 次提交
- L
  Make data contiguous before the inplace reshape-copy_ function (#2489) · f2710bbe
  由 lokoppakmsft 提交于 11月 11, 2022
```
Co-authored-by: NMichael Wyatt <michaelwyatt@microsoft.com>
```
  f2710bbe
11 11月, 2022 2 次提交
- M
  Fix nightly CI tests (#2493) · be5ec506
  由 Michael Wyatt 提交于 11月 10, 2022
```
* fix for lm-eval nightly tests and add gpt-j to MPtest because OOM on single GPU

* add nv-nightly badge
```
  be5ec506
- O
  
  Make bf16_optimizer work for non pipeline (#2470) · ee39187d
  由 Olatunji Ruwase 提交于 11月 10, 2022
  
  ee39187d
10 11月, 2022 5 次提交
- stage_1_and_2.py: no allreduce needed when mp size is 1 (#2494) · 3ca9878d
  由郭叶军提交于 11月 10, 2022
  
  3ca9878d
- C
  Stable Diffusion Enhancements (#2491) · e7e75955
  由 Connor Holmes 提交于 11月 09, 2022
```
Co-authored-by: Ncmikeh2 <connorholmes@microsoft.com>
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
Co-authored-by: NReza Yazdani <reyazda@microsoft.com>
```
  e7e75955
- K
  Add `scale_attn_by_inverse_layer_idx` feature (#2486) · 6f77da1b
  由 Kevin Ko 提交于 11月 10, 2022
```
* Add scale_attn_by_inverse_layer_idx feature

* Fix layer_id bug

* Fix scaling value
Co-authored-by: NConnor Holmes <connorholmes@microsoft.com>
Co-authored-by: NReza Yazdani <44502768+RezaYazdaniAminabadi@users.noreply.github.com>
```
  6f77da1b
- J
  
  [docs] add SD tutorial to deepspeed.ai news · d2d1b4c3
  由 Jeff Rasley 提交于 11月 09, 2022
  
  d2d1b4c3
- J
  
  [docs] add SD tutorial to news · a63cb07e
  由 Jeff Rasley 提交于 11月 09, 2022
  
  a63cb07e
09 11月, 2022 1 次提交

Fix CI issues related to cupy install (#2483) · 521d329b

由 Michael Wyatt 提交于 11月 08, 2022

* remove any cupy install when setting up environments

* revert previous changes to run on cu111 runners

* fix for when no cupy is installed

* remove cupy uninstall for workflows not using latest torch version

* update to cu116 for inference tests

* fix pip uninstall line

* move python environment list to after DS install

* remove cupy uninstall

* re-add --forked

* fix how we get cupy version (should be based on nvcc version)

521d329b

08 11月, 2022 2 次提交
- R
  Add correct memory-allocation at DeepSpeed-Attention (#2474) · 9cfcf743
  由 Reza Yazdani 提交于 11月 07, 2022
```
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
Co-authored-by: NConnor Holmes <connorholmes@microsoft.com>
```
  9cfcf743
- K
  
  fix accelerate link (#2481) · a47c3e03
  由 kyoto7250 提交于 11月 08, 2022
  
  a47c3e03
05 11月, 2022 2 次提交
- S
  Added MLFLOW environment variables for logging metrics within trainig… (#2477) · ffb6d987
  由 savitamittal1 提交于 11月 04, 2022
```
* Added MLFLOW environment variables for logging metrics within trainign script

* exporting MLFlow env variables from AML env
Co-authored-by: NCheng Li <pistasable@gmail.com>
```
  ffb6d987
- J
  Updating autotune json default in docs. (#2476) · 4a06ecf6
  由 Joe Mayer 提交于 11月 04, 2022
```
* Updating autotune default in docs.

* Running pre-commit.
```
  4a06ecf6
04 11月, 2022 2 次提交
- don't gather partitioned activations for mp size 1 (#2454) · f74ee318
  由郭叶军提交于 11月 04, 2022
```
* don't gather partitioned activations for mp size 1

* add inline comment for the change
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
```
  f74ee318
- A
  
  Create a new folder structure to isolate model-specific code in DS (#2464) · 35458da0
  由 Ammar Ahmad Awan 提交于 11月 03, 2022
  
  35458da0
03 11月, 2022 1 次提交
- R
  fixing the checkpoint loading at inference-engine (#2429) · 39bdc141
  由 Reza Yazdani 提交于 11月 02, 2022
```
Co-authored-by: NAmmar Ahmad Awan <ammar.awan@microsoft.com>
```
  39bdc141

Greenplum / DeepSpeed 上一次同步 11 个月

Greenplum / DeepSpeed
上一次同步 11 个月