What's Changed
- Fix for distributed tests on pytorch>=1.12 by @mrwyattii in https://github.com/microsoft/DeepSpeed/pull/2141
- delay torch import for inference compatability check by @jeffra in https://github.com/microsoft/DeepSpeed/pull/2167
- Fix wrong unit of latency in flops-profiler (#2090) by @zionwu in https://github.com/microsoft/DeepSpeed/pull/2095
- [docs] adoption updates by @jeffra in https://github.com/microsoft/DeepSpeed/pull/2173
- Update for AMD CI workflow by @mrwyattii in https://github.com/microsoft/DeepSpeed/pull/2172
- [docs] update offload docs to include stage 1 by @jeffra in https://github.com/microsoft/DeepSpeed/pull/2178
- Fixing model partitioning without injection by @RezaYazdaniAminabadi in https://github.com/microsoft/DeepSpeed/pull/2179
- Match compute and reduce dtype by @tjruwase in https://github.com/microsoft/DeepSpeed/pull/2145
- Enable fused_lamb_cuda_kernel on ROCm by @rraminen in https://github.com/microsoft/DeepSpeed/pull/2148
- Update README to latest Composer version by @hanlint in https://github.com/microsoft/DeepSpeed/pull/2177
- [deepspeed/autotuner] Missing hjson import by @rahilbathwal5 in https://github.com/microsoft/DeepSpeed/pull/2175
- [docs] add more models to adoption by @jeffra in https://github.com/microsoft/DeepSpeed/pull/2189
- [CI] fix lightning tests by @jeffra in https://github.com/microsoft/DeepSpeed/pull/2190
- Fix typos on README.md by @gasparitiago in https://github.com/microsoft/DeepSpeed/pull/2192
- Fix the layer-past for GPT based models by @RezaYazdaniAminabadi in https://github.com/microsoft/DeepSpeed/pull/2196
- Add gradient_average flag support for sparse grads by @Dipet in https://github.com/microsoft/DeepSpeed/pull/2188
- Adding the compression tutorial on GPT distillation and quantization by @minjiaz in https://github.com/microsoft/DeepSpeed/pull/2197
- Log user config exactly by @tjruwase in https://github.com/microsoft/DeepSpeed/pull/2201
- Fix the tensor-slicing copy for qkv parameters by @RezaYazdaniAminabadi in https://github.com/microsoft/DeepSpeed/pull/2198
- Refactor Distributed Tests by @mrwyattii in https://github.com/microsoft/DeepSpeed/pull/2180
- fix table syntax by @kamalkraj in https://github.com/microsoft/DeepSpeed/pull/2204
- Correctly detect offload configuration by @tjruwase in https://github.com/microsoft/DeepSpeed/pull/2208
- add cuda 11.7 by @jeffra in https://github.com/microsoft/DeepSpeed/pull/2211
- use torch 1.9 in accelerate tests by @jeffra in https://github.com/microsoft/DeepSpeed/pull/2215
- [zero-3] print warning once and support torch parameter by @awan-10 in https://github.com/microsoft/DeepSpeed/pull/2127
- Add support of OPT models by @arashb in https://github.com/microsoft/DeepSpeed/pull/2205
- fix typos in readme. by @zhjohnchan in https://github.com/microsoft/DeepSpeed/pull/2218
- Fix regression w. dist_init_required by @jeffra in https://github.com/microsoft/DeepSpeed/pull/2225
- add doc for new bert example by @conglongli in https://github.com/microsoft/DeepSpeed/pull/2224
- Remove the random-generator from context during inference by @RezaYazdaniAminabadi in https://github.com/microsoft/DeepSpeed/pull/2228
- allow saving ckpt w/o ckpt json + bloom copy fix by @jeffra in https://github.com/microsoft/DeepSpeed/pull/2237
- Correctly detect zero_offload by @tjruwase in https://github.com/microsoft/DeepSpeed/pull/2213
- [docs] update community videos by @jeffra in https://github.com/microsoft/DeepSpeed/pull/2249
- Refactor dist tests: Checkpointing by @tjruwase in https://github.com/microsoft/DeepSpeed/pull/2202
- Make OPT policy backward compatible with pre-OPT transformers versions by @arashb in https://github.com/microsoft/DeepSpeed/pull/2254
- fix ds-inference without policy by @RezaYazdaniAminabadi in https://github.com/microsoft/DeepSpeed/pull/2247
New Contributors
- @zionwu made their first contribution in https://github.com/microsoft/DeepSpeed/pull/2095
- @hanlint made their first contribution in https://github.com/microsoft/DeepSpeed/pull/2177
- @rahilbathwal5 made their first contribution in https://github.com/microsoft/DeepSpeed/pull/2175
- @gasparitiago made their first contribution in https://github.com/microsoft/DeepSpeed/pull/2192
- @arashb made their first contribution in https://github.com/microsoft/DeepSpeed/pull/2205
- @zhjohnchan made their first contribution in https://github.com/microsoft/DeepSpeed/pull/2218
Full Changelog: https://github.com/microsoft/DeepSpeed/compare/v0.7.0...v0.7.1