- 08 9月, 2022 1 次提交
-
-
由 Michael Wyatt 提交于
-
- 07 9月, 2022 1 次提交
-
-
由 Molly Smith 提交于
-
- 05 9月, 2022 2 次提交
-
-
由 Olatunji Ruwase 提交于
-
由 Stas Bekman 提交于
when loading the non-sharded checkpoint update the progress bar (fix by @RezaYazdaniAminabadi) - I've just tested it to work. Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 04 9月, 2022 1 次提交
-
-
由 Olatunji Ruwase 提交于
* Refactor universal checkpointing and tensor fragments * Formatting
-
- 03 9月, 2022 1 次提交
-
-
由 Reza Yazdani 提交于
Co-authored-by: NArash Bakhtiari <arash@bakhtiari.org> Co-authored-by: NArash Bakhtiari <arashb@users.noreply.github.com>
-
- 02 9月, 2022 1 次提交
-
-
由 Connor Holmes 提交于
Co-authored-by: NAmmar Ahmad Awan <ammar.awan@microsoft.com>
-
- 01 9月, 2022 2 次提交
-
-
由 Connor Holmes 提交于
-
由 Ammar Ahmad Awan 提交于
Co-authored-by: Ncmikeh2 <connorholmes@microsoft.com>
-
- 31 8月, 2022 2 次提交
-
-
由 Reza Yazdani 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Jeff Rasley 提交于
-
- 30 8月, 2022 3 次提交
-
-
由 trajep 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: Nchenguo <chenguo@microsoft.com>
-
由 Reza Yazdani 提交于
-
由 Mikhail Druzhinin 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 27 8月, 2022 2 次提交
-
-
由 Molly Smith 提交于
Fix RuntimeError: Boolean value of Tensor with more than one value is ambiguous when running test-gptj.py
-
由 Michael Wyatt 提交于
Add blob storage to CI runners and enable for transformers cache on inference tests
-
- 26 8月, 2022 2 次提交
-
-
由 Jeff Rasley 提交于
-
由 江户川闰土 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 25 8月, 2022 2 次提交
-
-
由 Connor Holmes 提交于
-
由 Olatunji Ruwase 提交于
* Correctly detect CPU optimizer usage * Update nv-transformers-v100.yml (#2259) Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 24 8月, 2022 5 次提交
-
-
由 Siddharth Singh 提交于
MoE training with zero stage 1 only works with `contiguous gradients=True`.
-
由 Jeff Rasley 提交于
-
由 Reza Yazdani 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Arash Bakhtiari 提交于
-
由 Olatunji Ruwase 提交于
Refactor distributed tests: checkpointing Co-authored-by: NMichael Wyatt <michaelwyatt@microsoft.com>
-
- 23 8月, 2022 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 22 8月, 2022 1 次提交
-
-
由 Olatunji Ruwase 提交于
* Correctly detect offload configuration * Correctly detect offload configuration * Handle deprecated cpu offload setting * Correcly detect zero_offload setting * Minor tweak Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NAmmar Ahmad Awan <ammar.awan@microsoft.com>
-
- 20 8月, 2022 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 18 8月, 2022 2 次提交
-
-
由 Reza Yazdani 提交于
* Fix the tensor-slicing copy for qkv parameters * remove the random-generator from context during inference * formatting Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Conglong Li 提交于
-
- 17 8月, 2022 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 16 8月, 2022 1 次提交
-
-
由 Zhihong Chen 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 15 8月, 2022 1 次提交
-
-
由 Arash Bakhtiari 提交于
* add opt replace policy * simplify inf. api * fix opt replace policy * fix use-cash & add relu * Add support of custom MLP act. function * Revert "simplify inf. api" This reverts commit 9e910fcbd5471dec9b3c92008426f5ba590bf0b6. * fix the inference API (temp. solution) * fix code formatting * add unit tests for OPT models. * refactor pre-attention layer norm configuration * add support of opt-350m model * refactor the HF model config initialization * fix hf model config issue Co-authored-by: NReza Yazdani <reyazda@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NReza Yazdani <44502768+RezaYazdaniAminabadi@users.noreply.github.com>
-
- 13 8月, 2022 2 次提交
-
-
由 Ammar Ahmad Awan 提交于
* print warning only once. * add support for torch param and only warn on gpu 0 * remove type checking. will be done on a new PR with more tests. Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Jeff Rasley 提交于
-
- 12 8月, 2022 2 次提交
-
-
由 Jeff Rasley 提交于
* add cuda 11.7 * formatting
-
由 Olatunji Ruwase 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 11 8月, 2022 3 次提交
-
-
由 Kamal Raj 提交于
Co-authored-by: NConglong Li <conglong.li@gmail.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Michael Wyatt 提交于
Refactor Distributed unit tests
-
由 Reza Yazdani 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-