- 13 9月, 2022 4 次提交
-
-
由 Michael Wyatt 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Jeff Rasley 提交于
-
由 Arash Bakhtiari 提交于
-
由 Arash Bakhtiari 提交于
-
- 10 9月, 2022 3 次提交
-
-
由 Michael Wyatt 提交于
Co-authored-by: NReza Yazdani <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NSam Ade Jacobs <samjacobs@microsoft.com>
-
由 Molly Smith 提交于
-
由 Michael Wyatt 提交于
* added unit test * Update pt_binding.cpp * formatting * Update test_bias_add.py
-
- 08 9月, 2022 2 次提交
-
-
由 Quentin Anthony 提交于
Co-authored-by: Nanthony.301 <anthony.301@mri.cluster> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Michael Wyatt 提交于
-
- 07 9月, 2022 1 次提交
-
-
由 Molly Smith 提交于
-
- 05 9月, 2022 2 次提交
-
-
由 Olatunji Ruwase 提交于
-
由 Stas Bekman 提交于
when loading the non-sharded checkpoint update the progress bar (fix by @RezaYazdaniAminabadi) - I've just tested it to work. Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 04 9月, 2022 1 次提交
-
-
由 Olatunji Ruwase 提交于
* Refactor universal checkpointing and tensor fragments * Formatting
-
- 03 9月, 2022 1 次提交
-
-
由 Reza Yazdani 提交于
Co-authored-by: NArash Bakhtiari <arash@bakhtiari.org> Co-authored-by: NArash Bakhtiari <arashb@users.noreply.github.com>
-
- 02 9月, 2022 1 次提交
-
-
由 Connor Holmes 提交于
Co-authored-by: NAmmar Ahmad Awan <ammar.awan@microsoft.com>
-
- 01 9月, 2022 2 次提交
-
-
由 Connor Holmes 提交于
-
由 Ammar Ahmad Awan 提交于
Co-authored-by: Ncmikeh2 <connorholmes@microsoft.com>
-
- 31 8月, 2022 2 次提交
-
-
由 Reza Yazdani 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Jeff Rasley 提交于
-
- 30 8月, 2022 3 次提交
-
-
由 trajep 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: Nchenguo <chenguo@microsoft.com>
-
由 Reza Yazdani 提交于
-
由 Mikhail Druzhinin 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 27 8月, 2022 2 次提交
-
-
由 Molly Smith 提交于
Fix RuntimeError: Boolean value of Tensor with more than one value is ambiguous when running test-gptj.py
-
由 Michael Wyatt 提交于
Add blob storage to CI runners and enable for transformers cache on inference tests
-
- 26 8月, 2022 2 次提交
-
-
由 Jeff Rasley 提交于
-
由 江户川闰土 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 25 8月, 2022 2 次提交
-
-
由 Connor Holmes 提交于
-
由 Olatunji Ruwase 提交于
* Correctly detect CPU optimizer usage * Update nv-transformers-v100.yml (#2259) Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 24 8月, 2022 5 次提交
-
-
由 Siddharth Singh 提交于
MoE training with zero stage 1 only works with `contiguous gradients=True`.
-
由 Jeff Rasley 提交于
-
由 Reza Yazdani 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Arash Bakhtiari 提交于
-
由 Olatunji Ruwase 提交于
Refactor distributed tests: checkpointing Co-authored-by: NMichael Wyatt <michaelwyatt@microsoft.com>
-
- 23 8月, 2022 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 22 8月, 2022 1 次提交
-
-
由 Olatunji Ruwase 提交于
* Correctly detect offload configuration * Correctly detect offload configuration * Handle deprecated cpu offload setting * Correcly detect zero_offload setting * Minor tweak Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NAmmar Ahmad Awan <ammar.awan@microsoft.com>
-
- 20 8月, 2022 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 18 8月, 2022 2 次提交
-
-
由 Reza Yazdani 提交于
* Fix the tensor-slicing copy for qkv parameters * remove the random-generator from context during inference * formatting Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Conglong Li 提交于
-
- 17 8月, 2022 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 16 8月, 2022 1 次提交
-
-
由 Zhihong Chen 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-