- 19 3月, 2022 1 次提交
-
-
由 Michael Wyatt 提交于
* added concurrency to github actions * fixed problem where one workflow can cancel another workflow * added tmp file to help test concurrency policy on CI * removed tmp file to finish testing concurrency policy
-
- 15 3月, 2022 1 次提交
-
-
由 Michael Wyatt 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 11 3月, 2022 1 次提交
-
-
由 Yucheng Lu 提交于
Co-authored-by: NConglong Li <conglong.li@gmail.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 08 3月, 2022 2 次提交
-
-
由 Jeff Rasley 提交于
-
由 Jithun Nair 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 03 3月, 2022 1 次提交
-
-
由 Jeff Rasley 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJithun Nair <jithun.nair@amd.com> Co-authored-by: Nrraminen <rraminen@amd.com> Co-authored-by: NJeff Daily <jeff.daily@amd.com> Co-authored-by: Nokakarpa <okakarpa@amd.com> Co-authored-by: Nrraminen <rraminen@amd.com> Co-authored-by: NJithun Nair <37884920+jithunnair-amd@users.noreply.github.com> Co-authored-by: NJeff Daily <jeff.daily@amd.com> Co-authored-by: Nokakarpa <okakarpa@amd.com> Co-authored-by: NRamya Ramineni <62723901+rraminen@users.noreply.github.com>
-
- 28 1月, 2022 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 26 1月, 2022 1 次提交
-
-
由 Sean Naren 提交于
* Add a very simple PyTorch Lightning test * Run for just one epoch * Swap to using the plugin API till the strategy API makes it to 1.6
-
- 15 12月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 14 12月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 30 11月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
force set lf instead of crlf (https://github.com/pre-commit/pre-commit-hooks#mixed-line-ending) (#1598)
-
- 19 11月, 2021 1 次提交
-
-
由 Stas Bekman 提交于
-
- 17 11月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 13 11月, 2021 1 次提交
-
-
由 Cheng Li 提交于
* [squash] Staging autotuning v4 Co-authored-by: NCheng Li <pistasable@gmail.com> Co-authored-by: NMinjia Zhang <minjiaz@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com> * add new extra, guard xgboost, cleanup dead files (#268) * Fix autotuning docs (#1553) * fix docs * rewording the goal * fix typos * fix typos (#1556) * fix typos * fix format * fix bug (#1557) * fix bug Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NMinjia Zhang <minjiaz@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 12 11月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 30 10月, 2021 1 次提交
-
-
由 Olatunji Ruwase 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 05 10月, 2021 1 次提交
-
-
由 Stas Bekman 提交于
HF tests `image-classification` requirements have been fixed to not require pt-1.9
-
- 01 10月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 30 9月, 2021 1 次提交
-
-
由 Stas Bekman 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 28 9月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
* install HF w. dev extra to get all required packages * switch ds.init to use param dict instead of json file on disk * switch back to 'testing' extra
-
- 25 9月, 2021 1 次提交
-
-
由 Ammar Ahmad Awan 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NStas Bekman <stas00@users.noreply.github.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 21 9月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
Co-authored-by: NReza Yazdani <reyazda@microsoft.com>
-
- 10 9月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 09 9月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 03 9月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 02 9月, 2021 3 次提交
-
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-
- 20 8月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 24 5月, 2021 1 次提交
-
-
由 Reza Yazdani 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: Neltonzheng <eltonz@microsoft.com> Co-authored-by: NReza Yazdani <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by: NShaden Smith <Shaden.Smith@microsoft.com> Co-authored-by: NReza Yazdani <reyazda@microsoft.com> Co-authored-by: NShaden Smith <Shaden.Smith@microsoft.com> Co-authored-by: NElton Zheng <eltonz@microsoft.com> Co-authored-by: NReza Yazdani <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by: Neltonzheng <eltonz@microsoft.com> Co-authored-by: NArash Ashari <arashari@microsoft.com> Co-authored-by: NReza Yazdani <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by: NShaden Smith <Shaden.Smith@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NReza Yazdani <reyazda@microsoft.com> Co-authored-by: Nniumanar <60243342+niumanar@users.noreply.github.com> Co-authored-by: Neltonzheng <eltonz@microsoft.com> Co-authored-by: NReza Yazdani <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by: NShaden Smith <Shaden.Smith@microsoft.com> Co-authored-by: NReza Yazdani <reyazda@microsoft.com> Co-authored-by: NArash Ashari <arashari@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: Nniumanar <60243342+niumanar@users.noreply.github.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: Neltonzheng <eltonz@microsoft.com> Co-authored-by: NShaden Smith <Shaden.Smith@microsoft.com> Co-authored-by: NArash Ashari <arashari@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: Nniumanar <60243342+niumanar@users.noreply.github.com>
-
- 09 3月, 2021 1 次提交
-
-
由 Samyam Rajbhandari 提交于
* Squash stage3 v1 (#146) Co-authored-by: NSamyam <samyamr@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NSamyam Rajbhandari <samyamr@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NShaden Smith <Shaden.Smith@microsoft.com> Co-authored-by: NShaden Smith <ShadenTSmith@gmail.com> Co-authored-by: Neltonzheng <eltonz@microsoft.com> * Fix correctness bug (#147) * formatting fix (#150) * stage3 bugfix (API) update and simplified FP16 Z3 tests (#151) * fp16 Z3 API update and bugfix * revert debug change * ZeRO-3 detach and race condition bugfixes (#149) * trying out ZeRO-3 race condition fix * CUDA sync instead of stream * reduction stream sync * remove commented code * Fix optimizer state_dict KeyError (#148) Co-authored-by: NJeff Rasley <jerasley@microsoft.com> * fix for smaller SGS sizes, ensures each grad is backed by unique tensors (#152) * Simplifying the logic for getting averaged gradients (#153) * skip for now * Z3 Docs redux (#154) * removing some TODOs and commented code (#155) * New Z3 defaults (#156) Co-authored-by: NJeff Rasley <jerasley@microsoft.com> * formatting * megatron external params Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NShaden Smith <Shaden.Smith@microsoft.com> Co-authored-by: NShaden Smith <ShadenTSmith@gmail.com> Co-authored-by: Neltonzheng <eltonz@microsoft.com>
-
- 28 1月, 2021 3 次提交
-
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-
- 23 12月, 2020 1 次提交
-
-
由 Jeff Rasley 提交于
Co-authored-by: NSamyam Rajbhandari <samyamr@microsoft.com>
-
- 12 12月, 2020 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 25 11月, 2020 3 次提交
-
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-