- 27 11月, 2021 1 次提交
-
-
由 Mikhail Druzhinin 提交于
* fp16 allreduce * Undo sparse sum in nan check * communication_data_type instead of fp32_allreduce and fp16_allreduce * sparse_allreduce with fp32 or fp16 data type * FIx communication_data_type checks * Allow only torch data types for communication_data_type * Fix Zero assert messages Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 26 11月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
* allow external gas * init state * add docstring * add missing engine.step Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 25 11月, 2021 1 次提交
-
-
由 eltonzheng 提交于
* fix partition activations issue when mp=2 and pp=2 * change util function input and fix pre-commit errors * move print_backward_tensors() to debug.py Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 24 11月, 2021 1 次提交
-
-
由 Mikhail Druzhinin 提交于
-
- 23 11月, 2021 5 次提交
-
-
由 Wenhao Hu 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 alexandremuzio 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Chunyang Wen 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Manuel R. Ciosici 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Stas Bekman 提交于
-
- 20 11月, 2021 2 次提交
-
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-
- 19 11月, 2021 2 次提交
-
-
由 Jeff Rasley 提交于
-
由 Stas Bekman 提交于
-
- 18 11月, 2021 5 次提交
-
-
由 Jeff Rasley 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 James Reed 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Jeff Rasley 提交于
-
由 Stas Bekman 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Jeff Rasley 提交于
* guard tabulate package in case autotuning isn't installed * address comment
-
- 17 11月, 2021 3 次提交
-
-
由 Aswin John Mathews 提交于
* Enforce nccl/rccl alignment of start location of each shard * Making yapf happy Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-
- 16 11月, 2021 3 次提交
-
-
由 Mikhail Druzhinin 提交于
Fix partial recovery of sparse_tensor_module_names and dynamically check if gradient data is sparse (#1562) Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Cheng Li 提交于
-
由 Stas Bekman 提交于
-
- 14 11月, 2021 1 次提交
-
-
由 Olatunji Ruwase 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 13 11月, 2021 3 次提交
-
-
由 Cheng Li 提交于
* [squash] Staging autotuning v4 Co-authored-by: NCheng Li <pistasable@gmail.com> Co-authored-by: NMinjia Zhang <minjiaz@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com> * add new extra, guard xgboost, cleanup dead files (#268) * Fix autotuning docs (#1553) * fix docs * rewording the goal * fix typos * fix typos (#1556) * fix typos * fix format * fix bug (#1557) * fix bug Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NMinjia Zhang <minjiaz@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Manuel R. Ciosici 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Olatunji Ruwase 提交于
-
- 12 11月, 2021 6 次提交
-
-
由 Baizhou Huang 提交于
* Add warmup_type arguments in WarmupLR and WarmupDecayLR * Add warmup_type unit test * replace hardcoded constants with global vars Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Reza Yazdani 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Reza Yazdani 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Conglong Li 提交于
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-
- 11 11月, 2021 1 次提交
-
-
由 Olatunji Ruwase 提交于
-
- 10 11月, 2021 1 次提交
-
-
由 Reza Yazdani 提交于
* fixing the softmax masking when using triangular masking * move the TILE declaration outside of the SIMD loop * remove unrelated changes * fix Adagrad compile issue
-
- 09 11月, 2021 3 次提交
-
-
由 Chunyang Wen 提交于
Co-authored-by: NReza Yazdani <reyazda@microsoft.com> Co-authored-by: NReza Yazdani <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Jeff Rasley 提交于
-
由 Chunyang Wen 提交于
-
- 08 11月, 2021 1 次提交
-
-
由 Stas Bekman 提交于
* [docs] fix 404 This PR fixes a few broken links * fix 404
-