- 01 12月, 2021 3 次提交
-
-
由 Alex Hedges 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Jeff Rasley 提交于
-
由 Alex Hedges 提交于
-
- 30 11月, 2021 2 次提交
-
-
由 Jeff Rasley 提交于
force set lf instead of crlf (https://github.com/pre-commit/pre-commit-hooks#mixed-line-ending) (#1598)
-
由 Paige Wang 提交于
wall_clock_breakdown disable failed in moe layer due to incorrectly used a function as property
-
- 28 11月, 2021 1 次提交
-
-
由 Stas Bekman 提交于
-
- 27 11月, 2021 1 次提交
-
-
由 Mikhail Druzhinin 提交于
* fp16 allreduce * Undo sparse sum in nan check * communication_data_type instead of fp32_allreduce and fp16_allreduce * sparse_allreduce with fp32 or fp16 data type * FIx communication_data_type checks * Allow only torch data types for communication_data_type * Fix Zero assert messages Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 26 11月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
* allow external gas * init state * add docstring * add missing engine.step Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 25 11月, 2021 1 次提交
-
-
由 eltonzheng 提交于
* fix partition activations issue when mp=2 and pp=2 * change util function input and fix pre-commit errors * move print_backward_tensors() to debug.py Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 24 11月, 2021 1 次提交
-
-
由 Mikhail Druzhinin 提交于
-
- 23 11月, 2021 5 次提交
-
-
由 Wenhao Hu 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 alexandremuzio 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Chunyang Wen 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Manuel R. Ciosici 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Stas Bekman 提交于
-
- 20 11月, 2021 2 次提交
-
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-
- 19 11月, 2021 2 次提交
-
-
由 Jeff Rasley 提交于
-
由 Stas Bekman 提交于
-
- 18 11月, 2021 5 次提交
-
-
由 Jeff Rasley 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 James Reed 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Jeff Rasley 提交于
-
由 Stas Bekman 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Jeff Rasley 提交于
* guard tabulate package in case autotuning isn't installed * address comment
-
- 17 11月, 2021 3 次提交
-
-
由 Aswin John Mathews 提交于
* Enforce nccl/rccl alignment of start location of each shard * Making yapf happy Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-
- 16 11月, 2021 3 次提交
-
-
由 Mikhail Druzhinin 提交于
Fix partial recovery of sparse_tensor_module_names and dynamically check if gradient data is sparse (#1562) Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Cheng Li 提交于
-
由 Stas Bekman 提交于
-
- 14 11月, 2021 1 次提交
-
-
由 Olatunji Ruwase 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 13 11月, 2021 3 次提交
-
-
由 Cheng Li 提交于
* [squash] Staging autotuning v4 Co-authored-by: NCheng Li <pistasable@gmail.com> Co-authored-by: NMinjia Zhang <minjiaz@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com> * add new extra, guard xgboost, cleanup dead files (#268) * Fix autotuning docs (#1553) * fix docs * rewording the goal * fix typos * fix typos (#1556) * fix typos * fix format * fix bug (#1557) * fix bug Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NMinjia Zhang <minjiaz@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Manuel R. Ciosici 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Olatunji Ruwase 提交于
-
- 12 11月, 2021 6 次提交
-
-
由 Baizhou Huang 提交于
* Add warmup_type arguments in WarmupLR and WarmupDecayLR * Add warmup_type unit test * replace hardcoded constants with global vars Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
由 Reza Yazdani 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Reza Yazdani 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Conglong Li 提交于
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-