- 07 1月, 2022 1 次提交
-
-
由 Olatunji Ruwase 提交于
-
- 06 1月, 2022 4 次提交
-
-
由 Jeff Rasley 提交于
-
由 Conglong Li 提交于
-
由 Olatunji Ruwase 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Victor 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com> Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 05 1月, 2022 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 04 1月, 2022 1 次提交
-
-
由 Manuel R. Ciosici 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 30 12月, 2021 1 次提交
-
-
由 Reza Yazdani 提交于
* convert the fp16_params to group of parameters * fix typo * check the type of fp16_params * fix issue when fp16_param_groups is None Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 29 12月, 2021 2 次提交
-
-
由 Manuel R. Ciosici 提交于
-
由 Stas Bekman 提交于
-
- 22 12月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 21 12月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
-
- 18 12月, 2021 1 次提交
-
-
由 Conglong Li 提交于
-
- 17 12月, 2021 2 次提交
-
-
由 Alex Hedges 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Minjia Zhang 提交于
-
- 15 12月, 2021 3 次提交
-
-
由 Reza Yazdani 提交于
-
由 Cheng Li 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Jeff Rasley 提交于
-
- 14 12月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 13 12月, 2021 1 次提交
-
-
由 Gary Miguel 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
- 11 12月, 2021 4 次提交
-
-
由 Victor 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Ammar Ahmad Awan 提交于
-
由 Conglong Li 提交于
-
由 Jeff Rasley 提交于
-
- 10 12月, 2021 2 次提交
-
-
由 Jeff Rasley 提交于
-
由 Jeff Rasley 提交于
-
- 09 12月, 2021 1 次提交
-
-
由 Olatunji Ruwase 提交于
* Control ds_report output with two flags --hide_operators and --hide_errors_and_warnings Separate cli and function entry points to ds_report * Formatting fixes
-
- 07 12月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
see pypi issue: https://status.python.org/incidents/2jj696st6yn5
-
- 02 12月, 2021 3 次提交
-
-
由 Pierce Stegman 提交于
-
由 Jeff Rasley 提交于
-
由 Reza Yazdani 提交于
* fixing the softmax masking when using triangular masking * fix a bug in the the layernorm backward kernels * revert back some changes & remove debug code * change the constants to a macro Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 01 12月, 2021 3 次提交
-
-
由 Alex Hedges 提交于
Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
-
由 Jeff Rasley 提交于
-
由 Alex Hedges 提交于
-
- 30 11月, 2021 2 次提交
-
-
由 Jeff Rasley 提交于
force set lf instead of crlf (https://github.com/pre-commit/pre-commit-hooks#mixed-line-ending) (#1598)
-
由 Paige Wang 提交于
wall_clock_breakdown disable failed in moe layer due to incorrectly used a function as property
-
- 28 11月, 2021 1 次提交
-
-
由 Stas Bekman 提交于
-
- 27 11月, 2021 1 次提交
-
-
由 Mikhail Druzhinin 提交于
* fp16 allreduce * Undo sparse sum in nan check * communication_data_type instead of fp32_allreduce and fp16_allreduce * sparse_allreduce with fp32 or fp16 data type * FIx communication_data_type checks * Allow only torch data types for communication_data_type * Fix Zero assert messages Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 26 11月, 2021 1 次提交
-
-
由 Jeff Rasley 提交于
* allow external gas * init state * add docstring * add missing engine.step Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-
- 25 11月, 2021 1 次提交
-
-
由 eltonzheng 提交于
* fix partition activations issue when mp=2 and pp=2 * change util function input and fix pre-commit errors * move print_backward_tensors() to debug.py Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
-