未验证 提交 40c381df 编写于 作者: O Olatunji Ruwase 提交者: GitHub

[Doc] round_robin_gradients (#1261)

* Fix docstring

* Make screenshots clickable for easier viewing

* Navigation menu in alphabetical order; More clicable screenshots

* Rename 1Cycle doc

* Tweak naming

* Remove no longer used flag

* ZeRO3 Offload release

* Single GPU results

* Rearrange figures

* Single GPU text

* tweak intro

* zero3-offload section

* Add asynchronous i/o docs

* Fix print_per_steps doc

* Document round_robin_gradients

* Tweak description

* Trigger CI
上级 e82060d0
......@@ -301,6 +301,7 @@ Enabling and configuring ZeRO memory optimizations
"elastic_checkpoint" : [true|false],
"stage3_gather_fp16_weights_on_model_save": [true|false],
"ignore_unused_parameters": [true|false]
"round_robin_gradients": [true|false]
}
```
......@@ -358,6 +359,12 @@ Enabling and configuring ZeRO memory optimizations
| ------------------------------------------------------------------------------------------------------------------------------------------ | ------- |
| For use with ZeRO stage 1, enable backward hooks to reduce gradients during the backward pass or wait until the end of the backward pass. | `True` |
***round_robin_gradients***: [boolean]
| Description | Default |
| ------------------------------------------------------------------------------------------------------------------------------------------ | ------- |
| Stage 2 optimization for CPU offloading that parallelizes gradient copying to CPU memory among ranks by fine-grained gradient partitioning. Performance benefit grows with gradient accumulation steps (more copying between optimizer steps) or GPU count (increased parallelism). | `False` |
***offload_param***: [dictionary]
| Description | Default |
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册