Add fp16 grad compression with grad hook (!23019) · 合并请求 · PaddlePaddle / Paddle

Add fp16 grad compression with grad hook !23019

Created by: wangxicoding

Compress fp32 gradient to fp16 for communication, reduce communication size and bandwidth usage. Suitable for use on cards that do not support tensor core, such as P4.

This PR is the second version of fp16 compression. For previous versions, see https://github.com/PaddlePaddle/Paddle/pull/22434 . Version2 may better than version1.

PaddlePaddle / Paddle 大约 1 年 前同步成功

Add fp16 grad compression with grad hook !23019

PaddlePaddle / Paddle
大约 1 年前同步成功