PaddlePaddle / Paddle
1 年多前同步成功

代码
- 文件
- 提交
- 分支
- Tags
- 贡献者
- 分支图
- Diff
Issue 1423
- 列表
- 看板
- 标记
- 里程碑
合并请求 543
Wiki 0
- Wiki
分析
- 仓库
- DevOps
项目成员
Pages

Refine dropout gpu memory !17095

Created by: sneaxiy

This PR changes the output Mask of dropout_op to be type of uint8_t. (Furthermore, we can change Mask to be something like std::vector<bool>).

This PR makes the maximum batch size of Transformer model in benchmark repo reach 12000 stably.