PaddlePaddle / Paddle
1 年多前同步成功

代码
- 文件
- 提交
- 分支
- Tags
- 贡献者
- 分支图
- Diff
Issue 1423
- 列表
- 看板
- 标记
- 里程碑
合并请求 543
Wiki 0
- Wiki
分析
- 仓库
- DevOps
项目成员
Pages

Support sync batch norm. !16121

Created by: qingqing01

Now only support GPU, CPU will be add in nexted PR.
Use ncclAllReduce to sync E(x) and E(x^2) on multi-gpus in one machine. ~~3. Add ncclComm_t in DeviceContext and initialized by init.cc~~
The unit testing is to compare the forward outputs and backward outputs with batch_norm on one GPU.

build_strategy = fluid.BuildStrategy()
build_strategy.sync_batch_norm = True
binary = fluid.compiler.CompiledProgram(tp).with_data_parallel(
        loss_name=loss_mean.name,
        build_strategy=build_strategy)