提交 · b9fc80a13307461991bc2d091f70182b30f21128 · 机器未来 / Paddle

15 3月, 2019 1 次提交

Support sync batch norm. (#16121) · 8ad672a2

由 qingqing01 提交于 3月 15, 2019

* Support Sync Batch Norm.
* Note, do not enable it in one device.

Usage:

build_strategy = fluid.BuildStrategy()
build_strategy.sync_batch_norm = True
binary = fluid.compiler.CompiledProgram(tp).with_data_parallel(
        loss_name=loss_mean.name,
        build_strategy=build_strategy)

8ad672a2

28 1月, 2019 1 次提交
- S
  fix compile error in distributed mode · ba4f43fd
  由 sneaxiy 提交于 1月 28, 2019
```
test=develop
```
  ba4f43fd
26 12月, 2018 1 次提交

Fp16 training (#14992) · 856f0da0

由 Wu Yi 提交于 12月 26, 2018

* wip

* wip

* wip

* wip for test

* add fp16 tests test=develop

* fix cpu build test=develop

* fix test=develop

* fix py3 tests test=develop

* fix lr_scheduler dtype test=develop

* fix test=dvelop

* test fix ci compile test=develop

* fix build and merge test=develop

* fallback momentumop change to general test=develop

* make fp16 lr schedule simple test=develop

* fix ut test=develop

* fix tests test=develop

* remove fp16 learning rate cast test=develop

856f0da0

20 12月, 2018 2 次提交

T
Revert "[Feature] Fp16 training for resnet50 (#14850)" · da87f7a6
由 typhoonzero 提交于 12月 20, 2018
```
This reverts commit 3d750f9c.
```
da87f7a6

[Feature] Fp16 training for resnet50 (#14850) · 3d750f9c

由 Wu Yi 提交于 12月 20, 2018

* wip

* wip

* wip

* wip for test

* add fp16 tests test=develop

* fix cpu build test=develop

* fix test=develop

* fix py3 tests test=develop

* fix lr_scheduler dtype test=develop

* fix test=dvelop

* test fix ci compile test=develop

* fix build and merge test=develop

* fallback momentumop change to general test=develop

3d750f9c

14 12月, 2018 1 次提交
- Y
  
  update by comment test=develop · 4a4ccac1
  由 Yancey1989 提交于 12月 14, 2018
  
  4a4ccac1
12 12月, 2018 1 次提交
- Y
  Change tensor uses proto::VarType::type · 9bd70a1e
  由 Yu Yang 提交于 12月 11, 2018
```
test=develop
```
  9bd70a1e
07 12月, 2018 1 次提交
- Y
  
  clean code · cb8a24be
  由 Yancey1989 提交于 12月 07, 2018
  
  cb8a24be
06 12月, 2018 1 次提交
- Y
  
  init parallel graph mode · c9de6f1b
  由 Yancey1989 提交于 12月 06, 2018
  
  c9de6f1b
04 12月, 2018 1 次提交

[Feature] multi process multi gpu dist training, boost v100 performance by 20% (#14661) · 29d9fb53

由 Wu Yi 提交于 12月 04, 2018

* wip multi process multi gpu dist training

* workable for p2p

* update test=develop

* change back env name test=develop

* fix alloc init

* fix cpu build test=devlop

* fix mac tests test=develop

* refine code

* refine test=develop

29d9fb53

26 11月, 2018 1 次提交
- M
  Revert the changes of VLOG · 53433d7f
  由 minqiyang 提交于 11月 26, 2018
```
test=develop
```
  53433d7f
12 11月, 2018 1 次提交
- P
  
  fix style issue · 7840d181
  由 peizhilin 提交于 11月 12, 2018
  
  7840d181
08 11月, 2018 1 次提交
- M
  Change the origin VLOG level to 10 times · 0c3227a5
  由 minqiyang 提交于 11月 08, 2018
```
Fix code to support cpplint syntax check

test=develop
```
  0c3227a5
05 11月, 2018 1 次提交
- P
  
  cpu build support · 9d67c1fb
  由 peizhilin 提交于 11月 05, 2018
  
  9d67c1fb
08 9月, 2018 1 次提交

Benchmark tool for imgnet (#12305) · f90c7865

由 Wu Yi 提交于 9月 08, 2018

* support test using executor without reader

* run imgnet

* update fluid benchmark

* wip

* update

* update all models

* support pyreader

* update

* clean up

* make profile batches contollable

* update API.spec

* update scripts

* clean dockerfile

* update

* clean comments

* add scope argument docstring

* use num_trainers to determine nccl init comms

f90c7865

14 6月, 2018 1 次提交

Fix NCCLBcast hang up bug in Parallel Executor (#11377) · 046bb5c8

由 Qiyang Min 提交于 6月 13, 2018

* 1. Create buddy allocator in each places before NcclBcast the variables
2. Check the memory usage of ALL gpus rather than the first one

* 1. Make NCCLGroupGuard guards only the ncclBcast part, which avoid ncclGroupEnd blocking the exception throwing
2. NOTE the usage of NCCLGroupGuard

* Remove the memory usage check of gpus

* Fix code style

046bb5c8

01 6月, 2018 1 次提交
- G
  
  Move sync_mode device ctx from grpc server (#10881) · 4fb7cc7f
  由 gongweibao 提交于 5月 31, 2018
  
  4fb7cc7f
14 5月, 2018 2 次提交
- Y
  
  Add build strategy · 08295f98
  由 yuyang18 提交于 5月 14, 2018
  
  08295f98
- T
  
  update by comments · 7b0c0273
  由 typhoonzero 提交于 5月 14, 2018
  
  7b0c0273
11 5月, 2018 1 次提交
- T
  
  follow comments · f5840d89
  由 typhoonzero 提交于 5月 11, 2018
  
  f5840d89
07 5月, 2018 1 次提交
- T
  
  workable version · 17009d06
  由 typhoonzero 提交于 5月 07, 2018
  
  17009d06
05 5月, 2018 1 次提交
- T
  
  testing · 3667578e
  由 typhoonzero 提交于 5月 05, 2018
  
  3667578e
04 5月, 2018 1 次提交
- T
  
  complete code · d9320dcd
  由 typhoonzero 提交于 5月 04, 2018
  
  d9320dcd
16 4月, 2018 1 次提交
- Y
  
  Use mutex to stablize ncclCtxMap · 093d227a
  由 Yu Yang 提交于 4月 16, 2018
  
  093d227a
11 4月, 2018 2 次提交
- Y
  
  Polish NCCLHelper · c64190ec
  由 Yu Yang 提交于 4月 11, 2018
  
  c64190ec
- Q
  
  Support data type int64 in NCCL. (#9818) · 129859e7
  由 qingqing01 提交于 4月 11, 2018
  
  129859e7
27 3月, 2018 2 次提交
- Y
  
  Refine allreduce op · 7dcb217e
  由 Yu Yang 提交于 3月 27, 2018
  
  7dcb217e
- Y
  
  NCCL AllReduce · c0c2e159
  由 Yu Yang 提交于 3月 27, 2018
  
  c0c2e159
21 3月, 2018 4 次提交
- Y
  
  Extract NCCLCtxMap · fe7ed285
  由 Yu Yang 提交于 3月 21, 2018
  
  fe7ed285
- Y
  
  ReorganizeCode · 6ebc6bf5
  由 Yu Yang 提交于 3月 21, 2018
  
  6ebc6bf5
- Y
  
  Add NCCL Group Guard · 41ad6323
  由 Yu Yang 提交于 3月 21, 2018
  
  41ad6323
- Y
  
  Move nccl helper · 99fe83a0
  由 Yu Yang 提交于 3月 21, 2018
  
  99fe83a0
08 3月, 2018 1 次提交
- Y
  
  Fix CI · 5cb79524
  由 Yu Yang 提交于 3月 08, 2018
  
  5cb79524
07 3月, 2018 2 次提交
- Y
  Add Writer/Scanner · bcb80756
  由 Yu Yang 提交于 3月 07, 2018
```
Make vec<Tensor> can be serialized to RecordIO
```
  bcb80756
- F
  
  fix compile errors · af64f39b
  由 fengjiayi 提交于 3月 07, 2018
  
  af64f39b
06 3月, 2018 2 次提交
- F
  
  init double buffer · 3fcd16ed
  由 fengjiayi 提交于 3月 06, 2018
  
  3fcd16ed
- Y
  
  Extract create_reader_op to three files · 4d8345e3
  由 Yu Yang 提交于 3月 06, 2018
  
  4d8345e3
15 2月, 2018 1 次提交

Update tensor_util.h (#8422) · cfffb1a3

由 Yi Wang 提交于 2月 14, 2018

* Update tensor_util.h

* Update with moved TensorDesc

* Fix tensur_utils.cu

* Update

* Update

* Update

* Update

* Make tensor_util.cu a symbolic link

cfffb1a3

10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致