提交 · 7f7dfccf20347eb9f0600b15a6472c32f1c34c4b · SummerGao. / Paddle

08 1月, 2021 1 次提交

Support pure fp16 training for AMP API. (#29544) · 7f7dfccf

由 Zhen Wang 提交于 1月 08, 2021

* add cast ops before and after unsupported fp16 ops.

* Keep partial net in FP32 pattern.

* Support check_finite_and_unscale and update_loss_scaling for FP16 calculation mode.

* Add fp16 support for adam op.

* add multi precision attr for adam.

* Fix the bug of test_multi_precision_fp16_train UT.

* Code format for CI.

* Fix the redefine error about MPTypeTrait on windows.

* fix bugs of the _create_accumulators func in Momentum.

* fix bug when inserting post cast op.

* Add the update_loss_scaling op in allow_set of UnusedVarCheck.

* Update for ci coverage.

* Add some doc for OptimizerWithMixedPrecision.

* Fix the code style.

* Imporve the doc of `amp_init`.

* Change for fp16 testing if users have the infer program defined in separate way.

7f7dfccf

02 12月, 2020 2 次提交

Z

Remove some useless log. (#29300) · 9b59a589
由 Zhen Wang 提交于 12月 02, 2020

9b59a589

Add pure fp16 training with master weights. (#27712) · be3777a5

由 Zhen Wang 提交于 12月 02, 2020

* add the weight decay func for the momentum op

* Add the multi_precision function in Momentum Optimizer.

* Make sure that the initial value of master weights are same with the fp16 weights.

* add static loss scaling.

* add the rescale_grad function in the pure fp16 training.

* use the original momentum updating method.

* Polish some codes, such as variable names.

* add docstring for apis.

* update the var creation details of _create_master_weight.

* not modify codes about imperative momentum updating.

* Fix the error of test_dist_sparse_tensor_load_momentum UT.

* add unit test for multi precision fp16 training.

* add more unit tests for CI.

* Use lower threshold values for allclose comparing in test_multi_precision_fp16_train UT.

* For CI Coverage Checking.

be3777a5

23 11月, 2020 1 次提交
- F
  refactor momentum op to combine weight (#27414) · 8ff35506
  由 furnace 提交于 11月 23, 2020
```
* refactor momentum op to combine weight_decay (scale op and sum op)
```
  8ff35506
27 9月, 2020 1 次提交
- C
  fix error message (#27318) · d014e29f
  由 Chengmo 提交于 9月 27, 2020
```
* fix sgd/momentum/dpsgd/rmsprop error message
```
  d014e29f
28 10月, 2019 1 次提交

Replace risky GetInputType method with secure IndicateVarDataType interface (#20668) · 26cc1fe5

由 Chen Weihang 提交于 10月 28, 2019

* replace part of the old implementation, test=develop

* restore concat op, test=develop

* update all ops implemention & delete GetDataTypeOfVar func, test=develop

26cc1fe5

24 10月, 2019 1 次提交
- W
  
  Fix DGC algorithm flow to make it the same as paper (#20758) · 250e72d2
  由 WangXi 提交于 10月 24, 2019
  
  250e72d2
04 9月, 2019 1 次提交

Add user-friendly error message in optimizer ops to give a hint about the... · 8cb54ede

由 Chen Weihang 提交于 9月 04, 2019

Add user-friendly error message in optimizer ops to give a hint about the position sensitive problem of run(startup_program) (#19605)

* add extra error message hint in optimizer ops

* polish format & delete useless change, test=develop

* extract init judue from shape compare, test=develop

8cb54ede

19 3月, 2019 1 次提交
- Z
  add allocator flags · 22715487
  由 zhhsplendid 提交于 3月 19, 2019
```
test=develop
```
  22715487
15 3月, 2019 1 次提交
- S
  fix const_cast · f0d108f5
  由 sneaxiy 提交于 3月 15, 2019
```
test=develop
```
  f0d108f5
26 12月, 2018 1 次提交

Fp16 training (#14992) · 856f0da0

由 Wu Yi 提交于 12月 26, 2018

* wip

* wip

* wip

* wip for test

* add fp16 tests test=develop

* fix cpu build test=develop

* fix test=develop

* fix py3 tests test=develop

* fix lr_scheduler dtype test=develop

* fix test=dvelop

* test fix ci compile test=develop

* fix build and merge test=develop

* fallback momentumop change to general test=develop

* make fp16 lr schedule simple test=develop

* fix ut test=develop

* fix tests test=develop

* remove fp16 learning rate cast test=develop

856f0da0

20 12月, 2018 2 次提交

T
Revert "[Feature] Fp16 training for resnet50 (#14850)" · da87f7a6
由 typhoonzero 提交于 12月 20, 2018
```
This reverts commit 3d750f9c.
```
da87f7a6

[Feature] Fp16 training for resnet50 (#14850) · 3d750f9c

由 Wu Yi 提交于 12月 20, 2018

* wip

* wip

* wip

* wip for test

* add fp16 tests test=develop

* fix cpu build test=develop

* fix test=develop

* fix py3 tests test=develop

* fix lr_scheduler dtype test=develop

* fix test=dvelop

* test fix ci compile test=develop

* fix build and merge test=develop

* fallback momentumop change to general test=develop

3d750f9c

19 12月, 2018 1 次提交
- S
  rewrite variable type · ae6f46a1
  由 sneaxiy 提交于 12月 19, 2018
```
test=develop
```
  ae6f46a1
26 11月, 2018 1 次提交
- M
  Revert the changes of VLOG · 53433d7f
  由 minqiyang 提交于 11月 26, 2018
```
test=develop
```
  53433d7f
16 11月, 2018 1 次提交

Refine operator cmake (#14413) · a2d9b344

由 Wu Yi 提交于 11月 16, 2018

* wip simplify operator framework

* wip

* wip

* done test=develop

* clean test=develop

* fix test=develop

* fix deps test=develop

* fix cpu build test=develop

* fix tensorrt build test=develop

* fix tests test=develop

* fix test=develop

* fix cpu build test=develop

a2d9b344

08 11月, 2018 1 次提交
- M
  Change the origin VLOG level to 10 times · 0c3227a5
  由 minqiyang 提交于 11月 08, 2018
```
Fix code to support cpplint syntax check

test=develop
```
  0c3227a5
29 10月, 2018 1 次提交

[1.1] [project] train imagenet using large batch size (#13766) · 26200f2e

由 Wu Yi 提交于 10月 29, 2018

* fix nccl2 lars dist support

* put lars in momentum op

* add tests lars

* fix ci

* fix cpu kernel

* soft warning

* remove lars in test_recognize_digits.py

* move to another op

* add file

* update api.spec test=develop

* update test=develop

* fix api.spec test=develop

* wip

* wip, finish grad merge ops

* wip, finish graph build

* wip test running

* work on 1 gpu

* workable version

* update

* fix tests

* fuse broadcast op

* fix compile failed

* refine

* add batch merge test mnist

* fix CI test=develop

* fix build

* use independent bn params for batch merge test=develop

* update api.spec

* follow comments and for test

* wip

* refine tests test=develop

* follow comments test=develop

* remove startup bn modify test=develop

* follow comments test=develop

* fix merge test=develop

26200f2e

17 10月, 2018 3 次提交
- D
  
  fix compile in cpu error. test=develop · 00e8791f
  由 dzhwinter 提交于 10月 17, 2018
  
  00e8791f
- D
  
  use binary search. test=develop · d239cf2e
  由 dzhwinter 提交于 10月 17, 2018
  
  d239cf2e
- D
  
  use binary search. test=develop · a9f5f822
  由 dzhwinter 提交于 10月 17, 2018
  
  a9f5f822
15 10月, 2018 1 次提交

Add check for opt op (#13840) · 8e2fdc54

由 chengduo 提交于 10月 15, 2018

* add check for opt op

* fix opt op
test=develop

* fix test fail
test=develop

* fix optimization doc
test=develop

* test=develop

8e2fdc54

14 10月, 2018 1 次提交
- D
  
  add sparse update momentum. test=develop · 8329a1f1
  由 dzhwinter 提交于 10月 14, 2018
  
  8329a1f1
20 7月, 2018 1 次提交
- Q
  Fix serious bug in nesterov momentum optimizer. (#12231) · 873a50ce
  由 qingqing01 提交于 7月 20, 2018
```
* Fix serious bug in nesterov momentum optimizer.
```
  873a50ce
12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
05 12月, 2017 2 次提交
- D
  
  Refine the Eigen usage for CPU implementation. · e03b574e
  由 dangqingqing 提交于 12月 05, 2017
  
  e03b574e
- D
  
  Refine and speedup momentum operator. · 5bd1e73f
  由 dangqingqing 提交于 12月 05, 2017
  
  5bd1e73f
10 11月, 2017 1 次提交

Fix attribute naming for momentum_op (#5453) · 2e355f03

由 Siddharth Goyal 提交于 11月 09, 2017

* Fix attribute naming for momentum_op

* Fix minor typo in comment

* Fix attribute name

* Fix names in test_optimizer

* Fix python wrapper

2e355f03

20 10月, 2017 1 次提交
- K
  
  Adding Nesterov Momentum (#4948) · 5380a547
  由 kavyasrinet 提交于 10月 20, 2017
  
  5380a547
06 10月, 2017 2 次提交
- S
  
  Fix learning_rate usage for momentum · db77937e
  由 sidgoyal78 提交于 10月 05, 2017
  
  db77937e
- S
  
  Modify implementation · c10da26c
  由 sidgoyal78 提交于 10月 05, 2017
  
  c10da26c
03 10月, 2017 2 次提交
- S
  
  Add momentum operator · d28b3094
  由 sidgoyal78 提交于 10月 02, 2017
  
  d28b3094
- A
  Changing learning rate from attribute to input(float) (#4568) · 42e7fe05
  由 Abhinav Arora 提交于 10月 02, 2017
```
* Changing learning rate from attribute to input(float)
* Removing obsolete code
```
  42e7fe05
28 9月, 2017 1 次提交
- Y
  
  Add Skeleton of Double support · 3a5693e0
  由 Yu Yang 提交于 9月 27, 2017
  
  3a5693e0
06 9月, 2017 1 次提交
- Y
  Change `Op::GetAttr` to `Op::Attr` · 9de6a4b3
  由 Yu Yang 提交于 9月 05, 2017
```
Fix #3902
```
  9de6a4b3
04 9月, 2017 1 次提交
- Q
  
  add GetAttr to InferShapeContext · d323831a
  由 qiaolongfei 提交于 9月 03, 2017
  
  d323831a
03 9月, 2017 1 次提交
- Q
  
  add op() to InferShapeContext · 6fcdc916
  由 qiaolongfei 提交于 9月 02, 2017
  
  6fcdc916
18 8月, 2017 1 次提交
- Q
  
  fix-sgd · 8b3d33a0
  由 qiaolongfei 提交于 8月 17, 2017
  
  8b3d33a0

SummerGao. / Paddle 与 Fork 源项目一致

SummerGao. / Paddle
与 Fork 源项目一致