提交 · d8dfef54a5caba7bbe1fd383707ee69dac58a959 · PaddlePaddle / Paddle

11 1月, 2021 1 次提交

[Cherry-Pick] Support pure fp16 training for AMP API. (#29544) (#30241) · d8dfef54

由 Zhen Wang 提交于 1月 11, 2021

* Support pure fp16 training for AMP API. (#29544)

* add cast ops before and after unsupported fp16 ops.

* Keep partial net in FP32 pattern.

* Support check_finite_and_unscale and update_loss_scaling for FP16 calculation mode.

* Add fp16 support for adam op.

* add multi precision attr for adam.

* Fix the bug of test_multi_precision_fp16_train UT.

* Code format for CI.

* Fix the redefine error about MPTypeTrait on windows.

* fix bugs of the _create_accumulators func in Momentum.

* fix bug when inserting post cast op.

* Add the update_loss_scaling op in allow_set of UnusedVarCheck.

* Update for ci coverage.

* Add some doc for OptimizerWithMixedPrecision.

* Fix the code style.

* Imporve the doc of `amp_init`.

* Change for fp16 testing if users have the infer program defined in separate way.

* Remove tensor copy in the update_loss_scaling op. (#29426)

* remove tensor copy in the update_loss_scaling op

* not use thrust.

* fix some cuda memory access error.

d8dfef54

07 4月, 2020 1 次提交
- W
  Tensor value support (#23491) · 29c4fae1
  由 wangchaochaohu 提交于 4月 07, 2020
```
* add support for value tensor support of fill_constant Op
```
  29c4fae1
27 2月, 2020 1 次提交

Refine adam op to improve performance, test=develop (#22346) · 72dde4ab

由 zhaoyuchen2018 提交于 2月 27, 2020

* Refine adam op, test=develop

* Fuse kernels together to reduce cpu time.

* Refine paddle enforce, test=develop

* Remove some comments, test=develop

* Refine code,test=develop

* Refine cuda kernel, test=develop

* Refine code according to comments, test=develop

72dde4ab

24 12月, 2019 1 次提交

Optimize adam speed (#21777) · 51a86d2b

由 Aurelius84 提交于 12月 24, 2019

* optimize adam speed by removing _finish_update test=develop

* fix SparseAdamFunctor param list test=develop

* Remove scale_op in expect_list of adam_op test=develop

* fix test optimizer loss assert error test=develop

* fix test optimizer loss assert error test=develop

* modify PADDLE_ENFORCE usage test=develop

* fix op_type in lamb_op.cc test=develop

* fix errors ostream format bug test=develop

* add betaPowOut in ngraph op test=develop

* fix ngraph::op api for gcc8 test=develop

* clean code test=develop

* modify struct into class test=develop

* remove code of beta1Tensor in lamb_op test=develop

51a86d2b

28 11月, 2019 1 次提交
- K
  add Adam beta1/beta2 support Variable (#21234) · ebfb720a
  由 Kaipeng Deng 提交于 11月 28, 2019
```
* add Adam beta1/beta2 support Variable. test=develop
```
  ebfb720a
28 10月, 2019 1 次提交

Replace risky GetInputType method with secure IndicateVarDataType interface (#20668) · 26cc1fe5

由 Chen Weihang 提交于 10月 28, 2019

* replace part of the old implementation, test=develop

* restore concat op, test=develop

* update all ops implemention & delete GetDataTypeOfVar func, test=develop

26cc1fe5

04 9月, 2019 1 次提交

Add user-friendly error message in optimizer ops to give a hint about the... · 8cb54ede

由 Chen Weihang 提交于 9月 04, 2019

Add user-friendly error message in optimizer ops to give a hint about the position sensitive problem of run(startup_program) (#19605)

* add extra error message hint in optimizer ops

* polish format & delete useless change, test=develop

* extract init judue from shape compare, test=develop

8cb54ede

21 5月, 2019 1 次提交

Add LAMB Optimizer support (#17489) · f9796b12

由 Yibing Liu 提交于 5月 21, 2019

* Add LAMB optimizer

* Expose LAMB Optimizer's APIs

test=develop, test=document_preview

* Cleanup code & doc

test=develop, test=document_preview

* Update lamb optimizer's formula

test=develop

f9796b12

15 1月, 2019 1 次提交
- Q
  
  remote min_row_size_to_use_multithread in adam interface test=develop · 8c516a24
  由 Qiao Longfei 提交于 1月 15, 2019
  
  8c516a24
07 1月, 2019 1 次提交
- Q
  change min_row_size_to_use_multithread to parameter of adam · 44b30055
  由 Qiao Longfei 提交于 1月 07, 2019
```
test=develop
```
  44b30055
14 12月, 2018 1 次提交
- Q
  
  change sparse mode to lazy mode · c624417c
  由 Qiao Longfei 提交于 12月 14, 2018
  
  c624417c
13 12月, 2018 1 次提交
- Q
  
  add sparse mode adam · fc6ec6bd
  由 Qiao Longfei 提交于 12月 13, 2018
  
  fc6ec6bd
12 12月, 2018 2 次提交
- M
  
  Remove debug info · b75bd29c
  由 minqiyang 提交于 12月 12, 2018
  
  b75bd29c
- Y
  Change tensor uses proto::VarType::type · 9bd70a1e
  由 Yu Yang 提交于 12月 11, 2018
```
test=develop
```
  9bd70a1e
11 12月, 2018 1 次提交
- M
  
  Add debug info · 57033869
  由 minqiyang 提交于 12月 11, 2018
  
  57033869
16 11月, 2018 1 次提交

Refine operator cmake (#14413) · a2d9b344

由 Wu Yi 提交于 11月 16, 2018

* wip simplify operator framework

* wip

* wip

* done test=develop

* clean test=develop

* fix test=develop

* fix deps test=develop

* fix cpu build test=develop

* fix tensorrt build test=develop

* fix tests test=develop

* fix test=develop

* fix cpu build test=develop

a2d9b344

22 10月, 2018 1 次提交
- X
  clean up after the changes have been stopped for so long. · 8f2116d8
  由 Xin Pan 提交于 10月 18, 2018
```
test=develop
```
  8f2116d8
27 6月, 2018 1 次提交
- Q
  
  fix adam op for selected rows · df7a266a
  由 qiaolongfei 提交于 6月 27, 2018
  
  df7a266a
11 6月, 2018 1 次提交

add inplace attribute to op_proto_maker (#10665) · bfa3fd6f

由 dzhwinter 提交于 6月 11, 2018

* "add inplace attribute"

* "register inplace attribute"

* "change se-next model for memory-reuse"

* "fix typo"

* repick

* fix merge conflict

* "fix stupid error"

bfa3fd6f

08 5月, 2018 1 次提交

Clean OpProtoAndCheckerMaker · 0e78cb69

由 Yu Yang 提交于 5月 08, 2018

Do not use ctor

* Reduce line of codes.
* We can use virtual function for Maker now.
* The implementation does not care what maker holds, it is easier to
refactor later.

0e78cb69

06 5月, 2018 1 次提交
- D
  Fix/adam float64 (#10407) · a28dffbb
  由 dzhwinter 提交于 5月 06, 2018
```
* "optimizer op support float64"

* "fix ci"

* "fix ftrl op"
```
  a28dffbb
12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
20 12月, 2017 1 次提交
- Y
  Move framework.proto to proto namespace (#6718) · e445b3ff
  由 Yu Yang 提交于 12月 20, 2017
```
* Move framework.proto to proto namespace

* Fix compile

* Fix compile

* Fix Compile
```
  e445b3ff
12 12月, 2017 2 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

K
Updating the Latex equation for Adagrad (#6009) · 35420cdf
由 kavyasrinet 提交于 12月 11, 2017
```
* Updating the Latex equation for Adagrad

* Fixing Latex euqations for adadelta, adam and adamax
```
35420cdf

21 11月, 2017 1 次提交
- Y
  Support many data types of several operators (#5731) · a5e73f9e
  由 Yu Yang 提交于 11月 21, 2017
```
* Support many data types of several operators

* SeqConv only support float/double

* Revert adagrad
```
  a5e73f9e
05 11月, 2017 1 次提交

Adding the doc format for AdaDelta, AdaMax, Adam, AdaGrad, BatchNorm, Clip, Cast and AUC (#5317) · 30a85204

由 kavyasrinet 提交于 11月 04, 2017

* Adding the doc format for AdaDelta

* Updating the documentation for Adagrad, Adam and Adamax

* Updating the auc op

* Fix review comments

* Updating doc for Batch Norm

* Updating the cast op

* Updating the clip op

* Fixing review comment

* Fixing review comment:

* Small change to restart PR_CI

30a85204

20 10月, 2017 1 次提交
- A
  
  Removing updates of Beta1 and Beta2 power accumulators outside the op (#4925) · 11bebeb2
  由 Abhinav Arora 提交于 10月 19, 2017
  
  11bebeb2
17 10月, 2017 1 次提交
- Y
  Correct OpWithKernel's infershape (#4847) · 73a8b78a
  由 Yu Yang 提交于 10月 16, 2017
```
They are public now
```
  73a8b78a
13 10月, 2017 1 次提交

Adding the Adam Optimizer operator (#4733) · 11680037

由 Abhinav Arora 提交于 10月 12, 2017

* add adam op

moment1_out = beta1 * moment1 + (1 − beta1) * grad
moment2_out = beta2 * moment2 + (1 − beta2) * grad * grad
moment1_hat =  moment1_out / (1 - beta1^t)
moment2_hat =  moment2_out / (1 - beta2^t)
param_out = param - learning_rate * moment1_hat / (sqrt(moment2_hat) +
epsilon)

* fix moment 2

* Adding the Adam optimization operator

* Adding more tests for Adam op

11680037

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功