提交 · 0c31579c1c0242e184fe2dc7f8e14f4949da62a7 · BaiXuePrincess / Paddle

13 10月, 2021 1 次提交

由 limingshu 提交于 10月 13, 2021

* A leap of try for cudaLaunchCooperativeKernel

* fix bugs

* Totally replace the lar cuda kernel

* Fix bugs

* a test for lars merge

* Adding las_op_momentum infer_shape

* Fix codes

* use avg_numel instead of max_numel to acquire grid num

* modify unittest files about lars op

* Finally converge when merged-lars works

* fix ctest files

* add merged_operation kernel when cuda version is older than 11

* Fix code style

* fix ctest failure

* fix error

* fix all ctest error and change lars compute code of cpu

* fix bugs on v100.

* revert python modififation about lars

* revert python modification codes

0c31579c

13 9月, 2021 1 次提交

[RC22] Fix linear with matmul_op replace (#35445) · 53e294ca

由 zhulei 提交于 9月 13, 2021

* [RC22] Fix linear with matmul_op replace

* [RC22] Fix linear with matmul_op replace

* [RC22] Fix linear with matmul_op replace

* [RC22] Fix linear with matmul_op replace

* [RC22] Fix linear with matmul_op replace

53e294ca

10 6月, 2021 1 次提交

fuse L2Decay and momentum when param.regularizer is set (#32845) · a526b3e0

由 Zhang Ting 提交于 6月 10, 2021

* fuse L2Decay and momentum when param.regularizer is set

* add unittest

* refine

* refine _create_regularization_of_grad of momentum

* improve append_optimizer_op

a526b3e0

03 6月, 2021 1 次提交
- Y
  
  multi pricison for lars op and lars optimizer (#33280) · 4d805e6a
  由 Yuang Liu 提交于 6月 03, 2021
  
  4d805e6a
31 5月, 2021 1 次提交
- W
  support params groups, test=develop (#32830) · 2a771c06
  由 wangguanzhong 提交于 5月 31, 2021
```
* support params groups, test=develop

* simplify updating opt attr

* update according to review
```
  2a771c06
02 12月, 2020 1 次提交

Add pure fp16 training with master weights. (#27712) · be3777a5

由 Zhen Wang 提交于 12月 02, 2020

* add the weight decay func for the momentum op

* Add the multi_precision function in Momentum Optimizer.

* Make sure that the initial value of master weights are same with the fp16 weights.

* add static loss scaling.

* add the rescale_grad function in the pure fp16 training.

* use the original momentum updating method.

* Polish some codes, such as variable names.

* add docstring for apis.

* update the var creation details of _create_master_weight.

* not modify codes about imperative momentum updating.

* Fix the error of test_dist_sparse_tensor_load_momentum UT.

* add unit test for multi precision fp16 training.

* add more unit tests for CI.

* Use lower threshold values for allclose comparing in test_multi_precision_fp16_train UT.

* For CI Coverage Checking.

be3777a5

01 12月, 2020 2 次提交

J
Momentum Velocity init in Momentum.__init__() (#29223) · a5d13d59
由 Jiawei Wang 提交于 12月 01, 2020
```
* add lamb optimizer and unittest

* fix momentum resume training

* fix momentum acc
```
a5d13d59

accumulate gradient for leaf tensor with previous graph and expose leaf tensor concept (#28429) · c0a991c8

由 Zhou Wei 提交于 12月 01, 2020

* The leaf tensor concept is exposed and the gradient accumulation of leaf tensor

* The leaf tensor concept is exposed and the gradient accumulation of leaf tensor

* fix coverage

* fix api doc

* fix CI unittest

* fix CI unittest

* fix unitest

* empty tensor does’t need inner_var_

* fix some error message

c0a991c8

23 11月, 2020 1 次提交
- F
  refactor momentum op to combine weight (#27414) · 8ff35506
  由 furnace 提交于 11月 23, 2020
```
* refactor momentum op to combine weight_decay (scale op and sum op)
```
  8ff35506
29 8月, 2020 1 次提交

Adadelta Optimizer (#26590) · a1b99fae

由 Jiawei Wang 提交于 8月 29, 2020

* add doc; notest

* fix doc; notest

* update doc; notest

* refine optimizer && adam

* refine optimizer; notest

* add adam

* fix doc

* fix doc && add adamw; notest

* add error message

* bug fix

* refine rmsprop && adamax

* fix ci

* buf fix

* update comment

* unify arguments place; notest

* fix ut, test=develop

* bug fix

* fix conflicts, test=develop

* add examples code

* bug fix

* fix comments

* fix sample code

* add sample code for Optimizer

* add adamax ut, test=develop

* fix rmsprop ut, test=develop

* add ut for optimizer.py and adamw.py

* first commit of adadelta optimizer

* fix learning rate

* fix adadelta doc and add sgd momentum

* remove unused fluid

* fix codestyle

* Update test_adam_op.py

* Update test_adam_op.py

* fix SGD in 2 unittests

* fix SGD in 2 unittests

* fix ci

* fix ut
Co-authored-by: NMRXLT <xlt2024@gmail.com>
Co-authored-by: Nmapingshuo <mps2012@yeah.net>

a1b99fae

26 12月, 2018 1 次提交

Fp16 training (#14992) · 856f0da0

由 Wu Yi 提交于 12月 26, 2018

* wip

* wip

* wip

* wip for test

* add fp16 tests test=develop

* fix cpu build test=develop

* fix test=develop

* fix py3 tests test=develop

* fix lr_scheduler dtype test=develop

* fix test=dvelop

* test fix ci compile test=develop

* fix build and merge test=develop

* fallback momentumop change to general test=develop

* make fp16 lr schedule simple test=develop

* fix ut test=develop

* fix tests test=develop

* remove fp16 learning rate cast test=develop

856f0da0

20 12月, 2018 2 次提交

T
Revert "[Feature] Fp16 training for resnet50 (#14850)" · da87f7a6
由 typhoonzero 提交于 12月 20, 2018
```
This reverts commit 3d750f9c.
```
da87f7a6

[Feature] Fp16 training for resnet50 (#14850) · 3d750f9c

由 Wu Yi 提交于 12月 20, 2018

* wip

* wip

* wip

* wip for test

* add fp16 tests test=develop

* fix cpu build test=develop

* fix test=develop

* fix py3 tests test=develop

* fix lr_scheduler dtype test=develop

* fix test=dvelop

* test fix ci compile test=develop

* fix build and merge test=develop

* fallback momentumop change to general test=develop

3d750f9c

29 10月, 2018 1 次提交

[1.1] [project] train imagenet using large batch size (#13766) · 26200f2e

由 Wu Yi 提交于 10月 29, 2018

* fix nccl2 lars dist support

* put lars in momentum op

* add tests lars

* fix ci

* fix cpu kernel

* soft warning

* remove lars in test_recognize_digits.py

* move to another op

* add file

* update api.spec test=develop

* update test=develop

* fix api.spec test=develop

* wip

* wip, finish grad merge ops

* wip, finish graph build

* wip test running

* work on 1 gpu

* workable version

* update

* fix tests

* fuse broadcast op

* fix compile failed

* refine

* add batch merge test mnist

* fix CI test=develop

* fix build

* use independent bn params for batch merge test=develop

* update api.spec

* follow comments and for test

* wip

* refine tests test=develop

* follow comments test=develop

* remove startup bn modify test=develop

* follow comments test=develop

* fix merge test=develop

26200f2e

17 10月, 2018 1 次提交
- D
  
  fix compile in cpu error. test=develop · 00e8791f
  由 dzhwinter 提交于 10月 17, 2018
  
  00e8791f
14 10月, 2018 1 次提交
- D
  
  add sparse update momentum. test=develop · 8329a1f1
  由 dzhwinter 提交于 10月 14, 2018
  
  8329a1f1
15 8月, 2018 1 次提交
- M
  
  Add print_function for all python files · 99d3f089
  由 minqiyang 提交于 8月 15, 2018
  
  99d3f089
26 7月, 2018 2 次提交
- M
  
  Remove python3 relative import of unittest · 9fc13fde
  由 minqiyang 提交于 7月 26, 2018
  
  9fc13fde
- M
  
  Change iter_parameters back and port unittests code to Python3 · 35e6abd7
  由 minqiyang 提交于 7月 26, 2018
  
  35e6abd7
20 7月, 2018 1 次提交
- Q
  Fix serious bug in nesterov momentum optimizer. (#12231) · 873a50ce
  由 qingqing01 提交于 7月 20, 2018
```
* Fix serious bug in nesterov momentum optimizer.
```
  873a50ce
24 2月, 2018 1 次提交
- L
  
  move Fluid API code out of V2 API code · b11956a0
  由 Luo Tao 提交于 2月 24, 2018
  
  b11956a0
13 2月, 2018 1 次提交

Run Python OP tests in a single Python process to improve test time. (#8362) · cde6241a

由 Xin Pan 提交于 2月 13, 2018

Currently, our tests run with 2 GPUs, the init time is absurdly long:
about 4s for each process.  Currently, we run each OP test on
different processes. This PR:

1. create cmake function py_test_modules which will generate the
Makefile that runs a list of Python unittest module in a single Python
process.

2. move all "python unittest compatible" (e.g., used the unittest
package, not just a regular python file). from fluid/tests to
fluid/tests/unittests.

3. cmake now will run all OP tests in fluid/tests/unittests in a
single process, except the time-consuming tests, they are separated
into different processes to utilize parallelism. Please make sure to
use the unittest package if you put the python test file in
fluid/tests/unittests

4. remove all exit(0) from fluid/tests/unittests/*.py, exit(0) is used
to disable unittest, we can not do it when running all tests in a
single process since it will terminate the process without running the
other tests. Instead, the test is disabled in
fluid/tests/unittests/CMakeLists.txt. FIXME is added for each disabled
item. Please disable the unittest from
fluid/tests/unittests/CMakeLists.txt, instead of adding exit(0) to the
Python file, for all Python file in fluid/tests/unittests/.

5. add an option WITH_FAST_BUNDLE_TEST. When OFF, will run the unit
tests in separate process so that they can be tested individually.

cde6241a

12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
21 1月, 2018 1 次提交

"fix decode bug" (#7711) · e983cc90

由 dzhwinter 提交于 1月 21, 2018

* "fix decode bug"

* "follow commnet"

* "fix error"

* "fix hook bug"

* fix based comment

* fix copyright

* fix based on comment

e983cc90

15 1月, 2018 1 次提交

Feature/hooks (#7513) · b9b75377

由 dzhwinter 提交于 1月 15, 2018

* add copyright hook

* add copyright hook

* refine copyright hook

* "test copyright hook"

* fix check style

* fix ci

b9b75377

14 11月, 2017 1 次提交
- Q
  Change framework to fluid (#5637) · 4adc8a7a
  由 Qiao Longfei 提交于 11月 14, 2017
```
* init commit

* change some dir name
```
  4adc8a7a
10 11月, 2017 1 次提交

Fix attribute naming for momentum_op (#5453) · 2e355f03

由 Siddharth Goyal 提交于 11月 09, 2017

* Fix attribute naming for momentum_op

* Fix minor typo in comment

* Fix attribute name

* Fix names in test_optimizer

* Fix python wrapper

2e355f03

20 10月, 2017 1 次提交
- K
  
  Adding Nesterov Momentum (#4948) · 5380a547
  由 kavyasrinet 提交于 10月 20, 2017
  
  5380a547
06 10月, 2017 1 次提交
- S
  
  Modify implementation · c10da26c
  由 sidgoyal78 提交于 10月 05, 2017
  
  c10da26c
03 10月, 2017 1 次提交
- S
  
  Add momentum operator · d28b3094
  由 sidgoyal78 提交于 10月 02, 2017
  
  d28b3094

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致