提交 · b339dff2319b0bbcef467a352cace044acbe72ae · BaiXuePrincess / Paddle

04 1月, 2020 1 次提交

control flow: support optimizer called (#21851) · 7d8d4599

由 liym27 提交于 1月 04, 2020

* append optimize op in the grad block of current block if current block is in control flow. test=develop

* add conditional grad op when optimizer used in control flow. test=develop

* add comment and modify typo. test=develop

* fix append_backward to support control flow. test=develop

* add test. test=develop

* fix copy_var_to_parent_block and conditional_block_grad. test=develop

* fix bug: revert to append conditional_block_grad vars to sub grad block. test=develop

* fix bug: revert to assign var to parent block even if var already is in parent block

* fix bug: consider outputs is empty. test=develop

* move _rename_grad_ out. test=develop

* modify code according to reviews from Huihuang. test=develop

* modify code according to reviews from Jinle. test=develop

7d8d4599

01 1月, 2020 1 次提交
- C
  Uniform append_backward & gradients parameter_list type to Variable (#21938) · 9a2204ee
  由 Chen Weihang 提交于 1月 01, 2020
```
* update doc, test=develop

* fix related unittests, test=develop

* fix str incompatible error, test=develop
```
  9a2204ee
18 12月, 2019 1 次提交

Fix Backward Bugs in Conditional Block (#21809) · 557bce77

由 Huihuang Zheng 提交于 12月 18, 2019

The fixed bugs:

1. The condition sub-graph is not pruned
2. When backward graph is extremely simple, the whole backward ops are pruned.

557bce77

10 12月, 2019 1 次提交
- M
  Dropout with seed (#21590) · e2d849b9
  由 mapingshuo 提交于 12月 10, 2019
```
* add seed op
```
  e2d849b9
06 12月, 2019 1 次提交

Add Much Complex Test and Fix Bugs for Control Flow cond API (#21532) · 1dcf6a72

由 Huihuang Zheng 提交于 12月 06, 2019

Add tests to use dy/dx to make sure the gradient values calculated by the control flow backward is correct. Also fixed bugs detected by those tests.

Fix bugs:

1. Unlike sum_op, optimizer ops don't allow uninitialized input tensor. But in conditional_block_grad_op, since the conditional_block may not run, the output gradient tensor may be uninitialized, which will cause the optimizer op error. To fix it, we should let optimizer ops support uninitialized input like sum_op or assign the uninitialized gradient to 0 when the conditional_block_grad_op doesn't run. I found there are about 10+ optimizer ops. **To be simpler, I just assign output gradient of the conditional_block_grad_op to 0 in this PR**. But it can be further explored whether we can make optimizer ops like sum_op to support uninitialized input tensor because theoretically we can speed up without the assigning in conditional_block_grad_op.

2. Infer parameter shapes during append_backward. I didn't know that all our parameters are in global block. When op_desc is inferring shapes at the sub-block, it may not know the shape of gradients of parameters whose shape information is at global block. I fixed it by inferring shapes of gradients from forward var.

This PR also did some code clean up:
1. Print the var name when sgd_op catches shape error so that it is easier to debug
2. Fix a typo: dicta -> dict

1dcf6a72

29 11月, 2019 1 次提交

Fix Cond Bug for Nested Control Flow (#21340) · 630be319

由 Huihuang Zheng 提交于 11月 29, 2019

* Commit before merging develop

test=develop

* Backup after working with Huihuang logs

* Commit before deleting Huihuang debug loggings

* Commit before debug

test=develop

* Fix bug commit

test=develop

* Backup of fixing bugs

test=develop

* Clean up code

test=develop

* Fix a bug in sum_op

test=develop

630be319

30 10月, 2019 1 次提交
- L
  Fix gradients (#20857) · aadd81b6
  由 lvmengsi 提交于 10月 30, 2019
```
* fix_gradients

* fix_gradients, test=develop
```
  aadd81b6
19 10月, 2019 1 次提交
- A
  
  fix fill_constant shape with -1 and enhance cross_entropy test=develop (#20722) · 74a28f5e
  由 Aurelius84 提交于 10月 19, 2019
  
  74a28f5e
13 10月, 2019 1 次提交

fill_constant support Tensor; (#20521) · fc6ec3b9

由 liym27 提交于 10月 13, 2019

2. fix bug in backward.py: using fill_constant instead of fill_constant_batch_size_like
3. fix bug in ExpandGradOp.

test=develop

fc6ec3b9

09 10月, 2019 2 次提交

polish append_backward en doc (#20199) · 478e4d68

由 Youwei Song 提交于 10月 09, 2019

* polish append_backward, test=document_fix

* test=document_fix, test=develop

* test=document_fix, test=develop

* polish append_backward, test=document_fix, test=develop

478e4d68

RecomputeOptimizer: rm unused ckpt and sort ckpt (#20108) · 90be481b

由 mapingshuo 提交于 10月 09, 2019

* rm unused ckpt and sort ckpt

* use max op idx to sort, test=develop

* remove unsed code,test=develop

* add testcase, test_develop

* modify test case, test=develop

90be481b

26 9月, 2019 1 次提交

fix doc of apply_optimize (#19965) · d62360fe

由 mapingshuo 提交于 9月 26, 2019

* fix doc of apply_optimize
test=document_fix
test=document_preview

* modify doc of backward
test=develop
test=document_fix

* modify document hash
test=develop
test=document_preview

d62360fe

23 9月, 2019 1 次提交

Forward recompute3 (#19913) · 9901f696

由 mapingshuo 提交于 9月 23, 2019

* add recompute based checkpoints methods for large batch training
test=develop

* add append_backward_with_forward_recomputation
test=develop

* refine optimizer
test=develop

* update backward and optimizer
test=develop

* make Variable usable
test=develop

* add recompute code

* refine optimizer
test=develop

* refine addup _append_backward_ops_with_checkpoints_
1) for recompute part, just cache the grad_op_desc without appending to block
2) before appending grad_op_desc to backward part, addup_repetitive_vars, remove unused branch
test=develop

* make method private

* add recompute strategy into DistributedStrategy
test=develop

* checkpoint version3
test=develop

* remove some print information
test=develop

* remove unused sumop
test=develop

* try to fix recompute with graph building modules

* add input names to vars should be held

* add memory debug tool

* backup backward

* Fix bugs

* add backward desc for op not in any segments

* add exception info for sub_block

test=develop

* modify code style

test=develop

* modify code style

test=develop

* remove print functions

test=develop

* add API spec

test=develop
test=document_preview

* make Recompute a child class of Optimizer

test=develop
test=document_preview

* add API spec

test=develop
test=document_preview

* modify API spec

test=develop
test=document_preview

* add document for Recompute

test=develop
test=document_preview

* change API doc of Rcompute

test=develop
test=document_preview

* code cleaning

test=develop
test=document_preview

* modify API spec

* fix bugs when segments hold no element

* add testcase for Recompute Optimizer

test=develop
test=document_preview

* add test for apply_gradient, and code cleaning

test=develop
test=document_preview

* add test case for load function

* enable CI

test=develop
test=document

* add test case

test=develop
test=document_preview

* add sample code for 4 function of recompute optimizer

test=develop
test=document_preview

9901f696

11 9月, 2019 1 次提交

fix api-doc error for dygraph and backward (#19721) · 3e5fb636

由 Youwei Song 提交于 9月 11, 2019

* update dygraph api-doc and backward api-doc, test=develop

* update dygraph api-doc and backward api-doc, update api.spec, test=develop

* update dygraph api-doc and backward api-doc, update api.spec, test=develop

* update API.spec, test=develop

3e5fb636

26 8月, 2019 1 次提交
- C
  Fix optimizer bug (#19410) · bfb6ac81
  由 chengduo 提交于 8月 26, 2019
```
* fix optimizer bug
test=develop
```
  bfb6ac81
24 7月, 2019 1 次提交
- C
  Enhance backward process (#18700) · 8259f141
  由 chengduo 提交于 7月 24, 2019
```
* prun backward ops
test=develop
```
  8259f141
02 7月, 2019 1 次提交
- C
  Add find_no_grad_vars in backward.py (#17942) · e0d8c6ac
  由 chengduo 提交于 7月 02, 2019
```
* add not_been_used_vars to no_grad_set
test=develop
```
  e0d8c6ac
01 7月, 2019 1 次提交
- X
  
  add "import paddle.fluid as fluid" to examples lack of it · 47e2ef38
  由 xsrobin 提交于 7月 01, 2019
  
  47e2ef38
16 6月, 2019 1 次提交

Update backward appending stragety to support double backward and fix some bug. (#18104) · 80d2e66f

由 qingqing01 提交于 6月 16, 2019

* Update backward.py:
     - If there is no input grad var in all outputs of previous ops, do not append this op into graph.
     - Only apply this stragety when double backward.
* Update some double backward op.
* Update sum_op to judge whether a tensor is empty by numel or IsInitialized().

80d2e66f

16 5月, 2019 1 次提交
- Z
  
  fix recurrent_op,test=develop (#17433) · 712bfb17
  由 Zeng Jinle 提交于 5月 16, 2019
  
  712bfb17
08 5月, 2019 1 次提交

Repair api example (#17221) · e388a1fb

由 lujun 提交于 5月 08, 2019

Fix the following API examples:

paddle.fluid.scope_guard
paddle.fluid.backward.append_backward
paddle.fluid.cpu_places
paddle.fluid.cuda_pinned_places
paddle.fluid.cuda_places
paddle.fluid.in_dygraph_mode
paddle.fluid.CUDAPlace
paddle.fluid.CPUPlace
paddle.fluid.CUDAPinnedPlace

e388a1fb

23 4月, 2019 1 次提交

Support backward of backward for Relu and add a new gradient checker by... · c1c2633a

由 qingqing01 提交于 4月 23, 2019

Support backward of backward for Relu and add a new gradient checker by comparing theoretical and numerical Jacobian. (#16862)

* Support backward of backward and a new gradient checker
* Rename decorators.py to decorator_helper.py, since Python on Windows CI has decorators package.

1. Add ReluDoubleGradMaker when register relu_grad.
2. Add a new gradient checker by comparing theoretical and numerical Jacobian.  Check double gradients by double_grad_check.

c1c2633a

03 4月, 2019 1 次提交
- Z
  Fix some grad op desc makers (#16633) · 1c526e1d
  由 Zeng Jinle 提交于 4月 02, 2019
```
* fix some grad op desc maker
test=develop

* fix grad op desc makers
test=develop
```
  1c526e1d
19 12月, 2018 1 次提交
- M
  
  Shameless copy · 8d88c5a8
  由 minqiyang 提交于 12月 19, 2018
  
  8d88c5a8
18 12月, 2018 1 次提交
- X
  MLP forward backward · 63240326
  由 Xin Pan 提交于 12月 13, 2018
```
test=develop
```
  63240326
13 12月, 2018 1 次提交
- X
  clean parallel do · 47ea2534
  由 Xin Pan 提交于 12月 13, 2018
```
test=develop
```
  47ea2534
26 9月, 2018 1 次提交
- W
  hide operator API (#12543) · 16e73e0d
  由 Wu Yi 提交于 9月 26, 2018
```
* hide operator API

* update

* update api.spec

* fix merge

* fix test
```
  16e73e0d
18 9月, 2018 1 次提交
- W
  Hide program APIs (#12315) · efafc72f
  由 Wu Yi 提交于 9月 18, 2018
```
* hide program APIs

* fix merge error

* update
```
  efafc72f
15 8月, 2018 2 次提交
- M
  
  Add print_function for all python files · 99d3f089
  由 minqiyang 提交于 8月 15, 2018
  
  99d3f089
- G
  
  Fix clone() bug. (#12583) · 842fb021
  由 gongweibao 提交于 8月 15, 2018
  
  842fb021
14 8月, 2018 3 次提交
- M
  
  Move compat module to python/paddle · e0d5f8a8
  由 minqiyang 提交于 8月 14, 2018
  
  e0d5f8a8
- M
  
  Polish code style · 5338417b
  由 minqiyang 提交于 8月 14, 2018
  
  5338417b
- M
  
  Polish code · ae39709e
  由 minqiyang 提交于 8月 14, 2018
  
  ae39709e
10 8月, 2018 2 次提交
- M
  
  Fix six.iteritems problem · 5d4238cd
  由 minqiyang 提交于 8月 10, 2018
  
  5d4238cd
- M
  
  Replace items() with six.moves.iteritems() to improve memory usage · 6dc07e7f
  由 minqiyang 提交于 8月 10, 2018
  
  6dc07e7f
09 8月, 2018 1 次提交
- M
  Fix divide problem in CI · c3fdf3ae
  由 minqiyang 提交于 8月 09, 2018
```
Fix pb_protobuf2 FromString problem
```
  c3fdf3ae
07 8月, 2018 1 次提交

Fix pybind11 problem · 6abe819f

由 minqiyang 提交于 8月 07, 2018

Fix str and bytes problem
Fix sorted problem
Fix math problem
Fix CI problem

6abe819f

06 8月, 2018 1 次提交
- Y
  Do not set loss@Grad as persistable · 55ff03b7
  由 Yu Yang 提交于 8月 06, 2018
```
Revert part of a3ca4c99
```
  55ff03b7
26 7月, 2018 1 次提交
- M
  
  Apply 2to3 to current paddle main python code · 559d3632
  由 minqiyang 提交于 7月 26, 2018
  
  559d3632
17 7月, 2018 1 次提交

Remove block api (#12107) · db67d60e

由 Wu Yi 提交于 7月 17, 2018

* remove block api

* remove clone_variable

* hide block inner apis

* update

* fix tests

db67d60e

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致