- 01 1月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* update doc, test=develop * fix related unittests, test=develop * fix str incompatible error, test=develop
-
- 18 12月, 2019 1 次提交
-
-
由 Huihuang Zheng 提交于
The fixed bugs: 1. The condition sub-graph is not pruned 2. When backward graph is extremely simple, the whole backward ops are pruned.
-
- 10 12月, 2019 1 次提交
-
-
由 mapingshuo 提交于
* add seed op
-
- 06 12月, 2019 1 次提交
-
-
由 Huihuang Zheng 提交于
Add tests to use dy/dx to make sure the gradient values calculated by the control flow backward is correct. Also fixed bugs detected by those tests. Fix bugs: 1. Unlike sum_op, optimizer ops don't allow uninitialized input tensor. But in conditional_block_grad_op, since the conditional_block may not run, the output gradient tensor may be uninitialized, which will cause the optimizer op error. To fix it, we should let optimizer ops support uninitialized input like sum_op or assign the uninitialized gradient to 0 when the conditional_block_grad_op doesn't run. I found there are about 10+ optimizer ops. **To be simpler, I just assign output gradient of the conditional_block_grad_op to 0 in this PR**. But it can be further explored whether we can make optimizer ops like sum_op to support uninitialized input tensor because theoretically we can speed up without the assigning in conditional_block_grad_op. 2. Infer parameter shapes during append_backward. I didn't know that all our parameters are in global block. When op_desc is inferring shapes at the sub-block, it may not know the shape of gradients of parameters whose shape information is at global block. I fixed it by inferring shapes of gradients from forward var. This PR also did some code clean up: 1. Print the var name when sgd_op catches shape error so that it is easier to debug 2. Fix a typo: dicta -> dict
-
- 29 11月, 2019 1 次提交
-
-
由 Huihuang Zheng 提交于
* Commit before merging develop test=develop * Backup after working with Huihuang logs * Commit before deleting Huihuang debug loggings * Commit before debug test=develop * Fix bug commit test=develop * Backup of fixing bugs test=develop * Clean up code test=develop * Fix a bug in sum_op test=develop
-
- 30 10月, 2019 1 次提交
-
-
由 lvmengsi 提交于
* fix_gradients * fix_gradients, test=develop
-
- 19 10月, 2019 1 次提交
-
-
由 Aurelius84 提交于
-
- 13 10月, 2019 1 次提交
-
-
由 liym27 提交于
2. fix bug in backward.py: using fill_constant instead of fill_constant_batch_size_like 3. fix bug in ExpandGradOp. test=develop
-
- 09 10月, 2019 2 次提交
-
-
由 Youwei Song 提交于
* polish append_backward, test=document_fix * test=document_fix, test=develop * test=document_fix, test=develop * polish append_backward, test=document_fix, test=develop
-
由 mapingshuo 提交于
* rm unused ckpt and sort ckpt * use max op idx to sort, test=develop * remove unsed code,test=develop * add testcase, test_develop * modify test case, test=develop
-
- 26 9月, 2019 1 次提交
-
-
由 mapingshuo 提交于
* fix doc of apply_optimize test=document_fix test=document_preview * modify doc of backward test=develop test=document_fix * modify document hash test=develop test=document_preview
-
- 23 9月, 2019 1 次提交
-
-
由 mapingshuo 提交于
* add recompute based checkpoints methods for large batch training test=develop * add append_backward_with_forward_recomputation test=develop * refine optimizer test=develop * update backward and optimizer test=develop * make Variable usable test=develop * add recompute code * refine optimizer test=develop * refine addup _append_backward_ops_with_checkpoints_ 1) for recompute part, just cache the grad_op_desc without appending to block 2) before appending grad_op_desc to backward part, addup_repetitive_vars, remove unused branch test=develop * make method private * add recompute strategy into DistributedStrategy test=develop * checkpoint version3 test=develop * remove some print information test=develop * remove unused sumop test=develop * try to fix recompute with graph building modules * add input names to vars should be held * add memory debug tool * backup backward * Fix bugs * add backward desc for op not in any segments * add exception info for sub_block test=develop * modify code style test=develop * modify code style test=develop * remove print functions test=develop * add API spec test=develop test=document_preview * make Recompute a child class of Optimizer test=develop test=document_preview * add API spec test=develop test=document_preview * modify API spec test=develop test=document_preview * add document for Recompute test=develop test=document_preview * change API doc of Rcompute test=develop test=document_preview * code cleaning test=develop test=document_preview * modify API spec * fix bugs when segments hold no element * add testcase for Recompute Optimizer test=develop test=document_preview * add test for apply_gradient, and code cleaning test=develop test=document_preview * add test case for load function * enable CI test=develop test=document * add test case test=develop test=document_preview * add sample code for 4 function of recompute optimizer test=develop test=document_preview
-
- 11 9月, 2019 1 次提交
-
-
由 Youwei Song 提交于
* update dygraph api-doc and backward api-doc, test=develop * update dygraph api-doc and backward api-doc, update api.spec, test=develop * update dygraph api-doc and backward api-doc, update api.spec, test=develop * update API.spec, test=develop
-
- 26 8月, 2019 1 次提交
-
-
由 chengduo 提交于
* fix optimizer bug test=develop
-
- 24 7月, 2019 1 次提交
-
-
由 chengduo 提交于
* prun backward ops test=develop
-
- 02 7月, 2019 1 次提交
-
-
由 chengduo 提交于
* add not_been_used_vars to no_grad_set test=develop
-
- 01 7月, 2019 1 次提交
-
-
由 xsrobin 提交于
-
- 16 6月, 2019 1 次提交
-
-
由 qingqing01 提交于
* Update backward.py: - If there is no input grad var in all outputs of previous ops, do not append this op into graph. - Only apply this stragety when double backward. * Update some double backward op. * Update sum_op to judge whether a tensor is empty by numel or IsInitialized().
-
- 16 5月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 08 5月, 2019 1 次提交
-
-
由 lujun 提交于
Fix the following API examples: paddle.fluid.scope_guard paddle.fluid.backward.append_backward paddle.fluid.cpu_places paddle.fluid.cuda_pinned_places paddle.fluid.cuda_places paddle.fluid.in_dygraph_mode paddle.fluid.CUDAPlace paddle.fluid.CPUPlace paddle.fluid.CUDAPinnedPlace
-
- 23 4月, 2019 1 次提交
-
-
由 qingqing01 提交于
Support backward of backward for Relu and add a new gradient checker by comparing theoretical and numerical Jacobian. (#16862) * Support backward of backward and a new gradient checker * Rename decorators.py to decorator_helper.py, since Python on Windows CI has decorators package. 1. Add ReluDoubleGradMaker when register relu_grad. 2. Add a new gradient checker by comparing theoretical and numerical Jacobian. Check double gradients by double_grad_check.
-
- 03 4月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* fix some grad op desc maker test=develop * fix grad op desc makers test=develop
-
- 19 12月, 2018 1 次提交
-
-
由 minqiyang 提交于
-
- 18 12月, 2018 1 次提交
-
-
由 Xin Pan 提交于
test=develop
-
- 13 12月, 2018 1 次提交
-
-
由 Xin Pan 提交于
test=develop
-
- 26 9月, 2018 1 次提交
-
-
由 Wu Yi 提交于
* hide operator API * update * update api.spec * fix merge * fix test
-
- 18 9月, 2018 1 次提交
-
-
由 Wu Yi 提交于
* hide program APIs * fix merge error * update
-
- 15 8月, 2018 2 次提交
-
-
由 minqiyang 提交于
-
由 gongweibao 提交于
-
- 14 8月, 2018 3 次提交
- 10 8月, 2018 2 次提交
- 09 8月, 2018 1 次提交
-
-
由 minqiyang 提交于
Fix pb_protobuf2 FromString problem
-
- 07 8月, 2018 1 次提交
-
-
由 minqiyang 提交于
Fix str and bytes problem Fix sorted problem Fix math problem Fix CI problem
-
- 06 8月, 2018 1 次提交
-
- 26 7月, 2018 1 次提交
-
-
由 minqiyang 提交于
-
- 17 7月, 2018 1 次提交
-
-
由 Wu Yi 提交于
* remove block api * remove clone_variable * hide block inner apis * update * fix tests
-
- 11 7月, 2018 1 次提交
-
-
由 yuyang18 提交于
-