提交 · 797bd40d093189ce3c9f24fcd0f59bbe2878b2ca · 机器未来 / Paddle

20 10月, 2021 1 次提交

[Auto Parallel] Generalization for Partition and Completion (#35735) · 797bd40d

由 JZ-LIANG 提交于 10月 20, 2021

* default dist op

* add dist_attr for dist op

* add unitest

* update inputname

* update function name

* add unitest

* update CMakeLists.txt for CI

* fix dis_matmul

* fix compile error

* update matmul to matmul_v2

* unify api

* unify api

* todo

* update distop forward func

* update distop forward func

* auto parallel backward

* update dist op

* autoparallel backward

* add backward for embedding

* temp1

* temp2

* temp3

* temp4

* backward done1

* backward done2

* backward done3

* dist embedding remove mp mode

* dist matmul remove mp mode

* update dist embedding
『

* dist op init1

* dist op init 2

* update unitest

* context remove parallel mode

* partitioner remove parallel mode

* update unitest

* a more general method to support varying mesh in pipeline parallel

* support varying mesh in pipeline parallel

* embedding support varying mesh in pipeline parallel

* matmul support varying mesh in pipeline parallel

* default dist op support varying mesh in pipeline parallel

* dist attribute for startup program

* default dist op support varying mesh in pipeline parallel 2

* partitoner support varying mesh in pipeline parallel

* revise logic for auto compeletion

* revise framework.py

* revise reshard unitest

* revise unitest for parallelize

* chmod

* fixed bug for dist embedding name mapping
Co-authored-by: Nzhaoyingli <zhaoyingli@baidu.com>

797bd40d

19 10月, 2021 1 次提交
- W
  
  [hybrid] static model parallel dropout support deterministic RandomSeedGenerator (#36228) · 8cc8e411
  由 WangXi 提交于 10月 19, 2021
  
  8cc8e411
13 10月, 2021 1 次提交

[New Feature] Support triple grad in Paddle (#36187) · 2c44ee7e

由 Jiabin Yang 提交于 10月 13, 2021

* native commit for triple grad of sigmod

* Updated unittests files

* init functional jacobian api

* Updated trible_test func

* Updated gradient_checker & test_script

* finish test with dtype float32

* add float64 test case

* polish code

* use atol=1e-5 with dtype float64

* fix for ci

* set timeout for test_jacobian

* fix dygraph grad to support high differential

* polish API docstring

* Updated gradient checker and some related files

* fix double grad strip error for high differential

* fix double grad strip error for high differential

* Add Sigmoid triple grad tests

* fix dygraph double grad dtype error when calling for high differential senario

* Updated triple grad teses func

* Use np.random to initialize ddx

* Updated triple_grad_check func

* add todo for gradient checker and refine some comments

* remove additional code

* add test for warnging in backward.py

* format python code
Co-authored-by: Nveyron95 <veyron_wu@163.com>
Co-authored-by: Nlevi131 <limaolin01@baidu.com>

2c44ee7e

28 9月, 2021 1 次提交

[hybrid] seed and dropout op support force-cpu (#35820) · 58c8f6b3

由 xiayanming 提交于 9月 28, 2021

* [HIP] fix op not support AMD GPU bug, the flag PADDLE_WITH_ROCM is invalid

* [HIP] fix op not support AMD GPU bug, the flag PADDLE_WITH_ROCM is invalid

* [HIP] fix op not support AMD GPU bug

* [hybrid] seed and dropout op support force-cpu

* [hybrid] seed and dropout op support force-cpu

* [hybrid] seed and dropout op support force-cpu

* [hybrid] seed and dropout op support force-cpu

* [hybrid] seed and dropout op support force-cpu

* [hybrid] fix seed ci failed issue

* add AsExtra for force_cpu of seed op

58c8f6b3

05 8月, 2021 1 次提交
- W
  
  optimize pipeline performance with recompute and amp, test=allcase (#34519) · 911c8593
  由 WangXi 提交于 8月 05, 2021
  
  911c8593
04 8月, 2021 1 次提交

Add gradient with optimizer API (#34395) · d9e63a81

由 chentianyu03 提交于 8月 04, 2021

* add gradients_with_optimizer api

* modify gradients_with_optimizer

* add gradients_with_optimizer api into paddle.auto.backward_mode

* add gradients_with_optimizer test case

* add doc for gradients_with_optimizer

* add doc for gradients_with_optimizer

d9e63a81

14 7月, 2021 1 次提交
- S
  
  [Hybrid Parallel]add op_device in seed op for recompute · 52c1a950
  由 ShenLiang 提交于 7月 14, 2021
  
  52c1a950
05 7月, 2021 1 次提交
- W
  
  optimize grad add device (#33946) · 75d247b7
  由 WangXi 提交于 7月 05, 2021
  
  75d247b7
02 7月, 2021 1 次提交
- W
  
  fix shared param grad_add op_device is null (#33875) · cf4c6fb4
  由 WangXi 提交于 7月 02, 2021
  
  cf4c6fb4
09 6月, 2021 1 次提交
- W
  cache core.globals() to speed up dynamic graph (#32098) · b4954ce4
  由 wanghuancoder 提交于 6月 09, 2021
```
* modify API nn.Bilinear's doc, test=develop
```
  b4954ce4
26 4月, 2021 1 次提交
- X
  [2.1 API] Modified params of some APIs to support tuple and list. (#32528) · 400c3aa7
  由 xiemoyuan 提交于 4月 26, 2021
```
* Modified params of some APIs to support tuple and list.

* fixed bug.
```
  400c3aa7
07 4月, 2021 1 次提交
- J
  
  [3D-parallelism] Hybrid Model Parallelism (#32074) · 1e60a0c4
  由 JZ-LIANG 提交于 4月 07, 2021
  
  1e60a0c4
02 4月, 2021 1 次提交
- J
  
  [3D-Parallel:Sharding] Optimizations for supporting ERNIE 3.0 training (#31884) · 69c874fd
  由 JZ-LIANG 提交于 4月 02, 2021
  
  69c874fd
12 1月, 2021 1 次提交
- J
  
  Recompute Offload (#30233) · 75936d83
  由 JZ-LIANG 提交于 1月 12, 2021
  
  75936d83
24 12月, 2020 1 次提交

[Feature] one ps (3/4) (#29604) · 032414ca

由 tangwei12 提交于 12月 24, 2020

* oneps (3/4)
Co-authored-by: NMrChengmo <cmchengmo@163.com>
Co-authored-by: Nmalin10 <malin10@baidu.com>
Co-authored-by: Nchengmo <chengmo@baidu.com>

032414ca

26 11月, 2020 1 次提交

Add static_only decorator for static apis (#29015) · d0129fcd

由 Chen Weihang 提交于 11月 26, 2020

* add static_only for static api

* addd static_only for class init

* remove static_only for default_main_program

* remove creater_parameter & startup_program

* remove failed apis

* revert py_func import

* remove global scope

* remove some api

* remove cuda pinned place

d0129fcd

14 10月, 2020 1 次提交
- Y
  
  Update all the examples which use paddle.static.nn.fc. (#27904) · b301adc9
  由 Yiqun Liu 提交于 10月 14, 2020
  
  b301adc9
28 9月, 2020 1 次提交

[API 2.0]Migrate api example for gradients/append_backward/program_guard (#27570) · 7c516240

由 Aurelius84 提交于 9月 28, 2020

* modify sample code

* variable -> tensor

* migrate program_guard sample code

* refine error message

* migrate program_guard

* refine comment style

* fix indent

7c516240

21 9月, 2020 1 次提交

[Feature] Enhance inplace addto strategy for gradient accumulation in static graph (#27112) · aba759ba

由 Leo Chen 提交于 9月 21, 2020

* support use add instead of sum to do gradient accumulation

* add inplace addto pass

* add grad_add op and inplace addto pass

* remove debug code

* code refine

* fix bug when sereral sum ops inserts at same op_idx

* fix Flags type

* add addto attribute for conv3d

* fix ut

* code clean

* fix type

aba759ba

11 9月, 2020 1 次提交
- A
  fix unused var with zero gradient bug in fluid.gradient (#27246) · 20a84820
  由 Aurelius84 提交于 9月 11, 2020
```
* fix calcu_gradients

* fix code place

* fix embedding interface usage
```
  20a84820
13 7月, 2020 1 次提交

[while grad]Support pruning op in find_op_path about while sub-block when... · 435fc4f0

由 liym27 提交于 7月 13, 2020

[while grad]Support pruning op in find_op_path about while sub-block when appending backward (#25330)

Prune OPs which are not related with loss in while sub-block when constructing backward OP path.

435fc4f0

14 5月, 2020 1 次提交

English API Docs Optimization Part 1 (#24536) · 86ca31ab

由 Cindy Cai 提交于 5月 14, 2020

* test=develop, test=document_fix

* test=develop, test=document_fix
Co-authored-by: Nswtkiwi <1208425345@qq.com>

86ca31ab

30 4月, 2020 1 次提交

Fix double_grad bug in statig-graph (#24190) · 84cf5db8

由 qingqing01 提交于 4月 30, 2020

* Rename internal gradient variables in multiple backward
* so that they have different names with previous backward
* For example:
*  y = x * x, grad = fluid.gradients(fluid.gradients(y, x) + y * y, x)
* In second-time backward, gradient variable names of partial
* forward network (y * y) may be have same names with first-time
* fluid.gradients(y, x).

test=develop

84cf5db8

15 4月, 2020 1 次提交
- M
  fix AMP and recompute (#23551) · f0e743f1
  由 mapingshuo 提交于 4月 15, 2020
```
* allow amp and recompute working together
```
  f0e743f1
10 4月, 2020 1 次提交

API(append_backward) error message enhancement (#23446) · 2ca5801d

由 Aurelius84 提交于 4月 10, 2020

* API/OP (append_backward) error message enhancement test=develop

* polish check_type test=develop

* fix unittest failed test=develop

* merge develop test=develop

2ca5801d

09 4月, 2020 1 次提交
- A
  API(fluid.gridents) error message enhancement (#23450) · fab9464f
  由 Aurelius84 提交于 4月 09, 2020
```
* API(fluid.gridents) error message enhancement test=develop

* fix unitest failed test=develop
```
  fab9464f
20 3月, 2020 1 次提交

Add dygraph double grad implementation (#22939) · a31d7328

由 Zeng Jinle 提交于 3月 20, 2020

* add double grad implementation for dygraph, test=develop

* polish code, add uts, test=develop

* fix place bug, test=develop

* polish codes, add more uts for coverages, test=develop

* add no_grad_set, test=develop

* add star gan ut, test=develop

* follow comments, test=develop

a31d7328

19 3月, 2020 1 次提交
- Z
  
  add op_device attr for backward op_desc, test=develop (#23062) · 3f371db8
  由 Zhang Ting 提交于 3月 19, 2020
  
  3f371db8
17 3月, 2020 1 次提交
- Z
  
  set op_device for loss_op_desc (#23027) · eec10aab
  由 Zhang Ting 提交于 3月 17, 2020
  
  eec10aab
03 3月, 2020 1 次提交
- Z
  add fluid.device_guard to specify the device type for Op (#22254) · 4e8bc024
  由 Zhang Ting 提交于 3月 03, 2020
```
* add fluid.device_guard to specify the device type for Op
```
  4e8bc024
23 2月, 2020 1 次提交
- T
  
  fix typo words (#22653) · d2ba91aa
  由 tianshuo78520a 提交于 2月 23, 2020
  
  d2ba91aa
10 2月, 2020 1 次提交
- G
  
  Fix the leaving out of rnn_memory_helper_grad's output vars. test=develop (#22499) · e7bbad6c
  由 Guo Sheng 提交于 2月 10, 2020
  
  e7bbad6c
07 2月, 2020 1 次提交

polish no_grad_set of gradient and append_backward (#22440) · 50af6b5d

由 Aurelius84 提交于 2月 07, 2020

* polish backward api doc test=develop, test=document_preview,
       test=document_fix

* polish backward api doc test=develop, test=document_preview, test=document_fix

* no_grad supports set of Variable test=develop, test=document_preview

* polish sample code of append_backward test=develop, test=document_preview

* modify assert into Raise TypeError test=develop,test=document_preview

* fix unittest failed test=develop

* rm useless file test=develop

* polish en doc test=develop

* polish code of no_grad_set test=develop

* polish code of no_grad_set test=develop

50af6b5d

20 1月, 2020 1 次提交

Polish backward.py to prune more ops (#22246) · 039bb505

由 Zeng Jinle 提交于 1月 19, 2020

* polish backward prune, test=develop

* fix control flow op bug, test=develop

* add some unittests, test=develop

* fix unittest args, test=develop

* follow huihuang's comments, test=develop

039bb505

16 1月, 2020 1 次提交
- Z
  
  fix typo in error message (#22312) · 805328e1
  由 zhangchunle 提交于 1月 16, 2020
  
  805328e1
04 1月, 2020 1 次提交

control flow: support optimizer called (#21851) · 7d8d4599

由 liym27 提交于 1月 04, 2020

* append optimize op in the grad block of current block if current block is in control flow. test=develop

* add conditional grad op when optimizer used in control flow. test=develop

* add comment and modify typo. test=develop

* fix append_backward to support control flow. test=develop

* add test. test=develop

* fix copy_var_to_parent_block and conditional_block_grad. test=develop

* fix bug: revert to append conditional_block_grad vars to sub grad block. test=develop

* fix bug: revert to assign var to parent block even if var already is in parent block

* fix bug: consider outputs is empty. test=develop

* move _rename_grad_ out. test=develop

* modify code according to reviews from Huihuang. test=develop

* modify code according to reviews from Jinle. test=develop

7d8d4599

01 1月, 2020 1 次提交
- C
  Uniform append_backward & gradients parameter_list type to Variable (#21938) · 9a2204ee
  由 Chen Weihang 提交于 1月 01, 2020
```
* update doc, test=develop

* fix related unittests, test=develop

* fix str incompatible error, test=develop
```
  9a2204ee
18 12月, 2019 1 次提交

Fix Backward Bugs in Conditional Block (#21809) · 557bce77

由 Huihuang Zheng 提交于 12月 18, 2019

The fixed bugs:

1. The condition sub-graph is not pruned
2. When backward graph is extremely simple, the whole backward ops are pruned.

557bce77

10 12月, 2019 1 次提交
- M
  Dropout with seed (#21590) · e2d849b9
  由 mapingshuo 提交于 12月 10, 2019
```
* add seed op
```
  e2d849b9
06 12月, 2019 1 次提交

Add Much Complex Test and Fix Bugs for Control Flow cond API (#21532) · 1dcf6a72

由 Huihuang Zheng 提交于 12月 06, 2019

Add tests to use dy/dx to make sure the gradient values calculated by the control flow backward is correct. Also fixed bugs detected by those tests.

Fix bugs:

1. Unlike sum_op, optimizer ops don't allow uninitialized input tensor. But in conditional_block_grad_op, since the conditional_block may not run, the output gradient tensor may be uninitialized, which will cause the optimizer op error. To fix it, we should let optimizer ops support uninitialized input like sum_op or assign the uninitialized gradient to 0 when the conditional_block_grad_op doesn't run. I found there are about 10+ optimizer ops. **To be simpler, I just assign output gradient of the conditional_block_grad_op to 0 in this PR**. But it can be further explored whether we can make optimizer ops like sum_op to support uninitialized input tensor because theoretically we can speed up without the assigning in conditional_block_grad_op.

2. Infer parameter shapes during append_backward. I didn't know that all our parameters are in global block. When op_desc is inferring shapes at the sub-block, it may not know the shape of gradients of parameters whose shape information is at global block. I fixed it by inferring shapes of gradients from forward var.

This PR also did some code clean up:
1. Print the var name when sgd_op catches shape error so that it is easier to debug
2. Fix a typo: dicta -> dict

1dcf6a72

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致