提交 · 13d757362c6ba045bb2dace130175e5f9a90870f · PaddlePaddle / Paddle

15 1月, 2021 1 次提交

Add Inplace strategy (Output reuse Input Varbase) in dygraph (#30103) · 13d75736

由 pangyoki 提交于 1月 15, 2021

* add view strategy on squeeze,unsqueeze,reshape,flatten

* add squeeze unittest

* add unittests

* use View strategy as name rather than Reuse Allacation

* fix view api doc

* fix format

* use core.ops when input of reshape2 is Tensor

* fix test_cross_entropy_loss error because of reshape2

* fix test_cross_entropy_loss error because of reshape2

* add inplace strategy

* add elementwise_add sub

* let backward op not use inplace

* grad op do not use inplace

* fix memory increase error and add leaf error message

* delete selected_rows

* change op_function

* little change

* solve HandleViewBetweenInputAndOutput

* add unittest and leaf error message

* merge view error

* optimize op_function_generator format and support sum inplace op

* fix format of basic_engine

* fix format for framework

* little change of variable wrapper

* add reshape, squeeze, unsqueeze, scatter api

* add relu elu tanh softmax inplace api

* fix test_squeeze_op unittest

* fix test_relu_op unittest

* fix comment problems

* delete sample code of inplace api

* add reference of grad_pending_nodes in basic_engine

* fix unittest name

* add inplace apis into wlist

* fix error message

* add PADDLE_ENFORCE for set grad op twice

* fix head file error

13d75736

13 1月, 2021 1 次提交
- S
  
  Support unused parameters in dynamic graph distributed (#30224) · a60f17b8
  由 ShenLiang 提交于 1月 13, 2021
  
  a60f17b8
11 1月, 2021 1 次提交
- 石
  
  fix header file paths of gflags, commit 1, test=develop (#30271) · 8ce2482b
  由石晓伟提交于 1月 11, 2021
  
  8ce2482b
08 1月, 2021 2 次提交

Fix dtype of ungenerated grad var (#28511) · 8696335f

由 Leo Chen 提交于 1月 08, 2021

* fix dtype of ungenerated grad var

* update ut

* refine code

* set default dtype

* fix could_use_cudnn bug

* remove debug code

* re-implement

* fix bug

8696335f

Add callback after TensorCopy (#30123) · 1f97d61c

由 Leo Chen 提交于 1月 08, 2021

* change to tensor copy sync

* change to tensor copy sync

* make copy_to safe when use TensorCopy

* refine code

* add ut

* add cudapinned garbagecollector

* add testcase: cpu place -> cuda pinned place

1f97d61c

07 1月, 2021 1 次提交

[Complex] Simplify prepared op impl to improve performance (#30153) · d0fb06b2

由 Chen Weihang 提交于 1月 07, 2021

* simplify prepared op impl to improve performance

* fix kunlun compile error

* continue fix kunlun compile error

* only transform diff place when dtype diff

* fix failed unittests

* remove useless file

* polish impl by review comment

d0fb06b2

05 1月, 2021 1 次提交

support dygraph in xpu place (#30051) · 297fff1a

由 hong 提交于 1月 05, 2021

* support dygraph in xpu place; test=develop

* fix cpu/gpu compile error; test=develop

* fix compile error; test=develop

* fix xpu compile error; testd=develop

297fff1a

29 12月, 2020 1 次提交
- C
  
  support grad accumulated across batch (#29942) · a1d9a14e
  由 Chen Weihang 提交于 12月 28, 2020
  
  a1d9a14e
25 12月, 2020 2 次提交

[Complex] Handle complex to real after type promotion (#29855) · a6072055

由 Chen Weihang 提交于 12月 25, 2020

* try to add fwd op input dtypes

* refactor base impl

* return tmp_ins after dygraph prepare data

* fix typo found in debug

* polish comment & add complex net test

* revert detail change

* fix unittest failed

* add complex kernel condition control

* fix xpu test failed & polish comment

* polish details by review comments

a6072055

[Complex] Add support for complex grad accumulated (#29889) · 1a304e6c

由 Chen Weihang 提交于 12月 25, 2020

* add support for complex grad accumulated

* add unittest for coverage

* update test dtype

* remove useless blank line

1a304e6c

22 12月, 2020 2 次提交
- S
  
  opt sparse allreduce using ncclgather (#29819) · f65f1caa
  由 ShenLiang 提交于 12月 22, 2020
  
  f65f1caa
- S
  Support multi-stream communication for dynamic graph distributed (#29525) · 01e2874a
  由 ShenLiang 提交于 12月 22, 2020
```
* fix fleet for multi-stream

* fix memcpy for ncclid

* use sync to solve move operation
```
  01e2874a
09 12月, 2020 2 次提交
- Z
  support deepcopy for Layer/Tensor/Paramerbase (#29387) · e74e1a22
  由 Zhou Wei 提交于 12月 09, 2020
```
* support deepcopy for Layer/Tensor/Paramerbase

* fix some code
```
  e74e1a22
- S
  Rebuild group automatically in dynamic graph distributed (#29255) · 2ef9e0e2
  由 ShenLiang 提交于 12月 09, 2020
```
* add tensor_indices in AssignGroupBySize

* add rebuild group in reducer
```
  2ef9e0e2
07 12月, 2020 1 次提交
- Z
  
  fix that parameters'grad has grad var (#29408) · 24ba9ed4
  由 Zhou Wei 提交于 12月 07, 2020
  
  24ba9ed4
03 12月, 2020 2 次提交
- L
  use has_grad instead of train_mode (#29309) · b58cfff8
  由 Leo Chen 提交于 12月 03, 2020
```
* use has_grad instead of train_mode

* add vlog for debug

* fix ut

* fix ut
```
  b58cfff8
- S
  
  fix the warning of reducer (#29323) · 696dc4bb
  由 ShenLiang 提交于 12月 03, 2020
  
  696dc4bb
01 12月, 2020 1 次提交

accumulate gradient for leaf tensor with previous graph and expose leaf tensor concept (#28429) · c0a991c8

由 Zhou Wei 提交于 12月 01, 2020

* The leaf tensor concept is exposed and the gradient accumulation of leaf tensor

* The leaf tensor concept is exposed and the gradient accumulation of leaf tensor

* fix coverage

* fix api doc

* fix CI unittest

* fix CI unittest

* fix unitest

* empty tensor does’t need inner_var_

* fix some error message

c0a991c8

30 11月, 2020 1 次提交

Check whether there is any inplace operation affecting gradient calculation. (#27901) · 865a4598

由 liym27 提交于 11月 30, 2020

* Add a class TensorInplaceVersion to count the inplace version and put it in framework::Tensor instead of Allocation or Variable.

* Add a new attribute `_inplace_version` for VarBase.

* Raise exception if an inplace operation can result in incorrect gradient computation.

* Add a new interface _bump_inplace_version() for VarBase to bump the version whenever the Tensor is modified through an inplace operation.

* For api assign, call _bump_inplace_version() when it's an inplace operation inn dynamic mode.

* Use original var_wrapper if the inplace_version is not changed.

* Replace SnapshotVarWrapperList with SnapshotVarWrapper to optimize performane.

865a4598

27 11月, 2020 1 次提交

Support dynamic graph distributed (#28997) · e2d01eb6

由 ShenLiang 提交于 11月 27, 2020

* add reducer

* refine envent for memorycopy

* add concat&split for allreduce

* apply concat & split for fuse tensor

* fix nccl dep

* fix the untest, compile problem and ddp initialize problem

* fix untest for mac & add some comments & solve the repeated param in sublayers

* fix untest for windows & fix document

e2d01eb6

26 11月, 2020 1 次提交
- L
  Split train_mode and has_grad for tracer (#29064) · 770395cb
  由 Leo Chen 提交于 11月 26, 2020
```
* split train_mode and has_grad

* fix format

* fix ci problems

* fix sample code
```
  770395cb
18 11月, 2020 1 次提交

Add basic hook classes for dygraph & implement reduce hook (#28584) · 7eeb99fe

由 Chen Weihang 提交于 11月 18, 2020

* add base hook classes and reduce hook impl

* fix constructor typo

* polish comment format

* refactor baisc hook class design

* polish design details

7eeb99fe

16 11月, 2020 1 次提交
- D
  
  fix nccl init failed in parallel dygraph mode (#28497) · a24d1868
  由 danleifeng 提交于 11月 16, 2020
  
  a24d1868
06 11月, 2020 1 次提交
- C
  Remove selected rows all reduce over height check (#28460) · 155b4f9b
  由 Chen Weihang 提交于 11月 06, 2020
```
* remove slelected rows all reduce over height check

* polish unittest
```
  155b4f9b
05 11月, 2020 1 次提交
- C
  Add retry for dygraph parallel socket bind (#28404) · c42e6561
  由 Chen Weihang 提交于 11月 05, 2020
```
* add retry for dygraph parallel socket bind

* change to loop always

* fix writing error
```
  c42e6561
04 11月, 2020 1 次提交
- L
  
  support cuda pinned place (#28416) · 44a476c2
  由 Leo Chen 提交于 11月 04, 2020
  
  44a476c2
23 10月, 2020 1 次提交
- L
  
  use FLAGS_use_mkldnn to prevent unnecessary attrs copy (#28146) · 4ea23307
  由 lidanqing 提交于 10月 23, 2020
  
  4ea23307
21 10月, 2020 1 次提交
- D
  dygraph nccl init support host domain name (#28107) · f29fb396
  由 danleifeng 提交于 10月 21, 2020
```
* nccl init support hostname and ip; test=develop
```
  f29fb396
13 10月, 2020 1 次提交

Refine the format of printing tensor (#27673) · 049696bf

由 Leo Chen 提交于 10月 13, 2020

* add sumary feature

* refine printting tensor

* add sci_mode

* add sample code

* fix indent error

* fix _format_item

* polish code

* support item indent

* add ut

* set place for ut

* fix py2 issue

* fix ut

049696bf

28 9月, 2020 1 次提交
- A
  Add support for mkldnn ops types selection with FLAGS in dygraph (#27482) · 0ecf441a
  由 arlesniak 提交于 9月 28, 2020
```
* Add support for mkldnn ops types selection with FLAGS in dygraph

* use regex to match DNNL verbose

* python3 encoding fix
```
  0ecf441a
25 9月, 2020 1 次提交
- L
  Refine error msg in paddle/fluid/imperative (#27521) · a5b32637
  由 Leo Chen 提交于 9月 25, 2020
```
* refine err msg

* follow comments
```
  a5b32637
24 9月, 2020 1 次提交

use iwyu clean include (#27267) · df43905f

由 wanghuancoder 提交于 9月 24, 2020

* use iwyu clean include, test=develop, test=win

* compilation error, test=develop

* fix compilation error2, test=develop

* fix compilation error3, test=develop

* fix compilation error4, test=develop

* fix compilation error5, test=develop

* fix compilation error6, test=develop

* fix compilation error7, test=develop

* fix compilation error8, test=develop

* fix compilation error8, test=develop

* fix compilation error10, test=develop

* fix compilation error11, test=develop

df43905f

31 8月, 2020 1 次提交

Add use of global flag 'use_mkldnn' to layer_helper (#26497) · 885c61f0

由 arlesniak 提交于 8月 31, 2020

* get use of global 'use_mkldnn' in layer_helper

* update for CI

* update for CI, relu test

* update for CI, relu test added, make FLAGS_use_mkldnn a public flag

* added more strict tests, fixes after review

* fixes after review

* fixes after review, CI stuff

885c61f0

28 8月, 2020 2 次提交
- Z
  Remove `sorted_sum_gradient_` form BasicEngine and PartialGradTask. (#26766) · f32ae272
  由 Zhen Wang 提交于 8月 28, 2020
```
Use `Tensor` instead of `Variable`  in the doc of paddle.grad.
```
  f32ae272
- Z
  Update the demo code and the doc of varbase.backward. (#26506) · f9066e6a
  由 Zhen Wang 提交于 8月 28, 2020
```
* update the demo code and the doc of varbase.backward.

* update the doc of the fake interface `paddle.fluid.Variable`.

* remove BackwardStrategy.
```
  f9066e6a
21 8月, 2020 1 次提交

support Baidu Kunlun AI Accelerator (#25959) · 138ecf24

由 QingshuChen 提交于 8月 21, 2020

* support Baidu AI Accelerator
  * test=kunlun

* minor
 * test=kunlun

* support xpu op in separate file
 * test=kunlun

* update XPU error message and remove duplicated code

 * test=kunlun

* minor
 * test=kunlun

* minor
 * test=kunlun

138ecf24

18 8月, 2020 1 次提交

Enable mkldnn layout conversion (#25778) · 69742bd9

由 Sylwester Fraczek 提交于 8月 18, 2020

* enable mkldnn layout conversion

* review fix: remove tmp_place

* fix test mkldnn swish

* add UT for PrepareData CPU->MKLDNN

* add #ifdef PADDLE_WITH_MKLDNN

* Force-push commit
Co-authored-by: Ngrygielski <adam.grygielski@gmail.com>

69742bd9

13 8月, 2020 2 次提交

[OpDevOptimize] Add common infershape functions (#26096) · ffe52b44

由 Leo Chen 提交于 8月 13, 2020

* add unchaged infershape function

* add broadcast infershape function

* fix bug

* rename infershape functions

* add UnaryOpUnchangedInferShapeCheckAxis

* add error message

* add test for common infer shape functions

* dont update existed ops

* dont update op_desc.h

* add more test

* add error check, refine error message

ffe52b44

Feature/Enable Auto-Mixed-Precision in dynamic graph (#24903) · 2d95280e

由 Leo Chen 提交于 8月 13, 2020

* add auto_cast, test=develop

* add loss scaler, test=develop

* add comments, test=develop

* refine code, test=develop

* refine code, test=develop

* do not set flags automatically, test=develop

* fix custom op bug, test=develop

* add more test, test=develop

* refine enable logic, test=develop

* enable amp test with GPU, test=develop

* add unittest

* add test for found_inf

* follow comments

* follow comments

* remove global variable, use singleton

* add some notes

* update comments

* update comments

* update comments

* add use_dynamic_loss_scaling argument

* refine found_inf

* refine found_inf

2d95280e

11 8月, 2020 1 次提交
- Z
  
  add more error info for these ops without double grad ops. (#25987) · a86e8c0e
  由 Zhen Wang 提交于 8月 11, 2020
  
  a86e8c0e

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功