提交 · 1412d3bc2f731f33279c27b229e84b804e247760 · Crayon鑫 / Paddle

24 6月, 2021 1 次提交

[NPU] support dygraph execution on npu place(#33579) · 6aea6be2

由 houj04 提交于 6月 24, 2021

* in NPU environment, use CPUPlace for missing operators.

* in NPU environment, use CPUPlace for missing operators.

* fix TensorCopy bug and add unit test.

* fix code style.

* add more unit tests.

6aea6be2

15 6月, 2021 1 次提交
- Z
  
  support convert core.Tensor to paddle.Tensor (#33430) · b7a54fc1
  由 Zhou Wei 提交于 6月 15, 2021
  
  b7a54fc1
11 6月, 2021 1 次提交
- W
  use PYTHON_C_API in dygraph (#32524) · 08e81475
  由 wanghuancoder 提交于 6月 11, 2021
```
* use PYTHON_C_API in dygraph, test=develop
```
  08e81475
19 5月, 2021 1 次提交
- C
  
  add enforce check for set_value (#32972) · af89a943
  由 Chen Weihang 提交于 5月 19, 2021
  
  af89a943
13 5月, 2021 1 次提交
- C
  
  add varbase_copy support CUDAPinnedPlace (#32883) · 48fc16f2
  由 chentianyu03 提交于 5月 13, 2021
  
  48fc16f2
12 5月, 2021 1 次提交

add varbasecopy func to fix the ParamBase type bug in layers.to API (#32789) · 067f558c

由 chentianyu03 提交于 5月 12, 2021

* add varbasecopy func to fix the paraBase type bug in layers.to API

* overload _copy_to func for ParamBase

* add xpuplace

* add waiting varbsecopy completion when not blocking

* fix dst_device bug

* modify varbase to shared_ptr

067f558c

30 4月, 2021 1 次提交
- Z
  
  add API Tensor.item() to convert Tensor element to a Python scalar (#32561) · 7e2b60a4
  由 Zhou Wei 提交于 4月 30, 2021
  
  7e2b60a4
26 4月, 2021 1 次提交
- L
  [AMP] Autocast to fp32 for op has no fp16 kernel (#32543) · d2b31a14
  由 Leo Chen 提交于 4月 26, 2021
```
* skip op has no fp16 kernel

* add ut
```
  d2b31a14
25 4月, 2021 2 次提交
- L
  
  [slice] Support index is Tensor for slice in dynamic mode (#32435) · aceec7fb
  由 liym27 提交于 4月 25, 2021
  
  aceec7fb
- L
  
  [Setitem] Support grad computation of op set_value (#32431) · 25e723e7
  由 liym27 提交于 4月 25, 2021
  
  25e723e7
15 4月, 2021 1 次提交

Customizable Python Layer in Dygraph (#32130) · 29f65225

由 WeiXin 提交于 4月 14, 2021

* custom python backward

* polish up the code

* polish up the code

* polish up the code.

* Fix code format and comments.

* Delete redundant files.

* add unnittest.

* edit unnittest.

* edit unnittest.

* Remove redundant header files.

* Improve coverage and remove redundant code.

* support saving for backward.

* polish code according to comments.

* Add support type for PyLayer.

* Modify the DOC.

* polish Doc.

* polish Doc.

* polish Doc.

* polish Doc.

* polish Doc.

* polish Doc.

* polish code and make the code robust.

* Modify the code format.

29f65225

14 4月, 2021 1 次提交
- C
  Add inner register backward hook method for Tensor (#32171) · 7ba85aca
  由 Chen Weihang 提交于 4月 14, 2021
```
* add register backward hook method

* add leaf grad accumullated test
```
  7ba85aca
13 4月, 2021 1 次提交

add layer.to api (#32040) · 6e946e9d

由 chentianyu03 提交于 4月 13, 2021

* add layer.to api

* add layer.to api

* add layer.to api

* add the doc for Layer.to

* add input type checking

* modify assert and import bug

* format code style

* format code style

* make place support str type

* add SetGradVarBase method to set the gradient after conversion

* modify argument palce to device

* modify argument palce to device

* modify doc of layers.to API

* add xpuplace to device argument

6e946e9d

01 4月, 2021 3 次提交

add custom init grad for backward function (#31540) · 83b953f5

由 chentianyu03 提交于 4月 01, 2021

* add custom init grad for backward function

* add custom init grad for backward function

* handle when the grad_tensor is none

* handle when the grad_tensor is none

* fix the args type error on windows platform

* modify the args order and doc

* format code

* add grad_tensor to xpu

* modify the grad_tensor type check

* add paddle.backward api to support multi tensors gradient compute

* add paddle.backward api to support multi tensors gradient compute

* add paddle.atuograd module and backward api

* change tensor.backward func args

* modify tensor backward api

* remove create_graph intputs args

* add doc and examplex code for backward api

* when have the same tensor, throw error

* modify test Init func args

* modify the execute.Init func args in test files

* add paddle.autograd package in setup.py.in

* modify error msg, remove _run_backward method in class Tensor

* add test cases for backward api

83b953f5

K
new group (#31682) · 07741593
由 kuizhiqing 提交于 4月 01, 2021
```
* new group

* ci compatible fix

* assert nccl
```
07741593

Refactor and simplify hook design & add Tensor.register_hook API (#31775) · dbeb3ea4

由 Chen Weihang 提交于 3月 31, 2021

* refactor and simplify hook design

* fix reducer add hook error

* add Tensor.register_hook basic impl

* refine prepare data impl

* revert prepare data change

* support register_hook for Tensor

* add hook test in model

* polish tests and doc example

* fix double grad test failed

* remove reduce hook func

* fix set empty error

* polish code by comments

* change reduce_hook to mutable_hook

* remove useless tmp_ins

* fix shape code format error

* fix shape code format error

dbeb3ea4

31 3月, 2021 1 次提交
- K
  Polish tensor pipeline (#31701) · e973bd73
  由 Kaipeng Deng 提交于 3月 31, 2021
```
* polish tensor pipeline. test=develop
```
  e973bd73
30 3月, 2021 1 次提交
- L
  
  [dynamic setitem] Fix bug of dynamic setitem: Decerease axes to do right broadcast (#31960) · 57d4288a
  由 liym27 提交于 3月 30, 2021
  
  57d4288a
04 3月, 2021 1 次提交
- Q
  
  [ROCM] update fluid platform for rocm (part5), test=develop (#31315) · 4d647ec1
  由 Qi Li 提交于 3月 04, 2021
  
  4d647ec1
26 2月, 2021 1 次提交
- Q
  
  [ROCM] update fluid framework for rocm (part6), test=develop (#31015) · 28b356b9
  由 Qi Li 提交于 2月 26, 2021
  
  28b356b9
20 2月, 2021 1 次提交

[static setitem] Support the index is Tensor; step>1; step<0 .(#30949) · 5b367dab

由 liym27 提交于 2月 20, 2021

* [static setitem] support the index step > 1. tensor_a[::3] = value

* [static setitem] support the index step < 0. Eg: tensor_a[::-3] = value

* [static setitem] support the index is Tensor. eg: tensor_a[tensor_3:0:-1] = value

* Add op version.

5b367dab

05 2月, 2021 1 次提交

Performance optimization for dynamic setitem: Call op set_value to speed up... · 39f41cb4

由 liym27 提交于 2月 05, 2021

Performance optimization for dynamic setitem: Call op set_value to speed up because the original call to TensorToPyArray will introduce unnecessary data copy. (#30817)

39f41cb4

04 2月, 2021 1 次提交
- W
  
  fix xpu dygraph place (#30868) · 6e3856d3
  由 WangXi 提交于 2月 04, 2021
  
  6e3856d3
03 2月, 2021 1 次提交
- W
  
  【kunlun】dygraph supports multi xpu card training (#30671) · b1026f64
  由 WangXi 提交于 2月 03, 2021
  
  b1026f64
29 1月, 2021 1 次提交
- S
  
  rm Singleton of reducer (#30775) · 3858f458
  由 ShenLiang 提交于 1月 29, 2021
  
  3858f458
19 1月, 2021 1 次提交
- L
  support layer_norm fp16 in dygraph amp (#30430) · 7043b8cf
  由 Leo Chen 提交于 1月 19, 2021
```
* support layer_norm fp16 in dygraph amp

* add ut

* refine code
```
  7043b8cf
13 1月, 2021 2 次提交

Set expected place in child thread for dataloader to avoid costing cuda memory... · 3d015f1c

由 Leo Chen 提交于 1月 13, 2021

Set expected place in child thread for dataloader to avoid costing cuda memory on other card (#30338)

* set expected place in child thread for dataloader

* set device id when set tensor from numpy

* revert tensor_py change

* add compile guard

* fix ci

* fix bug

3d015f1c

S

Support unused parameters in dynamic graph distributed (#30224) · a60f17b8
由 ShenLiang 提交于 1月 13, 2021

a60f17b8

08 1月, 2021 1 次提交

Add callback after TensorCopy (#30123) · 1f97d61c

由 Leo Chen 提交于 1月 08, 2021

* change to tensor copy sync

* change to tensor copy sync

* make copy_to safe when use TensorCopy

* refine code

* add ut

* add cudapinned garbagecollector

* add testcase: cpu place -> cuda pinned place

1f97d61c

06 1月, 2021 1 次提交

Fix bug: In dynamic mode, if start or end is negetive, __getitem__ return wrong result(#30003) · 9922bd41

由 liym27 提交于 1月 06, 2021

1. when slice_item is a slice: 
 1) the start of __getitem__ should be std::max(start, 0) if slice
 2) the start of __getitem__ should be std::min(end, dim) 
2. when slice_item is an integer, it should be in [-dim_len, dim_len) 
3. Fix error message to use accurate data

9922bd41

27 12月, 2020 1 次提交

[Dynamic Inplace] Support ShareInplaceVersionCounterWith for C++ Tensor (#29842) · 9602a182

由 liym27 提交于 12月 27, 2020

* Revert "[inplace] Add ShareHolderWith for class Variable and SharePlaceholderWith in VarBase.detach() to share the same Tensor/SelectedRows (#29267)"

This reverts commit b10ecd9d.

* Support ShareInplaceVersionCounterWith to share the same inplace version counter for VarBase

9602a182

22 12月, 2020 1 次提交
- S
  Support multi-stream communication for dynamic graph distributed (#29525) · 01e2874a
  由 ShenLiang 提交于 12月 22, 2020
```
* fix fleet for multi-stream

* fix memcpy for ncclid

* use sync to solve move operation
```
  01e2874a
09 12月, 2020 2 次提交
- Z
  support deepcopy for Layer/Tensor/Paramerbase (#29387) · e74e1a22
  由 Zhou Wei 提交于 12月 09, 2020
```
* support deepcopy for Layer/Tensor/Paramerbase

* fix some code
```
  e74e1a22
- S
  Rebuild group automatically in dynamic graph distributed (#29255) · 2ef9e0e2
  由 ShenLiang 提交于 12月 09, 2020
```
* add tensor_indices in AssignGroupBySize

* add rebuild group in reducer
```
  2ef9e0e2
04 12月, 2020 1 次提交

[inplace] Add ShareHolderWith for class Variable and SharePlaceholderWith in... · b10ecd9d

由 liym27 提交于 12月 04, 2020

[inplace] Add ShareHolderWith for class Variable and SharePlaceholderWith in VarBase.detach() to share the same Tensor/SelectedRows (#29267)

b10ecd9d

01 12月, 2020 1 次提交

accumulate gradient for leaf tensor with previous graph and expose leaf tensor concept (#28429) · c0a991c8

由 Zhou Wei 提交于 12月 01, 2020

* The leaf tensor concept is exposed and the gradient accumulation of leaf tensor

* The leaf tensor concept is exposed and the gradient accumulation of leaf tensor

* fix coverage

* fix api doc

* fix CI unittest

* fix CI unittest

* fix unitest

* empty tensor does’t need inner_var_

* fix some error message

c0a991c8

30 11月, 2020 1 次提交

Check whether there is any inplace operation affecting gradient calculation. (#27901) · 865a4598

由 liym27 提交于 11月 30, 2020

* Add a class TensorInplaceVersion to count the inplace version and put it in framework::Tensor instead of Allocation or Variable.

* Add a new attribute `_inplace_version` for VarBase.

* Raise exception if an inplace operation can result in incorrect gradient computation.

* Add a new interface _bump_inplace_version() for VarBase to bump the version whenever the Tensor is modified through an inplace operation.

* For api assign, call _bump_inplace_version() when it's an inplace operation inn dynamic mode.

* Use original var_wrapper if the inplace_version is not changed.

* Replace SnapshotVarWrapperList with SnapshotVarWrapper to optimize performane.

865a4598

27 11月, 2020 1 次提交

Support dynamic graph distributed (#28997) · e2d01eb6

由 ShenLiang 提交于 11月 27, 2020

* add reducer

* refine envent for memorycopy

* add concat&split for allreduce

* apply concat & split for fuse tensor

* fix nccl dep

* fix the untest, compile problem and ddp initialize problem

* fix untest for mac & add some comments & solve the repeated param in sublayers

* fix untest for windows & fix document

e2d01eb6

26 11月, 2020 1 次提交
- L
  Split train_mode and has_grad for tracer (#29064) · 770395cb
  由 Leo Chen 提交于 11月 26, 2020
```
* split train_mode and has_grad

* fix format

* fix ci problems

* fix sample code
```
  770395cb
25 11月, 2020 1 次提交
- Z
  fix tensor detach to zero copy (#27921) · 8ca0a8a8
  由 Zhou Wei 提交于 11月 25, 2020
```
* fix tensor detach to zero copy

* fix tensor detach to zero copy
```
  8ca0a8a8

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致