提交 · c0bcbd371414601e3c48047e272922ad06843fc8 · BaiXuePrincess / Paddle

21 3月, 2022 1 次提交

Add yaml config part0 (#40020) · cc853e95

由 hong 提交于 3月 21, 2022

* add add yaml

* add elementwise add yaml; test=develop

* add norm

* update

* add some yaml config; test=develop

* fix bug; test=develop

* fix compare error; test=develop

* revert erger_gen.py

* update; test=deveop

* remove usless code; test=deveop

* fix bug; test=develop

* fix test error; test=develop

* remove int_type; test=develop

* fix type error; test=develop

* format; test=develop

* remove type register; test=develop

* polish code; test=develop

* fix ci error; test=develop

cc853e95

19 3月, 2022 7 次提交

P

fix generator bug; · c43f8afb
由 phlrain 提交于 3月 19, 2022

c43f8afb
Z
Call sparse op from python (#40608) · 95fbbc5b
由 zhangkaihuo 提交于 3月 19, 2022
```
* call sparse api from python
```
95fbbc5b
P

fix some bugs; test=develop · 111ee988
由 phlrain 提交于 3月 19, 2022

111ee988
Z

move deformable_conv forward kernel to phi (#40700) · a8e5c9be
由 zyfncg 提交于 3月 19, 2022

a8e5c9be
C

fix python hook mem leak (#40716) · c46f2ddb
由 Chen Weihang 提交于 3月 19, 2022

c46f2ddb

Add infer meta (#40544) · 8e4e19ab

由 hong 提交于 3月 19, 2022

* add infer meta; test=develop

* add histogram infer meta; test=develop

* fix unitest bug; test=develop

* format; test=develop

* format; test=develop

* bn not use new infer meta; test=develop

* add infer meta; test=develop

* fixbug; test=develop

* fix bug;

* recover unitest; test=develop

8e4e19ab

support inplace in dygraph eager_fluid state (#40400) · 8e612903

由 pangyoki 提交于 3月 19, 2022

* [Eager] Support eager grad interface, draft version

* Support eager grad interface with allow_unused and multi startup_op

* Fix code format

* Fix allow_unused case, return PyNone if tensor not initialize

* Support output's stop_gradient related to create_graph

* Support grad exception case in eager mode, fix coverage CI

* Update ToPyObject, return PyNone if not initialize

* AccumulationNode add FLAGS_retain_grad_for_all_tensor

* Fix ci issue

* Fix CI issue

* fix, use core.eager.Tensor

* Add func SetBufferSlotRankZeros for GradTensorHolder

* Support retain_graph by using ClearTensorWrappers

* Support retain_graph by using ClearTensorWrappers

* Update retain_graph and no_grad_vars related test case

* Update code gen logic for ClearTensorWrappers

* Fix by override statement

* fix override func args

* Support retain_graph, update unit tests

* Updated ClearTensorWrappers logic

* fix grad python interface

* Use deep copy and update unit tests

* Polish code

* Polish code

* Fix CI issue, Deep copy only use when user set grad_tensors

* Fix CI, use Backward instead RunBackward

* Fix CI, Declare kernel explicitly in test file

* Polish, remove vector of TensorWrapper

* Refactor the logic of grad/backward, polish codes

* Update code after merge upstream develop

* Polish after merge upstream develop

* Update to adapt new GradNodeBase superclass

* Fix error introduced during conflict resolution

* support inplace strategy in eager_fluid state

* solve conflict

* nothing

* Update purify potential_startup_nodes logic

* Fix errors

* Polish code

* Remove useless args for ToPyObject

* Remove useless TensorWrappersSet

* fix record conflict

* Fix code-format, re-install pre-commit

* fix tensor_wrapper bug

* Fix pre-process logic for potential_startup_ops

* Update unit tests, use eager mode

* Fix conflicts

* fix unittest timeout

* little change
Co-authored-by: NWeilong Wu <veyron_wu@163.com>

8e612903

18 3月, 2022 15 次提交

[Phi] Migrate gelu/log_softmax/prelu op kernel and infershape (#40393) · aed6faf2

由 shentanyue 提交于 3月 18, 2022

* add gelu

* fix gelu

* add log_softmax

* add prelu kernel and prelu/gelu/logsoftmax infershape

* fix

* fix

* fix

* fix

* fix ci

* log_softmax rewrite

* fix

* fix

* fix conflict

* fix compile error

* fix comment

* fix

* ci_fix
Co-authored-by: NYan Li <liyan665@gmail.com>

aed6faf2

[Phi]Move hierarchical_sigmoid kernel to phi (#40553) · 64a7cbd3

由 Zhang Zheng 提交于 3月 18, 2022

* first commit

* fix compile error

* support std::vector<std::srting>

* fix

* fix op support on GPU by chenweihang

* pass test

* infershape

* add set_dtype

* fix order

* fix

* unify the impl of dt and sr

* fix

64a7cbd3

S

set +x to close showing command, update check_change code with linux (#40456) · 161d27dc
由 Sing_chan 提交于 3月 18, 2022

161d27dc
F
[NPU] fix fp16 (PART I) (#40259) · aaa71ea4
由 furnace 提交于 3月 18, 2022
```
[NPU] fix fp16 (PART I)
```
aaa71ea4
Z
[Phi] Move infershape of roi_pool to phi (#40682) · 579173d8
由 zyfncg 提交于 3月 18, 2022
```
* move infershape of roi_pool to phi

* polish code
```
579173d8

Supported Complex2Real Conversion for Eager Dygraph (#39878) · e3b2a035

由 Zhanlue Yang 提交于 3月 18, 2022

* Supported Complex2Real Conversion for Eager Dygraph

* Supported Complex2Real Conversion for Eager Dygraph

* Enabled complex type promotion test for matmul_v2

* Fix CI issues

* Merged adj_edges_ with GradSlotMeta

* Fixed monir issue

* Adjusted num runs

* Recovered Eager performance tests configurations

* Recovered Eager performance tests configurations

* Adjusted performance tests configurations

* Fixed Minor Issues with performance tests

* Moved out Edge from GradSlotMeta

* Fixed issues from merge

* Fixed typo

* Addressed review comments

* Fixed minor issues

e3b2a035

X
[phi] tranfer kthvalue from fluid to phi (#40676) · d7ccd6bf
由 xiongkun 提交于 3月 18, 2022
```
* tranfer kthvalue from fluid to phi

* transfer infershape
```
d7ccd6bf
A

[NPU] fix no allocator error (#40687) · 8c713223
由 Aganlengzi 提交于 3月 18, 2022

8c713223
S
[DataParallel]Support control flow in new DP (#40593) · 984eacb3
由 ShenLiang 提交于 3月 18, 2022
```
* fix bug

* fix bug
```
984eacb3
Z
Refactored Final State Python-C Code Generation Scripts (#40650) · 35a5e8ee
由 Zhanlue Yang 提交于 3月 18, 2022
```
* Refactored Final State Python-C Code Generation Scripts.

* Bug fix
```
35a5e8ee
L

Use store for gloo process group (#40629) · bb2cb762
由 lilong12 提交于 3月 18, 2022

bb2cb762

[Phi] move reduce_grad kernel into phi (#40522) · 70726696

由 chentianyu03 提交于 3月 18, 2022

* move reduce_mean_grad kernel into phi

* move reduce_max/min_grad into phi

* remove raw max/min grad kernel

* fix bug

* fix max/min grad error

* move all reduce_grad kernel into one file

* add prod grad kernel

* add infermeta for prod kernel

70726696

F
[NPU] fix fp16 (PART II) (#40537) · 1a13fa0f
由 furnace 提交于 3月 18, 2022
```
[NPU] fix fp16 (PART II)
```
1a13fa0f

王

[infrt] rename pd dialect from mlir to infrt. (#40651) · ef4ef154

由王明冬提交于 3月 18, 2022

* [infrt] rename pd dialect from mlir to infrt. test=develop

* [infrt] fix the kernel signature generator bug.

ef4ef154

Z
Optimize perf of softmax_with_cross_entropy_bwd (#40643) · 081e4307
由 Zhang Zheng 提交于 3月 18, 2022
```
* Optimize perf of softmax_with_cross_entropy_bwd

* fix

* fix
```
081e4307

17 3月, 2022 17 次提交

[Phi] Move assign kernel into phi (#40022) · 1904572a

由 Chen Weihang 提交于 3月 17, 2022

* move assign kernel init commit

* change vec<tensor> to vec<tensor*>

* support tensor array

* support api declare

* fix test_list failed

* fix npu and xpu failed

* fix infrt failed

* remove assign array size in operator

* move assign sr header into sr dir

* add infermeta for assign

* test op success

* fix test_list failed

* fix kunlun failed

* add set host allocator in tests

* support tensor array in arg ctx

* open set layout in share_meta

* fix meta tensor layout error

* fix test failed

1904572a

S
merge cpu and gpu graph engines (#40597) · 31776199
由 seemingwang 提交于 3月 17, 2022
```
* extract sub-graph

* graph-engine merging

* fix

* fix

* fix heter-ps config
```
31776199
C
Revert "Fix truncated norm operator (#40287)" (#40614) · 313bff6b
由 Chang Xu 提交于 3月 17, 2022
```
This reverts commit 0c333543.
```
313bff6b
T

fix double-free bug in variables of cinn subgraph (#40609) · 7dad9f70
由 TeFeng Chen 提交于 3月 17, 2022

7dad9f70

CopyFromCpu and CopyToCpu of Onnxruntime back-end optimize (#40561) · fcbb7440

由 heliqi 提交于 3月 17, 2022

* add onnxruntime predictor

* Add code comments

* support link paddle2onnx onnxruntime

* support onnxruntime with python

* support onnxruntime with python

* support onnxruntime with windows

* paddle2onnx compile with windows

* supoort windows compile

* supoort windows compile with onnxruntime

* supoort windows compile with paddle2onnx

* supoort mac compile

* compile with mac

* compile with mac

* add code comments

* fix remind word

* code optimization

* add test case

* add test case

* add inference demo_ci test case

* fix compile paddle2onnx with no python

* add inference demo_ci test case

* add inference demo_ci test case

* add inference infer_ut test case

* support c go api and test cases

* add converage test case

* add converage test case

* add capi test case

* add capi test case

* fix onnxruntime copyfromcpu and copytocpu

* fix goapi

* modify code

fcbb7440

Q

[ROCm] fix bfloat16 support, test=develop (#40401) · da558f0e
由 Qi Li 提交于 3月 17, 2022

da558f0e

[Bug fixes] Fix partial grad conflicts (#40655) · 60899549

由 Weilong Wu 提交于 3月 17, 2022

* [Eager] Support eager grad interface, draft version

* Support eager grad interface with allow_unused and multi startup_op

* Fix code format

* Fix allow_unused case, return PyNone if tensor not initialize

* Support output's stop_gradient related to create_graph

* Support grad exception case in eager mode, fix coverage CI

* Update ToPyObject, return PyNone if not initialize

* AccumulationNode add FLAGS_retain_grad_for_all_tensor

* Fix ci issue

* Fix CI issue

* fix, use core.eager.Tensor

* Add func SetBufferSlotRankZeros for GradTensorHolder

* Support retain_graph by using ClearTensorWrappers

* Support retain_graph by using ClearTensorWrappers

* Update retain_graph and no_grad_vars related test case

* Update code gen logic for ClearTensorWrappers

* Fix by override statement

* fix override func args

* Support retain_graph, update unit tests

* Updated ClearTensorWrappers logic

* fix grad python interface

* Use deep copy and update unit tests

* Polish code

* Polish code

* Fix CI issue, Deep copy only use when user set grad_tensors

* Fix CI, use Backward instead RunBackward

* Fix CI, Declare kernel explicitly in test file

* Polish, remove vector of TensorWrapper

* Refactor the logic of grad/backward, polish codes

* Update code after merge upstream develop

* Polish after merge upstream develop

* Update to adapt new GradNodeBase superclass

* Fix error introduced during conflict resolution

* Update purify potential_startup_nodes logic

* Fix errors

* Polish code

* Remove useless args for ToPyObject

* Remove useless TensorWrappersSet

* Fix code-format, re-install pre-commit

* Fix pre-process logic for potential_startup_ops

* Update unit tests, use eager mode

* Fix conflicts

60899549

Y

rename math (#40641) · 883a8eea
由 YuanRisheng 提交于 3月 17, 2022

883a8eea

[PHI] move roi_pool kernel to phi (#40574) · 7d0db629

由 zyfncg 提交于 3月 17, 2022

* move roi_pool forward kernel to phi

* move roi_pool_grad to phi

* fix compile bug

* fix compile bug

* fix register data_type

7d0db629

Move layer norm to phi (#40193) · 681a6865

由 hong 提交于 3月 17, 2022

* update

* fix bugs; test=develop

* update; test=develop

* fix test compile error; test=develop

* fix cpu compile error; test=develop

* fix test error; test=develo

* fix layer_norm_op plugin error; test=develop

* fix error; test=develop

* fix test bug; test=develop

* update; test=develop

* polish code; test=develop

* fix bugs; test=develop

* remove unused depency; test=develop

* polish code; test=develop

681a6865

Z

move infershape of set_value to phi (#40636) · c335288d
由 zyfncg 提交于 3月 17, 2022

c335288d
Y

move activation sigmoid (#40626) · ed8a9370
由 YuanRisheng 提交于 3月 17, 2022

ed8a9370
Z
[Phi]Move infershape of top_k/expand_as/kron/searchsorted to phi (#40632) · 9ee03302
由 Zhang Zheng 提交于 3月 17, 2022
```
* [Phi]Move infershape of top_k/expand_as/kron/searchsorted to phi

* add set_dtype

* fix order
```
9ee03302
Y

[fleet executor] fleet executor for npu (#40607) · 81848fff
由 Yuang Liu 提交于 3月 17, 2022

81848fff
B

support gpu mixed precision inference (#40531) · 06fee998
由 baoachun 提交于 3月 17, 2022

06fee998

[Eager Grad] Support eager grad interface (#40170) · 4db8cf24

由 Weilong Wu 提交于 3月 17, 2022

* [Eager] Support eager grad interface, draft version

* Support eager grad interface with allow_unused and multi startup_op

* Fix code format

* Fix allow_unused case, return PyNone if tensor not initialize

* Support output's stop_gradient related to create_graph

* Support grad exception case in eager mode, fix coverage CI

* Update ToPyObject, return PyNone if not initialize

* AccumulationNode add FLAGS_retain_grad_for_all_tensor

* Fix ci issue

* Fix CI issue

* fix, use core.eager.Tensor

* Add func SetBufferSlotRankZeros for GradTensorHolder

* Support retain_graph by using ClearTensorWrappers

* Support retain_graph by using ClearTensorWrappers

* Update retain_graph and no_grad_vars related test case

* Update code gen logic for ClearTensorWrappers

* Fix by override statement

* fix override func args

* Support retain_graph, update unit tests

* Updated ClearTensorWrappers logic

* fix grad python interface

* Use deep copy and update unit tests

* Polish code

* Polish code

* Fix CI issue, Deep copy only use when user set grad_tensors

* Fix CI, use Backward instead RunBackward

* Fix CI, Declare kernel explicitly in test file

* Polish, remove vector of TensorWrapper

* Refactor the logic of grad/backward, polish codes

* Update code after merge upstream develop

* Polish after merge upstream develop

* Update to adapt new GradNodeBase superclass

* Fix error introduced during conflict resolution

* Update purify potential_startup_nodes logic

* Fix errors

* Polish code

* Remove useless args for ToPyObject

* Remove useless TensorWrappersSet

* Fix code-format, re-install pre-commit

* Fix pre-process logic for potential_startup_ops

* Update unit tests, use eager mode

4db8cf24

J
fix copy_ problem by doing it with phi copy (#40521) · c1931beb
由 Jiabin Yang 提交于 3月 17, 2022
```
* fix copy_ problem by doing it with phi copy

* improve test coverage

* refactor copy with sr kernel
```
c1931beb

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致