提交 · 4b269baaad1c61184d62510afbb7dad514df07e6 · PaddlePaddle / Paddle

17 3月, 2022 6 次提交

W
Revert "[Eager Grad] Support eager grad interface (#40170)" · 4b269baa
由 Weilong Wu 提交于 3月 17, 2022
```
This reverts commit 4db8cf24.
```
4b269baa
B

support gpu mixed precision inference (#40531) · 06fee998
由 baoachun 提交于 3月 17, 2022

06fee998

[Eager Grad] Support eager grad interface (#40170) · 4db8cf24

由 Weilong Wu 提交于 3月 17, 2022

* [Eager] Support eager grad interface, draft version

* Support eager grad interface with allow_unused and multi startup_op

* Fix code format

* Fix allow_unused case, return PyNone if tensor not initialize

* Support output's stop_gradient related to create_graph

* Support grad exception case in eager mode, fix coverage CI

* Update ToPyObject, return PyNone if not initialize

* AccumulationNode add FLAGS_retain_grad_for_all_tensor

* Fix ci issue

* Fix CI issue

* fix, use core.eager.Tensor

* Add func SetBufferSlotRankZeros for GradTensorHolder

* Support retain_graph by using ClearTensorWrappers

* Support retain_graph by using ClearTensorWrappers

* Update retain_graph and no_grad_vars related test case

* Update code gen logic for ClearTensorWrappers

* Fix by override statement

* fix override func args

* Support retain_graph, update unit tests

* Updated ClearTensorWrappers logic

* fix grad python interface

* Use deep copy and update unit tests

* Polish code

* Polish code

* Fix CI issue, Deep copy only use when user set grad_tensors

* Fix CI, use Backward instead RunBackward

* Fix CI, Declare kernel explicitly in test file

* Polish, remove vector of TensorWrapper

* Refactor the logic of grad/backward, polish codes

* Update code after merge upstream develop

* Polish after merge upstream develop

* Update to adapt new GradNodeBase superclass

* Fix error introduced during conflict resolution

* Update purify potential_startup_nodes logic

* Fix errors

* Polish code

* Remove useless args for ToPyObject

* Remove useless TensorWrappersSet

* Fix code-format, re-install pre-commit

* Fix pre-process logic for potential_startup_ops

* Update unit tests, use eager mode

4db8cf24

J
fix copy_ problem by doing it with phi copy (#40521) · c1931beb
由 Jiabin Yang 提交于 3月 17, 2022
```
* fix copy_ problem by doing it with phi copy

* improve test coverage

* refactor copy with sr kernel
```
c1931beb
C

move grid sample op infershape (#40625) · b1b24463
由 Chen Weihang 提交于 3月 17, 2022

b1b24463

Improve the performance of fake quantize OP (#40491) · 827b6a0e

由 Leo Chen 提交于 3月 17, 2022

* Move the computation of moving average scale to device

* Use register to save local maximum in a thread

827b6a0e

16 3月, 2022 21 次提交

C

move determinant op infershape (#40624) · a09a93a1
由 Chen Weihang 提交于 3月 16, 2022

a09a93a1
L
[KP] Fix registry and add UT for thresholded_relu & softshrink (#40524) · bef6f2e1
由 Lijunhui 提交于 3月 16, 2022
```
* init commit

* correct namespace
```
bef6f2e1
F
Add yaml config for pool2d (#40563) · ac5cc136
由 From00 提交于 3月 16, 2022
```
* Add yaml config for pool2d

* Fix CI error

* Fix code format error
```
ac5cc136

[Phi] Migrate mode_op and mode_grad_op into Phi (#40571) · 00183a93

由 Aurelius84 提交于 3月 16, 2022

* [Phi] Migrate mode_op and mode_grad_op into Phi

* fix omp

* add ifdef

* migrate infershape

* modify according reviewer

00183a93

Refactor elementwise op grad classes (#40187) · 7004f65c

由 piotrekobi 提交于 3月 16, 2022

* Refactor elementwise op grad classes

* Add more refactor changes

* Revert set layout and format deletion

* Fix failing elementwise test

7004f65c

Quantize elementwise mul (#40546) · 2def79bc

由 Zuza 提交于 3月 16, 2022

* Quantize elementwise mul op

* Parametrize elementwise functions

* Fix code formatting

2def79bc

R

clean up DeviceManager in advance manually (#40504) · 23c036d6
由 ronnywang 提交于 3月 16, 2022

23c036d6
Z

Add tensor desc size check (#40518) · 849bfbbf
由 zlsh80826 提交于 3月 16, 2022

849bfbbf
Z
[Phi] Move roi_align grad kernel and infershape from fuild to phi (#40556) · 3898080e
由 zyfncg 提交于 3月 16, 2022
```
* move roi_align_grad kernel

* move roi_align grad kernel and infershape to phi

* remove roi_align infershape
```
3898080e

[PHI] Migrate roll op (#40257) · 44d46d03

由 chenenquan 提交于 3月 16, 2022

* [PHI] Migrate roll op

* 【phi】migrate eigh op to phi (#40213)

* migrate eigh to phi

* optimize code

* modify code according to comment

* conflict resolution

* [PHI] Migrate roll op

* [PHI] Fix converage of roll_sig

* [PHI] Fix infermate of roll_sig

* [Phi] Fix unittest coverage of roll op

* [PHI] Fix infermeta in unary

* [PHI] Fix parameter type of roll op

* [PHI] Fix parameter type of roll op

* [PHI] Fix parameter of roll op
Co-authored-by: Ncrystal <62974595+Zjq9409@users.noreply.github.com>

44d46d03

[PHI] Migrate index_select op (#40260) · 99452af7

由 chenenquan 提交于 3月 16, 2022

* [PHI] Migrate index_select op

* [PHI] Fix bug in test_variable

* [PHI] migrate index_select op

99452af7

Y

move activation kernel (#40565) · 57f54d3b
由 YuanRisheng 提交于 3月 16, 2022

57f54d3b
L
[KP]fix bug that cannot fallback to CPU normally in XPU KP (#40576) · 603f8425
由 Liu-xiandong 提交于 3月 16, 2022
```
* [kp]fix bug that cannot fallback to CPU normally in XPU KP

* fix bug in static graph
```
603f8425

[Phi] Migrate multiplex, qr, tril_triu op kernel to phi (#40007) · dce87e3d

由 caozhou 提交于 3月 16, 2022

* migrate multiplex op kernel

* migrate qr cpu kernel

* migrate tril_triu op kernel

* fix multiplex kernel

* add kernel sig

* fix dependence and bug

* fix multiplex error

* fix npu include error

* fix conflict

* fix conflict and delete tril_triu

* fix date and multiplex input

* adapt header file order

* fix header file include

* fix conflict

* delete cholesky_solve_op.h

* delete triangular_solve_op.h

dce87e3d

X

tranfer cumprod and kldiv_loss infershape to phi (#40575) · 6d205516
由 xiongkun 提交于 3月 16, 2022

6d205516
C

move isclose infershape (#40595) · c7637700
由 Chen Weihang 提交于 3月 16, 2022

c7637700

[Phi] Move grid sample op kernel into phi (#40585) · 8fd20b5b

由 Chen Weihang 提交于 3月 16, 2022

* add grid sample phi kernel

* add grid sample phi kernel and remove original kernel

* replace mutable_data by alloc

8fd20b5b

Q

[MLU] support amp O1 of mlu (#40461) · ad81f22c
由 qipengh 提交于 3月 16, 2022

ad81f22c
Z

Fixed issue with default-valued attributes (#40368) · f748b433
由 Zhanlue Yang 提交于 3月 16, 2022

f748b433
C

move gather infershape (#40594) · 59e5c49f
由 Chen Weihang 提交于 3月 16, 2022

59e5c49f

[Auto Parallel] Add the support for the auto completion of while_op (#39939) · ec6b8fbd

由 Yulong Ao 提交于 3月 16, 2022

* [Auto Parallel] Support the auto completion of while_op

* [Auto Parallel] Improve the completion algorithms

* [Auto Parallel] Fix bugs for ernie inference

* [Auto Parallel] Remove attrs which cannot be pickled

* [Auto Parallel] make the dims_mappings of LodTensorArray vars empty

* [Auto Parallel] Fix bugs for the ernie inference in the pipeline parallel

* [Auto Parallel] Remove unncessary comments

* [Auto Parallel] Fix a bug of the CMakeLists

* [Auto Parallel] Use the newest APIs to write the unit test

* [Auto Parallel] Remove unnecessary statements

ec6b8fbd

15 3月, 2022 13 次提交

[Phi] Move determinant op kernel into phi (#40539) · a04a6bd5

由 Chen Weihang 提交于 3月 15, 2022

* add determinant phi kernel

* remove original determinant op kernel

* add determinant grad [hi kernel

* fix determinant test failed

* remove original determinant grad op kernel

a04a6bd5

[phi] modify the shape OP and move inferMeta of shape,matrix_pow,multi_dot (#40506) · 31729a62

由 Liu-xiandong 提交于 3月 15, 2022

* [phi] move matrix_power op

* MatrixInverse fluid -> phi

* modify the CMake to fix compile bug

* delete useless comment

* mutable memory -> phi Alloc

* modify the include file

* modify the include file

* fix bug in CI compiler

* [phi]modify the shape OP and move inferMeta of shape,matrix_pow,multi_dot

* delete useless comment

* fix bug in CI

* modify after review

31729a62

add number count op (#39224) · 9bdee437

由 Roc 提交于 3月 15, 2022

* add expert count op

add ut for expert_count

* update UT only for cuda

* fix for rocm

* update ut

* add moe module

* add expert count op

add ut for expert_count

* update UT only for cuda

* update ut

* add moe module

* make expert count private

* rename expert count op
Co-authored-by: Nhlygit66666 <2570058140@qq.com>

9bdee437

X
run python api in eager model and filter the out in argument list (#40523) · 4d886f75
由 xiongkun 提交于 3月 15, 2022
```
* run python api in eager model and filter the out in argument list

* fix code
```
4d886f75
Z
Fixed issues with generated scale operator (#40482) · 30417999
由 Zhanlue Yang 提交于 3月 15, 2022
```
* Fixed issues with generated scale operator

* Fixed minor issues
```
30417999
F
[NPU] add AMP O1 support (#40362) · 69dd43d1
由 furnace 提交于 3月 15, 2022
```
* [NPU] add AMP O1 support

* [NPU] fix NOTE and warnings
```
69dd43d1

[Phi] Move gather op kernel into phi (#40500) · 0c703fe7

由 Chen Weihang 提交于 3月 15, 2022

* add phi gather kernel

* update year

* remove original gather opkernel

* add gather grad phi kernels

* remove origin gather grad kernel

* fix failed npu and xpu

* fix xpu compile failed

0c703fe7

oneDNN NHWC fixes (#40049) · dde9cec0

由 Jacek Czaja 提交于 3月 15, 2022

* - Prototype of third solution

- fix

- compilation fixes

- fix

- fixe

- fix

- fix

- compilation fix

- comment fix

- lint

update mkldnn conv_elementwise_add_fuse_pass ut

- NHWC changes to prelu

- alhpa dims

- UT fix

- fix to UT

- lint

- Some fixes

- added to BWD of prelu NHWC support

- reverted removal of resetting cu_layout in clearing of caching

* - Small changes

* - compilation fix

* - fix

* - fix

* lint

* - fixes after internal review

* - compilation fix

* - lint

dde9cec0

T
add shard_id (#40261) · 6b7d4845
由 Thunderbrook 提交于 3月 15, 2022
```
* shard_id

* format
```
6b7d4845

[phi] Transfer lgamma, kldiv_loss, isclose, cumprod kernels into phi and pass... · 64223620

由 xiongkun 提交于 3月 15, 2022

[phi] Transfer lgamma, kldiv_loss, isclose, cumprod kernels into phi and pass the tests of these four kernels (#39770)

* tranfer and pass the lgamma unittest

* merge and pass the test

* transfer kldiv_loss and kldiv_loss_grad; pass the unitest

* trafer the isclose and cumprod kernel

* change PT_REGISTER -> PD_REGISTER

* fix by code review

* fix by code review

* fix

* remove enforce include dependence from scalar

* fix

* fix by code review

* fix by code review

64223620

[Phi]move reduce_min/any/all kernel (#40374) · c46e661d

由 chentianyu03 提交于 3月 15, 2022

* add reduce_min kernel

* remove raw reduce_min kernel

* add reduce min

* add reduce any all impl

* add bool reduce Kernel

* remove raw any/all kernel

* add any all kernel

* rm comment

c46e661d

Added more profile signposts to dygraph (#40201) · 36db75b4

由 Zhanlue Yang 提交于 3月 15, 2022

* Added more signposts to dygraph profiling

* Fixed minor issues

* Refactored signpost names

* Fixed typo

* Removed debug codes

* Fixed typo

* Adjusted signpost names

* Fixed issues from branch merge

36db75b4

Move one hot to phi (#39876) · 7701db37

由 hong 提交于 3月 15, 2022

* move one hot to phi; test=develop

* fix bugs; test=develop

* fix bugs; test=develop

* add infer meta; test=develop

* fix bugs; test=develop

* resolve confilct

* resolve confilct

* fix bug;

* fix error; test=develop

* update; test=develop

* polish code; test=develop

* add one api in eager mode; test=develop

* add one hot test; test=develop

* remove use less code; test=develop

* fix bug; test=develop

* polish code; test=develop

* polish code; test=develop

7701db37

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功