提交 · 4db8cf240093a3258903d70a69af968aceab51be · PaddlePaddle / Paddle

17 3月, 2022 10 次提交

[Eager Grad] Support eager grad interface (#40170) · 4db8cf24

由 Weilong Wu 提交于 3月 17, 2022

* [Eager] Support eager grad interface, draft version

* Support eager grad interface with allow_unused and multi startup_op

* Fix code format

* Fix allow_unused case, return PyNone if tensor not initialize

* Support output's stop_gradient related to create_graph

* Support grad exception case in eager mode, fix coverage CI

* Update ToPyObject, return PyNone if not initialize

* AccumulationNode add FLAGS_retain_grad_for_all_tensor

* Fix ci issue

* Fix CI issue

* fix, use core.eager.Tensor

* Add func SetBufferSlotRankZeros for GradTensorHolder

* Support retain_graph by using ClearTensorWrappers

* Support retain_graph by using ClearTensorWrappers

* Update retain_graph and no_grad_vars related test case

* Update code gen logic for ClearTensorWrappers

* Fix by override statement

* fix override func args

* Support retain_graph, update unit tests

* Updated ClearTensorWrappers logic

* fix grad python interface

* Use deep copy and update unit tests

* Polish code

* Polish code

* Fix CI issue, Deep copy only use when user set grad_tensors

* Fix CI, use Backward instead RunBackward

* Fix CI, Declare kernel explicitly in test file

* Polish, remove vector of TensorWrapper

* Refactor the logic of grad/backward, polish codes

* Update code after merge upstream develop

* Polish after merge upstream develop

* Update to adapt new GradNodeBase superclass

* Fix error introduced during conflict resolution

* Update purify potential_startup_nodes logic

* Fix errors

* Polish code

* Remove useless args for ToPyObject

* Remove useless TensorWrappersSet

* Fix code-format, re-install pre-commit

* Fix pre-process logic for potential_startup_ops

* Update unit tests, use eager mode

4db8cf24

Refine io for test_mnist.py (#40496) · 1e045cae

由 0x45f 提交于 3月 17, 2022

* for test_mnist.py

* remove comments

* using type() replace isinstance()

* valid vars for run program OP in io.py

* open test_mnist in eager_gurad for coverage

1e045cae

王

[infrt] move pd_ops.td to pd floder. test=develop (#40613) · 4c01763c
由王明冬提交于 3月 17, 2022

4c01763c

Optimize the performance of C++ API (#40640) · add304ed

由 zyfncg 提交于 3月 17, 2022

* Optimize performance

* optimiaze c++ api performance

* remove unsed code

* fix paddle throw

* updata format

add304ed

J
fix copy_ problem by doing it with phi copy (#40521) · c1931beb
由 Jiabin Yang 提交于 3月 17, 2022
```
* fix copy_ problem by doing it with phi copy

* improve test coverage

* refactor copy with sr kernel
```
c1931beb
C

move grid sample op infershape (#40625) · b1b24463
由 Chen Weihang 提交于 3月 17, 2022

b1b24463

Improve the performance of fake quantize OP (#40491) · 827b6a0e

由 Leo Chen 提交于 3月 17, 2022

* Move the computation of moving average scale to device

* Use register to save local maximum in a thread

827b6a0e

Trt engine. (#40532) · 3082ed46

由 Wilber 提交于 3月 17, 2022

* infrt add trt engine

* fix register

* file generate

* fix ci error

* fix conflict

* add copyright

* update

* update

* update

* update engine name

* refactor trt code

* update

* update

* update

* update

* fix conflict

* update

* fix compile with cuda

3082ed46

王

[infrt] add default kernel argument remap feature in phi_op_convert_pass. (#40633) · 46abe798
由王明冬提交于 3月 17, 2022

46abe798
王

[infrt] move pd dialect position. test=develop (#40616) · 3a256637
由王明冬提交于 3月 17, 2022

3a256637

16 3月, 2022 30 次提交
- Z
  [Ops] segment pool op support for int int64 kernel. (#40577) · 6849d33b
  由 Zhong Hui 提交于 3月 16, 2022
```
* segment pool support for int int64 kernel.

* add support in python api
```
  6849d33b
- Z
  Optimize the computation of log_softmax (#40612) · 2dec25db
  由 Zhang Zheng 提交于 3月 16, 2022
```
* Optimize the computation of log_softmax

* modify the var name
```
  2dec25db
- C
  
  move determinant op infershape (#40624) · a09a93a1
  由 Chen Weihang 提交于 3月 16, 2022
  
  a09a93a1
- L
  [KP] Fix registry and add UT for thresholded_relu & softshrink (#40524) · bef6f2e1
  由 Lijunhui 提交于 3月 16, 2022
```
* init commit

* correct namespace
```
  bef6f2e1
- H
  
  Add model check (#40398) · 9fc89b34
  由 huzhiqiang 提交于 3月 16, 2022
  
  9fc89b34
- F
  Add yaml config for pool2d (#40563) · ac5cc136
  由 From00 提交于 3月 16, 2022
```
* Add yaml config for pool2d

* Fix CI error

* Fix code format error
```
  ac5cc136
- F
  
  Fix Jetson compilation error in pooling (#40586) · 8ffcf596
  由 From00 提交于 3月 16, 2022
  
  8ffcf596
- A
  [Phi] Migrate mode_op and mode_grad_op into Phi (#40571) · 00183a93
  由 Aurelius84 提交于 3月 16, 2022
```
* [Phi] Migrate mode_op and mode_grad_op into Phi

* fix omp

* add ifdef

* migrate infershape

* modify according reviewer
```
  00183a93
- P
  Refactor elementwise op grad classes (#40187) · 7004f65c
  由 piotrekobi 提交于 3月 16, 2022
```
* Refactor elementwise op grad classes

* Add more refactor changes

* Revert set layout and format deletion

* Fix failing elementwise test
```
  7004f65c
- Z
  Quantize elementwise mul (#40546) · 2def79bc
  由 Zuza 提交于 3月 16, 2022
```
* Quantize elementwise mul op

* Parametrize elementwise functions

* Fix code formatting
```
  2def79bc
- J
  Modify save_quant_model to support different input and output filenames (#40542) · dec2b1ca
  由 joanna.wozna.intel 提交于 3月 16, 2022
```
* Modify save_quant_model.py to support differnet input and output filenames

* Correct wrong order of arguments
```
  dec2b1ca
- R
  
  clean up DeviceManager in advance manually (#40504) · 23c036d6
  由 ronnywang 提交于 3月 16, 2022
  
  23c036d6
- A
  Fix tile_op inferShape (#40589) · f5bf46e6
  由 Aurelius84 提交于 3月 16, 2022
```
* Fix tile_op inferShape

* fix style
```
  f5bf46e6
- N
  fix paddle.optimizer.SGD en docs (#40479) · 8e631715
  由 Nyakku Shigure 提交于 3月 16, 2022
```
* align to cn docs

* add parameter `weight_decay`
```
  8e631715
- Z
  
  Add tensor desc size check (#40518) · 849bfbbf
  由 zlsh80826 提交于 3月 16, 2022
  
  849bfbbf
- Z
  Restructure sparse conv (#40570) · 2f5fb031
  由 zhangkaihuo 提交于 3月 16, 2022
```
restructure conv
```
  2f5fb031
- Z
  [Phi] Move roi_align grad kernel and infershape from fuild to phi (#40556) · 3898080e
  由 zyfncg 提交于 3月 16, 2022
```
* move roi_align_grad kernel

* move roi_align grad kernel and infershape to phi

* remove roi_align infershape
```
  3898080e
- C
  [PHI] Migrate roll op (#40257) · 44d46d03
  由 chenenquan 提交于 3月 16, 2022
```
* [PHI] Migrate roll op

* 【phi】migrate eigh op to phi (#40213)

* migrate eigh to phi

* optimize code

* modify code according to comment

* conflict resolution

* [PHI] Migrate roll op

* [PHI] Fix converage of roll_sig

* [PHI] Fix infermate of roll_sig

* [Phi] Fix unittest coverage of roll op

* [PHI] Fix infermeta in unary

* [PHI] Fix parameter type of roll op

* [PHI] Fix parameter type of roll op

* [PHI] Fix parameter of roll op
Co-authored-by: Ncrystal <62974595+Zjq9409@users.noreply.github.com>
```
  44d46d03
- C
  [PHI] Migrate index_select op (#40260) · 99452af7
  由 chenenquan 提交于 3月 16, 2022
```
* [PHI] Migrate index_select op

* [PHI] Fix bug in test_variable

* [PHI] migrate index_select op
```
  99452af7
- Y
  
  move activation kernel (#40565) · 57f54d3b
  由 YuanRisheng 提交于 3月 16, 2022
  
  57f54d3b
- L
  [KP]fix bug that cannot fallback to CPU normally in XPU KP (#40576) · 603f8425
  由 Liu-xiandong 提交于 3月 16, 2022
```
* [kp]fix bug that cannot fallback to CPU normally in XPU KP

* fix bug in static graph
```
  603f8425
- M
  
  Add Support Layer List to ASP (#40253) · c040bbd7
  由 Ming-Xu Huang 提交于 3月 16, 2022
  
  c040bbd7
- T
  
  fix xpu op test, *test=kunlun (#40409) · d1a98f0b
  由 TTerror 提交于 3月 16, 2022
  
  d1a98f0b
- C
  [Phi] Migrate multiplex, qr, tril_triu op kernel to phi (#40007) · dce87e3d
  由 caozhou 提交于 3月 16, 2022
```
* migrate multiplex op kernel

* migrate qr cpu kernel

* migrate tril_triu op kernel

* fix multiplex kernel

* add kernel sig

* fix dependence and bug

* fix multiplex error

* fix npu include error

* fix conflict

* fix conflict and delete tril_triu

* fix date and multiplex input

* adapt header file order

* fix header file include

* fix conflict

* delete cholesky_solve_op.h

* delete triangular_solve_op.h
```
  dce87e3d
- 王
  
  [infrt] add parse for infrt.dense_tensor_type. test=develop (#40592) · 517b1a7c
  由王明冬提交于 3月 16, 2022
  
  517b1a7c
- C
  [Phi]move reduce kernels into one file (#40584) · 84e17a31
  由 chentianyu03 提交于 3月 16, 2022
```
* move reduce kernels into one file

* rename reduce_prod to prod

* move reduce sum/mean from math_kernel into reduce_kernel

* rm comment
```
  84e17a31
- X
  
  tranfer cumprod and kldiv_loss infershape to phi (#40575) · 6d205516
  由 xiongkun 提交于 3月 16, 2022
  
  6d205516
- C
  
  move isclose infershape (#40595) · c7637700
  由 Chen Weihang 提交于 3月 16, 2022
  
  c7637700
- C
  [Phi] Move grid sample op kernel into phi (#40585) · 8fd20b5b
  由 Chen Weihang 提交于 3月 16, 2022
```
* add grid sample phi kernel

* add grid sample phi kernel and remove original kernel

* replace mutable_data by alloc
```
  8fd20b5b
- Q
  
  [MLU] support amp O1 of mlu (#40461) · ad81f22c
  由 qipengh 提交于 3月 16, 2022
  
  ad81f22c

PaddlePaddle / Paddle 接近 2 年 前同步成功

PaddlePaddle / Paddle
接近 2 年前同步成功