提交 · 6fe9dfb269bf6c635156b19af942e31f25dfa988 · PaddlePaddle / Paddle

08 11月, 2022 2 次提交
- C
  support pow double grad op (#47691) · 6fe9dfb2
  由 Charles-hit 提交于 11月 08, 2022
```
* support pow_double_grad op

* add unit test for pow double grad

* fix pow double grad

* optimize pow double grad kernel

* fix pow double grad kernel
```
  6fe9dfb2
- C
  
  normalize autotune tests dir (#47726) · 6bab3343
  由 Chen Weihang 提交于 11月 08, 2022
  
  6bab3343
07 11月, 2022 5 次提交
- Y
  Define ConvRunner to wrapper the call of cudnn conv functions. (#47576) · c331e2ce
  由 Yiqun Liu 提交于 11月 07, 2022
```
* Define ConvRunner to wrapper the call of cudnn conv functions.

* Use ConvKind in SearchAlgorithm.
```
  c331e2ce
- Q
  support kldiv_loss/kldiv_loss_grad for kunlun (#47638) · 5f0a8adc
  由 QingshuChen 提交于 11月 07, 2022
```
*test=kunlun
```
  5f0a8adc
- Y
  add roll and roll_grad kernels and strided_slice and strided_slice_grad... · 5a4d2186
  由 ykkk2333 提交于 11月 07, 2022
```
add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun (#47368)

* add stat tool

* add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun
```
  5a4d2186
- S
  [PHI] Migrate batch_norm (#47652) · 2337e609
  由 Sławomir Siwek 提交于 11月 07, 2022
```
* init changes

* bnorm

* method signature

* change order

* bnorm

* removed unused args
```
  2337e609
- S
  [PHI] Migrate depthwise_conv2d_grad and conv3d_grad kernels (#47686) · b0c38568
  由 Sławomir Siwek 提交于 11月 07, 2022
```
* remove fwd funcs

* migrate conv grads
```
  b0c38568
04 11月, 2022 8 次提交

Z
Generate static graph code for some activation ops by Yaml (part3) (#47640) · 40cd5271
由 zyfncg 提交于 11月 04, 2022
```
* generate static graph code for some activation op

* fix bug

* fix infermeta of selected_rows
```
40cd5271

Add sin double grad operator. (#47543) · 297f5efe

由 cyber-pioneer 提交于 11月 04, 2022

* add sin double grad operator

* add sin double grad test example

* move sindoublegradopmaker to backward.yaml

* fix sindoublegrad code

* simplify sindoublegrad functor

297f5efe

[XPU] add cumsum op. test=kunlun (#47585) · ac2a94c7

由 houj04 提交于 11月 04, 2022

* [XPU] add cumsum op. test=kunlun

* try to fix linker. test=kunlun

* try to fix linker. test=kunlun

* try to fix linker. test=kunlun

* debug. test=kunlun

* update xpu.cmake. remove unnecessary codes. test=kunlun.

ac2a94c7

S

migrate convs (#47658) · 4a4f3f80
由 Sławomir Siwek 提交于 11月 04, 2022

4a4f3f80

[PHI] Migrate pool2d and pool2d_grad kernels (#47423) · ca4bed7b

由 Piotr Paturej 提交于 11月 04, 2022

* add extra attr property set

* add type_info for all context

* add onednn context to all context

* fix context compile error

* simplify conv kernel args

* pass runtime attr into dev_ctx

* fix marco error

* clear conv_grad_kernel extra args

* merge conv_grad_grad into conv_grad

* clear conv2d_grad_grad extra attrs

* clear yaml and eager extra attr

* fix conv1d error

* change to thread local

* fix npu compile failed

* try to fix windows compile failed

* add conv2d onednn phi kernel

* fix ci bugs (#36)

* fix compile bugs (#38)

* fix extra input transform bug (#39)

* support dynamic created attr (#40)

* reset extra info gen code

* rm conv_grad_grad kernel

* reimpl pass attr adapting

* add int attr support

* remove vector inputnames creating

* fix map at error

* Update paddle/phi/kernels/onednn/conv_grad_kernel.cc
Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>

* remove useless extra attrs

* replace mkldnn_engine by onednn_engine

* Migrate pool+grad to PHI

* Update paddle/fluid/operators/mkldnn/test_mkldnn_op_nhwc.cc
Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>

* Update paddle/phi/kernels/onednn/pool_grad_kernel.cc
Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>

* Update paddle/phi/kernels/onednn/pool_kernel.cc
Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
Co-authored-by: NChen Weihang <chenweihang@baidu.com>
Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
Co-authored-by: NChen Weihang <chenwhpro@163.com>
Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>

ca4bed7b

[PHI] Migrate softplus kernel (#47406) · 1831919f

由 Sławomir Siwek 提交于 11月 04, 2022

* add extra attr property set

* add type_info for all context

* add onednn context to all context

* fix context compile error

* simplify conv kernel args

* pass runtime attr into dev_ctx

* fix marco error

* clear conv_grad_kernel extra args

* merge conv_grad_grad into conv_grad

* clear conv2d_grad_grad extra attrs

* remove redundant imports

* migrate softmax

* clear yaml and eager extra attr

* fix conv1d error

* change to thread local

* fix npu compile failed

* try to fix windows compile failed

* add conv2d onednn phi kernel

* fix ci bugs (#36)

* fix compile bugs (#38)

* fix extra input transform bug (#39)

* support dynamic created attr (#40)

* reset extra info gen code

* rm conv_grad_grad kernel

* reimpl pass attr adapting

* add int attr support

* remove vector inputnames creating

* merge dev

* fix map at error

* adjust attribute

* adapt funcs to PHI

* init

* adjust imports

* support postops

* format codeblocks

* revert changes to softmax
Co-authored-by: NChen Weihang <chenweihang@baidu.com>
Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>

1831919f

Y

fix deepfm and deep_wide bug, add embedding_sparse_grad kernel, test=kunlun (#47365) · f53e920d
由 ykkk2333 提交于 11月 04, 2022

f53e920d
Z

matmul_v2 support new case and fix masked_select bug for xpu, test=kunlun (#47370) · 6916215e
由 zhangyikun02 提交于 11月 04, 2022

6916215e

03 11月, 2022 7 次提交

W

Weight and bias's stop_gradient of BatchNorm must be True or False at the same time (#47634) · 21277904
由 wanghuancoder 提交于 11月 03, 2022

21277904
sparse attention kernel is used from 11.8 (#47594) · 7648f429
由 zhouweiwei2014 提交于 11月 03, 2022

7648f429
S

fix gemm compute_type (#47613) · 954be40d
由 sneaxiy 提交于 11月 03, 2022

954be40d

[PHI] Migrate softmax kernel (#47339) · b8ae3858

由 Sławomir Siwek 提交于 11月 03, 2022

* add extra attr property set

* add type_info for all context

* add onednn context to all context

* fix context compile error

* simplify conv kernel args

* pass runtime attr into dev_ctx

* fix marco error

* clear conv_grad_kernel extra args

* merge conv_grad_grad into conv_grad

* clear conv2d_grad_grad extra attrs

* remove redundant imports

* migrate softmax

* clear yaml and eager extra attr

* fix conv1d error

* change to thread local

* fix npu compile failed

* try to fix windows compile failed

* add conv2d onednn phi kernel

* fix ci bugs (#36)

* fix compile bugs (#38)

* fix extra input transform bug (#39)

* support dynamic created attr (#40)

* reset extra info gen code

* rm conv_grad_grad kernel

* reimpl pass attr adapting

* add int attr support

* remove vector inputnames creating

* merge dev

* fix map at error

* adjust attribute

* adapt funcs to PHI
Co-authored-by: NChen Weihang <chenweihang@baidu.com>
Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>

b8ae3858

Z

[Sparse] Unified api args name (#47529) · f9a0605d
由 zhangkaihuo 提交于 11月 03, 2022

f9a0605d
[Zero-Dim] support input 0D Tensor for min/max/amin/amax/prod/logsumexp/all/any (#47501) · a7509ce3
由 zhouweiwei2014 提交于 11月 03, 2022

a7509ce3
Y

fix xpu ci bugs, test=kunlun (#47581) · da083436
由 YuanRisheng 提交于 11月 03, 2022

da083436

02 11月, 2022 5 次提交
- Z
  fix ci bug (#47583) · 0967506e
  由 zhangbo9674 提交于 11月 02, 2022
```
* fix ci bug

* test
```
  0967506e
- T
  
  fix amax/amin/max/min write overflow (#47570) · 6f7a80c3
  由 Tao Luo 提交于 11月 02, 2022
  
  6f7a80c3
- Y
  [PHI]Standardise some C++ API (Part3) (#47532) · fe8c6796
  由 YuanRisheng 提交于 11月 02, 2022
```
* Standardise batch norm

* standardize conv3d and depwise_conv2d

* fix ci bugs
```
  fe8c6796
- [Zero-Dim] support input 0D Tensor for some binary api (#46909) · cad2e68d
  由 zhouweiwei2014 提交于 11月 02, 2022
  
  cad2e68d
- H
  [XPU] add int64 support for slice and subtract. (#47409) · 77395619
  由 houj04 提交于 11月 02, 2022
```
* [XPU] add int64 support for slice and subtract. test=kunlun

* try to fix xpu compile. test=kunlun

* try to fix xpu compile. test=kunlun

* try to fix xpu compile. test=kunlun

* remove unnecessary modification. test=kunlun
```
  77395619
01 11月, 2022 7 次提交

S

[geometric] Optimize graph sample speed (#47531) · 2a932e55
由 Siming Dai 提交于 11月 01, 2022

2a932e55

Fix bugs in tranpose kernel (#47212) · ec7fe888

由 limingshu 提交于 11月 01, 2022

* first commit

* transpose_kernel_optimization

* first complishment of transpose op

* second commit

* refine code logics of tranpose_kernel

* refine transpose kernel

* first commit

* fix DtoD copy bugs for hip

* refine code according to the PR advice

* change dim to int64_t type.

* fix some type error

ec7fe888

Y
[PHI]Standardise some C++ API (Part2) (#47510) · 399047d7
由 YuanRisheng 提交于 11月 01, 2022
```
* standard_api

* add hardtanh
```
399047d7

[EinsumOp] Einsum support complex grad (#47514) · e930c576

由 xiongkun 提交于 11月 01, 2022

* Einsum Support Complex

* code fix

* add unittest for complex grad with einsum

* set rtol=1e-4

* fix

e930c576

W

remove unused-local-typedefs warning on linux (#47513) · 96f36962
由 Wang Xin 提交于 11月 01, 2022

96f36962

Adapting device-specific Extra Attributes for the PHI kernel (#46342) · c923e6c9

由 Chen Weihang 提交于 10月 31, 2022

* add extra attr property set

* add type_info for all context

* add onednn context to all context

* fix context compile error

* simplify conv kernel args

* pass runtime attr into dev_ctx

* fix marco error

* clear conv_grad_kernel extra args

* merge conv_grad_grad into conv_grad

* clear conv2d_grad_grad extra attrs

* clear yaml and eager extra attr

* fix conv1d error

* change to thread local

* fix npu compile failed

* try to fix windows compile failed

* add conv2d onednn phi kernel

* fix ci bugs (#36)

* fix compile bugs (#38)

* fix extra input transform bug (#39)

* support dynamic created attr (#40)

* reset extra info gen code

* rm conv_grad_grad kernel

* reimpl pass attr adapting

* add int attr support

* remove vector inputnames creating

* fix map at error

* Update paddle/phi/kernels/onednn/conv_grad_kernel.cc
Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>

* remove useless extra attrs

* replace mkldnn_engine by onednn_engine
Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>

c923e6c9

U

summer-ospp 2022: 飞桨PaddlePaddle Sparse Conv开发和优化: gather-gemm-scatter fuse (#46679) · 5158fa4f
由 umiswing 提交于 11月 01, 2022

5158fa4f

31 10月, 2022 6 次提交

Y
[PHI]Standardise some C++ API (#47385) · 60e0c506
由 YuanRisheng 提交于 10月 31, 2022
```
* standard api

* fix ci bugs

* fix ci bugs

* fix ce bugs
```
60e0c506

[Einsum] Einsum support repeated labels. (#47290) · 6e1c14e3

由 xiongkun 提交于 10月 31, 2022

* add unittest for einsum-v2-trace and diagonal

* repeat labels.

* einsum support repeated labels.

* forward is ok for diagonal and undiagonalized.
TODO: check backward is ok by our theorem.

* backward is ok!

* fix by PR suggestions.

* fix ci error

* fix ci error

* fix ci warning

6e1c14e3

R
[CustomDevice] GetCCLComm add custom device support (#47168) · 34d13d6a
由 ronnywang 提交于 10月 31, 2022
```
* [CustomDevice] GetCCLComm add custom device support

* update

* update

* update
```
34d13d6a

[ControlFlow] replace executor in run method of control flow ops with standalone_executor (#45696) · 3b219e5e

由 kangguangli 提交于 10月 31, 2022

* replace executor in conditional_block_op.run with standalone_executor

* add block_id as the argument of standalone executor's method run; add print for program

* fix scope bug about conditional block op

* fix bug: unnecessary return of fetch value

* fix typo

* fix: quantization will set variable persistable, and these variables must exist in global scope

* add interpretercore cache for conditional block op but not activate in default

* fix bug: local scope reuse for conditional block op

* reset scope when conditional block op runs

* fix typo

* fix typo and code style

* add build scope for conditional block op

* add skip for transfer_layout kernel

* refind code

* fix reset_scope

* fix reset_scope

* refine code

* refine code

* refine code

1. remove flag use in conditional_block_op
2. pass execution_config to BuildOpFuncList instead of individual parameter

* refine code

* remove the use of FLAGS_control_flow_use_new_executor_cache

* change FLAGS_control_flow_use_new_executor to false

3b219e5e

[Zero-Dim] support input 0D Tensor for reduce_sum/reduce_mean (#47219) · c8fc3379
由 zhouweiwei2014 提交于 10月 31, 2022

c8fc3379
W

remove boost compiler flags in flags.cmake (#47468) · 91096ae2
由 Wang Xin 提交于 10月 31, 2022

91096ae2

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功