提交 · b96a21df4e7a42b2445104426e2be407534705e6 · PaddlePaddle / Paddle

10 11月, 2022 3 次提交

change cudnn error to cuda error if compiled cuda version is incompatible with... · b96a21df

由 pangyoki 提交于 11月 10, 2022

change cudnn error to cuda error if compiled cuda version is incompatible with installed cuda version (#47743)

* fix cudnn error

* fix

* fix

* fix

b96a21df

XPU multi-card support eager mode (#47445) · 3b91f8f3

由 james 提交于 11月 10, 2022

* XPU support eager mode

* add unittest for XPU eager mode

* minor bugfix

* minor bugfix, test=kunlun

* correct copyright info

* 1. remove unsed vars/funcs
2. ProcessGroupBKCL inherit from ProcessGroupStream

* bugfix for fp16 in eager mode multi-card, test=kunlun

* rebase & fix a few issues

* use new processgroup interface, test=kunlun

* fix compile issue, test=kunlun

3b91f8f3

C

support pow_triple_grad op (#47799) · 7964119b
由 Charles-hit 提交于 11月 10, 2022

7964119b

09 11月, 2022 12 次提交

[PHI decoupling] remove "paddle/fluid/platform/dynload/xxx.h" in phi (#47787) · 7c302538

由 huangjiyi 提交于 11月 09, 2022

* rm "paddle/fluid/platform/dynload/cudnn.h" in phi

* rm "paddle/fluid/platform/dynload/mklml.h" in phi

* rm "paddle/fluid/platform/dynload/rocblas.h" in phi

* replace "paddle::platform::dynload::" with "phi::dynload::" in phi

* revert "blas_impl.cu.h"

7c302538

[PHI decoupling] remove framework/data_type.h from phi (#47776) · 1631836f

由 Wang Xin 提交于 11月 09, 2022

* remove framework/data_type.h from phi

* fix CI fail: map proto::VarType to phi::DataType

* refactor code to add more detailed comments

1631836f

J

fix for missing reorders in profiling (#47777) · a97b3630
由 jakpiase 提交于 11月 09, 2022

a97b3630

Final changes to introduce mem_desc to be hold in Tensor (#46768) · 14f261ad

由 Jacek Czaja 提交于 11月 09, 2022

* first commit

- more fixes

- compilation fix

- compilation fix

- fix

- another fix

- yet another fix

- Fix

- fix to fused ops

- compilation fix

- compilation fix

- another compilation fix

- another fix

- fix

- fix

- fix

- fix

- yet another fix

- fix

- fix

- cosmetic fix

:- lint

- Revert some changes (to be brought back later)

- fix to build

- Added prototype of slice

- fix

compilation fix

- compilation fix

- fix

- fix

- Fix

- fix

 fix
	modified:   cmake/flags.cmake

* lint

* rerun of CI

* - Fix

* - lint

* - lint2

14f261ad

H

rm "paddle/fluid/platform/dynload/cublas.h" in phi (#47778) · 692a9632
由 huangjiyi 提交于 11月 09, 2022

692a9632
Z
Generate static graph code for some ops by yaml (part2) (#47752) · ccb47076
由 zyfncg 提交于 11月 09, 2022
```
* generate static graph code of some op

* polish code

* fix bug

* update default value
```
ccb47076
H

rm #include "paddle/fluid/framework/data_layout.h" in phi (#47770) · fd80288e
由 huangjiyi 提交于 11月 09, 2022

fd80288e
C

add sin triple grad operator (#47753) · 267b218f
由 cyber-pioneer 提交于 11月 09, 2022

267b218f

[PHI decoupling] Move fluid op generator into fluid (#47714) · f369b2b1

由 Chen Weihang 提交于 11月 09, 2022

* move fluid op generator into fluid

* remove parsed op

* resolve sig undef error

* append python interp find logic

* remove dup code

f369b2b1

Q
Revert "[NPU] add more attrs into npu storiages, test=develop (#47645)" (#47751) · 87d97246
由 Qi Li 提交于 11月 09, 2022
```
This reverts commit 1568d64f.
```
87d97246

fix ScaleKernel configuration error where input numel is 0 (#47111) · 38ba5f2e

由 FlyingQianMM 提交于 11月 09, 2022

* fix scale kernel configuration error where input numel is 0

* fix code stype

* add unit test case for scale op when numel of input x is zero

* fix ci codestyle check

* add cpu and gpu unit test case for scale op when numel of input x is zero

* add uninitialized judgment for input of scale

38ba5f2e

Z

[Sparse]optimize sparse convolution and fix MaskHelper bug (#47703) · 1aa64d13
由 zhangkaihuo 提交于 11月 09, 2022

1aa64d13

08 11月, 2022 11 次提交
- R
  
  [CustomDevice] fix the not ready kernel can not register. (#47758) · 4b0f1b0c
  由 ronnywang 提交于 11月 08, 2022
  
  4b0f1b0c
- [Zero-Dim] support input 0D Tensor for sundary api (#47734) · 3198af20
  由 zhouweiwei2014 提交于 11月 08, 2022
```
* [Zero-Dim] support input 0D Tensor for sundary api

* fix comment
```
  3198af20
- Z
  
  add adadelta op for xpu, test=kunlun (#47661) · 047971f0
  由 zhangyikun02 提交于 11月 08, 2022
  
  047971f0
- Z
  
  argsort support n > 16384 and add argsort_grad op for xpu, test=kunlun (#47701) · 6a6a3ff1
  由 zhangyikun02 提交于 11月 08, 2022
  
  6a6a3ff1
- L
  
  Fix bug of abs_double_grad in eager mode for kunlun, test=kunlun (#47722) · aba3c806
  由 Leo Guo 提交于 11月 08, 2022
  
  aba3c806
- N
  [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition (#47642) · 888272b5
  由 Nyakku Shigure 提交于 11月 08, 2022
```
* [CodeStyle][py2][U004] unecessary explicit `object` inheritance in class definition

* fix an increment
```
  888272b5
- P
  Split quant (#47449) · 130db92a
  由 Paulina Gacek 提交于 11月 08, 2022
```
* Split kernel registered, tests for uint/int added

* Split quantized

* Split output scales calculated only once

* NearestInterp test fix reversed

* DequantizeOutputs corrected
```
  130db92a
- J
  removing dependent to fluid/framework/eigen.h in phi (#47675) · c7cd8d98
  由 jzhang533 提交于 11月 08, 2022
```
* removing dependent to fluid/framework/eigen.h in phi

* more fix according to PR-CI-Py3 fail
```
  c7cd8d98
- C
  support pow double grad op (#47691) · 6fe9dfb2
  由 Charles-hit 提交于 11月 08, 2022
```
* support pow_double_grad op

* add unit test for pow double grad

* fix pow double grad

* optimize pow double grad kernel

* fix pow double grad kernel
```
  6fe9dfb2
- W
  
  remove <fluid/eager/api/utils/global_utils.h> from phi (#47739) · 42d9fe2f
  由 Wang Xin 提交于 11月 08, 2022
  
  42d9fe2f
- C
  
  normalize autotune tests dir (#47726) · 6bab3343
  由 Chen Weihang 提交于 11月 08, 2022
  
  6bab3343
07 11月, 2022 5 次提交
- Y
  Define ConvRunner to wrapper the call of cudnn conv functions. (#47576) · c331e2ce
  由 Yiqun Liu 提交于 11月 07, 2022
```
* Define ConvRunner to wrapper the call of cudnn conv functions.

* Use ConvKind in SearchAlgorithm.
```
  c331e2ce
- Q
  support kldiv_loss/kldiv_loss_grad for kunlun (#47638) · 5f0a8adc
  由 QingshuChen 提交于 11月 07, 2022
```
*test=kunlun
```
  5f0a8adc
- Y
  add roll and roll_grad kernels and strided_slice and strided_slice_grad... · 5a4d2186
  由 ykkk2333 提交于 11月 07, 2022
```
add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun (#47368)

* add stat tool

* add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun
```
  5a4d2186
- S
  [PHI] Migrate batch_norm (#47652) · 2337e609
  由 Sławomir Siwek 提交于 11月 07, 2022
```
* init changes

* bnorm

* method signature

* change order

* bnorm

* removed unused args
```
  2337e609
- S
  [PHI] Migrate depthwise_conv2d_grad and conv3d_grad kernels (#47686) · b0c38568
  由 Sławomir Siwek 提交于 11月 07, 2022
```
* remove fwd funcs

* migrate conv grads
```
  b0c38568
04 11月, 2022 9 次提交

Q
[NPU] add more attrs into npu storiages, test=develop (#47645) · 1568d64f
由 Qi Li 提交于 11月 04, 2022
```
* [NPU] add more attrs into npu storiages, test=develop

* rename to storage_properties_initialized
```
1568d64f
Z
Generate static graph code for some activation ops by Yaml (part3) (#47640) · 40cd5271
由 zyfncg 提交于 11月 04, 2022
```
* generate static graph code for some activation op

* fix bug

* fix infermeta of selected_rows
```
40cd5271

Add sin double grad operator. (#47543) · 297f5efe

由 cyber-pioneer 提交于 11月 04, 2022

* add sin double grad operator

* add sin double grad test example

* move sindoublegradopmaker to backward.yaml

* fix sindoublegrad code

* simplify sindoublegrad functor

297f5efe

[XPU] add cumsum op. test=kunlun (#47585) · ac2a94c7

由 houj04 提交于 11月 04, 2022

* [XPU] add cumsum op. test=kunlun

* try to fix linker. test=kunlun

* try to fix linker. test=kunlun

* try to fix linker. test=kunlun

* debug. test=kunlun

* update xpu.cmake. remove unnecessary codes. test=kunlun.

ac2a94c7

P

add cudnn error (#47666) · eb9e4601
由 pangyoki 提交于 11月 04, 2022

eb9e4601
S

migrate convs (#47658) · 4a4f3f80
由 Sławomir Siwek 提交于 11月 04, 2022

4a4f3f80

[PHI] Migrate pool2d and pool2d_grad kernels (#47423) · ca4bed7b

由 Piotr Paturej 提交于 11月 04, 2022

* add extra attr property set

* add type_info for all context

* add onednn context to all context

* fix context compile error

* simplify conv kernel args

* pass runtime attr into dev_ctx

* fix marco error

* clear conv_grad_kernel extra args

* merge conv_grad_grad into conv_grad

* clear conv2d_grad_grad extra attrs

* clear yaml and eager extra attr

* fix conv1d error

* change to thread local

* fix npu compile failed

* try to fix windows compile failed

* add conv2d onednn phi kernel

* fix ci bugs (#36)

* fix compile bugs (#38)

* fix extra input transform bug (#39)

* support dynamic created attr (#40)

* reset extra info gen code

* rm conv_grad_grad kernel

* reimpl pass attr adapting

* add int attr support

* remove vector inputnames creating

* fix map at error

* Update paddle/phi/kernels/onednn/conv_grad_kernel.cc
Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>

* remove useless extra attrs

* replace mkldnn_engine by onednn_engine

* Migrate pool+grad to PHI

* Update paddle/fluid/operators/mkldnn/test_mkldnn_op_nhwc.cc
Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>

* Update paddle/phi/kernels/onednn/pool_grad_kernel.cc
Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>

* Update paddle/phi/kernels/onednn/pool_kernel.cc
Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
Co-authored-by: NChen Weihang <chenweihang@baidu.com>
Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
Co-authored-by: NChen Weihang <chenwhpro@163.com>
Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>

ca4bed7b

[PHI] Migrate softplus kernel (#47406) · 1831919f

由 Sławomir Siwek 提交于 11月 04, 2022

* add extra attr property set

* add type_info for all context

* add onednn context to all context

* fix context compile error

* simplify conv kernel args

* pass runtime attr into dev_ctx

* fix marco error

* clear conv_grad_kernel extra args

* merge conv_grad_grad into conv_grad

* clear conv2d_grad_grad extra attrs

* remove redundant imports

* migrate softmax

* clear yaml and eager extra attr

* fix conv1d error

* change to thread local

* fix npu compile failed

* try to fix windows compile failed

* add conv2d onednn phi kernel

* fix ci bugs (#36)

* fix compile bugs (#38)

* fix extra input transform bug (#39)

* support dynamic created attr (#40)

* reset extra info gen code

* rm conv_grad_grad kernel

* reimpl pass attr adapting

* add int attr support

* remove vector inputnames creating

* merge dev

* fix map at error

* adjust attribute

* adapt funcs to PHI

* init

* adjust imports

* support postops

* format codeblocks

* revert changes to softmax
Co-authored-by: NChen Weihang <chenweihang@baidu.com>
Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>

1831919f

Y

fix deepfm and deep_wide bug, add embedding_sparse_grad kernel, test=kunlun (#47365) · f53e920d
由 ykkk2333 提交于 11月 04, 2022

f53e920d

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功