提交 · be3a6fa7e494df3871685c13cfc0c19a87c86083 · PaddlePaddle / Paddle

12 7月, 2023 1 次提交

[clang-tidy] enable `readability-container-size-empty` check (#55279) · be3a6fa7

由 Wang Xin 提交于 7月 12, 2023

* [clang-tidy] enable readability-container-size-empty check

* fix test_custom_kernel Failed

* add clang-tid-10 in dockerfile

* add clang-tidy in dockerfile

* fix bug

be3a6fa7

27 6月, 2023 1 次提交
- Z
  delete swish_raw (#54536) · 0cdaafea
  由 zhangyuqin1998 提交于 6月 27, 2023
```
* delete swish_raw

* fix

* Update activation_kernel.cc

* fix
```
  0cdaafea
20 6月, 2023 1 次提交

static graph autogen code support for matmul op (#54338) · ad80fbfe

由 Wang Xin 提交于 6月 20, 2023

* static graph autogen code support for matmul op

* fix bug

* fix bug

* fix bug

* fix bug

* fix bug

* fix bug

* fix bug

ad80fbfe

09 6月, 2023 1 次提交

Auto generate code for elementwise_max (#54412) · 57564bdf

由 lzydev 提交于 6月 09, 2023

* auto generate code for elementwise_max

* auto generate code for elementwise_max

* fix composite ops

* fix bug of fmax

57564bdf

05 6月, 2023 2 次提交
- G
  
  [static op generation] pool2d, pool3d (#54070) · 30881647
  由 gouzil 提交于 6月 05, 2023
  
  30881647
- H
  Support code generation for op conv2d_transpose, conv3d_transpose,... · 1075d35d
  由 huangjiyi 提交于 6月 05, 2023
```
Support code generation for op conv2d_transpose, conv3d_transpose, depthwise_conv2d_transpose (#54242)
```
  1075d35d
02 6月, 2023 1 次提交
- W
  static graph autogen code for shape op (#54221) · f5342918
  由 Wang Xin 提交于 6月 02, 2023
```
* static graph autogen code for shape op

* fix onednn

* fix onednn
```
  f5342918
01 6月, 2023 1 次提交

Support static graph code generation for conv2d, conv3d, depthwise_conv2d (#54201) · f3eccb3f

由 huangjiyi 提交于 6月 01, 2023

* update

* update cmake

* update

* update

* update

* update

* Revert "update cmake"

This reverts commit 1e1dc1b2bc9967b725201272607f939260070fd4.

* update

* update

* update

* update

f3eccb3f

24 5月, 2023 1 次提交
- Z
  
  move reduce raw kernels to legacy (#53961) · f488e3fd
  由 zhangyuqin1998 提交于 5月 24, 2023
  
  f488e3fd
23 5月, 2023 2 次提交
- W
  
  Enabel memory optimize pass although MkLDNN is enabled (#53615) · 5996f623
  由 weishengying 提交于 5月 23, 2023
  
  5996f623
- W
  static graph autogen code support for pad3d op (#53733) · bcf67536
  由 Wang Xin 提交于 5月 23, 2023
```
* static graph autogen code support for pad3d op

* bug fixed

* add ut for pad3d mkldnn op

* fix coverage

* fix bug

* fix bug

* Delete test_pad3d_mkldnn_op.py
```
  bcf67536
19 5月, 2023 2 次提交
- G
  
  test,test=develop (#53839) · c174aa22
  由 Galaxy1458 提交于 5月 19, 2023
  
  c174aa22
- G
  
  test,test=develop (#53818) · 63ffd733
  由 Galaxy1458 提交于 5月 19, 2023
  
  63ffd733
18 5月, 2023 1 次提交

Fused elementwises kernels and ops (#51427) · fb4a6ecf

由 Hulek 提交于 5月 18, 2023

* Fused elementwises kernels and ops

* change fuse pass name

* adjust .pbtxt files

* adjust quantization attributes

* add missing arguments and fix others, review fixed

* simplify fused kernel registration

* fix elementwise unit tests

* reuse one fused elementwise op

* adjust proto

* Add supported datatypes

* Change 'Scale' to 'scale' in tests, change some tests to onednn

* Revert breaking changes

* Fix unit tests

* Delete obsolete test cases

* Delete commented out code

* Fix codestyle

* delete temporary condition

* fix conflicts and delete duplicate fusing

* Fix code after merge

* Move tests to new directory

* fix tests volatility

* Rename test_elementwise_add_onednn_op.py to test_elementwise_add_mkldnn_op.py

* Update CMakeLists.txt add mkldnn op test

---------
Co-authored-by: NSilv3S <slawomir.siwek@intel.com>

fb4a6ecf

15 5月, 2023 3 次提交
- H
  move dequantize kernel to phi (#53739) · efd410c8
  由 huangjiyi 提交于 5月 15, 2023
```
* update

* fix bug

* fix output type def
```
  efd410c8
- G
  remove some [-Wunsed-parameter] warning (#53689) · 3e1fffea
  由 Galaxy1458 提交于 5月 15, 2023
```
* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop
```
  3e1fffea
- G
  remove some [-Wunused-paramter]warning (#53681) · 96188fc1
  由 Galaxy1458 提交于 5月 15, 2023
```
* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop
```
  96188fc1
11 5月, 2023 1 次提交

remove some [-Wunused-parameter] warning (#53683) · dbb62692

由 Galaxy1458 提交于 5月 11, 2023

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

dbb62692

26 4月, 2023 1 次提交

remove some [-Wunused-parameter] waring (#53319) · f9e5072b

由 Galaxy1458 提交于 4月 26, 2023

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

f9e5072b

24 4月, 2023 2 次提交
- [Zero-Dim] Support paddle.max output 0D, test=allcase (#53242) · 9f9cd919
  由 zhouweiwei2014 提交于 4月 24, 2023
  
  9f9cd919
- Y
  [Zero-Dim] support 0d tensor for shape and squeeze onednn kernel (#52832) · c0a604e7
  由 YangQun 提交于 4月 24, 2023
```
* support 0d tensor for shape and squeeze onednn kernel

* set python api for shape op ut
```
  c0a604e7
17 4月, 2023 1 次提交
- Z
  
  rename_SliceKernel (#52863) · d2b0d63f
  由 zhangyuqin1998 提交于 4月 17, 2023
  
  d2b0d63f
14 4月, 2023 2 次提交

[Zero-Dim] support 0-D tensor for... · 6f41e177

由 YangQun 提交于 4月 14, 2023

[Zero-Dim] support 0-D tensor for reduce/reshape/stack/prelu/expand_v2/gaussion onednn kernels (#52185)

* support 0-D tensor for reduce/reshape/stack/prelu/expand_v2/gaussion ops

* fix gaussian random mkldnn op ut

6f41e177

Z

delete unused param from swish_grad and relu6_grad (#52805) · 54e4360a
由 zhangyuqin1998 提交于 4月 14, 2023

54e4360a

13 4月, 2023 2 次提交

[enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h (#52651) · 5664ea26

由 HongyuJia 提交于 4月 13, 2023

* [enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h

* Add logging.h for profiler.cc

* Add logging.h for gloo_utils.h

* Add logging.h for addmm_kernel_impl.h

* Add logging.h for addmm_grad_kernel_impl.h

* Add logging.h for p_send_kernel.cu

* Add logging.h for determinant_grad_kernel_impl.h

* Add logging.h for p_recv_kernel.cu

* Add logging.h for elementwise_grad_base.h

* Add logging.h for transfer_layout_kernel.cc

* Add logging.h for eigvals_kernel.cc and index_select_impl.h

* Add logging.h for all files in kernel directory

* Add logging.h for xpu_info.cc

* Add logging.h for xpu

5664ea26

Z

rename_bilinear_tensor_op (#52745) · eb93b5c9
由 zhangyuqin1998 提交于 4月 13, 2023

eb93b5c9

06 4月, 2023 1 次提交

Remove oneDNN-specific attributes from matmul (#49444) · 4d97b25d

由 Sławomir Siwek 提交于 4月 06, 2023

* replace matmul with matmul_v2 in fuse passes

* Remove fusion logic from matmul

* removing fusion methods

* add proper name

* adjust namespaces

* clean attrs in python tests

* delete checkpoint and restore matmul version

* remove unused code

* matmul and reshape/transpose fuses migrated

* split MatmulOneDNN headers

* fuse activation and eltwise_add

* add fuse_activation

* matmul_transpose_reshape/reshape_transpose_matmul

* matmul + elementwise_add (fused)

* activation temporary modifciation

* restore matmul(v1) version 0

* merge newest develop

* remove depedency from other PR

* revert pbtxt

* remove placeholders from matmul_v2

* add description in OPMaker

* remove matmul_v2_op.h and all depedencies

* remove dims changing in base op

* add possibility to fuse already fused_matmul

* restart broken CI

* Empty-Commit

* revert matmul_utils.h

* codestyle

* adjust imports

* add pbtxt file

* 100% matmul unit tests coverage

* trigger CI with minimal changes to develop

* adjust changes to develop

* add fused_matmul op

* inherit base ops

* add "v2"

* move OPMaker

* Gradually add fused_matmul files

* second batch of fused_matmul changes

* split infershapes of matmul_v2 and fused_matmul

* merge code from other PR

* 2023

* inherit fused_matmul from matmul_v2

* Update paddle/phi/backends/onednn/onednn_reuse.h
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

* Update paddle/phi/kernels/fusion/onednn/fused_matmul_kernel.cc
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

* resolve conflicts

* codestyle

* simplify isgemmlinear

* 2023

* remove import

* reuse methods

* matmul_v2_mkldnn cleanup

* simplify ExecuteMatMulV1Grad

* matmul refactored

* fc

* SetOutMemDescWithLogicalLayoutFusesSupport

* matmul_v2

* alpha support

* group repetetive funcs

* matmul utils

* execute matmul methods

* restore registered kernel names

* split header and impl files

* remove double negatives

* reduce numer of modified files

* adjust ExecuteMatmul

* add scales for ut

* dates

* limit number of modified files

* fluid imports

* remove alpha

* codestyle

---------
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

4d97b25d

04 4月, 2023 1 次提交

Improve new executor static build (#51149) · 5bac67d4

由 Ruibiao Chen 提交于 4月 04, 2023

* Improve new executor static build

* Skip GC for static build

* Skip infershape for static build

* Handle read_op

* Add fused_attention to OpsWithFluidKernelNeedMoveToPhi

* Fix argsort typos

* Add sequence_pool to OpsWithFluidKernelNeedMoveToPhi

* Fix skip share lod errors

* Fix errors for adam

* Fix errors for eigvals, memcpy and fake_quantize

* Add static_build.cc

* Add black list

* Fix CI errors

* Fix CI errors

* Fix CI errors

* Fix TensorArray

* Fix TensorArray

* Add update_loss_scaling to OpsNeedSetOutputDtypeWhenRegisterPhiKernel

* Fix copy

* Fix errors

* Fix momentum

* Skip mkldnn

* Fix CI errors

* Fix c_sync_calc_stream_op

* Fix CINN

* Fix while op

* All CI pass, disable FLAGS to merge code, enable it after more tests in future

* Add UTs

* Fix typos

* Fix typos

* Add mkldnn UT

* Remove mkldnn test

* Fix typos

* Fix dist test

* Fix typos

* Fix CI errors

* Fix CI errors

* Add UTs

* Fix typos

* Fix typos

* Add sparse tests

* ToComplexType -> ToComplex

* Add test_matmul_op_static_build to disable_win_inference_test

5bac67d4

29 3月, 2023 1 次提交

[AMP OP&Test] pad3d add unittests of fp16 and bf16 (#51015) · f86d0be7

由 zengshao0622 提交于 3月 29, 2023

* pad3d add unittests of fp16 and bf16

* pad3d add unittests of fp16 and bf16

* fix cuda place

* fix random to uniform

* fix class name

* fix fp16 max relative error to 1.5e-3

* add dytpe register for onednn

* add pad uint16 check of common.py

* remove check_eager

* test_check_grad --> test_check_grad_normal

f86d0be7

27 3月, 2023 2 次提交

X

elementwise: onednn: support zero dimension inputs (#51656) · 2c1d494e
由 Xinyu Chen 提交于 3月 27, 2023

2c1d494e

Fused elementwise_(mul/div) (#50428) · 968f7f24

由 Sławomir Siwek 提交于 3月 27, 2023

* extract Op and OPMaker to .h

* extend pattern for fused_op

* set "with_residual" default to false

* adjust fuse passes

* remove fc+eltwise flag

* fused_output_scale

* activation attrs

* remove extra attrs

* fix int8/bf16 unit tests

* simplify RecomputeOutputDims

* remove unused method

* Add description for attributes

* add extra check

* adjust op compats

* update quantize test

* fix protobuf parsing error

* fix int8 performance

* fused elementwises

* merge develop

* remove activation

* restore activation for existing add/sub ops

968f7f24

22 3月, 2023 2 次提交

[Zero-Dim] Support 0-D tensor for some oneDNN unary kernels (#51687) · 2a3d75bc

由 YangQun 提交于 3月 22, 2023

* support 0-d tensor for element wise unary ops

* fix python code style check

* fix approval check

* support 0-d tensor for onednn softmax and logsoftmax kernels

* fix commnets

* fix some unittests

2a3d75bc

Extract fused_transpose op dedicated for oneDNN fuse passes (#50021) · 02296977

由 Sławomir Siwek 提交于 3月 22, 2023

* extract common methods to reuse

* add header for transpose ops

* fused_transpose

* Split big function

* transpose2 tests

* fused_transpose

* Apply extra attributes

* add pbtxt file

* update pbtxt

* Merge develop

* add more strict op compats

* code  style

* remove mkldnn_data_type

* unify SetOutMemDescWithReshape2FuseSupport

* adjust quantize-dequantize for transpose

* remove appendact

* transpose2 quantization

* fix int8 tests

* adjust transpose_op to current develop

* delete fusion code from transpose_kernel

* add fused transpose to NHWC unittest

* change order

02296977

21 3月, 2023 1 次提交

[PHI decoupling] Move DataType* from paddle:experimental to phi namespace (#51716) · 4638a62e

由 iSerendipity 提交于 3月 21, 2023

* move DataType from paddle::experimental to phi

* convert namespace

* convert namespace

* convert namespace

* clarify namespace

* convert more datatype

* Revert "convert more datatype"

This reverts commit 083b462959e6a22d4d8767707b628b95b396642e.

* convert more in auto_code_generator

* fix conflicts for XPU

* fix namespace conflicts

* fix errors

* Revert "fix errors"

This reverts commit f9d9958b54ee32141112274c8a5c3c381ab0f876.

* fix errors

* fix formatting

4638a62e

15 3月, 2023 1 次提交
- Z
  Delete hardswish_raw op (#51634) · 3e636ec9
  由 zhangyuqin1998 提交于 3月 15, 2023
```
* Delete hardswish_raw op

* fix ut
```
  3e636ec9
13 3月, 2023 2 次提交

[PHI]Remove OneDNN code in Transpose infershape (#50836) · 5a39365a

由 YuanRisheng 提交于 3月 13, 2023

* remove transpose infershape

* fix ci bugs

* fix ci bugs

* delete transpose infershape

* fix ci bugs

* fix ci bugs

5a39365a

Fused softplus (#51087) · fdcfa04f

由 Sławomir Siwek 提交于 3月 13, 2023

* mkldnn->onednn

* fused softplus op + kernel

* remove extra attributes

* add missing handler

* change var name

fdcfa04f

10 3月, 2023 1 次提交

[New features]Add function node in phi_kernel for MKLDNN (#51073) · a0a6dc6a

由 HappyHeavyRain 提交于 3月 10, 2023

* Add function node in phi_kernel for MKLDNN

* fix the bug in 'BuildInferVarKernelContext'

* add infer_varkernel_utils.cc

* fix the bug:the first two parametes of 'BuildInferVarKernelContext' can't be template variable

* change the code according to first review

* change the code according to first review

* change the mode of paddle_build.sh

* change 'infer_var_kernel_fn_' to 'get_kerneltype_forvar_fn_'

* add the error information

* fix NotFound infomation warning

* fix NotFound infomation warning

* fix NotFound infomation warning

a0a6dc6a

09 3月, 2023 1 次提交
- support ONEDNN 0D for full_kernel (#51265) · cc511f24
  由 zhouweiwei2014 提交于 3月 09, 2023
  
  cc511f24
06 3月, 2023 1 次提交
- 傅
  [AMP OP&Test] add bf16 fp16 type support for interpolate (#51153) · 2f2bf4e8
  由傅剑寒提交于 3月 06, 2023
```
* add bf16 fp16 type support for interpolate

* add bf16 fp16 support for interpolate in phi on cpu
```
  2f2bf4e8

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功