提交 · 138bdf40e5e9b9830cb73730c53678748591d6a2 · PaddlePaddle / Paddle

29 8月, 2023 1 次提交
- C
  [clang-tidy] No.26,27 enable misc-unused-using-decls,misc-unused-alias-decls (#56485) · 138bdf40
  由 cyberslack_lee 提交于 8月 29, 2023
```
* fix

* fix
```
  138bdf40
09 8月, 2023 1 次提交
- X
  [oneDNN]rename macro to PADDLE_WITH_DNNL (#52208) · 6ff4c130
  由 Xinyu Chen 提交于 8月 09, 2023
```
* onednn: rename macro to PADDLE_WITH_DNNL

* onednn: rename macro to CINN_WITH_DNNL
```
  6ff4c130
04 8月, 2023 1 次提交
- Z
  
  [clang-tidy] NO.12 enable modernize-use-nullptr check(#55800) · 1e4f627d
  由 Zhenghai Zhang 提交于 8月 04, 2023
  
  1e4f627d
03 8月, 2023 1 次提交
- W
  
  [clang-tidy] [No.4] enable `modernize-loop-convert` (#55704) · 81ccd99e
  由 Wang Xin 提交于 8月 03, 2023
  
  81ccd99e
31 7月, 2023 1 次提交

rename BatchNormGradFunctor (#55717) · eee4b8fb

由 zhangyuqin1998 提交于 7月 31, 2023

* rename BatchNormGradFunctor

* Update batch_norm_grad_kernel.cc

* Update batch_norm_grad_kernel.cu

* Update batch_norm_grad_kernel.cc

* fix

* Update batch_norm_grad_kernel.cc

eee4b8fb

26 7月, 2023 1 次提交
- G
  
  add modernize-redundant-void-arg check (#55652) · 12fb18dd
  由 gouzil 提交于 7月 26, 2023
  
  12fb18dd
19 7月, 2023 1 次提交

delete relu6_raw (#55383) · 56d46ccc

由 zhangyuqin1998 提交于 7月 19, 2023

* delete relu6_raw

* fix codestyle

* Update test_mkldnn_matmul_activation_fuse_pass.py

* fix

* Update backward.yaml

* Update ops.yaml

* Update backward.yaml

56d46ccc

12 7月, 2023 2 次提交

[ONEDNN] Upgrade oneDNN version to v3.1 (#52463) · cfa513f7

由 YangQun 提交于 7月 12, 2023

* squash pick the poc code
* fix build after rebase
* fix int8 conv and fc uts
* Fix and clean-up Get_SRC_Scale_Memory
* fix floating point fc uts
* fix test_analyzer_int8_googlenet
* test_analyzer_int8_mobilenetv1
* fix int8 mobilenet v2 and v3
* fix build error after rebase
* [oneDNN] rename library version
* fix conv bias datatype
* try to fix import error
* fix rebase error
* [oneDNN] pack library into python wheel
* add MKLDNN_SHARED_LIB_3 to env_dict
* fix test_analyzer_bert
* fix fill_constant op kernel
* fix ernie and matmul op ut
* fix softplus ut
* fix conv+relu6 fusion ut
* fix hardswish fusion
* fix quant+transpose fusion ut
* fixsgd ut
* fix int8 matmul with flatten
* fix fc+scale fusion
* fix conv/matmul+gelu fusion uts
* fix rebase error
* Revert "fix conv/matmul+gelu fusion uts"
This reverts commit 47eb5e49972bd8f7271a233def9bfb3e98ce78e1.
* upgrade to onednn v3.1
* remove older version onednn
* use densetensor::data() for achieving mean and var in layernorm impl
* comments for atol of integer tests
* fix clang-format
* Revert "remove older version onednn"
This reverts commit 783e57ddfd4401254596eae7d47adb9b03590c09.
* improve binary handle
* fix expand kernel
* Revert "use densetensor::data() for achieving mean and var in layernorm impl"
* always use forward_inference for conv
* remove activation scales
* rollback changes to mkldnn.cmake
* address comments
* port changes to dequantize kernel
* fix merge error
* fix fused_elementwise_kernel
* upgrade onednn version to v3.1.1
* fix some approval error
* fix error msg format
* remove old onednn libs
* try to fix symbolic link issue
* fix cinn test case segfault
* do not explicit link test with onednn
* remove unnecessary changes
* integrate CINN with onednn v3
* link with mkldnn project
* fix cinn build file

---------
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
Co-authored-by: NChen, Xinyu1 <xinyu1.chen@intel.com>
Co-authored-by: Ntianshuo78520a <707759223@qq.com>

cfa513f7

[clang-tidy] enable `readability-container-size-empty` check (#55279) · be3a6fa7

由 Wang Xin 提交于 7月 12, 2023

* [clang-tidy] enable readability-container-size-empty check

* fix test_custom_kernel Failed

* add clang-tid-10 in dockerfile

* add clang-tidy in dockerfile

* fix bug

be3a6fa7

27 6月, 2023 1 次提交
- Z
  delete swish_raw (#54536) · 0cdaafea
  由 zhangyuqin1998 提交于 6月 27, 2023
```
* delete swish_raw

* fix

* Update activation_kernel.cc

* fix
```
  0cdaafea
20 6月, 2023 1 次提交

static graph autogen code support for matmul op (#54338) · ad80fbfe

由 Wang Xin 提交于 6月 20, 2023

* static graph autogen code support for matmul op

* fix bug

* fix bug

* fix bug

* fix bug

* fix bug

* fix bug

* fix bug

ad80fbfe

09 6月, 2023 1 次提交

Auto generate code for elementwise_max (#54412) · 57564bdf

由 lzydev 提交于 6月 09, 2023

* auto generate code for elementwise_max

* auto generate code for elementwise_max

* fix composite ops

* fix bug of fmax

57564bdf

05 6月, 2023 2 次提交
- G
  
  [static op generation] pool2d, pool3d (#54070) · 30881647
  由 gouzil 提交于 6月 05, 2023
  
  30881647
- H
  Support code generation for op conv2d_transpose, conv3d_transpose,... · 1075d35d
  由 huangjiyi 提交于 6月 05, 2023
```
Support code generation for op conv2d_transpose, conv3d_transpose, depthwise_conv2d_transpose (#54242)
```
  1075d35d
02 6月, 2023 1 次提交
- W
  static graph autogen code for shape op (#54221) · f5342918
  由 Wang Xin 提交于 6月 02, 2023
```
* static graph autogen code for shape op

* fix onednn

* fix onednn
```
  f5342918
01 6月, 2023 1 次提交

Support static graph code generation for conv2d, conv3d, depthwise_conv2d (#54201) · f3eccb3f

由 huangjiyi 提交于 6月 01, 2023

* update

* update cmake

* update

* update

* update

* update

* Revert "update cmake"

This reverts commit 1e1dc1b2bc9967b725201272607f939260070fd4.

* update

* update

* update

* update

f3eccb3f

24 5月, 2023 1 次提交
- Z
  
  move reduce raw kernels to legacy (#53961) · f488e3fd
  由 zhangyuqin1998 提交于 5月 24, 2023
  
  f488e3fd
23 5月, 2023 2 次提交
- W
  
  Enabel memory optimize pass although MkLDNN is enabled (#53615) · 5996f623
  由 weishengying 提交于 5月 23, 2023
  
  5996f623
- W
  static graph autogen code support for pad3d op (#53733) · bcf67536
  由 Wang Xin 提交于 5月 23, 2023
```
* static graph autogen code support for pad3d op

* bug fixed

* add ut for pad3d mkldnn op

* fix coverage

* fix bug

* fix bug

* Delete test_pad3d_mkldnn_op.py
```
  bcf67536
19 5月, 2023 2 次提交
- G
  
  test,test=develop (#53839) · c174aa22
  由 Galaxy1458 提交于 5月 19, 2023
  
  c174aa22
- G
  
  test,test=develop (#53818) · 63ffd733
  由 Galaxy1458 提交于 5月 19, 2023
  
  63ffd733
18 5月, 2023 1 次提交

Fused elementwises kernels and ops (#51427) · fb4a6ecf

由 Hulek 提交于 5月 18, 2023

* Fused elementwises kernels and ops

* change fuse pass name

* adjust .pbtxt files

* adjust quantization attributes

* add missing arguments and fix others, review fixed

* simplify fused kernel registration

* fix elementwise unit tests

* reuse one fused elementwise op

* adjust proto

* Add supported datatypes

* Change 'Scale' to 'scale' in tests, change some tests to onednn

* Revert breaking changes

* Fix unit tests

* Delete obsolete test cases

* Delete commented out code

* Fix codestyle

* delete temporary condition

* fix conflicts and delete duplicate fusing

* Fix code after merge

* Move tests to new directory

* fix tests volatility

* Rename test_elementwise_add_onednn_op.py to test_elementwise_add_mkldnn_op.py

* Update CMakeLists.txt add mkldnn op test

---------
Co-authored-by: NSilv3S <slawomir.siwek@intel.com>

fb4a6ecf

15 5月, 2023 3 次提交
- H
  move dequantize kernel to phi (#53739) · efd410c8
  由 huangjiyi 提交于 5月 15, 2023
```
* update

* fix bug

* fix output type def
```
  efd410c8
- G
  remove some [-Wunsed-parameter] warning (#53689) · 3e1fffea
  由 Galaxy1458 提交于 5月 15, 2023
```
* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop
```
  3e1fffea
- G
  remove some [-Wunused-paramter]warning (#53681) · 96188fc1
  由 Galaxy1458 提交于 5月 15, 2023
```
* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop
```
  96188fc1
11 5月, 2023 1 次提交

remove some [-Wunused-parameter] warning (#53683) · dbb62692

由 Galaxy1458 提交于 5月 11, 2023

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

dbb62692

26 4月, 2023 1 次提交

remove some [-Wunused-parameter] waring (#53319) · f9e5072b

由 Galaxy1458 提交于 4月 26, 2023

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

f9e5072b

24 4月, 2023 2 次提交
- [Zero-Dim] Support paddle.max output 0D, test=allcase (#53242) · 9f9cd919
  由 zhouweiwei2014 提交于 4月 24, 2023
  
  9f9cd919
- Y
  [Zero-Dim] support 0d tensor for shape and squeeze onednn kernel (#52832) · c0a604e7
  由 YangQun 提交于 4月 24, 2023
```
* support 0d tensor for shape and squeeze onednn kernel

* set python api for shape op ut
```
  c0a604e7
17 4月, 2023 1 次提交
- Z
  
  rename_SliceKernel (#52863) · d2b0d63f
  由 zhangyuqin1998 提交于 4月 17, 2023
  
  d2b0d63f
14 4月, 2023 2 次提交

[Zero-Dim] support 0-D tensor for... · 6f41e177

由 YangQun 提交于 4月 14, 2023

[Zero-Dim] support 0-D tensor for reduce/reshape/stack/prelu/expand_v2/gaussion onednn kernels (#52185)

* support 0-D tensor for reduce/reshape/stack/prelu/expand_v2/gaussion ops

* fix gaussian random mkldnn op ut

6f41e177

Z

delete unused param from swish_grad and relu6_grad (#52805) · 54e4360a
由 zhangyuqin1998 提交于 4月 14, 2023

54e4360a

13 4月, 2023 2 次提交

[enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h (#52651) · 5664ea26

由 HongyuJia 提交于 4月 13, 2023

* [enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h

* Add logging.h for profiler.cc

* Add logging.h for gloo_utils.h

* Add logging.h for addmm_kernel_impl.h

* Add logging.h for addmm_grad_kernel_impl.h

* Add logging.h for p_send_kernel.cu

* Add logging.h for determinant_grad_kernel_impl.h

* Add logging.h for p_recv_kernel.cu

* Add logging.h for elementwise_grad_base.h

* Add logging.h for transfer_layout_kernel.cc

* Add logging.h for eigvals_kernel.cc and index_select_impl.h

* Add logging.h for all files in kernel directory

* Add logging.h for xpu_info.cc

* Add logging.h for xpu

5664ea26

Z

rename_bilinear_tensor_op (#52745) · eb93b5c9
由 zhangyuqin1998 提交于 4月 13, 2023

eb93b5c9

06 4月, 2023 1 次提交

Remove oneDNN-specific attributes from matmul (#49444) · 4d97b25d

由 Sławomir Siwek 提交于 4月 06, 2023

* replace matmul with matmul_v2 in fuse passes

* Remove fusion logic from matmul

* removing fusion methods

* add proper name

* adjust namespaces

* clean attrs in python tests

* delete checkpoint and restore matmul version

* remove unused code

* matmul and reshape/transpose fuses migrated

* split MatmulOneDNN headers

* fuse activation and eltwise_add

* add fuse_activation

* matmul_transpose_reshape/reshape_transpose_matmul

* matmul + elementwise_add (fused)

* activation temporary modifciation

* restore matmul(v1) version 0

* merge newest develop

* remove depedency from other PR

* revert pbtxt

* remove placeholders from matmul_v2

* add description in OPMaker

* remove matmul_v2_op.h and all depedencies

* remove dims changing in base op

* add possibility to fuse already fused_matmul

* restart broken CI

* Empty-Commit

* revert matmul_utils.h

* codestyle

* adjust imports

* add pbtxt file

* 100% matmul unit tests coverage

* trigger CI with minimal changes to develop

* adjust changes to develop

* add fused_matmul op

* inherit base ops

* add "v2"

* move OPMaker

* Gradually add fused_matmul files

* second batch of fused_matmul changes

* split infershapes of matmul_v2 and fused_matmul

* merge code from other PR

* 2023

* inherit fused_matmul from matmul_v2

* Update paddle/phi/backends/onednn/onednn_reuse.h
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

* Update paddle/phi/kernels/fusion/onednn/fused_matmul_kernel.cc
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

* resolve conflicts

* codestyle

* simplify isgemmlinear

* 2023

* remove import

* reuse methods

* matmul_v2_mkldnn cleanup

* simplify ExecuteMatMulV1Grad

* matmul refactored

* fc

* SetOutMemDescWithLogicalLayoutFusesSupport

* matmul_v2

* alpha support

* group repetetive funcs

* matmul utils

* execute matmul methods

* restore registered kernel names

* split header and impl files

* remove double negatives

* reduce numer of modified files

* adjust ExecuteMatmul

* add scales for ut

* dates

* limit number of modified files

* fluid imports

* remove alpha

* codestyle

---------
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

4d97b25d

04 4月, 2023 1 次提交

Improve new executor static build (#51149) · 5bac67d4

由 Ruibiao Chen 提交于 4月 04, 2023

* Improve new executor static build

* Skip GC for static build

* Skip infershape for static build

* Handle read_op

* Add fused_attention to OpsWithFluidKernelNeedMoveToPhi

* Fix argsort typos

* Add sequence_pool to OpsWithFluidKernelNeedMoveToPhi

* Fix skip share lod errors

* Fix errors for adam

* Fix errors for eigvals, memcpy and fake_quantize

* Add static_build.cc

* Add black list

* Fix CI errors

* Fix CI errors

* Fix CI errors

* Fix TensorArray

* Fix TensorArray

* Add update_loss_scaling to OpsNeedSetOutputDtypeWhenRegisterPhiKernel

* Fix copy

* Fix errors

* Fix momentum

* Skip mkldnn

* Fix CI errors

* Fix c_sync_calc_stream_op

* Fix CINN

* Fix while op

* All CI pass, disable FLAGS to merge code, enable it after more tests in future

* Add UTs

* Fix typos

* Fix typos

* Add mkldnn UT

* Remove mkldnn test

* Fix typos

* Fix dist test

* Fix typos

* Fix CI errors

* Fix CI errors

* Add UTs

* Fix typos

* Fix typos

* Add sparse tests

* ToComplexType -> ToComplex

* Add test_matmul_op_static_build to disable_win_inference_test

5bac67d4

29 3月, 2023 1 次提交

[AMP OP&Test] pad3d add unittests of fp16 and bf16 (#51015) · f86d0be7

由 zengshao0622 提交于 3月 29, 2023

* pad3d add unittests of fp16 and bf16

* pad3d add unittests of fp16 and bf16

* fix cuda place

* fix random to uniform

* fix class name

* fix fp16 max relative error to 1.5e-3

* add dytpe register for onednn

* add pad uint16 check of common.py

* remove check_eager

* test_check_grad --> test_check_grad_normal

f86d0be7

27 3月, 2023 2 次提交

X

elementwise: onednn: support zero dimension inputs (#51656) · 2c1d494e
由 Xinyu Chen 提交于 3月 27, 2023

2c1d494e

Fused elementwise_(mul/div) (#50428) · 968f7f24

由 Sławomir Siwek 提交于 3月 27, 2023

* extract Op and OPMaker to .h

* extend pattern for fused_op

* set "with_residual" default to false

* adjust fuse passes

* remove fc+eltwise flag

* fused_output_scale

* activation attrs

* remove extra attrs

* fix int8/bf16 unit tests

* simplify RecomputeOutputDims

* remove unused method

* Add description for attributes

* add extra check

* adjust op compats

* update quantize test

* fix protobuf parsing error

* fix int8 performance

* fused elementwises

* merge develop

* remove activation

* restore activation for existing add/sub ops

968f7f24

22 3月, 2023 1 次提交

[Zero-Dim] Support 0-D tensor for some oneDNN unary kernels (#51687) · 2a3d75bc

由 YangQun 提交于 3月 22, 2023

* support 0-d tensor for element wise unary ops

* fix python code style check

* fix approval check

* support 0-d tensor for onednn softmax and logsoftmax kernels

* fix commnets

* fix some unittests

2a3d75bc

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功