提交 · 4d97b25d1838ec89af4f4e156f9eb004fb314841 · PaddlePaddle / Paddle

06 4月, 2023 11 次提交

Remove oneDNN-specific attributes from matmul (#49444) · 4d97b25d

由 Sławomir Siwek 提交于 4月 06, 2023

* replace matmul with matmul_v2 in fuse passes

* Remove fusion logic from matmul

* removing fusion methods

* add proper name

* adjust namespaces

* clean attrs in python tests

* delete checkpoint and restore matmul version

* remove unused code

* matmul and reshape/transpose fuses migrated

* split MatmulOneDNN headers

* fuse activation and eltwise_add

* add fuse_activation

* matmul_transpose_reshape/reshape_transpose_matmul

* matmul + elementwise_add (fused)

* activation temporary modifciation

* restore matmul(v1) version 0

* merge newest develop

* remove depedency from other PR

* revert pbtxt

* remove placeholders from matmul_v2

* add description in OPMaker

* remove matmul_v2_op.h and all depedencies

* remove dims changing in base op

* add possibility to fuse already fused_matmul

* restart broken CI

* Empty-Commit

* revert matmul_utils.h

* codestyle

* adjust imports

* add pbtxt file

* 100% matmul unit tests coverage

* trigger CI with minimal changes to develop

* adjust changes to develop

* add fused_matmul op

* inherit base ops

* add "v2"

* move OPMaker

* Gradually add fused_matmul files

* second batch of fused_matmul changes

* split infershapes of matmul_v2 and fused_matmul

* merge code from other PR

* 2023

* inherit fused_matmul from matmul_v2

* Update paddle/phi/backends/onednn/onednn_reuse.h
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

* Update paddle/phi/kernels/fusion/onednn/fused_matmul_kernel.cc
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

* resolve conflicts

* codestyle

* simplify isgemmlinear

* 2023

* remove import

* reuse methods

* matmul_v2_mkldnn cleanup

* simplify ExecuteMatMulV1Grad

* matmul refactored

* fc

* SetOutMemDescWithLogicalLayoutFusesSupport

* matmul_v2

* alpha support

* group repetetive funcs

* matmul utils

* execute matmul methods

* restore registered kernel names

* split header and impl files

* remove double negatives

* reduce numer of modified files

* adjust ExecuteMatmul

* add scales for ut

* dates

* limit number of modified files

* fluid imports

* remove alpha

* codestyle

---------
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

4d97b25d

Move fused_attention op to phi [迁移前向 GPU OpKernel] (#51743) · a7ec8958

由 Sonder 提交于 4月 06, 2023

* add kernel functions

* update kernel functions

* update func parameters' name

* create codes for gpu device

* 调整文件位置

* fix include error

* remove dependent files to phi/

* restore fused_attention_op.cu

* fix dependence errors

* fix dependence errors

* fix include error

* fix all depandence errors[build success]

* remove useless include

* recover useless include

* use phi::ToNCCLDataType

* fix namespace

* update new register code

* fix error in fused_gemm_epilogue_utils

* fix error in FusedAttentionKernel parm

* finish fused_attention registe code[build success]

* add paddle::optional

* add sig file

* fix build error

* fix a include error

* update CMkaeList

* fix parameter sequence

* add include file

* update #if before include

* fix grammly error

* update codes for DropoutParam

* remove const cast

* trans some fluid api to phi api

* add #if

* update test code

* update test codes

* recover test codes

* trans fused_attention to fluid

* move #endif to end

* move #endif

* delete useless files

* use fused attention utils and recover random seed

* remove fluid include in phi

a7ec8958

S

add autogen code support for logical_and, logical_not, logical_or and logical_xor (#52451) · 6df4a667
由 scotty 提交于 4月 06, 2023

6df4a667
R

support auto generate static for assign_value (#52534) · d394c9ed
由 RedContritio 提交于 4月 06, 2023

d394c9ed
R

support auto generate static for decode_jpeg (#52542) · c1f97a9b
由 RedContritio 提交于 4月 06, 2023

c1f97a9b
张

mv PADDLE_WITH_ASCEND_CL (#52535) · 80dd1672
由张春乔提交于 4月 06, 2023

80dd1672
J

support more custom vjp (#52533) · 29c28e2f
由 Jiabin Yang 提交于 4月 06, 2023

29c28e2f
陈

【昇腾和寒武纪相关代码退场】No.9 清理 PADDLE_WITH_ASCEND 相关代码 (#52403) · 262ea02a
由陈沧夜提交于 4月 06, 2023

262ea02a
R

support auto generate static for empty (#52524) · 2ad66a42
由 RedContritio 提交于 4月 06, 2023

2ad66a42
R
support auto generate static for randint (#52529) · 535915aa
由 RedContritio 提交于 4月 06, 2023
```
* support auto generate static for randint

* move seed from extra to attrs
```
535915aa
R

support auto generate static for uniform (uniform_random) (#52522) · 838b2c83
由 RedContritio 提交于 4月 06, 2023

838b2c83

04 4月, 2023 7 次提交

L
Autogen embedding static graph code (#52460) · 5b7c8f9e
由 lzydev 提交于 4月 04, 2023
```
* autogen embedding

* deal

* fix bug in CompatMetaTensor::share_lod
```
5b7c8f9e
H
register fluid kerenls to phi [part 5] (#52486) · eb38c85f
由 huangjiyi 提交于 4月 04, 2023
```
* update

* fix bug

* update

* fix bug
```
eb38c85f

Improve new executor static build (#51149) · 5bac67d4

由 Ruibiao Chen 提交于 4月 04, 2023

* Improve new executor static build

* Skip GC for static build

* Skip infershape for static build

* Handle read_op

* Add fused_attention to OpsWithFluidKernelNeedMoveToPhi

* Fix argsort typos

* Add sequence_pool to OpsWithFluidKernelNeedMoveToPhi

* Fix skip share lod errors

* Fix errors for adam

* Fix errors for eigvals, memcpy and fake_quantize

* Add static_build.cc

* Add black list

* Fix CI errors

* Fix CI errors

* Fix CI errors

* Fix TensorArray

* Fix TensorArray

* Add update_loss_scaling to OpsNeedSetOutputDtypeWhenRegisterPhiKernel

* Fix copy

* Fix errors

* Fix momentum

* Skip mkldnn

* Fix CI errors

* Fix c_sync_calc_stream_op

* Fix CINN

* Fix while op

* All CI pass, disable FLAGS to merge code, enable it after more tests in future

* Add UTs

* Fix typos

* Fix typos

* Add mkldnn UT

* Remove mkldnn test

* Fix typos

* Fix dist test

* Fix typos

* Fix CI errors

* Fix CI errors

* Add UTs

* Fix typos

* Fix typos

* Add sparse tests

* ToComplexType -> ToComplex

* Add test_matmul_op_static_build to disable_win_inference_test

5bac67d4

C
support auto generate for bce_loss (#52231) · 3ebd5af8
由 cyberslack_lee 提交于 4月 04, 2023
```
* bce_loss

* fix error

* fix

* fix

* fix

* reslove confilict
```
3ebd5af8
L
[FIX BUG]Delete "USE_OP_ITSELF(equal_all);\n"in CMakeLists.txt (#52468) · c85a0c5c
由 lzydev 提交于 4月 04, 2023
```
* fix bug of redefine use_equal_all

* fix bug of redefine use_equal_all
```
c85a0c5c

由 huangjiyi 提交于 4月 04, 2023

* update

* fix bug

* fix bug

* revert diag_op

* revert expand_op and expand_as_op

* fix bug

* fix bug

63efdaee

Z
rename_bilinear_tensor_product (#52375) · 34069c46
由 zhangyuqin1998 提交于 4月 04, 2023
```
* rename_bilinear_tensor_product

* fix
```
34069c46

03 4月, 2023 9 次提交
- remove WITH_ASCEND_CL PADDLE_WITH_ASCEND_CL WITH_ASCEND_CXX11 (#52448) · 0b60f28c
  由 engineer1109 提交于 4月 03, 2023
  
  0b60f28c
- C
  
  Add margin_cross_entropy, transfer_layout, dropout_nd tests (#52369) · 648563dd
  由 chenxujun 提交于 4月 03, 2023
  
  648563dd
- R
  support auto generate static for gaussian (gaussian_random) (#52422) · 3c949ba9
  由 RedContritio 提交于 4月 03, 2023
```
* support auto generate static for gaussian (gaussian_random)

* move gaussian_random_batch_size_like Kernels from gaussian_random_op.* to gaussian_random_batch_size_like_op.*
```
  3c949ba9
- G
  add autogen code support for accuracy (#52424) · 1f3b9ef5
  由 gouzil 提交于 4月 03, 2023
```
* add autogen code support for accuracy

* fix input
```
  1f3b9ef5
- R
  Fix gcc12 error when compiling using gcc12 and cuda12 (#50817) · 2f850990
  由 risemeup1 提交于 4月 03, 2023
```
* fix_gcc12_error

* fix_gcc12_error

* fix gcc12_error

* fix_gcc12_error
```
  2f850990
- R
  Fix gcc12_error (#52085) · 7500ff61
  由 risemeup1 提交于 4月 03, 2023
```
* fix error,test=document_fix

* test

* fix gcc12_error

* fix gcc12_error

* fix gcc12_error

* fix_gcc12_py3_error

* fix_range-loop-construct_error

* fix_gcc12_error
```
  7500ff61
- X
  add autogen code support for auc_op (#52437) · 672bb07e
  由 xiaoyuanzi914 提交于 4月 03, 2023
```
* add autogen code support for auc_op

* update

---------
Co-authored-by: Nwqgo <1552367872@qq.com>
```
  672bb07e
- Y
  
  delete paddle/fluid/operators/*_mlu.* files (#52435) · bb48b596
  由 Young-Flash 提交于 4月 03, 2023
  
  bb48b596
- L
  【PaddlePaddle Hackathon 4】No.56 : add fp16 test and bf16 test for diag,... · 0e3f7ab1
  由 LoneRanger 提交于 4月 03, 2023
```
【PaddlePaddle Hackathon 4】No.56 : add fp16 test and bf16 test for diag, diagonal, fill and fill_diagonal_tensor (#51649)
```
  0e3f7ab1
01 4月, 2023 1 次提交

Delete the /paddle/fluid/platform/device/npu directory (#52384) · 69436bf5

由 jjyaoao 提交于 4月 01, 2023

* Delete the /paddle/fluid/platform/device/npu directory

* clear Cmakelists

* Try removing npu in the header file

69436bf5

31 3月, 2023 9 次提交

R

support auto generate static for eye (#52370) · 20ee0d7f
由 RedContritio 提交于 3月 31, 2023

20ee0d7f

由 huangjiyi 提交于 3月 31, 2023

* update bipartite_match

* update

* fix bug

* fix test

* fix bug

* fix Kunlun-KP-Build

* Revert "fix Kunlun-KP-Build"

This reverts commit ceab63cc23079fd6839c826bb52db893fb056355.

* update

d05b73e4

J
[kunlun] prevent overflow in collective softmax_with_ce (#52356) · fb276f23
由 jameszhang 提交于 3月 31, 2023
```
* [kunlun] prevent numerical overflow in collective softmax_with_ce

* add fix in another branch
```
fb276f23

[Prim] Add prod backward composite rule (#51238) · a0069278

由 chenjian 提交于 3月 31, 2023

* first commit

* add registry

* add unit test

* fix format

* add unit test

* fix  bug

* replace unsuqeeze to reshape

* fix

* fix unit test

* update test

* update test

* fix unit test

* fix

* fix

a0069278

Add Yaml config for some op (#52347) · 967dee45

由 zyfncg 提交于 3月 31, 2023

* add yaml for some op

* fix inplace_abn

* fix test_leaky_relu_grad_grad_functor

* fix yaml

* fix typo

967dee45

Y
[PHI Decoupling]Remove distribute header (#52202) · e923642e
由 YuanRisheng 提交于 3月 31, 2023
```
* remove distribute

* fix py3 bugs

* fix gpu-ps bugs

* fix compile bugs

* fix unittest bugs
```
e923642e
E
[GCC9][Werror]fix -Werror=maybe-uninitialized (#52265) · 74d87a61
由 engineer1109 提交于 3月 31, 2023
```
fix with auto&
```
74d87a61

张

[CodeStyle][UP030][UP031][UP032] using f-string (#52062) · 40e4f5a5

由张春乔提交于 3月 31, 2023

* autofix
Co-authored-by: NLiyulingyue <83450930+Liyulingyue@users.noreply.github.com>

* revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py

* empty commit, trigger ci

* fix test_slice

---------
Co-authored-by: NSigureMo <sigure.qaq@gmail.com>

40e4f5a5

Y

use int64 for c split (#52279) (#52340) · 9fd4fd5f
由 Yuang Liu 提交于 3月 31, 2023

9fd4fd5f

30 3月, 2023 3 次提交

Z

[XPU] add delete_cast_op_pass (#52305) · 8b622d58
由 zhupengyang 提交于 3月 30, 2023

8b622d58

support complex data types for libpaddle.Tensor's element get and set (#52324) · 13b12457

由 Feiyu Chan 提交于 3月 30, 2023

1. add type caster for paddle's complex type, to allow pybind to automatically cast it with python's complex type;
2. add complex64 and complex128 data type for `libpaddle.Tensor`'s element get and set(which is required to perturb an element to get the numerical derivative)
3. add support for cuda pinned place in `libpaddle.Tensor` element get and set

---
4. fix a bug in op code generation.(Creation of output folder in concurrent with parsing op yamls.)

13b12457

W
add autogen code support for spectral_norm (#52145) · 28927209
由 Wang Xin 提交于 3月 30, 2023
```
* add autogen code support for spectral_norm

* bug fixed

* fix PR-CI-Static-Check fail
```
28927209

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功