提交 · e75c01f91350fce6d6051e4eec351514db005692 · PaddlePaddle / Paddle

07 4月, 2023 1 次提交
- W
  
  clean up WITH_MLU (#52546) · e75c01f9
  由 Wang Xin 提交于 4月 07, 2023
  
  e75c01f9
06 4月, 2023 10 次提交

Y

fix build bug (#52566) · 6c01ce8a
由 yuehuayingxueluo 提交于 4月 06, 2023

6c01ce8a

Remove oneDNN-specific attributes from matmul (#49444) · 4d97b25d

由 Sławomir Siwek 提交于 4月 06, 2023

* replace matmul with matmul_v2 in fuse passes

* Remove fusion logic from matmul

* removing fusion methods

* add proper name

* adjust namespaces

* clean attrs in python tests

* delete checkpoint and restore matmul version

* remove unused code

* matmul and reshape/transpose fuses migrated

* split MatmulOneDNN headers

* fuse activation and eltwise_add

* add fuse_activation

* matmul_transpose_reshape/reshape_transpose_matmul

* matmul + elementwise_add (fused)

* activation temporary modifciation

* restore matmul(v1) version 0

* merge newest develop

* remove depedency from other PR

* revert pbtxt

* remove placeholders from matmul_v2

* add description in OPMaker

* remove matmul_v2_op.h and all depedencies

* remove dims changing in base op

* add possibility to fuse already fused_matmul

* restart broken CI

* Empty-Commit

* revert matmul_utils.h

* codestyle

* adjust imports

* add pbtxt file

* 100% matmul unit tests coverage

* trigger CI with minimal changes to develop

* adjust changes to develop

* add fused_matmul op

* inherit base ops

* add "v2"

* move OPMaker

* Gradually add fused_matmul files

* second batch of fused_matmul changes

* split infershapes of matmul_v2 and fused_matmul

* merge code from other PR

* 2023

* inherit fused_matmul from matmul_v2

* Update paddle/phi/backends/onednn/onednn_reuse.h
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

* Update paddle/phi/kernels/fusion/onednn/fused_matmul_kernel.cc
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

* resolve conflicts

* codestyle

* simplify isgemmlinear

* 2023

* remove import

* reuse methods

* matmul_v2_mkldnn cleanup

* simplify ExecuteMatMulV1Grad

* matmul refactored

* fc

* SetOutMemDescWithLogicalLayoutFusesSupport

* matmul_v2

* alpha support

* group repetetive funcs

* matmul utils

* execute matmul methods

* restore registered kernel names

* split header and impl files

* remove double negatives

* reduce numer of modified files

* adjust ExecuteMatmul

* add scales for ut

* dates

* limit number of modified files

* fluid imports

* remove alpha

* codestyle

---------
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

4d97b25d

Move fused_attention op to phi [迁移前向 GPU OpKernel] (#51743) · a7ec8958

由 Sonder 提交于 4月 06, 2023

* add kernel functions

* update kernel functions

* update func parameters' name

* create codes for gpu device

* 调整文件位置

* fix include error

* remove dependent files to phi/

* restore fused_attention_op.cu

* fix dependence errors

* fix dependence errors

* fix include error

* fix all depandence errors[build success]

* remove useless include

* recover useless include

* use phi::ToNCCLDataType

* fix namespace

* update new register code

* fix error in fused_gemm_epilogue_utils

* fix error in FusedAttentionKernel parm

* finish fused_attention registe code[build success]

* add paddle::optional

* add sig file

* fix build error

* fix a include error

* update CMkaeList

* fix parameter sequence

* add include file

* update #if before include

* fix grammly error

* update codes for DropoutParam

* remove const cast

* trans some fluid api to phi api

* add #if

* update test code

* update test codes

* recover test codes

* trans fused_attention to fluid

* move #endif to end

* move #endif

* delete useless files

* use fused attention utils and recover random seed

* remove fluid include in phi

a7ec8958

张

mv PADDLE_WITH_ASCEND_CL (#52535) · 80dd1672
由张春乔提交于 4月 06, 2023

80dd1672
Z
Rename conv2d transpose grad grad (#52371) · 49bbd466
由 zhangyuqin1998 提交于 4月 06, 2023
```
* Rename conv2d transpose grad grad

* fix
```
49bbd466
C

fix backend bug (#52526) · 380a9bf7
由 Chitsing KUI 提交于 4月 06, 2023

380a9bf7
S
Fix flash attention bug (#52551) · 8ac5a6b6
由 sneaxiy 提交于 4月 06, 2023
```
* fix flash attn

* fix another API
```
8ac5a6b6

[PHI] Adjust files of fusion kernel in PHI (#52420) · 84bb7a96

由 zyfncg 提交于 4月 06, 2023

* update readme

* remove unused header file

* fix bug

* fix onednn

* fix onednn

* rename header file

84bb7a96

【PaddlePaddle Hackathon 4】No.63 add fp16 and bf16 for eye and frame (#51819) · ae10133a

由 LoneRanger 提交于 4月 06, 2023

* add fp16 and bf16 for eye and frame

* fix bug

* fix bug

* fix bug

* Update test_frame_op.py

fix code style

* fix bug

* fix bug

ae10133a

[AMP OP&Test]Add fp16/bf16 support logical op (#52112) · b10e4577

由 WJJ1995 提交于 4月 06, 2023

* fixed glog

* add

* add bfloat16 test for logical op

* rm useless code

* add uint16

* deal with comments

* fixed code style

* fixed code style

* fixed for ci

* deal with comments

* fixed for ci

b10e4577

04 4月, 2023 4 次提交

C
【Hackathon No.62】增加pool3d算子BF16及单测，lgamma, masked_select FP16/BF16算子单测 (#51837) · b0dbf9fe
由 chenxujun 提交于 4月 04, 2023
```
* Add pool3d lgamma masked_select tests

* Fix code
```
b0dbf9fe
Y

fix xpu compile bugs (#52501) · 81054ad4
由 YuanRisheng 提交于 4月 04, 2023

81054ad4

Improve new executor static build (#51149) · 5bac67d4

由 Ruibiao Chen 提交于 4月 04, 2023

* Improve new executor static build

* Skip GC for static build

* Skip infershape for static build

* Handle read_op

* Add fused_attention to OpsWithFluidKernelNeedMoveToPhi

* Fix argsort typos

* Add sequence_pool to OpsWithFluidKernelNeedMoveToPhi

* Fix skip share lod errors

* Fix errors for adam

* Fix errors for eigvals, memcpy and fake_quantize

* Add static_build.cc

* Add black list

* Fix CI errors

* Fix CI errors

* Fix CI errors

* Fix TensorArray

* Fix TensorArray

* Add update_loss_scaling to OpsNeedSetOutputDtypeWhenRegisterPhiKernel

* Fix copy

* Fix errors

* Fix momentum

* Skip mkldnn

* Fix CI errors

* Fix c_sync_calc_stream_op

* Fix CINN

* Fix while op

* All CI pass, disable FLAGS to merge code, enable it after more tests in future

* Add UTs

* Fix typos

* Fix typos

* Add mkldnn UT

* Remove mkldnn test

* Fix typos

* Fix dist test

* Fix typos

* Fix CI errors

* Fix CI errors

* Add UTs

* Fix typos

* Fix typos

* Add sparse tests

* ToComplexType -> ToComplex

* Add test_matmul_op_static_build to disable_win_inference_test

5bac67d4

Z
rename_bilinear_tensor_product (#52375) · 34069c46
由 zhangyuqin1998 提交于 4月 04, 2023
```
* rename_bilinear_tensor_product

* fix
```
34069c46

03 4月, 2023 9 次提交
- remove WITH_ASCEND_CL PADDLE_WITH_ASCEND_CL WITH_ASCEND_CXX11 (#52448) · 0b60f28c
  由 engineer1109 提交于 4月 03, 2023
  
  0b60f28c
- C
  
  Add margin_cross_entropy, transfer_layout, dropout_nd tests (#52369) · 648563dd
  由 chenxujun 提交于 4月 03, 2023
  
  648563dd
- D
  【Hackathon No.50】为 Paddle lerp 算子实现 float16 数据类型支持 (#50925) · a2cbc81a
  由 denglianbin 提交于 4月 03, 2023
```
* finish task

* fix error

* pre-commit fix code style

* add unittest.

* change unittest.

* delete unittest case.
```
  a2cbc81a
- C
  
  Add kron float16/bfloat16, unbind float16 tests (#52413) · f547ee92
  由 chenxujun 提交于 4月 03, 2023
  
  f547ee92
- Z
  Kernel registrar (#52079) · a725c9a5
  由 zhangyuqin1998 提交于 4月 03, 2023
```
* add kernel register macro for all backend

* fix msvc bug

* fix

---------
Co-authored-by: Nzhangyuqin1998 <2368719379@qq.com>
```
  a725c9a5
- T
  
  【Hackathon 4th No.24】为 Paddle 新增 paddle.sparse.is_nan 稀疏 API (#51513) · b7db6af2
  由 thunder95 提交于 4月 03, 2023
  
  b7db6af2
- L
  【PaddlePaddle Hackathon 4】No.56 : add fp16 test and bf16 test for diag,... · 0e3f7ab1
  由 LoneRanger 提交于 4月 03, 2023
```
【PaddlePaddle Hackathon 4】No.56 : add fp16 test and bf16 test for diag, diagonal, fill and fill_diagonal_tensor (#51649)
```
  0e3f7ab1
- Z
  
  rename_batch_norm_grad_grad (#52372) · cf7c431f
  由 zhangyuqin1998 提交于 4月 03, 2023
  
  cf7c431f
- W
  
  [XPU]add conv_fuse pass && kernel (#52247) · eddf1ad6
  由 wz1qqx 提交于 4月 03, 2023
  
  eddf1ad6
31 3月, 2023 5 次提交
- Z
  
  rename_conv2d_grad_grad (#52374) · ea5e1ebb
  由 zhangyuqin1998 提交于 3月 31, 2023
  
  ea5e1ebb
- C
  
  [XPU] interpolate support fp16 (#52358) · 3996f0de
  由 csy0225 提交于 3月 31, 2023
  
  3996f0de
- Y
  [PHI Decoupling]Remove distribute header (#52202) · e923642e
  由 YuanRisheng 提交于 3月 31, 2023
```
* remove distribute

* fix py3 bugs

* fix gpu-ps bugs

* fix compile bugs

* fix unittest bugs
```
  e923642e
- R
  
  [CustomDevice] fix set_constant (#52360) · f22b9666
  由 ronnywang 提交于 3月 31, 2023
  
  f22b9666
- 张
  [CodeStyle][UP030][UP031][UP032] using f-string (#52062) · 40e4f5a5
  由张春乔提交于 3月 31, 2023
```
* autofix
Co-authored-by: NLiyulingyue <83450930+Liyulingyue@users.noreply.github.com>

* revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py

* empty commit, trigger ci

* fix test_slice

---------
Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
```
  40e4f5a5
30 3月, 2023 9 次提交
- Z
  move elementwise_raw_kernel to new dir (#51965) · 49461a02
  由 zhangyuqin1998 提交于 3月 30, 2023
```
* move elementwise raw

* fix

* fix
```
  49461a02
- [Zero-Dim] Support broadcast_tensors input 0D and distribution API output 0D (#51721) · 2bd0a946
  由 zhouweiwei2014 提交于 3月 30, 2023
  
  2bd0a946
- Z
  
  [Sparse]Fix the bug of elementwise_grad (#52102) · aeb8c2e2
  由 zhangkaihuo 提交于 3月 30, 2023
  
  aeb8c2e2
- R
  
  [AMP OP&Test] add fp16 test for linspace (#52161) · 40b30f50
  由 Roc 提交于 3月 30, 2023
  
  40b30f50
- Y
  
  add xpu cumprod, group norm grad (#52089) · fb16bdc7
  由 ykkk2333 提交于 3月 30, 2023
  
  fb16bdc7
- Y
  [AMP OP&Test] Register FP16 for multinomial. (#52107) · 7788b65e
  由 yunyaoXYY 提交于 3月 30, 2023
```
* add FP16 for multinomial

* fix input data

* update code

* fix FP16

* fix code
```
  7788b65e
- W
  [AMP OP&Test] Strided slice fp16 and bf16 unitest (#52220) · 5cdd9f2c
  由 Wang Xinyu 提交于 3月 30, 2023
```
* stride slice fp16 and bf16 unitest

* fix code style

* add self.dtype
```
  5cdd9f2c
- D
  
  fix the compare in PD_MEA_CHECK_OVERFLOW (#52300) · 155018ee
  由 Danyang Zhang 提交于 3月 30, 2023
  
  155018ee
- C
  [CodeStyle][C416][C417] rewrite unnecessary comprehension with function call... · 929892c3
  由 cyberslack_lee 提交于 3月 30, 2023
```
[CodeStyle][C416][C417] rewrite unnecessary comprehension with function call and use generator instead of map (#52140)

* codestyle c416 c417

* fix error

* fix inc

* unify all C4 rules into one

* fix inc

---------
Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
```
  929892c3
29 3月, 2023 2 次提交

[AMP OP&Test] pad3d add unittests of fp16 and bf16 (#51015) · f86d0be7

由 zengshao0622 提交于 3月 29, 2023

* pad3d add unittests of fp16 and bf16

* pad3d add unittests of fp16 and bf16

* fix cuda place

* fix random to uniform

* fix class name

* fix fp16 max relative error to 1.5e-3

* add dytpe register for onednn

* add pad uint16 check of common.py

* remove check_eager

* test_check_grad --> test_check_grad_normal

f86d0be7

Add output defines for graph_sample_neighbors and group_norm (#51503) · 37bd7e78

由 hjyp 提交于 3月 29, 2023

* regist output type for GraphSampleNeighbors and GroupNorm

* Update return type

* fix return type

* update

* fix detail

37bd7e78

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功