提交 · 5c19bfc8015b79f861aae9be6c98371de7ccef19 · PaddlePaddle / Paddle

06 4月, 2023 25 次提交

Y

fix build bug (#52566) · 6c01ce8a
由 yuehuayingxueluo 提交于 4月 06, 2023

6c01ce8a

[StandaloneExe] improving sequentialRun mode of standaloneExecutor (#52111) · 14fe4b54

由 kangguangli 提交于 4月 06, 2023

* Verify SequentialRun Model of StandaloneExecutor

* fix

* fix

* fix

* remove redundant code

* fix CI

* fix CI

* recover multi-step dependency

14fe4b54

由 huangjiyi 提交于 4月 06, 2023

* update

* fix compile bug

* fix bug

* fix bug

* revert crop_op

* fix xpu compile

* fix cinn compile

* fix bug

* fix bug

* fix bug

* fix bug

* update

* update

* update

058ca61d

Remove oneDNN-specific attributes from matmul (#49444) · 4d97b25d

由 Sławomir Siwek 提交于 4月 06, 2023

* replace matmul with matmul_v2 in fuse passes

* Remove fusion logic from matmul

* removing fusion methods

* add proper name

* adjust namespaces

* clean attrs in python tests

* delete checkpoint and restore matmul version

* remove unused code

* matmul and reshape/transpose fuses migrated

* split MatmulOneDNN headers

* fuse activation and eltwise_add

* add fuse_activation

* matmul_transpose_reshape/reshape_transpose_matmul

* matmul + elementwise_add (fused)

* activation temporary modifciation

* restore matmul(v1) version 0

* merge newest develop

* remove depedency from other PR

* revert pbtxt

* remove placeholders from matmul_v2

* add description in OPMaker

* remove matmul_v2_op.h and all depedencies

* remove dims changing in base op

* add possibility to fuse already fused_matmul

* restart broken CI

* Empty-Commit

* revert matmul_utils.h

* codestyle

* adjust imports

* add pbtxt file

* 100% matmul unit tests coverage

* trigger CI with minimal changes to develop

* adjust changes to develop

* add fused_matmul op

* inherit base ops

* add "v2"

* move OPMaker

* Gradually add fused_matmul files

* second batch of fused_matmul changes

* split infershapes of matmul_v2 and fused_matmul

* merge code from other PR

* 2023

* inherit fused_matmul from matmul_v2

* Update paddle/phi/backends/onednn/onednn_reuse.h
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

* Update paddle/phi/kernels/fusion/onednn/fused_matmul_kernel.cc
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

* resolve conflicts

* codestyle

* simplify isgemmlinear

* 2023

* remove import

* reuse methods

* matmul_v2_mkldnn cleanup

* simplify ExecuteMatMulV1Grad

* matmul refactored

* fc

* SetOutMemDescWithLogicalLayoutFusesSupport

* matmul_v2

* alpha support

* group repetetive funcs

* matmul utils

* execute matmul methods

* restore registered kernel names

* split header and impl files

* remove double negatives

* reduce numer of modified files

* adjust ExecuteMatmul

* add scales for ut

* dates

* limit number of modified files

* fluid imports

* remove alpha

* codestyle

---------
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

4d97b25d

Move fused_attention op to phi [迁移前向 GPU OpKernel] (#51743) · a7ec8958

由 Sonder 提交于 4月 06, 2023

* add kernel functions

* update kernel functions

* update func parameters' name

* create codes for gpu device

* 调整文件位置

* fix include error

* remove dependent files to phi/

* restore fused_attention_op.cu

* fix dependence errors

* fix dependence errors

* fix include error

* fix all depandence errors[build success]

* remove useless include

* recover useless include

* use phi::ToNCCLDataType

* fix namespace

* update new register code

* fix error in fused_gemm_epilogue_utils

* fix error in FusedAttentionKernel parm

* finish fused_attention registe code[build success]

* add paddle::optional

* add sig file

* fix build error

* fix a include error

* update CMkaeList

* fix parameter sequence

* add include file

* update #if before include

* fix grammly error

* update codes for DropoutParam

* remove const cast

* trans some fluid api to phi api

* add #if

* update test code

* update test codes

* recover test codes

* trans fused_attention to fluid

* move #endif to end

* move #endif

* delete useless files

* use fused attention utils and recover random seed

* remove fluid include in phi

a7ec8958

S

add autogen code support for logical_and, logical_not, logical_or and logical_xor (#52451) · 6df4a667
由 scotty 提交于 4月 06, 2023

6df4a667
R

support auto generate static for assign_value (#52534) · d394c9ed
由 RedContritio 提交于 4月 06, 2023

d394c9ed
R

support auto generate static for decode_jpeg (#52542) · c1f97a9b
由 RedContritio 提交于 4月 06, 2023

c1f97a9b
张

mv PADDLE_WITH_ASCEND_CL (#52535) · 80dd1672
由张春乔提交于 4月 06, 2023

80dd1672
J

support more custom vjp (#52533) · 29c28e2f
由 Jiabin Yang 提交于 4月 06, 2023

29c28e2f

feat: add composite rule of roll grad (#52532) · 348a36b5

由 Kang Zhao 提交于 4月 06, 2023

* feat: add relu composite rule

* feat: add relu composite rule, maximum op

* feat: add relu composite rule, maximum op

* feat: add relu composite rule, polish comments

* feat: add relu composite rule, polish comments

* feat: add relu composite rule, add python api of relu

* feat: add relu composite rule, commit hook

* fix: maximum type error & ban cinn test

* fix: maximum input sequence bugs

* resolve conflicts

* fix: code style bugs

* add: relu fp16 test

* feat: add rsqrt composite rule

* feat: add rsqrt composite rule

* resolve conflicts of composite rule

* fix: delete check eager

* feat: add roll grad composite rule

* fix minus shift

* fix test roll op

348a36b5

Z
Rename conv2d transpose grad grad (#52371) · 49bbd466
由 zhangyuqin1998 提交于 4月 06, 2023
```
* Rename conv2d transpose grad grad

* fix
```
49bbd466
陈

【昇腾和寒武纪相关代码退场】No.9 清理 PADDLE_WITH_ASCEND 相关代码 (#52403) · 262ea02a
由陈沧夜提交于 4月 06, 2023

262ea02a
C

fix backend bug (#52526) · 380a9bf7
由 Chitsing KUI 提交于 4月 06, 2023

380a9bf7
S
Fix flash attention bug (#52551) · 8ac5a6b6
由 sneaxiy 提交于 4月 06, 2023
```
* fix flash attn

* fix another API
```
8ac5a6b6

rem is_compiled_with_npu (#52385) · 7976e2a3

由 Kim Yann 提交于 4月 06, 2023

* rem is_compiled_with_npu

* rem nup related code

* make lint happy

* rem test

* remove some tests

* Update grad_scaler.py

* fix an error

7976e2a3

J

[CINN] fix CINN graph symbolization topo sort fixed (#52556) · 2acc2b14
由 jiangcheng 提交于 4月 06, 2023

2acc2b14

[PHI] Adjust files of fusion kernel in PHI (#52420) · 84bb7a96

由 zyfncg 提交于 4月 06, 2023

* update readme

* remove unused header file

* fix bug

* fix onednn

* fix onednn

* rename header file

84bb7a96

X

[oneDNN]disable interpolate operators by default (#52462) · 690767ed
由 Xinyu Chen 提交于 4月 06, 2023

690767ed

【PaddlePaddle Hackathon 4】No.63 add fp16 and bf16 for eye and frame (#51819) · ae10133a

由 LoneRanger 提交于 4月 06, 2023

* add fp16 and bf16 for eye and frame

* fix bug

* fix bug

* fix bug

* Update test_frame_op.py

fix code style

* fix bug

* fix bug

ae10133a

R

support auto generate static for empty (#52524) · 2ad66a42
由 RedContritio 提交于 4月 06, 2023

2ad66a42
R
support auto generate static for randint (#52529) · 535915aa
由 RedContritio 提交于 4月 06, 2023
```
* support auto generate static for randint

* move seed from extra to attrs
```
535915aa
R

support auto generate static for uniform (uniform_random) (#52522) · 838b2c83
由 RedContritio 提交于 4月 06, 2023

838b2c83

fix protobuf error (#52499) · 8575837d

由 risemeup1 提交于 4月 06, 2023

* ix protobuf error

* fix protobuf error gpups

* fix_protobuf_error

* Update paddle_build.sh

* Update paddle_build.sh

8575837d

[AMP OP&Test]Add fp16/bf16 support logical op (#52112) · b10e4577

由 WJJ1995 提交于 4月 06, 2023

* fixed glog

* add

* add bfloat16 test for logical op

* rm useless code

* add uint16

* deal with comments

* fixed code style

* fixed code style

* fixed for ci

* deal with comments

* fixed for ci

b10e4577

05 4月, 2023 2 次提交
- fix Tensor.item to np.array(Tensor).item (#52483) · d95eaa17
  由 zhouweiwei2014 提交于 4月 05, 2023
  
  d95eaa17
- H
  
  [DDim Decouple enforce.h] Change enforce.h->exception.h (#52492) · 2236286c
  由 HongyuJia 提交于 4月 05, 2023
  
  2236286c
04 4月, 2023 13 次提交

Add Gloo Gather Function (#52334) · 5f6376b7

由 yuehuayingxueluo 提交于 4月 04, 2023

* add gloo gather

* add gloo_tools

* fix CI bug

* use gloo gather

* remove redundant code

* fix process_group_gloo.py

* rename send_recv

* fix conflict

* fix conflict

* fix codestyle

* fix CI bug

* add PADDLE_ENFORCE_NE

5f6376b7

C
【Hackathon No.62】增加pool3d算子BF16及单测，lgamma, masked_select FP16/BF16算子单测 (#51837) · b0dbf9fe
由 chenxujun 提交于 4月 04, 2023
```
* Add pool3d lgamma masked_select tests

* Fix code
```
b0dbf9fe
X
fix set value convert out of bound (#51885) · 2a9c7b5d
由 xjmxyt 提交于 4月 04, 2023
```
* fix out of bound

* fix bug

* fix bug

* fix
```
2a9c7b5d
Y

fix xpu compile bugs (#52501) · 81054ad4
由 YuanRisheng 提交于 4月 04, 2023

81054ad4
G
delete [-Wno-error=terminate], test=develop (#52490) · 15aa73df
由 Galaxy1458 提交于 4月 04, 2023
```
* delete [-Wno-error=terminate], test=develop

* remove GPUps[-Wterminate],test=develop
```
15aa73df
L
Autogen embedding static graph code (#52460) · 5b7c8f9e
由 lzydev 提交于 4月 04, 2023
```
* autogen embedding

* deal

* fix bug in CompatMetaTensor::share_lod
```
5b7c8f9e
H
register fluid kerenls to phi [part 5] (#52486) · eb38c85f
由 huangjiyi 提交于 4月 04, 2023
```
* update

* fix bug

* update

* fix bug
```
eb38c85f

Improve new executor static build (#51149) · 5bac67d4

由 Ruibiao Chen 提交于 4月 04, 2023

* Improve new executor static build

* Skip GC for static build

* Skip infershape for static build

* Handle read_op

* Add fused_attention to OpsWithFluidKernelNeedMoveToPhi

* Fix argsort typos

* Add sequence_pool to OpsWithFluidKernelNeedMoveToPhi

* Fix skip share lod errors

* Fix errors for adam

* Fix errors for eigvals, memcpy and fake_quantize

* Add static_build.cc

* Add black list

* Fix CI errors

* Fix CI errors

* Fix CI errors

* Fix TensorArray

* Fix TensorArray

* Add update_loss_scaling to OpsNeedSetOutputDtypeWhenRegisterPhiKernel

* Fix copy

* Fix errors

* Fix momentum

* Skip mkldnn

* Fix CI errors

* Fix c_sync_calc_stream_op

* Fix CINN

* Fix while op

* All CI pass, disable FLAGS to merge code, enable it after more tests in future

* Add UTs

* Fix typos

* Fix typos

* Add mkldnn UT

* Remove mkldnn test

* Fix typos

* Fix dist test

* Fix typos

* Fix CI errors

* Fix CI errors

* Add UTs

* Fix typos

* Fix typos

* Add sparse tests

* ToComplexType -> ToComplex

* Add test_matmul_op_static_build to disable_win_inference_test

5bac67d4

H
change skip-layernorm to adapt a new method (#52456) · 8a66d999
由 handiz 提交于 4月 04, 2023
```
* change skip-layernorm to adapt a new method

* fix review problem and add vlog

* fix review problem
```
8a66d999
C
support auto generate for bce_loss (#52231) · 3ebd5af8
由 cyberslack_lee 提交于 4月 04, 2023
```
* bce_loss

* fix error

* fix

* fix

* fix

* reslove confilict
```
3ebd5af8
C

Fix inplace op dims not changed (#52416) · 8e7aa296
由 csy0225 提交于 4月 04, 2023

8e7aa296
L
[FIX BUG]Delete "USE_OP_ITSELF(equal_all);\n"in CMakeLists.txt (#52468) · c85a0c5c
由 lzydev 提交于 4月 04, 2023
```
* fix bug of redefine use_equal_all

* fix bug of redefine use_equal_all
```
c85a0c5c

由 huangjiyi 提交于 4月 04, 2023

* update

* fix bug

* fix bug

* revert diag_op

* revert expand_op and expand_as_op

* fix bug

* fix bug

63efdaee

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功