提交 · 3f4917f69ad4ad91afa7d8451992e305f78dd2b5 · PaddlePaddle / Paddle

10 3月, 2023 1 次提交

【Hackathon No.67】remove operator.h in blas.h (#50989) · 3f4917f6

由 iSerendipity 提交于 3月 10, 2023

* remove operator.h from blas.h and remove paddle::framework::ExecutionContext

* remove the deps for GetBlas(exe_ctx)

* fix error

3f4917f6

09 3月, 2023 2 次提交
- W
  
  fix maybe-uninitialized compiler warning in Linux (#51336) · 7e56147d
  由 Wang Xin 提交于 3月 09, 2023
  
  7e56147d
- S
  
  fix fused linear bfloat16 (#51384) · 3328a3d5
  由 sneaxiy 提交于 3月 09, 2023
  
  3328a3d5
07 3月, 2023 1 次提交
- A
  
  Fix fused_gemm_epilogue compilation not declared problem (#51280) · e43a527b
  由 Aurelius84 提交于 3月 07, 2023
  
  e43a527b
06 3月, 2023 2 次提交

S

convert todos to internal tasks (#51174) · 6b393e45
由 Sławomir Siwek 提交于 3月 06, 2023

6b393e45

[phi decoupling] decouple dependency to device_context in phi (Part 1) (#50865) · a1006b2b

由 Huang Jiyi 提交于 3月 06, 2023

* move DeviceContextPool to phi

* add EmplaceExternalContextFunc

* update namespace

* update cmake

* fix bugs and create context_pool_impl.h

* replace platform::is_xxx_place

* fix bugs

* update generator

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* fix enforce usage

* Revert "fix enforce usage"

This reverts commit 5f521f08a69713cee506e64a00ec6d9fba709e27.

* fix bugs

* rm XPUDeviceContext and CustomDeviceContext

* fix bugs

* fix fix context init bug

* fix bugs after merge

* fix bugs

* fix name

* fix mutable_data

* update and fix bugs

* fix bugs

* update

* fix bugs

* fix name

* fix bugs

* merge

* fix bugs

* create context_pool in phi/backends

* create context_pool in phi/backends

* fix bugs

* fix xpu bugs

* fix rocm bugs

* fix bugs

* fix bugs

* fix bugs

* fix xpu bugs

* update

* update

* fix bugs

* fix bugs

a1006b2b

03 3月, 2023 1 次提交

【Hackathon No.70】[PHI decoupling] move jit kernels from fluid to phi (#50911) · 2d36c9a9

由 gouzil 提交于 3月 03, 2023

* [phi] move jit kernels from fluid to phi

* [phi] fix paddle::phi err

* [phi] fix windows 'posix_memalign': identifier not found

* [phi] fix windows 'posix_memalign_free': identifier not found

* [phi] fix readme directory structure, fc_functor  paddle::platform

2d36c9a9

28 2月, 2023 1 次提交
- Y
  
  fix bug in fused_gemm_epilogue_op.cc (#50980) · 064a5434
  由 yuehuayingxueluo 提交于 2月 28, 2023
  
  064a5434
26 2月, 2023 1 次提交

Enable matmul + bias fusion in fused_gat_attention. (#50755) · 57f6a469

由 Yiqun Liu 提交于 2月 26, 2023

* Enable matmul + bias fusion in fused_gat_attention.

* Add a variable to control whether using fused matmul + bias.

57f6a469

23 2月, 2023 1 次提交

[phi decoupling] move generator implementation from fluid to phi (#50746) · 4e417409

由 Huang Jiyi 提交于 2月 23, 2023

* move fluid generator to phi

* move fluid generator to phi

* update .gitignore

* fix bugs

* fix cannot find "glog/logging.h" in "generator.h"

* fix bugs

4e417409

22 2月, 2023 1 次提交

Fix some typos. (#50429) · 93b2bf4b

由 Shuangchi He 提交于 2月 22, 2023

* Fix some typos.
Signed-off-by: Yulv-git <yulvchi@qq.com>

* pre-commit
Signed-off-by: Yulv-git <yulvchi@qq.com>

---------
Signed-off-by: Yulv-git <yulvchi@qq.com>

93b2bf4b

17 2月, 2023 1 次提交

Rename MultiTensorAdam To FusedAdam (#50449) · e6af9bd2

由 yuehuayingxueluo 提交于 2月 17, 2023

* rename multi_tensor_adam to fused_adam

* fix some bugs

* fix CI coverage

* rename test_fused_adam.py

* fix some bug

* add test_fused_adam_op.py

* fix some bugs

* fix fused_adam_op.cc

* fix CI bugs

* fix CI bug

* fix CI bug

e6af9bd2

16 2月, 2023 1 次提交

[Phi decouple] move layer_norm_kernel.cu.h to phi (#50506) · 8910bb4a

由 Huang Jiyi 提交于 2月 16, 2023

* move layer_norm_kernel.cu.h to phi

* fix bugs

* fix namespace

* fix bugs

* fix CI-Windwos

* replace mutable_data

* fix bugs

* fix bugs

8910bb4a

15 2月, 2023 1 次提交

make FusedMultiTransformer supports variable-lengths. (#49560) · 53df50c7

由 lzy 提交于 2月 15, 2023

* make FusedMultiTransformer supports variable-lengths.

* modify ffn2 when cuda_version >= 11.6 because of #49392.

* code style

* delete remove_padding

53df50c7

14 2月, 2023 1 次提交

Decrease usage of GetVecSize for optimizing host computation efficiency (#50353) · 976606fe

由 limingshu 提交于 2月 14, 2023

* first commit.

* a little changes

* add some changes for get vec_size efficiently

* fix bugs

---------
Co-authored-by: Nzhangbopd <1299246947@qq.com>

976606fe

08 2月, 2023 3 次提交
- Y
  
  Fused attention pass mp support (#50320) · e44ff495
  由 Yuang Liu 提交于 2月 08, 2023
  
  e44ff495
- H
  
  move mixed_vector (#50282) · 35d7d1f0
  由 Huang Jiyi 提交于 2月 08, 2023
  
  35d7d1f0
- Y
  [PHI]Unify Fluid and PHI kernel (#49328) · e92e3aab
  由 YuanRisheng 提交于 2月 08, 2023
```
* unify_kernel

* fix compile bugs

* modify macro name

* perfect code according comment

* fix compile bugs

* fix compile bugs

* fix ci bugs

* fix ci bug

* fix ci bugs

* fix ci bugs

* modify code according comment

* rm conv_fusion_op
```
  e92e3aab
06 2月, 2023 2 次提交

Delete extra input (Bias, ResidualData) in OpMaker of conv2d (#49121) · 2deada9a

由 zyfncg 提交于 2月 06, 2023

* remove extra input of conv2d

* fix bug

* fix unittest bug

* adjust conv2d.pbtxt

* fix cpu_quantize_pass_tester

* revert use_addto of conv2d

* fix runtime attribute

* fix bug

* recover force_fp32_output in conv2d

* refine error info

* fix bug

2deada9a

E

phi move ReshapeToMatrix & GetValue (#50139) · d09962a1
由 engineer1109 提交于 2月 06, 2023

d09962a1

03 2月, 2023 2 次提交

Replace matmul(v2) with fused_matmul during oneDNN fuse passes (#49515) · 5cfe1645

由 Sławomir Siwek 提交于 2月 03, 2023

* replace matmul with matmul_v2 in fuse passes

* Remove fusion logic from matmul

* removing fusion methods

* add proper name

* adjust namespaces

* clean attrs in python tests

* delete checkpoint and restore matmul version

* remove unused code

* matmul and reshape/transpose fuses migrated

* split MatmulOneDNN headers

* fuse activation and eltwise_add

* add fuse_activation

* matmul_transpose_reshape/reshape_transpose_matmul

* matmul + elementwise_add (fused)

* activation temporary modifciation

* merge newest develop

* remove depedency from other PR

* revert pbtxt

* remove placeholders from matmul_v2

* add description in OPMaker

* remove matmul_v2_op.h and all depedencies

* remove dims changing in base op

* add possibility to fuse already fused_matmul

* restart broken CI

* Empty-Commit

* revert matmul_utils.h

* codestyle

* adjust imports

* add pbtxt file

* 100% matmul unit tests coverage

* trigger CI with minimal changes to develop

* adjust changes to develop

* add fused_matmul op

* inherit base ops

* add "v2"

* move OPMaker

* Gradually add fused_matmul files

* second batch of fused_matmul changes

* split infershapes of matmul_v2 and fused_matmul

* inherit fused_matmul from matmul_v2

* Update paddle/phi/backends/onednn/onednn_reuse.h
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

* Update paddle/phi/kernels/fusion/onednn/fused_matmul_kernel.cc
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

---------
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

5cfe1645

Y

Fused attention pass backward op replace. (#50186) · 7e8ef328
由 Yuang Liu 提交于 2月 03, 2023

7e8ef328

01 2月, 2023 1 次提交

Preln fix (#49802) · e03718f5

由 Wang Bojun 提交于 2月 01, 2023

* preln_residual 2 fused_bias_residual

* skip layernorm fix and ut

* code refine

* code style refine

* fix ut

* fix output

* add trt layer fall back info

* refine op teller and ut

* DropoutMaskOut output fix

e03718f5

13 1月, 2023 1 次提交
- Y
  
  fix fc and fused_fc_elementwise_layernorm kernel diff (#49778) · 0b24d167
  由 Yuanle Liu 提交于 1月 13, 2023
  
  0b24d167
06 1月, 2023 1 次提交
- revert back ffn2 (#49392) · 0019ef0c
  由 MarDino 提交于 1月 06, 2023
  
  0019ef0c
05 1月, 2023 1 次提交
- Y
  
  Add transpose_qkv_wb flags to the fused_attention_op. (#49494) · ec857b85
  由 Yuang Liu 提交于 1月 05, 2023
  
  ec857b85
04 1月, 2023 3 次提交

W

[Inference] Add conv_fusion nhwc impl. (#49047) · 4a8708bb
由 Wilber 提交于 1月 04, 2023

4a8708bb
Y

[Paddle Inference] fix mixed precision diff (#49475) · ac75a9a6
由 Yuanle Liu 提交于 1月 04, 2023

ac75a9a6

[Unify KernelKey] change OpKernelType->KernelKey (#49138) · 4383494f

由 HongyuJia 提交于 1月 04, 2023

* execute use kernel_key first

* change OpKernelType->KernelKey

* fix py3 compile error, remove redundant header files

* fix build_strategy_test

* fix DataType::RAW

* fix custom_type test: operator_test.cc

* fix transform place

* fix backends_are_same_class

* try fix place TransDataDevice

* support all KernelKey

* fix TransformData

* fix place_are_same_class

* fix merge

* fix test_params_no_grad

* fix specific place of GetExpectedKernelType

* fix specific place of GetExpectedKernelType

* fix GetKernelTypeForVar

* fix dtype error

* fix fetch_v2

* change GetKernelTypeForVar

* fix interpreter

* fix typo error

* polish codes

* polish codes

* polish codes

* fix conflict

4383494f

03 1月, 2023 1 次提交

[Paddle Inference] Implement conv2d_fusion NHWC format using cutlass (#47989) · c123dd1e

由 zhoutianzi666 提交于 1月 03, 2023

* Implement conv2d_fusion NHWC format using CUTLASS
* Add unit testing for CUTLASS Conv in inference
* Add experimental API for CUTLASS.

c123dd1e

29 12月, 2022 2 次提交
- fix ambiguous symbol error (#49406) · 6f07960c
  由 MarDino 提交于 12月 29, 2022
  
  6f07960c
- W
  fused_attention_op paratmers stop grad support (#49351) · 0bb999b6
  由 Wang Bojun 提交于 12月 29, 2022
```
* fusedAttenGrad_noGrad

* code style fix

* add ut

* remove unnecessary log
```
  0bb999b6
23 12月, 2022 1 次提交
- L
  
  make FusedMultiTransformer supports RoPE (#48842) · 644dfc60
  由 lzy 提交于 12月 23, 2022
  
  644dfc60
20 12月, 2022 1 次提交

[PHI decouple] move dropout_impl and cuda_graph_with_memory_pool from fluid to phi (#49139) · 579784e2

由 huangjiyi 提交于 12月 20, 2022

* move dropout_impl from fluid to phi

* move cuda_graph_with_memory_pool from fluid to phi

* update namespace

* remove cuad_graph in fluid

* fix mac-build

* fix bugs

* correct CodeStyle

* fix mac-build

* fix mutable_data

* fix stl include

* fix copy param

579784e2

19 12月, 2022 1 次提交
- W
  
  refactor: rename process group (#49137) · 22e416cf
  由 Wen Sun 提交于 12月 19, 2022
  
  22e416cf
16 12月, 2022 1 次提交
- W
  
  refactor: rename files (#49117) · 40f3f4f0
  由 Wen Sun 提交于 12月 16, 2022
  
  40f3f4f0
15 12月, 2022 2 次提交

H

[PHI decoupling] move softmax from fluid to phi and remove cpu_vec.h in fluid (#48970) · 344b99e1
由 huangjiyi 提交于 12月 15, 2022

344b99e1

[PHI decoupling] Remove fluid imports from MKLDNN code (#48981) · 4d5a5533

由 Sławomir Siwek 提交于 12月 15, 2022

* fix wrong handler name

* mkldnn_engine -> onednn_engine

* remove fluid/errors.h imports

* remove fluid/enforce.h imports

* remove note and unnecessary import

* remove fluid/pretty_log.h imports

* remove fluid/place.h imports

* remove fluid/data_layout_transform.h imports

* remove fluid/device_context.h imports

* remove mkldnn_helper code

* remove fluid/mkldnn_reuse.h imports

* pretty_log import

4d5a5533

14 12月, 2022 2 次提交
- M
  
  Fix nullptr to TestFuseGemmEpilogueReluBWDFP* (#48997) · e61df289
  由 Ming-Xu Huang 提交于 12月 14, 2022
  
  e61df289
- Z
  modify cmake file for cuda11.8 compile (#49020) · d0284f85
  由 zqw_1997 提交于 12月 14, 2022
```
* modify cmake file for cuda11.8 compile

* add op_library(fused_embedding_eltwise_layernorm_op DEPS bert_encoder_functor)
```
  d0284f85

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功