提交 · caf2008b55fe0ee34cd6fa71090701517ac33eda · PaddlePaddle / Paddle

06 2月, 2023 5 次提交

【Pglbox】merge gpugraph to develop (#50091) · caf2008b

由 zmxdream 提交于 2月 06, 2023

* add dump_walk_path  (#193)

* add dump_walk_path; test=develop

* add dump_walk_path; test=develop

* add dump_walk_path; test=develop

* Add multiple CPU communication, parameter query and merging functions, support batch alignment between multiple cards (#194)

* compatible with edge_type of src2dst and src2etype2dst (#195)

* do not merge_feature_shard when using metapath_split_opt (#198)

* support only load reverse_edge (#199)

* refactor GraphTable (#201)

* fix

* fix

* fix code style

* fix code style

* fix test_dataset

* fix hogwild worker

* fix code style

* fix code style

* fix code style

* fix code style

* fix code style.

* fix code style.

---------
Co-authored-by: Ndanleifeng <52735331+danleifeng@users.noreply.github.com>
Co-authored-by: Nqingshui <qshuihu@gmail.com>
Co-authored-by: NWebbley <liwb5@foxmail.com>
Co-authored-by: Nhuwei02 <53012141+huwei02@users.noreply.github.com>

caf2008b

Delete extra input (Bias, ResidualData) in OpMaker of conv2d (#49121) · 2deada9a

由 zyfncg 提交于 2月 06, 2023

* remove extra input of conv2d

* fix bug

* fix unittest bug

* adjust conv2d.pbtxt

* fix cpu_quantize_pass_tester

* revert use_addto of conv2d

* fix runtime attribute

* fix bug

* recover force_fp32_output in conv2d

* refine error info

* fix bug

2deada9a

Y

Fused attn pass single ut (#50227) · fcec564c
由 Yuang Liu 提交于 2月 06, 2023

fcec564c
S
Fix to_dlpack (#50138) · 35ce2bd9
由 Siming Dai 提交于 2月 06, 2023
```
* fix to_dlpack for loop

* fix reference count
```
35ce2bd9
E

phi move ReshapeToMatrix & GetValue (#50139) · d09962a1
由 engineer1109 提交于 2月 06, 2023

d09962a1

04 2月, 2023 1 次提交
- H
  Add Some Default Parameters to CINN Interface for Country Standard (#50182) · fb69204f
  由 Huihuang Zheng 提交于 2月 04, 2023
```
As the title
```
  fb69204f
03 2月, 2023 4 次提交

Replace matmul(v2) with fused_matmul during oneDNN fuse passes (#49515) · 5cfe1645

由 Sławomir Siwek 提交于 2月 03, 2023

* replace matmul with matmul_v2 in fuse passes

* Remove fusion logic from matmul

* removing fusion methods

* add proper name

* adjust namespaces

* clean attrs in python tests

* delete checkpoint and restore matmul version

* remove unused code

* matmul and reshape/transpose fuses migrated

* split MatmulOneDNN headers

* fuse activation and eltwise_add

* add fuse_activation

* matmul_transpose_reshape/reshape_transpose_matmul

* matmul + elementwise_add (fused)

* activation temporary modifciation

* merge newest develop

* remove depedency from other PR

* revert pbtxt

* remove placeholders from matmul_v2

* add description in OPMaker

* remove matmul_v2_op.h and all depedencies

* remove dims changing in base op

* add possibility to fuse already fused_matmul

* restart broken CI

* Empty-Commit

* revert matmul_utils.h

* codestyle

* adjust imports

* add pbtxt file

* 100% matmul unit tests coverage

* trigger CI with minimal changes to develop

* adjust changes to develop

* add fused_matmul op

* inherit base ops

* add "v2"

* move OPMaker

* Gradually add fused_matmul files

* second batch of fused_matmul changes

* split infershapes of matmul_v2 and fused_matmul

* inherit fused_matmul from matmul_v2

* Update paddle/phi/backends/onednn/onednn_reuse.h
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

* Update paddle/phi/kernels/fusion/onednn/fused_matmul_kernel.cc
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

---------
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

5cfe1645

Rewrite conv testers from cpp to python (#49582) · aa8cef4a

由 Paulina Gacek 提交于 2月 03, 2023

* conv_bias_mkldnn_fuse_pass_tester rewritten

* conv_concat_relu_mkldnn_fuse_pass_tester rewritten

* conv_elementwise_add_fuse_pass_tester rewritten

* mkldnn changed to onednn

* tests added to cmakeLists, style fix

* got rid of unnecessary UT, some style changes

* changes in naming convention

* max_examples reduced

* time out added

aa8cef4a

Y

Fused attention pass backward op replace. (#50186) · 7e8ef328
由 Yuang Liu 提交于 2月 03, 2023

7e8ef328
R
Reduce time cost of BuildOpHappensBefore (#50137) · 6b151c0e
由 Ruibiao Chen 提交于 2月 03, 2023
```
* Reduce time cost of BuildOpHappensBefore

* Update code

* Update code

* Improve data struct
```
6b151c0e

02 2月, 2023 1 次提交
- Y
  [BugFix]Fix bugs when compile with OneDNN (#50096) · 3c557e2f
  由 YuanRisheng 提交于 2月 02, 2023
```
* fix bugs

* fix ci bugs
```
  3c557e2f
01 2月, 2023 2 次提交

Y

Fused attention pass fwd, create the fused_attention op. (#50125) · 2b848aef
由 Yuang Liu 提交于 2月 01, 2023

2b848aef

Preln fix (#49802) · e03718f5

由 Wang Bojun 提交于 2月 01, 2023

* preln_residual 2 fused_bias_residual

* skip layernorm fix and ut

* code refine

* code style refine

* fix ut

* fix output

* add trt layer fall back info

* refine op teller and ut

* DropoutMaskOut output fix

e03718f5

31 1月, 2023 6 次提交

gn_silu (#49928) · 111075a3

由 wenbin 提交于 1月 31, 2023

* gn_silu

* add ut

* set TIMEOUT

* correct comments

* comments

* disable windows ut

* rename parameter

111075a3

Z

[pass] Upgrade Constant Folding Pass (#49908) · c3cd8502
由 Zhang Jun 提交于 1月 31, 2023

c3cd8502
N

Save nan log to file when output_dir is setted (#49200) · c18fddd3
由 niuliling123 提交于 1月 31, 2023

c18fddd3

Integrate static code gen info (#49858) · 0e51f398

由 Charles-hit 提交于 1月 31, 2023

* polish static grad op maker gen

* fix some bugs

* fix static code gen

* solve conflict

* modify composite grad maker name

* integrate phi and fluid info in static code gen

* rename some composite maker

* modify static code gen format

0e51f398

support inplaced variable in cinn_launch (#49912) · 754ab705

由 TeFeng Chen 提交于 1月 31, 2023

* support inplaced variable in cinn_launch

* fix error hint when compiling

* fix inplaced output variable of the subgraph

* skip CinnCompiler check

* using existed definition

* fix namespace reference error

* modify error message

* update cinn tage

* fix namespace

* skip enforce check

* fix unittest attribute throw

754ab705

P

change no_event GC to fast GC for xpu (#49871) · eba7b584
由 pangyoki 提交于 1月 31, 2023

eba7b584

30 1月, 2023 5 次提交

J

[CINN] fix build_cinn_pass collect inplace var bug (#50072) · ac84dce9
由 jiangcheng 提交于 1月 30, 2023

ac84dce9
E
add phi tensor vector array api from fluid (#49885) · 094e3b8c
由 engineer1109 提交于 1月 30, 2023
```
replace all TensorFromVector & TensorToVector

AssignKernel async copy
```
094e3b8c

Support stream priority for standalone executor (#49939) · 172d1de6

由 Ruibiao Chen 提交于 1月 30, 2023

* Support stream priority for standalone executor

* Fix compile error

* Fix compile error

* Fix compile error

* Fix compile error

* Fix compile error

172d1de6

[Pglbox2.0] merge gpugraph to develop (#49946) · cb525d4e

由 zmxdream 提交于 1月 30, 2023

* add set slot_num for psgpuwraper (#177)

* add set slot_num_for_pull_feature for psgpuwarper

* Add get_epoch_finish python interface (#182)

* add get_epoch_finish interface

* add return

* delete return

* add unzip op (#183)

* fix miss key for error dataset (#186)

* fix miss key for error dataset

* fix miss key for error dataset
Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>

* add excluded_train_pair and infer_node_type (#187)

* support return of degree (#188)

* fix task stuck in barrier (#189)
Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>

* check node/feature format when loading (#190)

* check node&feature format when loading

* check node&feature format when loading (2£ (2)

* degrade log (#191)

* [PGLBOX]fix conflict

* [PGLBOX]fix conflict

* [PGLBOX]replace LodTensor with phi::DenseTensor

* [PGLBOX]fix gpu_primitives.h include path

* [PGLBOX]from platform::PADDLE_CUDA_NUM_THREADS to phi::PADDLE_CUDA_NUM_THREADS

* [PGLBOX]fix unzip example code

* [PGLBOX]fix unzip example code

* [PGLBOX]fix unzip example code

* [PGLBOX]fix unzip example code

* [PGLBOX]fix unzip ut

* [PGLBOX]fix unzip ut

* [PGLBOX]fix code style

* [PGLBOX]fix code style

* [PGLBOX]fix code style

* fix code style

* fix code style

* fix unzip ut

* fix unzip ut

* fix unzip ut

* fix unzip

* fix code stype

* add ut

* add c++ ut & fix train_mode_ set

* fix load into memory

* fix c++ ut

* fix c++ ut

* fix c++ ut

* fix c++ ut

* fix code style

* fix collective

* fix unzip_op.cc

* fix barrier

* fix code style

* fix barrier

* fix barrier

* fix code styple

* fix unzip

* add unzip.py

* add unzip.py

* fix unzip.py

---------
Co-authored-by: Nchao9527 <33347532+chao9527@users.noreply.github.com>
Co-authored-by: NSiming Dai <908660116@qq.com>
Co-authored-by: Nhuwei02 <53012141+huwei02@users.noreply.github.com>
Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>

cb525d4e

G

depthwise_conv 映射成 conv的逻辑中添加下cudnn版本的判断 (#50058) · 320958eb
由 gem5 提交于 1月 30, 2023

320958eb

29 1月, 2023 4 次提交
- J
  
  [CINN] BuildCinnPass collect inplace var from all cluster instead op (#50057) · 6d13992e
  由 jiangcheng 提交于 1月 29, 2023
  
  6d13992e
- S
  Add the missing ps.proto and remove ps_pb2.py (#50040) · ba67361b
  由 sneaxiy 提交于 1月 29, 2023
```
* add missing proto file

* fix windows ci

* fix ci compile error
```
  ba67361b
- J
  [CINN] collect inplace var into cinn op desc's kInplaceVarNames attribute (#49898) · bad49b51
  由 jiangcheng 提交于 1月 29, 2023
```
* [CINN] collect inplace var into cinn op desc's kInplaceVarNames attribute

* attr move from op desc to subgraph

* GetFetchIds from var_map instead of var_model_to_program_map_
```
  bad49b51
- Y
  
  Fused attention pass backward pattern (#49855) · 8e02f290
  由 Yuang Liu 提交于 1月 29, 2023
  
  8e02f290
18 1月, 2023 2 次提交

Handle repetitive code in oneDNN activation fuse passes (#49824) · a1b2e1e2

由 Sławomir Siwek 提交于 1月 18, 2023

* extract fuse pass logic to header file

* adjust namespaces

* Update paddle/fluid/framework/ir/mkldnn/activation_onednn_fuse_pass.h

update date
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

* add inline remove static
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

a1b2e1e2

L

fix cinn compilation with py38 (#49883) · bc93452d
由 Leo Chen 提交于 1月 18, 2023

bc93452d

17 1月, 2023 3 次提交

Rewrite mat reshape transpose testers (#49580) · d9d47dc6

由 Paulina Gacek 提交于 1月 17, 2023

* reshape_transpose_matmul_pass_tester rewritten

* matmul_transpose_reshape_pass_tester rewritten

* mkldnn to onednn

d9d47dc6

support CUDA Graph for new executor (#49708) · 8e5ed04d

由 pangyoki 提交于 1月 17, 2023

* new exe supports CUDA Graph

* fix

* fix

* fix

* fix FLAGS_use_stream_safe_cuda_allocator in unittest

* insert output of coalesce_tensor op to skip_gc_var

* fix

8e5ed04d

[PHI]Change feed_op to phi kernel (#49116) · f7f1dc03

由 YuanRisheng 提交于 1月 17, 2023

* change feed_op to phi kernel

* fix ci bugs

* fix build bugs

* fix ci bugs

* fix compile bugs

* fix ci bugs

* perfect code

* perfect comment code

* fix install bugs

* modify code according comment

* remove visitor in feed_op

* modify according comment

* perfect code according comment

* add infershape

* fix py3 bugs

* fix getexpected kernel type

* fix getexpected kernel type

* fix ci bugs

* add registry for custom device

* fix py3 bugs

* fix floating point error

* fix py3 test bugs

f7f1dc03

16 1月, 2023 5 次提交
- Z
  [inference] Use output var name to mark the NVTX flag (#49825) · ea2e2495
  由 Zhang Jun 提交于 1月 16, 2023
```
* add outvar name for nvtx mark

* nly network created with kEXPLICIT_BATCH can setsetMaxBatchSize
```
  ea2e2495
- A
  [CINN]Switch cinn GIT_TAG from v0.2 into develop (#49775) · c8187ac7
  由 Aurelius84 提交于 1月 16, 2023
```
* [CINN]Switch cinn GIT_TAG from v0.2 into develop

* fix branch name

* specify commit

* disable unittest

* disable unittest
```
  c8187ac7
- Y
  [Paddle-TRT] support nhwc (#49633) · e43f7102
  由 Yuanle Liu 提交于 1月 16, 2023
```
* add trt_support_nhwc_pass
```
  e43f7102
- J
  Revert "[static code gen]Add phi and fluid info in static code gen (#49763)" (#49848) · 0355bb90
  由 Jiabin Yang 提交于 1月 16, 2023
```
This reverts commit 4d5265b8.
```
  0355bb90
- C
  [static code gen]Add phi and fluid info in static code gen (#49763) · 4d5265b8
  由 Charles-hit 提交于 1月 16, 2023
```
* polish static grad op maker gen

* fix some bugs

* fix static code gen

* solve conflict

* modify composite grad maker name
```
  4d5265b8
13 1月, 2023 2 次提交
- W
  add oss flash fmha and fmhca support (#49438) · a48b8e2c
  由 Wang Bojun 提交于 1月 13, 2023
```
* add fmha_flashattention oss plugin
```
  a48b8e2c
- L
  
  remove ps_core dependency (#49716) · 93f20a07
  由 LiYuRio 提交于 1月 13, 2023
  
  93f20a07

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功