提交 · 6d70761e63221f6b0c035bbc2e3141afc9d6e9c6 · PaddlePaddle / Paddle

04 2月, 2023 1 次提交
- H
  Add Some Default Parameters to CINN Interface for Country Standard (#50182) · fb69204f
  由 Huihuang Zheng 提交于 2月 04, 2023
```
As the title
```
  fb69204f
03 2月, 2023 8 次提交

Replace matmul(v2) with fused_matmul during oneDNN fuse passes (#49515) · 5cfe1645

由 Sławomir Siwek 提交于 2月 03, 2023

* replace matmul with matmul_v2 in fuse passes

* Remove fusion logic from matmul

* removing fusion methods

* add proper name

* adjust namespaces

* clean attrs in python tests

* delete checkpoint and restore matmul version

* remove unused code

* matmul and reshape/transpose fuses migrated

* split MatmulOneDNN headers

* fuse activation and eltwise_add

* add fuse_activation

* matmul_transpose_reshape/reshape_transpose_matmul

* matmul + elementwise_add (fused)

* activation temporary modifciation

* merge newest develop

* remove depedency from other PR

* revert pbtxt

* remove placeholders from matmul_v2

* add description in OPMaker

* remove matmul_v2_op.h and all depedencies

* remove dims changing in base op

* add possibility to fuse already fused_matmul

* restart broken CI

* Empty-Commit

* revert matmul_utils.h

* codestyle

* adjust imports

* add pbtxt file

* 100% matmul unit tests coverage

* trigger CI with minimal changes to develop

* adjust changes to develop

* add fused_matmul op

* inherit base ops

* add "v2"

* move OPMaker

* Gradually add fused_matmul files

* second batch of fused_matmul changes

* split infershapes of matmul_v2 and fused_matmul

* inherit fused_matmul from matmul_v2

* Update paddle/phi/backends/onednn/onednn_reuse.h
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

* Update paddle/phi/kernels/fusion/onednn/fused_matmul_kernel.cc
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

---------
Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>

5cfe1645

Rewrite conv testers from cpp to python (#49582) · aa8cef4a

由 Paulina Gacek 提交于 2月 03, 2023

* conv_bias_mkldnn_fuse_pass_tester rewritten

* conv_concat_relu_mkldnn_fuse_pass_tester rewritten

* conv_elementwise_add_fuse_pass_tester rewritten

* mkldnn changed to onednn

* tests added to cmakeLists, style fix

* got rid of unnecessary UT, some style changes

* changes in naming convention

* max_examples reduced

* time out added

aa8cef4a

R

Fix div 0 error of case20: paddle.min (#50013) · 50c43dd3
由 RedContritio 提交于 2月 03, 2023

50c43dd3

Generate some static graph ops (#49906) · 85490f70

由 HappyHeavyRain 提交于 2月 03, 2023

* generate some static graph ops

* fix the bug of pow

* add REGISTER_ACTIVATION_OP in operators.cmake

* modify the file operators.cmake

85490f70

Y

Fused attention pass backward op replace. (#50186) · 7e8ef328
由 Yuang Liu 提交于 2月 03, 2023

7e8ef328
R
Reduce time cost of BuildOpHappensBefore (#50137) · 6b151c0e
由 Ruibiao Chen 提交于 2月 03, 2023
```
* Reduce time cost of BuildOpHappensBefore

* Update code

* Update code

* Improve data struct
```
6b151c0e
J
【Prim】optimize log (#50160) · 80310541
由 Jiabin Yang 提交于 2月 03, 2023
```
* optimize log

* fix type error

* fix type error2
```
80310541

【Prim】Blacklist bwd comp (#50148) · cc8a7858

由 Jiabin Yang 提交于 2月 03, 2023

* refactor dir for prim

* support blacklist for bwd comp

* fix type error

* remove additional file

* fix git ignore

* add more test

* merge develop

cc8a7858

02 2月, 2023 3 次提交

【PRIM】Support use operator's output metadata info in constructing static... · d8643cb6

由 Xiaoxu Chen 提交于 2月 02, 2023

【PRIM】Support use operator's output metadata info  in constructing static backward composite (#50043)

* [prim] support custom target_gradients

* support infershape after append one gradop

* [prim] add simple net test

* fix test_loop segment fault bug

* [prim] fix infer shape segment fault bug when output of grad_op_desc is empty

d8643cb6

Y
[BugFix]Fix bugs when compile with OneDNN (#50096) · 3c557e2f
由 YuanRisheng 提交于 2月 02, 2023
```
* fix bugs

* fix ci bugs
```
3c557e2f
H
jit layer optimzer model param memory usage (#50135) · ec6e0a2c
由 Hui Zhang 提交于 2月 02, 2023
```
* jit layer support multi thread
```
ec6e0a2c

01 2月, 2023 7 次提交
- Y
  
  Fused attention pass fwd, create the fused_attention op. (#50125) · 2b848aef
  由 Yuang Liu 提交于 2月 01, 2023
  
  2b848aef
- R
  Fix div 0 error of case11: paddle.nn.functional.max_pool1d/max_pool2d/max_pool3d (#50010) · 3ab6faa8
  由 RedContritio 提交于 2月 01, 2023
```
* add stride check for MaxPool

* add unittests
```
  3ab6faa8
- W
  Preln fix (#49802) · e03718f5
  由 Wang Bojun 提交于 2月 01, 2023
```
* preln_residual 2 fused_bias_residual

* skip layernorm fix and ut

* code refine

* code style refine

* fix ut

* fix output

* add trt layer fall back info

* refine op teller and ut

* DropoutMaskOut output fix
```
  e03718f5
- H
  jit layer support multi thread and fix predictor clone (#50095) · 9fa2eb38
  由 Hui Zhang 提交于 2月 01, 2023
```
* jit layer support multi thread

* fix bug

* clone prediector not do graph optimizer

* format

* fix comment and format

* fix override and fromat

* fix

* fix
```
  9fa2eb38
- Z
  
  add dynamic shape support for running paddle-trt in calib_mode (#50033) · af673090
  由 zhoutianzi666 提交于 2月 01, 2023
  
  af673090
- L
  
  fix gc and infinite buffer size (#50122) · 3e9d8548
  由 LiYuRio 提交于 2月 01, 2023
  
  3e9d8548
- A
  [PrimCinn]Fix some vars are wrongly gc in CINN+InterpreterCore (#50116) · 9f231147
  由 Aurelius84 提交于 2月 01, 2023
```
* [PrimCinn]Fix some vars are wrongly gc in CINN+InterpreterCore

* fix baseline unittest config

* fix code style
```
  9f231147
31 1月, 2023 15 次提交
- W
  gn_silu (#49928) · 111075a3
  由 wenbin 提交于 1月 31, 2023
```
* gn_silu

* add ut

* set TIMEOUT

* correct comments

* comments

* disable windows ut

* rename parameter
```
  111075a3
- W
  Unary (#49914) · 0d9185b9
  由 wenbin 提交于 1月 31, 2023
```
* disable integer

* disable integer

* add cast layer
```
  0d9185b9
- Z
  
  [pass] Upgrade Constant Folding Pass (#49908) · c3cd8502
  由 Zhang Jun 提交于 1月 31, 2023
  
  c3cd8502
- N
  
  Save nan log to file when output_dir is setted (#49200) · c18fddd3
  由 niuliling123 提交于 1月 31, 2023
  
  c18fddd3
- C
  Integrate static code gen info (#49858) · 0e51f398
  由 Charles-hit 提交于 1月 31, 2023
```
* polish static grad op maker gen

* fix some bugs

* fix static code gen

* solve conflict

* modify composite grad maker name

* integrate phi and fluid info in static code gen

* rename some composite maker

* modify static code gen format
```
  0e51f398
- Z
  
  [inference][trt] add elementwise input data type check (#49675) · 5822e15c
  由 Zhang Jun 提交于 1月 31, 2023
  
  5822e15c
- P
  [Numpy] Add FP16 dtype for CastNumpy2Scalar (#50002) · 86a23818
  由 PuQing 提交于 1月 31, 2023
```
* add FP16 dtype for CastNumpy2Scalar

* fix throw message

* add test

* fix SyntaxWarning

* test skip for float16

* fix dtype mistakes
```
  86a23818
- R
  Add unified device management api (#48651) · 7aaaa1c6
  由 ronnywang 提交于 1月 31, 2023
```
* [CustomDevice] add custom device api

* update

* update

* test=document_fix

* update

* update

* add  examples
```
  7aaaa1c6
- R
  
  fix send start msg (#50085) · 1048b166
  由 Roc 提交于 1月 31, 2023
  
  1048b166
- Y
  
  [Paddle Inference] change the default values of some gflags (#50074) · a1f28a48
  由 Yuanle Liu 提交于 1月 31, 2023
  
  a1f28a48
- T
  support inplaced variable in cinn_launch (#49912) · 754ab705
  由 TeFeng Chen 提交于 1月 31, 2023
```
* support inplaced variable in cinn_launch

* fix error hint when compiling

* fix inplaced output variable of the subgraph

* skip CinnCompiler check

* using existed definition

* fix namespace reference error

* modify error message

* update cinn tage

* fix namespace

* skip enforce check

* fix unittest attribute throw
```
  754ab705
- P
  
  change no_event GC to fast GC for xpu (#49871) · eba7b584
  由 pangyoki 提交于 1月 31, 2023
  
  eba7b584
- H
  [Decouple phi] Decouple custom_op in fluid and phi (#49866) · 48b3e869
  由 HongyuJia 提交于 1月 31, 2023
```
* decouple phi custom_op

* decouple phi custom_op, remove codes

* delete custom symbol of inference
```
  48b3e869
- L
  
  add multi fetch (#50070) · a8078bbd
  由 LiYuRio 提交于 1月 31, 2023
  
  a8078bbd
- 姜
  rm flags retain grad in pybind (#49888) · 9c3a35b9
  由姜永久提交于 1月 31, 2023
```
* rm flags_retain grad in pybind

* retain grads for xpu test

* set retain grad for xpu

* rm flag

* lint

---------
Co-authored-by: Nwanghuancoder <wanghuan29@baidu.com>
```
  9c3a35b9
30 1月, 2023 5 次提交

J

[CINN] fix build_cinn_pass collect inplace var bug (#50072) · ac84dce9
由 jiangcheng 提交于 1月 30, 2023

ac84dce9
E
add phi tensor vector array api from fluid (#49885) · 094e3b8c
由 engineer1109 提交于 1月 30, 2023
```
replace all TensorFromVector & TensorToVector

AssignKernel async copy
```
094e3b8c

Support stream priority for standalone executor (#49939) · 172d1de6

由 Ruibiao Chen 提交于 1月 30, 2023

* Support stream priority for standalone executor

* Fix compile error

* Fix compile error

* Fix compile error

* Fix compile error

* Fix compile error

172d1de6

[Pglbox2.0] merge gpugraph to develop (#49946) · cb525d4e

由 zmxdream 提交于 1月 30, 2023

* add set slot_num for psgpuwraper (#177)

* add set slot_num_for_pull_feature for psgpuwarper

* Add get_epoch_finish python interface (#182)

* add get_epoch_finish interface

* add return

* delete return

* add unzip op (#183)

* fix miss key for error dataset (#186)

* fix miss key for error dataset

* fix miss key for error dataset
Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>

* add excluded_train_pair and infer_node_type (#187)

* support return of degree (#188)

* fix task stuck in barrier (#189)
Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>

* check node/feature format when loading (#190)

* check node&feature format when loading

* check node&feature format when loading (2£ (2)

* degrade log (#191)

* [PGLBOX]fix conflict

* [PGLBOX]fix conflict

* [PGLBOX]replace LodTensor with phi::DenseTensor

* [PGLBOX]fix gpu_primitives.h include path

* [PGLBOX]from platform::PADDLE_CUDA_NUM_THREADS to phi::PADDLE_CUDA_NUM_THREADS

* [PGLBOX]fix unzip example code

* [PGLBOX]fix unzip example code

* [PGLBOX]fix unzip example code

* [PGLBOX]fix unzip example code

* [PGLBOX]fix unzip ut

* [PGLBOX]fix unzip ut

* [PGLBOX]fix code style

* [PGLBOX]fix code style

* [PGLBOX]fix code style

* fix code style

* fix code style

* fix unzip ut

* fix unzip ut

* fix unzip ut

* fix unzip

* fix code stype

* add ut

* add c++ ut & fix train_mode_ set

* fix load into memory

* fix c++ ut

* fix c++ ut

* fix c++ ut

* fix c++ ut

* fix code style

* fix collective

* fix unzip_op.cc

* fix barrier

* fix code style

* fix barrier

* fix barrier

* fix code styple

* fix unzip

* add unzip.py

* add unzip.py

* fix unzip.py

---------
Co-authored-by: Nchao9527 <33347532+chao9527@users.noreply.github.com>
Co-authored-by: NSiming Dai <908660116@qq.com>
Co-authored-by: Nhuwei02 <53012141+huwei02@users.noreply.github.com>
Co-authored-by: Nyangjunchao <yangjunchao@baidu.com>

cb525d4e

G

depthwise_conv 映射成 conv的逻辑中添加下cudnn版本的判断 (#50058) · 320958eb
由 gem5 提交于 1月 30, 2023

320958eb

29 1月, 2023 1 次提交
- J
  
  [CINN] BuildCinnPass collect inplace var from all cluster instead op (#50057) · 6d13992e
  由 jiangcheng 提交于 1月 29, 2023
  
  6d13992e

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功