提交 · fb4a6ecfd8216f3cd07c9a99cb29e997e57b8197 · PaddlePaddle / Paddle

18 5月, 2023 8 次提交

Fused elementwises kernels and ops (#51427) · fb4a6ecf

由 Hulek 提交于 5月 18, 2023

* Fused elementwises kernels and ops

* change fuse pass name

* adjust .pbtxt files

* adjust quantization attributes

* add missing arguments and fix others, review fixed

* simplify fused kernel registration

* fix elementwise unit tests

* reuse one fused elementwise op

* adjust proto

* Add supported datatypes

* Change 'Scale' to 'scale' in tests, change some tests to onednn

* Revert breaking changes

* Fix unit tests

* Delete obsolete test cases

* Delete commented out code

* Fix codestyle

* delete temporary condition

* fix conflicts and delete duplicate fusing

* Fix code after merge

* Move tests to new directory

* fix tests volatility

* Rename test_elementwise_add_onednn_op.py to test_elementwise_add_mkldnn_op.py

* Update CMakeLists.txt add mkldnn op test

---------
Co-authored-by: NSilv3S <slawomir.siwek@intel.com>

fb4a6ecf

H

move fusion_group kernel to phi (#53781) · 26da689d
由 huangjiyi 提交于 5月 18, 2023

26da689d
C

Fix typos, test=document_fix (#53927) · e916e80c
由 co63oc 提交于 5月 18, 2023

e916e80c
T
Del test_async_read_write in CPU (#53882) · acb5039a
由 tianshuo78520a 提交于 5月 18, 2023
```
* fix

* fix
```
acb5039a

[Dy2static-Fallback] add set_eval_frame function in pybind. (#52006) · 7b1695af

由 xiongkun 提交于 5月 18, 2023

* [Dy2static-Fallback] add set_eval_frame function in pybind.
1. add set_eval_frame function in pybind.

* add unittest for eval frame hooker.

* [support py38]

* fix-GeneratorExit error in eval frame hooker

* support python == 3.9

* support 3.10

* fix some comments

7b1695af

H

[CustomOp Unittest] Fix XPU unittest, discard static backward (#53899) · 2d0c6948
由 HongyuJia 提交于 5月 18, 2023

2d0c6948

support auto generate for op layer_norm (#53178) · 4f07b653

由 RedContritio 提交于 5月 18, 2023

* simplify layer_norm_op.cc

* support auto generate for op layer_norm

* update unittest for composite_layer_norm

* remove layer_norm_op.cc from scripts

* replace layer_norm_op with generated_op

* add get_expected_kernel for layer_norm

* update cmake kernel register function for layer_norm_mkldnn_op

4f07b653

[AMP]Master grad in static graph (#53362) · 972581d8

由 shaojie_wang 提交于 5月 17, 2023

* add master gradients on static graph

* add unit test for bf16 master grad static graph

* use float16 as v100 test dtype

* only skip GPU which do not support bf16

* use linear layer to test master grad

* 1.push master grad creation before all optimizer ops; 2.remove useless unittest; 3.use a function to create master grad states

972581d8

17 5月, 2023 2 次提交

[IR] Program & Parameter & PaddleDialect (#53557) · 78967ad2

由 zhangbo9674 提交于 5月 17, 2023

* add program parameter dialect_interface

* fix op create bug

* add ir parameter convert pd variable methods

* refine code

* fix bug

* refine by ut

* refine ut

* delete unused code

* refine code

* refine code by comment

* reset WITH_NEW_IR

* refine op attribute map

* refine program and op create

* refine program and op create

78967ad2

6

test(cinn): fix resnet50 precision (#53649) · 56e8affe
由 6clc 提交于 5月 17, 2023

56e8affe

16 5月, 2023 6 次提交
- X
  【static】modify backward prune logic for EmptygradOpMaker (#53746) · 69161a96
  由 xiaoguoguo626807 提交于 5月 16, 2023
```
* add rules

* modify no kernel yaml parse

* success op generate

* success test_silu_double

* modify bug

* modify static error

* modify silu_grad input

* modify kernel signature

* modify kernel signature

* code style

* code style

* review

* delete opinfo modify

* modify gradOpMaker

* modify gradOpMaker

* modify genarated-j2

* add approve rules

* modify aytograd_functional_static_test
```
  69161a96
- S
  
  [XPU] Add sigmoid_elementmul_xpu_fuse_pass (#53580) · 32e36b15
  由 sprouteer 提交于 5月 16, 2023
  
  32e36b15
- N
  
  [AMP] support OD level for static (#53768) · c2c3bd43
  由 niuliling123 提交于 5月 16, 2023
  
  c2c3bd43
- W
  static graph autogen code support for softmax op (#53581) · 312f0187
  由 Wang Xin 提交于 5月 16, 2023
```
* static graph autogen code support for softmax op

* bug fixed

* fix PR-CI-Windows error

* fix CI error

* bug fixed

* fix conflicts
```
  312f0187
- [dygraph]remove legacy code : _in_eager_mode_ and _in_eager_without_dygraph_check() (#53761) · b1333175
  由 meteor135 提交于 5月 16, 2023
```
* remove _in_eager_mode_

* remove _in_eager_mode_
```
  b1333175
- Y
  [AMP] Allow to switch whether to use promote strategy to choose kernel for O2 training. (#53742) · db407bf0
  由 Yiqun Liu 提交于 5月 16, 2023
```
* Allow to switch whether to use promote strategy to choose kernel for O2 training.

* Fix comparing error and add unittest.
```
  db407bf0
15 5月, 2023 6 次提交

[AMP]fix embedding model weight type mismatch error (#53770) · 848deecd

由 shaojie_wang 提交于 5月 15, 2023

* fix embedding model weight type mismatch error

* Update fp16_utils.py

---------
Co-authored-by: NZhang Ting <zhangting_2017@163.com>

848deecd

[inference Zero-Dim][trt] Add Zero-Dim tensor support for clip, cast,... · cc9aedaf

由 bukejiyu 提交于 5月 15, 2023

[inference Zero-Dim][trt] Add Zero-Dim tensor support for clip, cast, flatten_contiguous_range (#53769)

* [inference Zero-Dim][trt]clip,cast,flatten_contiguous_range trt op converter support zero dim

cc9aedaf

Silu double grad (#53605) · 94c38803

由 xiaoguoguo626807 提交于 5月 15, 2023

* add rules

* modify no kernel yaml parse

* success op generate

* success test_silu_double

* modify bug

* modify static error

* modify silu_grad input

* modify kernel signature

* modify kernel signature

* code style

* code style

* review

* delete opinfo modify

94c38803

A

[UnitTest]Fix deprecate fluid.regularizer test=document_fix (#53805) · 3d4d7c19
由 Aurelius84 提交于 5月 15, 2023

3d4d7c19
A

[CI]Fix test_bert_primm_cinn gt loss value (#53796) · 359f43a9
由 Aurelius84 提交于 5月 15, 2023

359f43a9

relocate python/paddle/fluid/regularizer.py (#53106) · 00e415de

由 LoneRanger 提交于 5月 15, 2023

* relocate regularizer.py

* fix bug

* fix bug

* fix bug

* relocate the import

* replace _regularization_coeff with coeff

* remove the L1DecayRegularizer and L2DecayRegularizer

00e415de

13 5月, 2023 1 次提交

Revert elementwise add (#53745) · b75d8c7e

由 xiaoguoguo626807 提交于 5月 13, 2023

* modify concat_grad add sum comp rule

* delete default mul_double_grad

* delete high grad test

* recover yaml

* modify yaml

* recover add_double_grad prim

b75d8c7e

12 5月, 2023 7 次提交
- 6
  test(prim-cinn): split test_resnet and test_bert into three tests (#53723) · 60cf9b50
  由 6clc 提交于 5月 12, 2023
```
* test(prim-cinn): split test_resnet and test_bert into three tests

* test(prim-cinn): fix cmake file to run prim test in CINN-CI
```
  60cf9b50
- H
  
  [XPU] remove clip of c_softmax_with_cross_entropy_op (#53734) · 1019b264
  由 houj04 提交于 5月 12, 2023
  
  1019b264
- W
  
  [Inference] Update switch stream logical. (#53589) · eb97f4f0
  由 Wilber 提交于 5月 12, 2023
  
  eb97f4f0
- Y
  [inference zero dim] softmax, stack op trt converter support zero dim (#53729) · 05d3fc81
  由 Yuanle Liu 提交于 5月 12, 2023
```
* softmax support

* support stack
```
  05d3fc81
- W
  sequence_mask functionalization (#53478) · d2b1e3c2
  由 Wang Xin 提交于 5月 12, 2023
```
* sequence_mask functionalization

* fix sequence_mask test
```
  d2b1e3c2
- A
  Revert "[CINN]Adjust Bert unittest loss ground truth (#53628)" (#53731) · 95ae5d5c
  由 Aurelius84 提交于 5月 12, 2023
```
This reverts commit 45ce0ad5.
```
  95ae5d5c
- X
  【Prim】support higher order autodiff for dy2static+composite (#53171) · b73594b4
  由 Xiaoxu Chen 提交于 5月 12, 2023
```
* [Dy2St]Fix x grad names when high order gradient

* Polish error msg

* Add inputs var to backward in dy2st

* Fix error

* Get grad names for backward API

* Fix save load

* Polish code

* Add ut

* [prim] fix not support optional grad bugs in higher order autodiff

* [prim] remove duplicate fill_any_like caused by infershape_for_composite

* fix _strip_grad_suffix_ bugs in higher-order autodiff

* [prim] create output for test_static_prim.cc

---------
Co-authored-by: N0x45f <wangzhen45@baidu.com>
```
  b73594b4
11 5月, 2023 10 次提交
- X
  [Inference Zero-Dim] Support trt 0dim of gelu, hard_swish, hard_sigmoid and leaky_relu (#53714) · b150b168
  由 xiaoxiaohehe001 提交于 5月 11, 2023
```
* support_act
* delete_silu
```
  b150b168
- Y
  [inference Zero-Dim]prelu trt converter support zero dim tensor (#53634) · 82c73884
  由 Yuanle Liu 提交于 5月 11, 2023
```
* prelu op trt converter support zero dim
```
  82c73884
- Z
  
  [inference][trt]add trt sparse weights switch (#53562) · 4a69a536
  由 Zhang Jun 提交于 5月 11, 2023
  
  4a69a536
- Z
  
  [inference Zero-Dim]add equal, elementwise_op trt 0d (#53704) · 04e5e7b7
  由 Zhang Jun 提交于 5月 11, 2023
  
  04e5e7b7
- K
  move DataLoader code to paddle.io (#48699) · 793f3b93
  由 Kaipeng Deng 提交于 5月 11, 2023
```
* move DataLoader to paddle.io. test=develop
```
  793f3b93
- L
  [XPU][PHI Kernels] add pad op for xpu (#53684) · 6f28eb70
  由 lijin23 提交于 5月 11, 2023
```
* add pad op for xpu

* add pad op for xpu

* add pad op for xpu
```
  6f28eb70
- X
  Revert elementwise (#53663) · b4024aaf
  由 xiaoguoguo626807 提交于 5月 11, 2023
```
* modify concat_grad add sum comp rule

* delete default mul_double_grad

* delete high grad test

* recover yaml

* modify yaml
```
  b4024aaf
- Z
  
  add unitest for reshpe 0 dims (#53685) · 32dae48a
  由 zhoutianzi666 提交于 5月 11, 2023
  
  32dae48a
- G
  [test]mv fluid [controlflow,detection,dlnne,tensorrt] tests to tests (#53470) · 80757527
  由 gouzil 提交于 5月 11, 2023
```
* [test]mv fluid controlflow detection dlnne tensorrt tests to tests

* [test]clean dlnne

* [test] fix test_tensorrt_engine_op

* [test] try fix path error

* [test] RollBACK test_tensorrt_engine_op

* [test] RollBACK test_tensorrt_engine_op

* [test]add todo

* Empty-Commit; test=document_fix
```
  80757527
- X
  [Paddle-Inference] Support trt 0dims of expand_as_v2 and mish. (#53627) · aebff6d7
  由 xiaoxiaohehe001 提交于 5月 11, 2023
```
* support_expand_mish
```
  aebff6d7

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功