提交 · 2bc91cc5c44558be0bb000f8cdf8301ed1a6de5e · 机器未来 / Paddle

15 2月, 2022 2 次提交

[Paddle-Inference] support preln_ernie: add... · 2bc91cc5

由 Wangzheee 提交于 2月 15, 2022

[Paddle-Inference] support preln_ernie: add preln_embedding_eltwise_layernorm_fuse_pass, preln_skip_layernorm_fuse_pass (#39508)

* support preln_ernie

* support preln_ernie

2bc91cc5

[PTen]Migrate proto::VarType outside of Pten (#39411) · 7e7e9404

由 Aurelius84 提交于 2月 15, 2022

* #1 migrate dist-related type()-> dtype()

* move datatype function from pten -> fluid/framework

* change type() in imperative into convert(dtype())

* modify xx_tensor->type into xx_tensor->dtype

* change the set_type interface and the caller

* modify xx_tensor.type into xx_tensor.dtype

* fix mutable_data(place, dtype())

* change caller of mutable_data in pten and distributed

* change the caller of mutable_data in fluid/framework

* change the caller of mutable_data in imperative directory

* mutable_data: inference

* update the call of mutable_data

* transfer MakePenScalarArray MakePtenScalar ResetHolderWithType

* pass the compile. the next step is remove VarType in Pten

* fix all and remove VarType from pten. success in linux. Next task is other platform

* fix conflict with develop

* fix compiled error

* Fix reset conversion

* fix conflict

* fix compiled problem

* fix typo

* Fix << in tensor_utils.cc

* fix type->dtype

* fix unittest

* fix tensor init constructor

* fix DataTypeSize for BFloat16

* fix code style

* fix npu compiled error

* fix npu

* compile npu sucessfully

* fix conflict

* fix conflict
Co-authored-by: Nxiongkun <xiongkun03@baidu.com>

7e7e9404

14 2月, 2022 1 次提交

[Bug fix] prevent squashing pair u8 dequantize -> s8 quantize (#39346) · 66b5348e

由 Sylwester Fraczek 提交于 2月 14, 2022

* prevent squashing pair u8 dequantize -> s8 quantize

* add relu op to check for uint8

* fix ptq fc attr name fuse_activation->activation_type

* fix

* add unit test

* remove unused variable

* test fix unsuccessful

* fix test and logic

* multiline comment

* remove cout

* Revert "fix ptq fc attr name fuse_activation->activation_type"

This reverts commit ffd023353a5e9b0fd15e81b9e9f9fe1794035017.

* fix ptq fc attr name fuse_activation->activation_type

66b5348e

11 2月, 2022 2 次提交

F
[Pten] move operators/math/math_function_* to pten/kernels/func (#39300) · d25a7f9e
由 Feiyu Chan 提交于 2月 11, 2022
```
* move operators/math/math_function_* to pten/kernels/func
* namespace from `paddle::operators::math` to `pten::funcs`
```
d25a7f9e

[Paddle Inference] support ernie quant model with interleaved (#39424) · 1c44d3e2

由 Wangzheee 提交于 2月 11, 2022

* support ernie quant model with interleaved

* support ernie quant model with interleaved

* support ernie quant model with interleaved

* support ernie quant model with interleaved

* support ernie quant model with interleaved

* support ernie quant model with interleaved

* support ernie quant model with interleaved

1c44d3e2

10 2月, 2022 1 次提交

share MemOptVarInfos of external variables into cinn_launch subgraph (#39209) · 35b03e1c

由 TeFeng Chen 提交于 2月 10, 2022

* add a graph pass to share MemOptVarInfos of external variables into subgraph

* update pass name

* fix compile failed

* add share_mem_opt_info_to_subgraph_pass test

* share_mem_opt_info_to_subgraph_pass_test pass

* modify some codes for better style and more robust

* update cmake

35b03e1c

09 2月, 2022 1 次提交

[Paddle-Inference] rebuild matmul pass: trt and gpu_cpu (#39369) · db7d129e

由 Wangzheee 提交于 2月 09, 2022

* rebuild matmul pass: trt and gpu_cpu

* rebuild matmul pass: trt and gpu_cpu

* rebuild matmul pass: trt and gpu_cpu

* rebuild matmul pass: trt and gpu_cpu

db7d129e

08 2月, 2022 3 次提交

Add FuseOptimizerPass and test_dist_fuse_adam_pass unittest. (#39208) · ccdcfa2d

由 hlygit66666 提交于 2月 08, 2022

* add fuse_relu_depthwise_conv_pass unittest

* fix atol and rtol

* fix according to review

* Add FuseOptimizerPass and fuse_adam_pass unittest

* add sgd and momentum unittest

* add fuse_optimizer_pass

* close amp

* close amp

* update

* fix run on two cards

* Update test_dist_fuse_adam_pass.py

* Update test_dist_fuse_momentum_pass.py

* Update test_dist_fuse_sgd_pass.py

* Create test_dist_fuse_sgd_pass.py

* Create test_dist_fuse_sgd_pass.py

* Create test_dist_fuse_sgd_pass.py

* Update test_dist_fuse_adam_pass.py

* Update test_dist_fuse_momentum_pass.py

* Update test_dist_fuse_sgd_pass.py

ccdcfa2d

J
[Bug fix] Fixed handling of one of the cases in the quantization process (#39342) · e4d475ea
由 joanna.wozna.intel 提交于 2月 08, 2022
```
* Fix quantization next op findings

* Corrections according to the review
```
e4d475ea

[PTen] Support SelectedRows in execution and remove scale OpKernel and InferShape (#39351) · 41eb2595

由 Chen Weihang 提交于 2月 08, 2022

* adapt selectedrows in execution

* impl selected rows branch

* support selectedrow in infershape utils

* fix device compile failed

* fix new exe test failed

* revert some changes

41eb2595

02 2月, 2022 1 次提交
- Z
  
  Fix fc_mkldnn format issue (#38890) · 633c71c2
  由 Zuza 提交于 2月 02, 2022
  
  633c71c2
28 1月, 2022 1 次提交
- W
  compile fix (#39272) · 91dd0f0d
  由 wenbin 提交于 1月 28, 2022
```
* slice

* shuffle pass enhancement
```
  91dd0f0d
27 1月, 2022 1 次提交
- W
  fix shuffle_channel_detect_pass (#39242) · af9ddeb7
  由 wenbin 提交于 1月 27, 2022
```
* shuffle channel pass

* add ut

* timeout fix

* makefile fix
```
  af9ddeb7
26 1月, 2022 1 次提交

[IPU] sync misc changes 01 (#38876) · 4efbebea

由 Allen Guo 提交于 1月 26, 2022

* sync misc changes

* apply comments 01

* fix compile error

* remove is_ipu_place check

* add authors
Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai>
Co-authored-by: NAllen Guo <alleng@graphcore.ai>
Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>
Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai>
Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>

* sync changes

* restore cmake

* update ir cmake and setup.py

* update inference_lib cmake

* split PR
Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai>
Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>
Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai>
Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>

4efbebea

24 1月, 2022 1 次提交
- S
  
  fix test allreduce tests (#39166) · c00303ec
  由 sneaxiy 提交于 1月 24, 2022
  
  c00303ec
18 1月, 2022 2 次提交

Mish FP32/BF16 kernel, conv and fc fuse passes (#38623) · 1d18bc2c

由 Sławomir Siwek 提交于 1月 18, 2022

* Mish

* Change exp() library

* mish fuse pass

* mish attrs

* fixes

* mishop maker

* remove attrs

* mish kernal for bf16

* fc+mish fuse

* fix code format error

* Resolve merge conflicts

* Update mish operator version

* update mish variable to new naming convention

1d18bc2c

[Unify Tensors PR ] Merged Tensor into DenseTensor, test=allcases (#38914) · 2052f1e3

由 Zhanlue Yang 提交于 1月 18, 2022

* Merged LoDTensor with Tensor,test=allcases

* Patched python level LoDTensor

* Patched python level LoDTensor

* Merge Tensor into DenseTensor

* Fixed namespace issues,test=allcases

* Fixed merge issues

* Fixed inference issues

* Fixed NPU test issues

* Fixed merge issues

2052f1e3

17 1月, 2022 4 次提交

[Pten] Replace platform::Place to pten::Place. (#38899) · c48a9ad5

由 Wilber 提交于 1月 17, 2022

* add pten::Place data structure.

* update ci problem

* fix ci problem

* update

* using platform::Place=pten::Place

* remove BOOST_GET_CONST for CPUPlace and GPUPlace

* compile pass 25%.

* compile pass 45%

* compile pass 60%

* remove boost_get for xpu npu mlu and ipu

* compile pass on cpu and gpu.

* fix compile problem

* fix compile error.

* update

* fix ci problem

* update

* ci approve

* fix ci problem

* fix ci eager test problem

* remove BOOST_GET_CONST

* fix npu compile

c48a9ad5

add TransferCastOpPass, DeleteScaleOpPass (#38985) · 1006383b

由 Allen Guo 提交于 1月 17, 2022

Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai>
Co-authored-by: NAllen Guo <alleng@graphcore.ai>
Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>
Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai>
Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>
Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai>
Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>
Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai>
Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>

1006383b

[IPU] update ipu releated passes p0 (#38846) · 84f257bd

由 Allen Guo 提交于 1月 17, 2022

* update ipu releated passes
Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai>
Co-authored-by: NAllen Guo <alleng@graphcore.ai>
Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>
Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai>
Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>

* remove ipu_pass_base

* update error msg

* update error msg 02

* split pr 01

* restore ipu_pass_base
Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai>
Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>
Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai>
Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>

84f257bd

S
Add NoReduce mode for ParallelExecutor (#38969) · e50d883e
由 sneaxiy 提交于 1月 17, 2022
```
* add no reduce mode for pe

* add NoReduce ut
```
e50d883e

15 1月, 2022 1 次提交

[Unify Tensors PR #7] Merged LoDTensor with Tensor, test=allcases (#38880) · 88966b28

由 Zhanlue Yang 提交于 1月 15, 2022

* Merged LoDTensor with Tensor,test=allcases

* Patched python level LoDTensor

* Fixed example code failure

* Polished function names, removed duplicated forward declarations

88966b28

12 1月, 2022 2 次提交

[IPU] add more ops (#38831) · 050fd168

由 Allen Guo 提交于 1月 12, 2022

* support more ops

* Co-authored-by: Xiaobing Wang <xiaobingw@graphcore.ai>
Co-authored-by: NAllen Guo <alleng@graphcore.ai>
Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>
Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai>
Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>

* add authors
Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai>
Co-authored-by: NAllen Guo <alleng@graphcore.ai>
Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>
Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai>
Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>

* update date
Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai>
Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>
Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai>
Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>

050fd168

S
Fix conv act int8 scale (#38331) · 4825addd
由 Sylwester Fraczek 提交于 1月 12, 2022
```
* fix conv act int8 scale

* add unit test for conv+hard_swish
```
4825addd

05 1月, 2022 2 次提交
- J
  
  Add input data type checking in BF16 placement pass (#38702) · 60c51de5
  由 joanna.wozna.intel 提交于 1月 05, 2022
  
  60c51de5
- J
  Quantize nearest_interp and nearest_interp_v2 (#38622) · 1456b02d
  由 joanna.wozna.intel 提交于 1月 05, 2022
```
* Quantize nearest_interp and nearest_interp_v2

* Check if avx_core supported

* Add depthwise_conv2d to supported quantization list
```
  1456b02d
31 12月, 2021 1 次提交

add mul_gru_fuse_pass ut (#37772) · bc827307

由 baoachun 提交于 12月 31, 2021

* add mul_gru_fuse_pass ut

* update ut

* update ut

* update ut timeout setting

* update ut

bc827307

30 12月, 2021 2 次提交
- J
  
  Refactor cpu_quantize_pass (#38019) · 1fa6900e
  由 joanna.wozna.intel 提交于 12月 30, 2021
  
  1fa6900e
- Y
  [Auto parallel] Make sure the id semantics of every var and op unique (#38132) · 5620214e
  由 Yulong Ao 提交于 12月 30, 2021
```
* [Auto parallel] Make the id of var and op unique

* [Auto Parallel] Rename back dist_context to distop_context
```
  5620214e
28 12月, 2021 1 次提交

add mul_lstm_fuse_pass ut (#37795) · 1db61c3e

由 baoachun 提交于 12月 28, 2021

* add mul_lstm_fuse_pass ut

* update mul_lstm_fuse_pass ut

* update ut

* update ut

* update ut

* add CPU ut cmake setting

* update ut

1db61c3e

27 12月, 2021 1 次提交
- B
  
  add attr check for infer in batch_norm_act mkldnn fuse pass (#38443) · 04527ee3
  由 baoachun 提交于 12月 27, 2021
  
  04527ee3
24 12月, 2021 1 次提交

add conv+hard_sigmoid and conv+hard_swish fuse pass ut (#37553) · a858326a

由 baoachun 提交于 12月 24, 2021

* add conv+hard_sigmoid fuse pass ut

* update conv_elementwise_add_mkldnn_fuse_pass ut

* update conv_hard_sigmoid_mkldnn_fuse_pass ut

* update conv+hard_sigmoid and conv+hard_swish fuse pass ut

* update ut

* update ut

a858326a

23 12月, 2021 2 次提交

add mkldnn conv_elementwise_add_mkldnn_fuse_pass ut (#37612) · f88065d3

由 baoachun 提交于 12月 23, 2021

* add mkldnn conv_elementwise_add_mkldnn_fuse_pass ut

* update mkldnn conv_elementwise_add_mkldnn_fuse_pass ut

* update conv_elementwise_add_mkldnn_fuse_pass ut

* update conv_elementwise_add_mkldnn_fuse_pass ut

* update conv_elementwise_add_mkldnn_fuse_pass ut

* restrict conv2d data_format in conv_elementwise_add_mkldnn_fuse_pass

* update conv_elementwise_add_mkldnn_fuse_pass OpCompat

* update conv_elementwise_add_mkldnn_fuse_pass ut

* update ut

f88065d3

Add unittest for flatten2_matmul squeeze2_matmul reshape2_matmul pass (#37644) · aa059885

由 heliqi 提交于 12月 23, 2021

* add flatten2_matmul squeeze2_matmul reshape2_matmul test case

* modify skip func to ignore_pass_case func

* rebuild CI

* add test_xx_matmul_fuse_pass timeout

* add test_map_xx_pass timeout

* add max_duration of test cast

* add trt skip

* add timeout

* del commented code

aa059885

22 12月, 2021 3 次提交

add mkldnn reshape_transpose_matmul fuse pass ut and op version check (#37468) · 274b135b

由 baoachun 提交于 12月 22, 2021

* add mkldnn reshape_transpose_matmul fuse pass ut and op version check

* update reshape_transpose_matmul_mkldnn_fuse_pass ut

* update ut

274b135b

update mkldnn batch_norm_activation fuse pass ut (#37402) · 3d7e737c

由 baoachun 提交于 12月 22, 2021

* update mkldnn batch_norm_activation fuse pass ut

* update ut

* update mkldnn batch_norm_act_fuse_pass ut

* update batch_norm_act_fuse_pass ut

* update ut

3d7e737c

W
CE fix (#38324) · 90e9a486
由 wenbin 提交于 12月 22, 2021
```
* CE fix

* more format
```
90e9a486

21 12月, 2021 3 次提交
- B
  update seqconv_eltadd_relu_fuse_pass ut (#37907) · 4e578c2b
  由 baoachun 提交于 12月 21, 2021
```
* update seqconv_eltadd_relu_fuse_pass ut

* update ut

* update ut

* update ut
```
  4e578c2b
- B
  update squared_mat_sub_fuse_pass ut (#37838) · aadc8674
  由 baoachun 提交于 12月 21, 2021
```
* update squared_mat_sub_fuse_pass ut

* update ut

* update ut
```
  aadc8674
- B
  add seqpool_cvm_concat_fuse_pass ut (#37902) · 06cf314a
  由 baoachun 提交于 12月 21, 2021
```
* add seqpool_cvm_concat_fuse_pass ut

* rename ut name
```
  06cf314a

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致