提交 · e89baf91db01dbfc29f3adce88e74757918babd1 · PaddlePaddle / Paddle

18 2月, 2023 1 次提交

[Autogen Prim Operants] Autogen prim eager and static tensor operants (#50558) · e89baf91

由 HongyuJia 提交于 2月 18, 2023

* support prim eager codegen

* support prim static codegen

* fix static header file

* polish code style, support arthmetic

* add subtract to yaml

* fix merge conflict

e89baf91

17 2月, 2023 13 次提交
- Y
  Rename MultiTensorAdam To FusedAdam (#50449) · e6af9bd2
  由 yuehuayingxueluo 提交于 2月 17, 2023
```
* rename multi_tensor_adam to fused_adam

* fix some bugs

* fix CI coverage

* rename test_fused_adam.py

* fix some bug

* add test_fused_adam_op.py

* fix some bugs

* fix fused_adam_op.cc

* fix CI bugs

* fix CI bug

* fix CI bug
```
  e6af9bd2
- S
  upgrade oneDNN to 2.7.3 (#46301) · f803b239
  由 Sławomir Siwek 提交于 2月 17, 2023
```
* change SHA

* update to oneDNN 2.7

* update to 2.7.1

* update to 2.7.2

* add supported hardsigmoid

* update to 2.7.3

* limit cpu threads for int8 test

* group activations
```
  f803b239
- H
  [phi decoupling] move platform/transform to phi (#50498) · fe332794
  由 Huang Jiyi 提交于 2月 17, 2023
```
* move platform::transform to phi

* fix bugs

* move transform_test to phi

* fix cmake

* update namespace

* fix cmake
```
  fe332794
- Z
  [XPU] add multi_encoder_xpu_slice_fuse_pass, generate_sequence_xpu_fuse_pass,... · 61469eec
  由 zhupengyang 提交于 2月 17, 2023
```
[XPU] add multi_encoder_xpu_slice_fuse_pass, generate_sequence_xpu_fuse_pass, generate_sequence_xpu kernel (#50570)
```
  61469eec
- X
  
  delete white op list (#50561) · cea6a7c6
  由 xiaoguoguo626807 提交于 2月 17, 2023
  
  cea6a7c6
- A
  
  [Dy2St]Fix re-defined var from merge conflict with two PR (#50597) · 53df9884
  由 Aurelius84 提交于 2月 17, 2023
  
  53df9884
- R
  Consider kernel argument def for data device transform in standalone Executor (#50471) · af1ace59
  由 Ruibiao Chen 提交于 2月 17, 2023
```
* Consider kernel argument def for data device transform in standalone executor

* Fix ALL_BACKEND errors

* Fix CI errors
```
  af1ace59
- X
  
  [bugfix] fix unuseful inputs causes segment error. (#50531) · 3adee6c9
  由 xiongkun 提交于 2月 17, 2023
  
  3adee6c9
- J
  
  [CINN] support int8/uint8/int16/uint16 dtype (#50566) · 9e73be65
  由 jiangcheng 提交于 2月 17, 2023
  
  9e73be65
- R
  
  fix ninja error (#49181) · b5d0d8c8
  由 risemeup1 提交于 2月 17, 2023
  
  b5d0d8c8
- R
  
  fix gcc12 error (#49452) · 6a2da84c
  由 risemeup1 提交于 2月 17, 2023
  
  6a2da84c
- W
  [RM FLUID] rm fluid_pslib_init (#50549) · b90c988c
  由 wangzhen38 提交于 2月 17, 2023
```
* [RM FLUID] rm fluid_pslib_init

* [RM FLUID] for ci

* [RM FLUID] for ci
```
  b90c988c
- A
  [Dy2St]Remove PE logic in @to_static (#50512) · 4230bd87
  由 Aurelius84 提交于 2月 17, 2023
```
* [Dy2St]Remove PE logic in @to_static

* fix typo

* fix infer_program

* fix typo

* fix op_size
```
  4230bd87
16 2月, 2023 11 次提交

Add matmul_v2 and fused_matmul to the quantization process and adjust Ernie model test (#50354) · 8686a745

由 joanna.wozna.intel 提交于 2月 16, 2023

* Add matmul_v2 to the quantization process and adjust Ernie model test

* Correct cpu_quantize_pass test

* Move op to fuse transformation to placement pass

* Correct test

8686a745

Rewrite mkldnn conv bn fuse pass tester (#50034) · e2aacd21

由 Hulek 提交于 2月 16, 2023

* New onednn test

* checkopoint

* added new test, fixed issue with onednn bias

* fix bias check

* remove prints, refactor code

* delete old test

* update python tests cmake

* Delete depracated conv bias

* Delete outdated bias from convolution test

e2aacd21

T

Export paddle_proto symbols (#50031) · dd1410d7
由 Tomasz Socha 提交于 2月 16, 2023

dd1410d7
S
[XPU][Fleet] Support multi-card infer for xpu (#50490) · 517d8074
由 shentanyue 提交于 2月 16, 2023
```
* support xpu multi-card infer

* add ut

* clean code

* clean code

* fix

* fix

* fix

* fix
```
517d8074

[Tensor Operator] Support add, minus, and divide (#50487) · 3b6ebc9d

由 HongyuJia 提交于 2月 16, 2023

* polish namespace

* change static_tensor_operants

* polish namespace

* support add, subtract, divide

* add unit test

* polish unittest

* fix cmake error

* polish unittest

3b6ebc9d

L

fix cross step sync problem on npu (#50517) · 383a08e1
由 Leo Chen 提交于 2月 16, 2023

383a08e1
H
[XPU] update xccl to 1.0.8 and xdnn to 20230215 (#50247) · b8008580
由 houj04 提交于 2月 16, 2023
```
* [XPU] update xccl to 1.0.8

* update xdnn. add uint8 for concat and split.

* update xdnn to 20230215.
```
b8008580

[Phi decouple] move layer_norm_kernel.cu.h to phi (#50506) · 8910bb4a

由 Huang Jiyi 提交于 2月 16, 2023

* move layer_norm_kernel.cu.h to phi

* fix bugs

* fix namespace

* fix bugs

* fix CI-Windwos

* replace mutable_data

* fix bugs

* fix bugs

8910bb4a

Z

[XPU] fix dropout pass; add multi_encoder_xpu_fuse_pass & multi_encoder_xpu kernel (#50499) · c8aa6405
由 zhupengyang 提交于 2月 16, 2023

c8aa6405

Use StandaloneExecutor in FleetExecutor (#50239) · df207283

由 Ruibiao Chen 提交于 2月 16, 2023

* Use StandaloneExecutor in FleetExecutor

* Update FLAGS

* Fix CI errors

* Update code

* Add force_root_scope_vars config

* Update code

* Fix CI errors

* Fix test_layer_new errors

df207283

[phi decoupling] remove variable.h in phi (#50407) · 905cefd4

由 Huang Jiyi 提交于 2月 16, 2023

* move variable_utils from phi_api_utils to fluid

* fix coment

* update include

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* update

* update

* fix CI-Windows-OpenBLAS

* fix bugs

* fix bugs

* fix bugs

* update include

* move variable_utils to phi_utils

* fix namespace

905cefd4

15 2月, 2023 10 次提交

D

fix npu save_combine (#50496) · 3c14b38e
由 duanyanhui 提交于 2月 15, 2023

3c14b38e
N

Add Cpu tensor cast when amp_type isn't float32 (#50401) · 3d5faa88
由 niuliling123 提交于 2月 15, 2023

3d5faa88
L
make cinn_launch_op run interpretercore in tracing mode to reduce number of threads (#50472) · bf38175e
由 Leo Chen 提交于 2月 15, 2023
```
* make cinn_launch_op run interpretercore in tracing mode to reduce number of threads

* skip getWorkqueue in tracing mode
```
bf38175e

Rewrite conv activation mkldnn fuse pass tester (#49278) · 84beef80

由 Hulek 提交于 2月 15, 2023

* Done

* Deleted old python test, fixed new python test, changed names in parallel_UT

* Revert parallel UT changes

* Revert parallel UT changes v2

* Review fixes and simplification of conv output shape calculation, disabled sqrt from conv_act_duse_pass

* delete sqrt from possible activations from conv_concat_relu test

* review refactor

* merge main

* delete sqrt from list of compatible activations

* Test with no outdated inputs

84beef80

Z

delete onednn kernel of feed (#50503) · 8decfb78
由 zyfncg 提交于 2月 15, 2023

8decfb78

[PHI Decoupling]Remove Profiler header (Part2) (#50183) · 8fabca11

由 YuanRisheng 提交于 2月 15, 2023

* move profiler

* add file

* fix mac compile bugs

* fix ci bugs

* fix mac bugs

* fix ci bugs

* fix compile bugs

* perfect code according comment

8fabca11

R

fix ninja problem (#50431) · 96006f77
由 risemeup1 提交于 2月 15, 2023

96006f77

make FusedMultiTransformer supports variable-lengths. (#49560) · 53df50c7

由 lzy 提交于 2月 15, 2023

* make FusedMultiTransformer supports variable-lengths.

* modify ffn2 when cuda_version >= 11.6 because of #49392.

* code style

* delete remove_padding

53df50c7

fix some protobuf update problems (#49875) · d84b918b

由 risemeup1 提交于 2月 15, 2023

* Improved prootbuf upgrades

* Improved prootbuf upgrades

* Improved prootbuf upgrades

* limit protobuf version>=3.20.0

d84b918b

Y
[CUSTOM]custom device add black_list (#50409) · 66d3c56e
由 YuhangLi 提交于 2月 15, 2023
```
* [CUSTOM]custom device add black_list

* change log level

* fix some issues
```
66d3c56e

14 2月, 2023 5 次提交
- D
  Expand mixed_precision to custom device (#50378) · fcb746cb
  由 duanyanhui 提交于 2月 14, 2023
```
* expand mix_precision to custom_device

* fix bug

* fix bug

* fix comment

* fix DEFINE bug
```
  fcb746cb
- H
  [Polish Namespace] Polish operants namespace (#50420) · 61a933ac
  由 HongyuJia 提交于 2月 14, 2023
```
* polish namespace

* change static_tensor_operants

* polish namespace
```
  61a933ac
- H
  
  fix windows copysign error (part2) (#50468) · abad724e
  由 HongyuJia 提交于 2月 14, 2023
  
  abad724e
- L
  Decrease usage of GetVecSize for optimizing host computation efficiency (#50353) · 976606fe
  由 limingshu 提交于 2月 14, 2023
```
* first commit.

* a little changes

* add some changes for get vec_size efficiently

* fix bugs

---------
Co-authored-by: Nzhangbopd <1299246947@qq.com>
```
  976606fe
- X
  add setvalue trt converter (#50341) · 2548657e
  由 xjmxyt 提交于 2月 14, 2023
```
* add cast setvalue op

* add set_value to op teller

* renew test and add description

* add setAxis and add complex test

* change test
```
  2548657e

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功