提交 · 4230bd87ff0ac843851f31cd849b13aa4068e9b2 · PaddlePaddle / Paddle

17 2月, 2023 1 次提交
- A
  [Dy2St]Remove PE logic in @to_static (#50512) · 4230bd87
  由 Aurelius84 提交于 2月 17, 2023
```
* [Dy2St]Remove PE logic in @to_static

* fix typo

* fix infer_program

* fix typo

* fix op_size
```
  4230bd87
16 2月, 2023 16 次提交

Add matmul_v2 and fused_matmul to the quantization process and adjust Ernie model test (#50354) · 8686a745

由 joanna.wozna.intel 提交于 2月 16, 2023

* Add matmul_v2 to the quantization process and adjust Ernie model test

* Correct cpu_quantize_pass test

* Move op to fuse transformation to placement pass

* Correct test

8686a745

[Polist Unittest] Polish test_phi_tensor (#50440) · 6fa29c55

由 HongyuJia 提交于 2月 16, 2023

* fix py::array_t calling bug

* polish test_phi_tensor

* stop fix inference bug in this PR

* polish unittest

* change int->int32_t

* fix unittest

* fix compile error

* modify cmake

* remove redundancy codes

* fix selectedRow unittest

* fix cmake relay

* declare kernel

6fa29c55

[dy2static-bugfix] fix backward gradient aggregation bugs (#50474) · d4c7774f

由 xiongkun 提交于 2月 16, 2023

* [dy2static-bugfix] fix backward gradient aggregation bugs
1. Yolov3 and Yolov5 all face the same problem.

* remove set_device

* code review fix

d4c7774f

Rewrite mkldnn conv bn fuse pass tester (#50034) · e2aacd21

由 Hulek 提交于 2月 16, 2023

* New onednn test

* checkopoint

* added new test, fixed issue with onednn bias

* fix bias check

* remove prints, refactor code

* delete old test

* update python tests cmake

* Delete depracated conv bias

* Delete outdated bias from convolution test

e2aacd21

T

Export paddle_proto symbols (#50031) · dd1410d7
由 Tomasz Socha 提交于 2月 16, 2023

dd1410d7
C
Add logspace yaml (#49194) · c284d42a
由 Chen Weihang 提交于 2月 16, 2023
```
* add logspace yaml

* update by comments

* resolve test framework conflicct
```
c284d42a
S
[XPU][Fleet] Support multi-card infer for xpu (#50490) · 517d8074
由 shentanyue 提交于 2月 16, 2023
```
* support xpu multi-card infer

* add ut

* clean code

* clean code

* fix

* fix

* fix

* fix
```
517d8074

[Tensor Operator] Support add, minus, and divide (#50487) · 3b6ebc9d

由 HongyuJia 提交于 2月 16, 2023

* polish namespace

* change static_tensor_operants

* polish namespace

* support add, subtract, divide

* add unit test

* polish unittest

* fix cmake error

* polish unittest

3b6ebc9d

L

fix cross step sync problem on npu (#50517) · 383a08e1
由 Leo Chen 提交于 2月 16, 2023

383a08e1
Z

polish some useless code (#50533) · 16986d6b
由 zyfncg 提交于 2月 16, 2023

16986d6b
H
[XPU] update xccl to 1.0.8 and xdnn to 20230215 (#50247) · b8008580
由 houj04 提交于 2月 16, 2023
```
* [XPU] update xccl to 1.0.8

* update xdnn. add uint8 for concat and split.

* update xdnn to 20230215.
```
b8008580
R
[XPU] add group_norm, sin, cos, linspace, randint kernels (#50465) · c86a5140
由 ronnywang 提交于 2月 16, 2023
```
* [XPU] add group_norm kernel

* update

* add xpu sin, cos, randint, linspace kernels

* update

* update
```
c86a5140

[Phi decouple] move layer_norm_kernel.cu.h to phi (#50506) · 8910bb4a

由 Huang Jiyi 提交于 2月 16, 2023

* move layer_norm_kernel.cu.h to phi

* fix bugs

* fix namespace

* fix bugs

* fix CI-Windwos

* replace mutable_data

* fix bugs

* fix bugs

8910bb4a

Z

[XPU] fix dropout pass; add multi_encoder_xpu_fuse_pass & multi_encoder_xpu kernel (#50499) · c8aa6405
由 zhupengyang 提交于 2月 16, 2023

c8aa6405

Use StandaloneExecutor in FleetExecutor (#50239) · df207283

由 Ruibiao Chen 提交于 2月 16, 2023

* Use StandaloneExecutor in FleetExecutor

* Update FLAGS

* Fix CI errors

* Update code

* Add force_root_scope_vars config

* Update code

* Fix CI errors

* Fix test_layer_new errors

df207283

[phi decoupling] remove variable.h in phi (#50407) · 905cefd4

由 Huang Jiyi 提交于 2月 16, 2023

* move variable_utils from phi_api_utils to fluid

* fix coment

* update include

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* update

* update

* fix CI-Windows-OpenBLAS

* fix bugs

* fix bugs

* fix bugs

* update include

* move variable_utils to phi_utils

* fix namespace

905cefd4

15 2月, 2023 13 次提交
- D
  
  fix npu save_combine (#50496) · 3c14b38e
  由 duanyanhui 提交于 2月 15, 2023
  
  3c14b38e
- N
  
  Add Cpu tensor cast when amp_type isn't float32 (#50401) · 3d5faa88
  由 niuliling123 提交于 2月 15, 2023
  
  3d5faa88
- L
  make cinn_launch_op run interpretercore in tracing mode to reduce number of threads (#50472) · bf38175e
  由 Leo Chen 提交于 2月 15, 2023
```
* make cinn_launch_op run interpretercore in tracing mode to reduce number of threads

* skip getWorkqueue in tracing mode
```
  bf38175e
- H
  Rewrite conv activation mkldnn fuse pass tester (#49278) · 84beef80
  由 Hulek 提交于 2月 15, 2023
```
* Done

* Deleted old python test, fixed new python test, changed names in parallel_UT

* Revert parallel UT changes

* Revert parallel UT changes v2

* Review fixes and simplification of conv output shape calculation, disabled sqrt from conv_act_duse_pass

* delete sqrt from possible activations from conv_concat_relu test

* review refactor

* merge main

* delete sqrt from list of compatible activations

* Test with no outdated inputs
```
  84beef80
- C
  fix composite op map (#50397) · ff86aeab
  由 cyber-pioneer 提交于 2月 15, 2023
```
* map output from composite rule to origin op

add mean layer_norm dropout op map

add input map check

composite softmax support input shape []

* composite softmax support shape []

* polish log

* solve conflict

* polish code

* polish op map output

* add check dtype
```
  ff86aeab
- Z
  
  delete onednn kernel of feed (#50503) · 8decfb78
  由 zyfncg 提交于 2月 15, 2023
  
  8decfb78
- Y
  [PHI Decoupling]Remove Profiler header (Part2) (#50183) · 8fabca11
  由 YuanRisheng 提交于 2月 15, 2023
```
* move profiler

* add file

* fix mac compile bugs

* fix ci bugs

* fix mac bugs

* fix ci bugs

* fix compile bugs

* perfect code according comment
```
  8fabca11
- R
  
  fix ninja problem (#50431) · 96006f77
  由 risemeup1 提交于 2月 15, 2023
  
  96006f77
- Z
  
  add gather_nd_grad op and where_grad support zero_dim for xpu (#50454) · 055d0c2d
  由 zhangyikun02 提交于 2月 15, 2023
  
  055d0c2d
- Q
  
  remove duplicated op in xpu2_op_list (#50450) · 47c23ccb
  由 QingshuChen 提交于 2月 15, 2023
  
  47c23ccb
- L
  make FusedMultiTransformer supports variable-lengths. (#49560) · 53df50c7
  由 lzy 提交于 2月 15, 2023
```
* make FusedMultiTransformer supports variable-lengths.

* modify ffn2 when cuda_version >= 11.6 because of #49392.

* code style

* delete remove_padding
```
  53df50c7
- R
  fix some protobuf update problems (#49875) · d84b918b
  由 risemeup1 提交于 2月 15, 2023
```
* Improved prootbuf upgrades

* Improved prootbuf upgrades

* Improved prootbuf upgrades

* limit protobuf version>=3.20.0
```
  d84b918b
- Y
  [CUSTOM]custom device add black_list (#50409) · 66d3c56e
  由 YuhangLi 提交于 2月 15, 2023
```
* [CUSTOM]custom device add black_list

* change log level

* fix some issues
```
  66d3c56e
14 2月, 2023 9 次提交
- E
  decouple tensor_utils (#50264) · 057cdb95
  由 engineer1109 提交于 2月 14, 2023
```
fix X

remove TensorCopy

codestyle

add fluid memory header

fix symbol

fix cmake

fix cmake

fix context

fix header

fix place

fix context

fix context

fix context

fix code

fix custom context

fix custom context

fix copy

fix data_transform

fix style

remove changes of custom

fix scalar
```
  057cdb95
- D
  Expand mixed_precision to custom device (#50378) · fcb746cb
  由 duanyanhui 提交于 2月 14, 2023
```
* expand mix_precision to custom_device

* fix bug

* fix bug

* fix comment

* fix DEFINE bug
```
  fcb746cb
- H
  
  fix operants_manager.cc compile error (#50492) · 4a7d9cd8
  由 HongyuJia 提交于 2月 14, 2023
  
  4a7d9cd8
- H
  [Polish Namespace] Polish operants namespace (#50420) · 61a933ac
  由 HongyuJia 提交于 2月 14, 2023
```
* polish namespace

* change static_tensor_operants

* polish namespace
```
  61a933ac
- S
  
  support int8 for embedding (#50413) · 78eb2d87
  由 seemingwang 提交于 2月 14, 2023
  
  78eb2d87
- H
  
  fix windows copysign error (part2) (#50468) · abad724e
  由 HongyuJia 提交于 2月 14, 2023
  
  abad724e
- R
  
  fix failed tests in precise_test (#50406) · 47364149
  由 risemeup1 提交于 2月 14, 2023
  
  47364149
- L
  Decrease usage of GetVecSize for optimizing host computation efficiency (#50353) · 976606fe
  由 limingshu 提交于 2月 14, 2023
```
* first commit.

* a little changes

* add some changes for get vec_size efficiently

* fix bugs

---------
Co-authored-by: Nzhangbopd <1299246947@qq.com>
```
  976606fe
- X
  add setvalue trt converter (#50341) · 2548657e
  由 xjmxyt 提交于 2月 14, 2023
```
* add cast setvalue op

* add set_value to op teller

* renew test and add description

* add setAxis and add complex test

* change test
```
  2548657e
13 2月, 2023 1 次提交
- Z
  Delete axis of fmin kernel (#50358) · 8df8cb10
  由 zyfncg 提交于 2月 13, 2023
```
* delete axis of fmin

* fix bug
```
  8df8cb10

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功