提交 · 4230bd87ff0ac843851f31cd849b13aa4068e9b2 · PaddlePaddle / Paddle

17 2月, 2023 2 次提交
- A
  [Dy2St]Remove PE logic in @to_static (#50512) · 4230bd87
  由 Aurelius84 提交于 2月 17, 2023
```
* [Dy2St]Remove PE logic in @to_static

* fix typo

* fix infer_program

* fix typo

* fix op_size
```
  4230bd87
- X
  add approve rules (#50534) · bc731487
  由 xiaoguoguo626807 提交于 2月 17, 2023
```
* add approve rules

* add attr
```
  bc731487
16 2月, 2023 24 次提交

Add matmul_v2 and fused_matmul to the quantization process and adjust Ernie model test (#50354) · 8686a745

由 joanna.wozna.intel 提交于 2月 16, 2023

* Add matmul_v2 to the quantization process and adjust Ernie model test

* Correct cpu_quantize_pass test

* Move op to fuse transformation to placement pass

* Correct test

8686a745

[Polist Unittest] Polish test_phi_tensor (#50440) · 6fa29c55

由 HongyuJia 提交于 2月 16, 2023

* fix py::array_t calling bug

* polish test_phi_tensor

* stop fix inference bug in this PR

* polish unittest

* change int->int32_t

* fix unittest

* fix compile error

* modify cmake

* remove redundancy codes

* fix selectedRow unittest

* fix cmake relay

* declare kernel

6fa29c55

W

Add stub for quantization (#50510) · b5809912
由 whs 提交于 2月 16, 2023

b5809912
C

add owner of composite rules (#50525) · 2451841f
由 cyber-pioneer 提交于 2月 16, 2023

2451841f

[dy2static-bugfix] fix backward gradient aggregation bugs (#50474) · d4c7774f

由 xiongkun 提交于 2月 16, 2023

* [dy2static-bugfix] fix backward gradient aggregation bugs
1. Yolov3 and Yolov5 all face the same problem.

* remove set_device

* code review fix

d4c7774f

Rewrite mkldnn conv bn fuse pass tester (#50034) · e2aacd21

由 Hulek 提交于 2月 16, 2023

* New onednn test

* checkopoint

* added new test, fixed issue with onednn bias

* fix bias check

* remove prints, refactor code

* delete old test

* update python tests cmake

* Delete depracated conv bias

* Delete outdated bias from convolution test

e2aacd21

T

Export paddle_proto symbols (#50031) · dd1410d7
由 Tomasz Socha 提交于 2月 16, 2023

dd1410d7
C
Add logspace yaml (#49194) · c284d42a
由 Chen Weihang 提交于 2月 16, 2023
```
* add logspace yaml

* update by comments

* resolve test framework conflicct
```
c284d42a
C

Update trt version to 8.5.3.1 (#50530) · aded3338
由 chalsliu 提交于 2月 16, 2023

aded3338

Add Post-Training Quantization and export function in dygraph mode (#50107) · b7030257

由 whs 提交于 2月 16, 2023

Add PTQ and exporting function
1. Add Post-Training Quantization
1.1 Abstract some functions from QAT to Quantization class
1.2 Add Post-Training Quantization by extending Quantization class
1.3 Add observers for PTQ
1.4 Add unittest for PTQ
2. Add exporting function for QAT and PTQ

b7030257

S
[XPU][Fleet] Support multi-card infer for xpu (#50490) · 517d8074
由 shentanyue 提交于 2月 16, 2023
```
* support xpu multi-card infer

* add ut

* clean code

* clean code

* fix

* fix

* fix

* fix
```
517d8074

[Tensor Operator] Support add, minus, and divide (#50487) · 3b6ebc9d

由 HongyuJia 提交于 2月 16, 2023

* polish namespace

* change static_tensor_operants

* polish namespace

* support add, subtract, divide

* add unit test

* polish unittest

* fix cmake error

* polish unittest

3b6ebc9d

L

fix cross step sync problem on npu (#50517) · 383a08e1
由 Leo Chen 提交于 2月 16, 2023

383a08e1
R

fix gcc12 compile problem (#49423) · 7cc47a1d
由 risemeup1 提交于 2月 16, 2023

7cc47a1d
Z

polish some useless code (#50533) · 16986d6b
由 zyfncg 提交于 2月 16, 2023

16986d6b
A

[API]Support is_tensor() static branch (#50520) · c72e2a15
由 Aurelius84 提交于 2月 16, 2023

c72e2a15
H
[XPU] update xccl to 1.0.8 and xdnn to 20230215 (#50247) · b8008580
由 houj04 提交于 2月 16, 2023
```
* [XPU] update xccl to 1.0.8

* update xdnn. add uint8 for concat and split.

* update xdnn to 20230215.
```
b8008580
R
[XPU] add group_norm, sin, cos, linspace, randint kernels (#50465) · c86a5140
由 ronnywang 提交于 2月 16, 2023
```
* [XPU] add group_norm kernel

* update

* add xpu sin, cos, randint, linspace kernels

* update

* update
```
c86a5140

[Phi decouple] move layer_norm_kernel.cu.h to phi (#50506) · 8910bb4a

由 Huang Jiyi 提交于 2月 16, 2023

* move layer_norm_kernel.cu.h to phi

* fix bugs

* fix namespace

* fix bugs

* fix CI-Windwos

* replace mutable_data

* fix bugs

* fix bugs

8910bb4a

Z

[XPU] fix dropout pass; add multi_encoder_xpu_fuse_pass & multi_encoder_xpu kernel (#50499) · c8aa6405
由 zhupengyang 提交于 2月 16, 2023

c8aa6405

Use StandaloneExecutor in FleetExecutor (#50239) · df207283

由 Ruibiao Chen 提交于 2月 16, 2023

* Use StandaloneExecutor in FleetExecutor

* Update FLAGS

* Fix CI errors

* Update code

* Add force_root_scope_vars config

* Update code

* Fix CI errors

* Fix test_layer_new errors

df207283

[phi decoupling] remove variable.h in phi (#50407) · 905cefd4

由 Huang Jiyi 提交于 2月 16, 2023

* move variable_utils from phi_api_utils to fluid

* fix coment

* update include

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* update

* update

* fix CI-Windows-OpenBLAS

* fix bugs

* fix bugs

* fix bugs

* update include

* move variable_utils to phi_utils

* fix namespace

905cefd4

姜
disable deprecated ops dygraph tests (#50521) · df0ed4d6
由姜永久提交于 2月 16, 2023
```
* disable unewanted dygraph tests

* mine_hard_exa
```
df0ed4d6

Add mean composite rule (#50298) · f7f67b72

由 zqw_1997 提交于 2月 16, 2023

* beta

* small commit

* add batch_norm composite rule

move composite test case

remove unuseful var

add composite op blacklist

* small change v2

* finish the test_composite_mean and test_composite_mean_grad

* add ops assertion to the tests

* add cinn test

* fix the error and inappropriate usage in func: mean_composite

* remove the ref of outer lib in primtives.py

* modify sample code of reduce_sum

* fix composite mean op map

* modify testcases to test more float type

* remove cpu float16 test

* cinn test fix

* remove reduce_max

* change the name sum to sum_x

* change the use of reduce_sum to sum

---------
Co-authored-by: Ncyber-pioneer <chenzhuo@tju.edu.cn>

f7f67b72

15 2月, 2023 14 次提交

D

fix npu save_combine (#50496) · 3c14b38e
由 duanyanhui 提交于 2月 15, 2023

3c14b38e
N

Add Cpu tensor cast when amp_type isn't float32 (#50401) · 3d5faa88
由 niuliling123 提交于 2月 15, 2023

3d5faa88
L
make cinn_launch_op run interpretercore in tracing mode to reduce number of threads (#50472) · bf38175e
由 Leo Chen 提交于 2月 15, 2023
```
* make cinn_launch_op run interpretercore in tracing mode to reduce number of threads

* skip getWorkqueue in tracing mode
```
bf38175e

Rewrite conv activation mkldnn fuse pass tester (#49278) · 84beef80

由 Hulek 提交于 2月 15, 2023

* Done

* Deleted old python test, fixed new python test, changed names in parallel_UT

* Revert parallel UT changes

* Revert parallel UT changes v2

* Review fixes and simplification of conv output shape calculation, disabled sqrt from conv_act_duse_pass

* delete sqrt from possible activations from conv_concat_relu test

* review refactor

* merge main

* delete sqrt from list of compatible activations

* Test with no outdated inputs

84beef80

align tool (#49865) · 4632ca13

由 xu98bin 提交于 2月 15, 2023

* auto parallel align tool

* modify function get_var's return

* add save and load in align_tool

* modify load function and save function

* add finding different ops in align tool

* full auto parallel align tool

add test file for auto parallel align tool

set timeout for test

modify get_backward_tmp_var function

add annotation for align tool

modify test file

modify code to restart CI

remove timeout

* set timeout

4632ca13

W

[gpups update] add gpups ci log print (#50522) · 41902dda
由 wangzhen38 提交于 2月 15, 2023

41902dda

fix composite op map (#50397) · ff86aeab

由 cyber-pioneer 提交于 2月 15, 2023

* map output from composite rule to origin op

add mean layer_norm dropout op map

add input map check

composite softmax support input shape []

* composite softmax support shape []

* polish log

* solve conflict

* polish code

* polish op map output

* add check dtype

ff86aeab

Z

delete onednn kernel of feed (#50503) · 8decfb78
由 zyfncg 提交于 2月 15, 2023

8decfb78

[PHI Decoupling]Remove Profiler header (Part2) (#50183) · 8fabca11

由 YuanRisheng 提交于 2月 15, 2023

* move profiler

* add file

* fix mac compile bugs

* fix ci bugs

* fix mac bugs

* fix ci bugs

* fix compile bugs

* perfect code according comment

8fabca11

R

fix ninja problem (#50431) · 96006f77
由 risemeup1 提交于 2月 15, 2023

96006f77
Z

add gather_nd_grad op and where_grad support zero_dim for xpu (#50454) · 055d0c2d
由 zhangyikun02 提交于 2月 15, 2023

055d0c2d
Q

remove duplicated op in xpu2_op_list (#50450) · 47c23ccb
由 QingshuChen 提交于 2月 15, 2023

47c23ccb

make FusedMultiTransformer supports variable-lengths. (#49560) · 53df50c7

由 lzy 提交于 2月 15, 2023

* make FusedMultiTransformer supports variable-lengths.

* modify ffn2 when cuda_version >= 11.6 because of #49392.

* code style

* delete remove_padding

53df50c7

Z
remove incubate.data_generator (#50325) · a3989b5e
由 zqw_1997 提交于 2月 15, 2023
```
* remove incubate.data_generator

* modify the setup.py

* modifyt the setup.py.in
```
a3989b5e

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功