提交 · eda8df7133680ba56e7ab4cc762d265759621a6b · PaddlePaddle / Paddle

06 5月, 2023 4 次提交
- Y
  use int64 to calc dim for c softmax (#53541) · da963eab
  由 Yuang Liu 提交于 5月 06, 2023
```
* use int64 to calc dim for c softmax

* fix complie bug
```
  da963eab
- fix brpc double link (#53512) · 03fe3ce5
  由 zhenhailiu 提交于 5月 06, 2023
```
* polish

* polish

* polish

* polish

* polish

* polish

* polish

* polish

* polish

* polish

* polish
```
  03fe3ce5
- Z
  
  [inference][trt] add reduce_all and reduce_any (#53088) · 12406cad
  由 Zhang Jun 提交于 5月 06, 2023
  
  12406cad
- W
  Add trt pow converter. (#53462) · 5a44bf7e
  由 Wilber 提交于 5月 06, 2023
```
* Add trt pow converter.

* update to use AddConstantLayer

* add dims=0 ut
```
  5a44bf7e
05 5月, 2023 10 次提交
- G
  remove some [-Wunused-parameter]warning (#53397) · 58435ae5
  由 Galaxy1458 提交于 5月 05, 2023
```
* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop
```
  58435ae5
- S
  
  [XPU] Fusion of gather and assign operators to fused_mt op for reducing memory usage (#53262) · 2039115c
  由 shentanyue 提交于 5月 05, 2023
  
  2039115c
- S
  
  [XPU] Fix the out_max of the branch in xpu_conv2d op(#53343) · d27f15ed
  由 sprouteer 提交于 5月 05, 2023
  
  d27f15ed
- H
  
  [XPU] add accuracy phi kernel and delete accuracy fluid kernel (#53508) · 8eb3ce5b
  由 haosicheng 提交于 5月 05, 2023
  
  8eb3ce5b
- X
  【prim】modify the signature of cast_grad for keeping consistent with yaml config. (#53498) · 8c5c03c2
  由 xiaoguoguo626807 提交于 5月 05, 2023
```
* modify concat_grad add sum comp rule

* modify cast
```
  8c5c03c2
- X
  【prim】modify assign api setOutput in by_pass (#53417) · aa887717
  由 xiaoguoguo626807 提交于 5月 05, 2023
```
* modify concat_grad add sum comp rule

* cast and by_pass modify

* only modify by_pass

* modify by_pass
```
  aa887717
- G
  [test]mv fluid *test* to test (#53472) · 0ded3f04
  由 gouzil 提交于 5月 05, 2023
```
* [test]mv fluid *test* to test/cpp/fluid

* [phi] fix link error
```
  0ded3f04
- G
  [test]mv fluid op pscore to test (#53460) · 5cc6a512
  由 gouzil 提交于 5月 05, 2023
```
* [test]mv fluid op pscore to test/cpp/fluid/pscore

* [test]add -faligned-new

* [test] fix brpc link
```
  5cc6a512
- G
  
  [test]mv fluid op fused to test/cpp/fluid/fused (#53434) · 903c5638
  由 gouzil 提交于 5月 05, 2023
  
  903c5638
- G
  
  [test]mv fluid op math to test/cpp/fluid/math (#53446) · decc4c38
  由 gouzil 提交于 5月 05, 2023
  
  decc4c38
04 5月, 2023 4 次提交
- G
  
  mv fluid op elementwise to test/cpp/fluid/elementwise (#53448) · 19950e65
  由 gouzil 提交于 5月 04, 2023
  
  19950e65
- W
  
  Fix a bug in constant folding pass (#53456) · ace61b8b
  由 weishengying 提交于 5月 04, 2023
  
  ace61b8b
- G
  [test]mv fluid op [reader, prim_ops, nccl, reduce_ops, lite] to test (#53429) · 49d7bc5c
  由 gouzil 提交于 5月 04, 2023
```
* [test]mv fluid reader to test/

* [test]mv fluid op prim_ops to test/cpp/fluid/prim_ops

* [test]mv fluid op nccl to /test/cpp/fluid/nccl/

* [test]mv fluid op reduce_ops to test/cpp/fluid/reduce_ops

* [test]mv fluid op lite to test/cpp/fluid/lite

* [test]fix lite

* [test]fix prim op path

* [fluid]clean prim ops cmakelists
```
  49d7bc5c
- Y
  
  tensor should be defined (#53449) · 72e235d0
  由 Yuanle Liu 提交于 5月 04, 2023
  
  72e235d0
30 4月, 2023 1 次提交
- [Zero-Dim] Support paddle.sum/mean/loss api output 0D,test=allcase (#52739) · ddf94ae4
  由 zhouweiwei2014 提交于 4月 30, 2023
  
  ddf94ae4
29 4月, 2023 1 次提交
- G
  [tests]mv fluid benchmark to tests (#53426) · 0c2ab714
  由 gouzil 提交于 4月 29, 2023
```
* [tests]mv fluid benchmark to tests

* [test]Add placeholder

* [test]Add placeholder
```
  0c2ab714
28 4月, 2023 9 次提交

H

[CINN Support 0D-Tensor] CINN hack squeeze2 with trick temporarily (#53454) · 09f8e31d
由 HongyuJia 提交于 4月 28, 2023

09f8e31d

Dropout optimize & clean broadcast inT and ElementwiseType (#52969) · d611e48c

由 Bo Zhang 提交于 4月 28, 2023

* change judgement for DropoutGradGPUKernelDriver

* add UnrollerWithoutVecSize and after this Loaddata to be refined

* pass unittest

* use same unroller with XPU

* BroadcastWithInt64Index

* BroadcastDataLoader template partial specialization

* fix compile errs in ROCms

* clean ElementwiseT and InT for BroadcastKernel

* default axis and clean inT

* remove redundant fast divmod computation

* optimize drop_nd & drop_nd_grad

* optimize BroadcastDataLoader bf16 fp16

* rm InT etc. after merge develop

* delete constexpr for windows ci

* fix conflict

* fix conflic with develop

* fix conflic

* new clean

* clean

d611e48c

G

[test]mv fluid op cinn to test/cpp/fluid/cinn (#53443) · a53ee944
由 gouzil 提交于 4月 28, 2023

a53ee944
H
Support static graph code generation for op edit_distance (#53297) · 396fe483
由 huangjiyi 提交于 4月 28, 2023
```
* update

* fix bug

* support parsing fixed kernel data_type

* update op_compat

* update
```
396fe483
S

Support static graph code-gen for unpool (#52947) · 005fee12
由 Sanbu 提交于 4月 28, 2023

005fee12
J

remove is_npu_pinned_place (#53391) · 4ccbcce5
由 jjyaoao 提交于 4月 28, 2023

4ccbcce5
Z
[inference][trt]trt support 0 dims (#53383) · 64adfe7a
由 Zhang Jun 提交于 4月 28, 2023
```
* trt support 0 dim

* trt support 0 dim

* update activation ut
```
64adfe7a
S

fix c_softmax deterministic (#53419) · f1e3575e
由 sneaxiy 提交于 4月 28, 2023

f1e3575e

【Prim】comp_elementwise_double_grad (first part) (#53385) · 05499c71

由 xiaoguoguo626807 提交于 4月 28, 2023

* add mul doubel grad

* add sub_double_grad

* add add sub high test

* add mutiply test

* modify other unsqueeze

* delete api.yaml

* only for make ci run

* midify unsqueeze

* modify unsqueeze

* tmp

* modify operants gen

05499c71

27 4月, 2023 11 次提交

[phi] Move sequence_pool to phi - Step 3 ：sequence_pool_grad_op (#52680) · fe053396

由 gouzil 提交于 4月 27, 2023

* [phi] move sequence_pool kernel to phi

* mv kernels impl

* fix parameter error

* clean include

* fix compat filename

* [phi] move fluid sequence_pool_grad to phi

* [phi][compat] sig rm GradVarName

* [phi] fix sequence_pool out type

* [phi] rm impl, add const string

* [phi] fix const str

* fix sequence_pooling cmake

* [phi] mv sequence_pooling_test

* [phi] fix grad sig

* [phi] fix sequence_pool is_test error

* [phi] fix sequence_pooling gpu include

* [phi] mv to impl

* [phi] fix SequencePoolFunctor cu include

* [phi] modify out max_index int32_t

* [phi] add pooltype mapping determine

* [phi] fix sequence_pool_sig

* [phi] fix sequence_pool_sig sum

* [phi] try ci

* [phi] fix max_index optional

fe053396

Y

scale trt converter support int64 (#53388) · 182b6f83
由 Yuanle Liu 提交于 4月 27, 2023

182b6f83
Z

xpu quant weight only (#53306) · 1c97aa69
由 zhupengyang 提交于 4月 27, 2023

1c97aa69
W
[Dy2St]Get grad names when call append backward to fix high order gradient (#53250) · 2d17df97
由 WangZhen 提交于 4月 27, 2023
```
[Dy2St]Get grad names when call append backward to fix high order gradient (#53250)
```
2d17df97
W

set sync_param default true (#53335) · 421f56a8
由 wuhuachaocoding 提交于 4月 27, 2023

421f56a8
H

[XPU] c_sync_calc_stream support more types (#53389) · 9c1eb98a
由 houj04 提交于 4月 27, 2023

9c1eb98a

[static op generation] triangular_solve (#53328) · 18968e7e

由 gouzil 提交于 4月 27, 2023

* [static op generation] triangular_solve

* [phi] mv triangular_solve_grad to static_backward

* [phi] fix import

* [phi] mv to ops.yaml、 backward.yaml

* fix forward attr

* [phi] fix triangular_solve_grad args

18968e7e

H
[CINN Support 0D-Tensor] CINN supports 0D-Tensor with trick temporarily (#53382) · 9ab14865
由 HongyuJia 提交于 4月 27, 2023
```
* [CINN Support 0D-Tensor] CINN supports 0D-Tensor with trick temporarily

* Add unittest
```
9ab14865
W

autogen code support for max_pool[2,3]_with_index op (#53359) · cf6cbc34
由 Wang Xin 提交于 4月 27, 2023

cf6cbc34

Move fused feedforward (#53166) · 25b4ba7f

由 Sonder 提交于 4月 27, 2023

* trans fused_feedward Compute function to phi

* add register info

* remove maxfunctor

* move fused feedward to phi

* remove sig file

* remove fliud include

* add include

* add include

* add sig file

* add output register info

* fix sig file

* Update fused_feedforward_sig.cc

* fix grad kernel

* update output register info

* fix

* open fused_feedforward static build

* add optional and fix code style

* fix output info for fused attention

* add optional param

* merge

25b4ba7f

Z
[AMP] support OD level and skip dynamic loss scaling for bf16 (#53289) · 18e9dcdc
由 Zhang Ting 提交于 4月 27, 2023
```
* support OD level and skip dynamic loss scaling for bf16
```
18e9dcdc

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功