提交 · e041ffca8df68afe2cb914416989079cfaf7faf9 · PaddlePaddle / Paddle

11 4月, 2023 4 次提交
- J
  remove paddle/infrt/ (#52719) · e041ffca
  由 jjyaoao 提交于 4月 11, 2023
```
* remove paddle/infrt/

* delete .lit_test_times.txt
```
  e041ffca
- 石
  
  check the precision of cast operator test (#52317) · 0cb0f70a
  由石晓伟提交于 4月 11, 2023
  
  0cb0f70a
- C
  
  fix c_embedding bug (#52742) · 4a790cba
  由 Chitsing KUI 提交于 4月 11, 2023
  
  4a790cba
- A
  
  [Dy2St]Ignore os as not to_static module (#52715) · 94a8177f
  由 Aurelius84 提交于 4月 11, 2023
  
  94a8177f
10 4月, 2023 36 次提交

L
Autogen segment_pool (#52538) · 1bc00955
由 lzydev 提交于 4月 10, 2023
```
* autogen segment_pool

* delete legacy_dygraph about segment_pool
```
1bc00955
J
remove legacy profiler (#52624) · 0b89cb1d
由 JYChen 提交于 4月 10, 2023
```
* remove legacy profiler

* rm test_parallel_executor_profiler
```
0b89cb1d
H
[CustomOP unittest] Polish unit test, phi->custom (#52670) · bc9956cc
由 HongyuJia 提交于 4月 10, 2023
```
* [CustomOP unittest] Polish unit test, phi->custom

* Change phi->custom in custom_linear_op.cc
```
bc9956cc
R

fix cuda compule error (#52654) · 1ad943dd
由 risemeup1 提交于 4月 10, 2023

1ad943dd
J
delete paddle/fluid/operators/*_npu.* (#52678) · a7707efb
由 jjyaoao 提交于 4月 10, 2023
```
* delete paddle/fluid/operators/*_npu.*

* try pass CI

* try pass CI
```
a7707efb
D
【Hackathon No57】 add fp16 & bf16 for flip, fp16 for gaussian (#52380) · 2b0fffc2
由 Difer 提交于 4月 10, 2023
```
* add_fp_bf_for_flip_gaussian_random

* forget convert uint

* fix some error

* fix some error
```
2b0fffc2
J
delete paddle/fluid/operators/amp/*_npu.* (#52673) · d7a1a178
由 jjyaoao 提交于 4月 10, 2023
```
* delete paddle/fluid/operators/*_npu.*

* try pass code-style
```
d7a1a178
J
[Auto Parallel] Randomness Control for Distributed Training (#52554) · 03afb41c
由 JZ-LIANG 提交于 4月 10, 2023
```
* unique id for mesh

* rng ctrl

* support dropout

* register op

* adopt for recompute

* update unitest

* support pp
```
03afb41c

【Hackathon No.16】add PoissonNLLLoss API (#51117) · 349a059d

由 LyndonKong 提交于 4月 10, 2023

* add PoissonNLLLoss API

* update unittests

* Fix poisson_nll_loss init and update data type support

* remove type comment

* Update doc string

* Fix doc string erro

* Fix doc string math equation format

* Add float16 and bfloat16 support

349a059d

[AMP] support master_grad for amp training (#52235) · 4970dd65

由 Zhang Ting 提交于 4月 10, 2023

* support set master_grad

* move register_hook to auto_cast

* update unittest

* fix fp16 test

* update for review comments

4970dd65

X
[Paddle Inference] Support two inputs of multihead attention named qk_multihead. (#52455) · 6934ac79
由 xiaoxiaohehe001 提交于 4月 10, 2023
```
* Support two inputs of multihead attention named qk_multihead
```
6934ac79

[Opt Performance] Optimize custom operator performance (#52597) · 01247e33

由 HongyuJia 提交于 4月 10, 2023

* [Opt Performance] Optimize custom operator performance, reconstruct python API auto-gen, add cache and use const inference

* opt AutoGradMeta implementation

* remove profiler codes

* fix unit test

* change year, 2021->2023

* fix int64_t parse bug

01247e33

G
Autogen code bilinear_tensor_product (#52690) · 90c3bddf
由 gouzil 提交于 4月 10, 2023
```
* add autogen code bilinear_tensor_product

* [phi] rm cc file
```
90c3bddf
C

【Hackathon4 No58】fix exponential and pad (#51300) · 3ee2b237
由 cyberslack_lee 提交于 4月 10, 2023

3ee2b237
L
Autogen softmax_with_cross_entropy (#52515) · 351ccb63
由 lzydev 提交于 4月 10, 2023
```
* autogen softmax_with_cross_entropy

* fix error in softmax_with_cross_entropy version
```
351ccb63
H

[Approval For Phi] Add approval check for including third-party in phi headerfiles (#52653) · f9aaa1e4
由 HongyuJia 提交于 4月 10, 2023

f9aaa1e4

[StandaloneExe] Remove flag about Executor (#52671) · d6ee0a13

由 kangguangli 提交于 4月 10, 2023

* add strategy force_sequential_run

* remove flag

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

d6ee0a13

[enforce.h Decouple gflags.h] Move gflags.h from enforce.h to enforce.cc (#52573) · 3c0b1795

由 HongyuJia 提交于 4月 10, 2023

* [enforce.h Decouple gflags.h] Move gflags.h from enforce.h to enforce.cc

* Add gflags.h for other files

* Add gflags.h for other files

* Add gflags.h for blas_impl.hip.h

* Add gflags.h for miopen_helper.h

3c0b1795

[AMP OP&Test] Add fp16 and bf16 test to activation (#52521) · 6bd5fd75

由 Vvsmile 提交于 4月 10, 2023

* adjust defalut tolerance of output and grad

* fix a bug in the grad of OpTest

* fix the type of setting defalut value in optest, both forward and
backward

* add defalut

* fix test_sum_op

* adjust tolerance

* fix the tolerance of eager

* add bf16 and fp16 to the activation tests

* remove some fixs

* fix activation

* fix fp16

* fix gelu

* fix the activation tests

* add bfloat16 specialization to singrad and cosgrad

* fix bugs

* fix bugs

* add unittest

* add skip

* add fp/bf to rrelu/rrelu_grad

* git add rrelu

* fix bugs

6bd5fd75

W

update (#51297) · 70eaf9de
由 Wilber 提交于 4月 10, 2023

70eaf9de

【AMP OP&Test】instance_norm fp16 and bf16 support. (#52241) · 7c98abd9

由 qizhaoaoe 提交于 4月 10, 2023

* add fp16 and bf16 support for instance_norm

* fix /= operator which not support bf16

* fix instance_norm_grad kernel and unittests.

* fix fp32 unittests.

* fix instance_norm_kernel and unittests.

* fix instance_norm_grad_kernel and unittest threshold.

* add fp16/bf16 for instance_norm_grad_grad op.

* add bf16 dtype check.

* fix conflicts.

* fix cpu support for fp32 op and fix type in instance_norm_grad_kernel.

* fix type in instance_norm_kernel.

* fix bf16 outputs in unittests and refine codes.

* fix dx computation.

* delete unuseful params and head including.

* add fp16/bf16 for static graph.

* fix device condiction for instance_norm op.

* fix instance_norm_grad_grad and bf16 op tests.

* fix op_test to support grad of bf16 can be compared with fp32.

* remove updates.

* add self-defined grad.

7c98abd9

C

fix version message (#50318) · de44b3ac
由 chalsliu 提交于 4月 10, 2023

de44b3ac
W

add autogen code support for logcumsumexp op (#52682) · 891cf433
由 Wang Xin 提交于 4月 10, 2023

891cf433
H
register fluid kerenls to phi [part 7] (#52577) · aa35331f
由 huangjiyi 提交于 4月 10, 2023
```
* update

* fix bug

* fix ci-windows-openblas

* fix test_partial_sum_op

* fix codestyle
```
aa35331f
J

remove infrt V1.1 (#52672) · 6913feb0
由 jjyaoao 提交于 4月 10, 2023

6913feb0

【PaddlePaddle Hackathon 4 No.36】为 Paddle 优化 tile op 在 GPU 上的计算性能 (#52482) · 61fe2198

由 Zero Rains 提交于 4月 10, 2023

* fix divide zero bug for softmax_with_cross_entropy

* change the single test way

* can run but slow. the most important is that I do not know why it slow

* remove some useless commet

* change the copyright to correct

* remove some useless change

* if repeat_times == 1, we will not use BroadcastKernel

61fe2198

C

support auto generate for eigvalsh (#52687) · 93404a61
由 cyberslack_lee 提交于 4月 10, 2023

93404a61
A
【PaddlePaddle Hackathon 4 No.44】为 Paddle 优化 logsumexp op 在 GPU 上的计算性能 (#52509) · 0e776965
由 Asthestarsfalll 提交于 4月 10, 2023
```
* Optimize the performance of logsumexp

* Support zero-dim tensor
```
0e776965
L

support custom device on macos (#52620) · 575cafb4
由 lishicheng1996 提交于 4月 10, 2023

575cafb4
Z

add tensor_utils.h into all.h (#52600) · 3cbcaf1a
由 zyfncg 提交于 4月 10, 2023

3cbcaf1a

add autogen code support for affine_grid op (#52560) · 90280542

由 Wang Xin 提交于 4月 10, 2023

* add autogen code support for affine_grid op

* update op_compat.yaml for affine_grid

* update op_compat.yaml for affine_grid

* fix AffineGridGradInferMeta

* fix CI error

* update AffineGridInferMeta

90280542

R

[AMP OP & Test] Tril & Triu (#52411) · ec008a71
由 Roc 提交于 4月 10, 2023

ec008a71
W

Fix shape error when check no shape var type (#52629) · 648f58aa
由 WangZhen 提交于 4月 10, 2023

648f58aa

modify ~MatmulDescriptor and remove [-Wunused-function] (#52618) · 45f660dd

由 Galaxy1458 提交于 4月 10, 2023

* delete [-Wno-error=terminate], test=develop

* remove GPUps[-Wterminate],test=develop

* remove some -Wno-, test=develop

* modify ~MatmulDescriptor

* mess

45f660dd

H

[CustomOP unittest] Remove useless comment in custom operator's unit test (#52710) · 50ef5c5a
由 HongyuJia 提交于 4月 10, 2023

50ef5c5a
R

fix gcc12 error (#52646) · 66a4804b
由 risemeup1 提交于 4月 10, 2023

66a4804b

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功