提交 · 5664ea26a0c2ed61bca5857877a3bc6ef0a1d01c · PaddlePaddle / Paddle

13 4月, 2023 2 次提交

[enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h (#52651) · 5664ea26

由 HongyuJia 提交于 4月 13, 2023

* [enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h

* Add logging.h for profiler.cc

* Add logging.h for gloo_utils.h

* Add logging.h for addmm_kernel_impl.h

* Add logging.h for addmm_grad_kernel_impl.h

* Add logging.h for p_send_kernel.cu

* Add logging.h for determinant_grad_kernel_impl.h

* Add logging.h for p_recv_kernel.cu

* Add logging.h for elementwise_grad_base.h

* Add logging.h for transfer_layout_kernel.cc

* Add logging.h for eigvals_kernel.cc and index_select_impl.h

* Add logging.h for all files in kernel directory

* Add logging.h for xpu_info.cc

* Add logging.h for xpu

5664ea26

Z

rename_bilinear_tensor_op (#52745) · eb93b5c9
由 zhangyuqin1998 提交于 4月 13, 2023

eb93b5c9

12 4月, 2023 3 次提交

Z
Optimize performance of unique kernel (#52736) · 8cbeefea
由 Zhang Zheng 提交于 4月 12, 2023
```
* Optimize performance of unique kernel

* fix ci
```
8cbeefea

[AMP OP&Test] add fp16/bf16 unittest for pool2d op (#52288) · f9b155f9

由 Wei Shengyu 提交于 4月 12, 2023

* add bf16 support and bf16/fp16 unittest for pool2d

* add include files

* dbg

* reformat

* reformat

* modify code according to review comment

* remove duplicate code

* remove dup code

* remove useless include

* dbg

f9b155f9

[AMP OP&Test] support bf16 for batch norm (#52407) · 523f8a26

由 Guoxia Wang 提交于 4月 12, 2023

* [AMP OP&Test] support bf16 for batchnorm

* codestyle

* Update batch_norm_grad_kernel.cu

* Update batch_norm_kernel.cu

* fix codestyle

* fix

* fix

* fix

* fix

* fix

* Update batch_norm_kernel.cc

523f8a26

11 4月, 2023 3 次提交
- W
  [AMP OP&Test]Add fp16/bf16 support isnan/isfinite/isinf op (#52259) · aaf873b2
  由 WJJ1995 提交于 4月 11, 2023
```
* add bfp16 test for isfinite

* fixed for ci

* deal with comments

* fixed test

* skip test in cpu

* deal with comments

* fixed for ci

* fixed testcase

* fixed for ci

* fixed for testcase
```
  aaf873b2
- L
  Add output defs for eigh kernel (#51362) · da0c7e14
  由 LinearTemporalLogic 提交于 4月 11, 2023
```
* Add output defs for eigh kernel

* fix

* update

* update

* fix

* fix
```
  da0c7e14
- T
  
  [AMP OP&Test] add bf16 fp16 type support for expand_v2_op and top_k_v2_op (#51263) · 5b09dd56
  由 Thomas Young 提交于 4月 11, 2023
  
  5b09dd56
10 4月, 2023 8 次提交

D
【Hackathon No57】 add fp16 & bf16 for flip, fp16 for gaussian (#52380) · 2b0fffc2
由 Difer 提交于 4月 10, 2023
```
* add_fp_bf_for_flip_gaussian_random

* forget convert uint

* fix some error

* fix some error
```
2b0fffc2
C

【Hackathon4 No58】fix exponential and pad (#51300) · 3ee2b237
由 cyberslack_lee 提交于 4月 10, 2023

3ee2b237

[enforce.h Decouple gflags.h] Move gflags.h from enforce.h to enforce.cc (#52573) · 3c0b1795

由 HongyuJia 提交于 4月 10, 2023

* [enforce.h Decouple gflags.h] Move gflags.h from enforce.h to enforce.cc

* Add gflags.h for other files

* Add gflags.h for other files

* Add gflags.h for blas_impl.hip.h

* Add gflags.h for miopen_helper.h

3c0b1795

[AMP OP&Test] Add fp16 and bf16 test to activation (#52521) · 6bd5fd75

由 Vvsmile 提交于 4月 10, 2023

* adjust defalut tolerance of output and grad

* fix a bug in the grad of OpTest

* fix the type of setting defalut value in optest, both forward and
backward

* add defalut

* fix test_sum_op

* adjust tolerance

* fix the tolerance of eager

* add bf16 and fp16 to the activation tests

* remove some fixs

* fix activation

* fix fp16

* fix gelu

* fix the activation tests

* add bfloat16 specialization to singrad and cosgrad

* fix bugs

* fix bugs

* add unittest

* add skip

* add fp/bf to rrelu/rrelu_grad

* git add rrelu

* fix bugs

6bd5fd75

【AMP OP&Test】instance_norm fp16 and bf16 support. (#52241) · 7c98abd9

由 qizhaoaoe 提交于 4月 10, 2023

* add fp16 and bf16 support for instance_norm

* fix /= operator which not support bf16

* fix instance_norm_grad kernel and unittests.

* fix fp32 unittests.

* fix instance_norm_kernel and unittests.

* fix instance_norm_grad_kernel and unittest threshold.

* add fp16/bf16 for instance_norm_grad_grad op.

* add bf16 dtype check.

* fix conflicts.

* fix cpu support for fp32 op and fix type in instance_norm_grad_kernel.

* fix type in instance_norm_kernel.

* fix bf16 outputs in unittests and refine codes.

* fix dx computation.

* delete unuseful params and head including.

* add fp16/bf16 for static graph.

* fix device condiction for instance_norm op.

* fix instance_norm_grad_grad and bf16 op tests.

* fix op_test to support grad of bf16 can be compared with fp32.

* remove updates.

* add self-defined grad.

7c98abd9

【PaddlePaddle Hackathon 4 No.36】为 Paddle 优化 tile op 在 GPU 上的计算性能 (#52482) · 61fe2198

由 Zero Rains 提交于 4月 10, 2023

* fix divide zero bug for softmax_with_cross_entropy

* change the single test way

* can run but slow. the most important is that I do not know why it slow

* remove some useless commet

* change the copyright to correct

* remove some useless change

* if repeat_times == 1, we will not use BroadcastKernel

61fe2198

C

support auto generate for eigvalsh (#52687) · 93404a61
由 cyberslack_lee 提交于 4月 10, 2023

93404a61
A
【PaddlePaddle Hackathon 4 No.44】为 Paddle 优化 logsumexp op 在 GPU 上的计算性能 (#52509) · 0e776965
由 Asthestarsfalll 提交于 4月 10, 2023
```
* Optimize the performance of logsumexp

* Support zero-dim tensor
```
0e776965

09 4月, 2023 1 次提交
- add bf16 for some ops in static mode (#51582) · 6cd095fc
  由 shaojie_wang 提交于 4月 08, 2023
  
  6cd095fc
07 4月, 2023 1 次提交
- add distributed p_send/p_recv/reduce_scatter operator (#51858) · 2b12a117
  由 TaoTao Li 提交于 4月 07, 2023
```
fix merge conflicts
```
  2b12a117
06 4月, 2023 4 次提交
- Z
  Rename conv2d transpose grad grad (#52371) · 49bbd466
  由 zhangyuqin1998 提交于 4月 06, 2023
```
* Rename conv2d transpose grad grad

* fix
```
  49bbd466
- C
  
  fix backend bug (#52526) · 380a9bf7
  由 Chitsing KUI 提交于 4月 06, 2023
  
  380a9bf7
- S
  Fix flash attention bug (#52551) · 8ac5a6b6
  由 sneaxiy 提交于 4月 06, 2023
```
* fix flash attn

* fix another API
```
  8ac5a6b6
- L
  【PaddlePaddle Hackathon 4】No.63 add fp16 and bf16 for eye and frame (#51819) · ae10133a
  由 LoneRanger 提交于 4月 06, 2023
```
* add fp16 and bf16 for eye and frame

* fix bug

* fix bug

* fix bug

* Update test_frame_op.py

fix code style

* fix bug

* fix bug
```
  ae10133a
04 4月, 2023 3 次提交

C
【Hackathon No.62】增加pool3d算子BF16及单测，lgamma, masked_select FP16/BF16算子单测 (#51837) · b0dbf9fe
由 chenxujun 提交于 4月 04, 2023
```
* Add pool3d lgamma masked_select tests

* Fix code
```
b0dbf9fe

Improve new executor static build (#51149) · 5bac67d4

由 Ruibiao Chen 提交于 4月 04, 2023

* Improve new executor static build

* Skip GC for static build

* Skip infershape for static build

* Handle read_op

* Add fused_attention to OpsWithFluidKernelNeedMoveToPhi

* Fix argsort typos

* Add sequence_pool to OpsWithFluidKernelNeedMoveToPhi

* Fix skip share lod errors

* Fix errors for adam

* Fix errors for eigvals, memcpy and fake_quantize

* Add static_build.cc

* Add black list

* Fix CI errors

* Fix CI errors

* Fix CI errors

* Fix TensorArray

* Fix TensorArray

* Add update_loss_scaling to OpsNeedSetOutputDtypeWhenRegisterPhiKernel

* Fix copy

* Fix errors

* Fix momentum

* Skip mkldnn

* Fix CI errors

* Fix c_sync_calc_stream_op

* Fix CINN

* Fix while op

* All CI pass, disable FLAGS to merge code, enable it after more tests in future

* Add UTs

* Fix typos

* Fix typos

* Add mkldnn UT

* Remove mkldnn test

* Fix typos

* Fix dist test

* Fix typos

* Fix CI errors

* Fix CI errors

* Add UTs

* Fix typos

* Fix typos

* Add sparse tests

* ToComplexType -> ToComplex

* Add test_matmul_op_static_build to disable_win_inference_test

5bac67d4

Z
rename_bilinear_tensor_product (#52375) · 34069c46
由 zhangyuqin1998 提交于 4月 04, 2023
```
* rename_bilinear_tensor_product

* fix
```
34069c46

03 4月, 2023 4 次提交
- D
  【Hackathon No.50】为 Paddle lerp 算子实现 float16 数据类型支持 (#50925) · a2cbc81a
  由 denglianbin 提交于 4月 03, 2023
```
* finish task

* fix error

* pre-commit fix code style

* add unittest.

* change unittest.

* delete unittest case.
```
  a2cbc81a
- C
  
  Add kron float16/bfloat16, unbind float16 tests (#52413) · f547ee92
  由 chenxujun 提交于 4月 03, 2023
  
  f547ee92
- L
  【PaddlePaddle Hackathon 4】No.56 : add fp16 test and bf16 test for diag,... · 0e3f7ab1
  由 LoneRanger 提交于 4月 03, 2023
```
【PaddlePaddle Hackathon 4】No.56 : add fp16 test and bf16 test for diag, diagonal, fill and fill_diagonal_tensor (#51649)
```
  0e3f7ab1
- Z
  
  rename_batch_norm_grad_grad (#52372) · cf7c431f
  由 zhangyuqin1998 提交于 4月 03, 2023
  
  cf7c431f
31 3月, 2023 2 次提交
- Z
  
  rename_conv2d_grad_grad (#52374) · ea5e1ebb
  由 zhangyuqin1998 提交于 3月 31, 2023
  
  ea5e1ebb
- Y
  [PHI Decoupling]Remove distribute header (#52202) · e923642e
  由 YuanRisheng 提交于 3月 31, 2023
```
* remove distribute

* fix py3 bugs

* fix gpu-ps bugs

* fix compile bugs

* fix unittest bugs
```
  e923642e
30 3月, 2023 3 次提交
- R
  
  [AMP OP&Test] add fp16 test for linspace (#52161) · 40b30f50
  由 Roc 提交于 3月 30, 2023
  
  40b30f50
- Y
  [AMP OP&Test] Register FP16 for multinomial. (#52107) · 7788b65e
  由 yunyaoXYY 提交于 3月 30, 2023
```
* add FP16 for multinomial

* fix input data

* update code

* fix FP16

* fix code
```
  7788b65e
- W
  [AMP OP&Test] Strided slice fp16 and bf16 unitest (#52220) · 5cdd9f2c
  由 Wang Xinyu 提交于 3月 30, 2023
```
* stride slice fp16 and bf16 unitest

* fix code style

* add self.dtype
```
  5cdd9f2c
29 3月, 2023 2 次提交
- H
  Add output defines for graph_sample_neighbors and group_norm (#51503) · 37bd7e78
  由 hjyp 提交于 3月 29, 2023
```
* regist output type for GraphSampleNeighbors and GroupNorm

* Update return type

* fix return type

* update

* fix detail
```
  37bd7e78
- Y
  
  [AMP OP&Test]label_smooth op fp/bf16 support (#52193) · c4b6d1ae
  由 YuhangLi 提交于 3月 29, 2023
  
  c4b6d1ae
28 3月, 2023 3 次提交

H
fix int8 support for full kernel (#52194) · c145fd1e
由 houj04 提交于 3月 28, 2023
```
* fix int8 support for full kernel

* fix ut.
```
c145fd1e
H

[API/OP] Support FP16/BF16 in paddle.nonzero API/OP (#51640) · 2e92357b
由 Haohongxiang 提交于 3月 28, 2023

2e92357b

[AMP OP&Test] add fp16/bf16 unittest for conv ops (#51787) · ad5536eb

由 wangxinxin08 提交于 3月 28, 2023

* add unittest for conv2d/depthwise_conv2d/conv2d_transpose

* add bf16 for DWConv and ConvTranspose

* fix unitest of conv2d_transpose

* modify DWConv2d op and unittest

* fix unittest of conv2d_transpose_bf16

* modify unittest name according to review

* modify atol of DWConv2D unittest

ad5536eb

27 3月, 2023 1 次提交
- L
  unbind support bool dtype (#52080) · 553630aa
  由 Leo Chen 提交于 3月 27, 2023
```
* unbind support bool dtype

* replace np.array_equal
```
  553630aa

PaddlePaddle / Paddle 接近 2 年 前同步成功

PaddlePaddle / Paddle
接近 2 年前同步成功