提交 · 2ddd047346cf6fb99a13f2cdd218a0b8764df646 · PaddlePaddle / Paddle

12 6月, 2023 3 次提交

log/Log10/log2/log1p support int32/int64/float16/bfloat16 forward (#54089) · 2ddd0473

由 Hui Zhang 提交于 6月 12, 2023

* fix for log xxx

* add int32/int64 for cpu/gpu; add float16/bfloat16 for cpu forward

* fix docstring

* fix bug

* fix bugs

* fix bugs

* fix bugs

* fix bugs

* fix bug

* using cast

* fix test

* fix api

* fix other bugs

* fix ci bug for not using dygraph guard

* add bfloat16 test

* fix ut

* bf16

2ddd0473

Z
[inference]conv_fusion support bias's rank equal to input's rank (#54477) · 03dbdbd1
由 Zhang Jun 提交于 6月 12, 2023
```
* support bias's rank equal to input's rank
```
03dbdbd1
N

bump black to 2023 style (#54523) · 44e0393c
由 Nyakku Shigure 提交于 6月 12, 2023

44e0393c

09 6月, 2023 5 次提交
- P
  support add(x_float32, bfloa16_) or add(x_float32, y_float16) (#54415) · b3232936
  由 pangengzheng 提交于 6月 09, 2023
```
* support add(x_float32, bfloa16_) or add(x_float32, y_float16)

* polish

* fix test
```
  b3232936
- L
  Auto generate code for elementwise_max (#54412) · 57564bdf
  由 lzydev 提交于 6月 09, 2023
```
* auto generate code for elementwise_max

* auto generate code for elementwise_max

* fix composite ops

* fix bug of fmax
```
  57564bdf
- J
  
  fix reduce.h (#54476) · 2dcc622d
  由 jiangfan06 提交于 6月 09, 2023
  
  2dcc622d
- Z
  [IR] Refine IR builder and throw methods (#54396) · 3a452e4e
  由 zhangbo9674 提交于 6月 09, 2023
```
* refine code

* refine code

* refine code

* refine code

* refine code

* refine code

* refine code

* fix bug

* refine code

* refine code

* refine code

* refine code

* refine code

* delete unused code

* delete unused code

* refine code
```
  3a452e4e
- H
  
  [XPU] add registration of SplitWithNumKernel with int64. (#54478) · 4a77cf53
  由 houj04 提交于 6月 09, 2023
  
  4a77cf53
08 6月, 2023 3 次提交

W

[XPU]add fp16 kernels (#54410) · fd9c555c
由 wz1qqx 提交于 6月 08, 2023

fd9c555c
Y

xpu support auto growth allocator (#54121) · 168fac13
由 ykkk2333 提交于 6月 08, 2023

168fac13

[AMP] Add check_numerics API. (#54301) · a5444592

由 Yiqun Liu 提交于 6月 08, 2023

* Add outputs to check_numerics_kernel.

* Add check_numerics to yaml.

* Add API and unittest.

* Add check_nan_inf_level as argument of check_numerics_kernel.

* Add more unittests.

* Fix static API implementation and unittest.

* Move the implementation of check_numerics to paddle.amp.

* Fix import error.

a5444592

07 6月, 2023 1 次提交
- C
  
  support some prim ops bf16 dtype (#54399) · 791963ab
  由 Charles-hit 提交于 6月 07, 2023
  
  791963ab
06 6月, 2023 2 次提交
- H
  
  [XPU] support approximate for gelu activation. (#54376) · 87d24878
  由 houj04 提交于 6月 06, 2023
  
  87d24878
- Z
  Fix compilation error by using thrust (#54364) · 3ea7d577
  由 Zhang Zheng 提交于 6月 06, 2023
```
* Fix compilation error by using thrust

* fix
```
  3ea7d577
05 6月, 2023 7 次提交

【Hackathon 4 No.19】Add polygamma API to Paddle (#53791) · ed604569

由 PommesPeter 提交于 6月 05, 2023

* feat: added polygamma init code

* feat: added polygamma unittest code

* test: added more test cases

* refactor: added forward impl

* refactor: added backward impl

* test: updated cases

* refactor: updated test cases

* refactor: added more case and fixed some bugs

* test: updated ref func

* refactor: updated code style

* refactor: move the code

* refactor: updated test

* refactor: updated test

* docs: updated en doc
Co-authored-by: Nzachary sun <70642955+sunzhongkai588@users.noreply.github.com>

* docs: updated math eq

---------
Co-authored-by: Nzachary sun <70642955+sunzhongkai588@users.noreply.github.com>

ed604569

G

[static op generation] pool2d, pool3d (#54070) · 30881647
由 gouzil 提交于 6月 05, 2023

30881647
W

[bug fix] group norm backward (#54341) · d338b2f8
由 wangzhen38 提交于 6月 05, 2023

d338b2f8
H

[XPU] fix unittest of shape op. (#54323) · f55eb06f
由 houj04 提交于 6月 05, 2023

f55eb06f
U

Add macro SPCONV_WITH_CUTLASS (#54274) · e7a38f15
由 umiswing 提交于 6月 05, 2023

e7a38f15
H
Support code generation for op conv2d_transpose, conv3d_transpose,... · 1075d35d
由 huangjiyi 提交于 6月 05, 2023
```
Support code generation for op conv2d_transpose, conv3d_transpose, depthwise_conv2d_transpose (#54242)
```
1075d35d

optimize logsumexp in small data scale (#52952) · 93e1bb98

由 Asthestarsfalll 提交于 6月 05, 2023

* optimize logsumexp in small data scale

* fix

* fix

* add #pragma once

* swith to use aligned_vector and support arbitrarily shape

* fix store

* fix store

* refine for special cases

* try

* fix

* update

* fix

* fix all_reduce

* try

* fix rocm bug

* fix rocm bug

* fix rocm bug

* fix rocm bug

* fix rocm bug

* fix rocm bug

* fix rocm bug

* fix rocm bug

93e1bb98

03 6月, 2023 1 次提交
- S
  
  【Hackathon 4th No.29】为 Paddle 新增 paddle.sparse.slice 稀疏 API (#53794) · d71baff6
  由 Scotty 提交于 6月 03, 2023
  
  d71baff6
02 6月, 2023 8 次提交
- R
  
  fix typo (#54299) · 06304ade
  由 RedContritio 提交于 6月 02, 2023
  
  06304ade
- D
  【PaddlePaddle Hackathon 4】No.56 :add fp and bf16 for bernoulli (#54232) · 85d5f26d
  由 Difer 提交于 6月 02, 2023
```
* add fp&bf16 bernoulli

* add check_dtype & fix error

* fix rocm error
```
  85d5f26d
- W
  
  [XPU]Add yolo box fuse pass && kernel (#54163) · a087b9cb
  由 wz1qqx 提交于 6月 02, 2023
  
  a087b9cb
- H
  floor div support int8/int16/int32/int64/uint8/float32/float64/bfloat16/float16 (#53854) · 6310419b
  由 Hui Zhang 提交于 6月 02, 2023
```
* floor div support float/double/bfloat16/float16

* add ut

* fix bug

* fix fft.ifftshift for floor_divide upgrade

* fix comment

* fix bugs

* fix bug
```
  6310419b
- Z
  Optimize perf of broadcast matmul (#54126) · 9f76d050
  由 Zhang Zheng 提交于 6月 02, 2023
```
* Optimize perf of broadcast matmul

* support more dtype
```
  9f76d050
- 傅
  
  add mixed bool and int index support for index_put (#54195) · 8fd4ef91
  由傅剑寒提交于 6月 02, 2023
  
  8fd4ef91
- Z
  [AMP] support master_grad for adam and momentum (#54240) · 703a64a3
  由 Zhang Ting 提交于 6月 02, 2023
```
* support master_grad for adam and momentum

Co-authored-by: zhangting_2017@163.com <zhangting2020>
```
  703a64a3
- W
  static graph autogen code for shape op (#54221) · f5342918
  由 Wang Xin 提交于 6月 02, 2023
```
* static graph autogen code for shape op

* fix onednn

* fix onednn
```
  f5342918
01 6月, 2023 5 次提交
- U
  
  [Sparse] Support sparse conv 2d. (#54158) · 4f25604e
  由 umiswing 提交于 6月 01, 2023
  
  4f25604e
- [Zero-Dim] OpTest support shape check and fix previous case problem (#54117) · d4451cb0
  由 zhouweiwei2014 提交于 6月 01, 2023
  
  d4451cb0
- R
  [ROCM] fix multihead_matmul (#54108) · effebd41
  由 ronnywang 提交于 6月 01, 2023
```
* [ROCM] fix multihead_matmul

* skip bf16 uts

* update
```
  effebd41
- Y
  
  fix xpu-kp bugs (#54234) · e8735ddf
  由 YuanRisheng 提交于 6月 01, 2023
  
  e8735ddf
- H
  Support static graph code generation for conv2d, conv3d, depthwise_conv2d (#54201) · f3eccb3f
  由 huangjiyi 提交于 6月 01, 2023
```
* update

* update cmake

* update

* update

* update

* update

* Revert "update cmake"

This reverts commit 1e1dc1b2bc9967b725201272607f939260070fd4.

* update

* update

* update

* update
```
  f3eccb3f
31 5月, 2023 1 次提交
- C
  support activation prim op bf16 dtype (#54193) · cbeff5fc
  由 Charles-hit 提交于 5月 31, 2023
```
* support activation prim op bf16 dtype

* remove useless code
```
  cbeff5fc
30 5月, 2023 4 次提交

update_c++17 (#53892) · 950b563b

由 risemeup1 提交于 5月 30, 2023

* update_c++17

* update_c++17

* fix windows bug

* solve cirle depend

* solve cirle depend

* solve cirle depend

* solve cirle depend

* solve cirle depend

* fix windows bug

* fix compiler error

* fix compiler error

* update eigen3

* update eigen3

* update eigen3

* fix mac-py3 compiler error

* update C++17

* fix mac compiler error

* fix compile error

* fix coverage_compiler error

* fix coverage_ci_problem

* fix coverage_error

* fix_kunlun200 compile error

* fix kunlun200 compiler error

* fix compile error

* fix compiler error

* fix py3 failed test

* fix kunlun200 compiler error

* test

* fix test error

* fix test error

* fix test error

* test

* test

* fix mac py3 error

* fix mac py3 error

* fix mac py3 error

* fix test error

* fix test error

* fix compile error

* fix compile error

* fix compile error

* test

* test

* fix compiler error

* test

* test

* debug on ci

* fix compiler error

* fix compiler error

* test

* fix cinn compiler error

* test

* fix rocm cmpile error

* fix cinn and kunlun compile error

* update c++14

* Update flags.cmake

950b563b

softmax fwd: force vec size to 1 when dtype is float (#54183) · f5a3b427
由 shaojie_wang 提交于 5月 30, 2023
```
* softmax fwd: force vec size to 1 when dtype is float

* use 1024 as threshold to use cudnn
```
f5a3b427

[AMP] Reimplement check_nan_inf as check_numerics_kernel. (#52245) · 44bd5927

由 Yiqun Liu 提交于 5月 30, 2023

* Reimplement the check_nan_inf function as check_numerics kernel.

* Remove the cpu implemention to phi.

* Add ifdef for the including of omp.h.

* Move the use of FLAGS_check_nan_inf_level out of header file.

* Implement a common PrintAndThrowError function.

* Fix the error using of __NVCC__, which should be instead with __CUDA_ARCH__.

* Add dependency of phi.

* Polish codes and unittest.

44bd5927

H

[XPU] using xpu::normal in gaussian kernel. (#54176) · 060e4fab
由 houj04 提交于 5月 30, 2023

060e4fab

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功