提交 · d338b2f8097a1ff9696d98afc057a9cf3ece4048 · PaddlePaddle / Paddle

05 6月, 2023 7 次提交
- W
  
  [bug fix] group norm backward (#54341) · d338b2f8
  由 wangzhen38 提交于 6月 05, 2023
  
  d338b2f8
- H
  
  [XPU] fix unittest of shape op. (#54323) · f55eb06f
  由 houj04 提交于 6月 05, 2023
  
  f55eb06f
- U
  
  Add macro SPCONV_WITH_CUTLASS (#54274) · e7a38f15
  由 umiswing 提交于 6月 05, 2023
  
  e7a38f15
- H
  Support code generation for op conv2d_transpose, conv3d_transpose,... · 1075d35d
  由 huangjiyi 提交于 6月 05, 2023
```
Support code generation for op conv2d_transpose, conv3d_transpose, depthwise_conv2d_transpose (#54242)
```
  1075d35d
- H
  Support code generation for op multiclass_nms3 (#54272) · 587f66ee
  由 huangjiyi 提交于 6月 05, 2023
```
* update

* update eager_gen

* update

* rm intermediate
```
  587f66ee
- A
  optimize logsumexp in small data scale (#52952) · 93e1bb98
  由 Asthestarsfalll 提交于 6月 05, 2023
```
* optimize logsumexp in small data scale

* fix

* fix

* add #pragma once

* swith to use aligned_vector and support arbitrarily shape

* fix store

* fix store

* refine for special cases

* try

* fix

* update

* fix

* fix all_reduce

* try

* fix rocm bug

* fix rocm bug

* fix rocm bug

* fix rocm bug

* fix rocm bug

* fix rocm bug

* fix rocm bug

* fix rocm bug
```
  93e1bb98
- C
  
  polish tensor comment, test=document_fix (#54298) · 0a0fbb1a
  由 Chen Weihang 提交于 6月 05, 2023
  
  0a0fbb1a
03 6月, 2023 2 次提交
- R
  support auto generate for static op reduce_sum (#54304) · 4f848aa9
  由 RedContritio 提交于 6月 03, 2023
```
* remove reduce_sum_op.h

* support auto generate for static op reduce_sum

* remove reduce_sum_op in CMakeLists.txt
```
  4f848aa9
- S
  
  【Hackathon 4th No.29】为 Paddle 新增 paddle.sparse.slice 稀疏 API (#53794) · d71baff6
  由 Scotty 提交于 6月 03, 2023
  
  d71baff6
02 6月, 2023 14 次提交
- K
  [IR] Refine some IR code (#54303) · 1f82bc37
  由 kangguangli 提交于 6月 02, 2023
```
* add vector type support for program translator

* polish

* support basic attribute type

* resolve conflicts

* add verify for combine/slice and unittests

* polish

* support more type in attribute translator

* modify by reviews

* fix merge mistakes

* refine code

* refine code

* add interface

* fix: op name normalization

* fix typo

* refactor input translator

* fix merge conflicts

* fix op normalizer bug

* refactor attribute translator

* fix bug

* refactor output translator

* fix typo

* fix

* fix approval error

* fix coverage

* fix op_compat parser

* fix merge conflicts

* fix merge conflicts

* fix merge conflicts

* fix merge conflicts

* fix merge conflicts

* revert some changes

---------
Co-authored-by: Nzhangbo9674 <zhangbo54@baidu.com>
```
  1f82bc37
- R
  
  fix typo (#54299) · 06304ade
  由 RedContritio 提交于 6月 02, 2023
  
  06304ade
- R
  support auto generate for static op reduce_any (#54284) · 877948fa
  由 RedContritio 提交于 6月 02, 2023
```
* decouple reduce_any_op.h and reduce_op.h from reduce_any_op.cc

* support auto generate for static op reduce_any
```
  877948fa
- R
  
  support auto generate for static op reduce_max (#54286) · 1e34f9e1
  由 RedContritio 提交于 6月 02, 2023
  
  1e34f9e1
- R
  
  support auto generate for activation_op swish (#53983) · d6c8ca82
  由 RedContritio 提交于 6月 02, 2023
  
  d6c8ca82
- Y
  [BugFix]Fix TypeInfo errors in MacOS (#54279) · 4f56e7c2
  由 YuanRisheng 提交于 6月 02, 2023
```
* fix mac typeinfo bugs

* add file

* move code to cc

* fix compile bugs
```
  4f56e7c2
- D
  【PaddlePaddle Hackathon 4】No.56 :add fp and bf16 for bernoulli (#54232) · 85d5f26d
  由 Difer 提交于 6月 02, 2023
```
* add fp&bf16 bernoulli

* add check_dtype & fix error

* fix rocm error
```
  85d5f26d
- W
  
  [XPU]Add yolo box fuse pass && kernel (#54163) · a087b9cb
  由 wz1qqx 提交于 6月 02, 2023
  
  a087b9cb
- H
  floor div support int8/int16/int32/int64/uint8/float32/float64/bfloat16/float16 (#53854) · 6310419b
  由 Hui Zhang 提交于 6月 02, 2023
```
* floor div support float/double/bfloat16/float16

* add ut

* fix bug

* fix fft.ifftshift for floor_divide upgrade

* fix comment

* fix bugs

* fix bug
```
  6310419b
- Z
  Optimize perf of broadcast matmul (#54126) · 9f76d050
  由 Zhang Zheng 提交于 6月 02, 2023
```
* Optimize perf of broadcast matmul

* support more dtype
```
  9f76d050
- 傅
  
  add mixed bool and int index support for index_put (#54195) · 8fd4ef91
  由傅剑寒提交于 6月 02, 2023
  
  8fd4ef91
- Z
  [AMP] support master_grad for adam and momentum (#54240) · 703a64a3
  由 Zhang Ting 提交于 6月 02, 2023
```
* support master_grad for adam and momentum

Co-authored-by: zhangting_2017@163.com <zhangting2020>
```
  703a64a3
- W
  static graph autogen code for shape op (#54221) · f5342918
  由 Wang Xin 提交于 6月 02, 2023
```
* static graph autogen code for shape op

* fix onednn

* fix onednn
```
  f5342918
- X
  
  add_triple_grad rules (#54164) · c642aa17
  由 xiaoguoguo626807 提交于 6月 02, 2023
  
  c642aa17
01 6月, 2023 8 次提交
- Z
  [IR] Support static build function for op builder (#54197) · 4bd5b695
  由 zhangbo9674 提交于 6月 01, 2023
```
* add build

* add build

* refine code

* refine code

* refine code

* refine code

* refine interface

* fix bug

* fix bug

* fix bug

* refine yaml
```
  4bd5b695
- U
  
  [Sparse] Support sparse conv 2d. (#54158) · 4f25604e
  由 umiswing 提交于 6月 01, 2023
  
  4f25604e
- [Zero-Dim] OpTest support shape check and fix previous case problem (#54117) · d4451cb0
  由 zhouweiwei2014 提交于 6月 01, 2023
  
  d4451cb0
- R
  [ROCM] fix multihead_matmul (#54108) · effebd41
  由 ronnywang 提交于 6月 01, 2023
```
* [ROCM] fix multihead_matmul

* skip bf16 uts

* update
```
  effebd41
- W
  static graph autogen code for check_finite_and_unscale_ op (#54145) · 2186fe16
  由 Wang Xin 提交于 6月 01, 2023
```
* static graph autogen code for check_finite_and_unscale_ op

* bug fixed
```
  2186fe16
- Y
  
  fix xpu-kp bugs (#54234) · e8735ddf
  由 YuanRisheng 提交于 6月 01, 2023
  
  e8735ddf
- R
  
  support auto generate for static op reduce_amin (#54187) · 6e210f92
  由 RedContritio 提交于 6月 01, 2023
  
  6e210f92
- H
  Support static graph code generation for conv2d, conv3d, depthwise_conv2d (#54201) · f3eccb3f
  由 huangjiyi 提交于 6月 01, 2023
```
* update

* update cmake

* update

* update

* update

* update

* Revert "update cmake"

This reverts commit 1e1dc1b2bc9967b725201272607f939260070fd4.

* update

* update

* update

* update
```
  f3eccb3f
31 5月, 2023 3 次提交
- R
  support auto generate for static op reduce_amax (#54179) · 455a6735
  由 RedContritio 提交于 5月 31, 2023
```
* support auto generate for static op reduce_amax

* set reduce_amax attr 'axis' type as IntArray
```
  455a6735
- Y
  [BugFix]Fix inference static lib bugs (#54207) · 5f54a7fe
  由 YuanRisheng 提交于 5月 31, 2023
```
* fix inference static lib bugs

* add if for copy

* fix py3 bugs
```
  5f54a7fe
- C
  support activation prim op bf16 dtype (#54193) · cbeff5fc
  由 Charles-hit 提交于 5月 31, 2023
```
* support activation prim op bf16 dtype

* remove useless code
```
  cbeff5fc
30 5月, 2023 6 次提交

由 risemeup1 提交于 5月 30, 2023

* update_c++17

* update_c++17

* fix windows bug

* solve cirle depend

* solve cirle depend

* solve cirle depend

* solve cirle depend

* solve cirle depend

* fix windows bug

* fix compiler error

* fix compiler error

* update eigen3

* update eigen3

* update eigen3

* fix mac-py3 compiler error

* update C++17

* fix mac compiler error

* fix compile error

* fix coverage_compiler error

* fix coverage_ci_problem

* fix coverage_error

* fix_kunlun200 compile error

* fix kunlun200 compiler error

* fix compile error

* fix compiler error

* fix py3 failed test

* fix kunlun200 compiler error

* test

* fix test error

* fix test error

* fix test error

* test

* test

* fix mac py3 error

* fix mac py3 error

* fix mac py3 error

* fix test error

* fix test error

* fix compile error

* fix compile error

* fix compile error

* test

* test

* fix compiler error

* test

* test

* debug on ci

* fix compiler error

* fix compiler error

* test

* fix cinn compiler error

* test

* fix rocm cmpile error

* fix cinn and kunlun compile error

* update c++14

* Update flags.cmake

950b563b

softmax fwd: force vec size to 1 when dtype is float (#54183) · f5a3b427
由 shaojie_wang 提交于 5月 30, 2023
```
* softmax fwd: force vec size to 1 when dtype is float

* use 1024 as threshold to use cudnn
```
f5a3b427

[AMP] Reimplement check_nan_inf as check_numerics_kernel. (#52245) · 44bd5927

由 Yiqun Liu 提交于 5月 30, 2023

* Reimplement the check_nan_inf function as check_numerics kernel.

* Remove the cpu implemention to phi.

* Add ifdef for the including of omp.h.

* Move the use of FLAGS_check_nan_inf_level out of header file.

* Implement a common PrintAndThrowError function.

* Fix the error using of __NVCC__, which should be instead with __CUDA_ARCH__.

* Add dependency of phi.

* Polish codes and unittest.

44bd5927

L
add timer to log deps in executor (#54188) · 1ba1627d
由 Leo Chen 提交于 5月 30, 2023
```
* add timer to log deps

* rename flag

* add ut
```
1ba1627d
[BUG] Optimize GPU error message file search path (#54180) · ecda253a
由 zhouweiwei2014 提交于 5月 30, 2023

ecda253a

[IR] Refine OP auto code gen (#54186) · 14425c06

由 zhangbo9674 提交于 5月 30, 2023

* refine auto gen

* refine code

* refine code

* fix bug

* fix bug

* fix bug

* fix bug

* fix bug

* fix bug

14425c06

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功