提交 · a64a722a57573807414c018e2e62521e00f1c4fb · PaddlePaddle / Paddle

25 5月, 2023 5 次提交
- [Zero-Dim] support ReshapeTransform/nll_loss/matmul support 0D (#53828) · a64a722a
  由 zhouweiwei2014 提交于 5月 25, 2023
  
  a64a722a
- L
  add log for memory stats (#54083) · 5745a63f
  由 Leo Chen 提交于 5月 25, 2023
```
* add log for memory stats

* fix string_split in einsum
```
  5745a63f
- 张
  
  fix return bool (#54096) · ae360000
  由张春乔提交于 5月 25, 2023
  
  ae360000
- R
  
  Fix the custom pass with empty type (#54065) · 43d6bdca
  由 ronnywang 提交于 5月 25, 2023
  
  43d6bdca
- Z
  [Paddle Inference] Move down the transfer_layout (#52997) · 44044d80
  由 zhoutianzi666 提交于 5月 25, 2023
```
* add tranfer_elim
* transfer layout elimination
```
  44044d80
24 5月, 2023 14 次提交
- L
  
  suppport optional input for unbind_grad (#54085) · f2ed4011
  由 Leo Chen 提交于 5月 24, 2023
  
  f2ed4011
- Y
  Try to increase the repeat of autotune and fix the setting of allow_tf32_cublas. (#53622) · f4abe34b
  由 Yiqun Liu 提交于 5月 24, 2023
```
* Try to increase the repeat of autotune and fix the setting of allow_tf32_cublas.

* Change the repeat of cublaslt to 10.

* Use FLAGS_cublaslt_exhaustive_search_times as repeats.

* Fix compiling error on CI.

* Polish the key and simplify codes.
```
  f4abe34b
- Z
  
  move reduce raw kernels to legacy (#53961) · f488e3fd
  由 zhangyuqin1998 提交于 5月 24, 2023
  
  f488e3fd
- Z
  move raw kernels to legacy (#53913) · 48f5af99
  由 zhangyuqin1998 提交于 5月 24, 2023
```
* move raw kernels to legacy

* Update elementwise_add_kernel.cu

* fix
```
  48f5af99
- W
  
  [XPU]Add act add fuse (#53965) · f55f9d79
  由 wz1qqx 提交于 5月 24, 2023
  
  f55f9d79
- L
  Fixed the bug in the api.cc file where there was an inconsistency between the... · 75fc4bf0
  由 Leo Guo 提交于 5月 24, 2023
```
Fixed the bug in the api.cc file where there was an inconsistency between the specified type (std::vector<DenseTensor*>&) in the function pointer kernel_signature and the type of the phi kernel parameter (std::vector<DenseTensor*>) when the phi kernel is set to output as std::vector<DenseTensor*>. test=kunlun (#54053)
```
  75fc4bf0
- K
  [IR] Add vector type support for program translator (#54035) · 7e1dd338
  由 kangguangli 提交于 5月 24, 2023
```
* add vector type support for program translator

* polish

* resolve conflicts

* add verify for combine/slice and unittests

* polish
```
  7e1dd338
- 王
  
  [IR] fine-tune the interface of ir-context class. (#54031) · d73db135
  由王明冬提交于 5月 24, 2023
  
  d73db135
- X
  
  revert_tanh_double_grad (#54062) · e862753c
  由 xiaoguoguo626807 提交于 5月 24, 2023
  
  e862753c
- Z
  
  fix reshape error: (Repeated layer name: reshape (layers must have distinct names)) (#54072) · dc3c0de1
  由 Zhang Jun 提交于 5月 24, 2023
  
  dc3c0de1
- W
  Update lerp_kernel.cu (#54071) · a299797d
  由 Winters Montagne 提交于 5月 24, 2023
```
Removed unnecessary header files introduced
```
  a299797d
- L
  [XPU][PHI Kernels] bind bitwise_add kernel & add int32/int64 support to... · 0a06140f
  由 lijin23 提交于 5月 24, 2023
```
[XPU][PHI Kernels] bind bitwise_add kernel & add int32/int64 support to scatter_nd_add kernel for xpu (#54066)

* bind new kernels to xpu

* refine code

* fix bugs in unittest
```
  0a06140f
- F
  
  Fix bugs on ubuntu22.04 (#54057) · e419f434
  由 Frank Lin 提交于 5月 24, 2023
  
  e419f434
- H
  [XPU] add retry for unittests (#54044) · 6e5b9478
  由 houj04 提交于 5月 24, 2023
```
* [XPU] add retry for unittests

* revert debug code.
```
  6e5b9478
23 5月, 2023 21 次提交

Z
[AMP OP&Test] Support float16 in selu (#54030) · 6133ca4e
由 Zhang Zheng 提交于 5月 23, 2023
```
* [AMP OP&Test] Support float16 in selu

* fix
```
6133ca4e

[CINN] Enable check_cinn on some tests (#53710) · 97fe79a9

由 Fisher 提交于 5月 23, 2023

* Enable check_cinn on some tests

Tests: bitwise, compare, shape, assign_value, sum, expand_v2,
lookup_table, lookup_table_v2

* Enable more CINN tests

Tests with CINN: expand_v2, matmul, matmul_v2, mul, norm, one_hot_v2
Add target select in cinn_launch_op

* Revert test_mul_op

* Improve op unit tests

97fe79a9

L

fix nccl version (#53942) · 89da2f19
由 LiYuRio 提交于 5月 23, 2023

89da2f19
R

[PHI] bind nll_loss xpu kernel (#54043) · 73d706ce
由 RuohengMa 提交于 5月 23, 2023

73d706ce
W

fix 2 bug of eager (#54041) · 626ea800
由 wanghuancoder 提交于 5月 23, 2023

626ea800

[IR] Add op definition auto code generator (#54026) · b49a7e26

由 zhangbo9674 提交于 5月 23, 2023

* Use copy_if_different to avoid recompilation of generated cutlass
kernels.

* add program parameter dialect_interface

* fix op create bug

* add conv2d

* draft of paddle converter

* fix CI

* fix windows CI

* fix program destructor

* printer draft

* fix bug

* printer draft finish

* fix windows CI

* reserve inplace semantics

* revert program::destroy since no need to do topology sort

* revert

* modify by reviews

* commit printer and resnet50 related ops

* fix

* fix

* fix op definition

* refine op dyn_cast

* fix bug

* refine code

* refine code

* refine code

* refine code

* add code gen

* refine code

* refine code

* refine code

---------
Co-authored-by: Numiswing <umiswing@foxmail.com>
Co-authored-by: Nkangguangli <kangguangli@hotmail.com>

b49a7e26

[dist attr 迁移到 phi]Dist attr (#53848) · be1152a4

由 zhenhailiu 提交于 5月 23, 2023

* merge code from forsish

* polish

* paddle/fluid/pybind/auto_parallel_py.cc

* polish

* polish

* polish

* polish

* polish

* polish

* polish

* polish

* polish

* polish

* polish

be1152a4

[static op generation] tril_triu (#54033) · 4af0f140

由 gouzil 提交于 5月 23, 2023

* [phi] autogen code tril_triu

* [phi][api]fix tril_triu_grad args

* [fluid] clean cmake; [phi] fix infer_meta

4af0f140

C

Fix typos (#54015) · adca3654
由 co63oc 提交于 5月 23, 2023

adca3654
Y
Fix inference fp16 io (#54042) · ae241565
由 Yuanle Liu 提交于 5月 23, 2023
```
* fix trt inference fp16 io

* fix inference fp16 io
```
ae241565
C

Fix typos (#53960) · d89e0367
由 co63oc 提交于 5月 23, 2023

d89e0367
W

Enabel memory optimize pass although MkLDNN is enabled (#53615) · 5996f623
由 weishengying 提交于 5月 23, 2023

5996f623
C
Fix typos, Betweeen to Between (#53952) · ee4eecef
由 co63oc 提交于 5月 23, 2023
```
* Fix typos

* Fix
```
ee4eecef
C

fix typos(#53967) · c36a000d
由 cyberslack_lee 提交于 5月 23, 2023

c36a000d

Functionalize distributed_fused_lamb kernel (#53896) · 5f8e7d8f

由 huangjiyi 提交于 5月 23, 2023

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update HostAlloc

* update param name

* update cpu kernel

* remove kernel header

* update

* update

5f8e7d8f

T

Fix trt runtime destroy issue (#53937) · 6e0cf610
由 Tian Zheng 提交于 5月 23, 2023

6e0cf610
L
add host memory stats (#54036) · 01345a51
由 Leo Chen 提交于 5月 23, 2023
```
* add host memory stats

* add ut
```
01345a51
H
move fusion_group infershape to phi (#53934) · 3dc99088
由 huangjiyi 提交于 5月 23, 2023
```
* update

* update

* update

* set out dtype
```
3dc99088

static graph autogen code support for pad3d op (#53733) · bcf67536

由 Wang Xin 提交于 5月 23, 2023

* static graph autogen code support for pad3d op

* bug fixed

* add ut for pad3d mkldnn op

* fix coverage

* fix bug

* fix bug

* Delete test_pad3d_mkldnn_op.py

bcf67536

Z

[XPU] silu op support to use fast_swish (#53980) · 1ef0de81
由 zhangyikun02 提交于 5月 23, 2023

1ef0de81
R
[CustomDevice] fix auto_paralell (#53842) · 3aa5d64e
由 ronnywang 提交于 5月 23, 2023
```
* [CustomDevice] fix auto_paralell

* update

* update

* update
```
3aa5d64e

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功