提交 · afc2c5984ea1c0919462eeb3d052b36e5e231237 · PaddlePaddle / Paddle

18 4月, 2023 14 次提交
- T
  
  fix xpu test;test=document_fix (#53016) · afc2c598
  由 tianshuo78520a 提交于 4月 18, 2023
  
  afc2c598
- H
  register fluid kerenls to phi [part 6.4] (#52881) · 37ca3b4c
  由 huangjiyi 提交于 4月 18, 2023
```
* update

* revert lookup_table_op
```
  37ca3b4c
- 张
  
  remove mlu(#53007) · 4d5a3ad6
  由张春乔提交于 4月 18, 2023
  
  4d5a3ad6
- M
  rename _varbase_creator as _create_tensor (#52938) · 240e13a2
  由 Meteor Liu 提交于 4月 18, 2023
```
* rename _varbase_creator as create_tensor

* rename _varbase_creator as create_tensor
```
  240e13a2
- G
  【0D output】add 0D output support for linalg.slogdet (#52891) · a7155c5c
  由 GGBond8488 提交于 4月 18, 2023
```
* add 0D output support for inalg.slogdet,test=allcase

* fix zerom dime test error test=allcase

* fix test error test=allcase

* add static backward test, test=allcase
```
  a7155c5c
- R
  
  Set random seed for test_tensordot (#53004) · f1b6a76b
  由 Ruibiao Chen 提交于 4月 18, 2023
  
  f1b6a76b
- T
  del read (#52943) · 188efd11
  由 tianshuo78520a 提交于 4月 18, 2023
```
* del read

* fix

* test log

* fix

* fix bug
```
  188efd11
- J
  fix the set_value error in cpu (#49804) · 239dbc4e
  由 JYChen 提交于 4月 18, 2023
```
* fix the set_value error in cpu

* add a unitest for set_value OP

* fix platform::is_gpu_place

* add todo note for set_value
```
  239dbc4e
- Z
  add autogen code support for rnn op (#52799) · aba6af4f
  由 Zhenghai Zhang 提交于 4月 18, 2023
```
* add autogen code support for rnn op

* fix bug

* fix bug
```
  aba6af4f
- L
  add autogen code support for lu (#52802) · f9fadfc4
  由 LoneRanger 提交于 4月 18, 2023
```
* add autogen code support for lu

* fix bug

* fix bug

* fix bug

* fix bug
```
  f9fadfc4
- R
  [CustomDevice] add c_identity op (#52982) · 77b4d0f1
  由 ronnywang 提交于 4月 18, 2023
```
* [CustomDevice] add c_identity op

* fix use calc stream
```
  77b4d0f1
- X
  
  [prim add instance_norm custom vjp] (#52935) · f7b80ada
  由 Xiaoxu Chen 提交于 4月 18, 2023
  
  f7b80ada
- Y
  [AMP] Support overload of paddle.static.amp.decorate function. (#52918) · 79a01d6c
  由 Yiqun Liu 提交于 4月 18, 2023
```
* Implement a common AmpTestBase.

* Support overload of decorate.

* Change the ignore list of flake and fix an error.
```
  79a01d6c
- Z
  reorder_prior_box (#52749) · a70d9db9
  由 zhangyuqin1998 提交于 4月 18, 2023
```
* reorder_prior_box

* fix
```
  a70d9db9
17 4月, 2023 26 次提交

Y

[Auto Parallel] Add the micro-bathsize config (#52912) · 94afa5ab
由 Yulong Ao 提交于 4月 17, 2023

94afa5ab

mv ps distributed dir (#52885) · 1765d5d1

由 tianshuo78520a 提交于 4月 17, 2023

* mv ps distributed dir

* fix

* add del auto_parallel

* add auto_parallel

* fix ps

* fix bug

* fix test bug

* fix test bug

* merge develop fix error

* merge develop fix error

* merge develop fix error

1765d5d1

[Paddle-Inference] Add cutlass conv2d_depthwise (#51792) · bd3b096a

由 zhoutianzi666 提交于 4月 17, 2023

* initial commit for cutlass_teller

* second commit for cutlass_teller

* add conv2d_depthwise python template

* add conv2d_depthwise cutlass template

* /zhoukangkang/paddle_cutlass/Paddle/paddle/fluid/framework/ir/cutlass_teller.h

* refine code in Conv2dFusionCanSupport

* add macro in cutlass_teller.h

* add 3x3 5x5 teller

* add groups not 1 or conv2d_depthwise teller

* 只生成ic是8的倍数的conv2d_depthwise 的kernel

* add EXPLICIT in cutlass_teller.h

* final commit

* add split_k_slices in conv2d_depthwise

* make stages == 2

* 重构部分代码

* add CutlassFusionType

* solve illegal memory

* make stride_h=stride_w && make dilation==1

* must check HasAttr(use_cutlass) before GetAttrIfExists

* add CONV2D_DEPTHWISE_BIAS_SILU to OpType2String

* modify decl.h and util.cu

bd3b096a

L
cherry-pick fleet executor from 2.4 (#52896) · bafe287a
由 LiYuRio 提交于 4月 17, 2023
```
* cherry-pick fleet executor from 2.4

* fix test case
```
bafe287a
S

Support static graph code-gen for matrix_rank (#52659) · a2aa0087
由 Sanbu 提交于 4月 17, 2023

a2aa0087
S
Add unique counter for shared memory used in DataLoader (#52976) · b0911ecb
由 sneaxiy 提交于 4月 17, 2023
```
* fix ipc counter

* fix missing std::to_string
```
b0911ecb
Y
[PHI]Unify fluid kernel (Part4) (#52626) · 1b5eba8a
由 YuanRisheng 提交于 4月 17, 2023
```
* unify kernel

* fix ci bugs

* fix py3 bugs

* fix py3 bugs

* perfect code
```
1b5eba8a
L
[Test Mv] remove rnn (#52967) · 5e29f30c
由 liulinduo 提交于 4月 17, 2023
```
* [Test Mv] remove rnn

* Update test_rnn_cell_api.py
```
5e29f30c
L
【fix bug】Fix bug in parse args with '{,}' (#52968) · be04f258
由 lzydev 提交于 4月 17, 2023
```
* fix bug in parse args

* fix bug

* recover legacy_*.yaml

* change 'Out' to Output
```
be04f258
L

add autogen code support for uniform_inplace (#52955) · b9830634
由 LoneRanger 提交于 4月 17, 2023

b9830634
G

remove some [-Wunused-paramter] warning (#52924) · 337cc2ca
由 Galaxy1458 提交于 4月 17, 2023

337cc2ca

[CINN] fix concat (#52341) · 31fc763a

由 wangzhen38 提交于 4月 17, 2023

* [CINN] fix concat&pow

* update concat

* composite_backward_api

* for ci

* for ci

* update test & fix opmaker

31fc763a

T

mv ir test (#52834) · b8a848bb
由 tianshuo78520a 提交于 4月 17, 2023

b8a848bb
C
[Fused] controlled randomness for fused dropout add (#52903) · e36f80c6
由 Chitsing KUI 提交于 4月 17, 2023
```
* add random control for fused dropout add

* add __init__
```
e36f80c6
V
[AMP OP&Test]Add BF16 implementation and unit tests of multinomial (#52898) · d19d2486
由 Vvsmile 提交于 4月 17, 2023
```
* fix multinomial

* fix test_elementwise

* fix convert_float_to_uint16

* aadd test_multimial_op

* fix code style
```
d19d2486

【PaddlePaddle Hackathon 4 No.49】：为 Paddle bce_loss 支持 float16 数据类型 (#50930) · 44e6de98

由 thunder95 提交于 4月 17, 2023

* untracked files

* bce_loss_fp16

* remove unused files

* back max_rel_erro still big

* simplify code

* upd

* fix max_relative_error

* restart ci

* Update test_bce_loss.py

* Update test_bce_loss.py

* Update test_bce_loss.py

* Update test_bce_loss.py

* try to pass test

* restore file

* remove error value

* fix bug

---------
Co-authored-by: NZhang Ting <Douyaer2020@qq.com>

44e6de98

J

Support trt engine auto build in runtime for dynamic shape (#52162) · ebc58548
由 JingZhuangzhuang 提交于 4月 17, 2023

ebc58548
W

Remove cinn deny ops for bert test (#52897) · 3de2206c
由 WangZhen 提交于 4月 17, 2023

3de2206c
J
【Eager】fix multiply double grad error (#52870) · cf3ddf24
由 Jiabin Yang 提交于 4月 17, 2023
```
* fix multiply double grad error

* fix multiply dy only kenrel
```
cf3ddf24

【Hackathon No.32】为 Paddle 优化 expand_as 前向&反向 op 在 GPU 上的计算性能 (#52700) · 3c44e948

由 Hanchiao 提交于 4月 17, 2023

* Implement optimized kernel for OP-expand_as.

* Support fp16.
Co-authored-by: Timber-Ye <ye_hanqiao@163.com>
Co-authored-by: NBrianQian1999 <brianqianhitsz@gmail.com>

* remove fp16 support

* remove MAX_RANK_SUPPORTED

---------
Co-authored-by: NBrianQian1999 <brianqianhitsz@gmail.com>

3c44e948

K

rem cncl keyword in py (#52939) · ea04bef8
由 Kim Yann 提交于 4月 17, 2023

ea04bef8
Z

rename_SliceKernel (#52863) · d2b0d63f
由 zhangyuqin1998 提交于 4月 17, 2023

d2b0d63f
张

remove hccl in some .cc files (#52942) · 514d83de
由张春乔提交于 4月 17, 2023

514d83de
张
remove hccl in .py files (#52934) · 27a601e8
由张春乔提交于 4月 17, 2023
```
* remove hccl in .py files

* remove ascend in setup.py.in

* remove ascend in setup.py
```
27a601e8

Add output defs for some kernelsPhi register (#52941) · 23f87442

由 Sonder 提交于 4月 17, 2023

* add register info for eigh and eig_gard

* add sync_batch_norm_op.cu register info

* add lamb output register info

* add unique register info

* change type name

* change type name

* add output register info for check_finite_and_unscale

* update cmake and config file

* add register info for adagrad

* fix build error

* add sync to run_unittests.sh

* add register info for unique_consecutive

* fix build error

* add eigh to STATIC_BUILD_TESTS

* update eig_kernel.cc

* update eig_kernel.cc

* fix infer mate error

* fix unique register error

* fix lamb register info error

* fix lamb register info

* update lamb register info

* fix lamb

* remove one Output Register

* update static build file

* add eigh op to disable_wingpu_test

* update run_unittests

23f87442

C

Fix typos, test=document_fix (#52937) · 002f2185
由 chenxujun 提交于 4月 17, 2023

002f2185

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功