提交 · 0480ff5de0e919b5bcba96a7b54284d0dcdd55fd · PaddlePaddle / Paddle

23 3月, 2023 5 次提交
- D
  【Hackathon No.45】为 Paddle logical 算子实现 float16 数据类型支持 (#50926) · 0480ff5d
  由 denglianbin 提交于 3月 23, 2023
```
* finish pr

* skip cpu test for logical

* change test style

* fix error.
```
  0480ff5d
- H
  
  Change custom op flag & fix path bug (#51799) · 56257b3a
  由 haosicheng 提交于 3月 23, 2023
  
  56257b3a
- J
  【Eager】Fix error raise (#51963) · 3704471d
  由 Jiabin Yang 提交于 3月 23, 2023
```
* allow return none when stop_gradient=True

* remove useless code

* refine code

* refine code

* fix test cast

* change more test

* add more tests

* fix error msg in pylayer
```
  3704471d
- I
  
  [CodeStyle][C404] Unnecessary list comprehension (rewrite as a dict comprehension) (#51969) · 1f8e6ad6
  由 Infinity_lee 提交于 3月 23, 2023
  
  1f8e6ad6
- 张
  
  [CodeStyle][UP012] Unnecessary call to encode as UTF-8 (#51994) · 9796980c
  由张春乔提交于 3月 23, 2023
  
  9796980c
22 3月, 2023 35 次提交

Support optimizers operator to be generated (#51767) · 0b008e0c

由 HappyHeavyRain 提交于 3月 22, 2023

* test_get_kernel

* add invoke signature

* change reduce_max

* change frobenius_norm

* reset reduce_max according to composite and change reduce_all

* fix the bug when Scalar(*)

* fix 'scalar when support_tensor'

* change code according to review

* change 'keep_signature' to 'manual_signature' and add some erro info

* support optimizers autogen

* change sgd yaml

* change generate signature

* fix test/cpp/new_executor/CM

* reset signature generated function

* change signature funciton

* change signature funciton

0b008e0c

[Zero-Dim] Support 0-D tensor for some oneDNN unary kernels (#51687) · 2a3d75bc

由 YangQun 提交于 3月 22, 2023

* support 0-d tensor for element wise unary ops

* fix python code style check

* fix approval check

* support 0-d tensor for onednn softmax and logsoftmax kernels

* fix commnets

* fix some unittests

2a3d75bc

J

Correct lstm qat test (#51499) · 31f81685
由 joanna.wozna.intel 提交于 3月 22, 2023

31f81685
S

add fused dropout add (#51752) · 6ba0507d
由 ShenLiang 提交于 3月 22, 2023

6ba0507d
D
[XPU] fix distribute_fpn_proposals (#51873) · a10718e8
由 duanyanhui 提交于 3月 22, 2023
```
* fix distribute_fpn_proposals

* fix bug
```
a10718e8

Add fused_feed_forward pass (#50423) · 5dda0ef6

由 Ghost Screaming 提交于 3月 22, 2023

* Add fused_feed_forward pass for semi-automatic static graph training.

* Add fused_feedforward property in parallel_executor.cc

* Polish code.

* Polish fused feed_forward pass code. Support use_dropout1 and
use_dropout2 option.

* Support model parallel in fused_feedforward pass.

5dda0ef6

Extract fused_transpose op dedicated for oneDNN fuse passes (#50021) · 02296977

由 Sławomir Siwek 提交于 3月 22, 2023

* extract common methods to reuse

* add header for transpose ops

* fused_transpose

* Split big function

* transpose2 tests

* fused_transpose

* Apply extra attributes

* add pbtxt file

* update pbtxt

* Merge develop

* add more strict op compats

* code  style

* remove mkldnn_data_type

* unify SetOutMemDescWithReshape2FuseSupport

* adjust quantize-dequantize for transpose

* remove appendact

* transpose2 quantization

* fix int8 tests

* adjust transpose_op to current develop

* delete fusion code from transpose_kernel

* add fused transpose to NHWC unittest

* change order

02296977

P
[PHI] Add multiclass_nms3 output defs (#51355) · 06cb6553
由 PuQing 提交于 3月 22, 2023
```
* add nms3 register output defs

* remove nms from set

* remove nms from set
```
06cb6553
B
【AMP OP&Test】unit test for test_logit_op (#51051) · 289677e2
由 Bo Zhang 提交于 3月 22, 2023
```
* test_logit_op

* add cudaKernel to replace eigen impl

* bf16 unit test CI
```
289677e2
H

[XPU] fix unit test of test_pad3d_op_xpu. (#51962) · de2166c0
由 houj04 提交于 3月 22, 2023

de2166c0
Z
[AMP OP&Test] Fix fp16 check_grad when user_defined_grads is not None (#51959) · 153351e1
由 Zhang Zheng 提交于 3月 22, 2023
```
* [AMP OP&Test] Fix fp16 check_grad when user_defined_grads are not None

* fix cond
```
153351e1
H
[CustomOP Optional] CustomOP supports optional Tensor (#51923) · b74e00e1
由 HongyuJia 提交于 3月 22, 2023
```
* [CustomOP Optional] CustomOP supports optional Tensor

* fix test_custom_concat, add pytest to CMakeLists
```
b74e00e1
L
remove net_drawer.py, memory_analysis.py (#51869) · af2fa429
由 LoneRanger 提交于 3月 22, 2023
```
* remove net_drawer.py

* remove memory_analysis.py

* remove test_memory_analysis.py
```
af2fa429
W
add autogen code support for index_add op (#51887) · 3065fa2c
由 Wang Xin 提交于 3月 22, 2023
```
* add autogen code for index_add op

* bug fixed
```
3065fa2c
N

Fix type error in adagrad_kernel (#51790) · 8ef020c1
由 niuliling123 提交于 3月 22, 2023

8ef020c1
Z
Revert "[AMP OP&Test] Support float & bfloat16 when using thrust (#51627)" (#51897) · 57e368b8
由 Zhang Zheng 提交于 3月 22, 2023
```
This reverts commit 3b2cd23a.
```
57e368b8
K
[BugFix] fix raw_program_optimizer not apply when using amp (#51865) · 202c06a2
由 kangguangli 提交于 3月 22, 2023
```
* fix raw_program_optimizer not apply when using amp

* fix CI
```
202c06a2
Z

fix dtype checking for softmax (#51929) · 59841444
由 Zhang Ting 提交于 3月 22, 2023

59841444
R
support auto generate for p_norm (#51590) · 2b98993b
由 RedContritio 提交于 3月 22, 2023
```
* supoort auto generate p_norm

* fix bug in backward
```
2b98993b
R
support auto generate for dirichlet (#51601) · ec877d1f
由 RedContritio 提交于 3月 22, 2023
```
* support auto generate for dirichlet

* use uppercase in args

* use op_compat for name mapping
```
ec877d1f
W
Add reduce_max_grad composite rule (#51653) · d04c9cda
由 wangxiaoning 提交于 3月 22, 2023
```
* max comp

* fix

* add test

* fix

* fix

* fix

* fix

* fix test

* fix api
```
d04c9cda

fix ninja error (#50617) · 9b2b3dad

由 risemeup1 提交于 3月 22, 2023

* fix ninja error

* fix_ninja_error

* fix ninja error

* fix r-200 ci ninja error

9b2b3dad

Add fused_linear_param_grad_add_kernel (#51805) · f59c5d8b

由 sneaxiy 提交于 3月 22, 2023

* add fused_linear_param_grad_add_kernel

* fix compile error

* remove flag

* fix ci compile error

* fix ci compile error

* revert pylayer revision

* fix ci ut

* improve performance

f59c5d8b

Y

inference support double data type (#51786) · a765eb26
由 Yuanle Liu 提交于 3月 22, 2023

a765eb26

[IR] Attribute system (#51636) · 586d9018

由 zhangbo9674 提交于 3月 22, 2023

* add Attribute system to new ir

* set StorageType to Storage in Type and Attribute

* refine strAttr

* refine name of StrAttribute

* add DictionaryAttribute

* refine code

* refine dic_attr

* refine code

* Set DictionaryAttribute ParamKey is map

* refine code

* refine code by comment

* refine code

* refine code

* refine code

* refine code

* fix complie bug

* refine code

* add const for Attribute storage

586d9018

[CodeStyle][UP018] Unnecessary call to `str` (#51922) · 52a31b87
由 iSerendipity 提交于 3月 22, 2023

52a31b87

【Eager】Allow return none when stop_gradient=False (#51740) · db599258

由 Jiabin Yang 提交于 3月 22, 2023

* allow return none when stop_gradient=True

* remove useless code

* refine code

* refine code

* fix test cast

* change more test

* add more tests

db599258

【AMP OP&Test】unit test for accuracy_op (#51009) · 8c61a95a

由 Bo Zhang 提交于 3月 22, 2023

* test_accuracy_op

* add create_test_fp/bf16_class

* cast after calculation

* change convert_uint16_to_float_ifneed

* delete TestAccuracyOpFp32 according to PR comment

* fix the rtol setting rules in bfloat16 forward

8c61a95a

L

first commit (#51947) · 320a5b23
由 limingshu 提交于 3月 22, 2023

320a5b23
Z

[Test Mv] legacy_test (#51941) · 1617ba76
由 Zheng-Bicheng 提交于 3月 22, 2023

1617ba76
Z

[AMP OP&Test] Fix the rtol setting rules in bfloat16 forward (#51875) · f29c0ca1
由 Zhang Zheng 提交于 3月 22, 2023

f29c0ca1
Z

Replace OpTest.assertTrue(numpy.allclose) to numpy.testing.assert_allclose (#51690) · 75fb2ed9
由 Zhang Zheng 提交于 3月 22, 2023

75fb2ed9
H

[Dygraph] Support main_grad in hybrid_parallel for BF16 training (#51204) · e335ae29
由 Haohongxiang 提交于 3月 22, 2023

e335ae29

[AutoParallel] BF16-o1/FP16-o1 PASS support training and generation (#51147) · 5cb0f3aa

由 zhaoyingli 提交于 3月 22, 2023

* [AutoParallel] support bloom

* fix import

* align amp and bf16

* update func name

* clipbyglobalnorm and add_n support bf16

* upgrade amp strategy api

* update bf16 unittest

* fix static clip

---------
Co-authored-by: Nliangjianzhong <liangjianzhong@baidu.com>
Co-authored-by: NAurelius84 <zhangliujie@baidu.com>

5cb0f3aa

D

Case7:paddle.distribution.Beta：fix beta(true stack) (#51847) · 32baca93
由 Difer 提交于 3月 22, 2023

32baca93

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功