提交 · b413733824452d624a06c0fa74aec1035b4af253 · PaddlePaddle / Paddle

30 3月, 2023 32 次提交

support prim & cinn test for layer_norm (#51272) · 84504f35

由 Weilong Wu 提交于 3月 30, 2023

* support layer_norm prim and cinn test

* enable cinn test

* fix merge conflict

* polish input for check_output_with_place

* fix merge conflict

* add more test case

* fix merge conflict

* polish test case

* polish op_test

* change ln_g rules

* modify scale is none case

* modify scale is none case

* add public_python_api for check prim

* modify setoutputgrad and fp64bug

* add todo & delete log

* recover

* fix some errors

* recover

* recover

* recover

* recover

* fix merge conflicts

---------
Co-authored-by: Nwangruting <wangruting@baidu.com>

84504f35

[Prim] fix loss of composite rule (#52120) · a4e0f666

由 cyber-pioneer 提交于 3月 30, 2023

* fix_prim

* fix bug

* add note

* fix logic

* fix

* add note

* fix check

* fix bug

* fix bug

* fix bug

* add debug

* fix check

* fix bug

* sync print log

* fix test case

* change default

* change test case time

a4e0f666

Z
move elementwise_raw_kernel to new dir (#51965) · 49461a02
由 zhangyuqin1998 提交于 3月 30, 2023
```
* move elementwise raw

* fix

* fix
```
49461a02
[Zero-Dim] Support broadcast_tensors input 0D and distribution API output 0D (#51721) · 2bd0a946
由 zhouweiwei2014 提交于 3月 30, 2023

2bd0a946
[Bug-fix] fix bug of Tensor.item() when CUDAPinnedPlace (#52322) · 0f9ec013
由 zhouweiwei2014 提交于 3月 30, 2023

0f9ec013
W
[AMP OP&Test] Transpose OP fp16 unitest (#52315) · f1cdd654
由 Wang Xinyu 提交于 3月 30, 2023
```
* transpose fp16 test

* transpose auto tune fp16 test
```
f1cdd654
Z

[Sparse]Fix the bug of elementwise_grad (#52102) · aeb8c2e2
由 zhangkaihuo 提交于 3月 30, 2023

aeb8c2e2
Z
[Move Test] Move prim (#52167) · 3e2d0195
由 Zheng-Bicheng 提交于 3月 30, 2023
```
* update

* update
```
3e2d0195

support complex data types for libpaddle.Tensor's element get and set (#52324) · 13b12457

由 Feiyu Chan 提交于 3月 30, 2023

1. add type caster for paddle's complex type, to allow pybind to automatically cast it with python's complex type;
2. add complex64 and complex128 data type for `libpaddle.Tensor`'s element get and set(which is required to perturb an element to get the numerical derivative)
3. add support for cuda pinned place in `libpaddle.Tensor` element get and set

---
4. fix a bug in op code generation.(Creation of output folder in concurrent with parsing op yamls.)

13b12457

R

[AMP OP&Test] add fp16 test for linspace (#52161) · 40b30f50
由 Roc 提交于 3月 30, 2023

40b30f50

[AMP] Add python API for collecting operator stats. (#52215) · 73544322

由 Yiqun Liu 提交于 3月 30, 2023

* [AMP] Add python API for collecting operator stats.

* Fix import and polish codes.

* Add more unittest.

* Add doc for the new APIs.

73544322

W
add autogen code support for spectral_norm (#52145) · 28927209
由 Wang Xin 提交于 3月 30, 2023
```
* add autogen code support for spectral_norm

* bug fixed

* fix PR-CI-Static-Check fail
```
28927209

[AMP OP&Test]Modify the FP16 and BF16 OpTest of Add_N (#52311) · e3217e3e

由 Vvsmile 提交于 3月 30, 2023

* adjust defalut tolerance of output and grad

* fix a bug in the grad of OpTest

* fix the type of setting defalut value in optest, both forward and
backward

* add defalut

* fix test_sum_op

* fix test_sum_op test for testing add_n

* modify the add_n op_test

e3217e3e

add scatter composite rule. (#52005) · e16eb22c

由 zxcd 提交于 3月 30, 2023

* add scatter composite rule.

* add public_python_api

* add python unit16 support.

* fix code style.

* add cinn to makelist

* cinn unsupport uint16, forbidden cinn when dtype==uint16.

e16eb22c

Y

add xpu cumprod, group norm grad (#52089) · fb16bdc7
由 ykkk2333 提交于 3月 30, 2023

fb16bdc7
Z

[XPU] add delete_concat_op_pass (#52304) · 70ebef81
由 zhupengyang 提交于 3月 30, 2023

70ebef81

Fix bug of c_softmax_with_cross_entropy_op_xpu_op (#52296) · 8ef97088

由 Ghost Screaming 提交于 3月 30, 2023

* Support ignore_index for c_softmax_with_cross_entropy_op.

* Polish code. Remove useless comments and add Testcase.

* Polish code for TestCase.

* Polish code.

* Polish code style.

* Polish code.

* Change loss calculation formula and ignore_index dtype.

* Polish TestCase.

* Fix bug of c_softmax_with_cross_entropy_op_xpu_op. Attribute 'ignore_index'
dtype is int64_t.

8ef97088

傅
[AMP&OP_TEST] Fix interp test case (#52282) · dfa893fd
由傅剑寒提交于 3月 30, 2023
```
* delete check_dygraph and use default atol,max_relative_error

* add test case for bicubic_interp
```
dfa893fd
Y
[AMP OP&Test] Register FP16 for multinomial. (#52107) · 7788b65e
由 yunyaoXYY 提交于 3月 30, 2023
```
* add FP16 for multinomial

* fix input data

* update code

* fix FP16

* fix code
```
7788b65e
K
[Perf] remove sync_calc_stream and sync_comm_stream (#51989) · 0f4229c5
由 kangguangli 提交于 3月 30, 2023
```
* remove sync_calc_stream and sync_comm_stream

* fix ci bug

* fix

* fix

* fix
```
0f4229c5
Z

[AMP] use promote dtype when amp_level=O2 (#51063) · 6f8ab1fa
由 Zhang Ting 提交于 3月 30, 2023

6f8ab1fa
W
[AMP OP&Test] Strided slice fp16 and bf16 unitest (#52220) · 5cdd9f2c
由 Wang Xinyu 提交于 3月 30, 2023
```
* stride slice fp16 and bf16 unitest

* fix code style

* add self.dtype
```
5cdd9f2c

[Test Mv] ipu_test (#52143) · 38a477e2

由 gouzil 提交于 3月 30, 2023

* [Test Mv] ipu_test

* [Test Mv] cmake add py_test_modules

* [Move Test] rm py_test_modules

* rm asp

38a477e2

[AMP OP&Test] assign op add fp16 、bfp16 test (#52233) · 41f0e3c3
由 zhenhailiu 提交于 3月 30, 2023
```
* add fp16 bfp16 test

* polish

* polish

* polish
```
41f0e3c3
[AMP OP&Test] Arg min max bf16 test (#52276) · 3161e6c3
由 zhenhailiu 提交于 3月 30, 2023
```
* polish

* add type check
```
3161e6c3
[AMP OP&Test] element_wise_add_fp16_test (#52240) · bed54a70
由 zhenhailiu 提交于 3月 30, 2023

bed54a70
S
[BugFix]Fix segment fault in order setting (#52293) · d2cdc7e3
由 ShenLiang 提交于 3月 29, 2023
```
* fix bug in proto

* add utest
```
d2cdc7e3

support python object input data broadcast for model parallel (#51765) · 8baf33a4

由 Guoxia Wang 提交于 3月 30, 2023

* support python object input data broadcast for model parallel

* add unittest

* fix

* fix concat 0D tensor

* fix codestyle

8baf33a4

[CodeStyle][C416][C417] rewrite unnecessary comprehension with function call... · 929892c3

由 cyberslack_lee 提交于 3月 30, 2023

[CodeStyle][C416][C417] rewrite unnecessary comprehension with function call and use generator instead of map (#52140)

* codestyle c416 c417

* fix error

* fix inc

* unify all C4 rules into one

* fix inc

---------
Co-authored-by: NSigureMo <sigure.qaq@gmail.com>

929892c3

J

[Test Mv] remove remaining tests in unittests/mlu(#52291) · b6ae6a5d
由 jjyaoao 提交于 3月 30, 2023

b6ae6a5d
W
Del old dygraph varbase (#52236) · d4571470
由 wanghuancoder 提交于 3月 30, 2023
```
* delete old dygraph op test
```
d4571470

Add Gloo SendRecv Function (#52221) · b8850521

由 yuehuayingxueluo 提交于 3月 30, 2023

* add gloo  send_recv

* fix code_stype

* fix CI bug

* fix send_recv.cc

* add send_recv without sync_op

* fix send_recv test

* fix gather.cc

b8850521

29 3月, 2023 8 次提交
- G
  
  fix QAT export bug (#52218) · a523f6b3
  由 Guanghua Yu 提交于 3月 29, 2023
  
  a523f6b3
- Z
  [AMP OP&Test] pad3d add unittests of fp16 and bf16 (#51015) · f86d0be7
  由 zengshao0622 提交于 3月 29, 2023
```
* pad3d add unittests of fp16 and bf16

* pad3d add unittests of fp16 and bf16

* fix cuda place

* fix random to uniform

* fix class name

* fix fp16 max relative error to 1.5e-3

* add dytpe register for onednn

* add pad uint16 check of common.py

* remove check_eager

* test_check_grad --> test_check_grad_normal
```
  f86d0be7
- Z
  
  [Test Mv] custom_runtime (#52021) · 7f86c1dc
  由 Zheng-Bicheng 提交于 3月 29, 2023
  
  7f86c1dc
- Y
  Add group_norm composite rule (#51874) · cabf3921
  由 Yichen Zhang 提交于 3月 29, 2023
```
* add group_norm composite rule

* add test for scale_grad and bias_grad

* resolve conflicts

* remove amp in composite_rule.py

* add float16 test

* deal with NHWC format

* keep the composite rule in float16 identical as original kernel

* resolve conflicts
```
  cabf3921
- R
  
  [KUNLUN]fix cast bf16 (#52246) · 548d5522
  由 Roc 提交于 3月 29, 2023
  
  548d5522
- W
  Del old dygraph optest8 (#52094) · d612faf5
  由 wanghuancoder 提交于 3月 29, 2023
```
* delete old dygraph op test
```
  d612faf5
- H
  [XPU] fix ut: test_kldiv_loss_op_xpu, test_temporal_shift_op_xpu (#52258) · b83e506b
  由 houj04 提交于 3月 29, 2023
```
* fix test_kldiv_loss_op_xpu

* fix test_temporal_shift_op_xpu
```
  b83e506b
- W
  [AMP OP&Test] Add fp16/bf16 to clip op (#52158) · ad01eccd
  由 wuyefeilin 提交于 3月 29, 2023
```
* add fp16/bf16 to clip op

* fix as reviewed

* update test_clip_op.py

* update test_clip_op.py
```
  ad01eccd

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功