提交 · e492ee2485d93664b0bb7513e5e92bc3db9cc126 · PaddlePaddle / Paddle

28 3月, 2023 21 次提交
- N
  
  fix a typo, `sheduler` -> `scheduler` (#52149) · e492ee24
  由 Nyakku Shigure 提交于 3月 28, 2023
  
  e492ee24
- W
  Polish ResNet and Bert prim_cinn test (#52030) · e57051b4
  由 WangZhen 提交于 3月 28, 2023
```
* Polish ResNet and Bert prim_cinn test
```
  e57051b4
- C
  support auto generate for huber_loss (#51951) · 2ba4515e
  由 cyberslack_lee 提交于 3月 28, 2023
```
* fix huber_loss

* fix

* fix ops.yaml add intermediate

* fix

* fix test
```
  2ba4515e
- A
  
  【Hackathon 4th 】add Trapezoid API && add Cumulative_trapezoid API (#51195) · cfe3ff48
  由 Ainavo 提交于 3月 28, 2023
  
  cfe3ff48
- K
  
  [CodeStyle][PLR1701] unify multiple isinstance expressions as one (#52150) · c1838da6
  由 Kim 提交于 3月 28, 2023
  
  c1838da6
- G
  [CodeStyle][B020] rename iterable and it's iterates when they have same name (#52128) · c05feb90
  由 gouzil 提交于 3月 28, 2023
```
* [CodeStyle][B020] rename for

* [CodeStyle][B020] rename for
```
  c05feb90
- H
  
  [API/OP] Support FP16/BF16 in paddle.nonzero API/OP (#51640) · 2e92357b
  由 Haohongxiang 提交于 3月 28, 2023
  
  2e92357b
- W
  [AMP OP&Test] add fp16/bf16 unittest for conv ops (#51787) · ad5536eb
  由 wangxinxin08 提交于 3月 28, 2023
```
* add unittest for conv2d/depthwise_conv2d/conv2d_transpose

* add bf16 for DWConv and ConvTranspose

* fix unitest of conv2d_transpose

* modify DWConv2d op and unittest

* fix unittest of conv2d_transpose_bf16

* modify unittest name according to review

* modify atol of DWConv2D unittest
```
  ad5536eb
- G
  [Test Mv] autograd_test (#52142) · b94ef537
  由 gouzil 提交于 3月 28, 2023
```
* [Test Mv] autograd_test

* [Move Test] rm py_test_modules
```
  b94ef537
- W
  delete old dygraph mkldnn op test (#51953) · 67a105f9
  由 wanghuancoder 提交于 3月 28, 2023
```
* delete old dygraph mkldnn op test
```
  67a105f9
- W
  [AMP OP&Test] Add float16 OpTest for squeeze, unsqueeze (#52018) · 866c2877
  由 Wang Xinyu 提交于 3月 28, 2023
```
* add squeeze, unsqueeze, transpose fp16 unitest

* Update test_transpose_op.py
```
  866c2877
- C
  [Auto Parallel] Add o1 level tune (#52041) · d6011cb6
  由 caozhou 提交于 3月 28, 2023
```
* add tune o1 level

* add unittest
```
  d6011cb6
- Y
  
  [AMP OP&Test]shape op fp/bf16 support (#52184) · 418b983c
  由 YuhangLi 提交于 3月 28, 2023
  
  418b983c
- A
  [Test Mv] move collective/multinode to test dir (#51982) · ceca55c5
  由 Ainavo 提交于 3月 28, 2023
```
* [Test Mv] move collective/multinode to test dir

* add CMakeList.txt to test/collective

* add bash_test_modules

* adjust the order

* recover bash_test_modules

* add_subdirectory(collective)

* resolve conflicts

* resolve conflicts
```
  ceca55c5
- J
  [AMP] add fp16&bf16 support for flatten op (#52035) · a33a4d01
  由 jiangcheng 提交于 3月 28, 2023
```
* [AMP] add fp16&bf16 support for flatten op

* fix ci bug

* fix inpute should astype self.dtype bug and fix zerodim test name

* remove 0D-tensor bf16 test for window-inference-ci pass

* remove flatten from op_accuracy_white_list
```
  a33a4d01
- Z
  [Move Test] move rpc (#52166) · a34abdb5
  由 Zheng-Bicheng 提交于 3月 28, 2023
```
* update

* update
```
  a34abdb5
- G
  [CodeStyle][B015] replace pointless comparisons with appropriate statements (#52126) · 3957007c
  由 gouzil 提交于 3月 28, 2023
```
* [CodeStyle][B015] delete unused

* [CodeStyle][B015] add assert
```
  3957007c
- 张
  
  [CodeStyle][PLC0414] remove self-alias and some discussion (#52122) · 888b8b6b
  由张春乔提交于 3月 28, 2023
  
  888b8b6b
- I
  
  [CodeStyle][C405] Unnecessary <list/tuple> literal - rewrite as a set literal (#51972) · 9fa98349
  由 Infinity_lee 提交于 3月 28, 2023
  
  9fa98349
- J
  【Prim】Optimize composite rule by making scalar shape as 1 (#51960) · 45acb717
  由 Jiabin Yang 提交于 3月 28, 2023
```
* optimize composite rule by making scalar shape as []1

* fix shape usage for 0D

* fix rules

* fix 0D error

* fix flatten 0D error

* fix bn eval mode

* fix bn test

* fix flatten
```
  45acb717
- Y
  [Hackathon NO.77] 为 Paddle-TRT 添加 bitwise 算子 (#51971) · 864b50c3
  由 Young-Flash 提交于 3月 28, 2023
```
* add bitwise_not trt converter

* run pre-commit

* modify neg_one_tensor_dims init way

* fix BOOL type support requires TensorRT 8.4

* fix int8 & uint8 type

* improve data type readability

* modify filter logic

* fix coverage CI
```
  864b50c3
27 3月, 2023 8 次提交

add prim test for some ops (#51749) · e1674e8b

由 Charles-hit 提交于 3月 27, 2023

* add tanh and cast prim test

* fix tanh test

* fix 0-d test

* add sqrt fp16 prim test

* add public_python_api in prim test

* fix test_squeeze2_op

* add tanh prim test

* add dropout prim test

* [Dy2St]Fix clone for test state problem

* clean code

* modify test_cumsum_op

* modify test_cumsum_op

* fix dropout test

* add dropout in cmake

* fix dropout test

---------
Co-authored-by: NAurelius84 <zhangliujie@baidu.com>

e1674e8b

C

fix eval branch of composite rule of batch_norm (#52154) · 20befdef
由 cyber-pioneer 提交于 3月 27, 2023

20befdef
[Zero-Dim] add FLAGS_set_to_1d, control whether to hack process to 1D, add ut for xpu (#51899) · 134c9c0c
由 zhouweiwei2014 提交于 3月 27, 2023

134c9c0c
X

elementwise: onednn: support zero dimension inputs (#51656) · 2c1d494e
由 Xinyu Chen 提交于 3月 27, 2023

2c1d494e
L
unbind support bool dtype (#52080) · 553630aa
由 Leo Chen 提交于 3月 27, 2023
```
* unbind support bool dtype

* replace np.array_equal
```
553630aa
L
Add data type of int, int64 for add kernel. Modify the code style of (#50443) · 62bff0e0
由 Leo Guo 提交于 3月 27, 2023
```
instance_norm_grad kernel. Fix bugs that the data type of input is different from output in reduce_sum kernel. test=kunlun
```
62bff0e0

Fused elementwise_(mul/div) (#50428) · 968f7f24

由 Sławomir Siwek 提交于 3月 27, 2023

* extract Op and OPMaker to .h

* extend pattern for fused_op

* set "with_residual" default to false

* adjust fuse passes

* remove fc+eltwise flag

* fused_output_scale

* activation attrs

* remove extra attrs

* fix int8/bf16 unit tests

* simplify RecomputeOutputDims

* remove unused method

* Add description for attributes

* add extra check

* adjust op compats

* update quantize test

* fix protobuf parsing error

* fix int8 performance

* fused elementwises

* merge develop

* remove activation

* restore activation for existing add/sub ops

968f7f24

H

[XPU] layer_norm support fp16 input of scale and bias. (#52091) · 14abafa1
由 houj04 提交于 3月 27, 2023

14abafa1

25 3月, 2023 5 次提交
- 张
  
  [CodeStyle][PLR0402] import a.b to from a import b (#52125) · 8c17fc0b
  由张春乔提交于 3月 25, 2023
  
  8c17fc0b
- I
  [CodeStyle][UP027] Replace unpacked list comprehension with a generator expression (#52025) · 3dbc0e46
  由 Infinity_lee 提交于 3月 25, 2023
```
* codestyle up027

* add to pyproject.toml
```
  3dbc0e46
- 张
  
  [CodeStyle][UP028] using yield from (#52059) · 85e20755
  由张春乔提交于 3月 25, 2023
  
  85e20755
- 张
  
  [CodeStyle][UP009] mv unnecessary utf8 declaration (#52050) · 33b289d7
  由张春乔提交于 3月 25, 2023
  
  33b289d7
- J
  
  [Test Mv] remove mlu (#52064) · e5414f76
  由 jjyaoao 提交于 3月 25, 2023
  
  e5414f76
24 3月, 2023 6 次提交

add phi operator allreduce/reduce (#51857) · 47f87ad3

由 TaoTao Li 提交于 3月 24, 2023

* add all_reduce, reduce kernel and api

* fix all_reduce reduce ut

fix reduce op maker conflict

fix merge conflicts

* fix conflicts, rename ReduceOp->ReduceBaseOp in reduce_ops

rename allreduce op, to remove

* fix code format

fix comments

* modify test_collective_reduce_api ut timeout

* fix PR-CI-Build

fix comments: format phi operator

47f87ad3

W
Del old dygraph optest5 (#51686) · 6261076c
由 wanghuancoder 提交于 3月 24, 2023
```
* delete old dygraph op test
```
6261076c
W
Del old dygraph MLU NPU (#51958) · 611f7ccc
由 wanghuancoder 提交于 3月 24, 2023
```
* delete old dygraph, mlu npu do not use dygraph
```
611f7ccc

Memory Efficient Attention (#51867) · e5ad3859

由 ZhangDY-6483 提交于 3月 24, 2023

* first version, notest

* return final rst, notest

* use infinity() instead of max

* ut structure

* start up of ut

* generate lse

* update

* add depense

* reconstruct cmake

* move file

* add memory efficient attention and fix blasimpl

* update

* update cmake

* add namespace

* update cmake

* use .cu

* update for pad3d

* bug fix

* bug fix

* update

* bug fix

* update enforce

* add test case

* merge the lse pad

* fix kernel_fn of backward

* fix PADDLE_ENFORCE_EQ and phi_api

* fix PADDLE_ENFORCE

* fix PADDLE_ENFORCE

* rerun coverage

* fix memory efficient attention test

* rerun ci

* add cuda version condition

* add cuda version condition

* delete WIP test

* replace PADDLE_ENFORCE

* edit the namespace of datatype in multiple.cc

* rerun

* rerun

---------
Co-authored-by: Nliuyuang <liuyuang@baidu.com>

e5ad3859

W
do not test dygraph in dygraph (#52027) · 298a1a0b
由 wanghuancoder 提交于 3月 24, 2023
```
* xpu do not test dygraph in dygraph
```
298a1a0b
Y

Fix roll kernel gpu bug. (#52012) · b6d0dac9
由 Yuang Liu 提交于 3月 24, 2023

b6d0dac9

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功