提交 · a33a4d013c41f821692cef2799f63319a89a4186 · PaddlePaddle / Paddle

28 3月, 2023 10 次提交
- J
  [AMP] add fp16&bf16 support for flatten op (#52035) · a33a4d01
  由 jiangcheng 提交于 3月 28, 2023
```
* [AMP] add fp16&bf16 support for flatten op

* fix ci bug

* fix inpute should astype self.dtype bug and fix zerodim test name

* remove 0D-tensor bf16 test for window-inference-ci pass

* remove flatten from op_accuracy_white_list
```
  a33a4d01
- Z
  [Move Test] move rpc (#52166) · a34abdb5
  由 Zheng-Bicheng 提交于 3月 28, 2023
```
* update

* update
```
  a34abdb5
- Z
  [Move Test] move contrib (#52168) · 28a76556
  由 Zheng-Bicheng 提交于 3月 28, 2023
```
* update

* update
```
  28a76556
- G
  [CodeStyle][B015] replace pointless comparisons with appropriate statements (#52126) · 3957007c
  由 gouzil 提交于 3月 28, 2023
```
* [CodeStyle][B015] delete unused

* [CodeStyle][B015] add assert
```
  3957007c
- 张
  
  [CodeStyle][PLC0414] remove self-alias and some discussion (#52122) · 888b8b6b
  由张春乔提交于 3月 28, 2023
  
  888b8b6b
- I
  
  [CodeStyle][C405] Unnecessary <list/tuple> literal - rewrite as a set literal (#51972) · 9fa98349
  由 Infinity_lee 提交于 3月 28, 2023
  
  9fa98349
- Z
  
  [Test Mv] python/paddle/fluid/tests/custom_op/*.py to test/custom_op (#51948) · 7aa7fc49
  由 Zheng-Bicheng 提交于 3月 28, 2023
  
  7aa7fc49
- J
  【Prim】Optimize composite rule by making scalar shape as 1 (#51960) · 45acb717
  由 Jiabin Yang 提交于 3月 28, 2023
```
* optimize composite rule by making scalar shape as []1

* fix shape usage for 0D

* fix rules

* fix 0D error

* fix flatten 0D error

* fix bn eval mode

* fix bn test

* fix flatten
```
  45acb717
- Y
  [Hackathon NO.77] 为 Paddle-TRT 添加 bitwise 算子 (#51971) · 864b50c3
  由 Young-Flash 提交于 3月 28, 2023
```
* add bitwise_not trt converter

* run pre-commit

* modify neg_one_tensor_dims init way

* fix BOOL type support requires TensorRT 8.4

* fix int8 & uint8 type

* improve data type readability

* modify filter logic

* fix coverage CI
```
  864b50c3
- I
  
  [CodeStyle][UP024] Replace aliased errors with OSError (#52024) · 8c888eea
  由 Infinity_lee 提交于 3月 28, 2023
  
  8c888eea
27 3月, 2023 11 次提交

G
[CodeStyle][PLC3002][PLE1205] simplify lambda and add missing placeholder to... · b166581a
由 gouzil 提交于 3月 27, 2023
```
[CodeStyle][PLC3002][PLE1205] simplify lambda and add missing placeholder to logger template (#52133)
```
b166581a

[CodeStyle][C413][C414] Unnecessary <list/reversed> call around... · af6f262d

由 Infinity_lee 提交于 3月 27, 2023

[CodeStyle][C413][C414] Unnecessary <list/reversed> call around sorted(),<list/reversed/set/sorted/tuple> call within <list/set/sorted/tuple>() (#52065)

af6f262d

add prim test for some ops (#51749) · e1674e8b

由 Charles-hit 提交于 3月 27, 2023

* add tanh and cast prim test

* fix tanh test

* fix 0-d test

* add sqrt fp16 prim test

* add public_python_api in prim test

* fix test_squeeze2_op

* add tanh prim test

* add dropout prim test

* [Dy2St]Fix clone for test state problem

* clean code

* modify test_cumsum_op

* modify test_cumsum_op

* fix dropout test

* add dropout in cmake

* fix dropout test

---------
Co-authored-by: NAurelius84 <zhangliujie@baidu.com>

e1674e8b

C

fix eval branch of composite rule of batch_norm (#52154) · 20befdef
由 cyber-pioneer 提交于 3月 27, 2023

20befdef
[Zero-Dim] add FLAGS_set_to_1d, control whether to hack process to 1D, add ut for xpu (#51899) · 134c9c0c
由 zhouweiwei2014 提交于 3月 27, 2023

134c9c0c
X

elementwise: onednn: support zero dimension inputs (#51656) · 2c1d494e
由 Xinyu Chen 提交于 3月 27, 2023

2c1d494e

[CustomOP Inplace] Automap inplace dtype and shape, support vector<Tensor> output (#52114) · 04025237

由 HongyuJia 提交于 3月 27, 2023

* [CustomOP Inplace] Automap inplace dtype and shape, prepare for vector<Tensor> output

* delete dtype,shape func of multi_inplace op

* [CustomOP Inplace] Automap inplace dtype and shape, support vector<Tensor> output

04025237

L
unbind support bool dtype (#52080) · 553630aa
由 Leo Chen 提交于 3月 27, 2023
```
* unbind support bool dtype

* replace np.array_equal
```
553630aa
L
Add data type of int, int64 for add kernel. Modify the code style of (#50443) · 62bff0e0
由 Leo Guo 提交于 3月 27, 2023
```
instance_norm_grad kernel. Fix bugs that the data type of input is different from output in reduce_sum kernel. test=kunlun
```
62bff0e0

Fused elementwise_(mul/div) (#50428) · 968f7f24

由 Sławomir Siwek 提交于 3月 27, 2023

* extract Op and OPMaker to .h

* extend pattern for fused_op

* set "with_residual" default to false

* adjust fuse passes

* remove fc+eltwise flag

* fused_output_scale

* activation attrs

* remove extra attrs

* fix int8/bf16 unit tests

* simplify RecomputeOutputDims

* remove unused method

* Add description for attributes

* add extra check

* adjust op compats

* update quantize test

* fix protobuf parsing error

* fix int8 performance

* fused elementwises

* merge develop

* remove activation

* restore activation for existing add/sub ops

968f7f24

H

[XPU] layer_norm support fp16 input of scale and bias. (#52091) · 14abafa1
由 houj04 提交于 3月 27, 2023

14abafa1

25 3月, 2023 8 次提交
- 张
  
  [CodeStyle][PLR0402] import a.b to from a import b (#52125) · 8c17fc0b
  由张春乔提交于 3月 25, 2023
  
  8c17fc0b
- Z
  
  [Test Mv] quantization (#51942) · 0c5a4bac
  由 Zheng-Bicheng 提交于 3月 25, 2023
  
  0c5a4bac
- 张
  
  [CodeStyle][C411] replace list() with [] (#52057) · b811043a
  由张春乔提交于 3月 25, 2023
  
  b811043a
- G
  
  [CodeStyle][B016] Clear raise (#52118) · bf1771d0
  由 gouzil 提交于 3月 25, 2023
  
  bf1771d0
- I
  [CodeStyle][UP027] Replace unpacked list comprehension with a generator expression (#52025) · 3dbc0e46
  由 Infinity_lee 提交于 3月 25, 2023
```
* codestyle up027

* add to pyproject.toml
```
  3dbc0e46
- 张
  
  [CodeStyle][UP028] using yield from (#52059) · 85e20755
  由张春乔提交于 3月 25, 2023
  
  85e20755
- 张
  
  [CodeStyle][UP009] mv unnecessary utf8 declaration (#52050) · 33b289d7
  由张春乔提交于 3月 25, 2023
  
  33b289d7
- J
  
  [Test Mv] remove mlu (#52064) · e5414f76
  由 jjyaoao 提交于 3月 25, 2023
  
  e5414f76
24 3月, 2023 11 次提交

Z

[Test Mv] python/paddle/fluid/tests/custom_kernel/*.py to test/custom_kernel (#51946) · 24740ccd
由 Zheng-Bicheng 提交于 3月 24, 2023

24740ccd

add phi operator allreduce/reduce (#51857) · 47f87ad3

由 TaoTao Li 提交于 3月 24, 2023

* add all_reduce, reduce kernel and api

* fix all_reduce reduce ut

fix reduce op maker conflict

fix merge conflicts

* fix conflicts, rename ReduceOp->ReduceBaseOp in reduce_ops

rename allreduce op, to remove

* fix code format

fix comments

* modify test_collective_reduce_api ut timeout

* fix PR-CI-Build

fix comments: format phi operator

47f87ad3

W
Del old dygraph optest5 (#51686) · 6261076c
由 wanghuancoder 提交于 3月 24, 2023
```
* delete old dygraph op test
```
6261076c

[PHI Decoupling]Remove memory header (Part3) (#51288) · 3d78e759

由 YuanRisheng 提交于 3月 24, 2023

* decouple memory copy

* fix ci bugs

* fix ci compile bugs

* fix rocm compile

* fix ci bugs

* decouple memory

* deal with conflict

* fix xpu compile bugs

* fix xpu bugs

* deal with xpu bugs

* fix cmake bugs

* fix windows bugs

* fix ci bugs

* fix ci bugs

* delete redundance code

* add code for pybind

* fix py3 bugs

* fix ci bugs

3d78e759

W
Del old dygraph MLU NPU (#51958) · 611f7ccc
由 wanghuancoder 提交于 3月 24, 2023
```
* delete old dygraph, mlu npu do not use dygraph
```
611f7ccc
Z
[Test Mv] python/paddle/fluid/tests to test/legacy_test (#51944) · ba7c62f8
由 Zheng-Bicheng 提交于 3月 24, 2023
```
update
```
ba7c62f8
Z

[Test Mv] python/paddle/fluid/tests/book to test/book (#51945) · 0da62ab7
由 Zheng-Bicheng 提交于 3月 24, 2023

0da62ab7

Memory Efficient Attention (#51867) · e5ad3859

由 ZhangDY-6483 提交于 3月 24, 2023

* first version, notest

* return final rst, notest

* use infinity() instead of max

* ut structure

* start up of ut

* generate lse

* update

* add depense

* reconstruct cmake

* move file

* add memory efficient attention and fix blasimpl

* update

* update cmake

* add namespace

* update cmake

* use .cu

* update for pad3d

* bug fix

* bug fix

* update

* bug fix

* update enforce

* add test case

* merge the lse pad

* fix kernel_fn of backward

* fix PADDLE_ENFORCE_EQ and phi_api

* fix PADDLE_ENFORCE

* fix PADDLE_ENFORCE

* rerun coverage

* fix memory efficient attention test

* rerun ci

* add cuda version condition

* add cuda version condition

* delete WIP test

* replace PADDLE_ENFORCE

* edit the namespace of datatype in multiple.cc

* rerun

* rerun

---------
Co-authored-by: Nliuyuang <liuyuang@baidu.com>

e5ad3859

Y
[AMP] Add uint16 dtype check for compare ops (#52016) · 40fea722
由 yeliang2258 提交于 3月 24, 2023
```
* add uint16 dtype check for compare ops

* update doc
```
40fea722
W
do not test dygraph in dygraph (#52027) · 298a1a0b
由 wanghuancoder 提交于 3月 24, 2023
```
* xpu do not test dygraph in dygraph
```
298a1a0b
Y

Fix roll kernel gpu bug. (#52012) · b6d0dac9
由 Yuang Liu 提交于 3月 24, 2023

b6d0dac9

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功