提交 · 866c28773d2a5c1a85b85f3f34e0d005b41c8970 · PaddlePaddle / Paddle

28 3月, 2023 12 次提交
- W
  [AMP OP&Test] Add float16 OpTest for squeeze, unsqueeze (#52018) · 866c2877
  由 Wang Xinyu 提交于 3月 28, 2023
```
* add squeeze, unsqueeze, transpose fp16 unitest

* Update test_transpose_op.py
```
  866c2877
- C
  [Auto Parallel] Add o1 level tune (#52041) · d6011cb6
  由 caozhou 提交于 3月 28, 2023
```
* add tune o1 level

* add unittest
```
  d6011cb6
- Y
  
  [AMP OP&Test]shape op fp/bf16 support (#52184) · 418b983c
  由 YuhangLi 提交于 3月 28, 2023
  
  418b983c
- A
  [Test Mv] move collective/multinode to test dir (#51982) · ceca55c5
  由 Ainavo 提交于 3月 28, 2023
```
* [Test Mv] move collective/multinode to test dir

* add CMakeList.txt to test/collective

* add bash_test_modules

* adjust the order

* recover bash_test_modules

* add_subdirectory(collective)

* resolve conflicts

* resolve conflicts
```
  ceca55c5
- J
  [AMP] add fp16&bf16 support for flatten op (#52035) · a33a4d01
  由 jiangcheng 提交于 3月 28, 2023
```
* [AMP] add fp16&bf16 support for flatten op

* fix ci bug

* fix inpute should astype self.dtype bug and fix zerodim test name

* remove 0D-tensor bf16 test for window-inference-ci pass

* remove flatten from op_accuracy_white_list
```
  a33a4d01
- Z
  [Move Test] move rpc (#52166) · a34abdb5
  由 Zheng-Bicheng 提交于 3月 28, 2023
```
* update

* update
```
  a34abdb5
- G
  [CodeStyle][B015] replace pointless comparisons with appropriate statements (#52126) · 3957007c
  由 gouzil 提交于 3月 28, 2023
```
* [CodeStyle][B015] delete unused

* [CodeStyle][B015] add assert
```
  3957007c
- 张
  
  [CodeStyle][PLC0414] remove self-alias and some discussion (#52122) · 888b8b6b
  由张春乔提交于 3月 28, 2023
  
  888b8b6b
- I
  
  [CodeStyle][C405] Unnecessary <list/tuple> literal - rewrite as a set literal (#51972) · 9fa98349
  由 Infinity_lee 提交于 3月 28, 2023
  
  9fa98349
- Z
  
  [Test Mv] python/paddle/fluid/tests/custom_op/*.py to test/custom_op (#51948) · 7aa7fc49
  由 Zheng-Bicheng 提交于 3月 28, 2023
  
  7aa7fc49
- J
  【Prim】Optimize composite rule by making scalar shape as 1 (#51960) · 45acb717
  由 Jiabin Yang 提交于 3月 28, 2023
```
* optimize composite rule by making scalar shape as []1

* fix shape usage for 0D

* fix rules

* fix 0D error

* fix flatten 0D error

* fix bn eval mode

* fix bn test

* fix flatten
```
  45acb717
- Y
  [Hackathon NO.77] 为 Paddle-TRT 添加 bitwise 算子 (#51971) · 864b50c3
  由 Young-Flash 提交于 3月 28, 2023
```
* add bitwise_not trt converter

* run pre-commit

* modify neg_one_tensor_dims init way

* fix BOOL type support requires TensorRT 8.4

* fix int8 & uint8 type

* improve data type readability

* modify filter logic

* fix coverage CI
```
  864b50c3
27 3月, 2023 9 次提交

add prim test for some ops (#51749) · e1674e8b

由 Charles-hit 提交于 3月 27, 2023

* add tanh and cast prim test

* fix tanh test

* fix 0-d test

* add sqrt fp16 prim test

* add public_python_api in prim test

* fix test_squeeze2_op

* add tanh prim test

* add dropout prim test

* [Dy2St]Fix clone for test state problem

* clean code

* modify test_cumsum_op

* modify test_cumsum_op

* fix dropout test

* add dropout in cmake

* fix dropout test

---------
Co-authored-by: NAurelius84 <zhangliujie@baidu.com>

e1674e8b

C

fix eval branch of composite rule of batch_norm (#52154) · 20befdef
由 cyber-pioneer 提交于 3月 27, 2023

20befdef
[Zero-Dim] add FLAGS_set_to_1d, control whether to hack process to 1D, add ut for xpu (#51899) · 134c9c0c
由 zhouweiwei2014 提交于 3月 27, 2023

134c9c0c
X

elementwise: onednn: support zero dimension inputs (#51656) · 2c1d494e
由 Xinyu Chen 提交于 3月 27, 2023

2c1d494e

[CustomOP Inplace] Automap inplace dtype and shape, support vector<Tensor> output (#52114) · 04025237

由 HongyuJia 提交于 3月 27, 2023

* [CustomOP Inplace] Automap inplace dtype and shape, prepare for vector<Tensor> output

* delete dtype,shape func of multi_inplace op

* [CustomOP Inplace] Automap inplace dtype and shape, support vector<Tensor> output

04025237

L
unbind support bool dtype (#52080) · 553630aa
由 Leo Chen 提交于 3月 27, 2023
```
* unbind support bool dtype

* replace np.array_equal
```
553630aa
L
Add data type of int, int64 for add kernel. Modify the code style of (#50443) · 62bff0e0
由 Leo Guo 提交于 3月 27, 2023
```
instance_norm_grad kernel. Fix bugs that the data type of input is different from output in reduce_sum kernel. test=kunlun
```
62bff0e0

Fused elementwise_(mul/div) (#50428) · 968f7f24

由 Sławomir Siwek 提交于 3月 27, 2023

* extract Op and OPMaker to .h

* extend pattern for fused_op

* set "with_residual" default to false

* adjust fuse passes

* remove fc+eltwise flag

* fused_output_scale

* activation attrs

* remove extra attrs

* fix int8/bf16 unit tests

* simplify RecomputeOutputDims

* remove unused method

* Add description for attributes

* add extra check

* adjust op compats

* update quantize test

* fix protobuf parsing error

* fix int8 performance

* fused elementwises

* merge develop

* remove activation

* restore activation for existing add/sub ops

968f7f24

H

[XPU] layer_norm support fp16 input of scale and bias. (#52091) · 14abafa1
由 houj04 提交于 3月 27, 2023

14abafa1

25 3月, 2023 5 次提交
- 张
  
  [CodeStyle][PLR0402] import a.b to from a import b (#52125) · 8c17fc0b
  由张春乔提交于 3月 25, 2023
  
  8c17fc0b
- I
  [CodeStyle][UP027] Replace unpacked list comprehension with a generator expression (#52025) · 3dbc0e46
  由 Infinity_lee 提交于 3月 25, 2023
```
* codestyle up027

* add to pyproject.toml
```
  3dbc0e46
- 张
  
  [CodeStyle][UP028] using yield from (#52059) · 85e20755
  由张春乔提交于 3月 25, 2023
  
  85e20755
- 张
  
  [CodeStyle][UP009] mv unnecessary utf8 declaration (#52050) · 33b289d7
  由张春乔提交于 3月 25, 2023
  
  33b289d7
- J
  
  [Test Mv] remove mlu (#52064) · e5414f76
  由 jjyaoao 提交于 3月 25, 2023
  
  e5414f76
24 3月, 2023 9 次提交

Z

[Test Mv] python/paddle/fluid/tests/custom_kernel/*.py to test/custom_kernel (#51946) · 24740ccd
由 Zheng-Bicheng 提交于 3月 24, 2023

24740ccd

add phi operator allreduce/reduce (#51857) · 47f87ad3

由 TaoTao Li 提交于 3月 24, 2023

* add all_reduce, reduce kernel and api

* fix all_reduce reduce ut

fix reduce op maker conflict

fix merge conflicts

* fix conflicts, rename ReduceOp->ReduceBaseOp in reduce_ops

rename allreduce op, to remove

* fix code format

fix comments

* modify test_collective_reduce_api ut timeout

* fix PR-CI-Build

fix comments: format phi operator

47f87ad3

W
Del old dygraph optest5 (#51686) · 6261076c
由 wanghuancoder 提交于 3月 24, 2023
```
* delete old dygraph op test
```
6261076c
W
Del old dygraph MLU NPU (#51958) · 611f7ccc
由 wanghuancoder 提交于 3月 24, 2023
```
* delete old dygraph, mlu npu do not use dygraph
```
611f7ccc
Z
[Test Mv] python/paddle/fluid/tests to test/legacy_test (#51944) · ba7c62f8
由 Zheng-Bicheng 提交于 3月 24, 2023
```
update
```
ba7c62f8
Z

[Test Mv] python/paddle/fluid/tests/book to test/book (#51945) · 0da62ab7
由 Zheng-Bicheng 提交于 3月 24, 2023

0da62ab7

Memory Efficient Attention (#51867) · e5ad3859

由 ZhangDY-6483 提交于 3月 24, 2023

* first version, notest

* return final rst, notest

* use infinity() instead of max

* ut structure

* start up of ut

* generate lse

* update

* add depense

* reconstruct cmake

* move file

* add memory efficient attention and fix blasimpl

* update

* update cmake

* add namespace

* update cmake

* use .cu

* update for pad3d

* bug fix

* bug fix

* update

* bug fix

* update enforce

* add test case

* merge the lse pad

* fix kernel_fn of backward

* fix PADDLE_ENFORCE_EQ and phi_api

* fix PADDLE_ENFORCE

* fix PADDLE_ENFORCE

* rerun coverage

* fix memory efficient attention test

* rerun ci

* add cuda version condition

* add cuda version condition

* delete WIP test

* replace PADDLE_ENFORCE

* edit the namespace of datatype in multiple.cc

* rerun

* rerun

---------
Co-authored-by: Nliuyuang <liuyuang@baidu.com>

e5ad3859

W
do not test dygraph in dygraph (#52027) · 298a1a0b
由 wanghuancoder 提交于 3月 24, 2023
```
* xpu do not test dygraph in dygraph
```
298a1a0b
Y

Fix roll kernel gpu bug. (#52012) · b6d0dac9
由 Yuang Liu 提交于 3月 24, 2023

b6d0dac9

23 3月, 2023 5 次提交
- H
  
  [CustomOP Optional] CustomOP supports optional vector<Tensor> input (#51973) · 6a10e604
  由 HongyuJia 提交于 3月 23, 2023
  
  6a10e604
- W
  
  add paddle-trt convert op: greater_equal (#52000) · 4dfbdb04
  由 Wangzheee 提交于 3月 23, 2023
  
  4dfbdb04
- X
  【prim】delete high order prim flag && add special prune rules for node.cc (#51676) · 978d544b
  由 xiaoguoguo626807 提交于 3月 23, 2023
```
* delete prim flag for matmul_2_grad

* delete prim flag for matmul_2_grad

* add new setgradoutmeta for matmul_double_grad_node

* modify test and delete log

* deal with review
```
  978d544b
- C
  [Prim] add meshgrid composite rule (#51061) · 53bb883d
  由 chenjian 提交于 3月 23, 2023
```
* add meshgrid composite rule

* add meshgrid composite rule

* update

* add into CMakeLists

* fix

* update

* update

* optimize code

* fix meshgrid op

* update test
```
  53bb883d
- W
  delete old dygraph xpu op test (#51955) · f8a8dd5e
  由 wanghuancoder 提交于 3月 23, 2023
```
* delete old dygraph xpu op test
```
  f8a8dd5e

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功