提交 · 2c1d494edb3324d1a8a2c7ac2163653cfbd7bd7d · PaddlePaddle / Paddle

27 3月, 2023 13 次提交

X

elementwise: onednn: support zero dimension inputs (#51656) · 2c1d494e
由 Xinyu Chen 提交于 3月 27, 2023

2c1d494e

[CustomOP Inplace] Automap inplace dtype and shape, support vector<Tensor> output (#52114) · 04025237

由 HongyuJia 提交于 3月 27, 2023

* [CustomOP Inplace] Automap inplace dtype and shape, prepare for vector<Tensor> output

* delete dtype,shape func of multi_inplace op

* [CustomOP Inplace] Automap inplace dtype and shape, support vector<Tensor> output

04025237

Automatically generate 'assign' operator (#51940) · 888a30c9

由 HappyHeavyRain 提交于 3月 27, 2023

* support assign op

* support assign infer_var_type

* change code according to review

* change code according to review

* only save 'get_infer_var_type_func'

* rest file mode

888a30c9

L

fix scope reuse problem (#52119) · 97fc2a0f
由 Leo Chen 提交于 3月 27, 2023

97fc2a0f
W
Revert "fix softmaxce null point in shape test (#51850)" (#52086) · d92c6477
由 wanghuancoder 提交于 3月 27, 2023
```
This reverts commit 9c238d2b.
```
d92c6477
L
unbind support bool dtype (#52080) · 553630aa
由 Leo Chen 提交于 3月 27, 2023
```
* unbind support bool dtype

* replace np.array_equal
```
553630aa
E
add custom device mixed precision inference api (#50884) · a6449634
由 engineer1109 提交于 3月 27, 2023
```
fix bug

remove useless

fix bug

add pybind

remove log

fix style

fix style

change api
```
a6449634
L
Add data type of int, int64 for add kernel. Modify the code style of (#50443) · 62bff0e0
由 Leo Guo 提交于 3月 27, 2023
```
instance_norm_grad kernel. Fix bugs that the data type of input is different from output in reduce_sum kernel. test=kunlun
```
62bff0e0
R
fix_gcc12_error (#52083) · f7267412
由 risemeup1 提交于 3月 27, 2023
```
* fix_gcc12_error

* fix gcc12 error

* fix gcc12 error
```
f7267412

fix_gcc12_error (#52007) · b2bd74f7

由 risemeup1 提交于 3月 27, 2023

* fix_gcc12_error

* patch on eigen3 for fixing gcc12 error

* Update multiary.cc

b2bd74f7

Fused elementwise_(mul/div) (#50428) · 968f7f24

由 Sławomir Siwek 提交于 3月 27, 2023

* extract Op and OPMaker to .h

* extend pattern for fused_op

* set "with_residual" default to false

* adjust fuse passes

* remove fc+eltwise flag

* fused_output_scale

* activation attrs

* remove extra attrs

* fix int8/bf16 unit tests

* simplify RecomputeOutputDims

* remove unused method

* Add description for attributes

* add extra check

* adjust op compats

* update quantize test

* fix protobuf parsing error

* fix int8 performance

* fused elementwises

* merge develop

* remove activation

* restore activation for existing add/sub ops

968f7f24

H

[XPU] layer_norm support fp16 input of scale and bias. (#52091) · 14abafa1
由 houj04 提交于 3月 27, 2023

14abafa1

Fix memory efficient attention bug (#52117) · 019e1cf5

由 sneaxiy 提交于 3月 27, 2023

* fix mea compile error

* support 2-D bias

* add inline to avoid compile error

* polish codes

019e1cf5

26 3月, 2023 2 次提交
- Z
  
  [Move Test] Sorted test/CMakeLists.txt (#52152) · a5b88cba
  由 Zheng-Bicheng 提交于 3月 26, 2023
  
  a5b88cba
- Z
  
  Add cmake (#52131) · 3676a443
  由 Zheng-Bicheng 提交于 3月 26, 2023
  
  3676a443
25 3月, 2023 10 次提交
- N
  [CodeStyle] update ruff config for pylint rules (#52144) · c88ed587
  由 Nyakku Shigure 提交于 3月 25, 2023
```
* [CodeStyle] update ruff config for pylint rules

* empty commit; test=document_fix
```
  c88ed587
- 张
  
  [CodeStyle][PLR0402] import a.b to from a import b (#52125) · 8c17fc0b
  由张春乔提交于 3月 25, 2023
  
  8c17fc0b
- Z
  
  [Test Mv] quantization (#51942) · 0c5a4bac
  由 Zheng-Bicheng 提交于 3月 25, 2023
  
  0c5a4bac
- R
  [Fix Bug] fix get_new_shape and get_new_data_from_tensor not support fallback... · db5204ec
  由 Ruibin Cheung 提交于 3月 25, 2023
```
[Fix Bug] fix get_new_shape and get_new_data_from_tensor not support fallback to CPU on custom device (#52002)
```
  db5204ec
- 张
  
  [CodeStyle][C411] replace list() with [] (#52057) · b811043a
  由张春乔提交于 3月 25, 2023
  
  b811043a
- G
  
  [CodeStyle][B016] Clear raise (#52118) · bf1771d0
  由 gouzil 提交于 3月 25, 2023
  
  bf1771d0
- I
  [CodeStyle][UP027] Replace unpacked list comprehension with a generator expression (#52025) · 3dbc0e46
  由 Infinity_lee 提交于 3月 25, 2023
```
* codestyle up027

* add to pyproject.toml
```
  3dbc0e46
- 张
  
  [CodeStyle][UP028] using yield from (#52059) · 85e20755
  由张春乔提交于 3月 25, 2023
  
  85e20755
- 张
  
  [CodeStyle][UP009] mv unnecessary utf8 declaration (#52050) · 33b289d7
  由张春乔提交于 3月 25, 2023
  
  33b289d7
- J
  
  [Test Mv] remove mlu (#52064) · e5414f76
  由 jjyaoao 提交于 3月 25, 2023
  
  e5414f76
24 3月, 2023 15 次提交

Z

[Test Mv] python/paddle/fluid/tests/custom_kernel/*.py to test/custom_kernel (#51946) · 24740ccd
由 Zheng-Bicheng 提交于 3月 24, 2023

24740ccd

add phi operator allreduce/reduce (#51857) · 47f87ad3

由 TaoTao Li 提交于 3月 24, 2023

* add all_reduce, reduce kernel and api

* fix all_reduce reduce ut

fix reduce op maker conflict

fix merge conflicts

* fix conflicts, rename ReduceOp->ReduceBaseOp in reduce_ops

rename allreduce op, to remove

* fix code format

fix comments

* modify test_collective_reduce_api ut timeout

* fix PR-CI-Build

fix comments: format phi operator

47f87ad3

W
Del old dygraph optest5 (#51686) · 6261076c
由 wanghuancoder 提交于 3月 24, 2023
```
* delete old dygraph op test
```
6261076c

[PHI Decoupling]Remove memory header (Part3) (#51288) · 3d78e759

由 YuanRisheng 提交于 3月 24, 2023

* decouple memory copy

* fix ci bugs

* fix ci compile bugs

* fix rocm compile

* fix ci bugs

* decouple memory

* deal with conflict

* fix xpu compile bugs

* fix xpu bugs

* deal with xpu bugs

* fix cmake bugs

* fix windows bugs

* fix ci bugs

* fix ci bugs

* delete redundance code

* add code for pybind

* fix py3 bugs

* fix ci bugs

3d78e759

Y
[CUSTOM DEVICE]analysis predictor custom device support (#52015) · 3ab19ab4
由 YuhangLi 提交于 3月 24, 2023
```
* [CUSTOM DEVICE]analysis predictor custom device support

* del debug log
```
3ab19ab4
Y

remove py::array::forcecast flag (#52039) · 5d503ec9
由 Yuanle Liu 提交于 3月 24, 2023

5d503ec9
W
Del old dygraph MLU NPU (#51958) · 611f7ccc
由 wanghuancoder 提交于 3月 24, 2023
```
* delete old dygraph, mlu npu do not use dygraph
```
611f7ccc
P
[PHI]fix momentum dtype infer (#51353) · 648ec795
由 PuQing 提交于 3月 24, 2023
```
* fix momentum dtype infer

* fix momentum datatype

* fix on cpu

* add momentum
```
648ec795
T
【PaddlePaddle Hackathon 4 No.40】为 Paddle 优化 kthvalue op 在 GPU 上的计算性能 (#51835) · e18f5339
由 thunder95 提交于 3月 24, 2023
```
* untracked files

* kthvalue perf

* remove unused files

* fix isnan

* fix isnan2

* fix bug

* try to fix rocm error
```
e18f5339
R
Fix ninja error (#49499) · 7415b101
由 risemeup1 提交于 3月 24, 2023
```
* fix ninja error

* fix_lite_ninja_error
```
7415b101
Z
[Test Mv] python/paddle/fluid/tests to test/legacy_test (#51944) · ba7c62f8
由 Zheng-Bicheng 提交于 3月 24, 2023
```
update
```
ba7c62f8
Z

[Test Mv] python/paddle/fluid/tests/book to test/book (#51945) · 0da62ab7
由 Zheng-Bicheng 提交于 3月 24, 2023

0da62ab7

Memory Efficient Attention (#51867) · e5ad3859

由 ZhangDY-6483 提交于 3月 24, 2023

* first version, notest

* return final rst, notest

* use infinity() instead of max

* ut structure

* start up of ut

* generate lse

* update

* add depense

* reconstruct cmake

* move file

* add memory efficient attention and fix blasimpl

* update

* update cmake

* add namespace

* update cmake

* use .cu

* update for pad3d

* bug fix

* bug fix

* update

* bug fix

* update enforce

* add test case

* merge the lse pad

* fix kernel_fn of backward

* fix PADDLE_ENFORCE_EQ and phi_api

* fix PADDLE_ENFORCE

* fix PADDLE_ENFORCE

* rerun coverage

* fix memory efficient attention test

* rerun ci

* add cuda version condition

* add cuda version condition

* delete WIP test

* replace PADDLE_ENFORCE

* edit the namespace of datatype in multiple.cc

* rerun

* rerun

---------
Co-authored-by: Nliuyuang <liuyuang@baidu.com>

e5ad3859

Y
[AMP] Add uint16 dtype check for compare ops (#52016) · 40fea722
由 yeliang2258 提交于 3月 24, 2023
```
* add uint16 dtype check for compare ops

* update doc
```
40fea722
Z

remove copy of index for gather_nd_grad and scatter_nd_add op in xpu (#51871) · b110085f
由 zhangyikun02 提交于 3月 24, 2023

b110085f

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功