提交 · e7c249cb5017cdb39fc3ddb51acfa7afe5235d09 · PaddlePaddle / Paddle

28 3月, 2023 3 次提交
- R
  
  [CustomDevice] fix reducer (#52115) · e7c249cb
  由 ronnywang 提交于 3月 28, 2023
  
  e7c249cb
- I
  
  [CodeStyle][C405] Unnecessary <list/tuple> literal - rewrite as a set literal (#51972) · 9fa98349
  由 Infinity_lee 提交于 3月 28, 2023
  
  9fa98349
- Y
  [Hackathon NO.77] 为 Paddle-TRT 添加 bitwise 算子 (#51971) · 864b50c3
  由 Young-Flash 提交于 3月 28, 2023
```
* add bitwise_not trt converter

* run pre-commit

* modify neg_one_tensor_dims init way

* fix BOOL type support requires TensorRT 8.4

* fix int8 & uint8 type

* improve data type readability

* modify filter logic

* fix coverage CI
```
  864b50c3
27 3月, 2023 12 次提交

Y
[PHI]Support register functor kernel into PHI (#51914) · bcea3b89
由 YuanRisheng 提交于 3月 27, 2023
```
* perfect structure kernel registry

* fix ci bugs
```
bcea3b89
A

[NewExe]Adjust ExecutorCache Capacity from 4 into 10 (#52104) · 897fb6ab
由 Aurelius84 提交于 3月 27, 2023

897fb6ab
[Zero-Dim] add FLAGS_set_to_1d, control whether to hack process to 1D, add ut for xpu (#51899) · 134c9c0c
由 zhouweiwei2014 提交于 3月 27, 2023

134c9c0c

Add fuse_ops.yaml and fused_backward.yaml (#52010) · 10145cb6

由 HappyHeavyRain 提交于 3月 27, 2023

* add fused_yaml fused_backward

* fix eager_funciton bug

* add some comment of fused yaml file

* add 'support_dygraph_mode' configuration in fused yaml

* delete some 'fused_api.h' in include file

* add fused flag in api_gen

10145cb6

X

elementwise: onednn: support zero dimension inputs (#51656) · 2c1d494e
由 Xinyu Chen 提交于 3月 27, 2023

2c1d494e

[CustomOP Inplace] Automap inplace dtype and shape, support vector<Tensor> output (#52114) · 04025237

由 HongyuJia 提交于 3月 27, 2023

* [CustomOP Inplace] Automap inplace dtype and shape, prepare for vector<Tensor> output

* delete dtype,shape func of multi_inplace op

* [CustomOP Inplace] Automap inplace dtype and shape, support vector<Tensor> output

04025237

Automatically generate 'assign' operator (#51940) · 888a30c9

由 HappyHeavyRain 提交于 3月 27, 2023

* support assign op

* support assign infer_var_type

* change code according to review

* change code according to review

* only save 'get_infer_var_type_func'

* rest file mode

888a30c9

L

fix scope reuse problem (#52119) · 97fc2a0f
由 Leo Chen 提交于 3月 27, 2023

97fc2a0f
W
Revert "fix softmaxce null point in shape test (#51850)" (#52086) · d92c6477
由 wanghuancoder 提交于 3月 27, 2023
```
This reverts commit 9c238d2b.
```
d92c6477
E
add custom device mixed precision inference api (#50884) · a6449634
由 engineer1109 提交于 3月 27, 2023
```
fix bug

remove useless

fix bug

add pybind

remove log

fix style

fix style

change api
```
a6449634
R
fix_gcc12_error (#52083) · f7267412
由 risemeup1 提交于 3月 27, 2023
```
* fix_gcc12_error

* fix gcc12 error

* fix gcc12 error
```
f7267412

Fused elementwise_(mul/div) (#50428) · 968f7f24

由 Sławomir Siwek 提交于 3月 27, 2023

* extract Op and OPMaker to .h

* extend pattern for fused_op

* set "with_residual" default to false

* adjust fuse passes

* remove fc+eltwise flag

* fused_output_scale

* activation attrs

* remove extra attrs

* fix int8/bf16 unit tests

* simplify RecomputeOutputDims

* remove unused method

* Add description for attributes

* add extra check

* adjust op compats

* update quantize test

* fix protobuf parsing error

* fix int8 performance

* fused elementwises

* merge develop

* remove activation

* restore activation for existing add/sub ops

968f7f24

25 3月, 2023 1 次提交
- I
  [CodeStyle][UP027] Replace unpacked list comprehension with a generator expression (#52025) · 3dbc0e46
  由 Infinity_lee 提交于 3月 25, 2023
```
* codestyle up027

* add to pyproject.toml
```
  3dbc0e46
24 3月, 2023 4 次提交

add phi operator allreduce/reduce (#51857) · 47f87ad3

由 TaoTao Li 提交于 3月 24, 2023

* add all_reduce, reduce kernel and api

* fix all_reduce reduce ut

fix reduce op maker conflict

fix merge conflicts

* fix conflicts, rename ReduceOp->ReduceBaseOp in reduce_ops

rename allreduce op, to remove

* fix code format

fix comments

* modify test_collective_reduce_api ut timeout

* fix PR-CI-Build

fix comments: format phi operator

47f87ad3

[PHI Decoupling]Remove memory header (Part3) (#51288) · 3d78e759

由 YuanRisheng 提交于 3月 24, 2023

* decouple memory copy

* fix ci bugs

* fix ci compile bugs

* fix rocm compile

* fix ci bugs

* decouple memory

* deal with conflict

* fix xpu compile bugs

* fix xpu bugs

* deal with xpu bugs

* fix cmake bugs

* fix windows bugs

* fix ci bugs

* fix ci bugs

* delete redundance code

* add code for pybind

* fix py3 bugs

* fix ci bugs

3d78e759

Y
[CUSTOM DEVICE]analysis predictor custom device support (#52015) · 3ab19ab4
由 YuhangLi 提交于 3月 24, 2023
```
* [CUSTOM DEVICE]analysis predictor custom device support

* del debug log
```
3ab19ab4
Y

remove py::array::forcecast flag (#52039) · 5d503ec9
由 Yuanle Liu 提交于 3月 24, 2023

5d503ec9

23 3月, 2023 15 次提交
- H
  
  [CustomOP Optional] CustomOP supports optional vector<Tensor> input (#51973) · 6a10e604
  由 HongyuJia 提交于 3月 23, 2023
  
  6a10e604
- W
  
  add paddle-trt convert op: greater_equal (#52000) · 4dfbdb04
  由 Wangzheee 提交于 3月 23, 2023
  
  4dfbdb04
- X
  【prim】delete high order prim flag && add special prune rules for node.cc (#51676) · 978d544b
  由 xiaoguoguo626807 提交于 3月 23, 2023
```
* delete prim flag for matmul_2_grad

* delete prim flag for matmul_2_grad

* add new setgradoutmeta for matmul_double_grad_node

* modify test and delete log

* deal with review
```
  978d544b
- add output defs for clip_by_norm kernel (#51993) · 33897a95
  由 iSerendipity 提交于 3月 23, 2023
  
  33897a95
- Z
  
  [XPU] support lod_reset (#51967) · c491b361
  由 ZhouMengLei1999 提交于 3月 23, 2023
  
  c491b361
- S
  Remove fluid deps in fused_linear_param_grad_add_kernel.cu (#51975) · 5da1a27b
  由 sneaxiy 提交于 3月 23, 2023
```
* remove fluid deps in fused_linear_param_grad_add_kernel

* fix compile error

* fix ut error

* follow comments
```
  5da1a27b
- H
  register fluid kerenls to phi (#51976) · cc9bbd5b
  由 Huang Jiyi 提交于 3月 23, 2023
```
* unify add_position_encoding

* unify affine_channel

* unify alloc_float_status

* unify allreduce

* unify alltoall

* unify anchor_generator

* unify ascend_trigger

* fix bug

* fix test
```
  cc9bbd5b
- H
  register fluid activation kernel to phi (#51927) · aaa14780
  由 Huang Jiyi 提交于 3月 23, 2023
```
* update

* update

* update

* update

* update

* fix test
```
  aaa14780
- C
  
  [prim] add gelu vjp rule · 2add31f4
  由 cxxly 提交于 3月 06, 2023
  
  2add31f4
- Z
  To support py3.11, pybind need to upgrade to v2.10.0 (#51350) · 13b8b5e0
  由 zqw_1997 提交于 3月 23, 2023
```
* to support cuda12, pybind need to upgrade to v2.10.0

* add DEPS of pybind in test_custom_plugin_creater.cc

* only change the tag

* please let CI pass

* try pybind v2.10/3

* modify the include header in test

* code check
```
  13b8b5e0
- I
  
  support auto generate for nms (#51891) · 4bf1c163
  由 Infinity_lee 提交于 3月 23, 2023
  
  4bf1c163
- H
  [Bug fixes] fix distributed graph engine (#51956) · 9c853d1d
  由 Huang Zhengjie 提交于 3月 23, 2023
```
* fix distributed graph engine
```
  9c853d1d
- P
  [PHI] Add nanmedian output defs (#51358) · a82911a5
  由 PuQing 提交于 3月 23, 2023
```
* add nanmedian output defs

* remove the multiclass_nms3 momentum
```
  a82911a5
- P
  [CodeStyle][C408][C409][C410] Fix unnecessary <dict/list/tuple> call and... · cf391b81
  由 PuQing 提交于 3月 23, 2023
```
[CodeStyle][C408][C409][C410] Fix unnecessary <dict/list/tuple> call and unnecessary <list/tuple> passed to <list/tupule>() (#51928)

* autofix

* add select config

* autofix C410

* add C410 select
```
  cf391b81
- J
  【Eager】Fix error raise (#51963) · 3704471d
  由 Jiabin Yang 提交于 3月 23, 2023
```
* allow return none when stop_gradient=True

* remove useless code

* refine code

* refine code

* fix test cast

* change more test

* add more tests

* fix error msg in pylayer
```
  3704471d
22 3月, 2023 5 次提交

Support optimizers operator to be generated (#51767) · 0b008e0c

由 HappyHeavyRain 提交于 3月 22, 2023

* test_get_kernel

* add invoke signature

* change reduce_max

* change frobenius_norm

* reset reduce_max according to composite and change reduce_all

* fix the bug when Scalar(*)

* fix 'scalar when support_tensor'

* change code according to review

* change 'keep_signature' to 'manual_signature' and add some erro info

* support optimizers autogen

* change sgd yaml

* change generate signature

* fix test/cpp/new_executor/CM

* reset signature generated function

* change signature funciton

* change signature funciton

0b008e0c

[Zero-Dim] Support 0-D tensor for some oneDNN unary kernels (#51687) · 2a3d75bc

由 YangQun 提交于 3月 22, 2023

* support 0-d tensor for element wise unary ops

* fix python code style check

* fix approval check

* support 0-d tensor for onednn softmax and logsoftmax kernels

* fix commnets

* fix some unittests

2a3d75bc

J

Correct lstm qat test (#51499) · 31f81685
由 joanna.wozna.intel 提交于 3月 22, 2023

31f81685

Add fused_feed_forward pass (#50423) · 5dda0ef6

由 Ghost Screaming 提交于 3月 22, 2023

* Add fused_feed_forward pass for semi-automatic static graph training.

* Add fused_feedforward property in parallel_executor.cc

* Polish code.

* Polish fused feed_forward pass code. Support use_dropout1 and
use_dropout2 option.

* Support model parallel in fused_feedforward pass.

5dda0ef6

Extract fused_transpose op dedicated for oneDNN fuse passes (#50021) · 02296977

由 Sławomir Siwek 提交于 3月 22, 2023

* extract common methods to reuse

* add header for transpose ops

* fused_transpose

* Split big function

* transpose2 tests

* fused_transpose

* Apply extra attributes

* add pbtxt file

* update pbtxt

* Merge develop

* add more strict op compats

* code  style

* remove mkldnn_data_type

* unify SetOutMemDescWithReshape2FuseSupport

* adjust quantize-dequantize for transpose

* remove appendact

* transpose2 quantization

* fix int8 tests

* adjust transpose_op to current develop

* delete fusion code from transpose_kernel

* add fused transpose to NHWC unittest

* change order

02296977

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功