提交 · 0a2dfa380332ea38e0df3076cbcc1cac3417202f · PaddlePaddle / Paddle

06 12月, 2022 8 次提交

Clear extra input (Bias, ResidualData) in OpMaker of conv2d (#47579) · 0a2dfa38

由 zyfncg 提交于 12月 06, 2022

* delete Bias and ResidualData in OpMaker of conv2d

* delete extra input of conv3d

* refactor pass of conv_bias_fusion

* fix mkldnn dependency

* fix mkldnn compile

* fix test_conv_bias_mkldnn_fuse_pass

* police some code

* remove useless log

* fix analyzer_vit_ocr_tester

* fix conv_activation_mkldnn_fuse_pass

* fix test_analyzer_ocr

* add fused_conv_sig

* fix performence regression

* fix performance regression

0a2dfa38

Q
add xpu_support op function (#48606) · 06b32b38
由 QingshuChen 提交于 12月 06, 2022
```
*test=kunlun
```
06b32b38
S
[PHI] Migrate elementwise_(add/mul) kernels (#48625) · 7575d37c
由 Sławomir Siwek 提交于 12月 06, 2022
```
* remove fluid code

* init

* typo

* fix merge conflicts
```
7575d37c
H

[XPU] add tile_grad op (#48720) · 8de336f9
由 houj04 提交于 12月 06, 2022

8de336f9

Remove fluid matmul (#47988) · 8fb829ba

由 kangguangli 提交于 12月 06, 2022

* remove layers.matmul in nets.py

* remove layers.matmul in rnn_impl/test_quantization_pass/auto_parallel_gpt_model/test_auto_parallel_completion_gpt

* remove layers.matmul in other files

* fix

* fix

* remove layers.matmul itself

* remove ref in CMakeLists.txt and tools directory

* remove matmul in fluid.layers.nn.py

* remove matmul in fluid.dygraph.rnn.py && resotre test_matmul_op.py

* replace matmul in fluid.dygraph.rnn.py && clean api_test in test_matmul_op.py

* fix error && restore empty test_auto_search_dist_matmul_op.py

* fix check in test_auto_parallel_partitioner.py

* fix test_dist_matmul && test_flags_mkldnn_ops_on_off

* fix test_fused_attention_op_xpu.py && test_matmul_op_xpu.py

* remove test_auto_search_dist_matmul_op.py

* remove layers.matmul in auto_parallel_gpt_model.py && fix doc in fluid/io.py

* fix for matmul_grad

* fix codestyle

* fix codestyle

* resolve conflicts error

* restore unit test file but not compiled it for later remove

* fix codestyle

* fix wrong unittest skip

* fix unittest delete

* fix scale cost

* fix scale cost

* resolve conflicts error

* resolve conflicts error
Co-authored-by: Njakpiase <jakpia21@gmail.com>

8fb829ba

Z
[inference][trt] add reduce max for trt (#48684) · dd304f31
由 Zhang Jun 提交于 12月 06, 2022
```
* add reduce max for trt
```
dd304f31
Y

[Paddle Inference] Add float_to_half_pass to support inference with mixed precision (#47993) · c5a45cc6
由 Yuanle Liu 提交于 12月 06, 2022

c5a45cc6

add xpu centered rmsprop (#48658) · 54b756e2

由 ykkk2333 提交于 12月 06, 2022

* add stat tool

* add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun

* add xpu rmsprop centered, test=kunlun

54b756e2

05 12月, 2022 20 次提交

Transpose optimization for AlphaFold2 (#45230) · a0f43889

由 limingshu 提交于 12月 05, 2022

* first commit

* fix bugs according to ci

* add some changes

* change file name into function.cu.h

* remove const_cast

a0f43889

Z

support nhwc in conv2d_fusion (#48642) · 30f4ef7f
由 zhoutianzi666 提交于 12月 05, 2022

30f4ef7f
R

[0D Tensor]support 0d tensor for dist.scatter and dist.broadcast (#48638) · 22ec915c
由 Roc 提交于 12月 05, 2022

22ec915c
Y

fix onednn bugs (#48714) · 35ebf2b4
由 YuanRisheng 提交于 12月 05, 2022

35ebf2b4

Reverse roll fuse (#46914) · feb68dd1

由 Wang Bojun 提交于 12月 05, 2022

* pass

* pass

* draft version

* share mem opt

* remove sharemem

* add pattern for the case with circle_shift=0

* add UT

* pass opt

* test_fix

* code-commit

* code-style

* code style

* code-style

* ut-fix

* op teller refine

* resolve conflict

* adjust position op_teller list and pass order for swin

* ut code style update

* adjust paddle pass order

* refine pass order

* refine pass order

* refine pass order

feb68dd1

W

fix error when share buffer but modify the dtype (#48666) · 65ffc3f5
由 Wilber 提交于 12月 05, 2022

65ffc3f5
H

move device_memory_aligment from fluid to phi (#48694) · 796499fd
由 huangjiyi 提交于 12月 05, 2022

796499fd
六
fix bug in paddle/phi/api/yaml/generator (#48659) · 595338c6
由六个骨头提交于 12月 05, 2022
```
* fix bug

* fix bugs in api_gen tools
```
595338c6

Replace mutable_data with DeviceContext.Alloc in phi kernels (#48500) · 34a957e3

由 Ruibiao Chen 提交于 12月 05, 2022

* Replace mutable_data with DeviceContext.Alloc in phi kernels

* Fix CI errors

* Fix CI errors

* Fix CI errors, test=kunlun

* Fix CI errors, test=kunlun

* Handle rnn_functor

* Update approvals

34a957e3

S
Register exp/expm1/logit bf16 activation op kernels (#48702) · d1e2ba8a
由 sneaxiy 提交于 12月 05, 2022
```
* register more bf16 ops

* update to register coresponding backward ops
```
d1e2ba8a

Generate static graph code of some ops by yaml (#48698) · 97aa938f

由 HappyHeavyRain 提交于 12月 05, 2022

* generate static graph code of some ops by yaml, test = develop

* generate static graph code of some ops by yaml, test = develop

97aa938f

Setuptools (#48301) · 9913da02

由 risemeup1 提交于 12月 05, 2022

* test

* test

* test

* test

* test

* suport setuptools for paddle

* modify paddle_build.sh

* modify paddle_build.sh

* modify paddle_build.sh

* modify paddle_build.sh

* modify paddle_build.sh

* test

* modify setup.py

* modify build_options

* modify build_options

* modify paddle_build.sh

* modify setup.py

* modify paddle_build.sh

* modify setup.py

* modify setup.py

* modify setup.py

* modify setup.py

* modfiy paddle_build.sh

* debug

* debug

* debug

* dddd

* debug

* debug

* debug

* debug

* debug

* debug

* debug

* debug

* debug

* fix bug that no version.py

* debug

* debug

* debug

* debug

* debug

* debug

* Delete .pre-commit-config.yaml

* debug

* support ninja

* support ninja

* debug

* debug

* debug

* support setuptools for paddle

* modify code style

* debug

* debug

* debug

* debug

* 取消make clean

* 取消make clean

* debug

* debug

* debug

* debug for py3

* debug

* debug

* debug

* 将mkdir_and_copy_file单独封装一个函数

* modify paddle_build.sh

* modify setup.py after zhangbo reviewd

9913da02

H

fix custom operator backward=None (#48656) · 0c1d68e1
由 HongyuJia 提交于 12月 05, 2022

0c1d68e1
H
[Fluid Clean] remove nn.topk, nn.ctc_greedy_decoder, nn.im2sequence,... · 93027d9f
由 heyanru 提交于 12月 05, 2022
```
[Fluid Clean] remove nn.topk, nn.ctc_greedy_decoder, nn.im2sequence, nn.multiplex, nn.smooth_l1 (#48289)
```
93027d9f
N
[PHI decoupling] migrate poly_util.h to phi (#48499) · d6aa0d43
由 Netpunk 提交于 12月 05, 2022
```
* rm poly_util.h

* format code

* fix some problems

* format code
```
d6aa0d43
S

fix bug of reducer in best_fit (#48668) · cee7a3db
由 ShenLiang 提交于 12月 05, 2022

cee7a3db
柠

DenseTensor (#48419) · 6cdaa371
由柠檬味~ 提交于 12月 05, 2022

6cdaa371
X
[Paddle Inference] Support range trt converter and add scalar interface. (#48697) · aee2db01
由 xiaoxiaohehe001 提交于 12月 05, 2022
```
* add_range

* add_range
```
aee2db01
X

release_ (#48383) · 7507956b
由 xiaoxiaohehe001 提交于 12月 05, 2022

7507956b
X
[Paddle Inference] Support fill_any_like bool input. (#48671) · a842c1d0
由 xiaoxiaohehe001 提交于 12月 05, 2022
```
* fill_any_like_bool

* fill_any_like_bool
```
a842c1d0

04 12月, 2022 1 次提交

[Eager] fix set_value logic when input's dtype is different (#48519) · 46371c53

由 Weilong Wu 提交于 12月 04, 2022

* [Eager] fix set_value logic when input's dtype is different

* value_tensor

* fix set_value logic when input's dtype is different

46371c53

03 12月, 2022 2 次提交
- W
  Refactor collective communication static check (#48646) · 4552be48
  由 Wen Sun 提交于 12月 03, 2022
```
* refactor: classify static check

* refactor: rename to static_check & use forward decl

* refactor: switch to unary & binary funcs
```
  4552be48
- Y
  
  Scatter 0D index for gather, 0D index and 0D updates for scatter. (#48452) · f9815bfe
  由 Yuang Liu 提交于 12月 03, 2022
  
  f9815bfe
02 12月, 2022 9 次提交

Y

[Paddle-TRT] Support engine sharing memory of multiple predictors (#47631) · ea5ca555
由 Yuanle Liu 提交于 12月 02, 2022

ea5ca555
P
[PHI] Migrate elementwise_sub kernel (#48611) · 493825a5
由 Piotr Paturej 提交于 12月 02, 2022
```
* Add migrations

* Fix build errors

* Remove elementwise_mul from migration
```
493825a5

Migrate mul_mkldnn_op to phi matmul_kernel (#48299) · e8edbb09

由 Hulek 提交于 12月 02, 2022

* Migrate mul_mkldnn_op to matmul_kernel

* Review fixes - changed mutable_data, changed ctx to dev_ctx, fixed namespaces

* switched some funcs to phi

* Deleted not needed phi:: and changed place checking according to standards

e8edbb09

[XPU ]Fix xpu compile error (#48621) · 2af82190

由 Jiabin Yang 提交于 12月 02, 2022

* [Eager] Fix paddle.grad interface

* [Eager] Support minimum SubGraph for GeneralGrad

* Add needed_nodes to prune grad graph more thoroughly

* [Eager] Add grad_node_trans_mapping_ to record which grad_node has been transformed to AccumulationNode

* [Eager] Fix paddle.grad interface

* Polish code

* remove potential_stop_node

* Add endding_nodes to enhance genSugraph logic

* clear endding_nodes_

* polish code

* rename endding_nodes to endding_nades_

* Refactor grad interface

* Add register_hook case to fix coverage-ci

* Fix code format

* Refactor general_grad

* Add more code comments

* call clear directly to release GradSlotMeta

* fix a mistake

* fix matmul/ multiply kernel logic and optional input in yaml, fill zeros logic and so on.

* fix batch_norm_double_grad yaml optional config

* fix tanh_triple_grad yaml and kernels

* fix MultiplyTripleGradKernel optional logic

* fix merge mistake

* fix compile error

* remove legacy attr for bn

* polish code

* fix some kernel

* merge develop

* fix error

* remote log

* fix kernel with full like

* hide value log behind

* hide value log behind

* fix matmul_triple grad

* fix xpu compile error

* fix xpu compile error

* fix xpu ut

* fix xpu ut

* fix_xpu_compile_error
Co-authored-by: NWeilong Wu <veyron_wu@163.com>

2af82190

Split common funcs from reduction and structure modification (#46970) · ef575d6a

由 Bo Zhang 提交于 12月 02, 2022

* profile reduce kernel for fp16 and reduceHigherdim

* use reinterpret_cast

* fix for CI on ROCm

* add Macro for ROCm

* ROCm CI config

* ROCm CI config

* unit test repair

* pull

* add common_funcs.h

* reduceType

* Update reduce_function.h

* not higher

* rename

ef575d6a

Fix fuse_gemm_epilogue (#47805) · 6efc2888

由 Shijie 提交于 12月 02, 2022

* Fix fuse_gemm_epilogue

* update tests

* Update CMakeLists.txt

* Update CMakeLists.txt

* Update CMakeLists.txt

* fix random seed

* use assert_allclose

* Update test_dist_fuse_gemm_epilogue_pass.py

* Update cpp_pass.py

* Update test_dist_fuse_gemm_epilogue_pass.py

* fix codestyle

* update seed and atol

6efc2888

G

add some compare and logical trt converter (#48592) · 4c38b87e
由 gem5 提交于 12月 02, 2022

4c38b87e
R
fix phi capi kernel registration macro error (#48616) · 0f3b1ad6
由 ronnywang 提交于 12月 02, 2022
```
* fix capi kernel registration macro error

* update
```
0f3b1ad6
W
[Eager, Performance Optimization] modify AllocateFrom to reduce deconstruction... · 708c4f88
由 Weilong Wu 提交于 12月 02, 2022
```
[Eager, Performance Optimization] modify AllocateFrom to reduce deconstruction of shared_ptr (#48548)
```
708c4f88

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功