提交 · 4a09da02441a1b0c2afd83d3cdc83aa57e9040ad · Crayon鑫 / Paddle

02 4月, 2022 1 次提交
- W
  filter unsupported inputs for elementwise op in op teller (#41253) · 56f108ff
  由 wangxinxin08 提交于 4月 02, 2022
```
* filter unsupported inputs for elementwise op in op teller

* add unittest for corner case
```
  56f108ff
01 4月, 2022 3 次提交

X
reshape_opteller (#41090) · 15d5f6b9
由 xiaoxiaohehe001 提交于 4月 01, 2022
```
fix_reshape: for paddle-trt
```
15d5f6b9

[Eager] Support pinned (#41035) · f3270fc8

由 wanghuancoder 提交于 4月 01, 2022

* support pinned, test=develop

* support async_write, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine,test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

f3270fc8

J

- Enabled fc of oneDNN for bert test (#41235) · 597d7efd
由 Jacek Czaja 提交于 4月 01, 2022

597d7efd

31 3月, 2022 4 次提交

W
add multiclass nms3 trt converter (#41181) · 08c3edb3
由 wangxinxin08 提交于 3月 31, 2022
```
* add multiclass_nms3 converter
```
08c3edb3

Using DistConfig in Paddle Inference (#41128) · dc0702fe

由 TeslaZhao 提交于 3月 31, 2022

* Pass compat of conv_transpose_bias_mkldnn_fuse_pass

* Fix a bug of strided_slice op, about the axes parameter access memory out of bounds

* Fix a bug of strided_slice op, about the axes parameter access memory out of bounds

* Fix a bug of transpose op, about accessing memory out of bounds of the perm param

* op:transpose_op supports bool type

* op:transpose_op supports bool type

* Keep strided_slice op behavior consistent with slice op when starts input is less than -rank

* Using DistConfig in inference

dc0702fe

add flatten2,reshape2,squueze2_trt_fuse_pass test cast (#41031) · 7ef69202

由 heliqi 提交于 3月 31, 2022

* add flatten2,reshape2,squueze2_trt_fuse_pass  test cast

* add flatten2,reshape2,squueze2_trt_fuse_pass  test cast

* add flatten2,reshape2,squueze2_trt_fuse_pass  test cast

7ef69202

W

remove shape check (#41143) · 4b9e748a
由 wenbin 提交于 3月 31, 2022

4b9e748a

30 3月, 2022 2 次提交
- Y
  
  move elementwise_mul selected rows input (#41042) · 13f1641d
  由 YuanRisheng 提交于 3月 30, 2022
  
  13f1641d
- H
  
  Optimize the onnxruntime code (#41044) · f12b5260
  由 heliqi 提交于 3月 30, 2022
  
  f12b5260
29 3月, 2022 1 次提交
- W
  add elementwise sub and elementwise div in tensorrt op teller (#40806) · f3022dfa
  由 wangxinxin08 提交于 3月 29, 2022
```
* add elementwise sub and elementwise div in tensorrt op teller

* add unittest of elementwise mul, sub and div
```
  f3022dfa
24 3月, 2022 1 次提交

[Phi] Move mul op kernel into phi (#40833) · 1b491818

由 Chen Weihang 提交于 3月 24, 2022

* add mul phi kernel

* remove mul op kernel

* remove original mul grad op

* fix cinn test

* fix dygraph test failed

1b491818

21 3月, 2022 1 次提交
- F
  Move conv-transpose OPs to phi (#40675) · 1eb96eec
  由 From00 提交于 3月 21, 2022
```
* Move conv-transpose OPs to phi

* Fix CI errors

* Fix CI errors
```
  1eb96eec
18 3月, 2022 1 次提交
- S
  
  set +x to close showing command, update check_change code with linux (#40456) · 161d27dc
  由 Sing_chan 提交于 3月 18, 2022
  
  161d27dc
17 3月, 2022 5 次提交

CopyFromCpu and CopyToCpu of Onnxruntime back-end optimize (#40561) · fcbb7440

由 heliqi 提交于 3月 17, 2022

* add onnxruntime predictor

* Add code comments

* support link paddle2onnx onnxruntime

* support onnxruntime with python

* support onnxruntime with python

* support onnxruntime with windows

* paddle2onnx compile with windows

* supoort windows compile

* supoort windows compile with onnxruntime

* supoort windows compile with paddle2onnx

* supoort mac compile

* compile with mac

* compile with mac

* add code comments

* fix remind word

* code optimization

* add test case

* add test case

* add inference demo_ci test case

* fix compile paddle2onnx with no python

* add inference demo_ci test case

* add inference demo_ci test case

* add inference infer_ut test case

* support c go api and test cases

* add converage test case

* add converage test case

* add capi test case

* add capi test case

* fix onnxruntime copyfromcpu and copytocpu

* fix goapi

* modify code

fcbb7440

Move layer norm to phi (#40193) · 681a6865

由 hong 提交于 3月 17, 2022

* update

* fix bugs; test=develop

* update; test=develop

* fix test compile error; test=develop

* fix cpu compile error; test=develop

* fix test error; test=develo

* fix layer_norm_op plugin error; test=develop

* fix error; test=develop

* fix test bug; test=develop

* update; test=develop

* polish code; test=develop

* fix bugs; test=develop

* remove unused depency; test=develop

* polish code; test=develop

681a6865

Y

move activation sigmoid (#40626) · ed8a9370
由 YuanRisheng 提交于 3月 17, 2022

ed8a9370
Y

[fleet executor] fleet executor for npu (#40607) · 81848fff
由 Yuang Liu 提交于 3月 17, 2022

81848fff
B

support gpu mixed precision inference (#40531) · 06fee998
由 baoachun 提交于 3月 17, 2022

06fee998

15 3月, 2022 1 次提交

[Phi]Move Tanh/BRelu/LeakyRelu/ThresholdedRelu Kernels to Phi (#40385) · d7112180

由 YuanRisheng 提交于 3月 15, 2022

* move activation op

* adjust code format

* fix compile bugs

* fix ci bugs

* code format adjust

* code format adjust2

* activate ci status

* modify according to comment

* move activation kernel

* revert relu6

* reduce add code

* perfect use_phi_functor

* completing func name

* fix bugs when run ci

* fix bugs when run infr

* modifpy infrt get kernel signature

d7112180

14 3月, 2022 2 次提交

Add an elementwise + activation fusion pass. (#36541) · 3f219160

由 Tomasz Socha 提交于 3月 14, 2022

* Add elementwise add and activation fuse pass

* Fix copy ellision

* More flexible pattern detector

* More flexible fusion pass

* Update lists for pass

* Add support for Pow operator

* Add support for more activation types

* Style

* Rename fusion pass

* First version of tests

* Dirty version of pass

* Polished version

* Update pbtxt

* Style

* Update names

* Style

* Use PADDLE_ENFORCE_EQ

* Save error message to variable

* WO for error checks

* CR

* Static style check

* Add missing 'activation_scale' attribute

* Add relu6 and sigmoid activations

* Style

* Fix fuse list formating

* Sync filenames for fuse pass files

* Fix cmake after move

* Fix registration

* Fix pass name in tests

* Add missing activations to checker

* WIPS

* Working mul op

* Working sub

* Working Add

* Remove pten includes

* Remove some forward declarations

* Remove Includes

* Fixes

* Remove default kernels

* Add check if post_ops attributes are avaliable

* Style

* Code adjustment

* Register default kernels

* We have year 2022 not 2021...
Co-authored-by: Njakpiase <jakpia21@gmail.com>
Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>

* Fast review fixes
Co-authored-by: Njakpiase <jakpia21@gmail.com>
Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>

* Review Fix

* Rename one_dnn -> onednn

* Style after review

* Fast and dirty fix for quantization

* Update tests

* Style

* Fix mkldnn_quantizer config

* Add Joanna's suggestion.

* Check if operator is explicitly disables on OneDNN

* Try to use unregistered attributes

* Style

* Test new framework

* FXI

* FXII

* Update test

* Style
Co-authored-by: Njakpiase <jakpia21@gmail.com>
Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>

3f219160

F
Move Pool OPs to phi (#40208) · 88ec08a7
由 From00 提交于 3月 14, 2022
```
* Move Pool OPs to phi

* Fix CI error

* Fix conflicts
```
88ec08a7

10 3月, 2022 2 次提交

Inference add ONNXRuntime back-end (#39988) · 431afc39

由 heliqi 提交于 3月 10, 2022

* add onnxruntime predictor

* Add code comments

* support link paddle2onnx onnxruntime

* support onnxruntime with python

* support onnxruntime with python

* support onnxruntime with windows

* paddle2onnx compile with windows

* supoort windows compile

* supoort windows compile with onnxruntime

* supoort windows compile with paddle2onnx

* supoort mac compile

* compile with mac

* compile with mac

* add code comments

* fix remind word

* code optimization

* add test case

* add test case

* add inference demo_ci test case

* fix compile paddle2onnx with no python

* add inference demo_ci test case

* add inference demo_ci test case

* add inference infer_ut test case

* support c go api and test cases

* add converage test case

* add converage test case

* add capi test case

* add capi test case

431afc39

Move dropout to phi (#40148) · 99fc1b08

由 hong 提交于 3月 10, 2022

* move dropout to phi; test=develop

* fix xpu, npu compile error; test=develop

99fc1b08

08 3月, 2022 2 次提交

[Phi]Move Relu/Cos/Sin/Tan/Acos/Asin/Atan/Sinh/Cosh/Asinh/Acosh/Atanh kernels... · 975f99ab

由 YuanRisheng 提交于 3月 08, 2022

[Phi]Move Relu/Cos/Sin/Tan/Acos/Asin/Atan/Sinh/Cosh/Asinh/Acosh/Atanh kernels in Activation to Phi (#40175)

* move activation op

* adjust code format

* fix compile bugs

* fix ci bugs

* code format adjust

* code format adjust2

* activate ci status

* modify according to comment

975f99ab

A
[custom kernel]Upgrade support for multiple libs (#40223) · c39aa18e
由 Aganlengzi 提交于 3月 08, 2022
```
* [custom kernel]Upgade support for multi libs

* upgrade phi_custom_kernel deps
```
c39aa18e

04 3月, 2022 3 次提交

H
Move yolo box to phi (#40112) · faece382
由 hong 提交于 3月 04, 2022
```
* add yolo box kernel; test=develop

* fix comile error; test=develop
```
faece382

[paddle-inference]support setting fully connected in multi-head attention... · 8dbfc2ae

由 ceci3 提交于 3月 04, 2022

[paddle-inference]support setting fully connected in multi-head attention static shape branch to int8  (#39660)

* fix inference int

* update

* add unittest

8dbfc2ae

Move conv to pten (#39354) · d50fb43e

由 hong 提交于 3月 04, 2022

* move conv to pten

* move conv to pten; test=develop

* fix bug;

* add conv cudnn impl; test=develop

* update

* update operator; test=develop

* fix bug; test=develop

* move operator and prepared_operator to develop; test=develop

* resolve conflict; test=develop

* remove useless code;test=develop

* add depency ; test=develop

* fix bug;

* add sig.cc ; test=develop

* fix use_op error; test=develop

* fix bug; test=develop

* fix bug; test=develop

* add conv3d register; test=develop

* fix star gan and conv_nn_grad test failed; test=develop

* add header; test=develop

* manul to recover to develop;

* resolve confilct; test=develop

* remove useless code

* fix bug;

* remove conv2d_cudnn; test=develop

* fix bugs; test=develop

* fix cpu rocm compile bugs; test=develop

* fix blas error; test=develop

* fix compile bug; test=develop

* fix windows compile error; test=develop

* fix windows error; test=develop

* resolve confilct; test=develop

d50fb43e

03 3月, 2022 3 次提交
- J
  
  fix_trt_engine_op_bug (#40067) · d8b40223
  由 JingZhuangzhuang 提交于 3月 03, 2022
  
  d8b40223
- R
  
  [CustomRuntime] migrate CustomRuntime into phi (#39908) · b4665d23
  由 ronnywang 提交于 3月 03, 2022
  
  b4665d23
- W
  EmbEltwiseLayernorm fix (#40015) · c3f3643b
  由 wenbin 提交于 3月 03, 2022
```
* emb fix

* fix trt6 compile

* fix half

* absolute error fix
```
  c3f3643b
02 3月, 2022 3 次提交
- Y
  [fleet_executor] Add entrance of FleetExecutor in AnalysisPredictor for... · 244ae318
  由 Yuang Liu 提交于 3月 02, 2022
```
[fleet_executor] Add entrance of FleetExecutor in AnalysisPredictor for distributed inference (#39992)
```
  244ae318
- W
  
  ernie: revert skip_layernorm_fp16 (#39991) · 26e2b918
  由 Wangzheee 提交于 3月 02, 2022
  
  26e2b918
- J
  
  add share external data interface (#39809) · 1ff1c1e0
  由 JingZhuangzhuang 提交于 3月 02, 2022
  
  1ff1c1e0
01 3月, 2022 2 次提交
- J
  Add mobilenetv3_large performance test for bf16 and int8 (#39738) · eb7c211a
  由 joanna.wozna.intel 提交于 3月 01, 2022
```
* Add mobilenetv3_large performance test

* Disable the BF16 test if the device does not support BF16 computations

* Change test timeout
```
  eb7c211a
- W
  remove conv_affine_channel_fuse_pass (#39817) · fc06be9d
  由 wenbin 提交于 3月 01, 2022
```
* remove

* pass

* more pass
```
  fc06be9d
28 2月, 2022 1 次提交

[Pten->Phi PR4] Rename pten in funcs to phi (#39961) · eb42dd52

由 Chen Weihang 提交于 2月 28, 2022

* rename pten_utils to phi_utils

* rename pten_utils target

* rename Pten to Phi

* replace pten with phi

* resolve conflict

eb42dd52

25 2月, 2022 2 次提交

Disable dist ut cases (#39906) · 4fe465cb

由 YUNSHEN XIE 提交于 2月 25, 2022

* disable some distribute test case when in CPU test env

* disable some test case when in CPU test env

* fix

4fe465cb

[Phi] Support cudnn kernel moving & move softmax kernels (#39547) · 8895379a

由 Chen Weihang 提交于 2月 25, 2022

* support cudnn kernel moving

* polish cmake rules

* add unittest for coverage

* remove orig kernel

* remove softmax cudnn kernel

* fix softmax test failed

* fix npu func error

* resolve conflict

* rename gpu dnn kernels

* fix name rule error

* fix compile error

* update fp16 namespace

8895379a

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致