提交 · b1ee9d5e02bd9e16e2680dc65aeb795ab468cdc8 · Crayon鑫 / Paddle

29 3月, 2022 1 次提交
- W
  add elementwise sub and elementwise div in tensorrt op teller (#40806) · f3022dfa
  由 wangxinxin08 提交于 3月 29, 2022
```
* add elementwise sub and elementwise div in tensorrt op teller

* add unittest of elementwise mul, sub and div
```
  f3022dfa
24 3月, 2022 1 次提交

[Phi] Move mul op kernel into phi (#40833) · 1b491818

由 Chen Weihang 提交于 3月 24, 2022

* add mul phi kernel

* remove mul op kernel

* remove original mul grad op

* fix cinn test

* fix dygraph test failed

1b491818

21 3月, 2022 1 次提交
- F
  Move conv-transpose OPs to phi (#40675) · 1eb96eec
  由 From00 提交于 3月 21, 2022
```
* Move conv-transpose OPs to phi

* Fix CI errors

* Fix CI errors
```
  1eb96eec
18 3月, 2022 1 次提交
- S
  
  set +x to close showing command, update check_change code with linux (#40456) · 161d27dc
  由 Sing_chan 提交于 3月 18, 2022
  
  161d27dc
17 3月, 2022 5 次提交

CopyFromCpu and CopyToCpu of Onnxruntime back-end optimize (#40561) · fcbb7440

由 heliqi 提交于 3月 17, 2022

* add onnxruntime predictor

* Add code comments

* support link paddle2onnx onnxruntime

* support onnxruntime with python

* support onnxruntime with python

* support onnxruntime with windows

* paddle2onnx compile with windows

* supoort windows compile

* supoort windows compile with onnxruntime

* supoort windows compile with paddle2onnx

* supoort mac compile

* compile with mac

* compile with mac

* add code comments

* fix remind word

* code optimization

* add test case

* add test case

* add inference demo_ci test case

* fix compile paddle2onnx with no python

* add inference demo_ci test case

* add inference demo_ci test case

* add inference infer_ut test case

* support c go api and test cases

* add converage test case

* add converage test case

* add capi test case

* add capi test case

* fix onnxruntime copyfromcpu and copytocpu

* fix goapi

* modify code

fcbb7440

Move layer norm to phi (#40193) · 681a6865

由 hong 提交于 3月 17, 2022

* update

* fix bugs; test=develop

* update; test=develop

* fix test compile error; test=develop

* fix cpu compile error; test=develop

* fix test error; test=develo

* fix layer_norm_op plugin error; test=develop

* fix error; test=develop

* fix test bug; test=develop

* update; test=develop

* polish code; test=develop

* fix bugs; test=develop

* remove unused depency; test=develop

* polish code; test=develop

681a6865

Y

move activation sigmoid (#40626) · ed8a9370
由 YuanRisheng 提交于 3月 17, 2022

ed8a9370
Y

[fleet executor] fleet executor for npu (#40607) · 81848fff
由 Yuang Liu 提交于 3月 17, 2022

81848fff
B

support gpu mixed precision inference (#40531) · 06fee998
由 baoachun 提交于 3月 17, 2022

06fee998

15 3月, 2022 1 次提交

[Phi]Move Tanh/BRelu/LeakyRelu/ThresholdedRelu Kernels to Phi (#40385) · d7112180

由 YuanRisheng 提交于 3月 15, 2022

* move activation op

* adjust code format

* fix compile bugs

* fix ci bugs

* code format adjust

* code format adjust2

* activate ci status

* modify according to comment

* move activation kernel

* revert relu6

* reduce add code

* perfect use_phi_functor

* completing func name

* fix bugs when run ci

* fix bugs when run infr

* modifpy infrt get kernel signature

d7112180

14 3月, 2022 2 次提交

Add an elementwise + activation fusion pass. (#36541) · 3f219160

由 Tomasz Socha 提交于 3月 14, 2022

* Add elementwise add and activation fuse pass

* Fix copy ellision

* More flexible pattern detector

* More flexible fusion pass

* Update lists for pass

* Add support for Pow operator

* Add support for more activation types

* Style

* Rename fusion pass

* First version of tests

* Dirty version of pass

* Polished version

* Update pbtxt

* Style

* Update names

* Style

* Use PADDLE_ENFORCE_EQ

* Save error message to variable

* WO for error checks

* CR

* Static style check

* Add missing 'activation_scale' attribute

* Add relu6 and sigmoid activations

* Style

* Fix fuse list formating

* Sync filenames for fuse pass files

* Fix cmake after move

* Fix registration

* Fix pass name in tests

* Add missing activations to checker

* WIPS

* Working mul op

* Working sub

* Working Add

* Remove pten includes

* Remove some forward declarations

* Remove Includes

* Fixes

* Remove default kernels

* Add check if post_ops attributes are avaliable

* Style

* Code adjustment

* Register default kernels

* We have year 2022 not 2021...
Co-authored-by: Njakpiase <jakpia21@gmail.com>
Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>

* Fast review fixes
Co-authored-by: Njakpiase <jakpia21@gmail.com>
Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>

* Review Fix

* Rename one_dnn -> onednn

* Style after review

* Fast and dirty fix for quantization

* Update tests

* Style

* Fix mkldnn_quantizer config

* Add Joanna's suggestion.

* Check if operator is explicitly disables on OneDNN

* Try to use unregistered attributes

* Style

* Test new framework

* FXI

* FXII

* Update test

* Style
Co-authored-by: Njakpiase <jakpia21@gmail.com>
Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>

3f219160

F
Move Pool OPs to phi (#40208) · 88ec08a7
由 From00 提交于 3月 14, 2022
```
* Move Pool OPs to phi

* Fix CI error

* Fix conflicts
```
88ec08a7

10 3月, 2022 2 次提交

Inference add ONNXRuntime back-end (#39988) · 431afc39

由 heliqi 提交于 3月 10, 2022

* add onnxruntime predictor

* Add code comments

* support link paddle2onnx onnxruntime

* support onnxruntime with python

* support onnxruntime with python

* support onnxruntime with windows

* paddle2onnx compile with windows

* supoort windows compile

* supoort windows compile with onnxruntime

* supoort windows compile with paddle2onnx

* supoort mac compile

* compile with mac

* compile with mac

* add code comments

* fix remind word

* code optimization

* add test case

* add test case

* add inference demo_ci test case

* fix compile paddle2onnx with no python

* add inference demo_ci test case

* add inference demo_ci test case

* add inference infer_ut test case

* support c go api and test cases

* add converage test case

* add converage test case

* add capi test case

* add capi test case

431afc39

Move dropout to phi (#40148) · 99fc1b08

由 hong 提交于 3月 10, 2022

* move dropout to phi; test=develop

* fix xpu, npu compile error; test=develop

99fc1b08

08 3月, 2022 2 次提交

[Phi]Move Relu/Cos/Sin/Tan/Acos/Asin/Atan/Sinh/Cosh/Asinh/Acosh/Atanh kernels... · 975f99ab

由 YuanRisheng 提交于 3月 08, 2022

[Phi]Move Relu/Cos/Sin/Tan/Acos/Asin/Atan/Sinh/Cosh/Asinh/Acosh/Atanh kernels in Activation to Phi (#40175)

* move activation op

* adjust code format

* fix compile bugs

* fix ci bugs

* code format adjust

* code format adjust2

* activate ci status

* modify according to comment

975f99ab

A
[custom kernel]Upgrade support for multiple libs (#40223) · c39aa18e
由 Aganlengzi 提交于 3月 08, 2022
```
* [custom kernel]Upgade support for multi libs

* upgrade phi_custom_kernel deps
```
c39aa18e

04 3月, 2022 3 次提交

H
Move yolo box to phi (#40112) · faece382
由 hong 提交于 3月 04, 2022
```
* add yolo box kernel; test=develop

* fix comile error; test=develop
```
faece382

[paddle-inference]support setting fully connected in multi-head attention... · 8dbfc2ae

由 ceci3 提交于 3月 04, 2022

[paddle-inference]support setting fully connected in multi-head attention static shape branch to int8  (#39660)

* fix inference int

* update

* add unittest

8dbfc2ae

Move conv to pten (#39354) · d50fb43e

由 hong 提交于 3月 04, 2022

* move conv to pten

* move conv to pten; test=develop

* fix bug;

* add conv cudnn impl; test=develop

* update

* update operator; test=develop

* fix bug; test=develop

* move operator and prepared_operator to develop; test=develop

* resolve conflict; test=develop

* remove useless code;test=develop

* add depency ; test=develop

* fix bug;

* add sig.cc ; test=develop

* fix use_op error; test=develop

* fix bug; test=develop

* fix bug; test=develop

* add conv3d register; test=develop

* fix star gan and conv_nn_grad test failed; test=develop

* add header; test=develop

* manul to recover to develop;

* resolve confilct; test=develop

* remove useless code

* fix bug;

* remove conv2d_cudnn; test=develop

* fix bugs; test=develop

* fix cpu rocm compile bugs; test=develop

* fix blas error; test=develop

* fix compile bug; test=develop

* fix windows compile error; test=develop

* fix windows error; test=develop

* resolve confilct; test=develop

d50fb43e

03 3月, 2022 3 次提交
- J
  
  fix_trt_engine_op_bug (#40067) · d8b40223
  由 JingZhuangzhuang 提交于 3月 03, 2022
  
  d8b40223
- R
  
  [CustomRuntime] migrate CustomRuntime into phi (#39908) · b4665d23
  由 ronnywang 提交于 3月 03, 2022
  
  b4665d23
- W
  EmbEltwiseLayernorm fix (#40015) · c3f3643b
  由 wenbin 提交于 3月 03, 2022
```
* emb fix

* fix trt6 compile

* fix half

* absolute error fix
```
  c3f3643b
02 3月, 2022 3 次提交
- Y
  [fleet_executor] Add entrance of FleetExecutor in AnalysisPredictor for... · 244ae318
  由 Yuang Liu 提交于 3月 02, 2022
```
[fleet_executor] Add entrance of FleetExecutor in AnalysisPredictor for distributed inference (#39992)
```
  244ae318
- W
  
  ernie: revert skip_layernorm_fp16 (#39991) · 26e2b918
  由 Wangzheee 提交于 3月 02, 2022
  
  26e2b918
- J
  
  add share external data interface (#39809) · 1ff1c1e0
  由 JingZhuangzhuang 提交于 3月 02, 2022
  
  1ff1c1e0
01 3月, 2022 2 次提交
- J
  Add mobilenetv3_large performance test for bf16 and int8 (#39738) · eb7c211a
  由 joanna.wozna.intel 提交于 3月 01, 2022
```
* Add mobilenetv3_large performance test

* Disable the BF16 test if the device does not support BF16 computations

* Change test timeout
```
  eb7c211a
- W
  remove conv_affine_channel_fuse_pass (#39817) · fc06be9d
  由 wenbin 提交于 3月 01, 2022
```
* remove

* pass

* more pass
```
  fc06be9d
28 2月, 2022 1 次提交

[Pten->Phi PR4] Rename pten in funcs to phi (#39961) · eb42dd52

由 Chen Weihang 提交于 2月 28, 2022

* rename pten_utils to phi_utils

* rename pten_utils target

* rename Pten to Phi

* replace pten with phi

* resolve conflict

eb42dd52

25 2月, 2022 2 次提交

Disable dist ut cases (#39906) · 4fe465cb

由 YUNSHEN XIE 提交于 2月 25, 2022

* disable some distribute test case when in CPU test env

* disable some test case when in CPU test env

* fix

4fe465cb

[Phi] Support cudnn kernel moving & move softmax kernels (#39547) · 8895379a

由 Chen Weihang 提交于 2月 25, 2022

* support cudnn kernel moving

* polish cmake rules

* add unittest for coverage

* remove orig kernel

* remove softmax cudnn kernel

* fix softmax test failed

* fix npu func error

* resolve conflict

* rename gpu dnn kernels

* fix name rule error

* fix compile error

* update fp16 namespace

8895379a

24 2月, 2022 2 次提交
- C
  [PTen->Phi PR3] Rename pten make target to phi (#39832) · f77019a0
  由 Chen Weihang 提交于 2月 24, 2022
```
* rename pten to phi

* fix infrt compile failed

* resolve conflict
```
  f77019a0
- W
  [Paddle-Inference] fix special_slice plugin (#39875) · 1255e7d6
  由 Wangzheee 提交于 2月 24, 2022
```
* fix plugin: special slice for ernie
```
  1255e7d6
23 2月, 2022 1 次提交
- A
  [IPU] update inference demos (#39792) · 24f55aed
  由 Allen Guo 提交于 2月 23, 2022
```
* update inference part

* restore white space
```
  24f55aed
22 2月, 2022 3 次提交

F
delete gather_ut skip_case (#39657) · da43e065
由 feng_shuai 提交于 2月 22, 2022
```
* delete gather_ut skip_case

* add trt version limit
```
da43e065

change Vector to std::vector and provide MixVector class as a helper … (#39559) · 728c0624

由 xiongkun 提交于 2月 22, 2022

* change Vector to std::vector and provide MixVector class as a helper wrapper class

* solve the multi-gpu hang problem

* remove the duplicate template instantialize

* Copy vector to cpu

* add CopyToCPU

* xxx

* final version: fix the problem of all reduce

* remove mixvector dependence

* fix

* merge

* fix code

* fix by CI

728c0624

W
[Paddle-Inference] fix pass and convert_op for preln_ernie (#39733) · 574f3402
由 Wangzheee 提交于 2月 22, 2022
```
* fix pass and convert_op for preln_ernie and add preln_ernie'flag in pass
```
574f3402

21 2月, 2022 1 次提交

Update record interface using part2 (#39694) · c984cd85

由 chenjian 提交于 2月 21, 2022

* fix RecordEvent interface

* modify default level to 4

* update interface use

* add const default trace level

* update record event interface using

* update record event interface using

* update operator.cc

* update part2

* update part1

* fix include profiler.h header in ps server

* fix include profiler.h header in ps server

* fix profiler.h header

c984cd85

20 2月, 2022 1 次提交

[PTen->Phi PR1] Change pten dirname and namespace to phi (#39748) · dcfe1986

由 Chen Weihang 提交于 2月 20, 2022

* rename pten dir to phi

* rename namespace to phi

* rename infrt pten dir to phi

* resolve conflict

* rename pten to phi in cmake

* revert all infrt change

* change needed files

* fix infrt failed

* fix inference failed

dcfe1986

19 2月, 2022 1 次提交

[Pten]Unify paddle/pten::framework::ddim into pten::ddim (#39614) · 2fe04264

由 Aurelius84 提交于 2月 19, 2022

* Unify paddle/pten::framework::ddim into pten::ddim

* fix paddle namespace

* compile sucessfully

* fix npu src file

* fix conflict

* fix conflict

* fix tensorrt compiler error

* fix conflict

* fix conflict

* fix tesst file conflict

* fix conflict

* fix mlu file conflict

* fix mlu file conflict

* fix cinn header file conflict

* fix conflict

* fix conflict

* fix conflict

* fix conflict

2fe04264

18 2月, 2022 1 次提交
- F
  [Pten] blas and lapck migration (#39587) · 8c7ee8c2
  由 Feiyu Chan 提交于 2月 18, 2022
```
* move blas related files
* move lapack related files
```
  8c7ee8c2

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致