提交 · 4427f1b1726bbe148d2c9663b841f939fd0eeda8 · 机器未来 / Paddle

19 5月, 2022 1 次提交

[TensorRT] Support yolov5s (#42688) · a7778930

由 shentanyue 提交于 5月 19, 2022

* support yolov5s static/int8

* fix eltwise_sub and div weight compute

* fix delete_fill_constant_pass

a7778930

17 5月, 2022 1 次提交
- Z
  
  add yolo_box_fuse_pass, yolo_box_head_op, yolo_box_post_op (#42641) · 6b58de95
  由 zhupengyang 提交于 5月 17, 2022
  
  6b58de95
13 5月, 2022 1 次提交
- W
  
  add gpu resources. (#42723) · 1280f294
  由 Wilber 提交于 5月 13, 2022
  
  1280f294
12 5月, 2022 3 次提交
- S
  
  Fix some typos in paddle/. (#42408) · 2012672c
  由 Shuangchi He 提交于 5月 12, 2022
  
  2012672c
- Z
  
  add exp,log trt converter (#42655) · 6e90ba1b
  由 zhupengyang 提交于 5月 12, 2022
  
  6e90ba1b
- W
  [Paddle-Inference] support transformer generation: some passes (#42664) · 5914b18a
  由 Wangzheee 提交于 5月 12, 2022
```
* [Paddle-Inference] support transformer generation: some passes
```
  5914b18a
11 5月, 2022 1 次提交

Move weights and biases scale computing into pass (#42241) · c0652972

由 Zuza Gawrysiak 提交于 5月 11, 2022

* Add int8 scales gathering pass for convolution

* Fix typo

* Add unittest

* Add corrected unit test

* Change test name

* Remove enabling mkldnn in test

* Speed up test

* Change max examples

* Add functional test

* Change test name

* Add new test case

* Rename pass

c0652972

10 5月, 2022 2 次提交

R

[CustomDevice] add inference support (#42036) · 02e5c4be
由 ronnywang 提交于 5月 10, 2022

02e5c4be

Rea-dd conv_affine_channel fuse pass as oneDNN only pass (#41998) · 3540d33b

由 piotrekobi 提交于 5月 10, 2022

* Readd conv_affine_channel fuse pass as mkldnn pass

* Fix formatting

* Add new test to parallel_UT_rule.py

* Fix Coverage and Windows CI issues

* Revert "Fix Coverage and Windows CI issues"

This reverts commit f33459846385c9fd51c07f9f44e7ff283a652637.

* Fix CI errors

* Remove unnecessary conv_eltwise_add_affine_channel fuse pass

* Remove test from parallel_UT_rule.py

3540d33b

04 5月, 2022 1 次提交
- H
  fix paddle-ort python bug (#42464) · e7eb0e25
  由 heliqi 提交于 5月 04, 2022
```
* fix paddle-ort python bug

* fix paddle-ort python bug
```
  e7eb0e25
28 4月, 2022 1 次提交
- W
  
  fix error report. (#42333) · afa846d9
  由 Wilber 提交于 4月 28, 2022
  
  afa846d9
25 4月, 2022 1 次提交

Fix compiling ort test cases error on Windows (#42186) · 3241cea2

由 heliqi 提交于 4月 25, 2022

* fix windows compile test case error

* test windows ci

* cmake add onnxruntime

* cmake add onnxruntime

* test windows ci

* auto_code_generator add ort lib copy

* fallback modify windows ci bat

* ci notest;test=document_fix;test=windows_ci_inference;test=windows_ci;test=windows_op

3241cea2

21 4月, 2022 2 次提交
- H
  
  fix onnxruntime bug (#42095) · c51f55f9
  由 heliqi 提交于 4月 21, 2022
  
  c51f55f9
- W
  infer add io stream. (#42031) · 0d28ee29
  由 Wilber 提交于 4月 21, 2022
```
* infer add io stream.

* add macro
```
  0d28ee29
20 4月, 2022 1 次提交
- B
  
  update demo_ci ut threshold (#41981) · 65a5492a
  由 baoachun 提交于 4月 20, 2022
  
  65a5492a
19 4月, 2022 2 次提交
- B
  update gpu fp16 op blacklist (#41703) · 55096a1c
  由 baoachun 提交于 4月 19, 2022
```
* update gpu fp16 op blacklist

* update blacklist
```
  55096a1c
- J
  
  fix infer gpu strage (#41924) · 84c8096c
  由 JingZhuangzhuang 提交于 4月 19, 2022
  
  84c8096c
17 4月, 2022 1 次提交

[Perf] Optimize dygraph scheduling performance (#41696) · 7ee31a96

由 Chen Weihang 提交于 4月 17, 2022

* split phi and fluid infermeta context

* resolve conflict

* fix type error

* optimize scheduling perf

* spec small vector size

* replace all grad var name

* fix test failed

* move init defalut signature

* polish details

* polish details

* fix no init bug

* init sig for tests

* add init sig for infer

* fix infrt error

* fix infrt failed

* fix kunlun error

* fix infrt failed

7ee31a96

14 4月, 2022 5 次提交

Fix to #38693 (minimal UT) (#41026) · d0f3296b

由 Jacek Czaja 提交于 4月 14, 2022

* Add UT

- Added missed data_layout

- Added missing conversions

- NDHWC added

- NDHWC support in data_transform

- another fix

- condddate change

- fix

u- fix

- fix

- fix

- fix

- fix

- fix to hack

- compilation fix

- fix to automatic merge

* - reduced UT

* - fix

* - lint

* - fix to lint

d0f3296b

FC+elementwise_add (residual connection) (#41776) · 92d8d0bc

由 Sławomir Siwek 提交于 4月 14, 2022

* Change tensor name to match activation

* declare fc_eltwise_add pass

* merge conv_eltwise refactor PR

* first compilable draft

* unittest feedback tools

* Fuse pass tester

* Move IsReachable() to shared file

* 100% coverage of fuse_pass_tester.cc

* register pass

* Add bias node

* Improve unit tests / remove bias node from pattern

* improve fc_eltwiseadd_unittest

* cancel eltwise_add fuse if act is already fused

* Add elementwise_input scale

* Residual MVP

* Add new FC attrs

* Add more test cases

* Add missing op attrs

* Adapt code to new Elementwise pattern

* reuse existing fcpattern

* improve code style

* remove unused arguments

* fix typo

* remove whitespace

* remove int8 related code

* Remove attributes from base ops

* style

* style check

* Remove input from base op

* Set attribute during fuse

* ut timeout

* download and test model

* DRY

* apply feedback from review

* Style check

* fix typo

* cosmetic changes

* explicitly set residual as output

* VIT-OCR accuracy check

* trigger CI

* remove whitespaces

* fix missing data file

92d8d0bc

S

fix bug of set cuda lib in demo_ci and infer_ut (#41677) · bda4965a
由 Sing_chan 提交于 4月 14, 2022

bda4965a

add mkldnn int8 pass [step3] (#41599) · 8e2d4d30

由 baoachun 提交于 4月 14, 2022

* add mkldnn int8 pass [step3]

* Add test for compute_propagate_scales_mkldnn_pass

* update pass

* update api comment and python api
Co-authored-by: Nwozna <joanna.wozna@intel.com>

8e2d4d30

Added shuffle_channel BF16/FP32 FWD oneDNN kernel (#39756) · c7623d72

由 jakpiase 提交于 4月 14, 2022

* added shuffle_channel bf16/fp32 fwd kernel

* added missing files

* CI fix

* changed from pten to phi

* tmp save

* added reviewers suggestions

* fix for test

c7623d72

13 4月, 2022 1 次提交

init roll convert (#41689) · 14c3c450

由 feng_shuai 提交于 4月 13, 2022

* init roll convert

* add ut for roll convert

* roll convert don't support trt6.0

* fix: change ut for trt 7.0.0.1

14c3c450

12 4月, 2022 2 次提交

strided_slice (#41573) · b861022a

由 feng_shuai 提交于 4月 12, 2022

* strided_slice

* fix: compiler error because of size()

* fix: warning

* fix : warning

* init input_shape

* fix:forget punctuation

b861022a

add python share_data interface (#41626) · be4a2077

由 JingZhuangzhuang 提交于 4月 12, 2022

* add python share_data interface

* Update inference_api.cc

* Update inference_api.cc

* add python share_data interface

be4a2077

07 4月, 2022 3 次提交

modify inference model test build method to support multi version (#41027) · c9e0e10e

由 Sing_chan 提交于 4月 07, 2022

* change inference demo_test build method to ninja to choose visual studio version automaticly

* notest;test=windows_ci_inference

* set cuda of demo_ci by arg,fix bug of ninja compile,test=document_fix;test=windows_ci;test=windows_ci_inference

* fix bug;test=document_fix;test=windows_ci;test=windows_ci_inference

* fix bug;test=document_fix;test=windows_ci_inference"

* set lib_path according to generator

c9e0e10e

Z

remove cudnn_deterministic=True (#41341) · cefa91fd
由 Zhang Jun 提交于 4月 07, 2022

cefa91fd
J
modify infer gpu memory strategy (#41427) · 56e72b20
由 JingZhuangzhuang 提交于 4月 07, 2022
```
* modify infer gpu memory strategy

* modify infer gpu memory strategy
```
56e72b20

06 4月, 2022 1 次提交
- A
  [IPU] remove paddle_ipu shared library (#41307) · 229e91bf
  由 Allen Guo 提交于 4月 06, 2022
```
* remove paddle_ipu shared library

* fix unique_name
```
  229e91bf
02 4月, 2022 1 次提交
- W
  [Paddle inference] support new quant_model (#41049) · 1b58ce14
  由 Wangzheee 提交于 4月 02, 2022
```
* paddle inference support new quant_model
```
  1b58ce14
31 3月, 2022 2 次提交

W
add multiclass nms3 trt converter (#41181) · 08c3edb3
由 wangxinxin08 提交于 3月 31, 2022
```
* add multiclass_nms3 converter
```
08c3edb3

Using DistConfig in Paddle Inference (#41128) · dc0702fe

由 TeslaZhao 提交于 3月 31, 2022

* Pass compat of conv_transpose_bias_mkldnn_fuse_pass

* Fix a bug of strided_slice op, about the axes parameter access memory out of bounds

* Fix a bug of strided_slice op, about the axes parameter access memory out of bounds

* Fix a bug of transpose op, about accessing memory out of bounds of the perm param

* op:transpose_op supports bool type

* op:transpose_op supports bool type

* Keep strided_slice op behavior consistent with slice op when starts input is less than -rank

* Using DistConfig in inference

dc0702fe

30 3月, 2022 1 次提交
- H
  
  Optimize the onnxruntime code (#41044) · f12b5260
  由 heliqi 提交于 3月 30, 2022
  
  f12b5260
18 3月, 2022 1 次提交
- S
  
  set +x to close showing command, update check_change code with linux (#40456) · 161d27dc
  由 Sing_chan 提交于 3月 18, 2022
  
  161d27dc
17 3月, 2022 3 次提交

CopyFromCpu and CopyToCpu of Onnxruntime back-end optimize (#40561) · fcbb7440

由 heliqi 提交于 3月 17, 2022

* add onnxruntime predictor

* Add code comments

* support link paddle2onnx onnxruntime

* support onnxruntime with python

* support onnxruntime with python

* support onnxruntime with windows

* paddle2onnx compile with windows

* supoort windows compile

* supoort windows compile with onnxruntime

* supoort windows compile with paddle2onnx

* supoort mac compile

* compile with mac

* compile with mac

* add code comments

* fix remind word

* code optimization

* add test case

* add test case

* add inference demo_ci test case

* fix compile paddle2onnx with no python

* add inference demo_ci test case

* add inference demo_ci test case

* add inference infer_ut test case

* support c go api and test cases

* add converage test case

* add converage test case

* add capi test case

* add capi test case

* fix onnxruntime copyfromcpu and copytocpu

* fix goapi

* modify code

fcbb7440

Y

[fleet executor] fleet executor for npu (#40607) · 81848fff
由 Yuang Liu 提交于 3月 17, 2022

81848fff
B

support gpu mixed precision inference (#40531) · 06fee998
由 baoachun 提交于 3月 17, 2022

06fee998

14 3月, 2022 1 次提交

Add an elementwise + activation fusion pass. (#36541) · 3f219160

由 Tomasz Socha 提交于 3月 14, 2022

* Add elementwise add and activation fuse pass

* Fix copy ellision

* More flexible pattern detector

* More flexible fusion pass

* Update lists for pass

* Add support for Pow operator

* Add support for more activation types

* Style

* Rename fusion pass

* First version of tests

* Dirty version of pass

* Polished version

* Update pbtxt

* Style

* Update names

* Style

* Use PADDLE_ENFORCE_EQ

* Save error message to variable

* WO for error checks

* CR

* Static style check

* Add missing 'activation_scale' attribute

* Add relu6 and sigmoid activations

* Style

* Fix fuse list formating

* Sync filenames for fuse pass files

* Fix cmake after move

* Fix registration

* Fix pass name in tests

* Add missing activations to checker

* WIPS

* Working mul op

* Working sub

* Working Add

* Remove pten includes

* Remove some forward declarations

* Remove Includes

* Fixes

* Remove default kernels

* Add check if post_ops attributes are avaliable

* Style

* Code adjustment

* Register default kernels

* We have year 2022 not 2021...
Co-authored-by: Njakpiase <jakpia21@gmail.com>
Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>

* Fast review fixes
Co-authored-by: Njakpiase <jakpia21@gmail.com>
Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>

* Review Fix

* Rename one_dnn -> onednn

* Style after review

* Fast and dirty fix for quantization

* Update tests

* Style

* Fix mkldnn_quantizer config

* Add Joanna's suggestion.

* Check if operator is explicitly disables on OneDNN

* Try to use unregistered attributes

* Style

* Test new framework

* FXI

* FXII

* Update test

* Style
Co-authored-by: Njakpiase <jakpia21@gmail.com>
Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>

3f219160

10 3月, 2022 1 次提交

Inference add ONNXRuntime back-end (#39988) · 431afc39

由 heliqi 提交于 3月 10, 2022

* add onnxruntime predictor

* Add code comments

* support link paddle2onnx onnxruntime

* support onnxruntime with python

* support onnxruntime with python

* support onnxruntime with windows

* paddle2onnx compile with windows

* supoort windows compile

* supoort windows compile with onnxruntime

* supoort windows compile with paddle2onnx

* supoort mac compile

* compile with mac

* compile with mac

* add code comments

* fix remind word

* code optimization

* add test case

* add test case

* add inference demo_ci test case

* fix compile paddle2onnx with no python

* add inference demo_ci test case

* add inference demo_ci test case

* add inference infer_ut test case

* support c go api and test cases

* add converage test case

* add converage test case

* add capi test case

* add capi test case

431afc39

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致