提交 · 173b39bb5703c297ae89c6ef442f634c56f2f2bf · BaiXuePrincess / Paddle

22 9月, 2022 1 次提交
- Y
  
  TensorRT engine context memory sharing (#45842) · 173b39bb
  由 Yuanle Liu 提交于 9月 22, 2022
  
  173b39bb
05 9月, 2022 2 次提交

New format quant model support for MKLDNN (#45416) · 4e4f4586

由 yeliang2258 提交于 9月 05, 2022

* support onnx format quantized model

* update code

* add test

* add test

* fix

* fix test

* fix cmake

* update code

* change scale file path to calibration file path

* update code

* update code

* fix build bug

* fix build bugs

* fix

* fix

4e4f4586

Update DlNNE engine (#45027) · 638965c5

由 denglin-github 提交于 9月 05, 2022

* add config param for enable_dlnne and support calibration mode
* remove useless file
* refine code and add annotation
* refine code of Warnning tips

638965c5

05 8月, 2022 1 次提交

update trt workspace size param (#44469) · bdce552b

由 Zhang Jun 提交于 8月 05, 2022

* update trt workspace size param

* update

* update

* update

* use int64_t

* use int64_t

* upate

* update

bdce552b

08 7月, 2022 1 次提交
- W
  
  Inference support mixed-precision model [3] (#44057) · 7f958728
  由 Wilber 提交于 7月 08, 2022
  
  7f958728
29 6月, 2022 1 次提交
- W
  inference support mixed-precision model [1]. (#43814) · c7694b82
  由 Wilber 提交于 6月 29, 2022
```
* inference add convert to mixed model ability.
```
  c7694b82
28 6月, 2022 1 次提交

Enable Bert on bfloat16 datatype (#43455) · 6d31dc93

由 Tomasz Socha 提交于 6月 28, 2022

* Remove output arguments from functions.
Replace pointers with references

* Name used bool flags

* Reorder functions

* Enable bfloat16 data type

* Give declarations some space

* Style

* Style

6d31dc93

24 6月, 2022 1 次提交
- W
  revert 40531 (#43807) · 7985407b
  由 Wilber 提交于 6月 24, 2022
```
* revert 40531

* update
```
  7985407b
08 6月, 2022 1 次提交
- W
  
  thread_local method to support predictor stream. (#42785) · cab0f2f5
  由 Wilber 提交于 6月 08, 2022
  
  cab0f2f5
05 6月, 2022 1 次提交
- S
  
  【code format check upgrade】 step2：clang-format (#42840) · a3730dc8
  由 Sing_chan 提交于 6月 05, 2022
  
  a3730dc8
02 6月, 2022 1 次提交
- W
  [Paddle-Inference] new general transformer inference support (#43077) · 2810dfea
  由 Wangzheee 提交于 6月 02, 2022
```
* new general transformer inference support
```
  2810dfea
12 5月, 2022 1 次提交
- S
  
  Fix some typos in paddle/. (#42408) · 2012672c
  由 Shuangchi He 提交于 5月 12, 2022
  
  2012672c
10 5月, 2022 1 次提交
- R
  
  [CustomDevice] add inference support (#42036) · 02e5c4be
  由 ronnywang 提交于 5月 10, 2022
  
  02e5c4be
19 4月, 2022 1 次提交
- B
  update gpu fp16 op blacklist (#41703) · 55096a1c
  由 baoachun 提交于 4月 19, 2022
```
* update gpu fp16 op blacklist

* update blacklist
```
  55096a1c
14 4月, 2022 1 次提交

add mkldnn int8 pass [step3] (#41599) · 8e2d4d30

由 baoachun 提交于 4月 14, 2022

* add mkldnn int8 pass [step3]

* Add test for compute_propagate_scales_mkldnn_pass

* update pass

* update api comment and python api
Co-authored-by: Nwozna <joanna.wozna@intel.com>

8e2d4d30

17 3月, 2022 1 次提交
- B
  
  support gpu mixed precision inference (#40531) · 06fee998
  由 baoachun 提交于 3月 17, 2022
  
  06fee998
10 3月, 2022 1 次提交

Inference add ONNXRuntime back-end (#39988) · 431afc39

由 heliqi 提交于 3月 10, 2022

* add onnxruntime predictor

* Add code comments

* support link paddle2onnx onnxruntime

* support onnxruntime with python

* support onnxruntime with python

* support onnxruntime with windows

* paddle2onnx compile with windows

* supoort windows compile

* supoort windows compile with onnxruntime

* supoort windows compile with paddle2onnx

* supoort mac compile

* compile with mac

* compile with mac

* add code comments

* fix remind word

* code optimization

* add test case

* add test case

* add inference demo_ci test case

* fix compile paddle2onnx with no python

* add inference demo_ci test case

* add inference demo_ci test case

* add inference infer_ut test case

* support c go api and test cases

* add converage test case

* add converage test case

* add capi test case

* add capi test case

431afc39

02 3月, 2022 1 次提交
- Y
  [fleet_executor] Add entrance of FleetExecutor in AnalysisPredictor for... · 244ae318
  由 Yuang Liu 提交于 3月 02, 2022
```
[fleet_executor] Add entrance of FleetExecutor in AnalysisPredictor for distributed inference (#39992)
```
  244ae318
23 2月, 2022 1 次提交
- A
  [IPU] update inference demos (#39792) · 24f55aed
  由 Allen Guo 提交于 2月 23, 2022
```
* update inference part

* restore white space
```
  24f55aed
11 2月, 2022 1 次提交
- L
  
  Add TensorRT inspector into Paddle-TRT (#38362) · 69793a27
  由 Leo Chen 提交于 2月 11, 2022
  
  69793a27
13 1月, 2022 1 次提交
- W
  [Paddle-Inference] add Paddle Trt config: with_interleaved (#38884) · dccdc719
  由 Wangzheee 提交于 1月 13, 2022
```
* add Paddle Trt config: with_interleaved
```
  dccdc719
15 12月, 2021 1 次提交

ipu_inference (#37102) · 141b2854

由 jianghaicheng 提交于 12月 15, 2021

* add ipu_inference

* resovle commments

* resolve comments

* add EnableIpu introduction

* rm line

* restore npu update

* add ernie and resnet50 test

* fix copyright time
Co-authored-by: Nyaozhixin <522190855@qq.com>

141b2854

24 11月, 2021 1 次提交
- Z
  
  fix lite with xpu or nnadapter (#37449) · 93aefceb
  由 zhupengyang 提交于 11月 24, 2021
  
  93aefceb
22 9月, 2021 1 次提交
- J
  
  [Inference] Support NNAdapter and ascend310 (#35226) · 10e53044
  由 JingZhuangzhuang 提交于 9月 22, 2021
  
  10e53044
15 9月, 2021 1 次提交
- H
  
  add set-xpu-device-id function for inference config. (#35572) · a74d7fb6
  由 houj04 提交于 9月 15, 2021
  
  a74d7fb6
14 9月, 2021 1 次提交
- W
  
  [Inference] Add tuned trt_dynamic_shape mode. (#34806) · 7c96efed
  由 Wilber 提交于 9月 14, 2021
  
  7c96efed
06 9月, 2021 1 次提交
- W
  
  update trt ut. (#35458) · 18934c53
  由 Wilber 提交于 9月 06, 2021
  
  18934c53
04 9月, 2021 1 次提交
- W
  
  update inference trt ut framework (#35418) · e8772486
  由 Wilber 提交于 9月 04, 2021
  
  e8772486
19 7月, 2021 1 次提交
- W
  
  [Inference] Add config.Summary api (#34122) · 831c1c6c
  由 Wilber 提交于 7月 19, 2021
  
  831c1c6c
14 7月, 2021 1 次提交
- W
  
  Inference support Ascend910 (#34101) · 4e3fb219
  由 Wilber 提交于 7月 14, 2021
  
  4e3fb219
23 6月, 2021 1 次提交
- W
  
  modify mkldnn default capacity (#33729) · 0722297d
  由 Wilber 提交于 6月 23, 2021
  
  0722297d
21 6月, 2021 1 次提交
- P
  
  fix emb_eltwise_ln gpu_id bug (#33701) · 1b0c5ef2
  由 Pei Yang 提交于 6月 21, 2021
  
  1b0c5ef2
17 6月, 2021 1 次提交
- W
  
  [Inference] Update go inference api based on new capi. (#33113) · c7e3c918
  由 Wilber 提交于 6月 17, 2021
  
  c7e3c918
25 4月, 2021 2 次提交

W

update lite subgraph api. (#32513) · 92dc9b2b
由 Wilber 提交于 4月 25, 2021

92dc9b2b

Nne integration (#32255) · feb2e476

由 denglin-github 提交于 4月 25, 2021

* Add dlnne engine runtime

* Fix log

* Remove <const_cast> and remove unrelated modify with dlnne, +clang-format

* Fix CMakeList format error

* Add copyright message

* Fix dlnne CMakeList.txt

* Add some paddlepaddle_pass to support more networks

* Fix some format bug

feb2e476

07 2月, 2021 1 次提交

石

bug fix of xpu lite engine, test=develop (#30918) · 99bd16eb

由石晓伟提交于 2月 07, 2021

* bug fix of xpu lite engine, test=develop

* xpu zero copy tensor, test=develop

* revert paddle/fluid/inference/tests/api/CMakeLists.txt

99bd16eb

03 2月, 2021 1 次提交

石

support xpu with analysis predictor, test=develop (#30832) · 2ac4143b

由石晓伟提交于 2月 03, 2021

* support xpu inference with analysis predictor, test=develop

* merge the cmake of the xpu toolchain, test=develop

* add c-apis, test=develop

* fix a bug in extern_xpu, test=develop

2ac4143b

25 1月, 2021 1 次提交

add DLA support：C++&&Python api (#30165) · ae0f88a9

由 Shang Zhizhou 提交于 1月 25, 2021

* add dla

* add dla done

* add python api
Co-authored-by: Nshangzhizhou <root@szth-rp-fanyi-opera49.szth.baidu.com>

ae0f88a9

06 1月, 2021 1 次提交

add inference api： DisableTensorRtOps (#30109) · 05b27695

由 Shang Zhizhou 提交于 1月 06, 2021

* snap

* add inference api: DisableTensorRtOPs

* fix code style

* update api to experimental

* update variable name

05b27695

03 11月, 2020 1 次提交

TensorRT中ernie模型推理性能优化，支持变长输入 (#28367) · ea851796

由 Shang Zhizhou 提交于 11月 03, 2020

* fp16 result ok

* change -DWITH_NVINFER_PLUGIN toconfig.EnableTensorRtOSS

* auto detect special slice op converter for ernie with trt oss

* ernie oss only support fp16

* fix special_slice_plugin serialize bug

* matmul in tensorrt ok

* ernie unittest ok

* add matmul tensorrt unittest

* remove demo code

ea851796

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致