提交 · 8aa1772cf33170e4dc8fc6566bf60b18598b4bb2 · PaddlePaddle / Paddle

25 8月, 2023 1 次提交

[Inference] auto mixed precision inference support white list (#56535) · ecff21e7

由 Yuanle Liu 提交于 8月 25, 2023

* auto mixed precision inference support white list

* update

* update

* update

* move down identity_op_clean_pass

* fix code style

ecff21e7

23 8月, 2023 1 次提交

Integrate TRT qdq layers (#54803) · ae84c603

由 Leo Chen 提交于 8月 23, 2023

* Integrate quantize/dequantize linear and add config for explicit quantization

* Fix the build error

* Add macro for TRT version < 8.0

* Remove qdq UT from windows

* Fix UT failure

* Check TRT version in qdq UT

* Test tensorrt_explicit_enabled API

* Disable QDQ UT if TRT version < 8.5

* Add quantization postfix into public APIs

* Apply code formatter

* Fix the UT failure for explicit quantization

* Apply code formatter on modified files

* Correct the year in copyright

ae84c603

17 8月, 2023 1 次提交

Add MarkTrtEngineOutputs API (#56188) · 2abf4326

由 ming1753 提交于 8月 17, 2023

* [paddle-TRT] support mark output

* [fix bug] hook function only call one in different predictor

* add api test

2abf4326

09 8月, 2023 1 次提交
- X
  [oneDNN]rename macro to PADDLE_WITH_DNNL (#52208) · 6ff4c130
  由 Xinyu Chen 提交于 8月 09, 2023
```
* onednn: rename macro to PADDLE_WITH_DNNL

* onednn: rename macro to CINN_WITH_DNNL
```
  6ff4c130
19 6月, 2023 1 次提交
- A
  
  [XPU] add context_gm_size in XpuConfig, don't alloc gm in pass. (#54674) · 52ad918b
  由 AlbertVan 提交于 6月 19, 2023
  
  52ad918b
14 6月, 2023 1 次提交
- Z
  
  set xpu context at runtime (#54587) · d0d7d01f
  由 zhupengyang 提交于 6月 14, 2023
  
  d0d7d01f
09 6月, 2023 1 次提交
- Z
  
  refine xpu inference api (#54342) · b62b384b
  由 zhupengyang 提交于 6月 09, 2023
  
  b62b384b
22 5月, 2023 1 次提交
- Y
  [Inference] add config.enable_low_precision_io api and remove rely on... · d1bbd900
  由 Yuanle Liu 提交于 5月 22, 2023
```
[Inference] add config.enable_low_precision_io api and remove rely on AnalysisConfig::Precison in trt (#52485)
```
  d1bbd900
19 5月, 2023 1 次提交
- S
  
  [Inference] Save optimized model by pass (#53696) · fa08a514
  由 shentanyue 提交于 5月 19, 2023
  
  fa08a514
18 5月, 2023 1 次提交

[inference][trt]Remove trt sparse weight api (#53905) · 1007690b

由 Zhang Jun 提交于 5月 18, 2023

* Revert "[inference][trt]add trt sparse weights switch (#53562)"

This reverts commit 4a69a536.

* remove kSPARSE_WEIGHTS

* remove kFASTER_DYNAMIC_SHAPES_0805 and add 'TrtMajorVersion' function

1007690b

11 5月, 2023 2 次提交
- Z
  
  [inference][trt]add trt sparse weights switch (#53562) · 4a69a536
  由 Zhang Jun 提交于 5月 11, 2023
  
  4a69a536
- 张
  
  昇腾和寒武纪相关代码退场 npu相关代码退场2 (#53568) · 0d45ac73
  由张春乔提交于 5月 11, 2023
  
  0d45ac73
09 5月, 2023 1 次提交
- W
  
  Support trt cuda graph. (#53406) · ea0abf93
  由 Wilber 提交于 5月 09, 2023
  
  ea0abf93
27 4月, 2023 1 次提交
- Z
  
  xpu quant weight only (#53306) · 1c97aa69
  由 zhupengyang 提交于 4月 27, 2023
  
  1c97aa69
24 4月, 2023 1 次提交

remove some [-Wunused-parameter] (#53185) · 834eb2ba

由 Galaxy1458 提交于 4月 24, 2023

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test,test=develop

* test ,test=develop

834eb2ba

27 3月, 2023 1 次提交
- E
  add custom device mixed precision inference api (#50884) · a6449634
  由 engineer1109 提交于 3月 27, 2023
```
fix bug

remove useless

fix bug

add pybind

remove log

fix style

fix style

change api
```
  a6449634
03 1月, 2023 1 次提交

[Paddle Inference] Implement conv2d_fusion NHWC format using cutlass (#47989) · c123dd1e

由 zhoutianzi666 提交于 1月 03, 2023

* Implement conv2d_fusion NHWC format using CUTLASS
* Add unit testing for CUTLASS Conv in inference
* Add experimental API for CUTLASS.

c123dd1e

14 12月, 2022 1 次提交
- Y
  
  [Paddle Inference] rewrite convert_to_mixed_precision (#48853) · 28ea9aad
  由 Yuanle Liu 提交于 12月 14, 2022
  
  28ea9aad
13 12月, 2022 1 次提交
- E
  
  enable custom device save model on device memory && fix conflict (#48221) · b6aa9f53
  由 engineer1109 提交于 12月 13, 2022
  
  b6aa9f53
08 12月, 2022 1 次提交
- W
  
  [Inference] inference add cinn interface (#48741) · 3a387df6
  由 Wilber 提交于 12月 08, 2022
  
  3a387df6
06 12月, 2022 1 次提交
- Y
  
  [Paddle Inference] Add float_to_half_pass to support inference with mixed precision (#47993) · c5a45cc6
  由 Yuanle Liu 提交于 12月 06, 2022
  
  c5a45cc6
01 12月, 2022 1 次提交
- W
  [Inference] Optimize memory_optimize pass. (#48476) · aa892113
  由 Wilber 提交于 12月 01, 2022
```
* update memory_optimize pass
```
  aa892113
30 11月, 2022 1 次提交
- Y
  
  [Paddle Inference] clean unused code (#48392) · 5de01e8a
  由 Yuanle Liu 提交于 11月 30, 2022
  
  5de01e8a
16 11月, 2022 1 次提交
- C
  
  feat(ipu): add paddle inference support for model_runtime. (#47364) · 39c85064
  由 czr-gc 提交于 11月 16, 2022
  
  39c85064
14 11月, 2022 1 次提交
- E
  
  add lite opencl support api (#47112) · 798ab3f9
  由 engineer1109 提交于 11月 14, 2022
  
  798ab3f9
01 11月, 2022 1 次提交
- S
  
  [Lite][XPU] Upgrade lite subgraph api of xpu (#47373) · 8a1124b1
  由 shentanyue 提交于 11月 01, 2022
  
  8a1124b1
27 10月, 2022 1 次提交

[JIT] Add Predictor for JITLayer (#47379) · b160d09e

由 Aurelius84 提交于 10月 27, 2022

* add predictor_engine

* add predictor_engine

* fix zero shape

* fix lodTensor

* fix unittest

* fix code style

* update CmakeList

b160d09e

11 10月, 2022 1 次提交
- C
  Remove LoDTensor using in fluid (Part 1) (#46663) · 940d8f25
  由 Chen Weihang 提交于 10月 11, 2022
```
* remove using lodtensor part1

* polish history code format
```
  940d8f25
30 9月, 2022 1 次提交

[IPU] paddle-inference support custom-ops (#45235) · a6b4bee3

由 Allen Guo 提交于 9月 30, 2022

* paddle-inference support custom-ops
Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>

* fix tolower
Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>

a6b4bee3

29 9月, 2022 1 次提交
- Y
  Remove calibration file path when deploy quantize model (#46283) · d71f1b3f
  由 yeliang2258 提交于 9月 29, 2022
```
* remove calibration file path

* remove useless code
```
  d71f1b3f
22 9月, 2022 1 次提交
- Y
  
  TensorRT engine context memory sharing (#45842) · 173b39bb
  由 Yuanle Liu 提交于 9月 22, 2022
  
  173b39bb
05 9月, 2022 2 次提交

New format quant model support for MKLDNN (#45416) · 4e4f4586

由 yeliang2258 提交于 9月 05, 2022

* support onnx format quantized model

* update code

* add test

* add test

* fix

* fix test

* fix cmake

* update code

* change scale file path to calibration file path

* update code

* update code

* fix build bug

* fix build bugs

* fix

* fix

4e4f4586

Update DlNNE engine (#45027) · 638965c5

由 denglin-github 提交于 9月 05, 2022

* add config param for enable_dlnne and support calibration mode
* remove useless file
* refine code and add annotation
* refine code of Warnning tips

638965c5

05 8月, 2022 1 次提交

update trt workspace size param (#44469) · bdce552b

由 Zhang Jun 提交于 8月 05, 2022

* update trt workspace size param

* update

* update

* update

* use int64_t

* use int64_t

* upate

* update

bdce552b

08 7月, 2022 1 次提交
- W
  
  Inference support mixed-precision model [3] (#44057) · 7f958728
  由 Wilber 提交于 7月 08, 2022
  
  7f958728
05 7月, 2022 1 次提交
- R
  
  Remove header file including for boost (#44052) · 52607cf8
  由 Ruibiao Chen 提交于 7月 05, 2022
  
  52607cf8
29 6月, 2022 1 次提交
- W
  inference support mixed-precision model [1]. (#43814) · c7694b82
  由 Wilber 提交于 6月 29, 2022
```
* inference add convert to mixed model ability.
```
  c7694b82
24 6月, 2022 1 次提交
- W
  revert 40531 (#43807) · 7985407b
  由 Wilber 提交于 6月 24, 2022
```
* revert 40531

* update
```
  7985407b
02 6月, 2022 1 次提交
- W
  [Paddle-Inference] new general transformer inference support (#43077) · 2810dfea
  由 Wangzheee 提交于 6月 02, 2022
```
* new general transformer inference support
```
  2810dfea
14 4月, 2022 1 次提交

add mkldnn int8 pass [step3] (#41599) · 8e2d4d30

由 baoachun 提交于 4月 14, 2022

* add mkldnn int8 pass [step3]

* Add test for compute_propagate_scales_mkldnn_pass

* update pass

* update api comment and python api
Co-authored-by: Nwozna <joanna.wozna@intel.com>

8e2d4d30

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功