提交 · c7694b82eb47eedaeb5866d90d104aa0d747af31 · 机器未来 / Paddle

29 6月, 2022 1 次提交
- W
  inference support mixed-precision model [1]. (#43814) · c7694b82
  由 Wilber 提交于 6月 29, 2022
```
* inference add convert to mixed model ability.
```
  c7694b82
24 6月, 2022 1 次提交
- W
  revert 40531 (#43807) · 7985407b
  由 Wilber 提交于 6月 24, 2022
```
* revert 40531

* update
```
  7985407b
05 6月, 2022 1 次提交
- S
  
  【code format check upgrade】 step2：clang-format (#42840) · a3730dc8
  由 Sing_chan 提交于 6月 05, 2022
  
  a3730dc8
02 6月, 2022 1 次提交
- W
  [Paddle-Inference] new general transformer inference support (#43077) · 2810dfea
  由 Wangzheee 提交于 6月 02, 2022
```
* new general transformer inference support
```
  2810dfea
30 5月, 2022 1 次提交
- S
  [TensorRT] Fix delete fill_constant pass (#43053) · 1448520d
  由 shentanyue 提交于 5月 30, 2022
```
* update lite compile cmake

* Update delete_fill_constant_op_pass.cc

* Update analysis_config.cc
```
  1448520d
14 4月, 2022 1 次提交

add mkldnn int8 pass [step3] (#41599) · 8e2d4d30

由 baoachun 提交于 4月 14, 2022

* add mkldnn int8 pass [step3]

* Add test for compute_propagate_scales_mkldnn_pass

* update pass

* update api comment and python api
Co-authored-by: Nwozna <joanna.wozna@intel.com>

8e2d4d30

31 3月, 2022 1 次提交

add flatten2,reshape2,squueze2_trt_fuse_pass test cast (#41031) · 7ef69202

由 heliqi 提交于 3月 31, 2022

* add flatten2,reshape2,squueze2_trt_fuse_pass  test cast

* add flatten2,reshape2,squueze2_trt_fuse_pass  test cast

* add flatten2,reshape2,squueze2_trt_fuse_pass  test cast

7ef69202

17 3月, 2022 1 次提交
- B
  
  support gpu mixed precision inference (#40531) · 06fee998
  由 baoachun 提交于 3月 17, 2022
  
  06fee998
22 2月, 2022 1 次提交
- W
  [Paddle-Inference] fix pass and convert_op for preln_ernie (#39733) · 574f3402
  由 Wangzheee 提交于 2月 22, 2022
```
* fix pass and convert_op for preln_ernie and add preln_ernie'flag in pass
```
  574f3402
11 2月, 2022 1 次提交
- L
  
  Add TensorRT inspector into Paddle-TRT (#38362) · 69793a27
  由 Leo Chen 提交于 2月 11, 2022
  
  69793a27
13 1月, 2022 1 次提交
- W
  [Paddle-Inference] add Paddle Trt config: with_interleaved (#38884) · dccdc719
  由 Wangzheee 提交于 1月 13, 2022
```
* add Paddle Trt config: with_interleaved
```
  dccdc719
27 10月, 2021 1 次提交
- W
  
  enable trt test check and fix trt ut error（3/3） (#36581) · 8c1c72af
  由 Wilber 提交于 10月 27, 2021
  
  8c1c72af
22 10月, 2021 1 次提交
- W
  
  support lite xpu choose device id (#36610) · f46311b0
  由 Wilber 提交于 10月 22, 2021
  
  f46311b0
14 10月, 2021 1 次提交
- P
  
  clean inference logs when config.DisableGlogInfo is triggered (#36356) · 7f5128f4
  由 Pei Yang 提交于 10月 14, 2021
  
  7f5128f4
22 9月, 2021 1 次提交
- J
  
  [Inference] Support NNAdapter and ascend310 (#35226) · 10e53044
  由 JingZhuangzhuang 提交于 9月 22, 2021
  
  10e53044
14 9月, 2021 1 次提交
- W
  
  [Inference] Add tuned trt_dynamic_shape mode. (#34806) · 7c96efed
  由 Wilber 提交于 9月 14, 2021
  
  7c96efed
30 4月, 2021 1 次提交
- P
  
  remove check for optim_cache_dir in trt slim int8 (#32676) · c6713bc0
  由 Pei Yang 提交于 4月 30, 2021
  
  c6713bc0
25 4月, 2021 2 次提交

W

update lite subgraph api. (#32513) · 92dc9b2b
由 Wilber 提交于 4月 25, 2021

92dc9b2b

Nne integration (#32255) · feb2e476

由 denglin-github 提交于 4月 25, 2021

* Add dlnne engine runtime

* Fix log

* Remove <const_cast> and remove unrelated modify with dlnne, +clang-format

* Fix CMakeList format error

* Add copyright message

* Fix dlnne CMakeList.txt

* Add some paddlepaddle_pass to support more networks

* Fix some format bug

feb2e476

02 3月, 2021 1 次提交

support trt serialize when load model from memory (#31342) · 6404c438

由 Shang Zhizhou 提交于 3月 02, 2021

* support trt serialize when load model from memory

* delete conv_bn_fuse_pass before tensorrt, with which trt serialize engine id is not stable

* Revert "delete conv_bn_fuse_pass before tensorrt, with which trt serialize engine id is not stable"

performance degradation, fix in the future

This reverts commit fa6cd17e60b15df351efda379ddd00e9e9c1fea9.

* add delete conv_bn

* delete path when delete_cache_files

6404c438

18 2月, 2021 1 次提交
- P
  
  add trt transpose and flatten converter (#31022) · 9b54fe41
  由 Pei Yang 提交于 2月 18, 2021
  
  9b54fe41
25 1月, 2021 1 次提交

add DLA support：C++&&Python api (#30165) · ae0f88a9

由 Shang Zhizhou 提交于 1月 25, 2021

* add dla

* add dla done

* add python api
Co-authored-by: Nshangzhizhou <root@szth-rp-fanyi-opera49.szth.baidu.com>

ae0f88a9

06 1月, 2021 1 次提交

add inference api： DisableTensorRtOps (#30109) · 05b27695

由 Shang Zhizhou 提交于 1月 06, 2021

* snap

* add inference api: DisableTensorRtOPs

* fix code style

* update api to experimental

* update variable name

05b27695

06 11月, 2020 1 次提交
- J
  Add bfloat16 softmax and gelu (#28394) · 7821759d
  由 joanna.wozna.intel 提交于 11月 06, 2020
```
* Add bfloat16 softmax and gelu

* Add pass attr bfloat16_enabled_op_types

* Changes from review
```
  7821759d
03 11月, 2020 1 次提交

TensorRT中ernie模型推理性能优化，支持变长输入 (#28367) · ea851796

由 Shang Zhizhou 提交于 11月 03, 2020

* fp16 result ok

* change -DWITH_NVINFER_PLUGIN toconfig.EnableTensorRtOSS

* auto detect special slice op converter for ernie with trt oss

* ernie oss only support fp16

* fix special_slice_plugin serialize bug

* matmul in tensorrt ok

* ernie unittest ok

* add matmul tensorrt unittest

* remove demo code

ea851796

16 9月, 2020 1 次提交
- W
  
  Enhance infer error info message (#26731) · dae62556
  由 Wilber 提交于 9月 16, 2020
  
  dae62556
11 9月, 2020 1 次提交
- W
  
  Lite subgraph refine predictor (#27167) · 1b84c0bf
  由 Wilber 提交于 9月 11, 2020
  
  1b84c0bf
22 7月, 2020 1 次提交

石

supports xpu runtime, test=develop (#25554) · 72064172

由石晓伟提交于 7月 22, 2020

* update ResetHolder, test=develop

* add TensorShare for lite engine, test=develop

* tensor data changed from copying to sharing, test=develop

* supports xpu runtime, test=develop

* fix code styles, test=develop

72064172

09 4月, 2020 1 次提交

Remove: NGraph engine from PDPD repository (#23545) · 3baaee9a

由 mozga-intel 提交于 4月 09, 2020

* Remove the NGraph engine from PDPD repository
1. Each operator was removed from the operator's directory
2. Each test was removed from the unittest directory
3. The parallel executor support was removed from the PDPD
4. The CMake file was removed from the PDPD
5. The NG flags were removed from the repository
test=develop

* Remove ngraph from:
1. Cmake file
2. Python file
test=develop

3baaee9a

26 3月, 2020 1 次提交

[Paddle-TRT]: Ernie Dynamic shape support. (#23138) · 430b0099

由 Zhaolong Xing 提交于 3月 26, 2020

* add dynamic plugin support.
test=develop

* change emb eltwise layernorm to math function
test=develop

* add emb eltwise layernorm
test=develop

* can run dynamic shape ernie
test=develop

* fix ci
test=develop

* add ut for trt ernie dynamic

test=develop

* refine dynamic shape c++ interface.
test=develop

* fix comments
test=develop

* fix comments
test=develop

430b0099

09 3月, 2020 1 次提交

[Paddle-TRT] : (Part1) Dynamic shape support (#22868) · dd67d44a

由 Zhaolong Xing 提交于 3月 09, 2020

* change the ci trt from version 5. to 6.0

* paddle-trt dynamic shape support init

* conv+bias or conv+bn dynamic shape support
test=develop

* modity trt engine opconvert
test=develop

* fix ci error
test=develop

dd67d44a

24 2月, 2020 1 次提交

Add an inference interface to disable FC padding (#22097) · cdf5f6fb

由 GaoWei8 提交于 2月 24, 2020

* Add an interface of disabling FC padding
* fix bert regression
* polish fc padding interface
* recover pass function
* fix argument error
* fix mkldnn error

cdf5f6fb

04 2月, 2020 1 次提交
- 石
  
  remove anakin from code, test=develop (#22420) · e1b0d7cb
  由石晓伟提交于 2月 04, 2020
  
  e1b0d7cb
09 1月, 2020 1 次提交
- 石
  
  [Feature] Lite subgraph (#22114) · ad0dfb17
  由石晓伟提交于 1月 09, 2020
  
  ad0dfb17
07 1月, 2020 1 次提交
- Y
  Remove subgraph_detector from inference/analysis to the common framework/ir directory. (#22094) · b1401fb7
  由 Yiqun Liu 提交于 1月 07, 2020
```
test=develop
```
  b1401fb7
03 1月, 2020 1 次提交
- M
  
  [DNNL] 3D Fully-Connected (#21746) · 61921084
  由 Michał Gallus 提交于 1月 03, 2020
  
  61921084
04 12月, 2019 1 次提交
- P
  make config option DisableGlogInfo() able to mute all inference logs (#21318) · 122b37ce
  由 Pei Yang 提交于 12月 04, 2019
```
* make DisableGlogInfo able to mute all logs in inference. 
```
  122b37ce
26 11月, 2019 1 次提交

Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972) · 234060f8

由 GaoWei8 提交于 11月 26, 2019

* Add fc padding to solve mkl performance
test=develop

* fix gpu pass and error information
test=develop

* fix fc_fuse_pass_test
test=develop

* fix error information
test=develop

* fix error information
test=develop

* fix name and add fc op padding test
test=develop

* fix attributes
test=develop

* optimize fc padding
test=develop

* fix test
test=develop

234060f8

03 9月, 2019 1 次提交

A a pass to enable the use of cudnn (#19346) · c5548178

由 Yiqun Liu 提交于 9月 03, 2019

* Add a interface to enable cudnn for inference.

* Add cudnn_placement_pass.
test=develop

* Set the default value of cudnn_enabled_op_types to null.
test=develop

* Write the common basic class, placement_pass_base, to refine the codes.
test=develop

* Call EnableCUDNN in unittest.
test=develop

* Refine cudnn_placement_pass tester.

* Enable the testing of cudnn_placement_pass in inference's unittest.
test=develop

* Add the check of op kernels.
test=develop

c5548178

31 7月, 2019 1 次提交

Trt fp16 support (#18860) · 61238d31

由 Zhaolong Xing 提交于 7月 31, 2019

* Fix Mask rcnn predictor
    1. refine memory optim algorithm to support the model with the block op.
    2. output diff : modify the affine channel fuse
    3. add condition_block_infer op
add interface for setting trt calib table dir
test=develop

* add the missing files.
test=develop

* 1 add trt fp16 support
test=develop

61238d31

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致