提交 · 9cbc1eff2dc11142893805c2260a4386e39ebfcd · Crayon鑫 / Paddle

17 9月, 2019 1 次提交

zerocopytensor support uint8, analysis config support profile, analysis... · 9cbc1eff

由 Pei Yang 提交于 9月 17, 2019

zerocopytensor support uint8, analysis config support profile, analysis predictor support GetInputTensorShape, test=develop (#19822)

9cbc1eff

03 9月, 2019 1 次提交

A a pass to enable the use of cudnn (#19346) · c5548178

由 Yiqun Liu 提交于 9月 03, 2019

* Add a interface to enable cudnn for inference.

* Add cudnn_placement_pass.
test=develop

* Set the default value of cudnn_enabled_op_types to null.
test=develop

* Write the common basic class, placement_pass_base, to refine the codes.
test=develop

* Call EnableCUDNN in unittest.
test=develop

* Refine cudnn_placement_pass tester.

* Enable the testing of cudnn_placement_pass in inference's unittest.
test=develop

* Add the check of op kernels.
test=develop

c5548178

31 7月, 2019 1 次提交

Trt fp16 support (#18860) · 61238d31

由 Zhaolong Xing 提交于 7月 31, 2019

* Fix Mask rcnn predictor
    1. refine memory optim algorithm to support the model with the block op.
    2. output diff : modify the affine channel fuse
    3. add condition_block_infer op
add interface for setting trt calib table dir
test=develop

* add the missing files.
test=develop

* 1 add trt fp16 support
test=develop

61238d31

11 7月, 2019 1 次提交

add config.SetMkldnnCacheCapacity api for mkldnn cache clear strategy (#18580) · 076f8331

由 Tao Luo 提交于 7月 11, 2019

* add config.SetMkldnnCacheCapacity api for mkldnn cache clear strategy

test=develop

* enhance MkldnnPostReset

test=develop

* add comments for mkldnn_cache_capacity field

test=develop

076f8331

08 7月, 2019 1 次提交

Inference: fix mask rcnn model diff, optim memory usage, memory leak. (#18532) · 88b52a27

由 Zhaolong Xing 提交于 7月 08, 2019

* Fix Mask rcnn predictor
    1. refine memory optim algorithm to support the model with the block op.
    2. output diff : modify the affine channel fuse
    3. add condition_block_infer op
add interface for setting trt calib table dir
test=develop

* add the missing files.
test=develop

88b52a27

06 6月, 2019 2 次提交

Z
fix: when use the load model from memory mode, the RAM occupy is high (#17788) · ae576f3c
由 Zhaolong Xing 提交于 6月 06, 2019
```
test=develop
```
ae576f3c

翟

INT8 MKL-DNN v2 integrate to slim (#17634) · 993c703b

由翟飞跃提交于 6月 06, 2019

* refactor PR 16865

* delete mergetool files

* test=develop

* test=develop

* test=develop

* test=develop

* create dir for int8 model before call SaveOptimModel

* test=develop

* mkldnn int8 only support linux; test=develop

* refine code; test=develop

* remove comment; test=develop

* refine code; test=develop

* fix bug; test=develop

* add exception for mkldnn_post_training_strategy

* reuse int8v2 CAPI dataset; test=develop

* fix accuracy check bug; test=develop

* remove tab

* convert files to unix format

* test=develop

* reduce CI time;test=develop

* reduce CI time and refine code;test=develop

* refine comment; test=develop

* add cmake FLAGS;test=develop

* remove predict_num;test=develop

993c703b

29 5月, 2019 1 次提交
- M
  
  Capi for a ngraph engine (#17037) · 5eb81fe5
  由 mozga-intel 提交于 5月 28, 2019
  
  5eb81fe5
25 5月, 2019 1 次提交

TRT: Support set dynamic range in int8 mode. (#17524) · 61221ebc

由 Zhaolong Xing 提交于 5月 25, 2019

* fluid int8 train and trt int8 predict align.
trt int8 predict init
op converter

* 2. align fluid int8 train and trt int8 inference.
enhance quant dequant fuse pass
enhance op converter, trt engine, trt engine op, trt subgraph pass.

* 3. add delete_quant_dequant_pass for trt

test=develop

* 4. add the missing file
test=develop

* 5. i modify the c++ interface, but forget to modify the pybind code
fix the IS_TRT_VERSION_GE bug, and fix elementwise op converter
test=develop

61221ebc

07 5月, 2019 1 次提交

石

Cherry-pick benchmark related changes from release/1.4 (#17156) · a72dbe9a

由石晓伟提交于 5月 07, 2019

* cherry-pick commit from 88770542

* cherry-pick commit from 3f0b97df

* cherry-pick from 16691:Anakin subgraph support yolo_v3 and faster-rcnn

(cherry picked from commit 8643dbc2)

* Cherry-Pick from 16662 : Anakin subgraph cpu support

(cherry picked from commit 7ad182e1)

* Cherry-pick from 1662, 16797.. : add anakin int8 support

(cherry picked from commit e14ab180)

* Cherry-pick from 16813 : change singleton to graph RegistBlock
test=release/1.4

(cherry picked from commit 4b9fa423)

* Cherry Pick : 16837 Support ShuffleNet and MobileNet-v2

Support ShuffleNet and MobileNet-v2, test=release/1.4

(cherry picked from commit a6fb066f)

* Cherry-pick : anakin subgraph add opt config layout argument #16846
test=release/1.4

(cherry picked from commit 8121b3ec)

* 1. add shuffle_channel_detect

(cherry picked from commit 6efdea89)

* update shuffle_channel op convert, test=release/1.4

(cherry picked from commit e4726a06)

* Modify symbol export rules

test=develop

a72dbe9a

29 3月, 2019 1 次提交
- S
  
  resolve conflicts with the develop branch test=develop · bddb2cd3
  由 Shixiaowei02 提交于 3月 28, 2019
  
  bddb2cd3
28 3月, 2019 1 次提交

C-API quantization core 2 (#16396) · 09dfc7a2

由 Wojciech Uss 提交于 3月 27, 2019

* C-API quantization core

test=develop
Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>

* Decouple Quantizer from AnalysisPredictor

test=develop

* fixes after review

test=develop

* renamed mkldnn quantize stuff

test=develop

* remove ifdef from header file

test=develop

09dfc7a2

20 3月, 2019 4 次提交
- N
  
  git cherry-pick from feature/anakin-engine: update anakin subgraph #16278 · 07dcf285
  由 nhzlx 提交于 3月 20, 2019
  
  07dcf285
- N
  
  cherry-pick from feature/anakin-engine: deal the changing shape when using anakin #16189 · a25331bc
  由 nhzlx 提交于 3月 20, 2019
  
  a25331bc
- N
  
  cherry-pick from feature/anakin-engine: add batch interface for pd-anakin #16178 · c79f06d3
  由 nhzlx 提交于 3月 20, 2019
  
  c79f06d3
- N
  cherry-pick from feature/anakin-engine: refine anakin subgraph. #16157 · 69d37f81
  由 nhzlx 提交于 3月 20, 2019
```
support change input size
```
  69d37f81
19 3月, 2019 2 次提交
- Z
  add allocator flags · 22715487
  由 zhhsplendid 提交于 3月 19, 2019
```
test=develop
```
  22715487
- T
  
  Revert "cache runtime_context" · 7d2740db
  由 Tao Luo 提交于 3月 19, 2019
  
  7d2740db
15 3月, 2019 1 次提交
- L
  set enable_runtime_context_cache_ default false · 5ecdc49c
  由 luotao1 提交于 3月 15, 2019
```
test=develop
```
  5ecdc49c
13 3月, 2019 1 次提交
- L
  add runtime_context_cache_pass · d94fd972
  由 luotao1 提交于 3月 13, 2019
```
test=develop
```
  d94fd972
08 3月, 2019 1 次提交
- N
  cant not pass ci · 2891070c
  由 nhzlx 提交于 3月 07, 2019
```
add if use static engine for trt
test=develop
```
  2891070c
07 3月, 2019 1 次提交
- N
  cant not pass ci · a9ed4277
  由 nhzlx 提交于 3月 07, 2019
```
add if use static engine for trt
test=develop
```
  a9ed4277
21 2月, 2019 1 次提交
- S
  
  fix typo releated->related · 543e53db
  由 Sylwester Fraczek 提交于 2月 21, 2019
  
  543e53db
31 1月, 2019 1 次提交
- Y
  
  fix ir debug config (#15571) · e887d719
  由 Yan Chunwei 提交于 1月 31, 2019
  
  e887d719
29 1月, 2019 1 次提交
- Y
  
  AnalysisConfig remove contrib namespace (#15540) · 65517908
  由 Yan Chunwei 提交于 1月 29, 2019
  
  65517908
26 1月, 2019 1 次提交
- Y
  
  add dynamic memory optim (#15457) · e2818c86
  由 Yan Chunwei 提交于 1月 26, 2019
  
  e2818c86
21 1月, 2019 1 次提交
- Y
  
  fea/infer memory optim2 (#14953) · 885c4e57
  由 Yan Chunwei 提交于 1月 21, 2019
  
  885c4e57
16 1月, 2019 1 次提交
- N
  add trt int8 calibration support · 312fe0ec
  由 nhzlx 提交于 1月 16, 2019
```
fix comments

test=develop
```
  312fe0ec
09 1月, 2019 1 次提交
- N
  add trt int8 support · 4e3522e5
  由 nhzlx 提交于 1月 09, 2019
```
test=develop
```
  4e3522e5
08 1月, 2019 1 次提交
- Y
  
  make inference api work with Doxygen (#15195) · d09d6ead
  由 Yan Chunwei 提交于 1月 08, 2019
  
  d09d6ead
07 1月, 2019 1 次提交
- Y
  
  refactor inference analysis api (#14634) · 875a07c3
  由 Yan Chunwei 提交于 1月 07, 2019
  
  875a07c3
26 12月, 2018 1 次提交
- N
  add min_subgraph_size attr to tensorrt config · 71636e67
  由 nhzlx 提交于 12月 26, 2018
```
test=develop
```
  71636e67
08 12月, 2018 1 次提交

One possible solution to add flexibility for mkldnn placement pass (#14768) · 943ad478

由 bingyanghuang 提交于 12月 08, 2018

* Choose to turn on use_mkldnn attribute v1

* Fix mkldnn_op empty bug

* format change test=develop

* fix ci test=develop

* fix ci test and add test in dam test=develop

* add example to dam compare test test=develop

* review changes test=develop

943ad478

06 12月, 2018 2 次提交
- T
  update with comments · 743cb840
  由 Tao Luo 提交于 12月 06, 2018
```
test=develop
```
  743cb840
- T
  support loading from memory · 405b2486
  由 Tao Luo 提交于 12月 04, 2018
```
test=develop
```
  405b2486
23 11月, 2018 1 次提交
- L
  
  add SetMKLDNNThreadId api · a5c4b463
  由 luotao1 提交于 11月 22, 2018
  
  a5c4b463
15 11月, 2018 1 次提交

Refine tester of TensorRT engine (#14390) · 9e6b1c5f

由 Yiqun Liu 提交于 11月 15, 2018

* Refine the tester for MixedRTPredictor.
test=develop

* Enable the profiler in TensorRT engine.

* Support the use of combined inference model in TensorRT unittest, and print the shape of feed targets.

9e6b1c5f

14 11月, 2018 1 次提交
- Y
  
  Combine Inference Analysis with IR (#13914) · 9f252e00
  由 Yan Chunwei 提交于 11月 14, 2018
  
  9f252e00

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致