提交 · 720641725941b7e49b8926c65476c631ab6bef03 · PaddlePaddle / Paddle

22 7月, 2020 1 次提交

supports xpu runtime, test=develop (#25554) · 72064172

由石晓伟提交于 7月 22, 2020

* update ResetHolder, test=develop

* add TensorShare for lite engine, test=develop

* tensor data changed from copying to sharing, test=develop

* supports xpu runtime, test=develop

* fix code styles, test=develop

72064172

26 3月, 2020 1 次提交

[Paddle-TRT]: Ernie Dynamic shape support. (#23138) · 430b0099

由 Zhaolong Xing 提交于 3月 26, 2020

* add dynamic plugin support.
test=develop

* change emb eltwise layernorm to math function
test=develop

* add emb eltwise layernorm
test=develop

* can run dynamic shape ernie
test=develop

* fix ci
test=develop

* add ut for trt ernie dynamic

test=develop

* refine dynamic shape c++ interface.
test=develop

* fix comments
test=develop

* fix comments
test=develop

430b0099

09 3月, 2020 1 次提交

[Paddle-TRT] : (Part1) Dynamic shape support (#22868) · dd67d44a

由 Zhaolong Xing 提交于 3月 09, 2020

* change the ci trt from version 5. to 6.0

* paddle-trt dynamic shape support init

* conv+bias or conv+bn dynamic shape support
test=develop

* modity trt engine opconvert
test=develop

* fix ci error
test=develop

dd67d44a

24 2月, 2020 1 次提交

Add an inference interface to disable FC padding (#22097) · cdf5f6fb

由 GaoWei8 提交于 2月 24, 2020

* Add an interface of disabling FC padding
* fix bert regression
* polish fc padding interface
* recover pass function
* fix argument error
* fix mkldnn error

cdf5f6fb

04 2月, 2020 1 次提交
- 石
  
  remove anakin from code, test=develop (#22420) · e1b0d7cb
  由石晓伟提交于 2月 04, 2020
  
  e1b0d7cb
09 1月, 2020 1 次提交
- 石
  
  [Feature] Lite subgraph (#22114) · ad0dfb17
  由石晓伟提交于 1月 09, 2020
  
  ad0dfb17
04 12月, 2019 1 次提交
- P
  make config option DisableGlogInfo() able to mute all inference logs (#21318) · 122b37ce
  由 Pei Yang 提交于 12月 04, 2019
```
* make DisableGlogInfo able to mute all logs in inference. 
```
  122b37ce
25 9月, 2019 1 次提交

FIx C++ inference BUG: When open memory optim and enable trt subgraph at the... · e89b1288

由 Zhaolong Xing 提交于 9月 25, 2019

FIx C++ inference BUG: When open memory optim and enable trt subgraph at the same time, there is a bug (#19969)

* fix memory optimization type
test=develop

* 1. fix BUG: open trt and memory optim will trigger bug.
2. Clean memory optim bug.
test=develop

e89b1288

19 8月, 2019 1 次提交

Fix BUG: Mask RCNN inference diff When using AnalysisPredictor. (#19213) · 76c95af0

由 Zhaolong Xing 提交于 8月 19, 2019

* fix mask rcnn bug:
1. affine channel fuse (diff)
2. condition block op (memory leak)
3. merge lod tensor op (diff)
4. memroy optim (diff)
test=develop

* fix ci aboud PADDLE_ENFOCE
fix merge lod infer op ut
test=develop

76c95af0

11 7月, 2019 1 次提交

add config.SetMkldnnCacheCapacity api for mkldnn cache clear strategy (#18580) · 076f8331

由 Tao Luo 提交于 7月 11, 2019

* add config.SetMkldnnCacheCapacity api for mkldnn cache clear strategy

test=develop

* enhance MkldnnPostReset

test=develop

* add comments for mkldnn_cache_capacity field

test=develop

076f8331

08 7月, 2019 1 次提交

Inference: fix mask rcnn model diff, optim memory usage, memory leak. (#18532) · 88b52a27

由 Zhaolong Xing 提交于 7月 08, 2019

* Fix Mask rcnn predictor
    1. refine memory optim algorithm to support the model with the block op.
    2. output diff : modify the affine channel fuse
    3. add condition_block_infer op
add interface for setting trt calib table dir
test=develop

* add the missing files.
test=develop

88b52a27

06 6月, 2019 1 次提交
- Z
  fix: when use the load model from memory mode, the RAM occupy is high (#17788) · ae576f3c
  由 Zhaolong Xing 提交于 6月 06, 2019
```
test=develop
```
  ae576f3c
25 5月, 2019 1 次提交

TRT: Support set dynamic range in int8 mode. (#17524) · 61221ebc

由 Zhaolong Xing 提交于 5月 25, 2019

* fluid int8 train and trt int8 predict align.
trt int8 predict init
op converter

* 2. align fluid int8 train and trt int8 inference.
enhance quant dequant fuse pass
enhance op converter, trt engine, trt engine op, trt subgraph pass.

* 3. add delete_quant_dequant_pass for trt

test=develop

* 4. add the missing file
test=develop

* 5. i modify the c++ interface, but forget to modify the pybind code
fix the IS_TRT_VERSION_GE bug, and fix elementwise op converter
test=develop

61221ebc

07 5月, 2019 1 次提交

石

Cherry-pick benchmark related changes from release/1.4 (#17156) · a72dbe9a

由石晓伟提交于 5月 07, 2019

* cherry-pick commit from 88770542

* cherry-pick commit from 3f0b97df

* cherry-pick from 16691:Anakin subgraph support yolo_v3 and faster-rcnn

(cherry picked from commit 8643dbc2)

* Cherry-Pick from 16662 : Anakin subgraph cpu support

(cherry picked from commit 7ad182e1)

* Cherry-pick from 1662, 16797.. : add anakin int8 support

(cherry picked from commit e14ab180)

* Cherry-pick from 16813 : change singleton to graph RegistBlock
test=release/1.4

(cherry picked from commit 4b9fa423)

* Cherry Pick : 16837 Support ShuffleNet and MobileNet-v2

Support ShuffleNet and MobileNet-v2, test=release/1.4

(cherry picked from commit a6fb066f)

* Cherry-pick : anakin subgraph add opt config layout argument #16846
test=release/1.4

(cherry picked from commit 8121b3ec)

* 1. add shuffle_channel_detect

(cherry picked from commit 6efdea89)

* update shuffle_channel op convert, test=release/1.4

(cherry picked from commit e4726a06)

* Modify symbol export rules

test=develop

a72dbe9a

29 3月, 2019 1 次提交
- S
  
  resolve conflicts with the develop branch test=develop · bddb2cd3
  由 Shixiaowei02 提交于 3月 28, 2019
  
  bddb2cd3
25 3月, 2019 1 次提交
- W
  Move cpu_quantize_* passes into mkldnn subfolder · 46677fb0
  由 Wojciech Uss 提交于 3月 25, 2019
```
test=develop
```
  46677fb0
21 3月, 2019 1 次提交
- W
  Add enabling quantization (#16326) · cbe2dbf0
  由 Wojciech Uss 提交于 3月 21, 2019
```
* Add enabling quantization

test=develop

* remove unused (here) function
```
  cbe2dbf0
20 3月, 2019 3 次提交
- N
  
  cherry-pick from feature/anakin-engine: deal the changing shape when using anakin #16189 · a25331bc
  由 nhzlx 提交于 3月 20, 2019
  
  a25331bc
- N
  
  cherry-pick from feature/anakin-engine: add batch interface for pd-anakin #16178 · c79f06d3
  由 nhzlx 提交于 3月 20, 2019
  
  c79f06d3
- N
  cherry-pick from feature/anakin-engine: refine anakin subgraph. #16157 · 69d37f81
  由 nhzlx 提交于 3月 20, 2019
```
support change input size
```
  69d37f81
19 3月, 2019 1 次提交
- Z
  add allocator flags · 22715487
  由 zhhsplendid 提交于 3月 19, 2019
```
test=develop
```
  22715487
18 3月, 2019 1 次提交

Add cpu_quantize_pass for C-API quantization (#16127) · 2579ade4

由 Wojciech Uss 提交于 3月 18, 2019

* Add cpu_quantize_pass for C-API quantization

test=develop

* add cpu_quantize_pass test

* fix lint: add include memory unorderd_map and unordered_set

test=develop

* fuse_relu 1

test=develop

* tuned 2 without squash

* fixes

test=develop

* remove unused vars

test=develop

* refactored

test=develop

* fix lint c-style cast -> C++ style cast

test=develop

* remove QuantMax and c style casts

test=develop

* last usage of QuantMax removed

test=develop

* Fix Analysis Predictor UT

Check if memory_optimize_pass has already been added
to the analysis config before adding a new one, so
that it is not added multiple times.
test=develop

* change map to unordered_map

fix the forgotten part of cpu_quantize_pass_tester.cc

test=develop

* removed quantized attribute

* fixed cpu_quantize_pass_tester and op attr comments

test=develop

* removed redundant line

test=debug

* removed gmock

test=develop

* fix after merge

2579ade4

08 3月, 2019 3 次提交
- N
  cant not pass ci · 2891070c
  由 nhzlx 提交于 3月 07, 2019
```
add if use static engine for trt
test=develop
```
  2891070c
- N
  6. delete useless predictor id · 5863c861
  由 nhzlx 提交于 2月 26, 2019
```
test=develop
```
  5863c861
- N
  4. do the trt_engine optim during init. · 31008100
  由 nhzlx 提交于 2月 18, 2019
```
add simple static mode loading
test=develop
```
  31008100
07 3月, 2019 1 次提交
- N
  cant not pass ci · a9ed4277
  由 nhzlx 提交于 3月 07, 2019
```
add if use static engine for trt
test=develop
```
  a9ed4277
26 2月, 2019 1 次提交
- N
  6. delete useless predictor id · 0ed63b21
  由 nhzlx 提交于 2月 26, 2019
```
test=develop
```
  0ed63b21
18 2月, 2019 1 次提交
- N
  4. do the trt_engine optim during init. · 2070fb24
  由 nhzlx 提交于 2月 18, 2019
```
add simple static mode loading
test=develop
```
  2070fb24
29 1月, 2019 1 次提交
- Y
  
  AnalysisConfig remove contrib namespace (#15540) · 65517908
  由 Yan Chunwei 提交于 1月 29, 2019
  
  65517908
26 1月, 2019 1 次提交
- Y
  
  add dynamic memory optim (#15457) · e2818c86
  由 Yan Chunwei 提交于 1月 26, 2019
  
  e2818c86
25 1月, 2019 1 次提交
- N
  fix comments · 92cf4a4c
  由 nhzlx 提交于 1月 25, 2019
```
test=develop
```
  92cf4a4c
21 1月, 2019 1 次提交
- Y
  
  fea/infer memory optim2 (#14953) · 885c4e57
  由 Yan Chunwei 提交于 1月 21, 2019
  
  885c4e57
16 1月, 2019 1 次提交
- N
  add trt int8 calibration support · 312fe0ec
  由 nhzlx 提交于 1月 16, 2019
```
fix comments

test=develop
```
  312fe0ec
09 1月, 2019 1 次提交
- N
  add trt int8 support · 4e3522e5
  由 nhzlx 提交于 1月 09, 2019
```
test=develop
```
  4e3522e5
07 1月, 2019 1 次提交
- Y
  
  refactor tensorrt node teller (#15181) · 6ccf8685
  由 Yan Chunwei 提交于 1月 07, 2019
  
  6ccf8685
26 12月, 2018 1 次提交
- N
  add min_subgraph_size attr to tensorrt config · 71636e67
  由 nhzlx 提交于 12月 26, 2018
```
test=develop
```
  71636e67
08 12月, 2018 1 次提交

One possible solution to add flexibility for mkldnn placement pass (#14768) · 943ad478

由 bingyanghuang 提交于 12月 08, 2018

* Choose to turn on use_mkldnn attribute v1

* Fix mkldnn_op empty bug

* format change test=develop

* fix ci test=develop

* fix ci test and add test in dam test=develop

* add example to dam compare test test=develop

* review changes test=develop

943ad478

06 12月, 2018 2 次提交
- T
  update with comments · 743cb840
  由 Tao Luo 提交于 12月 06, 2018
```
test=develop
```
  743cb840
- T
  support loading from memory · 405b2486
  由 Tao Luo 提交于 12月 04, 2018
```
test=develop
```
  405b2486
16 11月, 2018 1 次提交

fix gpu load model · 4bf6817c

由 superjomn 提交于 11月 16, 2018

the parameters will load from CPUPlace, that will keep copying data
between CPU and GPU places.

test=develop

4bf6817c

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功