提交 · 0a51098a715fd648d8ff9cf95f48f5db6ba3ebf9 · BaiXuePrincess / Paddle

06 1月, 2020 1 次提交

Add TRT support for BERT (#21135) · 0a51098a

由 Pei Yang 提交于 1月 06, 2020

* add gelu plugin

* align trt bert with gpu

* add support for fused fc with relu,

* add unittest for bert trt

0a51098a

24 12月, 2019 1 次提交
- L
  change qat_performance with mobilenet, change batch_size of qat2_resnet50 (#21895) · 9dff56e8
  由 lidanqing 提交于 12月 24, 2019
```
test=develop
```
  9dff56e8
16 12月, 2019 1 次提交
- M
  Re-anble vgg and resnet101 models download (#21713) · a5159d84
  由 Michał Gallus 提交于 12月 16, 2019
```
test=develop
```
  a5159d84
12 12月, 2019 1 次提交

Add reshape int8 mkldnn op (#21428) · d419b859

由 joanna.wozna.intel 提交于 12月 12, 2019

* Add reshape int8 op

test=develop

* Change test to CPUPlace

test=develop

* Correct tests

test=develop

d419b859

10 12月, 2019 2 次提交

MKL-DNN 1.0 Update (#20162) · e81f0228

由 Adam 提交于 12月 10, 2019

* MKLDNN v1.0 rebase to Paddle 1.6
test=develop

* Add hacky paddle::string::to_string() implementation

* vectorize<int64-t>() -> vectorize() cleanup
test=develop

* PADDLE_ENFORCE and void_cast fixes
test=develop

* Rebase changes
test=develop

* Cosmetics
test=develop

* Delete MKL from mkldnn.cmake
test=develop

* CMake debug commands
test=develop

* Delete MKLDNN_VERBOSE and rebase fixes
test=develop

* Rebase fixes
test=develop

* Temporarily disable int8 resnet101 vgg16 and vgg19 tests
test=develop

* Add libmkldnn.so.1 to python setup
test=develop

* Add libmkldnn.so.1 to inference_lib cmake after rebase
test=develop

* Post rebase fixes + FC int8 changes
test=develop

* Fix LRN NHWC
test=develop

* Fix NHWC conv3d
test=develop

* Windows build fix + next conv3d fix
test=develop

* Fix conv2d on AVX2 machines
test=develop

e81f0228

R
fix: fail to call ZeroCopyTensor::mutable_data() when device_id is no… (#21461) · 7f5d532a
由 rensilin 提交于 12月 10, 2019
```
* ZeroCopyTensor::mutable_data in the right device, test=develop

* add unittest for zerocopy, test=develop
```
7f5d532a

09 12月, 2019 1 次提交

QAT Int8 document (#21360) · fbf9eca0

由 lidanqing 提交于 12月 09, 2019

* update benchmark for int8v2, QAT1, QAT2 accuracy and performance
test=document_fix

* change according to reviews
test=develop test=document_fix

* improve some descriptions and some models
test=develop test=document_fix

* update models benchmark data
test=develop test=document_fix

* update int8v2 and qat2 performance
test=develop test=document_fix

fbf9eca0

03 12月, 2019 1 次提交
- G
  Add ernie large c++ inference test (#21365) · 250a1921
  由 GaoWei8 提交于 12月 03, 2019
```
* add ernie-large test
test=develop

* add ernie large c++ inference test
test=develop
```
  250a1921
02 12月, 2019 1 次提交

fix -Wno-error=sign-compare warning in gcc8 (#21434) · 01fa4ead

由 Tao Luo 提交于 12月 02, 2019

* fix -Wno-error=sign-compare warning in gcc8

test=develop

* fix warning in distributed codes

test=develop

01fa4ead

28 11月, 2019 1 次提交

Fp32 vs int8 qat C++ performance (#21244) · c0aa1367

由 lidanqing 提交于 11月 28, 2019

* add ut for comparing FP32 and QAT INT8

* add save qat transformed model python script
test=develop

* updated

* added missing file

* add "with_label"
test=develop

* performance benchmark as unit test
test=develop

* change names of unnecessary thing

* Change CMakeList.txt for model downloading and UT
test=develop

* change names of functions and params for more readable code
test=develop

* Change PADDLE_ENFORCE messages
test=develop

* fix indent problems
test=develop

* indent problems
test=develop

c0aa1367

26 11月, 2019 1 次提交

Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972) · 234060f8

由 GaoWei8 提交于 11月 26, 2019

* Add fc padding to solve mkl performance
test=develop

* fix gpu pass and error information
test=develop

* fix fc_fuse_pass_test
test=develop

* fix error information
test=develop

* fix error information
test=develop

* fix name and add fc op padding test
test=develop

* fix attributes
test=develop

* optimize fc padding
test=develop

* fix test
test=develop

234060f8

20 11月, 2019 1 次提交

Fix the CAPI ZeroCopy shape error and reuse the code to get output (#21240) · 3cb6c0a0

由 liu zhengxi 提交于 11月 20, 2019

* fix the CAPI ZeroCopy shape error and reconstruct the output obtain

* use an anonymous namespace to cover the functor

* fix unit tests because of the output of typeid(T).name() is different from linux and windows, test=develop

3cb6c0a0

18 11月, 2019 1 次提交
- Z
  TRT int8: refine trt int8 for dynamic range set (#21112) · 65f70525
  由 Zhaolong Xing 提交于 11月 18, 2019
```
* refine trt int8 for dynamic range set
test=develop

* refine trt int8
test=develop
```
  65f70525
15 11月, 2019 1 次提交

fix cmake fails on inference_download_and_uncompress (#21185) · a9d4eed3

由 GaoWei8 提交于 11月 15, 2019

* solve cmake fails on inference_download_and_uncompress
test=develop

* solve cmake fails on inference_download_and_uncompress
test=develop

a9d4eed3

14 11月, 2019 1 次提交

Add relative error measure when (value > 1) (#21144) · d74ea085

由 Adam 提交于 11月 14, 2019

* Add relative error measure when value > 1
test=develop

* Move code to CheckError function
test=develop

d74ea085

08 11月, 2019 2 次提交

Add transpose2 INT8 for mkl-dnn (#19424) · 77c20835

由 joanna.wozna.intel 提交于 11月 08, 2019

* Add transpose2 INT8 for mkl-dnn

test=develop

* Fix test_transpose_int8_mkldnn

test=develop

* Revert "Merge branch 'develop' into transpose_int8_mkldnn_2"

This reverts commit 34011bdb, reversing
changes made to 2ce6473f.

* Revert "Revert "Merge branch 'develop' into transpose_int8_mkldnn_2""

This reverts commit 23754dd7.

* Add template to TransposeMKLDNNHandler

test=develop

* Resolve conflict

test=develop

* Restore get_size and refactor

test=develop

77c20835

Add ernie c++ inference test (#21015) · 829bf871

由 GaoWei8 提交于 11月 08, 2019

* Add ernie unit test
test=develop

* Add ernie unit test
test=develop

* Add ernie unit test
test=develop

* remove ngraph

* optimize gpu test
test=develop

* optimize codes
test=develop

829bf871

23 10月, 2019 1 次提交

Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and... · e89c16b9

由 Pei Yang 提交于 10月 23, 2019

Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and "num" attribute in split op converter (#20733)

* fix pool2d trt converter, test=develop

* add fix for split op converter, test=develop

e89c16b9

20 10月, 2019 1 次提交
- B
  
  update int8 benchmark with 6271 data, test=develop test=document_fix (#20736) · fd49ebcb
  由 bingyanghuang 提交于 10月 20, 2019
  
  fd49ebcb
18 10月, 2019 1 次提交
- L
  alter the capi of PD_PredictorRun to provide proper function, test=develop (#20697) · d39777fe
  由 liu zhengxi 提交于 10月 18, 2019
```
modify the way to pass parameter out_size in function. 
```
  d39777fe
16 10月, 2019 1 次提交
- L
  
  Add document for int8 object detection quantization (#19356) · 57b656f9
  由 lidanqing 提交于 10月 16, 2019
  
  57b656f9
15 10月, 2019 1 次提交

fix the PD_ZeroCopyPredictorRun output problem (#20612) · 922d4324

由 liu zhengxi 提交于 10月 15, 2019

* fix the PD_ZeroCopyPredictorRun output problem and add some checks and logs for users

* modify the cmakelists depends and fix the cmakelists problem

922d4324

14 10月, 2019 2 次提交
- B
  
  Modify the helper information in full_pascalvoc_test_preprocess.py (#20475) · 85e1f215
  由 bingyanghuang 提交于 10月 14, 2019
  
  85e1f215
- P
  
  add DisableGlogInfo() to AnalysisConfig, test=develop (#20581) · 443f604c
  由 Pei Yang 提交于 10月 14, 2019
  
  443f604c
05 10月, 2019 1 次提交

Add capi for fluid inference api (#20092) · 301eeb5b

由 liu zhengxi 提交于 10月 05, 2019

* add capi for fluid inference api, including AnalysisConfig, AnalysisPredictor, PaddleBuf, PaddleTensor, ZeroCopyTensor

301eeb5b

25 9月, 2019 2 次提交

FIx C++ inference BUG: When open memory optim and enable trt subgraph at the... · e89b1288

由 Zhaolong Xing 提交于 9月 25, 2019

FIx C++ inference BUG: When open memory optim and enable trt subgraph at the same time, there is a bug (#19969)

* fix memory optimization type
test=develop

* 1. fix BUG: open trt and memory optim will trigger bug.
2. Clean memory optim bug.
test=develop

e89b1288

Removing length dims constraints of seq_pad and seq_unpad (#19497) · 99a9615a

由 Aurelius84 提交于 9月 25, 2019

* Removing last dims constraints of seq_pad and seq_unpad test=develop

* fix test_layer api code test=develop

* fix sequence_pad_op.cc conflict test=develop

* remove test_analyzer_mm_dnn test=develop

* fix vectorize bug test=develop

* fix vectorize<int> test=develop

99a9615a

21 9月, 2019 2 次提交
- P
  Add two extra flags for test_analyzer_int8_image_classification to disable fp32/int8 (#19840) · 2c5c6365
  由 pawelpiotrowicz 提交于 9月 21, 2019
```
test=develop
```
  2c5c6365
- P
  Fix BUGS: paddle-TRT repeatedly sets weight_map and overdeletes repetitive_params (#19825) · 74812d1c
  由 Pei Yang 提交于 9月 21, 2019
```
* fix trt bugs when sharing params, test=develop

* add unittest for cascade_rcnn
```
  74812d1c
17 9月, 2019 1 次提交

zerocopytensor support uint8, analysis config support profile, analysis... · 9cbc1eff

由 Pei Yang 提交于 9月 17, 2019

zerocopytensor support uint8, analysis config support profile, analysis predictor support GetInputTensorShape, test=develop (#19822)

9cbc1eff

16 9月, 2019 1 次提交

Enhance fc_fuse_pass to enable fusing relu to fc_op (#19733) · c67c8758

由 Yiqun Liu 提交于 9月 16, 2019

* Refine the codes related to fc op.

* Add GPU implementation for fc functor.

* Apply fc_fuse_pass in GPU inference.
test=develop

* Change the cmake for fc op.

* Change PADDLE_ENFORCE to PADDLE_ENFORCE_EQ.

* Add an attribute to set the activation type in fc_op.

* Enhance the unittest of fc_op.
test=develop

* Remove the declaration of FCOpGrad back to the header file.
test=develop

* Set default value for newly added arguments in test_fc_op.
test=develop

* Enhance fc_fuse_pass to enable fusing relu.

* Allow print the shapes of var_desc in graph.
test=develop

* Enhance fc_fuse_pass_tester.

* Remove the use of PADDLE_ENFORCE.
test=develop

* Correct the number of ops after fusing.
test=develop

* Fix a typo.
test=develop

* Set activation_type to null when there is no relu in fc.
test=develop

* Refine fc_fuse_pass's codes.

* Enable the set of shape for tensor.

* Refine repeated_fc_relu_pass and add unittest.
test=develop

c67c8758

03 9月, 2019 1 次提交

A a pass to enable the use of cudnn (#19346) · c5548178

由 Yiqun Liu 提交于 9月 03, 2019

* Add a interface to enable cudnn for inference.

* Add cudnn_placement_pass.
test=develop

* Set the default value of cudnn_enabled_op_types to null.
test=develop

* Write the common basic class, placement_pass_base, to refine the codes.
test=develop

* Call EnableCUDNN in unittest.
test=develop

* Refine cudnn_placement_pass tester.

* Enable the testing of cudnn_placement_pass in inference's unittest.
test=develop

* Add the check of op kernels.
test=develop

c5548178

22 8月, 2019 1 次提交

add local user data conversion into full_pascalvoc_test_preprocess.py (#19283) · 9240e532

由 lidanqing 提交于 8月 22, 2019

* add local user data conversion into full_pascalvoc_test_preprocess.py
test=develop

* change PADDLE_ENFORCE to PADDLE_ENFORCE_GE
test=develop

* change according to reviews
test=develop

9240e532

15 8月, 2019 1 次提交

Fix mAP problem in unit test of int8 object detection test (#18946) · 07a4d8f8

由 lidanqing 提交于 8月 15, 2019

* change the top1 comparison to mAP comparison
test=develop

* change the mobilenet-ssd tester demo data and batch_size settings
test=develop

07a4d8f8

30 7月, 2019 1 次提交

Revert "use static variable to do cache instead of thread local in thread... · 10eeed93

由 Leo Zhao 提交于 7月 30, 2019

Revert "use static variable to do cache instead of thread local in thread frequent switching case (#18428)" (#18879)

This reverts commit ce38bb53.

test=develop

10eeed93

11 7月, 2019 1 次提交

add config.SetMkldnnCacheCapacity api for mkldnn cache clear strategy (#18580) · 076f8331

由 Tao Luo 提交于 7月 11, 2019

* add config.SetMkldnnCacheCapacity api for mkldnn cache clear strategy

test=develop

* enhance MkldnnPostReset

test=develop

* add comments for mkldnn_cache_capacity field

test=develop

076f8331

08 7月, 2019 2 次提交
- L
  
  use static variable to do cache instead of thread local in thread frequent switching case (#18428) · ce38bb53
  由 Leo Zhao 提交于 7月 08, 2019
  
  ce38bb53
- T
  add mkldnn shapeblob cache clear strategy (#18513) · fe32879d
  由 Tao Luo 提交于 7月 08, 2019
```
* add mkldnn shapeblob cache clear strategy

test=develop

* refine with comments

test=develop

* make cache clear strategy more safey

test=develop

* add lock for GetShapeBlobSize

test=develop
```
  fe32879d
05 7月, 2019 1 次提交
- B
  
  fix command line bug in int8v2 readme (#18507) · 3fe6bf5e
  由 bingyanghuang 提交于 7月 05, 2019
  
  3fe6bf5e
03 7月, 2019 1 次提交
- 石
  Remove the obsolete cmake options (#18481) · 047bba85
  由石晓伟提交于 7月 03, 2019
```
* remove the obsolete cmake options, test=develop

* remove unittests, test=develop
```
  047bba85

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致