提交 · fbbd94a6ce2c0a3389f06b89a48f75447a0b5193 · Crayon鑫 / Paddle

11 12月, 2019 1 次提交
- Z
  there is bug for inference using auto grwoth allocator (#21621) · fbbd94a6
  由 Zhaolong Xing 提交于 12月 11, 2019
```
test=develop
```
  fbbd94a6
10 12月, 2019 2 次提交

MKL-DNN 1.0 Update (#20162) · e81f0228

由 Adam 提交于 12月 10, 2019

* MKLDNN v1.0 rebase to Paddle 1.6
test=develop

* Add hacky paddle::string::to_string() implementation

* vectorize<int64-t>() -> vectorize() cleanup
test=develop

* PADDLE_ENFORCE and void_cast fixes
test=develop

* Rebase changes
test=develop

* Cosmetics
test=develop

* Delete MKL from mkldnn.cmake
test=develop

* CMake debug commands
test=develop

* Delete MKLDNN_VERBOSE and rebase fixes
test=develop

* Rebase fixes
test=develop

* Temporarily disable int8 resnet101 vgg16 and vgg19 tests
test=develop

* Add libmkldnn.so.1 to python setup
test=develop

* Add libmkldnn.so.1 to inference_lib cmake after rebase
test=develop

* Post rebase fixes + FC int8 changes
test=develop

* Fix LRN NHWC
test=develop

* Fix NHWC conv3d
test=develop

* Windows build fix + next conv3d fix
test=develop

* Fix conv2d on AVX2 machines
test=develop

e81f0228

R
fix: fail to call ZeroCopyTensor::mutable_data() when device_id is no… (#21461) · 7f5d532a
由 rensilin 提交于 12月 10, 2019
```
* ZeroCopyTensor::mutable_data in the right device, test=develop

* add unittest for zerocopy, test=develop
```
7f5d532a

09 12月, 2019 1 次提交

QAT Int8 document (#21360) · fbf9eca0

由 lidanqing 提交于 12月 09, 2019

* update benchmark for int8v2, QAT1, QAT2 accuracy and performance
test=document_fix

* change according to reviews
test=develop test=document_fix

* improve some descriptions and some models
test=develop test=document_fix

* update models benchmark data
test=develop test=document_fix

* update int8v2 and qat2 performance
test=develop test=document_fix

fbf9eca0

05 12月, 2019 1 次提交
- P
  
  fix glog warning, test=develop (#21573) · 20d61414
  由 Pei Yang 提交于 12月 05, 2019
  
  20d61414
04 12月, 2019 2 次提交
- P
  make config option DisableGlogInfo() able to mute all inference logs (#21318) · 122b37ce
  由 Pei Yang 提交于 12月 04, 2019
```
* make DisableGlogInfo able to mute all logs in inference. 
```
  122b37ce
- Z
  add conv, depthwise_conv, pooling (#20966) · da7748c5
  由 Zhaolong Xing 提交于 12月 04, 2019
```
test=develop
```
  da7748c5
03 12月, 2019 2 次提交
- G
  Add ernie large c++ inference test (#21365) · 250a1921
  由 GaoWei8 提交于 12月 03, 2019
```
* add ernie-large test
test=develop

* add ernie large c++ inference test
test=develop
```
  250a1921
- Z
  specify the auto growth allocator for inference. (#21448) · b39c0116
  由 Zhaolong Xing 提交于 12月 03, 2019
```
test=develop
```
  b39c0116
02 12月, 2019 2 次提交
- T
  fix -Wno-error=sign-compare warning in gcc8 (#21434) · 01fa4ead
  由 Tao Luo 提交于 12月 02, 2019
```
* fix -Wno-error=sign-compare warning in gcc8

test=develop

* fix warning in distributed codes

test=develop
```
  01fa4ead
- L
  Fix transpose conv (#21406) · 37f3e56d
  由 Lv Mengsi 提交于 12月 02, 2019
```
* fix transpose conv,test=develop

* fix comments
test=develop
```
  37f3e56d
28 11月, 2019 1 次提交

Fp32 vs int8 qat C++ performance (#21244) · c0aa1367

由 lidanqing 提交于 11月 28, 2019

* add ut for comparing FP32 and QAT INT8

* add save qat transformed model python script
test=develop

* updated

* added missing file

* add "with_label"
test=develop

* performance benchmark as unit test
test=develop

* change names of unnecessary thing

* Change CMakeList.txt for model downloading and UT
test=develop

* change names of functions and params for more readable code
test=develop

* Change PADDLE_ENFORCE messages
test=develop

* fix indent problems
test=develop

* indent problems
test=develop

c0aa1367

27 11月, 2019 2 次提交

Z
fix C++ multicard inference bug. (#20955) · d1a6e112
由 Zhaolong Xing 提交于 11月 27, 2019
```
test=develop
```
d1a6e112

INT8 Fully-connected (#17641) · 5d7d5482

由 Michał Gallus 提交于 11月 27, 2019

* Implement Int8 FC

* Integrate FC into INT8v2

test=develop

* int8 FC: transpose weights before computing scales

test=develop

* Add support for activation_type string in FC

test=develop

* Disable MKL-DNN's FC in VGG16 and 19

test=develop

* Disable FC quantization when mkldnn FC is disabled

test=develop

* Solve PADDLE_ENFORCES in FC int8

* Fix Paddle enforces and remove const cast

test=develop

* Fix style changes

test=develop

* Fix quantizer_tester test and add fc quantization

test=develop

* Fix FC test fail on CUDA

* Remove unnecessary log from quantize placement pass

test=develop

* Add Thread ID to FC hash key

test=develop

* Add comments to MKL-DNN FC Kernel

test=develop

* Refactor quantizer

test=develop

* Fix linter issues

test=develop

* Fix crash in slim googlenet

test=develop

* Fix PADDLE_ENFORCE messages

test=develop

5d7d5482

26 11月, 2019 2 次提交

Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972) · 234060f8

由 GaoWei8 提交于 11月 26, 2019

* Add fc padding to solve mkl performance
test=develop

* fix gpu pass and error information
test=develop

* fix fc_fuse_pass_test
test=develop

* fix error information
test=develop

* fix error information
test=develop

* fix name and add fc op padding test
test=develop

* fix attributes
test=develop

* optimize fc padding
test=develop

* fix test
test=develop

234060f8

S

add prediction demo and script on windows (#21248) · 45c1e7bb
由 silingtong123 提交于 11月 26, 2019

45c1e7bb

25 11月, 2019 1 次提交
- Z
  
  remove warning LNK4006 and warning LNK4221 (#21226) · 345b67b5
  由 zhouwei25 提交于 11月 25, 2019
  
  345b67b5
20 11月, 2019 2 次提交

Fix the CAPI ZeroCopy shape error and reuse the code to get output (#21240) · 3cb6c0a0

由 liu zhengxi 提交于 11月 20, 2019

* fix the CAPI ZeroCopy shape error and reconstruct the output obtain

* use an anonymous namespace to cover the functor

* fix unit tests because of the output of typeid(T).name() is different from linux and windows, test=develop

3cb6c0a0

P
fix trt weight bug (#21231) · 2e2f92a5
由 Pei Yang 提交于 11月 20, 2019
```
added splitter "__" between weight name and suffix number to avoid conflicts.
```
2e2f92a5

19 11月, 2019 1 次提交
- Z
  
  Determine whether to copy and link inference lib by ON_INFER (#20931) · c0dcb090
  由 zhouwei25 提交于 11月 19, 2019
  
  c0dcb090
18 11月, 2019 1 次提交
- Z
  TRT int8: refine trt int8 for dynamic range set (#21112) · 65f70525
  由 Zhaolong Xing 提交于 11月 18, 2019
```
* refine trt int8 for dynamic range set
test=develop

* refine trt int8
test=develop
```
  65f70525
15 11月, 2019 1 次提交

fix cmake fails on inference_download_and_uncompress (#21185) · a9d4eed3

由 GaoWei8 提交于 11月 15, 2019

* solve cmake fails on inference_download_and_uncompress
test=develop

* solve cmake fails on inference_download_and_uncompress
test=develop

a9d4eed3

14 11月, 2019 1 次提交

Add relative error measure when (value > 1) (#21144) · d74ea085

由 Adam 提交于 11月 14, 2019

* Add relative error measure when value > 1
test=develop

* Move code to CheckError function
test=develop

d74ea085

13 11月, 2019 1 次提交

Add examples for error message writing specification - PreconditionNotMet,... · 8414575b

由 Chen Weihang 提交于 11月 13, 2019

Add examples for error message writing specification - PreconditionNotMet, Unimplemented, Unavailable (#21137)

* add examples for error spec, test=develop

* change ENFORCE to ENFORCE_**, test=develop

8414575b

08 11月, 2019 2 次提交

Add transpose2 INT8 for mkl-dnn (#19424) · 77c20835

由 joanna.wozna.intel 提交于 11月 08, 2019

* Add transpose2 INT8 for mkl-dnn

test=develop

* Fix test_transpose_int8_mkldnn

test=develop

* Revert "Merge branch 'develop' into transpose_int8_mkldnn_2"

This reverts commit 34011bdb, reversing
changes made to 2ce6473f.

* Revert "Revert "Merge branch 'develop' into transpose_int8_mkldnn_2""

This reverts commit 23754dd7.

* Add template to TransposeMKLDNNHandler

test=develop

* Resolve conflict

test=develop

* Restore get_size and refactor

test=develop

77c20835

Add ernie c++ inference test (#21015) · 829bf871

由 GaoWei8 提交于 11月 08, 2019

* Add ernie unit test
test=develop

* Add ernie unit test
test=develop

* Add ernie unit test
test=develop

* remove ngraph

* optimize gpu test
test=develop

* optimize codes
test=develop

829bf871

23 10月, 2019 2 次提交
- P
  Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and... · e89c16b9
  由 Pei Yang 提交于 10月 23, 2019
```
Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and "num" attribute in split op converter (#20733)

* fix pool2d trt converter, test=develop

* add fix for split op converter, test=develop
```
  e89c16b9
- 石
  
  optimize version error, test=develop (#20715) · e742760f
  由石晓伟提交于 10月 23, 2019
  
  e742760f
20 10月, 2019 1 次提交
- B
  
  update int8 benchmark with 6271 data, test=develop test=document_fix (#20736) · fd49ebcb
  由 bingyanghuang 提交于 10月 20, 2019
  
  fd49ebcb
18 10月, 2019 2 次提交
- 石
  Ensure backward compatibility with the anakin interface, test=develop (#20691) · d8f4f423
  由石晓伟提交于 10月 18, 2019
```
* support MLU nums, test=develop

* change anakin apis, test=develop
```
  d8f4f423
- L
  alter the capi of PD_PredictorRun to provide proper function, test=develop (#20697) · d39777fe
  由 liu zhengxi 提交于 10月 18, 2019
```
modify the way to pass parameter out_size in function. 
```
  d39777fe
17 10月, 2019 1 次提交
- L
  
  improve the performance of capi in PD_PredictorRun (#20665) · dbc2bb33
  由 liu zhengxi 提交于 10月 17, 2019
  
  dbc2bb33
16 10月, 2019 1 次提交
- L
  
  Add document for int8 object detection quantization (#19356) · 57b656f9
  由 lidanqing 提交于 10月 16, 2019
  
  57b656f9
15 10月, 2019 1 次提交

fix the PD_ZeroCopyPredictorRun output problem (#20612) · 922d4324

由 liu zhengxi 提交于 10月 15, 2019

* fix the PD_ZeroCopyPredictorRun output problem and add some checks and logs for users

* modify the cmakelists depends and fix the cmakelists problem

922d4324

14 10月, 2019 2 次提交
- B
  
  Modify the helper information in full_pascalvoc_test_preprocess.py (#20475) · 85e1f215
  由 bingyanghuang 提交于 10月 14, 2019
  
  85e1f215
- P
  
  add DisableGlogInfo() to AnalysisConfig, test=develop (#20581) · 443f604c
  由 Pei Yang 提交于 10月 14, 2019
  
  443f604c
13 10月, 2019 1 次提交

Add Multihead matmul fuse pass (#20167) · b8333ede

由 zhaoyuchen2018 提交于 10月 13, 2019

* Add multihead fuse pass for ernie opt

* Refine softmax

test=develop

* Refine cuda kernel

* Refine cuda version

* Refine cmake

test=develop

* refine header file

* refine test case and pass
* refine comments

b8333ede

12 10月, 2019 1 次提交

Add ConvTranspose + BatchNorm fuse pass (#20161) · 7faa3e95

由 Adam 提交于 10月 12, 2019

* Add ConvTranspose + BatchNorm fuse pass
test=develop

* Add tests for conv+bn and conv_transpose+bn passes
test=develop

7faa3e95

11 10月, 2019 1 次提交
- L
  remove incorrect new in c style, test=develop (#20370) · 53d8799b
  由 liu zhengxi 提交于 10月 11, 2019
```
remove incorrect "new" in c style. 
```
  53d8799b
10 10月, 2019 1 次提交
- 石
  
  fix analysis_predictor ci, test=release/1.6 (#20141) · 2c28e328
  由石晓伟提交于 10月 10, 2019
  
  2c28e328

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致