提交 · 2e2f92a5b10d1c5cf7b1d5384bc3c7db5e6ed25b · 机器未来 / Paddle

20 11月, 2019 1 次提交
- P
  fix trt weight bug (#21231) · 2e2f92a5
  由 Pei Yang 提交于 11月 20, 2019
```
added splitter "__" between weight name and suffix number to avoid conflicts.
```
  2e2f92a5
19 11月, 2019 1 次提交
- Z
  
  Determine whether to copy and link inference lib by ON_INFER (#20931) · c0dcb090
  由 zhouwei25 提交于 11月 19, 2019
  
  c0dcb090
18 11月, 2019 1 次提交
- Z
  TRT int8: refine trt int8 for dynamic range set (#21112) · 65f70525
  由 Zhaolong Xing 提交于 11月 18, 2019
```
* refine trt int8 for dynamic range set
test=develop

* refine trt int8
test=develop
```
  65f70525
15 11月, 2019 1 次提交

fix cmake fails on inference_download_and_uncompress (#21185) · a9d4eed3

由 GaoWei8 提交于 11月 15, 2019

* solve cmake fails on inference_download_and_uncompress
test=develop

* solve cmake fails on inference_download_and_uncompress
test=develop

a9d4eed3

14 11月, 2019 1 次提交

Add relative error measure when (value > 1) (#21144) · d74ea085

由 Adam 提交于 11月 14, 2019

* Add relative error measure when value > 1
test=develop

* Move code to CheckError function
test=develop

d74ea085

13 11月, 2019 1 次提交

Add examples for error message writing specification - PreconditionNotMet,... · 8414575b

由 Chen Weihang 提交于 11月 13, 2019

Add examples for error message writing specification - PreconditionNotMet, Unimplemented, Unavailable (#21137)

* add examples for error spec, test=develop

* change ENFORCE to ENFORCE_**, test=develop

8414575b

08 11月, 2019 2 次提交

Add transpose2 INT8 for mkl-dnn (#19424) · 77c20835

由 joanna.wozna.intel 提交于 11月 08, 2019

* Add transpose2 INT8 for mkl-dnn

test=develop

* Fix test_transpose_int8_mkldnn

test=develop

* Revert "Merge branch 'develop' into transpose_int8_mkldnn_2"

This reverts commit 34011bdb, reversing
changes made to 2ce6473f.

* Revert "Revert "Merge branch 'develop' into transpose_int8_mkldnn_2""

This reverts commit 23754dd7.

* Add template to TransposeMKLDNNHandler

test=develop

* Resolve conflict

test=develop

* Restore get_size and refactor

test=develop

77c20835

Add ernie c++ inference test (#21015) · 829bf871

由 GaoWei8 提交于 11月 08, 2019

* Add ernie unit test
test=develop

* Add ernie unit test
test=develop

* Add ernie unit test
test=develop

* remove ngraph

* optimize gpu test
test=develop

* optimize codes
test=develop

829bf871

23 10月, 2019 2 次提交
- P
  Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and... · e89c16b9
  由 Pei Yang 提交于 10月 23, 2019
```
Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and "num" attribute in split op converter (#20733)

* fix pool2d trt converter, test=develop

* add fix for split op converter, test=develop
```
  e89c16b9
- 石
  
  optimize version error, test=develop (#20715) · e742760f
  由石晓伟提交于 10月 23, 2019
  
  e742760f
20 10月, 2019 1 次提交
- B
  
  update int8 benchmark with 6271 data, test=develop test=document_fix (#20736) · fd49ebcb
  由 bingyanghuang 提交于 10月 20, 2019
  
  fd49ebcb
18 10月, 2019 2 次提交
- 石
  Ensure backward compatibility with the anakin interface, test=develop (#20691) · d8f4f423
  由石晓伟提交于 10月 18, 2019
```
* support MLU nums, test=develop

* change anakin apis, test=develop
```
  d8f4f423
- L
  alter the capi of PD_PredictorRun to provide proper function, test=develop (#20697) · d39777fe
  由 liu zhengxi 提交于 10月 18, 2019
```
modify the way to pass parameter out_size in function. 
```
  d39777fe
17 10月, 2019 1 次提交
- L
  
  improve the performance of capi in PD_PredictorRun (#20665) · dbc2bb33
  由 liu zhengxi 提交于 10月 17, 2019
  
  dbc2bb33
16 10月, 2019 1 次提交
- L
  
  Add document for int8 object detection quantization (#19356) · 57b656f9
  由 lidanqing 提交于 10月 16, 2019
  
  57b656f9
15 10月, 2019 1 次提交

fix the PD_ZeroCopyPredictorRun output problem (#20612) · 922d4324

由 liu zhengxi 提交于 10月 15, 2019

* fix the PD_ZeroCopyPredictorRun output problem and add some checks and logs for users

* modify the cmakelists depends and fix the cmakelists problem

922d4324

14 10月, 2019 2 次提交
- B
  
  Modify the helper information in full_pascalvoc_test_preprocess.py (#20475) · 85e1f215
  由 bingyanghuang 提交于 10月 14, 2019
  
  85e1f215
- P
  
  add DisableGlogInfo() to AnalysisConfig, test=develop (#20581) · 443f604c
  由 Pei Yang 提交于 10月 14, 2019
  
  443f604c
13 10月, 2019 1 次提交

Add Multihead matmul fuse pass (#20167) · b8333ede

由 zhaoyuchen2018 提交于 10月 13, 2019

* Add multihead fuse pass for ernie opt

* Refine softmax

test=develop

* Refine cuda kernel

* Refine cuda version

* Refine cmake

test=develop

* refine header file

* refine test case and pass
* refine comments

b8333ede

12 10月, 2019 1 次提交

Add ConvTranspose + BatchNorm fuse pass (#20161) · 7faa3e95

由 Adam 提交于 10月 12, 2019

* Add ConvTranspose + BatchNorm fuse pass
test=develop

* Add tests for conv+bn and conv_transpose+bn passes
test=develop

7faa3e95

11 10月, 2019 1 次提交
- L
  remove incorrect new in c style, test=develop (#20370) · 53d8799b
  由 liu zhengxi 提交于 10月 11, 2019
```
remove incorrect "new" in c style. 
```
  53d8799b
10 10月, 2019 1 次提交
- 石
  
  fix analysis_predictor ci, test=release/1.6 (#20141) · 2c28e328
  由石晓伟提交于 10月 10, 2019
  
  2c28e328
08 10月, 2019 1 次提交
- L
  add dll to inference capi (#20180) · acb02fd6
  由 liu zhengxi 提交于 10月 08, 2019
```
* add dll to inference capi, test=develop

* add if win32 in cmakelists, test=develop
```
  acb02fd6
05 10月, 2019 1 次提交

Add capi for fluid inference api (#20092) · 301eeb5b

由 liu zhengxi 提交于 10月 05, 2019

* add capi for fluid inference api, including AnalysisConfig, AnalysisPredictor, PaddleBuf, PaddleTensor, ZeroCopyTensor

301eeb5b

30 9月, 2019 1 次提交

fix compile paddle with anakin bug · 276b5e34

由 Wilber 提交于 9月 30, 2019

* fix compile with anakin bug

* remove useless deps test=develop

- 修复了联编anakin时，遇到的bug.
- 编译test_anakin_activate 不通过
- 编译test_anakin_engine 不通过

276b5e34

27 9月, 2019 1 次提交

石

update operator compatible info, test=develop (#19978) · 01b9d079

由石晓伟提交于 9月 27, 2019

* update operator compatible info, test=develop

* revert cmake/version.cmake, test=develop

* add unit_tests and fix bugs, test=develop

* update ../paddle/fluid/framework/framework.proto, test=develop

* fix bug of paddle/fluid/inference/api/analysis_predictor.cc, test=develop

* update paddle/fluid/framework/version_test.cc, test=develop

* add comments and rename interfaces, test=develop

01b9d079

25 9月, 2019 2 次提交

FIx C++ inference BUG: When open memory optim and enable trt subgraph at the... · e89b1288

由 Zhaolong Xing 提交于 9月 25, 2019

FIx C++ inference BUG: When open memory optim and enable trt subgraph at the same time, there is a bug (#19969)

* fix memory optimization type
test=develop

* 1. fix BUG: open trt and memory optim will trigger bug.
2. Clean memory optim bug.
test=develop

e89b1288

Removing length dims constraints of seq_pad and seq_unpad (#19497) · 99a9615a

由 Aurelius84 提交于 9月 25, 2019

* Removing last dims constraints of seq_pad and seq_unpad test=develop

* fix test_layer api code test=develop

* fix sequence_pad_op.cc conflict test=develop

* remove test_analyzer_mm_dnn test=develop

* fix vectorize bug test=develop

* fix vectorize<int> test=develop

99a9615a

21 9月, 2019 3 次提交
- P
  Add two extra flags for test_analyzer_int8_image_classification to disable fp32/int8 (#19840) · 2c5c6365
  由 pawelpiotrowicz 提交于 9月 21, 2019
```
test=develop
```
  2c5c6365
- P
  Add TRT input shape check between model and runtime (#19864) · baccd7e2
  由 Pei Yang 提交于 9月 21, 2019
```
* add TRT shape check, test=develop

* model_input_shape == runtime_input_shape, refine message, test=develop
```
  baccd7e2
- P
  Fix BUGS: paddle-TRT repeatedly sets weight_map and overdeletes repetitive_params (#19825) · 74812d1c
  由 Pei Yang 提交于 9月 21, 2019
```
* fix trt bugs when sharing params, test=develop

* add unittest for cascade_rcnn
```
  74812d1c
20 9月, 2019 1 次提交
- 石
  
  fix multi-thread exec of trt, test=develop (#19338) · d004a0f5
  由石晓伟提交于 9月 20, 2019
  
  d004a0f5
19 9月, 2019 1 次提交

Add a pass to fuse fc+elementwise_add+layernorm (#19776) · 3cd985a6

由 Yiqun Liu 提交于 9月 19, 2019

* Add fc_elementwise_layernorm_fuse pass and unittest.

* Add fused_fc_elementwise_layernorm op and its GPU kernel.
test=develop

* Apply fc_elementwise_layernorm_fuse_pass to GPU inference.

* Add the setting of attrs in the definition of binary_op.
test=develop

* Add comment.

* Implement the unittest.
test=develop

* Change the unittest name of layer_norm.
test=develop

3cd985a6

18 9月, 2019 1 次提交
- 石
  
  support MLU nums, test=develop (#19372) · 71b2ed61
  由石晓伟提交于 9月 18, 2019
  
  71b2ed61
17 9月, 2019 2 次提交
- P
  zerocopytensor support uint8, analysis config support profile, analysis... · 9cbc1eff
  由 Pei Yang 提交于 9月 17, 2019
```
zerocopytensor support uint8, analysis config support profile, analysis predictor support GetInputTensorShape, test=develop (#19822)
```
  9cbc1eff
- Z
  fix memory optimization type (#19781) · 110be57c
  由 Zhaolong Xing 提交于 9月 17, 2019
```
test=develop
```
  110be57c
16 9月, 2019 1 次提交

Enhance fc_fuse_pass to enable fusing relu to fc_op (#19733) · c67c8758

由 Yiqun Liu 提交于 9月 16, 2019

* Refine the codes related to fc op.

* Add GPU implementation for fc functor.

* Apply fc_fuse_pass in GPU inference.
test=develop

* Change the cmake for fc op.

* Change PADDLE_ENFORCE to PADDLE_ENFORCE_EQ.

* Add an attribute to set the activation type in fc_op.

* Enhance the unittest of fc_op.
test=develop

* Remove the declaration of FCOpGrad back to the header file.
test=develop

* Set default value for newly added arguments in test_fc_op.
test=develop

* Enhance fc_fuse_pass to enable fusing relu.

* Allow print the shapes of var_desc in graph.
test=develop

* Enhance fc_fuse_pass_tester.

* Remove the use of PADDLE_ENFORCE.
test=develop

* Correct the number of ops after fusing.
test=develop

* Fix a typo.
test=develop

* Set activation_type to null when there is no relu in fc.
test=develop

* Refine fc_fuse_pass's codes.

* Enable the set of shape for tensor.

* Refine repeated_fc_relu_pass and add unittest.
test=develop

c67c8758

11 9月, 2019 1 次提交

Implement the GPU kernel of fc operator (#19687) · a65c728e

由 Yiqun Liu 提交于 9月 11, 2019

* Refine the codes related to fc op.

* Add GPU implementation for fc functor.

* Apply fc_fuse_pass in GPU inference.
test=develop

* Change the cmake for fc op.

* Change PADDLE_ENFORCE to PADDLE_ENFORCE_EQ.

* Add an attribute to set the activation type in fc_op.

* Enhance the unittest of fc_op.
test=develop

* Remove the declaration of FCOpGrad back to the header file.
test=develop

* Set default value for newly added arguments in test_fc_op.
test=develop

a65c728e

09 9月, 2019 1 次提交

paddle::framework::vectorize() templatization [PART3] (#19643) · f05d2c51

由 Tao Luo 提交于 9月 09, 2019

* paddle::framework::vectorize() templatization

test=develop

* update pybind/imperative.cc

test=develop

* revert update on unsqueeze_op.cc and warpctc_cudnn_op.cu.cc

test=develop

f05d2c51

05 9月, 2019 1 次提交

unify PADDLE_ASSERT_MSG into PADDLE_ENFORCE(error_message) (#19631) · 3ae939e4

由 Tao Luo 提交于 9月 05, 2019

* remove assert.h

* change PADDLE_ASSERT_MSG to PADDLE_ENFORCE

test=develop

* fix tensorrt paddle_enforce

test=develop

3ae939e4

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致