- 07 1月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* add TRT support for instance_norm op
-
- 06 1月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* add gelu plugin * align trt bert with gpu * add support for fused fc with relu, * add unittest for bert trt
-
- 24 12月, 2019 1 次提交
-
-
由 lidanqing 提交于
test=develop
-
- 16 12月, 2019 1 次提交
-
-
由 Michał Gallus 提交于
test=develop
-
- 12 12月, 2019 1 次提交
-
-
由 joanna.wozna.intel 提交于
* Add reshape int8 op test=develop * Change test to CPUPlace test=develop * Correct tests test=develop
-
- 10 12月, 2019 2 次提交
-
-
由 Adam 提交于
* MKLDNN v1.0 rebase to Paddle 1.6 test=develop * Add hacky paddle::string::to_string() implementation * vectorize<int64-t>() -> vectorize() cleanup test=develop * PADDLE_ENFORCE and void_cast fixes test=develop * Rebase changes test=develop * Cosmetics test=develop * Delete MKL from mkldnn.cmake test=develop * CMake debug commands test=develop * Delete MKLDNN_VERBOSE and rebase fixes test=develop * Rebase fixes test=develop * Temporarily disable int8 resnet101 vgg16 and vgg19 tests test=develop * Add libmkldnn.so.1 to python setup test=develop * Add libmkldnn.so.1 to inference_lib cmake after rebase test=develop * Post rebase fixes + FC int8 changes test=develop * Fix LRN NHWC test=develop * Fix NHWC conv3d test=develop * Windows build fix + next conv3d fix test=develop * Fix conv2d on AVX2 machines test=develop
-
由 rensilin 提交于
* ZeroCopyTensor::mutable_data in the right device, test=develop * add unittest for zerocopy, test=develop
-
- 09 12月, 2019 1 次提交
-
-
由 lidanqing 提交于
* update benchmark for int8v2, QAT1, QAT2 accuracy and performance test=document_fix * change according to reviews test=develop test=document_fix * improve some descriptions and some models test=develop test=document_fix * update models benchmark data test=develop test=document_fix * update int8v2 and qat2 performance test=develop test=document_fix
-
- 03 12月, 2019 1 次提交
-
-
由 GaoWei8 提交于
* add ernie-large test test=develop * add ernie large c++ inference test test=develop
-
- 02 12月, 2019 1 次提交
-
-
由 Tao Luo 提交于
* fix -Wno-error=sign-compare warning in gcc8 test=develop * fix warning in distributed codes test=develop
-
- 28 11月, 2019 1 次提交
-
-
由 lidanqing 提交于
* add ut for comparing FP32 and QAT INT8 * add save qat transformed model python script test=develop * updated * added missing file * add "with_label" test=develop * performance benchmark as unit test test=develop * change names of unnecessary thing * Change CMakeList.txt for model downloading and UT test=develop * change names of functions and params for more readable code test=develop * Change PADDLE_ENFORCE messages test=develop * fix indent problems test=develop * indent problems test=develop
-
- 26 11月, 2019 1 次提交
-
-
由 GaoWei8 提交于
* Add fc padding to solve mkl performance test=develop * fix gpu pass and error information test=develop * fix fc_fuse_pass_test test=develop * fix error information test=develop * fix error information test=develop * fix name and add fc op padding test test=develop * fix attributes test=develop * optimize fc padding test=develop * fix test test=develop
-
- 20 11月, 2019 1 次提交
-
-
由 liu zhengxi 提交于
* fix the CAPI ZeroCopy shape error and reconstruct the output obtain * use an anonymous namespace to cover the functor * fix unit tests because of the output of typeid(T).name() is different from linux and windows, test=develop
-
- 18 11月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
* refine trt int8 for dynamic range set test=develop * refine trt int8 test=develop
-
- 15 11月, 2019 1 次提交
-
-
由 GaoWei8 提交于
* solve cmake fails on inference_download_and_uncompress test=develop * solve cmake fails on inference_download_and_uncompress test=develop
-
- 14 11月, 2019 1 次提交
-
-
由 Adam 提交于
* Add relative error measure when value > 1 test=develop * Move code to CheckError function test=develop
-
- 08 11月, 2019 2 次提交
-
-
由 joanna.wozna.intel 提交于
* Add transpose2 INT8 for mkl-dnn test=develop * Fix test_transpose_int8_mkldnn test=develop * Revert "Merge branch 'develop' into transpose_int8_mkldnn_2" This reverts commit 34011bdb, reversing changes made to 2ce6473f. * Revert "Revert "Merge branch 'develop' into transpose_int8_mkldnn_2"" This reverts commit 23754dd7. * Add template to TransposeMKLDNNHandler test=develop * Resolve conflict test=develop * Restore get_size and refactor test=develop
-
由 GaoWei8 提交于
* Add ernie unit test test=develop * Add ernie unit test test=develop * Add ernie unit test test=develop * remove ngraph * optimize gpu test test=develop * optimize codes test=develop
-
- 23 10月, 2019 1 次提交
-
-
由 Pei Yang 提交于
Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and "num" attribute in split op converter (#20733) * fix pool2d trt converter, test=develop * add fix for split op converter, test=develop
-
- 20 10月, 2019 1 次提交
-
-
由 bingyanghuang 提交于
-
- 18 10月, 2019 1 次提交
-
-
由 liu zhengxi 提交于
modify the way to pass parameter out_size in function.
-
- 16 10月, 2019 1 次提交
-
-
由 lidanqing 提交于
-
- 15 10月, 2019 1 次提交
-
-
由 liu zhengxi 提交于
* fix the PD_ZeroCopyPredictorRun output problem and add some checks and logs for users * modify the cmakelists depends and fix the cmakelists problem
-
- 14 10月, 2019 2 次提交
-
-
由 bingyanghuang 提交于
-
由 Pei Yang 提交于
-
- 05 10月, 2019 1 次提交
-
-
由 liu zhengxi 提交于
* add capi for fluid inference api, including AnalysisConfig, AnalysisPredictor, PaddleBuf, PaddleTensor, ZeroCopyTensor
-
- 25 9月, 2019 2 次提交
-
-
由 Zhaolong Xing 提交于
FIx C++ inference BUG: When open memory optim and enable trt subgraph at the same time, there is a bug (#19969) * fix memory optimization type test=develop * 1. fix BUG: open trt and memory optim will trigger bug. 2. Clean memory optim bug. test=develop
-
由 Aurelius84 提交于
* Removing last dims constraints of seq_pad and seq_unpad test=develop * fix test_layer api code test=develop * fix sequence_pad_op.cc conflict test=develop * remove test_analyzer_mm_dnn test=develop * fix vectorize bug test=develop * fix vectorize<int> test=develop
-
- 21 9月, 2019 2 次提交
-
-
由 pawelpiotrowicz 提交于
test=develop
-
由 Pei Yang 提交于
* fix trt bugs when sharing params, test=develop * add unittest for cascade_rcnn
-
- 17 9月, 2019 1 次提交
-
-
由 Pei Yang 提交于
zerocopytensor support uint8, analysis config support profile, analysis predictor support GetInputTensorShape, test=develop (#19822)
-
- 16 9月, 2019 1 次提交
-
-
由 Yiqun Liu 提交于
* Refine the codes related to fc op. * Add GPU implementation for fc functor. * Apply fc_fuse_pass in GPU inference. test=develop * Change the cmake for fc op. * Change PADDLE_ENFORCE to PADDLE_ENFORCE_EQ. * Add an attribute to set the activation type in fc_op. * Enhance the unittest of fc_op. test=develop * Remove the declaration of FCOpGrad back to the header file. test=develop * Set default value for newly added arguments in test_fc_op. test=develop * Enhance fc_fuse_pass to enable fusing relu. * Allow print the shapes of var_desc in graph. test=develop * Enhance fc_fuse_pass_tester. * Remove the use of PADDLE_ENFORCE. test=develop * Correct the number of ops after fusing. test=develop * Fix a typo. test=develop * Set activation_type to null when there is no relu in fc. test=develop * Refine fc_fuse_pass's codes. * Enable the set of shape for tensor. * Refine repeated_fc_relu_pass and add unittest. test=develop
-
- 03 9月, 2019 1 次提交
-
-
由 Yiqun Liu 提交于
* Add a interface to enable cudnn for inference. * Add cudnn_placement_pass. test=develop * Set the default value of cudnn_enabled_op_types to null. test=develop * Write the common basic class, placement_pass_base, to refine the codes. test=develop * Call EnableCUDNN in unittest. test=develop * Refine cudnn_placement_pass tester. * Enable the testing of cudnn_placement_pass in inference's unittest. test=develop * Add the check of op kernels. test=develop
-
- 22 8月, 2019 1 次提交
-
-
由 lidanqing 提交于
* add local user data conversion into full_pascalvoc_test_preprocess.py test=develop * change PADDLE_ENFORCE to PADDLE_ENFORCE_GE test=develop * change according to reviews test=develop
-
- 15 8月, 2019 1 次提交
-
-
由 lidanqing 提交于
* change the top1 comparison to mAP comparison test=develop * change the mobilenet-ssd tester demo data and batch_size settings test=develop
-
- 30 7月, 2019 1 次提交
-
- 11 7月, 2019 1 次提交
-
-
由 Tao Luo 提交于
* add config.SetMkldnnCacheCapacity api for mkldnn cache clear strategy test=develop * enhance MkldnnPostReset test=develop * add comments for mkldnn_cache_capacity field test=develop
-
- 08 7月, 2019 2 次提交
- 05 7月, 2019 1 次提交
-
-
由 bingyanghuang 提交于
-